If an image is uploaded and the user adds a link to the site where they found it (source in the {{Information}} template), this might be a good signal for whether the image is likely to be deleted - for example if the link is to google or facebook
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T357587 [Research EPIC] Media quality investigation on Commons FY24/25 | |||
Open | None | T349641 [EPIC] MVP Logo machine detection on Commons | |||
Resolved | Cparle | T369273 [Spike] Investigate the effect of external links on the likelihood of deletion of an image |
Event Timeline
Comment Actions
Uploaded in 2023
source contains a link to | deleted | not deleted | proportion deleted |
2868 | 360 | 0.89 | |
835 | 804 | 0.51 | |
1361 | 9 | 0.99 | |
228 | 24 | 0.90 | |
youtube | 2026 | 83 | 0.96 |
gettyimages | 193 | 17 | 0.92 |
155 | 4 | 0.97 | |
shutterstock | 20 | 1 | 0.95 |
alamy | 44 | 13 | 0.77 |
istockphoto | 23 | 1 | 0.96 |
tiktok | 20 | 1 | 0.96 |
fbcdn | 0 | 0 | - |
cdninstagram | 3 | 0 | 1 |
amazon | 98 | 11 | 0.90 |
media-amazon | 34 | 0 | 1 |
Ignoring google because it's only ~50/50 gives us 7597 files uploaded in 2023 that we can identify as having a ~93% chance of being deleted
Note that's around 7k deletions in 2023, compared to ~8.5k for logos
Comment Actions
Could you add how old is the account in the above query? And you could also add not only a link, just a mention, i.e. "source=Google", etc.