pjones235 wrote:I haven't mirrored any links yet. People aren't deleting the links..
Before 2010 or whenever it was that the {{SibleyScan|1802/xxx}} template was created, files were not mirrored with the word "Sibley" in the file name, so they were impossible to track after being uploaded. But that really only affected files with Sibley numbers below 15,000 or so. Now that we have gotten through most of those, the remaining links have been filtered several different ways by different scripts I've run to remove already uploaded files as best as I can. I have a script that I run every day or so that adds the latest Sibley Links to the link list and it automatically searches the uploaded files log to make sure the file hasn't already been uploaded. We can do that because every file mirrored from Sibley contains the sibley number in the file name when it is added to the IMSLP servers. I started running that around the time that 18,000 was the latest Sibley number so if you want a less frustrating experience I would work on the files above 18,000.
Now that we have the Link List most people are deleting the links as they go but some people are too fast (Like Massenetique) and follow the RSS feed for Sibley and upload files the day they are posted by Sibley and before my script transfers them to the link list. Even in this case, the script still catches them because it checks the IMSLP upload log before adding the link to the list and sees that the file has already been uploaded.
So once we get above 18,000 there should be zero instances of files on the link list having already been mirrored unless Sibley rescans the same file twice (which has happened) or duplicates a file we already have from somewhere else.
And thank you in advance for your extra time helping!