Jump to content

Wikipedia:Link rot/URL change requests/Archives/2024/May

From Wikipedia, the free encyclopedia


wikispot.org interwiki

The entire WikiSpot: interwiki is dead (around 250 uses). Sometimes the content can be recovered at localwiki (i.e Woodland, California: wikispot:woodland:Museums -> https://localwiki.org/woodland/Museums. Other times that's also a 404 and the content is just gone. * Pppery * it has begun... 04:04, 23 April 2024 (UTC)

LocalWiki and WikiSpot: 35 pages -- GreenC 16:39, 29 April 2024 (UTC)
 Done - Checked 35 pages and edited 35 pages. Converted 37 interwiki links to wikispot.org. Moved 17 wikispot links to localwiki. Added 17 {{dead link}}. Added 3 archive URLs (3 Wayback). -- GreenC 17:11, 29 April 2024 (UTC)

wikispot.org pass2 63 pages

 Done - Checked 63 pages and edited 20 pages. Added 14 {{dead link}}. Switched 2 |url-status=live to dead. Added 19 archive URLs (18 Wayback). Changed 1 citation metadata fields. -- GreenC 18:13, 29 April 2024 (UTC)
Note: many of the archive.org links to wikispot.org appear to be soft-404 redirects to the home page, or some other useless place on the old website. My bot has trouble detecting these as there is no redirect in the headers. Probably all of the wikispot.org URLs should be checked manually and if there is no viable alternative I recommend nuking the citation entirely as unverifiable because placing a dead link tag will result in bots re-adding the useless archive URL. -- GreenC 18:24, 29 April 2024 (UTC)
I suspect there's some date (per http://wikispot.org/2015_Shutdown_Notice.html probably circa April 2015) when the site started redirecting to the home page, and all archives after that date are useless. * Pppery * it has begun... 19:18, 30 April 2024 (UTC)

symantec.com

All URLs starting with http://www.symantec.com/security_response/writeup.jsp? seem to be soft 404. 97 pages. * Pppery * it has begun... 17:00, 27 April 2024 (UTC)

I processed every symantec link as the site is mostly soft404, I found 11 varieties.
 Done - Checked 384 pages and edited 319 pages. Moved 120 links to a new URL. Added 8 {{dead link}}. Switched 3 |url-status=dead to live. Switched 59 |url-status=live to dead. Added 351 archive URLs (330 Wayback). Changed 69 citation metadata fields. -- GreenC 22:35, 29 April 2024 (UTC)

wikisophia.org

Entire wikisophia.org site is dead, as well as the wikisophia: interwiki (which is soon going to point to a static page at m:Interwiki map/discontinued#Wikisophia). No replacement known. * Pppery * it has begun... 22:49, 28 April 2024 (UTC)

wikisophia interwiki

 Done Checked 15 pages and edited 15 pages. Converted 14 interwikis. Added 13 {{dead link}}. Added 1 archive URL. -- GreenC 19:43, 29 April 2024 (UTC)
The above also includes all wikisophia.org links. -- GreenC 19:56, 29 April 2024 (UTC)

koreatimes.co.kr

We seem to have some 3k articles with url=http://www.koreatimes.co.kr. The website loads fine over HTTPS for me, it should be upgraded. Nemo 04:30, 29 April 2024 (UTC)

The Korea Times - 5,439 pages -- GreenC 18:00, 30 April 2024 (UTC)
 Done - Checked 5,445 pages and edited 3,573 pages. Moved 5,983 links to a new URL. Removed 3 {{dead link}} templates. Added 15 {{dead link}}. Switched 662 |url-status=dead to live. Switched 25 |url-status=live to dead. Added 327 archive URLs (213 Wayback). Changed 92 citation metadata fields. -- GreenC 16:31, 1 May 2024 (UTC)

A new feature for this move can be seen Special:Diff/1221731335/1221749231 .. the URL redirects with a client-side mechanism (JavaScript) so it was not possible to use page headers which only returns status 200. I developed a headless browser script to retrieve the JS redirect. The script is a CLI utility, in case anyone would like a copy. It requires Node and Puppeteer. -- GreenC 20:55, 1 May 2024 (UTC)

wikilivres.org

Another dead interwiki: the entire site https://wikilivres.org/ is soft 404 of the "redirect to the homepage" variety, as well as the "wikilivres:" and "BiblioWiki:" interwikis that point to it.

I also noticed while investigating this that the wikilivres.ca domain appears to have been usurped, with it originally being a wiki similar to wikisource, and now being a spammy blog. But do note that https://wikilivres.ru/ (with its own wikilivresru: interwiki) is still up. * Pppery * it has begun... 19:24, 30 April 2024 (UTC)

59 pages for interwiki and .org. I'll add wikilivres.ca to WP:JUDI (40 pages). -- GreenC 21:02, 1 May 2024 (UTC)
Some pages inexplicably work eg [1] -- GreenC 12:59, 14 May 2024 (UTC)
 Done - Checked 58 pages and edited 52 pages. Converted 62 interwiki. Added 38 {{dead link}}. Added 4 archive URLs (2 Wayback). Changed 3 citation metadata fields. -- GreenC 13:06, 14 May 2024 (UTC)

wikinvest.com

Yet another dead interwiki: wikinvest:/https://wikinvest.com. See m:Talk:Interwiki map/Archives/2018#Discontinue Wikinvest. * Pppery * it has begun... 19:28, 30 April 2024 (UTC)

141 pages for interwiki and .com. -- GreenC 00:15, 10 May 2024 (UTC)
 Done - Checked 141 pages and edited 105 pages. Converted 91 interwiki. Added 12 {{dead link}}. Switched 4 |url-status=live to dead. Added 129 archive URLs (129 Wayback). Changed 11 citation metadata fields. -- GreenC 18:45, 14 May 2024 (UTC)

gutenberg.org

Entire path https://gutenberg.org/wiki/* is dead. About 40 pages. Also has an interwiki at gutenbergwiki: but it doesn't seem to be used. See m:Interwiki map/discontinued#Gutenbergwiki * Pppery * it has begun... 19:30, 30 April 2024 (UTC)

40 pages. -- GreenC 00:17, 10 May 2024 (UTC)
 Done - Checked 39 pages and edited 37 pages. Switched 10 |url-status=live to dead. Added 36 archive URLs (36 Wayback). -- GreenC 21:11, 14 May 2024 (UTC)

bigten.org

Hello. The links to articles on the Big Ten Conference are broken as their URLs have changed. For instance, this 2018 article is now here. The string at the end seems to be an unique ID, so I can't predict what is the new URL without searching through the website. Not sure if it's more useful to: 1) use the archived copies where possible then convert the other ones to the new URLs 2) convert all to the new URLs. Almost 2,000 possible broken links. Thanks! MrLinkinPark333 (talk) 03:40, 2 May 2024 (UTC)

1,317 pages. -- GreenC 00:20, 10 May 2024 (UTC)

Hi User:MrLinkinPark333: Unless there is an undocumented API like exists for Wikipedia:Link_rot/URL_change_requests#dinamalar.com that translates old to new, I don't see much option but convert to archive URLs. You could also contact them to see if they have plans to add redirects. If they ever do, I can go back and unwind the archive URLs and replace with the new URLs. -- GreenC 21:22, 14 May 2024 (UTC)

 Done - Checked 1,326 pages and edited 866 pages. Moved 65 links to a new URL. Added 99 {{dead link}}. Switched 56 |url-status=live to dead. Added 1,966 archive URLs (1,945 Wayback). Changed 745 citation metadata fields.

webcitation.org

Expand URLs to longform. Fix http->https. Fix |archive-date= offsets due to relative time-zone differences. Unpack archive.org doubles (they won't work correctly). Note: this work was made possible by a discovery in how to access the WebCite API, which normally gives the appearance of being down/inaccessible due to SSL misconfiguration on server-side. I don't know how long this hack will work, but I am updating the links while it's working. -- GreenC 14:36, 2 May 2024 (UTC)

 Done - Converted about 11,000 links to other other providers. Converted about 1,300 links from short to long form and other misc fixes. Includes 100s of templates. There are still many WebCitation.org URLs remaining unfortunately. -- GreenC 02:44, 14 May 2024 (UTC)

freeuk.com

Some (but not all) pages/subdomains of freeuk.com currently redirect to [2]. It's not clear to me whether this is more of a small-scale link rot issue or one that affects multiple pages, so listing here out of an abundance of caution. All the best, ‍—‍a smart kitten[meow] 15:35, 11 May 2024 (UTC)

I'll check it out, thanks. The domain is in 313 pages. -- GreenC 16:01, 11 May 2024 (UTC)
 Done - Checked 319 pages and edited 98 pages. Added 5 {{dead link}}. Switched 2 |url-status=live to dead. Added 113 archive URLs (109 Wayback). -- GreenC 20:19, 15 May 2024 (UTC)

iaboterr

Fixing about 800 pages that have an error by IABot adding duplicate archives and incorrect url-status -- GreenC 04:32, 16 May 2024 (UTC)

 Done -Checked 806 pages and edited 748 pages. -- GreenC 06:36, 16 May 2024 (UTC)

Found about 200 pages more in Category:CS1 errors: redundant parameter, and removing duplicate |access-date=. -- GreenC 16:47, 16 May 2024 (UTC)

 Done - Checked 209 pages and edited 168 pages -- GreenC 18:07, 16 May 2024 (UTC)

South Asia Analysis Group

www.southasiaanalysis.org - domain has been usurped. not sure if it's used anywhere other than Major non-NATO ally (where I already fixed the cite template). thanks, Kdroo (talk) 22:14, 23 May 2024 (UTC)

 Done - added to WP:JUDI for later processing: Special:Diff/1225704304/1225804735 -- GreenC 20:44, 26 May 2024 (UTC)

donjohnsonbigband.com

This domain seems to have been usurped: in 2020, it was still a normal band site https://web.archive.org/web/20201202185840/http://www.donjohnsonbigband.com/[usurped] vs since 2021 it's "DJ Son Band - Rock Music Review" https://web.archive.org/web/20211115153056/https://www.donjohnsonbigband.com/[usurped].

New official URL for the band is https://www.donjohnsonbigband.fi/ TuukkaH (talk) 22:10, 28 May 2024 (UTC)

 Done Amused the usurpers interpreted "donjohnson" as "DJ Son" ie. Don John Son. Or maybe a computer algorithm, stupid AI. Well, I added it to WP:JUDI for future processing:Special:Diff/1225804735/1226183097 and the URL is in one article, Support de Microphones, which I sortafixed.Special:Diff/1171308429/1226183692 -- GreenC 01:35, 29 May 2024 (UTC)