Page MenuHomePhabricator

HighBeam site dead
Closed, ResolvedPublic

Description

*.highbeam.com is now dead -- the site closed without any warning a few days ago. On Enwiki alone there are over 15,000 links. I tried to set the domain to dead through the interface but it doesn't work.

Testcases page:

https://en.wikipedia.org/wiki/User:GreenC/testcases/highbeam

Event Timeline

Kizule subscribed.

This happening and I can reproduce this, so I no see reason for closing this as invalid because IAB don't recongize links to *.highbeam.com as dead.

K - i was seeing threads and chatter various places figured this was already being tracked somewhere else but I don't know for certain.

Could this be because clicking on a Highbeam.com link now redirects to https://www.questia.com/hbr-welcome, rather than giving you a 404?

@Cyberpower678 Do you know why IABot isn't picking up Highbeam links as dead?

If it redirects to a live link, that would be why.

Is there any way to pick up that it's redirecting to a different URL, or flag Highbeam URLs specifically as dead despite the redirect?

@Samwalton9 - there are about 21k pages on enwiki with a highbeam. WaybackMedic will begin converting them to archives. If no archive exists it will add a {{dead link}}. Example diffs:

https://en.wikipedia.org/w/index.php?title=Actrius&diff=prev&oldid=883290952

https://en.wikipedia.org/w/index.php?title=Axiom&diff=prev&oldid=883291006

Should take a couple days since it is doing archive checking at the same time which slows it down some. It can't do the other language wikis. Hopefully @Cyberpower678 will discover if the Interface tool is crashing/time-out when converting highbeam to dead

All HighBeam links on Enwiki are now archived or a {{dead link}} if no archive available. Also the IABot database archive URL fields are updated, for those URLs it found on enwiki.

The next step is find all the highbeam URLs in the database that don't yet have an archive URL, add them and blacklist each HighBeam URL individually so IABot knows they are dead.

This will require downloading the entire IABot database via the API as there is no way to search for URLs using a domain or search pattern. It will take a while and sometimes aborts times out etc. so we will see how this goes.

Wow, that's some amazing progress @Green_Cardamom - thank you so much for your efforts on this!

Livestate is now "Blacklisted" for *highbeam.com across all language wikis. Was finally able by logging into a different language in the interface (alswiki) and from there it worked. For some reason from within Enwiki it crashes. This means it isn't possible to queue highbeam.com bot jobs on Enwiki pages, but I was able to mirror the same functionality of IABot using WaybackMedic (including updating the IABot database with the new archive links through the API). With global livestate now Blacklisted, it's possible to queue highbeam.com bot jobs for each language wiki (except enwiki) as normal. This has been done for all languages.

@Samwalton9 - thanks for the words of encouragement! Not sure what else needs to be done things seems complete and will close the ticket. Feel free to reopen if something shows up.