Web Science and Digital Libraries Research Group

Posts

Showing posts with the label memgator

2023-11-22: Auditing Web Archiving Livestreams

By Travis Reid - November 22, 2023

Figure 1: Using audit mode to replay mementos of https://oduwsdl.github.io/ from the Wayback Machine and archive.today While working on the Game Walkthroughs and Web Archiving project , we created web archiving livestreams where viewers would be able to watch two web crawlers archive a set of seed URIs and watch the replay of the archived web pages. We recently created a new mode that can audit web archives so that we can view archived web pages, or mementos , from different web archives at the same time. Viewing two mementos from different web archives is useful when the content on the original web page could vary based on personalization, location, or was different each time the web page was loaded. Audit mode will allow viewers to watch an audit of two web archives for the same URI-R. (A URI-R identifies the live web version of a web resource.) In audit mode, we show a replay of all of the unique mementos associated with a given URI-R for two web archives. Being able to view...

2019-08-03: Searching Web Archives for Unattributed Deleted Tweets From Politwoops

By Nauman Siddique - August 03, 2019

Tweet URL: https://twitter.com/derekwillis/status/1127234631865118731 On May 11th 2019, Derek Willis , who works at Propublica and also maintains the Politwoops project, tweeted a list of deleted tweet ids found by Politwoops that could not be attributed to any Twitter handle being tracked by Politwoops. This was an opportunity for us to revisit our interest in using web archives to uncover the deleted tweets . Although we were unsuccessful in finding any of the deleted tweet ids in web archives provided by Politwoops, we are documenting our process for coming to this conclusion. Politwoops Politwoops is a web service which tracks deleted tweets of elected public officials and candidates running for office in the USA and 55 other countries . The Politwoops USA is supported by Propublica . Creating Twitter handles list for the 116th Congress In a previous post , we discussed the challenges involv...

2018-04-30: A High Fidelity MS Thesis, To Relive The Web: A Framework For The Transformation And Archival Replay Of Web Pages

By Unknown - April 30, 2018

It is hard to believe that the time has come for me to write a wrap up blog about the adventure that was my Masters Degree and the thesis that got me to this point. If you follow this blog with any regularity you may remember two posts, written by myself, that were the genesis of my thesis topic: 2017-01-20: CNN.com has been unarchivable since November 1st, 2016 2017-03-09: A State Of Replay or Location, Location, Location Bonus points if you can guess the general topic of the thesis from the titles of those two blog posts. However, it is ok if you can not as I will give an oh so brief TL;DR;. The replay problems with cnn.com were, sadly, your typical here today gone tomorrow replay issues involving this little thing, that I have come to , known as JavaScript. What we also found out, when replaying mementos of cnn.com from the major web archi...