Cerf rocks

After I wrote about digital preservation and the need to save everything, not just the so-called “important” stuff, Jason wrote a lovely piece with his own thoughts on the matter:

In order to write a history, you need evidence of what happened. When we talk about preserving the stuff we make on the web, it isn’t because we think a Facebook status update, or those GeoCities sites have such significance now. It’s because we can’t know.

In a timely coincidence, Vint Cerf also spoke about the importance of digital preservation:

When you think about the quantity of documentation from our daily lives that is captured in digital form, like our interactions by email, people’s tweets, and all of the world wide web, it’s clear that we stand to lose an awful lot of our history.

He warns of the dangers of rapidly-obsoleting file formats:

We are nonchalantly throwing all of our data into what could become an information black hole without realising it. We digitise things because we think we will preserve them, but what we don’t understand is that unless we take other steps, those digital versions may not be any better, and may even be worse, than the artefacts that we digitised.

It was a little weird that the Guardian headline refers to Vint Cerf as “Google boss”. On the BBC he’s labelled as “Google’s Vint Cerf”. Considering he’s one of the creators of the internet itself, it’s a bit like referring to Neil Armstrong as a NASA employee.

I have to say, I just love listening to him talk. He’s so smooth. I’m sure that the character of The Architect from The Matrix Reloaded is modelled on him.

Vint Cerf knows a thing or two about long-term thinking when it comes to data formats. He has written many RFCs for the IETF (my favourite being RFC 2468). Back in 1969, he wrote RFC 20, proposing the ASCII format for network interchange. If you’ve ever used the keypress event in JavaScript and wondered why, for example, the number 13 corresponds to a carriage return, this is where all those numbers come from.

Last month, over 45 years after the RFC’s original publication, it became an official standard.

So when Vint Cerf warns about the dangers of digitising into file formats that could become unreadable, I think we should pay attention to him.

Have you published a response to this? :

Responses

Ashton McAllan

Man, this is making me worry about the whole “database antipattern” thing again but really I don’t want to sacrifice the gorgeous UX I’ve got going on WordPress for the sake of something that can be pretty much handled with a decent export backup script. Which shows how little I’ve learned since I already lost a lot of data last year thanks to bad database manipulation.

1 Like

# Liked by Gunnar Bittersmann on Tuesday, February 17th, 2015 at 6:05pm

Related posts

9,125 days later

219,000 hours of wonder.

Related links

The race to save our online lives from a digital dark age | MIT Technology Review

For many archivists, alarm bells are ringing. Across the world, they are scraping up defunct websites or at-risk data collections to save as much of our digital lives as possible. Others are working on ways to store that data in formats that will last hundreds, perhaps even thousands, of years.

Tagged with

Internet Artifacts

I love this timeline of internet firsts. Best of all:

You may touch the artifacts

The websites on display work—even the ones that used Flash!

Tagged with

File over app — Steph Ango

In the fullness of time, the files you create are more important than the tools you use to create them. Apps are ephemeral, but your files have a chance to last.

Tagged with

The Internet Archive on the future of the web - Protocol — The people, power and politics of tech

A profile of Brewster Kahle and the Internet Archive:

Tech’s walled gardens might make it harder to get a perfect picture, but the small team of librarians, digital archivists and software engineers at the Internet Archive plan to keep bringing the world the Wayback Machine, the Open Library, the Software Archive, etc., until the end of time. Literally.

Tagged with

The Cuneiform Tablets of 2015 [PDF]

A 2015 paper by Long Tien Nguyen and Alan Kay with a proposal for digital preservation.

We discuss the problem of running today’s software decades,centuries, or even millennia into the future.

Tagged with

Previously on this day

11 years ago I wrote Launching for America

The new Code for America website is live. That was quick!

14 years ago I wrote Sea change

Don’t fear responsive design; embrace it.

16 years ago I wrote Magnoliloss

Back up before your data goes down.

18 years ago I wrote BarCamp London 2: The Schedule

Get the line-up in hCalendar.

18 years ago I wrote BarCamping

BarCamp London 2: electric boogaloo.

22 years ago I wrote BBC - CNN = 866

Here is the BBC transcript of Hans Blix’s presentation to the UN security council.

22 years ago I wrote Robota

Here’s something a little bit different: a trailer for a book.

23 years ago I wrote New and improved

The eagle-eyed amongst you will have noticed a few changes here in the "Journal" section of adactio.