Tag Archives: Uncategorized

I just have to post this because it is making me drool.

A Visualization of the Metadata Universe

Also, why is the word ‘metadata’ showing up as a misspelling here?? Metadata is core to all things!!


1 Comment

Filed under Uncategorized

WARC file format becomes an ISO standard

WARC, an extension of the ARC file format, used for archiving web material, has been made an ISO standard.

WARC format offers new possibilities, notably the recording of HTTP request headers, the recording of arbitrary metadata, the allocation of an identifier for every contained file, the management of duplicates and of migrated records, and the segmentation of the records. WARC files are intended to store every type of digital content, either retrieved by HTTP or another protocol.

Standardization offers a guarantee of durability and evolution for the
WARC format. It will help web archiving entering into the mainstream
activities of heritage institutions and other branches, by fostering the
development of new tools and ensuring the interoperability of
collections. Several applications are already WARC compliant, such as
the Heritrix [http://crawler.archive.org/ ] crawler for harvesting, the
WARC tools [http://code.google.com/p/warc-tools/ ] for data management and exchange, the Wayback Machine
[http://archive-access.sourceforge.net/projects/wayback/ ], NutchWAX
[http://archive-access.sourceforge.net/projects/nutch/ ] and other
search tools [http://code.google.com/p/search-tools/ ] for access. The
international recognition of the WARC format and its applicability to
every kind of digital object will provide strong incentives to use it
within and beyond the web archiving community.

– Abby Grotke, IIPC Communications Officer, Library of Congress

See the IIPC press release.

Leave a comment

Filed under Uncategorized

Digital Preservation and Nuclear Disaster: An Animation

I have posted this fabulous, fabulous video everywhere but here. It is things like this that make me proud to be in this field. Enjoy!

Leave a comment

Filed under Uncategorized

Internet Archive gets a new data center

Computerworld has the story here.

The machine fits in a 20-foot-long outdoor metal cargo container filled with 63 server clusters that offer 4.5 million gigabytes of data storage capacity and 1TB of memory.

That only makes me shiver a little bit.

Leave a comment

Filed under Uncategorized

New love to longtermdata blog

After surviving my first semester as a PhD student, I think I just might be able to devote a little more time to the longtermdata blog. I’ve added some of my favorite blogs to the blogroll and am going through the painful process of finding a suitable theme. Bear with me on the theme thing.. it’s rough out there.

In the past month or so I have started to collect some good links and stories which are all somehow related to long term digital preservation. Here’s a good chunk of them:

A Tool to Verify Digital Records, Even as Technology Shifts
This is mostly about research happening at the University of Washington, but Stewart Brand is quoted at the end of the article about the Format Exchange, a project which I have been dabbling in.

Digital Preservation Challenge
DPE’s latest digital preservation challenge. A great great great idea. I encourage anyone to take it on.

Got Data?
If you have access to a subscription to ACM, check this out. “Tools for surviving a data deluge to ensure your data will be there when you need it.”

ExLibris Group Releases Digital Preservation System

Kevin Kelly’s ‘movage’ idea infiltrates the digital curation community. Hoo-ray!

Kevin Kelly on Movage
Yes, yes, and yes.

That is probably more than enough to keep you all busy. I’ll be bringing more of the old and more new stuff to you in the very near future.

Leave a comment

Filed under Uncategorized

Link of the Week : Another video game emulator being developed in the UK


Note the MAME love in the comments.

Leave a comment

Filed under Uncategorized