<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ulcc da blog &#187; preservation</title>
	<atom:link href="http://dablog.ulcc.ac.uk/tag/preservation/feed/" rel="self" type="application/rss+xml" />
	<link>http://dablog.ulcc.ac.uk</link>
	<description>ulcc digital archives blog</description>
	<lastBuildDate>Fri, 03 Feb 2012 16:24:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
		<item>
		<title>Scanning is different from digitisation</title>
		<link>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 12:47:38 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Digitisation]]></category>
		<category><![CDATA[access]]></category>
		<category><![CDATA[accessibility]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[e-books]]></category>
		<category><![CDATA[file formats]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1630</guid>
		<description><![CDATA[If you haven’t seen it, can I recommend Kristen Snawder&#8217;s recent post on the Library of Congress Digital Preservation blog, Digitization is different than digital preservation. Kristen reiterates familiar points about the long-term commitment necessary for serious digital preservation, contrasted with the quick hit of a scanning project. “In the hurry to meet user expectations, [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/' addthis:title='Scanning is different from digitisation '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/kylemcdonald/4287375982"><img class="alignright size-medium wp-image-1631" title="Autocorrelation scan by Kyle McDonald on Flickr" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/07/4287375982_5b5767939d_o-300x300.png" alt="" width="171" height="171" /></a></p>
<p>If you haven’t seen it, can I recommend Kristen Snawder&#8217;s recent post on the Library of Congress Digital Preservation blog, <a class="c3" href="http://blogs.loc.gov/digitalpreservation/2011/07/digitization-is-different-than-digital-preservation-help-prevent-digital-orphans/">Digitization is different than digital preservation</a>.  Kristen reiterates familiar points about the long-term commitment necessary for serious digital preservation, contrasted with the quick hit of a scanning project. “In the hurry to meet user expectations, institutions may scan large quantities of materials without having a solid plan for preserving the digital images into the future.”</p>
<p class="c2">However another recent find on the Web compels me to make an additional point, namely that we might do equally well to differentiate between scanning and digitisation. Anyone can set to work with a scanner and create a bunch of digital images &#8211; but that barely scratches the surface of what I think we should be expecting of a digitisation project in 2011.</p>
<p class="c0">First and foremost, we need metadata: the more the merrier, but something at least. Even if we expect to come back later and polish it up (once the images can be browsed and examined on screen). In the absence of any established metadata profiles for a project, at least try to cover as many <a class="c3" href="http://dublincore.org/documents/dces/">Dublin Core</a> elements as possible &#8211; title, creator, date, subject/keywords&#8230; Images, in particular, may prove tricky or time-consuming to find again, especially once there are thousands of them on a disk. We should probably keep the metadata in a database, and perhaps additionally store metadata with the objects. This can be as XML or plain text files stored alongside the digital images, or embedded in the files we create (many common file formats &#8211; TIFF, JPEG, MPEG, PDF &#8211; support metadata embedding, and there are many free tools available to help).</p>
<p class="c0">There is yet more, though, that we should be doing, particularly when we are scanning text-based objects (articles, books, magazines, reports, etc). Most importantly, we really should try and extract the text from the image if possible. <sup class="c1"><a name="ftnt_ref1" href="#ftnt1">[1]</a></sup></p>
<p class="c2">My recent web find was the teaching blog of Dr Toine Bogers at the <a class="c3" href="http://www.iva.dk/">Royal School of Library and Information Science</a> (RSLIS) in Copenhagen, Denmark. One fascinating post describes a Lab Session exercise, <a class="c3" href="http://itlab.dbit.dk/~toine/?page_id=304">From OCR To NER</a>, a set of comparatively simple command-line processes to get the most out of a scanned-text project.</p>
<p class="c0"><span id="more-1630"></span>Toine’s post walks us through the process. Once the article is scanned, we should apply some OCR. The exercise goes further and also describes the use of tools to clean up and spell-check the resulting OCR’d text. This will, at the very least, result in a separate text file, hopefully containing a fairly accurate version of the article text. Finally, the cleaned-up text can be submited to a Named Entity Recognition service. Toine’s exercise uses NER <a class="c3" href="http://cogcomp.cs.illinois.edu/demo/ner/">tools at University of Illinois</a>. (We’ve been using similar functionality provided by <a class="c3" href="http://www.opencalais.com/">OpenCalais</a> and <a class="c3" href="http://gate.ac.uk/">GATE</a> for our <a class="c3" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/infrastructureforresourcediscovery/pathfinder.aspx">AIM25 Open Metadata</a> project.)</p>
<p class="c0">Why do all this? The most important, instant, result of this is that we can now easily index our article for full-text searching &#8211; in a local repository system, such as EPrints or DSpace provide &#8211; and of course by Google. None of this is possible if we leave the scanned image as just that &#8211; an image.</p>
<p class="c2">Another  side-effect of any successful OCR outcome, is that the text is now free to be re-flowed. This means that we might consider sharing it with users in a variety of forms enhancing usability and accessibility.</p>
<p class="c2">It’s important not to confuse preservation formats with formats for access and dissemination. You probably will have your scanned image masters in TIFF, RAW, JPEG2000, PostScript, SVG. None of these are likely to be of much use to your users over the Web. Not only are the formats not widely supported by Web browsers, but most users probably don’t need or want your master image. If it’s a high-resolution scan of a 100 page book, they might be looking at 100Mb download, or worse &#8211; slow to load, and probably slow to render and navigate.</p>
<p class="c2">Time taken thinking what formats will give users the best experience is time well spent. What platforms might they want to use now and in the foreseeable future? It’s less than 18 months since Kindle3 made e-book readers affordable, and the Ipad made them sexy. E-books look and function very impressively on both platforms (albeit in different ways): for an overview of some of the benefits of the EPUB format, see Martin Fenner&#8217;s post <a href="http://blogs.plos.org/mfenner/2011/01/23/beyond-the-pdf-%E2%80%A6-is-epub/">Beyond The PDF&#8230; is EPUB</a>. PDF outputs may yet have their uses, if users can at least search for text within them. The point is that only with properly digitised text, do these kinds of accessibility options become possible.</p>
<p class="c2">Even image collections can also be disseminated as E-books &#8211; nice offline items some users might care to flick through on their tablet computers, possibly even smartphones. I&#8217;ve demonstrated how we can <a href="http://sasopenjournals.blogspot.com/2011/07/populating-ojs-from-eprints.html">create OJS XML from EPrints XML on-the-fly with XSLT</a>: since EPUB and Mobi/Kindle are XML-based formats, we should be able to do something similar to create e-books using repository APIs. Also, by using appropriately sized images in dissemination formats (Ipad screen is 1024x768px; Iphone4 is 960x640px) we can not only ship our users a sensibly-sized download, we can protect any capital we may have in the master images, without having to resort to ugly tricks like watermarking. (Giving users full-size, high-res images with embedded watermarks seems to me the worst of all worlds.)</p>
<p class="c2">Therefore I&#8217;d suggest that, in order to get the best out of a digitisation project, consider what would you like to see at the end of the project &#8211; and, more importantly, what would give your  users the best experience, or even win you new users? Ask around, do some tests, with users if possible, and get an idea how they want to use the materials and how they will get the best out of them. Maybe there are comparable projects and systems that you admire, with features you’d like to be available for your collection. What about in five or ten years’ time: will your current project outputs help or hinder longer term accessibility goals?</p>
<p class="c2">This kind of vision of is essential. Without some conception of the end result, how the materials will be used and managed most effectively, all the scanning in the world isn’t going to amount to  a successful digitisation project.</p>
<p>&nbsp;</p>
<hr class="c9" style="text-align: left; width: 50%; margin: 0 auto 0 0;" />
<div>
<p class="c2"><a name="ftnt1" href="#ftnt_ref1">[1]</a> Of course manuscipts and ‘difficult’ print formats &#8211; early printing typefaces, multilingual objects &#8211; may be resistant to OCR. For that we may need specialised solutions or rekeying, as discussed in recent posts on DA Blog (<a class="c3" href="http://dablog.ulcc.ac.uk/2011/05/05/house-of-books-part-2/">House Of Books (Part 2)</a>, <a class="c3" href="http://dablog.ulcc.ac.uk/2011/02/21/synergies-abound/">Synergies Abound</a>). Or the kind of online tool we developed with UCL for <a class="c3" href="http://dablog.ulcc.ac.uk/2010/03/01/transcribing-bentham/">Transcribe Bentham</a>.</p>
</div>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/' addthis:title='Scanning is different from digitisation '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Asynchronicities in blog structure</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/#comments</comments>
		<pubDate>Mon, 11 Apr 2011 15:56:04 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[data model]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[time]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=430</guid>
		<description><![CDATA[At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/' addthis:title='Asynchronicities in blog structure '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i>
<p>At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of  a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” in the world of non-digital publications.</p>
<p>The blog also displays, as part of its frame, other graphical and functional elements (sidebars, widgets, “blogrolls”, etc) which may themselves contain dynamically updated, constantly changing information. These can be added, removed, amended and rearranged at will by the blog author/editor. Blog posts that were “published” in the context of one set of framing elements, will persist through subsequent versions of that framework.</p>
<p>Similarly with design (layout, colours, mastheads, etc), though the persistence tends to be longer, the informal nature of blogs means that these may be easily changed by the blog editor/author, and are thus more volatile than a typical “corporate” website. Again, blog posts may persist, unchanged in themselves, through many iterations of the blog site design and layout.</p>
<div id="attachment_431" class="wp-caption aligncenter" style="width: 310px"><a href="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms.png"><img class="size-medium wp-image-431" src="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms-300x223.png" alt="blogatoms 300x223 Asynchronicities in blog structure" width="300" height="223" title="Asynchronicities in blog structure" /></a><p class="wp-caption-text">A simple view of blog elements and their temporal relationship</p></div>
<p>&nbsp;</p>
<p>This very simplified visualisations suggests where we might start conceptualising key elements of a blog. It indicates that they iterate over time, but in the cases of Design, Posts and Widgets (as we’ll call them for brevity), according to independent schedules. While Posts and Comments persist in the online view of a blog, designs and widget arrangements are overwritten.</p>
<p>With my earlier ArchivePress project we deliberately overlooked preservation of the blog&#8217;s framing elements, and (given the much smaller scope of that project) established an <a href="http://archivepress.ulcc.ac.uk/2009/07/31/archival-musings/">acceptable rationale</a> for doing so. The challenge for BlogForever is to find a solution to  precisely these issues. Unless we were simply to adopt the snapshot approach of Heritrix-based web archiving initiatives (e.g. Wayback/archive.org, UK Web Archive), we need to ensure the BlogForever repository supports a degree of granularity that can capture, describe and preserve atomic blog objects in a way that reflects the particular interdependencies, in order to understand and preserve them authentically, and permit the many possible authentic and valid “time slice” views and analyses that users of the archive will need.</p>
<p>(I appreciate, by the way that these objects themselves are compound objects, so not strictly &#8220;atomic&#8221;: but the same is also true of atoms, as our CERN colleagues can attest!)</p>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/11/asynchronicities-in-blog-structure/&via=blogforever&text=Asynchronicities%20in%20blog%20structure&related=:&lang=en&count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>File formats&#8230;or data streams?</title>
		<link>http://dablog.ulcc.ac.uk/2009/12/03/ffods/</link>
		<comments>http://dablog.ulcc.ac.uk/2009/12/03/ffods/#comments</comments>
		<pubDate>Thu, 03 Dec 2009 16:42:53 +0000</pubDate>
		<dc:creator>Ed Pinsent</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[file formats]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[Technical]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=811</guid>
		<description><![CDATA[On 1st December Malcolm Todd of The National Archives gave a good account of the work he&#8217;s been doing on File Formats for Preservation, resulting in a substantial new Technology Watch report for the DPC. It was a seminar hosted by William Kilbride, with participants from the BBC, the BL, NLW and others. The afternoon [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/12/03/ffods/' addthis:title='File formats&#8230;or data streams? '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>On 1st December Malcolm Todd of The National Archives gave a good account of the work he&#8217;s been doing on <strong>File Formats for Preservation</strong>, resulting in a substantial new <a href="http://www.dpconline.org/docs/reports/dpctw09-02.pdf">Technology Watch report for the DPC</a>. It was a seminar hosted by William Kilbride, with participants from the BBC, the BL, NLW and others. The afternoon was useful and interesting for me since I teach an elementary module on file formats in a preservation context for our DPTP courses.</p>
<p>My naïve thinking in the area has been characterised by the assumption that the process is rather static or linear, and that the problem we&#8217;re facing is broadly the same every time; migrate data from a format that&#8217;s about to become obsolete or unsupported, onto another format that&#8217;s stable, supported, and open. MS Word document to PDF or PDF/A…now <em>that</em>, I can understand!</p>
<p>In fact, I learned at least two ways of thinking about formats that hadn&#8217;t occurred to me before. One simple one is costs; some formats can cost more to preserve than others. This can be calculated in terms of storage costs, multiplied over time, and the costs associated with migrations to new versions of that format. <span id="more-811"></span>For example, we&#8217;ve tended to pin our faith on the TIFF format for images for many reasons, but there&#8217;s a high storage price to be paid for all that wonderful losslessness. This may be one reason why the DP world is looking with more favour on the JPEG2000 format, which is &#8216;virtually&#8217; lossless and smaller in size.</p>
<p>Secondly, the problems of preserving digital data which doesn&#8217;t actually have a specified stable preservation format. Chris Puttick of <a href="http://thehumanjourney.net/">Oxford Archaeology</a> gave a vivid description of the problems he&#8217;s facing with CAD and GIS files, where the data can&#8217;t easily be tied to a single format in the first place (nor can a stable format for migration be identified). As the NLA put it on their <a href="http://www.nla.gov.au/padi/topics/432.html">PADI page</a>, &#8220;At present there is little dealing specifically or comprehensively with the preservation of this particular type of data, although some aspects of database preservation are applicable to GIS. Some long term preservation issues include a lack of open source formats and metadata standards, large data volume and complex data objects.&#8221; Puttick suggests that his data doesn&#8217;t really perform at all unless it&#8217;s operated within a very specific environment of hardware and software. How do we preserve an environment? This appears to be quite a distinct preservation problem and much harder to solve than Word to PDF, to put it mildly.</p>
<p>William Kilbride suggested that such cases (and websites too, arguably, because they are time-based) are more like a <em>stream </em>of data &#8211; a handy image which conveys something about the dynamic of such information packages, and showing us that it&#8217;s much harder to nail them down into a single format. You can never step into the same river twice.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/12/03/ffods/' addthis:title='File formats&#8230;or data streams? '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2009/12/03/ffods/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>On the limits of preservation</title>
		<link>http://dablog.ulcc.ac.uk/2009/05/31/on-the-limits-of-preservation/</link>
		<comments>http://dablog.ulcc.ac.uk/2009/05/31/on-the-limits-of-preservation/#comments</comments>
		<pubDate>Sun, 31 May 2009 21:20:16 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[chiptunes]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[emulation]]></category>
		<category><![CDATA[historical computing]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=618</guid>
		<description><![CDATA[A recent article in New Scientist on the outer fringes of the chiptune scene prompted me to think about preservation, emulation and the fact that some digital things simply aren&#8217;t preservable in any useful sense. Chiptunes are typically created using early personal computers or videogames and/or their soundchips. In that respect, they depend on technology [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/05/31/on-the-limits-of-preservation/' addthis:title='On the limits of preservation '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>A recent article in New Scientist on the outer fringes of the <a href="http://en.wikipedia.org/wiki/Chiptune">chiptune</a> scene prompted me to think about preservation, emulation and the fact that some digital things simply aren&#8217;t preservable in any useful sense.</p>
<p>Chiptunes are typically created using early personal computers or videogames and/or their soundchips. In that respect, they depend on <a href="http://www.flickr.com/photos/fzero/483901133/"><img align="right" src="http://farm1.static.flickr.com/193/483901133_048282eeb5_m.jpg" alt="Harp and netbook - Fzero@flickr CC BY NC" /></a>technology preservation &#8211; the museum approach to digital preservation. Chiptune composers either use the systems as designed, programming them directly to create their music, or alter them in some way using techniques collectively known as &#8216;circuit-bending&#8217;, which makes the machines capable of producing sounds that they could not have originally produced. Some aspects of the chiptune scene utilise more modern synthetic techniques to recreate the sounds produced by these early chips &#8211; these are, in a loose sense, emulating the original systems, although not in a way that would allow you to use original software to create your sounds. But some adherents of the chiptune genre are going further, using the sounds of the systems themselves in their compositions.</p>
<p><a href="http://www.newscientist.com/article/mg20126999.900">The article</a> which set my train of thought going covered Matthew Applegate&#8217;s (aka <a href="http://www.pixelh8.co.uk">Pixelh8</a>) concert in late March 2009 at the National Museum of Computing, <span id="more-618"></span>which amongst other things used the electro-mechanical sounds produced by the Colossus and by hand-cranked adding machines. Now in this respect, as in many others, digital isn&#8217;t different to the analogue world. One could argue that found sounds have a long tradition in Western music, from Tchaikovsky&#8217;s use of cannon in the 1812 overture through to Leroy Anderson&#8217;s <a href="http://www.metacafe.com/watch/803796/the_typewriter_song/">typewriter symphony</a> and the work of the Boston Typewriter Orchestra, who eschew traditional instruments entirely and concentrate solely on percussive typewriter music, interleaved with the odd end-of-line bell. </p>
<p>But pixelh8&#8242;s work reminded me of some earlier, and unlikely, uses of computers to create music which I fear is already lost &#8211; although discerning listeners might well argue that we are all much the better for it being lost. One depended on the noise-making abilities of computer peripherals such as line and chain printers, the fiercest of which were capable of noise levels that drowned out conversation their vicinity. Devious users could create files which generated recognisable rhythms when printed on a specific printer. I&#8217;ve seen reference both to the &#8220;William Tell Overture&#8221; and &#8220;She&#8217;ll be Coming Round the Mountain&#8221; being produced by this method, which demanded detailed knowledge of the printer&#8217;s workings (but no programming knowledge &#8211; the effect was produced merely by printing a carefully-crafted file.) It seems that <a href="http://www.mmdigest.com/Archives/Digests/199812/1998.12.16.01.html">at least one recording</a> of this exists, from the mid-1960s. Good ideas like this don&#8217;t die, or are reimagined by later generations, and 1999 brought us the <a href="http://www.theuser.org/dotmatrix/en/intro.html">Symphony for dot matrix printers.</a></p>
<p>These examples, in common with some of pixelh8&#8242;s work, are dependent on the relevant hardware still existing and being in working order in order to allow us to continue to perform them, or to create new compositions. That&#8217;s certainly a challenge for things like line printers, and I imagine the same is true of mechanical typewriters. These, however, are relatively simple mechanical devices and, so long as someone is still interested in maintaining them, they will be repairable for many generations to come, even if they are no longer being manufactured.</p>
<p>But other forms of of computer music aren&#8217;t so easily re-created. At about the same time as people were creating line printer music, others realised that the computer&#8217;s own electronics generated sound in an unplanned-for way. Almost all computers of the 1960s generated substantial amounts of radio frequency interference of a type that&#8217;s now illegal. The clock speeds of those computers meant that the interference fell mainly in the medium wave band, between 500 Khz and just over 1Mhz, exactly the right spot for any consumer AM radio to pick it up. (Present-day systems, if not shielded, would be generating interference for microwaves and mobile phones, not AM radios.) Enterprising programmers with time on their hands and dedicated systems to play with wrote programs which generated RF interference which approximated a tune, since the exact nature of the interference depended on the instructions being processed, the data being fetched, and probably the addresses from which it was being fetched, as well as the properties of the CPU circuitry.</p>
<p>I encountered one version of this, using one of DEC&#8217;s PDP10 processors (probably a KI10, although I can&#8217;t be sure) and the DECUS program library also contains a similar program for the PDP8. I&#8217;ve come across one reference to a recording being made, this time of the same technique on an IBM 1401, but <a href="http://www.npr.org/templates/story/story.php?storyId=7408766">the available example</a> uses samples from a recording to make tunes rather than the original recording itself being musical.</p>
<p>All these examples illustrate, to different degrees, the digital preservation problem that pixelh8&#8242;s work brought to mind. The RFI music was exquisitely sensitive to properties of the original electronics. It wasn&#8217;t enough that the computer ran the same instruction set. DEC&#8217;s PDP10 series had four generations of processor &#8211; the KA10, KI10, KL10 and KS10 &#8211; each of which could have run the music program without error. But since each was entirely different at the circuit level, only one of those machines would produce the right sort of radio interference to generate the desired effect. <a href="http://www.inwap.com/pdp10/">Emulators</a> of these systems now exist, and a number of manufacturers have produced hardware-level recreations such as the Toad 1, Foonly and Systems Concepts machines. But neither the software nor the hardware emulators will have the same radio interference properties, so again, although they will run the original program without errors, it won&#8217;t produce the desired effects.</p>
<p>In fact, it would be extremely difficult to produce an electronic equivalent of those systems again, and as technology moves on such problems will only get more difficult. The early systems, which used primarily discrete components (transistors, capacitors, etc) could be built again by sufficiently dedicated enthusiasts. But chip fabrication is another thing entirely. The music that those programs and systems produced is essentially unpreservable except by recording it at the time. The significant properties that emerge from the combination of software and hardware are extremely difficult to characterise, and no environment that allows the software to run will emulate the one property of the original environment that really matters.</p>
<p>Some stuff really, really can&#8217;t be digitally preserved. It&#8217;s good to remember that sometimes.</p>
<p>(And if you are aware of any other recordings of computer &#8216;music&#8217; of this type, I and others would very much like to hear of them.)</p>
<p>[Updated to remove an embedded YouTube video which causes problems with the Iphone. Instead, you'll have to follow the link to <a href="http://www.youtube.com/watch?v=_uU4BzSQQmY">see and hear some hardware-based chiptune hacking.</a>]</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/05/31/on-the-limits-of-preservation/' addthis:title='On the limits of preservation '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2009/05/31/on-the-limits-of-preservation/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>rpmeet &#8211; the JISC Repositories and Preservation Programme Meeting</title>
		<link>http://dablog.ulcc.ac.uk/2009/05/10/rpmeet/</link>
		<comments>http://dablog.ulcc.ac.uk/2009/05/10/rpmeet/#comments</comments>
		<pubDate>Sun, 10 May 2009 20:32:31 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[AIDA]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[PRIMO]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[rpmeet]]></category>
		<category><![CDATA[SNEEP]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=575</guid>
		<description><![CDATA[Some of us at ULCC, and over 100 other people from around the UK, spent a couple of days this week at the Aston Business School reviewing the outcomes of JISC&#8217;s repositories and preservation programme and looking forward to what comes next. It was a useful and stimulating couple of days &#8211; the best programme [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/05/10/rpmeet/' addthis:title='rpmeet &#8211; the JISC Repositories and Preservation Programme Meeting '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.jisc.ac.uk/whatwedo/programmes/reppres.aspx"><img align="left" width="320" height="247" src="http://www.jisc.ac.uk/whatwedo/programmes/~/media/JISC/programmes/reppres/rpprog_structure_smaller3.ashx" alt="Diagram of programme elements" /></a><br />
Some of us at ULCC, and over 100 other people from around the UK, spent a couple of days this week at the Aston Business School reviewing the outcomes of JISC&#8217;s <a href="http://www.jisc.ac.uk/whatwedo/programmes/reppres.aspx">repositories and preservation programme</a> and looking forward to what comes next. It was a useful and stimulating couple of days &#8211; the best programme meeting I&#8217;ve attended so far. The few projects that weren&#8217;t represented at the meeting missed out in a lot of ways. If you&#8217;re involved in a JISC project, make sure you, your project manager, or both of you go to a programme meeting when you are invited. You&#8217;ll learn a lot, make some useful contacts, save some time, get some useful ideas and possibly lay the groundwork for future projects or collaborations.</p>
<p>I began the day by chairing the final meeting of <a href="http://www.jisc.ac.uk/aboutus/committees/workinggroups/repositoriespreservation.aspx">RPAG</a>(the repositories and preservation advisory group.) <span id="more-575"></span>We had a short meeting mainly to follow up on discussions we had been having on how the group had operated and how JISC might make use of advisory bodies in future. Those who expressed an opinion all felt it had been useful to them, but we all had concerns about how our time, and the JISC Executive&#8217;s time, might have been used more effectively. Future advisory groups may try to split responsibility for some areas into smaller working groups. All were agreed that the face-to-face meetings were invaluable, but we weren&#8217;t all agreed on which online technology would be best to use in between times. Enthusiasts for tools like ideascale were matched by those who found them unusable.</p>
<div style="width:425px;text-align:left" id="__ss_1399859"><object style="margin:0px" width="300" height="250"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=rpplenary200905-090507081547-phpapp01&#038;rel=0&#038;stripped_title=jisc-repositories-and-preservation-programme-plenary-presentation-2009" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=rpplenary200905-090507081547-phpapp01&#038;rel=0&#038;stripped_title=jisc-repositories-and-preservation-programme-plenary-presentation-2009" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="300" height="250"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/kevinashley">kevinashley</a>.</div>
</div>
<p>The meeting proper opened with some background and perspective from Rachel Bruce and Neil Grindley of JISC and myself. I tried &#8211; partly seriously, but without much expectation of accuracy &#8211; to give a one-line summary of what each project set out to do. But there were two things I meant to say which I failed to do. One was to look forward to the theme of day 2 (Value) and stress that repositories are not ends in themselves, but need to be thought of in terms of value, impact and benefits to someone. The second point I omitted was to remind us that , for innovation projects, failure in one sense can still mean success, as long as we understand the nature of the failure and are able to use it to improve and adapt future work. Not achieving what you set out to do is disappointing. Analysing the reasons for that and making sure others are aware of them can be of great value. </p>
<p>But it was the rest of the event that provided greatest interest. The discussion sessions on text mining, research data, teaching and learning repositories and more; presentations from projects from stakeholder, developer and other perspectives; posters and demos from many of the projects; and the fever of activities in the ideas room, which deployed technology ranging from post-it notes upwards to catalyse, capture and refine ideas from the attendees. These activities gave the event much more of a participatory feel &#8211; everyone became a contributor rather than being a consumer.</p>
<p><a href="http://www.wordle.net/gallery/wrdl/830199/AIDA_project_proposal" title="Wordle: AIDA project proposal"><img src="http://www.wordle.net/thumb/wrdl/830199/AIDA_project_proposal"  alt="Wordle: AIDA project proposal" style="padding:4px;border:1px solid #ddd" align="right" /></a> I learned a few things over the course of a day or two, most of them unexpected. David Flanders (via Chris Rusbridge) passed on the neat idea of feeding funding proposals through Wordle before marking them. That&#8217;s what ULCC&#8217;s <a href="http://aida.jiscinvolve.org/">AIDA</a> project looked like. Perhaps you ought to try the same with your proposals prior to submitting them?</p>
<p>I learned that talking unprepared and unscripted to a video camera doesn&#8217;t produce great results unless you&#8217;ve had practice or training &#8211; neither of which I&#8217;ve had. I knew that in an abstract sense and now have the unfortunate experience to back it up. But Andy McGregor and Dave Flanders did capture some other people talking far more sense than I did and far more clearly, and you can see the results on the <a href="http://www.youtube.com/user/dev8D">dev8D youtube channel</a>.</p>
<p>Andrew Prescott&#8217;s overview of the Welsh Repository Network provided us with the surprising finding that smaller institutions are more, not less, likely to want to run their own repository rather than contract it out to someone else.</p>
<p>And via a serendipitous typo, we all contemplated whether working in a repositoire might not be an altogether more rarified and sophisticated career option than working with a repository.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/05/10/rpmeet/' addthis:title='rpmeet &#8211; the JISC Repositories and Preservation Programme Meeting '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2009/05/10/rpmeet/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>DPC sponsors DPTP scholarships for May</title>
		<link>http://dablog.ulcc.ac.uk/2009/04/16/dpc-sponsors-dptp-scholarships-for-may/</link>
		<comments>http://dablog.ulcc.ac.uk/2009/04/16/dpc-sponsors-dptp-scholarships-for-may/#comments</comments>
		<pubDate>Thu, 16 Apr 2009 15:29:42 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[training]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=517</guid>
		<description><![CDATA[We&#8217;re pleased to say that the DPC has agreed to sponsor two places at the forthcoming open run of the Digital Preservation Training Programme (DPTP) at SOAS, 18-20 May 2009. Attendance at DPTP itself is open to everyone, but the sponsored places are only available to staff of DPC member institutions. We&#8217;re pleased that this [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/04/16/dpc-sponsors-dptp-scholarships-for-may/' addthis:title='DPC sponsors DPTP scholarships for May '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re pleased to say that the DPC has <a href="http://www.dpconline.org/graphics/training/0905Leadership.html">agreed to sponsor</a> two places at the forthcoming open run of the <a href="http://dptp.org/">Digital Preservation Training Programme</a> (DPTP) at SOAS, 18-20 May 2009. Attendance at DPTP itself is open to everyone, but the sponsored places are only available to staff of DPC member institutions. We&#8217;re pleased that this continues the valuable relationship we&#8217;ve had between the training programme and DPC since its inception. It also gives us the ideal excuse to welcome William Kilbride back as one of the tutors on the course &#8211; he&#8217;s a talented teacher and a joy to work with.</p>
<p>DPTP is of value to anyone with responsibility for digital preservation in an institutional context &#8211; its aim is to equip you with the knowledge to effect change in the organisation to allow the right things to happen. (If your primary responsibility is scientific data curation, you may find the DCC&#8217;s <a href="http://www.dcc.ac.uk/events/digital-curation-101-2009/">DC 101</a> course more applicable.)</p>
<p>Applications need to be in by May 5th &#8211; it&#8217;s not an onerous process, so don&#8217;t delay.  </p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/04/16/dpc-sponsors-dptp-scholarships-for-may/' addthis:title='DPC sponsors DPTP scholarships for May '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2009/04/16/dpc-sponsors-dptp-scholarships-for-may/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Set a blog to catch a blog…</title>
		<link>http://dablog.ulcc.ac.uk/2009/03/23/set-a-blog-to-catch-a-blog%e2%80%a6/</link>
		<comments>http://dablog.ulcc.ac.uk/2009/03/23/set-a-blog-to-catch-a-blog%e2%80%a6/#comments</comments>
		<pubDate>Mon, 23 Mar 2009 14:01:43 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[web 2.0]]></category>

		<guid isPermaLink="false">http://jiscpowr.jiscinvolve.org/2009/03/23/set-a-blog-to-catch-a-blog/</guid>
		<description><![CDATA[Much discussion of blog preservation focuses on how to preserve the blogness of blogs: how can we make a web archive store, manage and deliver preserved blogs in a way that is faithful to the original?


Since it is blogging applications that provide this stucture and behaviour (usually from simple database tables of Posts, Comments, Users, [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2009/03/23/set-a-blog-to-catch-a-blog%e2%80%a6/' addthis:title='Set a blog to catch a blog… '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>Much discussion of blog preservation focuses on how to preserve the blogness of blogs: how can we make a web archive store, manage and deliver preserved blogs in a way that is faithful to the original?</p>
<p><a href="http://jiscpowr.jiscinvolve.org/wp/files/2009/03/6698812_00a14b18c2_o1.jpg" title="Russian Dolls (275)" ></a></p>
<p><a href="http://jiscpowr.jiscinvolve.org/wp/files/2009/03/6698812_00a14b18c2_o1.jpg" title="Russian Dolls (275)" ><img src="http://jiscpowr.jiscinvolve.org/wp/files/2009/03/6698812_00a14b18c2_o1.jpg" alt="Nesting..." align="right" /></a></p>
<p>Since it is blogging applications that provide this stucture and behaviour (usually from simple database tables of Posts, Comments, Users, etc), perhaps we should consider making blogging software behave more like an archive. How difficult would that be? Do we need to hire a developer?</p>
<p>One interesting thing about Wordpress is the number of uses its simple blog model has been put to. Under-the-hood it is based on a remarkably <a href="http://www.artandsoul.co.uk/downloads/wp_schema.pdf" onclick="javascript:urchinTracker ('/outbound/article/www.artandsoul.co.uk');">simple data base schema</a> of about 10 tables and a suite of PHP scripts, functions and libraries that provide the interface to that data. Its huge user-base has contributed a wide variety of themes and additional functions. It can be turned into a Twitter-like microblog (<a href="http://en.blog.wordpress.com/2009/03/11/p2-the-new-prologue/" onclick="javascript:urchinTracker ('/outbound/article/en.blog.wordpress.com');">P2</a> and <a href="http://en.blog.wordpress.com/2008/01/28/introducing-prologue/" onclick="javascript:urchinTracker ('/outbound/article/en.blog.wordpress.com');">Prologue</a>) or a fully-fledged social network (Wordpress MU, <a href="http://buddypress.org/" onclick="javascript:urchinTracker ('/outbound/article/buddypress.org');">Buddypress</a>).</p>
<p>Another possibility exploited by a 3rd-party plugin is that of using Wordpress as an aggregating blog, collecting posts automatically via RSS from other blogs: this seems like a promising basis for starting to develop an archive of blogs, in a blog.</p>
<p>The plugin in question is called <a href="http://projects.radgeek.com/feedwordpress/" onclick="javascript:urchinTracker ('/outbound/article/projects.radgeek.com');">FeedWordPress</a>. It uses the Links feature of Wordpress as the basis of a list of feeds which it checks regularly, importing new content when it finds it, as Posts within Wordpress.</p>
<p>I installed FeedWordPress a while ago on ULCC&#8217;s <a href="http://dablog.ulcc.ac.uk/2009/03/20/if-you-can-keep-your-blog/" onclick="javascript:urchinTracker ('/outbound/article/dablog.ulcc.ac.uk');">DA Blog</a>, and set it up to import all of the ULCC-contributed posts to JISC-PoWR, i.e. those by Ed Pinsent, Kevin Ashley and myself. I did this because I felt that these contributions warrant being part of ULCC&#8217;s insitutional record of its activities, and  that DA Blog was the best to place to address this, as things stand.</p>
<p>JISC-PoWR also runs on Wordpress, therefore I knew that, thanks to Wordpress&#8217;s REST-like interface and <a href="http://dablog.ulcc.ac.uk/2008/05/23/powring-up-the-powr-project/" onclick="javascript:urchinTracker ('/outbound/article/dablog.ulcc.ac.uk');">Cool URIs</a>, it is easy not only to select an individual author&#8217;s posts (<a href="http://jiscpowr.jiscinvolve.org/author/kevinashley"><code>/author/kevinashley</code></a>) but also the RSS feed thereof (<a href="http://jiscpowr.jiscinvolve.org/author/kevinashley/feed"><code>/author/kevinashley/feed</code></a>). This, for each of the three author accounts, was all I needed to start setting up FeedWordPress in DA Blog to take an automatic copy each time any of us contributed to JISC-PoWR.The &#8220;author&#8221; on the original post has been mapped to an author in DA Blog, so posts are automatically (and correctly) attributed. The import also preserves, in custom fields, a considerable amount of contextual information about the posts in their original location.</p>
<p>In many cases, I&#8217;ve kept the imported post private in DA Blog. &#8220;Introductory&#8221; posts for the JISC-PoWR project blog, for example: as editor of DA Blog, I didn&#8217;t feel we needed to trouble our readers there with them; nevertheless they are stored in the blog database, as part of &#8220;the record&#8221; of our activities.</p>
<p>This is, admittedly, a very small-scale test of this approach, but the kind of system I&#8217;ve described is unquestionably a rudimentary blog archive, that can be set up relatively easily using WordPress and FeedWordPress &#8211; no coding necessary. Content is then searchable, sortable, exportable (SQL, RSS, etc). (Note, by the way, what happens when you use the Search box on the <a href="http://www.webarchive.org.uk/wayback/archive/20090101223818/http%3A//jiscpowr.jiscinvolve.org/" onclick="javascript:urchinTracker ('/outbound/article/www.webarchive.org.uk');">JISC-PoWR blog copy</a> in UKWAC: this won&#8217;t happen with this approach!)</p>
<p>For organisations with many staff blogging on diverse public platforms this would be one approach to ensuring that these activities are recorded and preserved. UKOLN, for example, manages its own <a href="http://blogs.ukoln.ac.uk/" onclick="javascript:urchinTracker ('/outbound/article/blogs.ukoln.ac.uk');">blog farm</a>, while <a href="http://ukwebfocus.wordpress.com/" onclick="javascript:urchinTracker ('/outbound/article/ukwebfocus.wordpress.com');">Brian</a> and <a href="http://remoteworker.wordpress.com" onclick="javascript:urchinTracker ('/outbound/article/remoteworker.wordpress.com');">Marieke</a> have blogs at Wordpress.com (as well as contributing to this one), and <a href="http://blog.paulwalk.net/" onclick="javascript:urchinTracker ('/outbound/article/blog.paulwalk.net');">Paul Walk</a> appears to manage his own blog and web space. This kind of arrangement is not uncommon, nor the problem of how an institution get a grasp on material in all these different locations (it&#8217;s been at the heart of many JISC-PoWR workshop discussions). A single, central, self-hosted, aggregating blog, automatically harvesting the news feeds of all these blogs, might be a low-cost, quick-start approach to securing data in The Cloud, and safeguarding the corporate memory.</p>
<p>There are more issues to address. What of comments or embedded images? Can it handle <a href="http://digitalarchiving.wordpress.com/2009/03/17/archiving-twitter/" onclick="javascript:urchinTracker ('/outbound/article/digitalarchiving.wordpress.com');">Twitter tweets</a> as well as blog posts? Does it scale? What of look-and-feel, individual themes, etc? Now we start needing some more robust tests and decisions, maybe even a developer or two to build a dedicated suite of &#8220;ArchivePress&#8221; plugins. But thanks to the power and Open-ness of  Wordpress, and the endless creativity of its many users, we have a promising and viable short-term solution, and a compelling place to start further exploration.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2009/03/23/set-a-blog-to-catch-a-blog%e2%80%a6/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Hot off the preservation press: JISC-PoWR and the Beagrie Survey</title>
		<link>http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/#comments</comments>
		<pubDate>Fri, 21 Nov 2008 11:44:58 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[JISC-PoWR]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[web 2.0]]></category>
		<category><![CDATA[web preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/</guid>
		<description><![CDATA[We were pleased to have finally made available version 1.0 of the JISC PoWR Handbook. The Handbook is the result of our extensive work with UKOLN on the JISC Preservation of Web Resources project, which included three hugely valuable workshops, and extensive discussion on the PoWR blog. In the Handbook we&#8217;ve tried to cover a [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/' addthis:title='Hot off the preservation press: JISC-PoWR and the Beagrie Survey '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/avantgardener4/2110782575/" title="Raspberry Jam by avantgardener4 on Flickr, CC by-nc-nd"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/11/2110782575_a2e3d0d0b9_m.jpg" alt="Raspberry Jam by avantgardener4 on Flickr, CC by-nc-nd" align="right" width="120" /></a> We were pleased to have finally made available version 1.0 of the <a href="http://jiscpowr.jiscinvolve.org/handbook/">JISC PoWR Handbook</a>. The Handbook is the result of our extensive work with UKOLN on the JISC Preservation of Web Resources project, which included three hugely valuable workshops, and extensive discussion on the <a href="http://jiscpowr.jiscinvolve.org">PoWR blog</a>.</p>
<p>In the Handbook we&#8217;ve tried to cover a huge, and sometimes controversial, area in as accessible a way as possible. The workshops, attended by both web-management and records-management professionals from HE institutions, brought  a wide range of concerns and issues to light. It&#8217;s been quite a job fitting it all in.</p>
<p>Even as the project progressed, we became aware of new developments in thinking about how to approach the special issues of managing web resources, including everybody&#8217;s favourite new fast automatic Web 2.0 applications. We saw the publication of Steve Bailey&#8217;s Records Management 2.0 book, TNA&#8217;s Web Continuity project, and further web archiving developments at UKWAC. We&#8217;ve even heard it whispered in some quarters that approaches to preservation may need a more profound reassessment in the context of the Web and the Cloud. Many of these issues were recorded on the PoWR blog, and we tried to reflect as much of this in the Handbook as possible.</p>
<p>Another recent JISC publication, <a href="http://www.jisc.ac.uk/Home/publications/publications/jiscpolicyfinalreport.aspx" target="_blank">The Digital Preservation Policies Study </a>by Charles Beagrie Ltd, published at the same time, is complementary in many ways, and reassured us that many of the conclusions we groped towards in the Handbook were not so wide of the mark!<span id="more-234"></span> Like PoWR, the  Digital Preservation Policies Study identified the necessity of high-level policy engagement as the <em>sine qua non</em> of effective digital preservation.</p>
<p style="margin-left: 40px">Digital preservation solutions are undoubtedly partly technical, and the tools being created will enhance digital longevity, but these solutions are also equally dependent on organisational issues. It is important to remember that digital preservation relies on the interaction between the digital preservation environment and wider organisational objectives and procedural issues. These could be financial and staffing issues, collection management, legal obligations, auditing requirements, and other strategies and policies. In this respect, recognition by organisational divisions that digital data is important and key to the successful running of an organisation is crucial.</p>
<p style="margin-left: 40px" align="right"><em><a href="http://www.jisc.ac.uk/Home/publications/publications/jiscpolicyfinalreport.aspx" target="_blank">The Digital Preservation Policies Study</a></em>, p.11</p>
<p>Among the other recommendations the Study shares with PoWR include:</p>
<ul>
<li>Analysis of existing policies and strategies, and how our work can support them even if said polices don&#8217;t explicitly refer to preservation or digital assets</li>
<li>Taking a phased approach &#8211; nothing happens all at once. (PoWR recommends pilot projects and working with supportive departments.)</li>
<li>Careful scoping of preservation requirements. (With regard to web resources, PoWR suggests not everything, not every version, and not forever.)</li>
<li>Identifying if and where existing systems will do the job</li>
<li>Consideration of lifecycle, publication, and retention schedules.</li>
</ul>
<p>The Charles Beagrie survey is a very concise and accessible contribution to the field, and we hope the PoWR Handbook, with its specific focus on established and emerging Web issues, and attention to the detailed and everyday concerns of our many contributors and correspondents, will be similarly useful. We also hope that the work of PoWR will continue in some form, on the blog and perhaps in the form of new projects and workshops, to fill in the gaps we left, and deal with the constantly emerging Web developments. Anyone for PoWR 2.0?</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/' addthis:title='Hot off the preservation press: JISC-PoWR and the Beagrie Survey '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/11/21/hot-off-the-preservation-press-jisc-powr-and-the-beagrie-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ULCC/Portico/DPC consortium to undertake JISC preservation study</title>
		<link>http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/#comments</comments>
		<pubDate>Wed, 29 Oct 2008 11:00:37 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digitisation]]></category>
		<category><![CDATA[DART]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Portico]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/</guid>
		<description><![CDATA[We&#8217;ve just heard that a consortium of ULCC, Portico and the Digital Preservation Coalition has been awarded the contract by JISC to undertake a Preservation Study of recent digitisation activities. The JISC Digitisation Programme has made a wide variety of valuable resources digitally accessible, including: British Newspapers (1620-1900) Newsfilm Online First World War Poetry Newspaper [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/' addthis:title='ULCC/Portico/DPC consortium to undertake JISC preservation study '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.jisc.ac.uk/whatwedo/programmes/digitisation" style="margin: 0pt 1ex 1ex; float: right" title="JISC Digitisation Projects"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/10/digipressurvey.jpg" alt="JISC Digitisation Projects" /></a>We&#8217;ve just heard that a consortium of ULCC, <a href="http://www.portico.org/" target="_blank">Portico</a> and the <a href="http://www.dpconline.org/" target="_blank">Digital Preservation Coalition</a> has been awarded the contract by JISC to undertake a <a href="http://digitisation.jiscinvolve.org/2008/10/27/study-of-preservation-plans-of-digitiation-projects/" target="_blank">Preservation Study</a> of  recent digitisation activities.</p>
<p>The JISC Digitisation Programme has made a wide variety of valuable resources digitally accessible, including:</p>
<ul>
<li>British Newspapers (1620-1900)</li>
<li>Newsfilm Online</li>
<li>First World War Poetry</li>
<li>Newspaper Cartoons</li>
<li>Welsh Periodicals</li>
<li>Pre Raphaelite drawings</li>
<li>East London Theatre Archive</li>
</ul>
<p>More information about these, and other projects, is available on the <a href="http://www.jisc.ac.uk/whatwedo/programmes/digitisation">JISC Digitisation</a> web page.</p>
<p>The project will review the preservation plans and processes of the sixteen projects funded under Phase 2 of the JISC Digitisation Programme, and identify any medium or long-term access risks to the digitised content. It will also produce recommendations &#8211; for individual projects and for JISC as a whole &#8211; for processes and strategies to mitigate the risks, and case studies which would be helpful to the broader community.</p>
<p>This is an exciting opportunity for us to apply and extend the experience we have gained working on a range of  projects in the field, including the European Visual Archive Market-validation Project (EVAMP) and risk assessments for the recently launched <a href="/2008/10/23/all-the-news-thats-fit-to-download/" target="_blank">Newsfilm Online</a> project. We will shortly be creating an online home to for the project collaboration and development, and will use DA Blog and the <a href="http://digipressurvey.jiscinvolve.org/" target="_blank">DigiPresSurvey Blog</a> (on <a href="http://jiscinvolve.org/" target="_blank">JISCInvolve</a>) to keep you updated.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/' addthis:title='ULCC/Portico/DPC consortium to undertake JISC preservation study '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/10/29/ulccporticodpc-consortium-to-undertake-new-jisc-preservation-study/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital Preservation Training Programme at SOAS</title>
		<link>http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/#comments</comments>
		<pubDate>Tue, 28 Oct 2008 14:00:53 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[training]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/</guid>
		<description><![CDATA[We have just finished our most recent Digital Preservation Training Programme which took place last week at the School of Oriental and African Studies. We had a good mixture of data asset managers and IT specialists, as well as archivists and the like, mainly from the public sector, but also a good crew from Rogers [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/' addthis:title='Digital Preservation Training Programme at SOAS '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p> <a href="http://www.ulcc.ac.uk/dptp" title="DPTP logo"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/10/dptp_logo.thumbnail.jpg" alt="DPTP logo" style="float: right" align="right" /></a> We have just finished our most recent <a href="http://www.ulcc.ac.uk/dptp" target="_blank">Digital Preservation Training Programme</a> which took place last week at the School of Oriental and African Studies. We had a good mixture of data asset managers and IT specialists, as well as archivists and the like, mainly from the public sector, but also a good crew from <a href="http://www.richardrogers.co.uk/r" target="_blank">Rogers Stirk Harbour &amp; Partners</a>.  Comments included:</p>
<ul>
<li>&#8216;It was all good. Thank you for the extra bits of reading in the pack too.&#8217;</li>
<li>&#8216;This course came highly recommended and didn&#8217;t disappoint!&#8217;</li>
<li>&#8216;Even the things that don&#8217;t directly apply to me now were useful to know for the future, and to understand the needs of our collaborators.&#8217;</li>
</ul>
<p>We are now studying the feedback with a view to further improving the course and looking at developing some social networking opportunities as well as looking at e-learning. The next DPTP will be in February  2009. Once dates are confirmed we will post them here.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/' addthis:title='Digital Preservation Training Programme at SOAS '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/10/28/digital-preservation-training-programme-at-soas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DCC 101 at the NeSC</title>
		<link>http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/#comments</comments>
		<pubDate>Wed, 08 Oct 2008 22:26:50 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[curation]]></category>
		<category><![CDATA[DCC]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[lifecycle]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[training]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/</guid>
		<description><![CDATA[I&#8217;m writing this whilst listening to Michael Day of UKOLN talk about preservation as one component of the data curation lifecycle espoused by the DCC. He is one of a number of people invited by the DCC to give lectures and create exercises as part of the pilot of DCC 101, a course aimed at [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/' addthis:title='DCC 101 at the NeSC '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m writing this whilst listening to Michael Day of UKOLN talk about preservation as one component of the <a href="http://www.dcc.ac.uk/FAQs/dcc-curation-lifecycle-model/">data curation lifecycle</a> espoused by the DCC. He is one of a number of people invited by the DCC to give lectures and create exercises as part of the pilot of <a href="http://www.dcc.ac.uk/events/digital-curation-101-2008/">DCC 101</a>, a course aimed at researchers and the data curators who work with them.</p>
<p>I&#8217;m here because I was invited to speak this morning about Ingest, the previous stage in the <a href='http://dablog.ulcc.ac.uk/wp-content/uploads/2008/10/nesc.jpg' title='NeSC - the view from reception'><img src='http://dablog.ulcc.ac.uk/wp-content/uploads/2008/10/nesc.thumbnail.jpg' alt='NeSC - the view from reception' class="float-right" style="border: 1px solid #dddddd; padding: 4px"/></a>lifecycle. That was followed by a stimulating set of exercises developed and led by <a href="http://www.cs.man.ac.uk/~embury/">Suzanne Embury</a> which got attendees to think about data quality and the varied measures we might use for it. Suzanne&#8217;s exercise was based on work by Wang and Strong (<a href="http://web.mit.edu/tdqm/www/tdqmpub/beyondaccuracy_files/beyondaccuracy.html">&#8220;Beyond accuracy: what data quality means for data consumers&#8221;</a>) which starts from the premise that we can never directly measure data quality &#8211; rather, we must construct measures which are proxies for quality, or measure only one facet of it. This was new work to me, but it was reassuring to see <span id="more-198"></span>that it fitted very well with my own experience and with what I know of the ways in which others have approached quality in data. Their original paper took 176 measures of data quality and condensed them into 15 generic aspects of quality. Not all are applicable to all settings, and part of the exercise today got our students considering which were appropriate to them and how they might measure them. This struck me as being particularly useful, as it provides a formal means for deciding which aspects of quality are important to your consumers. Effort can then be focussed on dealing with those aspects well, and not wasting effort on areas that are of lesser relevance, or for which measures are difficult to establish.</p>
<p>The DCC 101 looks to be a promising addition to the canon of training and education in digital curation. We&#8217;re looking forward to working with Joy Davidson and her colleagues at the DCC to explore how our training provision and theirs can complement, rather than compete with, each other. There are plans to discuss these issues at a workshop following the IDCC this year, and I&#8217;m interested in opinions from a wider audience that would help inform our discussions. If you are a potential attendee at one or more of these training courses, would it help you to see them more integrated ? Do you find it easy to identify the training and education that best suits you at present, and what could improve that situation ? If you are responsible for staff that may need training now and in the future, what type of training provision best suits you and your organisation and what constraints &#8211; budgets, time, place &#8211; should we all be aware of ? Responses as comments to this post, or <a href="mailto:K.Ashley@ulcc.ac.uk">by email</a>, would help us all greatly to plan future training that will meet the community&#8217;s needs.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/' addthis:title='DCC 101 at the NeSC '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/10/08/dcc-101-at-the-nesc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>iPres2008 &#8211; first impressions</title>
		<link>http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 22:15:10 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[British Library]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[iPres2008]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[JISCPoWR]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/</guid>
		<description><![CDATA[iPres2008 finished yesterday, and overall it was a useful and informative event. It took place a mere 15 minutes walk from our current home, so we took advantage of its proximity and attended en masse. Chris Rusbridge has already done an excellent job of some near-real-time reporting on the sessions, and I&#8217;m not going to [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/' addthis:title='iPres2008 &#8211; first impressions '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.bl.uk/ipres2008/">iPres2008</a> finished yesterday, and overall it was a useful and informative event. It took place a mere 15 minutes walk from our current home, so we took advantage of its proximity and attended <em>en masse.</em></p>
<p>Chris Rusbridge has already done an excellent job of some <a href="http://digitalcuration.blogspot.com/search/label/iPres-2008">near-real-time reporting</a> on the sessions, and I&#8217;m not going to try to replicate that level of detail in this post. As a first-time attendee at iPres, I was impressed by the professional mix attending, which took in hard-core computer science, digital preservation and curation folk, repository managers and those from the traditional custodial professions. In that respect it was very reminiscent of the <a href="http://ec.europa.eu/transparency/archival_policy/dlm_forum/index_en.htm">early DLM-Forums</a>, which were eye-opening for me when I attended the first one in 1996. But it was also interesting to observe that, just as DLM was dominated by archivists and records managers, iPres is a very library-oriented event. For example, those who expressed a desire for a Europe-wide event bringing together all those with an interest in digital preservation didn&#8217;t seem to be aware that the DLM-Forums existed.</p>
<p>One positive observation (of many) is that there is more reassuring news on the oft-vexed issue of IPR barriers to digital preservation. At the close of day 1, we heard a summary of the findings of the international survey on the impact of copyright law on digital preservation. <span id="more-196"></span>That indicated that the UK had one of the strictest set of constraints of all the countries looked at &#8211; in terms of who is permitted to carry out certain acts in the name of preservation and what those acts are. Other countries have more relaxed exemptions and that doesn&#8217;t appear to be causing the major rightsholdfers any significant financial loss. That should give us hope for some change in the law in the UK at least. And Horst Foster, making the keynote speech opening day 2, appeared to echo this at the European level, implying that the case for change had been made and accepted, although he was notably cautious about making any promises as to when this change might come about.</p>
<p>That&#8217;s an improvement on the situation in Europe a few years ago, though, when I was at one of a number of expert panels helping the European Commission to frame the forthcoming research questions and challenges in the digital preservation arena. At that time we were all warned off mentioning the (C) word at all &#8211; it seemed to have a somewhat toxic flavour. It&#8217;s really heartening to see that things have changed.</p>
<p>One should add a note of caution, however. After Adrienne Muir had commented favourably on how Australian law allows institutions such as the national library and archives to bypass DRM systems in order to preserve material, Colin Webb injected a note of caution. It is apparently still illegal to manufacture or to import a device to Australia which allows a DRM system to be bypassed. But if the national library happens to find such a device on its premises, it can use it without fear of breaking the law. Still some way to go, then, before the law is &#8216;joined up and working&#8217; &#8211; the strapline of iPres2008.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/' addthis:title='iPres2008 &#8211; first impressions '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/10/01/ipres2008-first-impressions/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Draft standard for long-term archiving of CAD data</title>
		<link>http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/#comments</comments>
		<pubDate>Mon, 08 Sep 2008 16:17:05 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[CAD]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[Technical]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/</guid>
		<description><![CDATA[September&#8217;s edition of BSI&#8217;s Update Standards magazine alerted me to another batch of standards, currently at the public comment stage, which are of particular relevance to digital preservation. The BS EN 9300 family is entitled &#8216;Long term archiving and retrieval of digital technical product documentation such as 3D, CAD and PDM data&#8216; and 5 parts [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/' addthis:title='Draft standard for long-term archiving of CAD data '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>September&#8217;s edition of BSI&#8217;s Update Standards magazine alerted me to another batch of standards, currently at the public comment stage, which are of particular relevance to digital preservation. The BS EN 9300 family is entitled &#8216;<strong>Long term archiving and retrieval of digital technical product documentation such as 3D, CAD and PDM data</strong>&#8216; and 5 parts (100; 110; 007; 005; 002 and 115) are open for comment until September 30th. I was initially surprised that I had heard nothing of this series of standards before, and wasn&#8217;t sure if this was simply lack of observation on my part or because they had come from an entirely different domain. They clearly aren&#8217;t new &#8211; unlike BS10008, which I <a href="http://dablog.ulcc.ac.uk/2008/06/26/bs-10008-time-is-running-out/">wrote about in June</a>, this is not a home-grown British Standard but one which is being proposed &#8216;for adoption&#8217; &#8211; which means that it&#8217;s already been adopted by another body somewhere. That status also means that you can&#8217;t use BSI&#8217;s excellent online commenting system. You have to buy the drafts from BSI on paper, so far as I can see.</p>
<p>In fact, as I was relieved to note, this group of standards isn&#8217;t entirely new to the digital preservation community, and the authors are also aware of general DP standards such as OAIS.  They derive from a group of standards known as <a href="http://www.steptools.com/library/standard/">STEP</a> (Standard for Exchange of Product Model Data), codified in ISO 10303. <span id="more-190"></span>STEP came up in a presentation and discussions at <a href="http://homepages.inf.ed.ac.uk/hmueller/presdb07/">PresDB&#8217;07</a>, although it&#8217;s been in development since the turn of the century at least. But STEP is a huge family of standards, and this particular work appears to have emerged from specific work going on in the aerospace industry, which is known to have had concerns about long-term survival of CAD data for some time.</p>
<p><a href="http://www.asd-stan.org/Lotar.html">LOTAR</a> is one of the results of this concern, and <a href="http://www.aristote.asso.fr/sem/sem0804.d/005-Duchier-airbus.pdf">a presentation by Pierre Duchier</a> of AIRBUS at an <a href="http://www.aristote.asso.fr/sem/semnext.html">Aristote conference</a> in April this year gives a clear picture of the concerns of industry and the approach they are taking. LOTAR and related work were also covered at a invitation-only event at Bath in 2007, <a href="http://www.ukoln.ac.uk/events/ltkr-2007/programme.html">Atlantic Workshop on Long-Term Knowledge Retrieval</a> whose attendees included many names familiar from the DCC and the repository community.</p>
<p>Overall, this is reassuring. Here is a set of digital preservation standards being developed and driven by concerns in industry, but where the work is taking place in dialogue with with the academic research and development community. CAD data in particular has long been a concern at the Archaeology Data Service, which has significant holdings of Autocad files, and it will be interesting to see to what extent LOTAR has relevance for such activities. In the meantime, I can see I&#8217;ve got a lot of reading to do to catch up, but it would be interesting to hear from others who have more insight into the possible use of these standards outside the aerospace industry.</p>
<p>Incidentally, BS EN 9300 is not to be confused with BS EN ISO 9300, an entirely different standard concerned with &#8220;<em>Measurement of gas flow by means of critical flow Venturi nozzles</em>&#8220;.  <img src='http://dablog.ulcc.ac.uk/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/' addthis:title='Draft standard for long-term archiving of CAD data '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/09/08/draft-standard-for-long-term-archiving-of-cad-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>JISC-PoWR @ IWMW2008</title>
		<link>http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/#comments</comments>
		<pubDate>Fri, 25 Jul 2008 10:24:47 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[IWMW2008]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[JISC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/</guid>
		<description><![CDATA[Seems I turned up just in time for the UKOLN IWMW 2008 event at Aberdeen. The sun was shining, weather was sweet, and the University buildings in Old Aberdeen looked magnificent. Not only outside &#8211; the Conference Hall in the Old King’s Library is a beautiful example of state of the art conferencing facilities, complete [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/' addthis:title='JISC-PoWR @ IWMW2008 '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/07/aberdeen-crombie-towers.jpg" title="Aberdeen - Crombie Gates"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/07/aberdeen-crombie-towers.thumbnail.jpg" class="float-right" alt="Aberdeen - Crombie Gates" /></a>Seems I turned up just in time for the UKOLN <a href="http://www.ukoln.ac.uk/web-focus/events/workshops/webmaster-2008/" target="_blank">IWMW 2008</a> event at Aberdeen. The sun was shining, weather was sweet, and the University buildings in Old Aberdeen looked magnificent. Not only outside &#8211; the Conference Hall in the Old King’s Library is a beautiful example of state of the art conferencing facilities, complete with individual microphones and voting panels &#8211; as impressive a debating chamber as I’ve seen since we attended DLM 1999, at the European Commission&#8217;s Charlemagne Conference Centre in Brussels.</p>
<p>To warm up my brain before the afternoon’s PoWR workshop on preserving web resources, I sampled James Currall’s enjoyable <a href="http://www.ukoln.ac.uk/web-focus/events/workshops/webmaster-2008/talks/currall/slides/IWMW08-JC-outline_Outline.html" title="discussion of web archiving" id="p:.w">discussion of web archiving</a>, and also a talk on Institutional Repositories by Stephanie Taylor of UKOLN and <a href="http://www.ukoln.ac.uk/repositories/digirep/index/Repositories_Research" title="RRT" id="v08s">RRT</a>.</p>
<p>In <em id="ni9y">The Tangled Web is but a Fleeting Dream &#8230; but then again&#8230;, </em>James Currall covered the essentials of web archiving in a clear and engaging way, drawing comparisons between the survival of WWI soldiers’ diaries, and the blogs of present-day servicemen in Iraq. Another example given was the trials and tribulations of the website for the Lockerbie Trial Briefing Unit: outsourced, host ceased trading, domain name lapsed, original website content dependent on outdated Coldfusion and Access environment. Thankfully a remote-harvested, static HTML image of the site does survive.</p>
<p>Why have there not been more conspicuous successes in web archiving in the past two decades? <span id="more-151"></span>Partly because it can be difficult to decide exactly whether and when a website should be treated as an authentic (and authenticable) record, a publication channel, or a publication itself &#8211; among other things. Partly because there are always  waves of innovation continually washing up more interesting things to do. Partly because archiving is a policy issue, yet is generally addressed as purely a techn(olog)ical problem (where it is addressed at all).</p>
<p>It was reassuring, as ever, to hear it said again that there is not one single tool that addresses all possible web preservation issues (behaviour, dynamic content, scripts, versioning, emerging standards, etc.); that depending on the Internet Archive is at best a partial and risky solution; and that &#8220;whatever you do is likely to be imperfect&#8221;.</p>
<p>James put the chamber’s electronic voting systems to entertaining and informative use with a number of snap votes: would, for example, present-day soldiers’ blogs still be available in 90 years’ time? Of course I could confidently vote ‘yes’, knowing that the publication of the JISC PoWR handbook is barely months away! Other ad hoc vox pops revealed that the audience was about as familiar with OAIS as with the Book of Ezra.</p>
<p>James is director of the <a href="http://www.gla.ac.uk/espida/" title="Espida" id="ho4c">Espida</a> Project, and this was a timely reminder that we must consider the relevance of that project’s work on assessing and controlling costs to the guidance we’re assembling for the forthcoming PoWR handbook. James and I share what must be a fairly unusual distinction of citing Thomas Carlyle in support of our cause in <a href="/2008/04/02/open-repositories-2008-in-southampton/" title="SNEEPing at OR08">recent presentations</a>. Unlike James, however, I don&#8217;t bear such an <a href="http://www.ukoln.ac.uk/web-focus/events/workshops/webmaster-2008/speakers/#currall" title="Currall or Carlyle?" target="_blank">uncanny likeness</a> to the great man.</p>
<p>Stephanie Taylor&#8217;s talk, <em id="ni9y0">Institutional Repositories: Asset or Obstacle?</em>, gave us a brief history of the Insitutional Repository, and a well-paced explanation of the value and purpose of IRs &#8211; rather more substantial, thankfully, than my Bluffer&#8217;s Guide To IRs last year. I was initially intrigued that her presentation might be leading to the conclusions that IRs are &#8220;obstacles&#8221;, but I quickly realised the alarmist title was rhetorical in nature. The talk was, among other things, an engaging appeal to the better nature of web managers and other techies to appreciate the value of, and issues faced by librarians.</p>
<p>In fact, the emergence of repositories, and the recognition of their place among institutional information systems, is a watershed in the evolution of electronic resources. Leaving it to researchers and teachers to manage what they create &#8211; variously on websites, blogs, Google Docs, thumb drive, or what you will &#8211; is no more sensible now than it ever was, if we want to ensure that they are consistently managed and accessible, let alone think about their preservation over the longer term.</p>
<p>There are many ways to approach it, none the only right and proper way. You may add extra value to it, through your choice of software or implementation (in-house or outsourced), or by using Web-Two-Oh-ish features and services. Institutional Repositories are only &#8220;one of many small conversations going on in different ways in different mediums”, but the need for an institution to manage valuable academic outputs in an orderly way is unarguable.</p>
<p>However one tweet that flew over the <a href="http://twemes.com/iwmw2008" target="_blank">Twitosphere</a> during the afternoon suggests that there may be more to do in assessing the pros and cons of different approaches &#8211; particularly the merits or otherwise of Google&#8217;s omnipresent panaceas. Mike Ellis commented: &#8220;I&#8217;ve never been clear why it is that institutions would trust a repository vendor more than someone like Google&#8230;.?&#8221;</p>
<p><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/07/aberdeen-art-richard.jpg" title="Aberdeen City Gallery"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/07/aberdeen-art-richard.thumbnail.jpg" class="float-right" alt="Aberdeen City Gallery" /></a>I won’t write up the JISC-PoWR workshop &#8211; the results of that will be on the <a href="http://jiscpowr.jiscinvolve.org/" target="_blank">JISC-PoWR blog </a>- except to say that Marieke succinctly summarised the many web preservation issues we’ve accumulated, while Brian found effective waysto draw out issues and concerns around the growing range of Web 2.0 applications finding favour among staff and students at our institutions; and we were pleased that both Stephanie and James were able to contribute to the discussion.</p>
<p>And then it was off to the Aberdeen City Art Gallery for a glass of wine or three and some inspirational art&#8230;.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/' addthis:title='JISC-PoWR @ IWMW2008 '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/07/25/jisc-powr-iwmw2008/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>How sticky is your wiki?</title>
		<link>http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/#comments</comments>
		<pubDate>Sun, 13 Jul 2008 20:05:17 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[JISC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[web 2.0]]></category>
		<category><![CDATA[wikis]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/</guid>
		<description><![CDATA[Wetpaint wiki is just one of the many enticing, powerful, quick-fix web apps that have sprung up around Web 2.0 and Social Networking. You&#8217;ll have your own favourites no doubt: I won&#8217;t start listing them here. Wikis have grown up a lot since the first WikiWikiWeb, and now are at the online heart of many [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/' addthis:title='How sticky is your wiki? '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><em>Originally published on <a href="http://jiscpowr.jiscinvolve.org/2008/07/13/sticky-wiki/">JISC-PoWR blog</a>. </em></p>
<hr /><a href="http://jiscpowr-20080627.wetpaint.com/" onclick="javascript:urchinTracker ('/outbound/article/jiscpowr-20080627.wetpaint.com');">Wetpaint wiki</a> is just one of the many enticing, powerful, quick-fix web apps that have sprung up around Web 2.0 and Social Networking. You’ll have your own favourites no doubt: I won’t start listing them here. Wikis have grown up a lot since the first <a href="http://c2.com/cgi/wiki?WikiWikiWeb" onclick="javascript:urchinTracker ('/outbound/article/c2.com');">WikiWikiWeb</a>, and now are at the online heart of many educational projects at all levels, from <a href="http://flatclassroomproject2006.wikispaces.com/" onclick="javascript:urchinTracker ('/outbound/article/flatclassroomproject2006.wikispaces.com');">classroom</a>, to research and publishing.We’ve been using Wetpaint’s wiki feature as a collaborative space for our workshop feedback, and this suits us fine: once we have collated all the input for our project outputs, in a few weeks it’ll probably be no loss to us to delete the wiki, or just set it adrift among all the other jettisoned flotsam in cyberspace.</p>
<p>But what’s often given less serious consideration, in the excitement of using a third-party provider of wikis, blogs, Ning, etc., to get your collaborative hypertext project off the ground so quickly and easily &#8211; and without having to go cap or cheque in-hand to whoever guards your web space &#8211; is this key preservation issue: <em>what happens when you want to get your painstakingly intricate web of hyperlinked pages <strong>out</strong>?</em></p>
<p>There are many good reasons why you might want to do this: you might want to migrate to another wiki system or CMS, as the shape and nature of your content evolves; or put it on a permanent, persistent footing by moving it into your own domain; you might simply want to back it up or take a snapshot; or you might want to pull out information for publication in a different form. When you had one or two pages, it might have seemed trivial; but what if you now have hundreds?</p>
<p><img src="http://jiscpowr.jiscinvolve.org/files/2008/07/29921948_79b7448227_m.jpg" alt="Old Style Wiki" title="Old Style Wiki by teemow on Flickr (CC:By-Nc-Sa)" style="margin: 0pt 2ex 1ex 0pt; float: left" />Unfortunately, just as exporting the information is often a secondary consideration for wiki content creators, so it also is for the wiki farm systems. The Wetpaint Wiki discussion boards indicate that an export feature was a long time in coming (and its absence quite a blocker to adoption by a number of serious would-be users). And what was eventually provided leaves a lot to be desired.</p>
<p><a href="http://www.wetpaint.com/" onclick="javascript:urchinTracker ('/outbound/article/www.wetpaint.com');">Wetpaint’s</a> backup option “lets” you download your wiki content as a set of HTML files. Well, not really HTML files: text files with some embedded HTML-like markup. (Which version? Not declared.) Don’t expect to open these files locally in your browser and carry on surfing your wiki hypertext (even links between wiki pages need fixing). The export doesn’t include comment threads or old versions. Restoring it to your online wiki is not possible. But, for what it’s worth, you have at least salvaged some sort of raw content, that might be transformed into something like the wiki it came from, if hit with a bit of Perl script or similar.</p>
<p>I checked out <a href="http://www.wikidot.com/" onclick="javascript:urchinTracker ('/outbound/article/www.wikidot.com');">Wikidot</a> &#8211; another impressively-specced, free “wiki farm”. Wikidot’s backup option will deliver you a zip file containing each wiki page as a separate text file, containing your wiki markup as entered, as well as all uploaded file attachments. However, according to Wikidot support:</p>
<blockquote><p>you can not restore from it automatically, it does not include all page revisions, only current (latest), it does not include forum discussion or page comments.</p></blockquote>
<p>To reconstruct your wiki locally, you’ll, again, need some scripting, including using the Wikidot code libraries to reconvert its non-standard wiki-markup into standard HTML.</p>
<p>A third approach can be seen with a self-hosted copy of <a href="http://www.mediawiki.org/" onclick="javascript:urchinTracker ('/outbound/article/www.mediawiki.org');">Mediawiki</a>. Here you can select one or more pages by name, and have them exported as an XML file, which also contains revisions and assorted other metadata. Within the XML framework, the page text is stored as original wiki markup, raising the same conversion issues as with Wikidot. However, the XML file can be imported fairly easily into a different or blank instance of Mediawiki, recreating both hypertext and functionality more or less instantly.</p>
<p>In contrast to all these approaches, if you set a spidering engine like <a href="http://www.httrack.com/" onclick="javascript:urchinTracker ('/outbound/article/www.httrack.com');">HTTrack</a> or <a href="http://www.gnu.org/software/wget/" onclick="javascript:urchinTracker ('/outbound/article/www.gnu.org');">Wget</a> to work “remotely harvesting” the site, you would get a working local copy of your wiki looking pretty much as it does on the web. This might be an attractive option if you simply want to preserve a record of what you created, a snapshot of how it looked on a certain date; or <em>just in case</em> a day should come when Wetpaint.com Inc., and the rest, no longer exist.</p>
<p>However, this will only result in something like a preservation copy &#8211; not a backup that can be easily restored to the wiki, and further edited &#8211; in the event, say, the wiki is hacked/cracked, or otherwise disfigured. For security alone, it may be sufficient to depend on regular backups of the underlying database, files and scripts: but you still ought to reassure yourself exactly what backup regime your host is operating, and whether they can restore them in a timely fashion. (Notwithstanding the versioning features of most wikis, rolling back a raft of abusive changes across a whole site is not usually a quick, easy or particularly enjoyable task.)</p>
<p>All this suggests some basic questions that one needs to ask when setting up a wiki for a project:</p>
<ul>
<li>How long do we need it for?</li>
<li>Will it need preserving at intervals, or at a completion date?</li>
<li>Is it more important to preserve its text content, or its complete look?</li>
<li>Should we back it up? If so, what should we back up?</li>
<li>Does the wiki provide backup features? If so, what does it back up (e.g. attachments, discussions, revisions)?</li>
<li>Once “backed up”, how easily can it be restored?</li>
<li>Will the links still work in our preservation or backup copy?</li>
<li>If the backup includes raw wiki markup, do you have the capabilities to re-render this as HTML?</li>
</ul>
<p>And questions like these are no less relevant when considering your uses of blogs and other social software: I hope we’ll be able to look at them more closely in another post.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/' addthis:title='How sticky is your wiki? '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/07/13/how-sticky-is-your-wiki/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DCC discussions on image formats</title>
		<link>http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/#comments</comments>
		<pubDate>Thu, 03 Jul 2008 23:14:12 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digitisation]]></category>
		<category><![CDATA[DART]]></category>
		<category><![CDATA[DCC]]></category>
		<category><![CDATA[file formats]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[JPEG]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[RAW]]></category>
		<category><![CDATA[Technical]]></category>
		<category><![CDATA[TIFF]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/</guid>
		<description><![CDATA[Rich pickings in a couple of fascinating posts on the DCC Digital Curation Blog, in which Chris Rusbridge summarises recent discussions on the DCC-Associates email list about appropriate photo image file formats for preservation, specifically TIFF, RAW and JPEG 2000. A sibling post also discusses the merits of RAW versus TIFF from the perspective of [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/' addthis:title='DCC discussions on image formats '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>Rich pickings in a couple of fascinating posts on the <a href="http://digitalcuration.blogspot.com/2008/07/responses-to-raw-versus-tiff.html" title="Responses to RAW versus TIFF on DCC Blog" target="_blank">DCC Digital Curation Blog</a>, in which Chris Rusbridge summarises recent discussions <span class="postbody">on the DCC-Associates email list </span>about appropriate photo image file formats for preservation, specifically TIFF, RAW and JPEG 2000. A <a href="http://digitalcuration.blogspot.com/2008/07/responses-to-raw-versus-tiff-image.html" title="Responses to RAW versus TIFF image on DCC Blog" target="_blank">sibling post</a> also discusses the merits of RAW versus TIFF from the perspective of different users and uses.</p>
<p>The proprietary nature of RAW formats (an emerging OpenRAW standard notwithstanding) and the relative newness on the block of JPEG 2000 would both tend to bolster the longstanding preference for TIFF, but as Chris&#8217;s posts make clear, each preservation project should nevertheless weigh the options based on its own requirements and resources.</p>
<p>If in doubt, the &#8220;keep everything&#8221; approach is attractive, as ever, but &#8211; in spite of the old mantras about ever-cheaper filestore &#8211; the implications for storage space and management are potentially very costly once one enters the world of Terabytes and Petabytes. <span id="more-134"></span>In one example, Sean Martin of BL concludes that</p>
<blockquote><p>probably only a small amount of additional value is created for the additional expense approaching £200K. This leads to the question &#8220;if we had £200K on what would we spend it?&#8221; and probably the answer is &#8220;not in this way&#8221;.</p></blockquote>
<p>The posts contain many interesting examples of costings, based on the experience of BL, SDSC and others. I won&#8217;t even attempt to summarise (Chris&#8217;s summary of) them here, but I hope that this post will still be available for me to consult next time I have to dabble in the murky science of DP costings.</p>
<p>In some instances, the case for preserving the RAW image may nevertheless be compelling. One can imagine that the risk of missing a new planet or virus (or identifying a non-existent one), or other potential infelicities in scientific and medical imaging, is not worth contemplating. By contrast, it&#8217;s hard to see what value might be added to a collection like the <a href="http://dablog.ulcc.ac.uk/2007/11/02/7/" title="Launch of Linnean Online">Linnean Society&#8217;s</a> by keeping raw camera data alongside the TIFFs, and more than doubling the amount of storage capacity required.</p>
<p>It&#8217;s for making decisions like this that the intellectual exercise of identifying <a href="http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/">significant properties</a> <em>and</em> the needs of all stakeholders &#8211; creators, curators and users &#8211;  seems particularly essential.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/' addthis:title='DCC discussions on image formats '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/07/04/dcc-discussions-on-image-formats/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Web Continuity Project at The National Archives</title>
		<link>http://dablog.ulcc.ac.uk/2008/06/11/web-continuity-project-at-the-national-archives/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/06/11/web-continuity-project-at-the-national-archives/#comments</comments>
		<pubDate>Tue, 10 Jun 2008 23:00:15 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[continuity]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[TNA]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/06/11/web-continuity-project-at-the-national-archives/</guid>
		<description><![CDATA[Ed and I were pleased to come across an interesting document, recently received from The National Archives, describing their Web Continuity Project. This is the latest of the many digital preservation initiatives undertaken by TNA/PRO, that began with EROS and NDAD in the mid 1990s, leading to the UK Government Web Archive and other recent [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/06/11/web-continuity-project-at-the-national-archives/' addthis:title='Web Continuity Project at The National Archives '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<em>From the <a href="http://jiscpowr.jiscinvolve.org/">JISC-PoWR Project blog</a>.</em>
<hr />
Ed and I were pleased to come across an interesting document, recently received from The National Archives, describing their <a href="http://nationalarchives.gov.uk/webcontinuity/">Web Continuity Project</a>. This is the latest of the many digital preservation initiatives undertaken by TNA/PRO, that began with <a href="http://web.archive.org/web/20010208135401/www.pro.gov.uk/recordsmanagement/eros/default.htm">EROS</a> and <a href="http://web.archive.org/web/20010305231332/www.pro.gov.uk/recordsmanagement/uknda/default.htm">NDAD</a> in the mid 1990s, leading to the <a href="http://www.nationalarchives.gov.uk/preservation/archivedwebsites.htm">UK Government Web Archive</a> and other recent <a href="http://www.nationalarchives.gov.uk/preservation/digital.htm">digital preservation initiatives</a> (many in conjunction with BL and the JISC).

The Web Continuity Project arises from a request by Jack Straw, as leader of the House of Commons in 2007, that government departments ensure continued access to online documents. Further research revealed that:
<ul>
	<li>Government departments are increasingly citing URLs in answer to Parliamentary Questions</li>
	<li>60% of links in Hansard to UK government websites for the period 1997 to 2006 are now broken</li>
	<li>Departments vary considerably: for one, every link works; for another every link is broken. (TNA’s own website is not immune!)</li>
</ul>
<span id="more-121"></span>Among the objectives if the Web Continuity Project are to ensure that:
<ul>
	<li>All links work in perpetuity</li>
	<li>No cited information is lost through deletion</li>
	<li>Information is preserved long-term, even if the Web is no longer the dominant publishing medium it is today</li>
</ul>
Its outputs will include:
<ul>
	<li>Guidance on creation and use of XML Sitemaps</li>
	<li>A website component (for MS IIS and Apache) that will redirect users to the Web archive if a link is no longer active but is in the archive</li>
	<li>Guidance to government webmasters on best practice for website design and maintenance for archiving purposes</li>
</ul>
We can see that the approach and aims of the Web Continuity Project are particularly pertinent to our deliberations. Its aims are to address both “persistence” and “preservation” in a way that is seamless and robust: in many ways, “continuity” seems a very apposite concept with which to address the particular nature of web resources (though I doubt we’d want to swap our nice acronym, PoWR, for CoWR!).

Many of the issues facing departmental web and information managers are likely to have analogues in HE and FE institutions, so the opportunity to share in research and expertise emanating from the National Archives and the British Library, among others, is an exciting one. We will be following the WCP closely over the coming months.

<hr />Post first published on <a href="http://jiscpowr.jiscinvolve.org/">JISC-PoWR Project blog</a>.
<hr />]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/06/11/web-continuity-project-at-the-national-archives/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital preservation in a nutshell, part II</title>
		<link>http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/#comments</comments>
		<pubDate>Tue, 10 Jun 2008 13:04:32 +0000</pubDate>
		<dc:creator>Ed Pinsent</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[JISC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[web 2.0]]></category>
		<category><![CDATA[web preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/</guid>
		<description><![CDATA[As Richard noted in Part I, digital preservation is a &#8220;series of managed activities necessary to ensure continued access to digital materials for as long as necessary.&#8221; But what sort of digital materials might be in scope for the PoWR project?
We think it extremely likely that institutional web resources are going to include digital materials [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/' addthis:title='Digital preservation in a nutshell, part II '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p><em>Originally published on the <a href="http://jiscpowr.jiscinvolve.org/2008/06/10/digital-preservation-in-a-nutshell-part-ii/">JISC-PoWR blog</a>.</em><br />
<hr /></p>
<p>As Richard noted in <a href="/2008/05/23/digital-preservation-in-a-nutshell-part-i/">Part I</a>, digital preservation is a “series of managed activities necessary to ensure continued access to digital materials for as long as necessary.” But what sort of digital materials might be in scope for the PoWR project?</p>
<p>We think it extremely likely that institutional web resources are going to include digital materials such as “records created during the day-to-day business of an organisation” and “born-digital materials created for a specific purpose”.</p>
<p>What we want is to “maintain access to these digital materials beyond the limits of media failure or technological change”. This leads us to consider the longevity of certain file formats, the changes undergone by proprietary software, technological obsolescence, and the migration or emulation strategies we’ll use to overcome these problems.</p>
<p>By <strong>migration</strong> we mean “a means of overcoming technological obsolescence by transferring digital resources from one hardware/software generation to the next.” In contrast, <strong>emulation</strong> is “a means of overcoming technological          obsolescence of hardware and software by developing techniques for imitating          obsolete systems on future generations of computers.”</p>
<p>Note also that when we talk about preserving anything, “for as long as necessary” doesn’t always mean “forever”. For the purposes of the PoWR project, it may be worth us considering <strong>medium-term preservation</strong> for example, which allows “continued access to digital materials beyond changes in technology for a defined period of time, but not indefinitely.”</p>
<p>We also hope to consider the idea of <strong>life-cycle management</strong>. According to DPC, “The major implications for life-cycle management of digital resources is the need actively to manage the resource at each stage of its life-cycle and to recognise the inter-dependencies between each stage and commence preservation activities as early as practicable.”</p>
<p>From these definitions alone, it should be apparent that success in the preservation of web resources will potentially involve the participation and co-operation of a wide range of experts: information managers, asset managers, webmasters, IT specialists, system administrators, records managers, and archivists.</p>
<p>(All the quotations and definitions above are taken from the <a href="http://www.dpconline.org/graphics/intro/definitions.html">DPC’s online handbook</a>.)</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/' addthis:title='Digital preservation in a nutshell, part II '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/06/10/digital-preservation-in-a-nutshell-part-ii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital preservation in a nutshell (Part I)</title>
		<link>http://dablog.ulcc.ac.uk/2008/05/23/digital-preservation-in-a-nutshell-part-i/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/05/23/digital-preservation-in-a-nutshell-part-i/#comments</comments>
		<pubDate>Thu, 22 May 2008 23:00:06 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[JiSC-PoWR]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/05/23/digital-preservation-in-a-nutshell-part-i/</guid>
		<description><![CDATA[One of the goals of PoWR is to make current trends in digital preservation meaningful and relevant to information professionals with the day-to-day responsibility for looking after web resources. Anyone coming for the first time to the field of digital preservation can find it a daunting area, with very distinct terminology and concepts. Some of [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/05/23/digital-preservation-in-a-nutshell-part-i/' addthis:title='Digital preservation in a nutshell (Part I) '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<em>From the <a href="http://jiscpowr.jiscinvolve.org/">JISC-PoWR Project blog</a>.</em>
<hr />
One of the goals of PoWR is to make current trends in digital preservation meaningful and relevant to information professionals with the day-to-day responsibility for looking after web resources. Anyone coming for the first time to the field of digital preservation can find it a daunting area, with very distinct terminology and concepts. Some of these are drawn from time-honored approaches to managing things like government records or institutional archives, while others have been developed exclusively in the digital domain. It is an emerging and evolving field that can take some time to get your head round: so we thought it was a good idea to offer a series of brief primers.

Starting, naturally, with <strong>digital preservation</strong>: this is defined as a "series of managed activities necessary to ensure continued access to digital materials for as long as necessary" (<a href="http://www.dpconline.org/graphics/intro/definitions.html" title="DPC">Digital Preservation Coalition</a>, 2002). <span id="more-118"></span>It's best to consider the scope of digital preservation as much broader than <strong>digital archiving</strong>, though the terms are often used interchangeably. Because, in computing generally, “archiving” is the process of backup and offline storage of data, the term "digital preservation” helps avoid confusion when referring to the broader issues of managing digital materials and information in and about them.

The Digital Preservation Coalition (DPC) is a consortium of many leading institutions working in the field, including The British Library and The National Archives. Its <a href="http://www.dpconline.org/graphics/handbook/">online handbook</a> contains much excellent information (though its online format could be improved), and includes a useful <a href="http://www.dpconline.org/graphics/intro/definitions.html">glossary</a>.

A third term, <strong>digital curation</strong>, has recently gained prominence. This places greater emphasis on the activities required to maintain the integrity of digital collections over time, and keep them usable. It promotes a pro-active approach to managing digital resources and the use of technological solutions, like web services, to address the problems that technology itself has created. It also paves the way for the emergence of “digital curators”, continually monitoring collections and intevening when necessary - a role analogous to their non-digital counterparts. The best source of information about digital curation is the <a href="http://www.dcc.ac.uk/">Digital Curation Centre</a>, based at Edinburgh University.

In the next part we'll look at some of the key concepts in digital preservation, including migration, emulation, and life-cycle models for digital objects. This will help us identify some of the things we should be considering when trying to preserve web resources.
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/05/23/digital-preservation-in-a-nutshell-part-i/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Significant Properties Workshop @ BL</title>
		<link>http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/#comments</comments>
		<pubDate>Tue, 08 Apr 2008 17:57:43 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[British Library]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[significant properties]]></category>
		<category><![CDATA[sigprops]]></category>
		<category><![CDATA[SPeLOs]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/</guid>
		<description><![CDATA[I don&#8217;t think I could begin to do justice in a few words to the wide-ranging debate at the JISC/BL/DPC Workshop on Significant Properties at the British Library on Monday: I&#8217;d rather leave it to others to analyse the significant outcomes in more detail, or to further discussions like the one started by Chris Rusbridge [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/' addthis:title='Significant Properties Workshop @ BL '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p style="text-align: center"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/04/image042.jpg" title="Neil Grindley and Chris Rusbridge kick off the Significant Properties Workshop"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/04/image042.jpg" alt="Neil Grindley and Chris Rusbridge kick off the Significant Properties Workshop" style="width: 95%" /></a></p>
<p>I don&#8217;t think I could begin to do justice in a few words to the wide-ranging debate at the <a href="http://www.dpconline.org/graphics/events/080407workshop.html" target="_blank">JISC/BL/DPC Workshop on Significant Properties</a> at the British Library on Monday: I&#8217;d rather leave it to <a href="http://digitalcuration.blogspot.com/2008/04/seriiously-seeking-significance.html" title="Maureen Pennock on Sig Props @ DCC Blog" target="_blank">others </a>to analyse the significant outcomes in more detail, or to further discussions like the one started by Chris Rusbridge (our cucumber-cool chairman on the day) on the <a href="http://digitalcuration.blogspot.com/2008/03/significant-properties-workshop.html" title="Chris Rusbridge on Sig Props @ DCC Blog" target="_blank">DCC Blog</a>. Suffice it to say there was a sack of food for thought in all the presentations, and lots of opportunities to wonder &#8220;now why didn&#8217;t <em>I</em> think of that?&#8221;</p>
<p><span id="more-79"></span>I think the most enjoyable presentation was that of Cal Lee from UNC Chapel Hill, who managed to lift us with a few laughs about his work &#8211; with Microsoft Word file format specifications &#8211; just as we were beginning to flag at the end of a long day of considering many things from many angles. The <a href="http://www.jisc.ac.uk/whatwedo/programmes/programme_preservation/project_movingimagesound.aspx" target="_blank">Moving Images</a> study also looks like it may have useful information for our deliberations on standards and formats for <a href="http://primo.sas.ac.uk" target="_blank">PRIMO</a>. If I wasn&#8217;t so taken with the presentation on E-learning objects, that was chiefly because I gave it &#8211; a mind-bending task to give our account of not only Learning Objects but also Significant Properties in 15 mins. But the <a href="http://www.dpconline.org/docs/events/080407sigpropsDavis.pdf" target="_blank">slides</a> look nice on the DPC website. (All credit to Ed for the excellent diagrams.)</p>
<p><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/04/image041.jpg" title="Avant le deluge: an empty BL Conference Centre"><img src="http://dablog.ulcc.ac.uk/wp-content/uploads/2008/04/image041.thumbnail.jpg" alt="Avant le deluge: an empty BL Conference Centre" class="float-right" /></a>Neil Grindley and the rest of the JISC/DPC/BL organisers did well to run it so smoothly and facilitate the discussion, and it was a rare opportunity to catch Andrew Wilson on a flying visit to this hemisphere and hear his first hand account of <a href="http://www.naa.gov.au/records-management/secure-and-store/e-preservation/at-NAA/index.aspx" target="_blank">NAA&#8217;s</a> approaches and models.  I can&#8217;t remember if I&#8217;ve been to the British Library Conference Centre before or not, but it is certainly an impressive space, and with super-slick AV facilities.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/' addthis:title='Significant Properties Workshop @ BL '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/04/08/significant-properties/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The MS Office 2003 format debacle</title>
		<link>http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/</link>
		<comments>http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/#comments</comments>
		<pubDate>Fri, 11 Jan 2008 17:45:13 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[file formats]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[Technical]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/</guid>
		<description><![CDATA[The consternation that has been caused by Office 2003 Service Pack 3 has provoked a great deal of discussion amongst the electronic records community about the vulnerability of proprietary file formats. (It&#8217;s provoked a lot of other discussion as well, much of it not complimentary to Microsoft.) If you&#8217;re not familiar with the problem, it [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/' addthis:title='The MS Office 2003 format debacle '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>The consternation that has been caused by <a href="http://support.microsoft.com/kb/923618">Office 2003 Service Pack 3</a> has provoked a great deal of discussion amongst the electronic records community about the vulnerability of proprietary file formats. (It&#8217;s provoked a lot of other discussion as well, much of it not complimentary to Microsoft.) If you&#8217;re not familiar with the problem, it boils down to this: the latest major update to Office 2003 silently disables the ability to open many files that Office would previously handle. They include formats such as Corel Draw, but worryingly for many people also include older MS Office formats, such as Excel 97 and Word 97.</p>
<p>I say &#8216;silently&#8217; because the information about the effect that this update will have is extremely well hidden. <span id="more-41"></span> Many individuals, and many large organisations, will do as MS recommends and allow their systems to download and install updates like this automatically. They&#8217;ll be caught unawares. Others may be more like me and read through the release information to see what the update is going to do beforehand. They would be caught unawares. The information linked to at the front of this post tells you that this is a major security update for Office. It says nothing about losing access to old files. It links to another page which goes into <a href="http://support.microsoft.com/kb/923618">greater detail</a> about what the update contains. That doesn&#8217;t mention the issue either, except that under the &#8220;known issues&#8221; section, it says:</p>
<blockquote><p> After you install Office 2003 SP3, you may receive an error message when you try to open or to save a file. For more information about this issue, click the following article number to view the article in the Microsoft Knowledge Base:</p></blockquote>
<p>And even that article doesn&#8217;t really list things in detail, although it does mention that older office formats are amongst those affected. The recommended solutions are not ideal either.</p>
<p>Microsoft has belatedly realised that it was a bad idea and released a patch to circumvent the problem. But you need to explicitly request that and many people won&#8217;t realise they need it for some time.</p>
<p>There&#8217;s been a good deal of <a href="http://listserv.albany.edu:8080/cgi-bin/wa?A1=ind08&amp;L=erecs-l#10"> discussion on the e-recs list </a> about this, including this remark from Marc Fresko:</p>
<blockquote><p> Personally, I think this is one of the more &#8220;visible&#8221; wake-up calls that<br />
should contribute to everyone realising that long-term access to<br />
electronic records, even over less than ten years, cannot be ignored.<br />
Active management is needed.  Anything that helps to get this message<br />
across is welcome, so long as it is (as is the case here) practically<br />
harmless.</p></blockquote>
<p>I agree with Marc that it&#8217;s a useful wake-up call. Consider that this could cause problems even to those organisations that had taken explicit steps to ensure continued access:</p>
<ul>
<li>You checked when moving to Office 2003 that you could still access your older documents using it.</li>
<li>You checked the documentation with each update to ensure that it wasn&#8217;t going to invalidate your earlier checks</li>
</ul>
<p>With those precautions, you would <strong>still</strong> be caught out by a silent update that you <strong>cannot uninstall.</strong> Only if yours is the sort of organisation that explicitly tests all updates in a safe environment, and specifically includes a check for access to legacy material as part of those tests, would you be alerted to the problem in time.</p>
<p>Microsoft aren&#8217;t the only software providers to build in automated update features to their products &#8211; the practice is common with both propretary and open-source products. Incidents like this <strong>will</strong> occur again. It reinforces the belief that one needs to have good control over the application(s) needed to access preserved content as well as the formats used to preserve it, whether one is looking at 5-year retention or 500-year retention.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/' addthis:title='The MS Office 2003 format debacle '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2008/01/11/the-ms-office-2003-format-debacle/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Digital Preservation Awards 2007</title>
		<link>http://dablog.ulcc.ac.uk/2007/11/27/digital-preservation-awards/</link>
		<comments>http://dablog.ulcc.ac.uk/2007/11/27/digital-preservation-awards/#comments</comments>
		<pubDate>Tue, 27 Nov 2007 14:16:18 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[DPC]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dash.ulcc.ac.uk/blog/?p=17</guid>
		<description><![CDATA[The third digital preservation award was announced a few weeks ago at the conservation awards ceremony at the British Museum. As in previous years, there was a strong shortlist but TNA were deserving winners. The awards are sponsored by the Digital Preservation Coalition who held an event today to give the shortlisted entries a chance [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2007/11/27/digital-preservation-awards/' addthis:title='Digital Preservation Awards 2007 '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>The third digital preservation award was announced a few weeks ago at the <a href="http://www.conservationawards.org.uk/index.php?option=com_content&amp;task=view&amp;id=31&amp;Itemid=21">conservation awards ceremony</a> at the British Museum. As in previous years, there was a <a href="http://www.conservationawards.org.uk/index.php?option=com_content&amp;task=view&amp;id=27">strong shortlist</a> but TNA were deserving winners. <img src="http://www.dpconline.org/graphics/awards/images/2007awardGroup1.jpg" alt="2007 conservation awards - winners, judges, uncle tom cobley, etc." class="float-right" width="350" /> The awards are sponsored by the Digital Preservation Coalition who held an event today to give the shortlisted entries a chance to promote themselves at more length to DPC members, and for members to ask questions. I chaired the morning session (the afternoon was devoted to some forward planning for the DPC) and it seemed to be an interesting and valuable exercise for all participants. If you weren&#8217;t there, you may still find the <a href="http://www.dpconline.org/graphics/awards/2007ceremony.html">videos explaining the winning entries</a> instructive, entertaining, or both.</p>
<p>I&#8217;ve been a judge for each of the awards and chaired the judging panel on this occasion; it&#8217;s been a fascinating and enjoyable experience and certainly makes one think again about what makes a project valuable to others. But we&#8217;re concerned that the awards might not be having the impact that they could be, <span id="more-17"></span> and that we need to do something to change the type of entry we receive &#8211; as well as thinking about how we publicise the event and our association with it. The DPC has established a working group to look at a number of these issues, and I&#8217;m interested in views from the wider world &#8211; particularly from those outside the DPC itself &#8211; about the awards, their relevance to you and your work, and what might encourage you to submit an entry.</p>
<p>Have you considered submitting an entry ? What held you back ? What do you think of the entries that have been shortlisted in the past ? One contributor has suggested that the final judging process (in which each entrant does a 25 minute presentation, followed by about 25 minutes of questions from the judges) should be held in public, instead of in private as it is at present. What do you think of that idea ?</p>
<p>As well as using the comment facility on this blog to share your thoughts, you&#8217;re welcome to email me (<a href="mailto:K.Ashley@ulcc.ac.uk">K.Ashley@ulcc.ac.uk</a>) if you would rather not share your thoughts with the rest of the world.</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2007/11/27/digital-preservation-awards/' addthis:title='Digital Preservation Awards 2007 '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2007/11/27/digital-preservation-awards/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Datasets @ TNA</title>
		<link>http://dablog.ulcc.ac.uk/2007/10/10/datasets-tna/</link>
		<comments>http://dablog.ulcc.ac.uk/2007/10/10/datasets-tna/#comments</comments>
		<pubDate>Wed, 10 Oct 2007 15:58:55 +0000</pubDate>
		<dc:creator>Kevin Ashley</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[archives]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[National Archives]]></category>
		<category><![CDATA[NDAD]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dash.ulcc.ac.uk/blog/?p=11</guid>
		<description><![CDATA[We were pleased to receive confirmation yesterday from The National Archives that ULCC Digital Archives has been selected to provide the Datasets@TNA service from February 2008. This is the next generation of the NDAD service which we began developing in 1997, and will run for three to five years, with the service moving in house [...]<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2007/10/10/datasets-tna/' addthis:title='Datasets @ TNA '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></description>
			<content:encoded><![CDATA[<p>We were pleased to receive confirmation yesterday from <a href="http://www.nationalarchives.gov.uk/">The National Archives</a> that ULCC Digital Archives has been selected to provide the Datasets@TNA service from February 2008. This is the next generation of the <a href="http://ndad.ulcc.ac.uk/">NDAD</a> service which we began developing in 1997, and will run for three to five years, with the service moving in house to TNA at the end of  the contract as their Seamless Flow project for managing all digital government records comes online. It&#8217;s going to mean a different way of working to some extent &#8211; moving from a service developed in the spirit of the Private Finance Initiative to one in which our role is to transfer expertise and knowledge to TNA (as well as continuing to provide a service to government departments and the public.)</p>
<div class="addthis_toolbox addthis_default_style " addthis:url='http://dablog.ulcc.ac.uk/2007/10/10/datasets-tna/' addthis:title='Datasets @ TNA '  ><a class="addthis_button_facebook_like" fb:like:layout="button_count"></a><a class="addthis_button_tweet"></a><a class="addthis_button_google_plusone" g:plusone:size="medium"></a><a class="addthis_counter addthis_pill_style"></a></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2007/10/10/datasets-tna/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

