<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ulcc da blog</title>
	<atom:link href="http://dablog.ulcc.ac.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://dablog.ulcc.ac.uk</link>
	<description>ulcc digital archives blog</description>
	<lastBuildDate>Fri, 03 Feb 2012 16:24:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
		<item>
		<title>The House of Books: Manuscripts and religious identity in Iraq</title>
		<link>http://dablog.ulcc.ac.uk/2011/11/21/house-of-books-manuscripts-and-religious-identity-in-iraq/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/11/21/house-of-books-manuscripts-and-religious-identity-in-iraq/#comments</comments>
		<pubDate>Mon, 21 Nov 2011 21:12:18 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Ankawa]]></category>
		<category><![CDATA[Aramaic]]></category>
		<category><![CDATA[Assyrians]]></category>
		<category><![CDATA[Chaldeans]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[Dominicans]]></category>
		<category><![CDATA[Erbil]]></category>
		<category><![CDATA[House of Books]]></category>
		<category><![CDATA[Iraq]]></category>
		<category><![CDATA[Iraq National Library and Archives]]></category>
		<category><![CDATA[Iraqi christians]]></category>
		<category><![CDATA[Kurdistan]]></category>
		<category><![CDATA[Mandaeans]]></category>
		<category><![CDATA[Middle Eastern Christians]]></category>
		<category><![CDATA[Mosul]]></category>
		<category><![CDATA[Shabaks]]></category>
		<category><![CDATA[Yezidi]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=2492</guid>
		<description><![CDATA[Father Najeeb Michaeel is an Iraqi Christian priest who speaks Arabic, English, French, Aramaic and Syriac, not to mention being able to read Latin and Greek. In the garden of Zaytun library, Erbil I hear this gentle man tell me how his community of friars used to live in Mosul, a traditional centre for Christianity [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_2554" class="wp-caption alignleft" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/DSC02779.jpg"><img class="size-medium wp-image-2554" title="DSC02779" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/DSC02779-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">Father Najeeb Michaeel examines a manuscript</p></div>
<p>Father Najeeb Michaeel is an Iraqi Christian priest who speaks Arabic, English, French, Aramaic and Syriac, not to mention being able to read Latin and Greek. In the garden of Zaytun library, Erbil I hear this gentle man tell me how his community of friars used to live in <a href="http://en.wikipedia.org/wiki/Mosul">Mosul</a>, a traditional centre for Christianity in Iraq, having the highest proportion of Assyrian Christians of all the Iraqi cities. Father Najeeb&#8217;s community has  had to leave Mosul due to <a href="http://news.bbc.co.uk/1/hi/world/middle_east/7671609.stm">persecution</a>.  Later on during The <a href="http://dablog.ulcc.ac.uk/2011/10/11/house-of-books-erbil-iraq/">House of Books workshop</a> he gives us a presentation of the magnificent early Christian manuscripts they are digitising.  Over coffee he gives us a moving rendition of the &#8216;Our Father&#8217; sung in Aramaic.  I wasn&#8217;t expecting to feel so moved by a  religion I have become increasingly frustrated by, and in Iraq.</p>
<div id="attachment_2501" class="wp-caption alignright" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/MDM-N121-90.jpg"><img class="size-medium wp-image-2501" title="MDM N121-90" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/MDM-N121-90-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">Early Christian manuscript, Centre Numerique des Manuscrits Orientaux, Mosul, Iraq.</p></div>
<p>Iraq has often compared to a mosaic in terms of the diversity of its religious diversity.  Iraq is a<a href="http://en.wikipedia.org/wiki/Shia_Islam"> Shia</a> majority country and contains the sacred Shia cities of Najaf and Karbala. Most sources estimate that around 65% of Iraqis follow Shia Islam, and around 35% follow <a href="http://en.wikipedia.org/wiki/Sunni_Islam">Sunni</a> Islam. What is not so well known is that Christians have inhabited what is modern day Iraq for about 2,000 years. The person who is supposed to be respnsible for the transmission of Christianity in Iraq is <a href="http://en.wikipedia.org/wiki/File:Caravaggio_-_The_Incredulity_of_Saint_Thomas.jpg">St Thomas</a> the Apostle. <a title="Assyrian people" href="http://en.wikipedia.org/wiki/Assyrian_people" target="_blank">Assyrians</a> (also called Syriacs and Chaldeans) most of whom are adherents of the <a title="Chaldean Catholic Church" href="http://en.wikipedia.org/wiki/Chaldean_Catholic_Church" target="_blank">Chaldean Catholic Church</a>, <a title="Syriac Orthodox Church" href="http://en.wikipedia.org/wiki/Syriac_Orthodox_Church" target="_blank">Syriac Orthodox Church</a> and the <a title="Assyrian Church of the East" href="http://en.wikipedia.org/wiki/Assyrian_Church_of_the_East" target="_blank">Assyrian Church of the East</a> account for most of Iraq&#8217;s <a title="Iraqi Christians" href="http://en.wikipedia.org/wiki/Iraqi_Christians" target="_blank">Christian</a> population, along with Armenians.  Tariq Aziz was born to an <a title="Assyrian people" href="http://en.wikipedia.org/wiki/Assyrian_people">Assyrian</a> family and is a member of the <a title="Chaldean Catholic" href="http://en.wikipedia.org/wiki/Chaldean_Catholic">Chaldean Catholic</a> church. There are also small populations of <a title="Mandaeanism" href="http://en.wikipedia.org/wiki/Mandaeanism" target="_blank">Mandaeans</a>, <a title="Shabaks" href="http://en.wikipedia.org/wiki/Shabaks" target="_blank">Shabaks</a>, <a title="Yarsan" href="http://en.wikipedia.org/wiki/Yarsan" target="_blank">Yarsan</a> and <a title="Yezidi" href="http://en.wikipedia.org/wiki/Yezidi" target="_blank">Yezidis</a>. The <a title="Iraqi Jewish" href="http://en.wikipedia.org/wiki/Iraqi_Jewish" target="_blank">Iraqi Jewish</a> community, numbering around 150,000 in 1941, almost entirely left the country.<sup> </sup>There are also Gnostics in the form of Mandeans and sub sects thereof, Yazidis who believe in a god but have a blue peacock angel in their pantheon, and of course the Zoroastrians which the ancient Babylonians followed.</p>
<div id="attachment_2502" class="wp-caption alignright" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/منمنمات-مخطوط-حنا-الكاتب-1-001.jpg"><img class="size-medium wp-image-2502 " title="منمنمات مخطوط حنا الكاتب 1-001" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/منمنمات-مخطوط-حنا-الكاتب-1-001-300x200.jpg" alt="" width="300" height="200" /></a><p class="wp-caption-text">Early Christian manuscript, Centre Numerique des Manuscrits Orientaux, Mosul, Iraq.</p></div>
<p style="text-align: left;">Despite this diversity they share some things, one being religious persecution within Iraq.  Estimates for the numbers of Christians suggest a decline from 8–10% in the mid-20th century to 5% at the turn of the century, to 3% in 2008.  About 600,000 Iraqi Christians have fled to Syria, Jordan or other countries or relocated to <a title="Iraqi Kurdistan" href="http://en.wikipedia.org/wiki/Iraqi_Kurdistan" target="_blank">Iraqi Kurdistan</a>.which is also the traditional homeland of the Assyrian people. Those who remain are very aware of their minority status and the threat to their lives. Another shared thing they have in common is a respect for their heritage, both in terms of artefacts and records.  In the aftermath of the war in Iraq where countless manuscripts where destroyed, these communities are well aware of the importance of their documentary heritage. Each group have their own manuscript collection. In addition, an attempt by Saddam Hussein to centralise private collections of archives has also made them wary of any notion of centralisation/government control. This has resulted in many collections being hidden away and lost.</p>
<p><strong> </strong>It was in this context that I met <a href="http://www.secours-catholique.org/actualite/les-chretiens-d-irak-face-a-un,7537.html">Father Najeeb </a>in Erbil. He was there to speak about his work at the Centre Numérique des Manuscrits Orientaux (CNMO) Mosul <span style="font-family: Calibri; font-size: small;"> </span>and their work involving the digitisation of their  Iraqi Christian manuscripts. This is a small scale project conducted by the Dominican community in Kurdistan. Their move to Iraqi Kurdistan was due to their persecution in Mosul and the government in Kurdistan ensures as much they can the community&#8217;s safety.  Father Najeeb and his community are being helped by Father Columba Stewart, a Benedictine monk in St John&#8217;s monastery in Minnesota. Based at the the <a href="http://www.hmml.org/">Hill Museum and Manuscript Library</a>, the library began in the cold war, hoping to retain a record of Europe’s heritage in case the Soviets came.  Father Stewart&#8217;s  goal since 2003 has been to  digitise as many Eastern Christian manuscripts in the Middle East as possible, because  these manuscripts are endangered from a variety of causes.<strong> </strong>The main danger is the ethnic genocide which has afflicted Iraq but also neglect.</p>
<div id="attachment_2558" class="wp-caption aligncenter" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/بدون-عنوان-6.jpg"><img class="size-medium wp-image-2558" title="بدون عنوان-6" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/بدون-عنوان-6-300x300.jpg" alt="" width="300" height="300" /></a><p class="wp-caption-text">CD of a collection for donor</p></div>
<p>Father Najeeb&#8217;s aim is clear, to preserve and generate awareness and interest in these ancient Iraqi Christian manuscripts and protect their heritage from disaster and cultural genocide. The Centre actively collects and digitises collections from private donors, who in turn get a copy of their manuscripts on CD (see above). I met other members of the Christian community in Iraq who had found a safe haven in <a href="http://en.wikipedia.org/wiki/Ankawa">Ankawa</a> a district of Erbil which we visited. They all speak several languages including Arabic and Syriac, an ancient language closely related to Aramaic.  While Erbil is known as a safe haven withing Iraq, driving around in Father Najeeb&#8217;s car  one realises that hanging a rosary bead from a rear view mirror is not a casual gesture a brave declaration of faith as is wearing the traditional clothes of a Catholic priest.</p>
<p>The manuscripts which Father Najeeb is digitising are extarordinarily beautiful, and all the more so considering their provenance, age and the number of destructive forces they have endured. They have of some similarity to other e arly Christian manuscripts of the age, as the religion spread as far east as Iraq and also to the wilds of western Ireland where scribes worked on  similar texts.  In some way digitisation and the digital age has brought people together again  to protect these texts and hopefully raise awareness of this and other vulnerable communities.</p>
<div id="attachment_2557" class="wp-caption aligncenter" style="width: 427px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/مخطوط-ابراهيم-ككي-قره-قوش-1.jpg"><img class="size-medium wp-image-2557 " title="مخطوط ابراهيم ككي- قره قوش (1)" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/11/مخطوط-ابراهيم-ككي-قره-قوش-1-300x224.jpg" alt="" width="417" height="271" /></a><p class="wp-caption-text">Young Iraqi Christian at Centre Numérique des Manuscrits Orientaux</p></div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/11/21/house-of-books-manuscripts-and-religious-identity-in-iraq/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Next Digital Preservation Training Programme in London</title>
		<link>http://dablog.ulcc.ac.uk/2011/10/13/next-digital-preservation-training-programme-in-london/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/10/13/next-digital-preservation-training-programme-in-london/#comments</comments>
		<pubDate>Thu, 13 Oct 2011 14:00:53 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Training]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=2243</guid>
		<description><![CDATA[We are very pleased to announce that the next Digital Preservation Training Programme will take place on the 14th -16th November 2011, in SOAS, London. The Digital Preservation Coalition is providing three scholarships to attend and applications are invited from DPC members. Full details are available on the DPTP website.]]></description>
			<content:encoded><![CDATA[<p style="text-align: center;"><a href="http://www.dptp.org/wp-content/uploads/2011/10/visuals.jpg"><img title="visuals" src="http://www.dptp.org/wp-content/uploads/2011/10/visuals-212x300.jpg" alt="" width="212" height="300" /></a></p>
<p>We are very pleased to announce that the next Digital Preservation Training Programme will take place on the <strong> 14th -16th November 2011,</strong> in <a href="http://www.soas.ac.uk/about/">SOAS</a>, London.  The Digital Preservation Coalition is providing three <a href="http://www.dpconline.org/training/leadership-programme">scholarships</a> to  attend and applications are invited from DPC members.</p>
<p>Full details are available on the <a  href="http://www.dptp.org/dptp-london-soas-november-2011/">DPTP website</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/10/13/next-digital-preservation-training-programme-in-london/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The House of Books: Erbil, Iraq</title>
		<link>http://dablog.ulcc.ac.uk/2011/10/11/house-of-books-erbil-iraq/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/10/11/house-of-books-erbil-iraq/#comments</comments>
		<pubDate>Tue, 11 Oct 2011 16:58:20 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[American University of Beirut]]></category>
		<category><![CDATA[Arabic]]></category>
		<category><![CDATA[Arabic authority files]]></category>
		<category><![CDATA[Arabic ontologies]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[Erbil]]></category>
		<category><![CDATA[EU]]></category>
		<category><![CDATA[House of Books]]></category>
		<category><![CDATA[Iraq]]></category>
		<category><![CDATA[Iraq National Library and Archives]]></category>
		<category><![CDATA[Kurdish]]></category>
		<category><![CDATA[Kurdistan]]></category>
		<category><![CDATA[Lebanon]]></category>
		<category><![CDATA[National Library of Jordan]]></category>
		<category><![CDATA[Ontologies]]></category>
		<category><![CDATA[Syriac]]></category>
		<category><![CDATA[UNESCO]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1887</guid>
		<description><![CDATA[&#8220;What you destroy, we will rebuild, only better&#8221; &#8211; Slogan of Kurdish Peshmerga. The garden I am standing in is so beautiful that I find it difficult to imagine that it was a former detention centre  operated by Saddam Hussain&#8217;s Ba’ath party, a place  of imprisonment and torture.  It is now a garden full of  [...]]]></description>
			<content:encoded><![CDATA[<p><img src="file:///C:/DOCUME%7E1/ULCC%7E1.STA/LOCALS%7E1/Temp/moz-screenshot.png" alt="" /></p>
<p><img src="file:///C:/DOCUME%7E1/ULCC%7E1.STA/LOCALS%7E1/Temp/moz-screenshot-1.png" alt="" /></p>
<p style="text-align: center;"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/152.jpg"></a></p>
<div class="mceTemp mceIEcenter">
<div id="attachment_1968" class="wp-caption aligncenter" style="width: 610px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/Erbil-at-night.jpg"><img class="size-full wp-image-1968" title="Erbil at night" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/Erbil-at-night.jpg" alt="" width="600" height="400" /></a><p class="wp-caption-text">Flying into Erbil at night</p></div>
</div>
<p style="text-align: center;">&#8220;What you destroy, we will rebuild, only better&#8221; &#8211; Slogan of Kurdish Peshmerga.</p>
<p style="text-align: left;">The garden I am standing in is so beautiful that I find it difficult  to imagine that it was a former detention  centre  operated by <a href="http://en.wikipedia.org/wiki/Saddam_Hussain">Saddam Hussain&#8217;s</a> <a href="http://news.bbc.co.uk/1/hi/world/middle_east/2886733.stm">Ba’ath party</a>, a place  of imprisonment and torture.  It is now a  garden full of  flowers  and trees and in its centre rises the impressive <a href="http://www.kurdishglobe.net/displayArticle.jsp?id=E257CA0FD3AE8E22D88534AA204F3C1F"> Zaytun Library</a> of Erbil.  This is no accident, the Kurdish Peshmerga vowed that all these  sites would be rebuilt this way once Saddam&#8217;s regime ended and the people  would reclaim such poisoned land for purposes such as libraries and gardens. Erbil or or Hawler as it is called by locals like much of Iraq has seen a lot of history pass its way, Alexander the Great sorted out the Persian King Darius near here and the citadel of Erbil is the oldest inhabited city in the world and a soon to be UNESCO heritage site.</p>
<div id="attachment_1952" class="wp-caption aligncenter" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/erbilcitadel.jpg"><img class="size-medium wp-image-1952" title="erbilcitadel" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/erbilcitadel-300x210.jpg" alt="" width="300" height="210" /></a><p class="wp-caption-text">Erbil citadel</p></div>
<p style="text-align: center;">&nbsp;</p>
<div class="mceTemp" style="text-align: left;">
<dl id="attachment_1940" class="wp-caption alignright" style="width: 190px;">
<dt class="wp-caption-dt"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP09201.jpg"><img class="size-medium wp-image-1940 " title="IMGP0920" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP09201-225x300.jpg" alt="" width="180" height="240" /></a></dt>
<dd class="wp-caption-dd">Flag of Kurdistan</dd>
</dl>
</div>
<p>But let&#8217;s take a step back. What is a London based <a href="http://www.peoplesrepublicofcork.com/">Corkonian</a> doing in the middle of former detention centre/ garden in Iraqi <a href="http://en.wikipedia.org/wiki/Iraqi_Kurdistan">Kurdistan</a>? This  region in the north is the ancestral homelands of the  Kurds &#8211; the oft persecuted minority in Iraq.  The Kurds  constitute the largest minority without a homeland. I was at the library as part of the third House of Books workshop funded by the EU and <a href="http://www.unesco.org/new/en/iraq-office/">UNESCO</a> and run by a Humanitarian NGO called <a href="http://www.unponteper.it/english/">Un Ponte Per</a>&#8230;. You can read more about their involvement <a href="http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/">here.</a> It is the last in a series of workshops which has been looking at digitisation of texts and their preservation and its main partner is the<a href="http://www.iraqnla.org"> Iraq National Library and Archives (INLA</a>). Many institutes from Iraq joined us including the <a href="http://www.theiraqmuseum.com/">National Museum of Iraq</a>, <a href="http://www.dominicains.fr/menu/nav_magazine/Actualite/Lu-vu-ou-entendu/Les-chretiens-d-Irak-victimes-d-un-genocide">Centre Numérique des Manuscrits Orientaux </a>and other projects. From the Middle East the  <a href="http://www.nl.gov.jo/EN/Pages/default.aspx">National Library of Jordan </a>and the  <a href="http://www.aub.edu.lb/main/Pages/index.aspx">American University of Beirut</a> also took part. My story with the INLA goes back to 2004 when I managed after some effort to persuade Dr Saad Eskander to write his  <a href="http://www.webarchive.org.uk/wayback/archive/20100427123118/http://www.bl.uk/iraqdiary.html">diary</a> about his day to day life reconstructing the destroyed library in Baghdad.</p>
<p><strong>Iraq National Library and Archives</strong></p>
<p>The INLA was destroyed during and post war in 2003.  Of its 417,000 books, 2,618 periodicals dating from the late Ottoman era to  modern times, and a collection of 4,412 rare books and manuscripts,  an estimated 60 percent of its total archival materials, 25 percent of  its books, newspapers, rare books, and most of its historical  photographs and maps were destroyed in various ways. This was not just a loss for Iraq, it was a catastrophe for the world on many levels.</p>
<p><span id="more-1887"></span>In 2011, <a href="http://howtomakeadifference.net/2008/12/19/saad-eskander/">Dr Eskander</a> has built up a library few could ever have imagined possible. The INLA now leads the way in much best practise in librarianship for both traditional and digital material in Iraq.  There are also many digitisation projects being hatched in Iraq and around the Middle East, big and small and the drive to join up previously physically separated collections of journals, manuscripts, photographs is strong. The potential of the digital has long been recognised as a powerful means of disseminating information in the Middle East. The workshop  is trying to make projects understand that digitisation has a catch, and that is preservation or how to ensure that access is maintained over time. This idea was well introduced in<a href="http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/"> Jordan</a> during sessions such as &#8221;Digitisation is not preservation&#8217; and other catchy titles, and the event this time saw progression and developments since Jordan. In fact it seemed that projects had reassessed their approach to digitisation. Some would admit that where previously they were just scanning, (<a href="http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/">blogs passim</a>) they revisited their projects in light of what they had learned in Jordan. Some  added technical metadata which they had not done at point of digitisation, others looked at having master copies as well as access copies of their digitised content and kept in different locations.  Policies were reconsidered. Storage solutions were considered. All steps in the right direction. The value of having 2 workshops in a year with a lot of the same people in both proved useful  as there seemed to be a lot of consolidation and desire to demonstrate improvements between one meeting and the next.</p>
<p><strong>Workshop overview</strong></p>
<p>The workshop was kicked off by UPP, and some short contributions by the EU&#8217;s representative in Kurdistan, Hala Al Sharifa , followed by the UNESCO programme officer for Kurdistan, Sami Al Khoji who is clearly dedicated to his role and the revitalisation and distribution by digital means of information about Iraq&#8217;s cultural assets. Kanan Mufti who is  <span class="st">director </span><span class="st">general </span><span class="st">for  antiquities in the western Kurdish region</span> in Erbil and who resided in the ancient citadel reflected on the importance of documentary heritage for Kurdistan and declared the protection of documents as a priority for them.  Presentations the INLA showed us they have been digitising and accessioning digital content since 2008  and we heard about their  plan to develop an Iraq digital library making its  materials available to all Iraqis online.  The material will cover all  aspects of Iraqi life and society and all forms of document. The INLA  sees as vital its contribution to intellectual and scientific research  in Iraq and also endeavours to support programmes which will end  illiteracy in Iraq. The INLA has embarked on training programmes in many  aspects of digital library management. They have even managed to send 2  people to attend the <a href="http://dptp.org">Digital Preservation Training Programme</a>, thanks to the British Council and BISI.</p>
<p>The <a href="http://www.nl.gov.jo/EN/Pages/default.aspx">National Library of Jordan</a> spoke about the need for standardisation in the region in relation to cataloguing and indexing. The  <a href="http://www.aub.edu.lb/main/Pages/index.aspx">American University of Beirut</a> &#8216;s presentation lead on nicely as Basma Chebani reflected  on Arabic ontologies and the need for authority files in the Arabic speaking world. My 2nd session on metadata fitted in well here and we did a nice hands on exercise, working with the group of 30 through Arabic and Kurdish, it seemed to hit  the mark and the right level. The translator also did a great job  helping me, he is now a metadata expert!</p>
<p>Father Najeeb of the Dominican Order in Iraq who spoke movingly about their ancient texts and manuscripts and their ongoing digitisation. A small project with great ambitions from a community constantly under threat. I plan to write more about this in another blog.</p>
<div id="attachment_1957" class="wp-caption aligncenter" style="width: 501px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/152.jpg"><img class="size-large wp-image-1957" title="15" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/152-1024x768.jpg" alt="" width="491" height="368" /></a><p class="wp-caption-text">Early Christian manuscript, Centre Numerique des Manuscrits Orientaux, Mosul, Iraq.</p></div>
<p>&nbsp;</p>
<p>Many copyright issues arose, in particular that of forgeries in the traditional manuscript environment. It seems that a lot of illegal copying takes place and it is difficult to contain. Issues such as translation were interesting. During my session on digitisation and preservation the Iraqi born but Aberystwyth-reared translator ran out of his booth proclaiming, &#8216;What is a plug in?!&#8217; To which everyone loudly  had an opinion in return.</p>
<div id="attachment_1954" class="wp-caption alignleft" style="width: 280px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP0979.jpg"><img class="size-medium wp-image-1954 " title="IMGP0979" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP0979-300x225.jpg" alt="" width="270" height="203" /></a><p class="wp-caption-text">Translator in action</p></div>
<div id="attachment_1941" class="wp-caption alignright" style="width: 235px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP09781.jpg"><img class="size-medium wp-image-1941 " title="IMGP0978" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/IMGP09781-225x300.jpg" alt="" width="225" height="300" /></a><p class="wp-caption-text">The translator explains metadata</p></div>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>I led the concluding session, working with the group about recommendations for next steps arising from the workshop. This proved interesting, considering I was again working with a translator with a group who spoke Kurdish, Arabic and<a href="http://en.wikipedia.org/wiki/Syriac_language"> Syriac</a>, not to mention an Arabic English keyboard! Getting individuals to think by themselves and for themselves about what they would accomplish on their return to work is good in a big group. They then worked as teams of 4 to consider what the next steps would be in Iraq for libraries and archives.  The key issues were networking, the establishment of a national network or syndicate of librarians and archivists and information specialists involved in libraries and archives is deemed of great importance. The group also want to consolidate the information from the workshops and ensure that people are not reinventing the wheel in terms of developing best practise. The establishment of education and training programmes in all aspects of librarianship and archives is also vital.</p>
<p><strong>Metadata matters but other things do too.</strong></p>
<p>What was also key and hidden beneath the discussion of texts and metadata and the like was that this was a   moment or space away from the day to day. These people work in circumstances which we cannot begin to comprehend. Just to come to the workshop involved endless checkpoints and danger. Life is unsafe and violent. Civil society as we know it here in the UK is almost non existent in most of Iraq. Electricity cuts are regular, resulting in me being stuck in a lift fo 5 minutes. This is not the same for Erbil on the whole but most colleagues came from Baghdad, Mosul and other regions which are the news for tragic reasons.</p>
<p>The value of this little group of archivists and librarians from different ethnic and religious groups is more than just about metadata and file formats (as important as they are) but about bringing disparate groups of people together with a view to the flowering of a new nation where religious and ethnic difference no longer matter, where censorship doesn;t exist, where ideas flow freely once again. This is the vision of the INLA director Dr Saad Eskander.  It is not an easy vision in a divided society where sectarianism is rife. However it is not so unusual to consider the power of libraries as a social phenomenom and yet we seem to treat them purely as an informational phenomenon. The House of Books demonstrates that it works on at least 2 levels</p>
<p>Iraq as everyone knows has  a violent history of occupation and war, however during periods of  serenity, the emergence of civilisations who have made numerous extraordinary  contributions to the history of civilisation, these include  writing,  and the concept of zero or  sifr to name but a few.  Original texts survive from the era of Babylonian  mathematics. On day 1 of archives school baby archivists learn that the Babylonians  wrote on tablets of unbaked clay, using cuneiform writing. The symbols were pressed into soft clay  tablets with the slanted edge of a stylus and so had a wedge-shaped  appearance (and hence the name cuneiform). Experts studying these  learned that the Babylonians had developed the concept of sifr or zero.<a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/220px-EgyptphoneKeypad.jpg"><br />
</a></p>
<p style="text-align: center;">&nbsp;</p>
<div id="attachment_1971" class="wp-caption aligncenter" style="width: 237px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/babyloniannumbers.jpg"><img class="size-full wp-image-1971 " title="babyloniannumbers" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/10/babyloniannumbers.jpg" alt="" width="227" height="222" /></a><p class="wp-caption-text">Cunieform from Babylonian times: Top: 64 (1 sixty + 4 ones) bottom: 3604 (1 sixty2 + 0 sixty + 4 ones)</p></div>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>Sifr is also used in Arabic to denote a clean slate, a blank page. In Iraq hard work has begun of the rebuilding from scratch  of a rich cultural heritage of Iraq for the future.  I am glad that the preservation of digital heritage of Iraq is a part of this.  More later!</p>
<div id="_mcePaste" class="mcePaste" style="position: absolute; left: -10000px; top: 196px; width: 1px; height: 1px; overflow: hidden;"><!--[if gte mso 9]><xml> <o:DocumentProperties> <o:Author>it</o:Author> <o:Version>11.9999</o:Version> </o:DocumentProperties> </xml><![endif]--><!--[if gte mso 9]><xml> <w:WordDocument> <w:View>Normal</w:View> <w:Zoom>0</w:Zoom> <w:PunctuationKerning /> <w:ValidateAgainstSchemas /> <w:SaveIfXMLInvalid>false</w:SaveIfXMLInvalid> <w:IgnoreMixedContent>false</w:IgnoreMixedContent> <w:AlwaysShowPlaceholderText>false</w:AlwaysShowPlaceholderText> <w:Compatibility> <w:BreakWrappedTables /> <w:SnapToGridInCell /> <w:WrapTextWithPunct /> <w:UseAsianBreakRules /> <w:DontGrowAutofit /> </w:Compatibility> <w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel> </w:WordDocument> </xml><![endif]--><!--[if gte mso 9]><xml> <w:LatentStyles DefLockedState="false" LatentStyleCount="156"> </w:LatentStyles> </xml><![endif]--><!--[if gte mso 10]> <mce:style><!   /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:"Table Normal"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-parent:""; 	mso-padding-alt:0cm 5.4pt 0cm 5.4pt; 	mso-para-margin:0cm; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Times New Roman"; 	mso-ansi-language:#0400; 	mso-fareast-language:#0400; 	mso-bidi-language:#0400;} table.MsoTableGrid 	{mso-style-name:"Table Grid"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	border:solid windowtext 1.0pt; 	mso-border-alt:solid windowtext .5pt; 	mso-padding-alt:0cm 5.4pt 0cm 5.4pt; 	mso-border-insideh:.5pt solid windowtext; 	mso-border-insidev:.5pt solid windowtext; 	mso-para-margin:0cm; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Times New Roman"; 	mso-ansi-language:#0400; 	mso-fareast-language:#0400; 	mso-bidi-language:#0400;} --> <!--[endif]--><span style="font-size: 10pt; font-family: &quot;Times New Roman&quot;;">Well, is this a format? </span></div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/10/11/house-of-books-erbil-iraq/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>House of Books 3</title>
		<link>http://dablog.ulcc.ac.uk/2011/09/21/house-of-books-3/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/09/21/house-of-books-3/#comments</comments>
		<pubDate>Wed, 21 Sep 2011 08:58:13 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[House of Books]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1837</guid>
		<description><![CDATA[Map of Erbil &#160; I have been invited to the 3rd House of Books workshop organised by Un Ponte Per&#8230;. It will take place in Erbil, Kurdistan and is under the auspices of the National Library and Archives of Iraq. The workshop is looking at best practise in relation to digitisation and preservation, follwing on [...]]]></description>
			<content:encoded><![CDATA[<div class="mceTemp" style="text-align: left;">
<dl id="attachment_1840" class="wp-caption  alignleft" style="width: 274px;">
<dt class="wp-caption-dt"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/09/erbilmap.png"><img class="size-medium wp-image-1840" title="erbilmap" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/09/erbilmap-264x300.png" alt="" width="264" height="300" /></a></dt>
<dd class="wp-caption-dd">Map of Erbil</dd>
</dl>
</div>
<p>&nbsp;</p>
<p>I have been invited to the <a href="http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/">3rd House of Books</a> workshop organised by<a href="http://www.unponteper.it/english/"> Un Ponte Pe</a>r&#8230;. It will take place in <a href="http://en.wikipedia.org/wiki/Arbil">Erbil</a>, Kurdistan and is under the auspices of the <a href="http://www.iraqnla.org/wpeng/">National Library and Archives of Iraq</a>. The workshop is looking at best practise in relation to digitisation and preservation, follwing on on topics discussed at the last workshop. Our keen and alert followers will notice that this project has already been on DA blog. I will be posting when I get back about this workshop.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/09/21/house-of-books-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scanning is different from digitisation</title>
		<link>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 12:47:38 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Digitisation]]></category>
		<category><![CDATA[access]]></category>
		<category><![CDATA[accessibility]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[e-books]]></category>
		<category><![CDATA[file formats]]></category>
		<category><![CDATA[preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1630</guid>
		<description><![CDATA[If you haven’t seen it, can I recommend Kristen Snawder&#8217;s recent post on the Library of Congress Digital Preservation blog, Digitization is different than digital preservation. Kristen reiterates familiar points about the long-term commitment necessary for serious digital preservation, contrasted with the quick hit of a scanning project. “In the hurry to meet user expectations, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/kylemcdonald/4287375982"><img class="alignright size-medium wp-image-1631" title="Autocorrelation scan by Kyle McDonald on Flickr" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/07/4287375982_5b5767939d_o-300x300.png" alt="" width="171" height="171" /></a></p>
<p>If you haven’t seen it, can I recommend Kristen Snawder&#8217;s recent post on the Library of Congress Digital Preservation blog, <a class="c3" href="http://blogs.loc.gov/digitalpreservation/2011/07/digitization-is-different-than-digital-preservation-help-prevent-digital-orphans/">Digitization is different than digital preservation</a>.  Kristen reiterates familiar points about the long-term commitment necessary for serious digital preservation, contrasted with the quick hit of a scanning project. “In the hurry to meet user expectations, institutions may scan large quantities of materials without having a solid plan for preserving the digital images into the future.”</p>
<p class="c2">However another recent find on the Web compels me to make an additional point, namely that we might do equally well to differentiate between scanning and digitisation. Anyone can set to work with a scanner and create a bunch of digital images &#8211; but that barely scratches the surface of what I think we should be expecting of a digitisation project in 2011.</p>
<p class="c0">First and foremost, we need metadata: the more the merrier, but something at least. Even if we expect to come back later and polish it up (once the images can be browsed and examined on screen). In the absence of any established metadata profiles for a project, at least try to cover as many <a class="c3" href="http://dublincore.org/documents/dces/">Dublin Core</a> elements as possible &#8211; title, creator, date, subject/keywords&#8230; Images, in particular, may prove tricky or time-consuming to find again, especially once there are thousands of them on a disk. We should probably keep the metadata in a database, and perhaps additionally store metadata with the objects. This can be as XML or plain text files stored alongside the digital images, or embedded in the files we create (many common file formats &#8211; TIFF, JPEG, MPEG, PDF &#8211; support metadata embedding, and there are many free tools available to help).</p>
<p class="c0">There is yet more, though, that we should be doing, particularly when we are scanning text-based objects (articles, books, magazines, reports, etc). Most importantly, we really should try and extract the text from the image if possible. <sup class="c1"><a name="ftnt_ref1" href="#ftnt1">[1]</a></sup></p>
<p class="c2">My recent web find was the teaching blog of Dr Toine Bogers at the <a class="c3" href="http://www.iva.dk/">Royal School of Library and Information Science</a> (RSLIS) in Copenhagen, Denmark. One fascinating post describes a Lab Session exercise, <a class="c3" href="http://itlab.dbit.dk/~toine/?page_id=304">From OCR To NER</a>, a set of comparatively simple command-line processes to get the most out of a scanned-text project.</p>
<p class="c0"><span id="more-1630"></span>Toine’s post walks us through the process. Once the article is scanned, we should apply some OCR. The exercise goes further and also describes the use of tools to clean up and spell-check the resulting OCR’d text. This will, at the very least, result in a separate text file, hopefully containing a fairly accurate version of the article text. Finally, the cleaned-up text can be submited to a Named Entity Recognition service. Toine’s exercise uses NER <a class="c3" href="http://cogcomp.cs.illinois.edu/demo/ner/">tools at University of Illinois</a>. (We’ve been using similar functionality provided by <a class="c3" href="http://www.opencalais.com/">OpenCalais</a> and <a class="c3" href="http://gate.ac.uk/">GATE</a> for our <a class="c3" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/infrastructureforresourcediscovery/pathfinder.aspx">AIM25 Open Metadata</a> project.)</p>
<p class="c0">Why do all this? The most important, instant, result of this is that we can now easily index our article for full-text searching &#8211; in a local repository system, such as EPrints or DSpace provide &#8211; and of course by Google. None of this is possible if we leave the scanned image as just that &#8211; an image.</p>
<p class="c2">Another  side-effect of any successful OCR outcome, is that the text is now free to be re-flowed. This means that we might consider sharing it with users in a variety of forms enhancing usability and accessibility.</p>
<p class="c2">It’s important not to confuse preservation formats with formats for access and dissemination. You probably will have your scanned image masters in TIFF, RAW, JPEG2000, PostScript, SVG. None of these are likely to be of much use to your users over the Web. Not only are the formats not widely supported by Web browsers, but most users probably don’t need or want your master image. If it’s a high-resolution scan of a 100 page book, they might be looking at 100Mb download, or worse &#8211; slow to load, and probably slow to render and navigate.</p>
<p class="c2">Time taken thinking what formats will give users the best experience is time well spent. What platforms might they want to use now and in the foreseeable future? It’s less than 18 months since Kindle3 made e-book readers affordable, and the Ipad made them sexy. E-books look and function very impressively on both platforms (albeit in different ways): for an overview of some of the benefits of the EPUB format, see Martin Fenner&#8217;s post <a href="http://blogs.plos.org/mfenner/2011/01/23/beyond-the-pdf-%E2%80%A6-is-epub/">Beyond The PDF&#8230; is EPUB</a>. PDF outputs may yet have their uses, if users can at least search for text within them. The point is that only with properly digitised text, do these kinds of accessibility options become possible.</p>
<p class="c2">Even image collections can also be disseminated as E-books &#8211; nice offline items some users might care to flick through on their tablet computers, possibly even smartphones. I&#8217;ve demonstrated how we can <a href="http://sasopenjournals.blogspot.com/2011/07/populating-ojs-from-eprints.html">create OJS XML from EPrints XML on-the-fly with XSLT</a>: since EPUB and Mobi/Kindle are XML-based formats, we should be able to do something similar to create e-books using repository APIs. Also, by using appropriately sized images in dissemination formats (Ipad screen is 1024x768px; Iphone4 is 960x640px) we can not only ship our users a sensibly-sized download, we can protect any capital we may have in the master images, without having to resort to ugly tricks like watermarking. (Giving users full-size, high-res images with embedded watermarks seems to me the worst of all worlds.)</p>
<p class="c2">Therefore I&#8217;d suggest that, in order to get the best out of a digitisation project, consider what would you like to see at the end of the project &#8211; and, more importantly, what would give your  users the best experience, or even win you new users? Ask around, do some tests, with users if possible, and get an idea how they want to use the materials and how they will get the best out of them. Maybe there are comparable projects and systems that you admire, with features you’d like to be available for your collection. What about in five or ten years’ time: will your current project outputs help or hinder longer term accessibility goals?</p>
<p class="c2">This kind of vision of is essential. Without some conception of the end result, how the materials will be used and managed most effectively, all the scanning in the world isn’t going to amount to  a successful digitisation project.</p>
<p>&nbsp;</p>
<hr class="c9" style="text-align: left; width: 50%; margin: 0 auto 0 0;" />
<div>
<p class="c2"><a name="ftnt1" href="#ftnt_ref1">[1]</a> Of course manuscipts and ‘difficult’ print formats &#8211; early printing typefaces, multilingual objects &#8211; may be resistant to OCR. For that we may need specialised solutions or rekeying, as discussed in recent posts on DA Blog (<a class="c3" href="http://dablog.ulcc.ac.uk/2011/05/05/house-of-books-part-2/">House Of Books (Part 2)</a>, <a class="c3" href="http://dablog.ulcc.ac.uk/2011/02/21/synergies-abound/">Synergies Abound</a>). Or the kind of online tool we developed with UCL for <a class="c3" href="http://dablog.ulcc.ac.uk/2010/03/01/transcribing-bentham/">Transcribe Bentham</a>.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/07/26/scanning-is-different-from-digitisation/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Populating OJS from EPrints</title>
		<link>http://dablog.ulcc.ac.uk/2011/07/14/populating-ojs-from-eprints/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/07/14/populating-ojs-from-eprints/#comments</comments>
		<pubDate>Thu, 14 Jul 2011 21:34:00 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Amicus Curiae]]></category>
		<category><![CDATA[eprints]]></category>
		<category><![CDATA[IALS]]></category>
		<category><![CDATA[journals]]></category>
		<category><![CDATA[OJS]]></category>
		<category><![CDATA[Open Journals]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[SAS-OJS]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[XSLT]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1618</guid>
		<description><![CDATA[From the SAS Open Journals project blog. Now that a full complement of Amicus Curiae articles has been loaded into the SAS-Space repository, I have been looking at ways to populate the OJS database automatically using the metadata available in the repository. We are fortunate, as ever, that EPrints provides a wide range of export formats [...]]]></description>
			<content:encoded><![CDATA[<em>From the <a href="http://sasopenjournals.blogspot.com/">SAS Open Journals project blog</a>.</em>
<br/><br/>
Now that a full complement of <a href="http://sas-space.sas.ac.uk/view/collections/ialsac.html">Amicus Curiae</a> articles has been loaded into the SAS-Space repository, I have been looking at ways to populate the OJS database automatically using the metadata available in the repository.
<br/><br/>
We are fortunate, as ever, that EPrints provides a wide range of export formats for individual item records and for sets of records. On the <a href="http://sas-space.sas.ac.uk/view/collections/ialsac.html">Amicus Curiae</a> Collection page, we can see that EPrints gives us the option to export the metadata for the whole collection as a bibliographic citation (plain text or HTML), in formats for reference management software (Reference Manager, BibTex, EndNote) and in several other bibliographic data formats, including Dublin Core and METS.
<br/><br/>
However, I've chosen to base our process on the EP3 XML format of EPrints, which I've worked with before (when we <a href="http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-3-changing-platforms/">migrated SAS-Space</a> from DSpace to EPrints). It is the native EPrints export/import format, and arguably contains the most faithful serialisation of item metadata in the repository.
<br/><br/>
I've now created an XSLT stylesheet that transforms the EP3 XML for the Amicus Curiae collection into the "<a href="http://pkp.sfu.ca/wiki/index.php/Importing_and_Exporting_Data">native.dtd</a>" XML format which is the native import/export format for OJS. The biggest challenge in XSLT was grouping the journal articles by issue number, as required by the OJS native format, but once I'd found a way to do that, the rest is just fiddling about, as it so often is with metadata mapping.
<br/><br/>
<span id="more-1618"></span>Once the EP3 XML is transformed to OJS format, then, with a Journal already defined in OJS, we can use the OJS import function to import a complete set of issues, each containing its full complement of articles. It's also possible to include a cover image for each issue (if one is available), and the article in PDF form can either be embedded in the XML using Base64 encoding, or linked-to using a URL. Since our articles are already online in the SAS-Space repository, I used the URL option. (It seems that this imports the object into OJS filestore: we will investigate whether it's possible to prevent this, and have the online journal simply link straight to the item in SAS-Space.)
<br/><br/>
At the moment the XSLT stylesheet is working for our purposes, but offers the intriguing prospect that it could be enhanced to work over any result set in an EPrints repository, and made available as an EPrints Export Plug-in. This way, anyone wanting to quickly assemble, or reassemble, an online journal in OJS, can do so from articles deposited in a repository.
<br/><br/>
This could be an attractive scenario for anyone trying to retrospectively assemble an online journal from scans of a printed journal: once the materials are deposited in the repository (with sufficient metadata, of course, and, ideally, OCRed), then the data needed to implement a fully working journal in OJS is only a click away.
<br/><br/>
Existing OJS journal managers might even choose to manage their deposit and review workflow using the repository, and export to OJS when ready. This project gives us an interesting opportunity to compare the two approaches to item submission workflow, and I hope we'll be able to report back on that later.]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/07/14/populating-ojs-from-eprints/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The BlogForever survey is live!</title>
		<link>http://dablog.ulcc.ac.uk/2011/07/11/the-blogforever-survey-is-live/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/07/11/the-blogforever-survey-is-live/#comments</comments>
		<pubDate>Mon, 11 Jul 2011 14:58:24 +0000</pubDate>
		<dc:creator>Silvia Arango-Docio</dc:creator>
				<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1606</guid>
		<description><![CDATA[After weeks of design work, the BlogForever survey is live, available in 6 languages and running for 28 days. This survey is part of BlogForever, an EU-funded collaborative project that ULCC collaborates through the Digital Archives department. The results of the survey, available at the end of the summer, will help to develop digital preservation, [...]]]></description>
			<content:encoded><![CDATA[<p>After weeks of design work, the BlogForever survey is live, available in 6 languages and running for 28 days.</p>
<p>This survey is part of <a href="http://blogforever.eu/">BlogForever</a>, an EU-funded collaborative project that ULCC collaborates through the Digital Archives department.</p>
<p>The results of the survey, available at the end of the summer, will help to develop digital preservation, management and  dissemination  facilities for weblogs. Hence, we are keen to gather information about the content, context and usage patterns of current weblogs, so we could identify blogs users&#8217; views on their long-term preservation, management, analysis, access and use. If you would like to take part on the survey please use the following link:</p>
<p><a href="http://iprobe.gr/Surveys/BlogForever/"><img class="alignnone size-medium wp-image-1607" title="banner_web" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/07/banner_web-300x200.jpg" alt="" width="300" height="200" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/07/11/the-blogforever-survey-is-live/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Digital Forensics and creation of a narrative</title>
		<link>http://dablog.ulcc.ac.uk/2011/07/04/forensics/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/07/04/forensics/#comments</comments>
		<pubDate>Mon, 04 Jul 2011 10:11:08 +0000</pubDate>
		<dc:creator>Ed Pinsent</dc:creator>
				<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[digital forensics]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[Records management]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1591</guid>
		<description><![CDATA[I was very interested to hear Jeremy Leighton John speaking about the nature of digital forensics at the recent DPC event on Digital Forensics. He worked on the &#8220;Digital Lives&#8221; project at the BL as part of their eManuscripts lab. The day began with his overview and ended with the demo of a powerful analysis [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/773px-PersonalStorageDevices.agr_.jpg"><img class="alignleft size-medium wp-image-1598" style="margin: 5px;" title="773px-PersonalStorageDevices.agr" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/773px-PersonalStorageDevices.agr_-300x232.jpg" alt="" width="300" height="232" /></a>I was very interested to hear Jeremy Leighton John speaking about the nature of digital forensics at the recent <a href="http://www.dpconline.org/events/details/31-Forensics?xref=30" target="_blank">DPC event on Digital Forensics</a>. He worked on the <a href="http://www.bl.uk/digital-lives/index.html " target="_blank">&#8220;Digital Lives&#8221; project at the BL</a> as part of their eManuscripts lab. The day began with his overview and ended with the demo of a powerful analysis tool called BitCurator in a Box from Chapel Hill which can analyse, decompress, and extract at a super-fast rate and generate a bunch of XML on the side.</p>
<p>For a naïve non-tech archivist like me, this felt like a new way of looking at and thinking about digital data. The principle seems to be that it&#8217;s possible to capture an image of an entire disk, and then perform stepped analysis on all the data contained within that image. Most models of digital preservation, management or curation that I&#8217;m familiar with tend to focus on the <em>file</em> as the unit which we must identify, migrate, catalogue or preserve; here the target is different, and we&#8217;re dealing with a whole wodge of related digital spew that includes files, technical metadata, automated logs, and lots of system elements from the registry.</p>
<p>Digital forensics is often applied to &#8220;personal archives&#8221;, such as the disk image of a single author / researcher; or is used by commercial organisations seeking to prove something in a court of law; or in extreme cases in tracking down evidence of a crime. As presented by Leighton John, a number of characteristics of the forensic approach struck me as being quite resonant with the aims of digital preservation and/or electronic records management:<br />
<span id="more-1591"></span>
<ul>
<li><strong>No change</strong>: you have to capture information without changing it; demonstrate you have not changed it; and analyse it without changing it.</li>
<li><strong>Audit trail</strong>: forensics requires working out what happened in the past from traces of information, then having to defend it in a public place.</li>
<li>It looks at <strong>everything</strong>: context is provided by examination of all data, records, files and system components in a case.</li>
<li>It tells a <strong>story</strong>: forensics is the creation of a narrative, with lots of details and evidence.</li>
</ul>
<p>This struck me as overlapping with aspects of records management, and descriptive of many features of what we&#8217;d expect an EDRMS to deliver. The &#8220;No Change&#8221; dimension is a crucial part of records management when it comes to the authenticity of the record, and also aligns with the requirements of <a href="http://www.thecabinetoffice.co.uk/page28.html" target="_blank">BIP 0008 on legal admissibility of digital documents</a>. EDRMS systems offer audit trailing as standard, but it&#8217;s mainly the audit trail of what happens to file objects once in the EDRMS itself. I wondered if digital forensics could indeed build this detailed technical &#8220;narrative&#8221; of record creation activity through analysis of an individual user&#8217;s disk image, and I asked a bemused question along these lines.</p>
<p>Naturally, forensics alone wouldn&#8217;t satisfy all the MOREQ requirements for records management, and we&#8217;d still need our mechanisms for managing retention and disposal. Some of the more obvious barriers to success would include:</p>
<ul>
<li><strong>Privacy and rights</strong>: what member of staff working anywhere would welcome having their entire profile analysed? At the very least they&#8217;d feel pretty sensitive about it! But maybe it need not be all or nothing; there&#8217;s plenty of &#8220;rich data&#8221; in any given system and a lot of it could be associated with individual files without capturing the entire data image.</li>
<li>The <strong>costs </strong>of doing it: I&#8217;m not sure but I get the impression it&#8217;s neither cheap nor trivial to carry out a forensic analysis of a disc image; it requires specialist technical skills, a dedicated environment, there are numerous steps to the process, and it takes a long time to complete.</li>
<li>Potential <strong>loss of context</strong>, a problem which may be partially resolved by a specialist form of search and analysis, which can be used as means of indexing one&#8217;s forensics findings.</li>
</ul>
<p>This is all just speculation of course, but I do like the mental exercise of thinking about digital data in ways that don&#8217;t involve mapping to paper analogues, a place where I think we have been trapped for too long.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/07/04/forensics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Moderation in everything</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/23/moderation-in-everything/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/23/moderation-in-everything/#comments</comments>
		<pubDate>Thu, 23 Jun 2011 11:24:40 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[Training]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1571</guid>
		<description><![CDATA[Question: What do alcohol and a VLE discussion forum have in common? Clue: you&#8217;ll have to read to the bottom of this article to find out. History Spot at Institute of Historial Research ( IHR) looks very good.  IHR is one of the many jewels in the crown of ten member Institutes of the School [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright" src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAQQAAACvCAIAAACUxjfEAAAgAElEQVR4nO1d+VeUR9bu/+ibZGYUen3X3thEIILsxoUxESdGHQUVVFCjnkgwQT1GM6PRGU2URcCIyRBhVIi4xzEqssumzSZbr+/3w0PX9CS2Q0M3bzfUc+7hQHf1rVtV97n3VtXbqkiap0j0Al/by6WHYu6hkNsACopggSLUI5m/Ineg+6UIflAyzBMy+GseAj2fctk5HfhMBgqK+QqfybAsyBCISZlLBHoe/DVvge430HZOBzQzUFBMgZKBgmIKCn+lM4qZQa559rVff7WXS8909NPMIDP85QSB7jfQzhrofqcDr2SQa5EoKOSC38hAyUMxlwiEv3ndM/iK97zAX/opKDwRCH9TeFNKMTdI8IJg69df7eXSMx39Cn8ppZgZfF1sCiAQ80nJEKSYzaJOB77266/2cumZjn6vZKCgWGigZKCgmAIlAwXFFCgZZEa8F8ht10IEJcMU/OWUvuqhZAgeUDJMQS4yUAQPKBmmQMlAQckwBUoGigVEhpldylDMDYIhaijiFhLi4+Pf8tYb4a/2FMEPxVKKpUuXLl3qbYL81Z5iBpiD+fwvMviRWEGFN452Ou3fMlmzaU8R/Ji3ZPAVlAwLE54lLi2TKCim4JUMsQGGr/3Opv1c2iPXfIY65JoozzWat5kBSVBuKyhmiDc6a6Axb8lAQeErFHOclSgoghYhT4YlXuBre7kwl3NF8XYo5HYGCopgASUDBcUUKBkoKKZAyUBBMYUFR4aYIIPc80HxHyi8LVK0n+Av/b7qma2TzhVmN7sLF/6aT8/PeiUDBcVCw7wtk+Se2OlC7nmi+A+8kiHQiy2XHor5gUD4FS2TKCimoAjE5oaCIhRByUBBMYV5S4aoEIHc80TxHygivcBfix3q+uWyx5ueYEOg7femMxD9KnxdJAqKuUT0HCZ5r2WSvzqQSz8Fha+Yt2Twl3657Jn+Ei4oBHR+5m2ZNLtKdebwlz1zOVcUgNcNNAXFQgMlA8WsEOEF/tU/N/16JYO3znyFXPop5gfmcpUV/nJKCopQByUDBcUUKBkoKKagMFPICm8LI1f7UO93NnooGSgopkDJIDNMXiBX+1DvdzZ6KBkoKKZAyUBBMQWFt/QxX2H2njGn334GSnz9CMXcY8GR4bcwGo1Go3GOm/nUkmJusNDJYHRjms2m39IvzSjmEgudDIB/M4N/+6WYM1AyUFBMgZLBZKKZgcJkMplMCiOF0WgwGKbTZprN/NsvxZxBYaCgoDAYDAbDfCCDXq+PiIiIiorieV4URZPJpNfr8brBYOB53mw2cxyH100mE3lLEASTycTzfEREhDflRndO+NWLer0eP0lfoigaDAb8ZFk2IiICxnhTMjN4swddYzgYJgYuCILZbOZ5HsM3Go0cx5nNZn/ZM58Q8mQQRVEURUEQDAYDlhyrLggCfEIUxcjIyOjoaLzO87zRXalDA7zH134FQYDfwxGhgdys4XfPxv4Y69sA78c3tjAhhAAGg4HnedhgNpsJgSl+hZAnQ2RkpNFohB9gpVmWhSOazeaIiAiO43Q6ncGdIgwGg8lkgq+Qe/gZ9BsVFQVqiaKIiAviwRiwEYQxm81+zAzegCxnNBox2CVLlsAeRASkPlgVaEv8C70XBKKvgJPB22B8hTf9Wq0WkRjORzyb4zi4JgnVcFlRFCMiIlDhiKKo0+lInfOWfn/7CjIPnolHJBZFkeM4vAUewjtJ4vLL/Hizx+CRrERRZFmW4zgMk+d5juMEQcBswCRf+52v8FyLkCcDKdxRL6EeQJgUBAFhkhTNYAjaeJ6pIZb7BMRdjuOQHIxGI2okYhhcEIniLcnHX/NjMplYljW4U5bBYIiMjGQYBhsJnJYgH4Kx/uo31OG5Fgq5jZktUBYnJSUlJiYuX7582bJlqampWHiESbPZzLJsTExMWlpaQkJCWloaWsbFxanVaswCx3H/syOQzfNPsA4a4HbLli2Lj49PTU1dvnw5fC4qKipAzvcrexiGiYyMFAQBRwU8z7Msq9frkRB4nmcYZunSpZio9957z+/2BAiiFwSiL4W3zgINXwfvDTzP79u3r7e3t7+/32KxDA8Pt7W1FRYWkgbwibKyshcvXoyNjfX29r569aqvr+/8+fPwEnjP2+387StwcQRdhOQDBw709fUNDAx0dnaWl5dHRESQ+sTg3ujPHt7sYRgmIiKChABBENLT0/fs2YOsqNPpYmJiqqqqenp6Xr16dffuXX/ZM58Q8mQQRfGTTz6x2WySJDkcDrvdLklSRUWF3iMoRkVFdXR0SJLkdDodDockSZIkVVZWonbieR6ljjc7vb2IMEwYdfDgQafTKUmSzWa7ceMGdKIL+Khf8BZ7WJYFq00m04kTJ7q7uxsbG81mM8wQRbGurs5qtUqS9Pz5c3/ZM58gGxm8QfACb+15nt+/fz+8ED8lSfrll19iY2MZhkG9lJGR4XK5wBZJkux2u8PhqK6u1uv1Op2O7HGhUK1Wo/TneR57ElxEcByHbECYg4+QXvbt2wcbXC5XfX09tivIIXq9HjsW9AWga0InURTRBr+jR5PJxDCMwb0pZxiGHFUZ3Ce2sFOv1yNHGY3GmzdvTkxMSJLU2NhI6rSoqKja2lqn02m321taWjA0lmXBZCQW7KnMZjOOJQRBYFkWBohuypGYhfsK0iliCrE/FKHw5nyhAlEU9+/fb7fbXS4XPF6SpIGBgT/96U+oXsxm88GDB10uFwnbaFNWVsYwjNls1ul08Cf4B5zAYDBotVqwhWEYvALf9Qz5OMwVBIHn+QMHDoBs4+Pj169fhwMRV0a1plKp4P06nS4qKgpE0mg0onsvLrg3IXBTURRxY4hSB8fHOCkS3Ue6Wq0W9qM7vV7/4sULm83mdDqbmprIxYJarV69evWGDRs2bdr0wQcfiG42khNhlmUxFSqVivAEx2Isy6INBo4jOLPZrFKpMEX4p+8QNbB9ktUpZoiQJ4Ner9+3b5/NZgMZRkdH4ZFFRUXwIaPR+MMPPyBg412woqKiQqvVEn9FbEhJSVm3bl1OTs7u3bs/+uijxMREcjkdFhYmCAJuDHCVsWbNmq1bt+bn569atcpkMu3duxc0Gx8fb2hoIA7BMAzHcZGRkZs2bSooKCgoKFi7du3y5csNBgOyB3gSFRWVmpqanJyclpbG83xKSsqWLVtycnIyMjIMHtd2KSkp27ZtKyws3L59e25u7vr165OSkjBSnucTExMTExP7+/sRF+7cubNy5cqMjAzknISEhKSkpJUrVyYkJMA2kAoHAEuWLNm4cWNubu7u3bvz8vIyMzNjY2OR1liWZVkW7dPT01NTUzHkiIiIxMTErVu3btmyJTs7OyUlhfBZHm+YHRR8kMGbod7ai6JYWFiIzOBwOB4+fIjYf/XqVbhjdHR0V1cXnKOpqUmSJKvVijKJ53m4I8dxcXFxX3/9NbYWSCCTk5Pt7e1Hjx6Njo5WqVS42dXpdDqdbu3atQ8fPhwbGwPxxsbGrly5cuLECexYJicnr1+/jmgqCEJkZOTOnTufPn1qsVisVqvNZrNarR0dHSUlJUuWLMEQtFptfn7+q1evhoeH29vbDx061NraCks6OjpWrFhhNpuzsrIePXqE0YHPdrt9bGysubm5oKAAVNm/f//w8PDk5CSGabfbBwYG2traMjMzDQbD999/PzQ09OLFi8ePH/PuYkypVMbGxhYXFz9+/HhsbEySpImJidHR0fHx8QcPHhQWFpLT54iIiBUrVjx9+nRkZKSnpyc/P//zzz9vb2+X3Hjx4sX+/fujo6PBnJBD0GUGX0kiiuLu3bvhlDab7fz58zabzeFwtLe3x8TEGAyG7OxsMOH169enT58m++zKykpBEBCYU1JSHj58CPeyWq1wJhDMbrfX1tYmJCRotVpU1StXruzt7YWrod/JyUmXyzU5OQkyTExM1NfX8zxvMBg0Gk1xcXFPTw8pz0ZHR+GpY2NjZWVler0ep0A7d+602Wyo9bu6umw2G0qdrq6u6Ojo5ORki8UCik5MTGBEdjdGRkZycnIYhjl8+DAGCLZgu2yxWLKysliWvXr1Kmzo7OxESuQ4LjY2try8HDQAzfAp/JQk6fTp0zqdDjnkvffee/nyJV5/+PAhxjs2NjY+Pg57Jicn8/Pz9e4L+NDCfCBDQUEBqiCr1bpnz57h4WE4aFZWFsMwJSUl4Mm///3vY8eOwUscDkdVVRWqW1EUa2pq4NBOp7O7u/uHH34oKysbHx8ni11aWopaQq/X19fX22w2vPX48ePjx4+fOXOmra3t9evX8EKbzYbMIAjC8uXLu7q6YIDFYjl37tzRo0d//vlnWDgxMZGTkwPa5OXlwQXHx8dBwoGBAYvF8s033/A8X1VV5XA4wIGKiorDhw8XFRX985//RFUmSdKlS5dMJlNubu7PP/9Mxjg+Pv7s2bOmpqaVK1cyDHP9+nW0f/bsGbYuWq32yy+/nJiYsNvtTqdzcHCwsbHx0qVLz58/dzgcExMTk5OTNptt7969GHtGRsbg4ODY2NjIyIgkSa9evTp79uypU6fa2tpAJ5fLde3aNVomyQNBEAoLC0nZ8MEHHzx69AircuzYMVEU6+rqELb//ve/Hz9+/Fdk0Gq1GzduRKgbHh5+8uRJUlISAltqampzczNi/+DgYFZWlsFgSE9Ph/M5HI4nT57ExcWpVCpBEFauXPny5Uun04nI2tDQoNPpWJb929/+Bjo5HI4tW7ZotVpcft27dw9huKmpied5hmFyc3PJKCwWS15entlsTklJSUhIiImJOXbs2E8//TQwMPDVV1/hsplhmPj4+EePHtnt9omJiVu3buFRWZPJ1NLSgrje0NCADTRC+48//ojhdHR0YIu8ZMkScFWSpP7+/o8//litVhuNxpiYmMOHDyPS2+32tra2mJgYjUaTkZExNDSE4Q8NDX300UeCIHAcl5iYaLFYECB++eWXuLg4ed16Zgh5MrAsu2fPHqyxw+FITU397rvvUAg1NjYuW7ast7cXHpaXl1dcXIwFs9vtFy9eRAC7cOHCxMQEqvnNmzfDX5E09u7di3JIkqQjR46wLHvo0CGr1Qqn37VrF8/zuPAymUz/+Mc/4FUjIyM3btzAoefz589RIN27dy86OloQBOybDx06BDKMjo7iQWv05XA4rFbr999/LwgCOcPRaDTYxeIKj2EYlmWXLl26devWFy9eoNN79+5hm8txXE9PD2bj1q1b5CEUtVrd0NCAxNXa2oqDrG3btsEMm8124sQJHMKCOYIgnD17luxP1q9fz/N8ZmYmSjj0iINjnErdunULJWJra2tsbKzMbjEjyEYGzke8RRXcCMuWnp7+2Wef4TS9p6dn7dq1kvt6ITY2tqSkhFw1lJeX8zwfGRn5r3/9C85tt9txsKPT6XDWnpmZ+fLlS3ykqqqK5/mTJ09iLyFJUnp6Os43OY5Tq9WffvopLvVcLteNGzfgyv39/XixpaXlzJkzf/3rX0+ePHny5MkrV66gpdVqTU5O1ul0O3bswBmAy+U6evQodtU6nU4QBBxoImYnJiZeuHChp6cHQyZ3iPfv38fZqCAInZ2dGOPt27dxUqxWqzmOq6urwyy1tbUJgsBx3BdffIGSzGazffjhh7hwEAQBFdT27duxfXK5XJ999plWq33//ff7+vpAj7KyMrKUPM9XVlbiQK+5uXmhkMFXJ54DFBYWOhwOuEVGRsbmzZuxARgZGblw4QIct6enJzIysri4mGwTq6qq4K937951Op2Tk5NdXV1Il1qtFqyIi4vD+ZLdbv/xxx/1ev2RI0fghXa7PSsri9xCGI3GrVu3kszQ2NiIw3jirNgkvH79enJyEhsD8mJ2drbJZNq2bRvZohw4cIBzP1PIcRz49umnn9bW1g4MDDidTuyzx8bGMDqbzXbnzh2e5/V6vVar7e7uxiaqsbERZ2U8z4uiWFtbiw13e3u7SqUSRfH8+fOk06SkJAwf9yE4Oyb6jx07xvN8ZmZmX18fnP7ChQtIRNybyCCfO8wcCrkN8AN2794tuW/TMjMz4+Pju7u7EfAGBwex2NeuXRMEAWSAo1RXV6MWevDgAVy2p6dHrVaLooj7Jp7nU1JS8ESTJEn19fUGg+Hzzz/HHtTpdK5YsQJkQBwtKCggaae+vh5nNePj49hVj46O9vX1WSyWoaEhi8Xy8uXLgYGBvr6+3t7e1atXsyxbWFj4+vVrON/OnTt5ntdoNLhxU6vVx48fJ8c4NputpaXl/v37paWlvb296PTevXuwmWXZV69eIRzcuXMnMjJSp9Mhg9XX1yOZPH36FJpPnTrlcrlQJa5YsQKsw1ZKpVKtX7+ePOFSUlJiMBgyMjL6+vowh5QMwQiQAWuWmZnJcVx9fb3nlbPT6SwuLtbpdCUlJXjR5XKVl5fDj2tqasCc8fHxqKgoHLYuWrTIaDSuWrUKxyZOp/Ps2bMMwxw5coRE+nXr1iEScxyn0Wi+/PJLci5ZV1fHcZwoigjkkiTdvHkzKysrNjY2Pj5+2bJliYmJCQkJ8fHxycnJJpNJq9Xm5eWhTJIkadu2baIoKpVKbOWXLl0KM6xWa1tb27p16/D/FjMM09jYCPPu3r0LY3ie7+vrw7Hv7du3IyMjNRqNTqfTarW1tbWwHHsGURSLiopAdavVmp+fHxYWhkQXFRUliuKuXbswsTabbceOHaIopqen9/f3YwLLysrIEhAyOJ1OSgY5UVhYSB49Sk9PZ1kWx6lONxwOx6pVq1iW/eKLL8hWu7S0FMXDwYMHEexdLte33367aNEibHM5jrt06RKuHaxWa05ODsdxH374ITzebrfX1NQgeAN37tyBcpRJqLVu3ryJI86enh5cVsBli4qKSktLDx48mJ2djfIdZACXduzYwXEcyhVBEHJzc3HDYLfbjxw5IgiCyWRSKpU4t8VHbt26hTtBlmWxZ3C5XD/99JPB/VVPQRDq6+sxG83NzUqlUhTFDz/8ELcEDofj2rVrcXFxPM+r1Wo8qvTo0SOS1lJSUhiGwSYKqbWiogLzj/xAyRAU2Lt3LzkjSktLwxpbrVbEe0mSLBYLDo6KiorgJQ6Ho6KiAjvLtLS0jo4ObItHRka+/vrr1atXr1q16ty5c4iaNpvt/v37cXFxeIKoubkZTBsaGjp16lRiYmJycnJlZSUp1UZHR2/cuKHT6dRqNTYSSA6XLl1asWJFVFRUbm7u8PAwyvfHjx/jpKigoIBc2+Xn5+MhEY1Gw/P8hg0bSOqrq6tDVZaQkHDmzBmyBXr48KFGo0GZh1sCSZI6Ojo2bty4ZcsWs9ms0WiuXbuG4be2tuIQTBCEpqYm7JKdTmd1dXV2dvb69esPHz787Nmz8fFxjKi0tBRMIycKTqezoqIC51rzhwxsiINhmD179mDNXC5XSkqKIAixsbEtLS3E769du4ZiBnsG+D0Cm0ajwV4C92jQ09/fTy4TrFZrb2/vpk2bNBoNLp527drleUdrsVgGBwedTufIyAjOZx0OR11dHe/+Pl1TUxMJ+V1dXe3t7aj+EZLz8vLUajXP89u2bUN+m5iY2L59O8uysJnjuMjISHJaOjEx8eTJkwcPHjx58gTX2IjTnZ2dSBco/LBnwNnU4ODgpk2beJ5HmWS321tbW6Fcp9NlZ2dbLBaUYXh8a2hoCIe8GCPuDbA7SktLI9cp5eXlSETsm8ggp0/MFAomROBtADqdbs+ePeS4E9lcr9cjCuK45vDhwxzHabXao0ePkvh98eJF7CzBh6Kiou7ubsnjsNLlctlstubm5s2bNzMMw3EcNrUGg+HEiRMjIyMulwucsdlsXV1dZWVlcCm73X7z5k3UIQzDJCYm3r59G09hwNfRy+DgYElJiSAIcKn8/HzcBEuShBsM2MyyLM/zxcXFZPdit9tfv34tSVJ3d3dNTQ1MxTMXULV37148AYUQbrfbN2zYwDAMyGCz2dra2nAIizn8+OOPW1paUCiS+cGTKQ0NDatXr4ZajuOSk5PBZJCB8Z4ZZHWWGULBMWxIC8uya9asuXz5ck1NzTfffIO7T41Gs3PnzosXL165cqW0tDQlJQUutWnTpsrKysuXL1dUVODhtvDwcBQAWq02IyOjpKSkoaGhs7Ozvb29trb2yJEj0dHRaMAwDDmuYVn2L3/5S1VVVVtb2/Pnz7/99tv3338/MzPz7Nmz3333XWVl5YEDB3AqhbpFr9d/8skn1dXVLS0tnZ2d9+7dO3/+/J///Gcc/7Msq1Kp1qxZU11dXV1dXVZWtnr1arAUdFWr1TqdbvPmzTU1Na2tra2trbW1tV999VVGRkZSUtK5c+cuX75cWVm5YcMGg8GAp8R37NjR2NjY3d399OnTixcvpqam4jG+q1evlpeXnz59mud5jBqIjY0tKCiAha2trQ8ePLh48WJOTk5MTAzGzjCMVqtNTEw8ffr0pUuXampqtm/fjk0X3t23b19VVVVFRcWJEyeioqK8BS/ZHeYt4jMZWB0TVCKKokqlUqlUKK/hN1hpjuNUKhWeKRIEASfrGo0G3x/geT4sLAy3p2gMaLVajUaDlSPRHTUSy7IMwyBMog0iN778AF5hyysIQlhYmOfXITiOw+v4FC7UwBZ8NwiqwsPD4dDoAoMit844+uQ4Tq1WwzylUonqiGXZxYsXg3swEkWgWq2GGYsXLyalERgIAwRB4Hke+slnSSogf/I8j0k2Go1KpRKhgRiJb5MzDKNWq/E9B2/RV3aHeYsofP0Ao9UFlcCHcEqD5cTakKWC3yCwYQk1Gg3cgoQrjUaDlUZLhGRSqKA9UYLfCesYhlGpVKg6yKew8PBalUrF87xSqQwPDwcPSS5CTiAPgICucHokDehUqVRqtRqWKJVK4tPk4Q64MigH30WAwJ9arRbXgvgToyC+jhFh1OHh4SAeZsCTD+QaGy/id1wsknnAMRR61HmB7A7zFgl5MsD54J2eXg4OwJlI8S24v8dIVgs+TZ6qADdI5CZVBDybvMUwDEkyJAxzHBceHk7IiVzBuq+l8N0G+Bk8GwTAqQ54iLgbHh4Or/qvgMqySBGc+8kUpVKp0Wg4jlMqlUgC8EhcNeB33EmDGLATA9FqtZgf8hFYiEHBNjJ86EEXhLEkcIB4hGB6vR5EWhBkCDbBYsCTPKMUVhorCg9AWMXvJPXjg3B9EuwJwRAdiTuS5Sc/Se3kyRBogFsgXyGcE6YR9yKsQ7/kd+QH/Im6nNhP+Ek8m1Q+sApkI1SH13qOCL1Puab7YABxPSwsDM1IZiOJjmzlkX6hnDw9BYWwEGl5QZRJwSYAPAYJATEPPxHMSNQkIQ1ugcUm5yHwD61WixUla0/yviclUHtAOTYDarUatTscBTsWNCaOqHOfRZK4i1egDQ/n/Wq3ALYj3pM9A4aMHkmhjzbYfjAMQw6L4LswBsMkOZOUQ6TQwnx6DhmPZmAqSJLh3f8KDuYTL3puaUKSDN6MDhUgviqVShLz8DopzfEnqfIZdzRl3BULWKFz73RBKnyEFAwkjnrmB/gH6Me4axISoQnNSIEO7ye9w03JiS1KF8/qBZyBv5LajwRgxu2RqJQY984ePZJ0RyI6ISQGRXhIAj/JTvByz50PLMSskjRCtlKe2RWvkGkPLSjkNmC2wLyTJxFILetZ7iMKku0EVo6ckzJu32U8CmXPCPerAgkgJ1eeJQ1xXMZdOBFu4E+ih2Qh8i7Ygoc1CD917t0I6ZRxk9nTEUkSwNBIWYVgj3qGJBzPQogMyrPY85xAYifhA+iK6ggafpVsCcFCDgotBQWFVqvVaikZKCimQMlAQTEFhSbI4M1Que2aa8ylE1AAQUcGCgq5EHRkUHuB3HbNNbzNA0XgEHRkCDS8TUSw9RuY5aZ4GygZpjBf+6WYPhQalZoKFSoalVqhVWuoBKF4W7Bga++rnkDrn017SgYqVKaEkiFIhWYGGTKDr0ZQoTJfRaFWqqhQoaJWqigZqFCZEkqGIBVVuPKNEmztfdUTaP2zaa/w1QgqVOarUDL4LMqwcGVY+Nz08lsJtJ6F3K/fyODNiEBPir8m0deRBk5/8DjHQuuXkmGGYwyE/kDMg78kVNZ9Nv6gCPRkyeX0fh+XvxQGyGmCbT5DsV9KhmmNy1/a5mZRAyGhsu5BQYb5KpQM85sMnkLJ8D9EFibMzM6AOl+g9QRDv/9FhvDFYfNGZPfOAImv4w30/PhLfzDYqfifrUNUZPfaAAklQ+DsVMjutVSoBIlQMlChMiWUDFSoTAklAxUqU6IIW7QYIrsp/hUyrnkmvo430PPjL/3BYCclQ4gJJUPg7FT4qzMqVEJdKBmoUJkSrxtouQwKNnuoLByhmYEKlSlRLP7jojeK7JZRoTLHQslAhcqUBF2ZRMlJRS6hZKBCZUoWHBm86feXyDUuKrMXr3sGuWTRH/74RpHdMCrzXhTenI8KlYUmfiPDH3//hzdKoPVToeIvCRkyUJJQCbQovL1ByUBloYlXMlChstBE8Yd3fx9Q8dUgf+mhQsVXCXhm8NW5A022UBd/zae/5j/Q9szlPARdmUSdfn7PZ7DZ4ylBRwYqVOQSv+0Z5BpAsNlDJXTXJeBkCDY9VOZGQnFdZNtA+4sMss8glXkjQUcGmgFmNp/+ah9s9s9lv7JtoKnT+3feAt0+2DJ8IPoN+dOkUKeKXM4daDvlmp/Z6An5S7c5W5WQmJ+ZtQyEnXLNz2z0BPxxjDmWabrCPJYFPvzZyHwjAxUqPsnv33mXyHwjAxmY7JbMYDE8xdf2v/3U9FsGwk655mdm8zZFhuk0okJlIcj8JMO7v3tHdhuohJzMNzK8+7t3ILJb4qvBvxJf27/xIz419q+dcs3PbPQoUE55vvGWGQzCyfWXMwV6UakEv0xlhtmTIdTF2wTJpSfQ/Qa6fSjqUfz+N6VFoJeNCpXgFOpNMN8AAABSSURBVMUbvf+d//udr4re+b/f+SRy6QmVcck1D8G2XnOpR+GpYiGTYZa+KLtQMviHDL994y2dBfNgKBkoGWajR/HGN97S2XwVXyc90HqozL38P8hdDDPa/ksrAAAAAElFTkSuQmCC" alt="" width="260" height="175" /></p>
<p>Question: What do alcohol and a VLE discussion forum have in common?</p>
<p>Clue: you&#8217;ll have to read to the bottom of this article to find out.</p>
<p><a href="https://historyspot.org.uk/frontpage">History Spot</a> at<a href="http://www.history.ac.uk/"> Institute of Historial Research</a> ( IHR) looks very good.  IHR is one of the many jewels in the crown of ten member Institutes of the <a href="http://www.sas.ac.uk/">School of Advanced Study</a>, part of the <a href="http://www.london.ac.uk/">University of London</a>. It offers many training opportunities for historians and the like. History SPOT is a platform and an opportunity seized by IHR to launch and disseminate IHR activities.  It will significantly increase and enhance access to the materials by anyone who needs these skills/information/knowledge.  What we saw so far, looks intelligently designed with the user at the heart of the model.</p>
<p>On Tuesday we gathered to hear more about HistorySPOT and also from the Open University about their issues with VLEs, they knowing more than a little about such matters.  History SPOT was launched in March 2011 thanks to a grant from an  anonymous donor to develop the IHR’s online presence (especially to  widen participation of its research seminar programmes).  It was soon  decided that they would use this donation to develop two core areas of the  IHR’s remit.  The first was to widen knowledge of and participation in their internationally renowned research seminars through the use of  podcasting, live streamed events, and vodcasting (video).  The second  was to provide for the first time, free access to content from our  research training modules and courses.</p>
<p><span id="more-1571"></span>Dr Matt Phillpott gave us an overview of presently available online training courses relating to the  Humanities and the concluded that there were not that many. The  Open University is the closest we have and of course   iTunes is an option but this is not perfect as it neglects much contextual  information about the courses on offer.  The OU also ran us through their various offerings, including the online tasters they provide (all in Moodle) which used to be called Learning Space but now is called Open Learn.</p>
<p>So all in all putting courses into a VLE is a good thing right? Well, of course! At IHR it increases access to the wealth of material being built up at IHR and hopefully adds to IHR&#8217;s income stream which will enable it to deliver more high quality and useful stuff to put into History Spot.</p>
<p>An interesting discussion arose about the age cohorts of people using our VLEs, many young people will be very happy with a purely online experience but the quite high drop out rate at OU reflects that not all can motivate themselves in this way. Agreement was reached that it is very difficult to teach about the meaning of something as well as it is to train via a VLE.</p>
<p>Of course putting our courses on a VLE is not as simple as it seems. Transferring the pedagogical experience to the environment of a VLE is not straightforward.  To recreate an experience of the  group dynamic which occurs in a training course or that magic of the learning moment, we can resort to forums, discussion groups and the like. However these will only go so far and the more is not the merrier. In fact the more forums the less likely people are to contribute and more problems can arise.</p>
<p>A positive side to online discussions is that people who normally wouldn&#8217;t contribute in a face to face course may be more inclined to do so in a VLE.  However, and here is the pitfall, the experience of many hardened and wise VLE providers will indicate that all these courses and discussion groups need  constant moderation. People can write things and express themselves in ways online which they wouldn&#8217;t dream of doing face to face. Like alcohol it seems to have an emboldening effect and people will say things which whether intentional or not can be misconstrued and cause great upset. This has to be decided on when planning and designing the course.  The moderator will need to be alert to this at all times while running their course, and so the lesson is again &#8216;Moderation in everything&#8217;.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/23/moderation-in-everything/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DA visits CERN</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/#comments</comments>
		<pubDate>Wed, 22 Jun 2011 15:41:57 +0000</pubDate>
		<dc:creator>Silvia Arango-Docio</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[CERN]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[FP7]]></category>
		<category><![CDATA[Organisations]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1540</guid>
		<description><![CDATA[Last week I was in a very welcoming Geneva, exactly in the European Organization for Nuclear Research (CERN) to meet other partners working on BlogForever and to have several Invenio workshops. I felt very lucky to be in the hub of such an organization and to see how many young international students are getting the [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1556" class="wp-caption alignleft" style="width: 202px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computercentre_cern-300x225.jpg"><img class="size-full wp-image-1556 " title="CERN Computer Centre" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computercentre_cern-300x225.jpg" alt="CERN Computer Centre" width="192" height="144" /></a><p class="wp-caption-text">CERN Computer Centre</p></div>
<div id="attachment_1557" class="wp-caption alignright" style="width: 164px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computer_raks_cern-300x300.jpg"><img class="size-full wp-image-1557  " title="CERN Computer Racks" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computer_raks_cern-300x300.jpg" alt="" width="154" height="154" /></a><p class="wp-caption-text">CERN Computer Racks</p></div>
<p>Last week I was in a very welcoming Geneva, exactly in the <a href="http://public.web.cern.ch/public/en/About/About-en.html">European Organization  for Nuclear Research</a> (CERN) to meet other partners working on <a href="http://blogforever.eu/">BlogForever</a> and to have several <a href="http://invenio-software.org/">Invenio</a> workshops. I felt very lucky to be in the hub of such an organization and to see how many young international students are getting the opportunity to be in the forefront of high physics research.</p>
<div id="attachment_1558" class="wp-caption alignleft" style="width: 164px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/Outside_Globe_Cern.jpg"><img class="size-medium wp-image-1558 " title="The Globe at CERN" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/Outside_Globe_Cern-300x300.jpg" alt="The Globe at CERN" width="154" height="154" /></a><p class="wp-caption-text">The Globe at CERN</p></div>
<p>CERN is home of the world&#8217;s biggest and most powerful particle accelerator, the <a href="http://public.web.cern.ch/public/en/lhc/lhc-en.html">Large Hadron Collider</a> (LHC). This machine is installed in a 27 km circumference tunnel. The LHC records around 15 petabytes per year. All the data is stored in their vast <a href="http://cdsweb.cern.ch/record/1103476/">computer centre</a>, where open access and sharing has been the driving principle since their foundation in 1954 and an inspirational environment for the Web to be born there.</p>
<p>Invenio&#8217;s workshops showed us that their electronic document management system is robust and versatile, targets the management of more than 1.2 million documents and it can be used in 19 different languages. Its content is clean and complete. In just their High Energy Physics domain, they have around 700 collections and approximately 20K queries a day. As well Invenio is used for special programs like the UNESCO funded digital repos in Africa and EU funded projects like <a href="http://www.d4science.eu/">D4Science</a> and <a href="http://www.openaire.eu/">OpenAIRE.</a></p>
<div id="attachment_1559" class="wp-caption alignright" style="width: 250px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/me_at_CERN-300x225.jpg"><img class="size-full wp-image-1559  " title="Silvia at CERN" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/me_at_CERN-300x225.jpg" alt="Silvia at CERN" width="240" height="180" /></a><p class="wp-caption-text">Me at CERN</p></div>
<p>In the case of BlogForever and Invenio, plenty of work to be done by the Invenio Team at the User and Document Service Group. At the moment, they have more than 30 readily available <a href="http://www.python.org/">Python</a> modules that can be adapted to the case of preserving huge amount of blogs. From the point of view of my work with repositories as part of the <a href="http://www.ulcc.ac.uk/content/repositories-he-and-research">Digital Archives and Repositories Team</a> at <a href="http://www.ulcc.ac.uk/">ULCC</a>, I was inspired by Invenio&#8217;s advance search engines; indexing and ranking methods.</p>
<p>In a more personal level, if you are ever crossing the border between France and Switzerland near Geneva, get Tram 18 and hop off at CERN to see their <a href="http://microcosm.web.cern.ch/microcosm/Welcome.html">Microcosm</a> and <a href="http://public.web.cern.ch/public/en/spotlight/SpotlightGlobe-en.html">Globe</a> exhibitions.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open Repositories 2011 (Part 1)</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-1/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-1/#comments</comments>
		<pubDate>Wed, 22 Jun 2011 10:00:46 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Repositories]]></category>
		<category><![CDATA[dspace]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[eprints]]></category>
		<category><![CDATA[Open Repositories]]></category>
		<category><![CDATA[OR11]]></category>
		<category><![CDATA[repositories]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1487</guid>
		<description><![CDATA[Rory and I had a fun, productive and informative time at Open Repositories 2011 in Austin: everyone involved agreed that this year&#8217;s OR conference at the University of Texas was a great success. The conference kicked off with a keynote from Jim Jagielski of the Apache Software Foundation, describing the history and organisation behind Apache [...]]]></description>
			<content:encoded><![CDATA[<p>Rory and I had a fun, productive and informative time at Open Repositories 2011 in Austin: everyone involved agreed that this year&#8217;s OR conference at the University of Texas was a great success.</p>
<div id="attachment_1490" class="wp-caption alignright" style="width: 205px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/IMAG0559.jpg"><img class="size-medium wp-image-1490  " title="Chris Awre, William Nixon, Rory McNicholl at the Longhorns stadium" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/IMAG0559-217x300.jpg" alt="" width="195" height="270" /></a><p class="wp-caption-text">Chris Awre, William Nixon, Rory McNicholl at the Texas Longhorns stadium</p></div>
<p>The conference kicked off with a keynote from Jim Jagielski of the<a href="http://www.apache.org/"> Apache Software Foundation</a>, describing the history and organisation behind Apache and its projects. It was observed by some in the Twitter backchannel that the talk could as easily have been from 2001 as 2011, but for all that it was a worthwhile reminder that, in all our efforts, we stand on the shoulders of the giants who created and maintain the infrastructure of the Web and the Internet. And also that many our endeavours benefit from a little more dedication and commitment than you can usually squeeze between 9-to-5.</p>
<p>The closing keynote was by repositories stalwart Clifford Lynch, who managed to touch on so many perennial repository themes, I won&#8217;t attempt to summarise them. There is a handy <a href="http://storify.com/datag/clifford-lynch-keynote-at-open-repositories-2011/">anthology of tweets about his talk on Storify</a>.</p>
<p>In between were plenty of presentations and opportunities to meet friends old and new from the United States of Repoland &#8211; some we have worked with, some we would like to work with, and many with challenging ideas and insights into the many facets of working with repositories.</p>
<p><span id="more-1487"></span>The OR conference hops back and forth across the Atlantic (I&#8217;ve previously attended <a href="http://dablog.ulcc.ac.uk/2008/04/02/open-repositories-2008-in-southampton/">OR08 in Southampton</a>, <a href="http://dablog.ulcc.ac.uk/2009/06/10/open-repositories-2009/">OR09 in Atlanta</a> and <a href="http://dablog.ulcc.ac.uk/2010/07/09/open-repositories-2010-in-madrid/">OR10 in Madrid</a>). Unfortunately when the conference is held Stateside, the representation of the EPrints community tends to be noticeably smaller. Not that there aren&#8217;t EPrints users in the USA (we were particularly pleased to meet the team from <a href="http://library.caltech.edu/">Cal Tech Library</a>, very happy users and advocates of EPrints), but the distribution of software platforms is significantly different from Europe in general, and the UK in particular (if you are interested in such things, you can check out the statistics at <a href="http://www.opendoar.org/find.php?format=charts">OpenDOAR</a>). And of course travel logistics (and costs) are non-trivial. Luckily Rory and I had been saving our prize money from <a href="http://devcsi.ukoln.ac.uk/blog/2010/07/13/we-have-a-winner-developer-challenge-at-open-repositories-2010-madrid/">last year&#8217;s Developer Challenge</a>!</p>
<p>While it eluded me in previous years, I think at last I am starting to grasp at least some of the salient points of the thing they call <a href="http://www.duraspace.org/">Duraspace</a> (launched, if I recall, in Atlanta)! I&#8217;m certainly hoping to find time to take my <a href="http://duracloud.org/trial_account">free Duracloud trial</a>. However other aspects still remain opaque to me. At one panel discussion about the prospects for implementing DSpace over Fedora (or Fedora under DSpace, depending which way up you look at it), I was surprised to hear a description of ongoing DSpace-Fedora alignment efforts as &#8220;more about the journey than the destination&#8221;. An enviable luxury: for the time being we need tangible outcomes for our repositories and customers, and that&#8217;s one reason why we&#8217;ll be sticking with EPrints for the foreseeable future.</p>
<p>Personal highlights for me are described elsewhere: the <strong><a href="http://dablog.ulcc.ac.uk/2011/06/14/open-repositories-2011-part-2-the-developer-challenge/">Developer Challenge</a></strong>, which we enjoyed immensely, and <strong><a href="http://dablog.ulcc.ac.uk/?p=1499">Changing Platforms</a></strong> the talk that I presented with Imma Subirats, of the UN Food &amp; Agricultural Organisation, where we discussed migrating between repository platforms. Rory also had a chance to meet developers from Yale, who had worked on the other end of the <a href="http://dablog.ulcc.ac.uk/2011/02/21/synergies-abound/">SOAS-Yale Islamic Manuscripts</a> collaboration, and show off some of his work for the <a href="http://digital.info.soas.ac.uk/cgi/c">SOAS repository</a>. We were also hugely appreciative of the generosity of the <a href="http://www.eprints.org/">EPrints t</a>eam, who kept us generally amused and amazed, and kindly included us in their group dinner on the last evening.</p>
<div id="attachment_1502" class="wp-caption alignright" style="width: 280px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/241367_860021243981_61013483_44532206_7367009_o.jpg"><img class="size-medium wp-image-1502 " title="Ade Stevenson on stage at the Blue Moon" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/241367_860021243981_61013483_44532206_7367009_o-300x225.jpg" alt="Ade Stevenson on stage at the Blue Moon" width="270" height="203" /></a><p class="wp-caption-text">Adrian Stevenson&#39;s got them all-night late bar open repository blues...</p></div>
<p>The facilities at UT&#8217;s AT&amp;T Conference Centre were outstanding, as was the surrounding campus generally, including the Longhorns football stadium (with its insanely massive west stand) where the conference dinner was held. Austin has far more attractions than we could see in such a short time, and it is an impressive and vibrant city, from the spectacular grandeur of the Texas state capitol, to the noisy entertainment on 6th Street, where virtually every bar has some kind of rock or blues band playing. We were most impressed by UKOLN&#8217;s Adrian Stevenson who jammed on a borrowed guitar with the blues band in the Blue Moon bar at 2am. As if that wasn&#8217;t enough, our visit also coincided with the massive Republic Of Texas biker rally &#8211; an insanely noisy procession of up to 50,000 bikers through the main streets of the city. Our ears won&#8217;t forget OR11 in a hurry.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open Repositories 2011 (Part 3): Changing Platforms</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-3-changing-platforms/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-3-changing-platforms/#comments</comments>
		<pubDate>Wed, 22 Jun 2011 09:38:22 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Repositories]]></category>
		<category><![CDATA[dspace]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[eprints]]></category>
		<category><![CDATA[Open Repositories]]></category>
		<category><![CDATA[OR11]]></category>
		<category><![CDATA[repositories]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1499</guid>
		<description><![CDATA[To OR11 I took a presentation, jointly with Imma Subirats, from UN FAO in Rome, which we called Changing Platforms. The aim of the presentation was to discuss the subject of migrating repositories between different software platforms. In addition to her work at FAO, Imma is Chief Executive for the E-LIS repository, a major international [...]]]></description>
			<content:encoded><![CDATA[<p>To OR11 I took a presentation, jointly with Imma Subirats, from UN FAO in Rome, which we called <em>Changing Platforms</em>. The aim of the presentation was to discuss the subject of migrating repositories between different software platforms.</p>
<p>In addition to her work at <a href="http://www.fao.org/">FAO</a>, Imma is Chief Executive for the <a href="http://eprints.rclis.org/">E-LIS</a> repository, a major international and multi-lingual repository of articles about Library and Information Science. E-LIS has operated since 2003 on EPrints, but last year migrated to DSpace, because <a href="http://www.cilea.it/">CILEA</a> in Italy, who generously donate support and hosting, now focuses exclusively on working with DSpace. The E-LIS migration has been largely successful, however a number of EPrints features on which the E-LIS editors and users depended, have been difficult to replicate in DSpace, or had to be put on ice. This is no reflection on the specialists at CILEA, but perhaps indicative of more profound differences between EPrints and DSpace, that aren&#8217;t always reflected in the usual comparisons of repository platforms, such as the otherwise informative <a href="http://www.rsp.ac.uk/start/software-survey/results-2010/">JISC RSP Repository Software survey</a>.</p>
<p>ULCC of course has just completed a repository migration from DSpace to EPrints for the School of Advanced Study. Our motivation was in many respects the same as that of CILEA &#8211; our expertise lies firmly in the EPrints camp. But I think the outcomes for our end-user community are more demonstrably positive: in fact I don&#8217;t think there&#8217;s a single feature of the new SAS-Space-on-EPrints that isn&#8217;t a major improvement over its previous incarnation.</p>
<p>Migration of metadata and data (at least from DSpace to EPrints) presented few issues (that weren&#8217;t of my own making!) &#8211; export, transform, import. Here the similarities between the models of the two platforms was extremely valuable. But we did encounter other significant differences, some of which are set out in more detail below.</p>
<div id="attachment_1510" class="wp-caption aligncenter" style="width: 562px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/241364_859817033221_61013483_44527715_7459153_o.jpg"><img class="size-large wp-image-1510" title="Richard presenting Changing Platforms at OR11" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/241364_859817033221_61013483_44527715_7459153_o-1024x288.jpg" alt="Richard presenting Changing Platforms at OR11" width="552" height="155" /></a><p class="wp-caption-text">Richard presenting Changing Platforms at OR11</p></div>
<p style="text-align: center;"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/241364_859817033221_61013483_44527715_7459153_o.jpg"></a></p>
<p><strong id="eprints"><span id="more-1499"></span>Issues in EPrints</strong></p>
<p>Perhaps the most significant issue we encountered with re-implementing SAS-Space on EPrints was the absence of built-in support for Handle persistent identifiers. Handle support comes out-of-the-box with DSpace, but not with EPrints, so the choice we faced was between re-implementing Handle support, or dropping it. We chose the latter, since the benefits of Handles to a relatively small IR like SAS-Space were not obvious, and so it was hard to justify the extra cost and effort. By <a href="http://www.slideshare.net/bezbozhnik/changing-platforms/20">ensuring that items kept the same ID</a> when migrated from DSpace to EPrints, and implementing a simple rewrite rule, we have ensured that Handle URIs created while DSpace was operational continue to point to the same item &#8211; but for items added since EPrints went live, no new Handle URIs are coined.</p>
<p>(Shortly after we returned from OR11, an extended discussion broke out on Twitter, amongst several well-respected gentlemen in our field, about the benefits of using Handles. A considerable amount of scepticism was expressed about their usefulness.)</p>
<p><strong>Issues in DSpace</strong></p>
<p>Imma described some workflow issues encountered with the new implementation of her repository. The E-LIS team is accustomed to a very flexible EPrints-based workflow that allows items to have their workflow status changed quite freely. DSpace, by contrast, has a unidirectional workflow model, so that items cannot (for example) be reverted from Live to Pending, if some kind of error is spotted, but effectively need to be deleted and resubmitted. This is obviously a significant divergence between the superficially similar repository platforms.</p>
<p>Another example Imma gave of a perplexing feature of the default DSpace UI is the button on each abstract page that says &#8220;View Full Item Record&#8221;. It leads to a rather intimidating web page displaying the item metadata as Qualified Dublin Core. It&#8217;s not a very attractive display, nor is it actually a &#8220;data&#8221; rendering of the metadata (as you would get by explicitly choosing to Export As XML, or from some new-fangled Linked Data features). It&#8217;s not clear why this view would be of interest to general users of the repository: why is it there?</p>
<p>At OR11 I talked to several people working with DSpace, and all agreed that there&#8217;s room for improvement in the default Web UI. In some cases they have completely reimplemented the web templates. It&#8217;s also worth noting that the page layout in the default JSP UI is entirely implemented using HTML tables, and doesn&#8217;t pass W3C validation. For a Web application that&#8217;s nearly 10 years old, this is disappointing. (The alternative Manakin XML UI implements an attractive vision of UI abstraction using XSLT, but reports suggest that configuring/maintaining it is not for the faint-hearted.)</p>
<p>Quite a few Web design infelicities are perpetrated in the default Community, Collection and Abstract page templates. (During the conference, many of us enjoyed and applauded Simeon Warner&#8217;s timely rant,&#8221;Don&#8217;t <strong>bold</strong> the field name&#8221;.) Of course we can change them &#8211; it&#8217;s Open Source, isn&#8217;t it? &#8211; but is it unreasonable to expect default Web templates that are at least potentially usable as is? Of course the natural and reasonable response of the DSpace community is to ask that we report the issue as a bug or feature-request to the development team. Or fix it ourselves and share the fix. But where an absent feature is really important to a user (by which I probably mean a repository manager), then the choice faced is between &#8220;getting by&#8221; until it&#8217;s implemented in the core distribution, or doing it themselves (which probably means hiring a specialist developer to implement it for them).</p>
<p><iframe src="http://www.slideshare.net/slideshow/embed_code/8305736" width="400" height="337" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe><br/><br/><br />
<strong>Out-of-the-box</strong></p>
<p>At the <a href="http://www.aepic.it/conf/DSUG2007/viewabstract8587.html?id=331&amp;cf=11">DSpace User Group meeting</a> in 2007, I described how we considered that, back in 2005, DSpace offered a better &#8220;out-of-the-box&#8221; experience than EPrints. I never thought it was anything to write home about &#8211; in fact I remember being disappointed by the very UI issues I&#8217;ve described above &#8211; but to my untrained eye it did seem better than EPrints, at the time. But, as I&#8217;ve <a href="http://dablog.ulcc.ac.uk/2009/12/21/our-new-eprints-repository-is-not-just-for-christmas/">mentioned elsewhere</a>, EPrints has improved remarkably since.</p>
<p>Of course a lot of people we admire have proved that you can create impressive repository systems using DSpace. It performs and provides a lot of essential repository functionality. Its Lucene search engine is certainly better than anything EPrints currently offers. But I&#8217;m still surprised how much more work seems to be necessary to make a DSpace installation as readily useful and usable as EPrints, and this seems to represent considerable additional cost in setting up DSpace.</p>
<p>I&#8217;ve heard it sometimes argued &#8211; in both EPrints and DSpace camps &#8211; that Repository setup shouldn&#8217;t <em>be</em> too easy, lest repository managers get in a mess and endanger the integrity of their system. In my opinion, as developers and solution providers, our job is to provide as many features and tools as possible to enable Repository Managers to manage their collections effectively and easily &#8211; not act as as gatekeepers to their systems and data.</p>
<p>By way of contrast, we have recently supported the Institute of Education (IOE) in setting up an EPrints repository of UK government publications, and we were pleased to see the repository manager called on us very little, other than to answer some questions and apply a few small configuration changes. The experience with SAS-Space has also confirmed to me that EPrints now has strong out-of-the-box appeal, and a rich set of features available through the Web UI, that enable a reasonably confident repository manager to get to work without needing to initiate a major technical project.</p>
<p>In the current climate, of straitened library budgets, this could make a considerable difference to the viability of a repository startup project. For a growing number of libraries and information services &#8211; not least at smaller research institutions, or in developing countries &#8211; that could be the difference between having a repository, or not.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/22/open-repositories-2011-part-3-changing-platforms/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open Repositories 2011 (Part 2): The Developer Challenge</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/14/open-repositories-2011-part-2-the-developer-challenge/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/14/open-repositories-2011-part-2-the-developer-challenge/#comments</comments>
		<pubDate>Tue, 14 Jun 2011 10:50:06 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Repositories]]></category>
		<category><![CDATA[challenge]]></category>
		<category><![CDATA[Challenges]]></category>
		<category><![CDATA[DevCSI]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[MERLIN]]></category>
		<category><![CDATA[OR11]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[SHERPA-LEAP]]></category>
		<category><![CDATA[University of London]]></category>
		<category><![CDATA[University of Texas]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1463</guid>
		<description><![CDATA[An event that asked developers to demonstrate the Future of Repositories can only be considered a great success when it receives entries that include: Multiple real-time examples of using &#8220;Repositories As A Service (RaaS)&#8221;, not only exchanging data but also sharing sophisticated functionality between EPrints and DSpace &#8211; and even including an Android application A [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1465" class="wp-caption alignright" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/4d849b5ea9d545a146cb5119f3cb07af08202f38_wmeg_00001.jpg"><img class="size-medium wp-image-1465 " title="OR11 Developer Challenge" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/4d849b5ea9d545a146cb5119f3cb07af08202f38_wmeg_00001-300x224.jpg" alt="" width="300" height="224" /></a><p class="wp-caption-text">Excitement at the OR11 Developer Challenge Show-and-Tell (Photo by @sparrowbarley)</p></div>
<p>An event that asked developers to demonstrate the Future of Repositories can only be considered a great success when it receives entries that include:</p>
<ul>
<li>Multiple real-time examples of using &#8220;Repositories As A Service (RaaS)&#8221;, not only exchanging data but also sharing sophisticated functionality between EPrints and DSpace &#8211; and even including an Android application</li>
<li>A tool for bundling and depositing a whole raft of research related outputs from the Web via RDF</li>
<li>A tactile repository search interface with dynamic search suggestions, specifically designed for tablets and smartphones</li>
<li>A complete gesture and voice-driven system for depositing and searching in repositories</li>
</ul>
<p>All these &#8211; and other great entries too &#8211; were achieved in a couple of days&#8217; work during the course of the conference, for the annual OR Developer Challenge, and presented at a packed Show-and-Tell session on Thursday afternoon (true, there was free beer).</p>
<p>Stuart Lewis&#8217;s team were worthy winners with their RaaS project, particularly as they showed a genuine commitment to a cross-platform approach &#8211; something which, sensibly, backgrounds the individual software platforms, that often receive too much attention, and focuses on the Repository as an application and entity in its own right.</p>
<p>We were also really pleased to see a prize go to Patrick McSweeney and Matt Taylor. And enjoyed seeing Dave Tarrant stealing the show (again) with his live demonstration of using a Microsoft Xbox <a href="http://www.xbox.com/en-US/Kinect/GetStarted">Kinect</a> to submit items to a repository.</p>
<p><a href="http://is.gd/texasslides">Our own entry </a>may not have won, but several people liked it, and you may see more of it in future. For the second year running, the Developer Challenge was a great opportunity for Rory and me to concentrate on an idea that we&#8217;ve been kicking around, without having found a home for it in existing work (yet). This was true for our Semantic Metadata popup tools that <a href="http://devcsi.ukoln.ac.uk/blog/2010/07/13/we-have-a-winner-developer-challenge-at-open-repositories-2010-madrid/" target="_blank">won the challenge</a> with last year.</p>
<p><span id="more-1463"></span>This year we set about achieving our  longstanding desire to take the very tactile and dynamic <a href="http://lasso.ucl.ac.uk/merlin-ui/" target="_blank">search interface that Rory created for the MERLIN project</a>, and turn it into a touchscreen app, for smartphones and tablets.  The results were pretty effective.</p>
<div id="attachment_1464" class="wp-caption alignleft" style="width: 190px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/texas-app-screenshot1.png"><img class="size-medium wp-image-1464" title="MERLIN Mobile interface (&quot;TEXAS&quot;)" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/texas-app-screenshot1-180x300.png" alt="" width="180" height="300" /></a><p class="wp-caption-text">The MERLIN Mobile interface (&quot;TEXAS&quot;) as demonstrated at OR11</p></div>
<p>The MERLIN interface on LASSO is quite complex, but at the heart of it is the tag cloud of related terms that the Termine text-mining suggests. This always looked like it might be good on a touchscreen, so we stripped it down, rearranged and tweaked it to make it viable on an Ipad screen. If you&#8217;ve got an Ipad, you can try it out by pointing your Safari browser at <a href="http://is.gd/texasweb" target="_blank">http://is.gd/texasweb</a>. (It will work on desktop browsers too, but it looks best on a portrait oriented screen.)</p>
<p>If you&#8217;ve got an Android device, you can even download an app-based version of it from <a href="http://is.gd/texasapp" target="_blank">http://is.gd/texasapp</a>. (It&#8217;s a bit cramped in a smartphone display, but still essentially working.)</p>
<p>If you&#8217;ve got neither we created <a href="http://is.gd/texasdemo" target="_blank">this page</a> to give you a rough idea what it looks like.</p>
<p>It&#8217;s worth mentioning that this is a <em>real, live, working application</em>: enter a search term (&#8220;logical positivism&#8221;, &#8220;climate change&#8221;, &#8220;Jeremy Bentham&#8221;,  &#8230;) and it searches over the full-text corpus of all articles in University of London Open Access repositories (<a href="http://www.sherpa-leap.ac.uk">SHERPA-LEAP</a> consortium members), and makes suggestions about additional or alternative search terms, based on the results of the text-mining analysis of the articles</p>
<p>The hackathon approach of working closely together to create something quickly worked well again: Rory hacks, and I test and review each time he hits &#8216;Save&#8217;! All very agile and iterative.</p>
<p>In honor of our hosts in Austin, we decided to call the new interface <em>Touchscreen Enhanced Cross-Search with Augmented Serendipity</em> &#8211; or TEXAS for short.</p>
<p>Kudos to Mahendra Mahey, <a href="http://ptsefton.com/" target="_blank">Peter Sefton</a> and the <a href="http://devcsi.ukoln.ac.uk/" target="_blank">DevCSI</a> project for putting (and keeping) it all together, and to the awesome panel of judges (even though they didn&#8217;t pick us)!</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/14/open-repositories-2011-part-2-the-developer-challenge/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fáilte gu Ghlaschu!</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/01/failte-gu-ghlaschu/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/01/failte-gu-ghlaschu/#comments</comments>
		<pubDate>Wed, 01 Jun 2011 14:11:51 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[continuity]]></category>
		<category><![CDATA[data curation]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[training]]></category>
		<category><![CDATA[ULCC]]></category>
		<category><![CDATA[web preservation]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1447</guid>
		<description><![CDATA[&#160; The whole purpose of education is to turn mirrors into windows.  ~Sydney J. Harris Football and digital preservation seem an unlikely combination but on May 15th Ed and I arrived to a slightly damp Glasgow celebrating en masse the end of the football season, to deliver our DPTP north of the border.  There, I [...]]]></description>
			<content:encoded><![CDATA[<p>&nbsp;</p>
<div id="attachment_1448" class="wp-caption alignright" style="width: 235px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/486242591_f77da45ef3.jpg"><img class="size-medium wp-image-1448 " style="margin: 5px;" title="486242591_f77da45ef3" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/486242591_f77da45ef3-225x300.jpg" alt="" width="225" height="300" /></a><p class="wp-caption-text">Fáilte go Sráid na Banrighinn</p></div>
<p><span style="font-family: georgia,bookman old style,palatino linotype,book antiqua,palatino,trebuchet ms,helvetica,garamond,sans-serif,arial,verdana,avante garde,century gothic,comic sans ms,times,times new roman,serif;"><em>The whole purpose of education is to turn mirrors into windows</em>.  ~Sydney J. Harris</span></p>
<p>Football and digital preservation seem an unlikely combination but on May 15th Ed and I arrived to a slightly damp Glasgow celebrating en masse the end of the football season, to deliver our DPTP north of the border.  There, I am afraid the comparison between football and digital curation ends but it is worth noting that on Monday our students did not consist, in the main of bleary eyed Scots.  As I am myself half Scottish I was looking forward to hearing about the digital state of play in my mother&#8217;s country but curiously we had a very international group, many had travelled far from places such as the <a href="http://www.ecb.int/home/html/index.en.html">European Central Bank</a>,  <a href="http://ec.europa.eu/index_en.htm">European Commission</a>, <a href="http://www.proni.gov.uk/">PRONI</a>,  and near  such as from the <a href="http://www.nas.gov.uk/">National Records Scotland</a> (who clearly cannot get enough of a good thing as we were with them 4 weeks previous).  The group was very creative who seemed to work well together. I know  I know, we say this a lot but this group seemed to shine for many reasons.  They were keen to work together and shout out about their ideas right from the start. And though being a disparate international group they managed to work together very neatly and develop some great case studies for us.</p>
<p>Internationally (and sometimes nationally!) language has always been a barrier  for communication, but  it is also an enabler  and this is again where we see something such as the OAIS being a useful way of empowering people to be able to communicate with each other, and other professionals.  The DPTP rarely (and rightly so) has students just from the traditional information management world (libraries/archives/etc).  Many people now being redeployed from different parts of their organisation and as such are often unfamilair with the idea/concept of an archive or indeed of the traditional notion of the lifecycle of a document/record. This is where OAIS is very valuable due to the way it which it expresses the way a digital object should be ideally kept and uses a language describe this.</p>
<p>In many of our case studies with OAIS, we often find that many organisations are actually mapping very well to the OAIS which indicates that it reflects  good and real practise when it comes to managing our digital repository. Our job in the DPTP is to take someone who has never heard of the OAIS (suprisingly more often than not) and by the end of the three days have them fluent in the OAIS concepts both through listening and through application to their own environments.  Our feedback from the course has proven that this is a job well done&#8230;.DPTP abú!</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/01/failte-gu-ghlaschu/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Your friendly neighborhood Digital Archives team</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/01/your-friendly-neighborhood-digital-archives-team/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/01/your-friendly-neighborhood-digital-archives-team/#comments</comments>
		<pubDate>Wed, 01 Jun 2011 09:59:28 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[DART]]></category>
		<category><![CDATA[Ed]]></category>
		<category><![CDATA[Patricia]]></category>
		<category><![CDATA[Richard]]></category>
		<category><![CDATA[Rory]]></category>
		<category><![CDATA[Silvia]]></category>
		<category><![CDATA[team]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1440</guid>
		<description><![CDATA[In May, the University of London Communications Office invited us to answer a few questions about ourselves for an Intranet article. We thought we&#8217;d reproduce some of our answers here, for the benefit of anyone else who wants a quick introduction to what we do (and ourselves, in case we forget). Please introduce yourself We [...]]]></description>
			<content:encoded><![CDATA[<p>In May, the University of London Communications Office invited us to answer a few questions about ourselves for an Intranet article. We thought we&#8217;d reproduce some of our answers here, for the benefit of anyone else who wants a quick introduction to what we do (and ourselves, in case we forget).</p>
<hr />
<div id="attachment_1304" class="wp-caption alignright" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2007/12/DART-Team-2011.jpg"><img class="size-medium wp-image-1304 " style="margin: 5px;" title="ULCC Digital Archives &amp; Repositories Team" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2007/12/DART-Team-2011-300x225.jpg" alt="ULCC Digital Archives &amp; Repositories Team" width="300" height="225" /></a><p class="wp-caption-text">ULCC Digital Archives &amp; Repositories Team, May 2011</p></div>
<p><strong>Please introduce yourself<br />
</strong>We are ULCC&#8217;s Digital Archives &amp; Repositories Team, and you can generally find us in the basement of Senate House along with the rest of ULCC. We are five:</p>
<ul>
<li>Ed Pinsent and Patricia Sleeman, our digital archivists, provide digital preservation training and consultancy.</li>
<li>Rory McNicholl is our lead developer and repository systems manager.</li>
<li>Silvia Arango-Docio is project officer for Web archiving projects and repository support.</li>
<li>Richard Davis is team leader, and actively contributes to all of the team&#8217;s projects, as well as pursuing new and interesting opportunities.</li>
</ul>
<p><strong>What does your team do?<br />
</strong>Our department was created in 1997 to develop the National Digital Archive of Datasets (NDAD), and operate it for The National Archives. NDAD was originally a joint project of ULCC and Senate House Libraries, providing a specialised cataloguing and preservation service for government databases.</p>
<p>Since then we&#8217;ve worked on many digital archive and library projects. The NDAD service ceased in 2010, but we continue working on innovative projects for born-digital and digitised records, developing repositories and related information systems for education and research. We provide specialist training and consultancy for the HE and cultural heritage sectors through the highly-acclaimed <a href="http://www.dptp.org/" target="_blank">Digital Preservation Training Programme</a> (DPTP).</p>
<p>Our current or recent partners and customers include most of the University&#8217;s colleges and institutes, the British Library, the Parliamentary Archives, the JISC, the UK Web Archiving Consortium, and the European Commission. We work closely with the <a href="http://dpconline.org/members/list-of-members" target="_blank">Digital Preservation Coalition</a> (DPC), and local groups such as the <a href="http://www.aim25.ac.uk/" target="_blank">AIM25</a> archives network and the<a href="http://www.sherpa-leap.ac.uk/" target="_blank">SHERPA-LEAP</a> consortium of University of London repository managers.</p>
<p><strong>Tell us one thing about your work that colleagues might not know.<br />
</strong>Digital preservation issues might seem esoteric, but they are fundamental to every electronic and computer-based activity &#8211; in other words, pretty much everything in business, education and research. Understanding the best ways to manage, describe and preserve all our electronic information &#8211; from a single email, to complex Web sites, or collections of digital photos or videos &#8211; is an essential 21st century skill.</p>
<p><strong>What aspect of your work gives you the most satisfaction?<br />
</strong>We are very lucky to work closely with information professionals from around the UK and internationally. Patricia has recently delivered preservation workshops at the National Library of Jordan; Ed and Silvia are working on Web archiving projects for the JISC and the EU; Richard and Rory won the annual JISC Developer Challenge at last year&#8217;s Open Repositories conference in Madrid.</p>
<p><strong>What is the most challenging part of your job?<br />
</strong>Keeping up-to-date with constantly changing ICT landscape in education, libraries and archives. Electronic information systems evolve continually, as do the tools and methods for managing them. Luckily there is a wealth of current information on the Web, if you know where to look! We actively share our thoughts and experiences through our long-running <a href="http://dablog.ulcc.ac.uk/" target="_blank">Digital Archives Blog</a> and on <a href="http://twitter.com/#!/digitalarchives" target="_blank">Twitter</a>.</p>
<p><strong>Name one thing that would make your working environment better.<br />
</strong>Sometimes it seems we work with everyone except colleagues in Senate House! It would be great to work more with colleagues at the Central University, and bring more of our accumulated experience to bear on some of the University&#8217;s information and records management challenges.</p>
<p><strong>If you could meet anyone (dead or alive) who would it be and why?<br />
</strong>We&#8217;re not going to agree on this any time soon. Suggestions so far include <a href="http://en.wikipedia.org/wiki/Sofia_Loren" target="_blank">Sofia Loren,</a> <a href="http://en.wikipedia.org/wiki/Hypatia" target="_blank">Hypatia of Alexandria</a>, <a href="http://en.wikipedia.org/wiki/Richard_Brautigan" target="_blank">Richard Brautigan</a> and <a href="http://en.wikipedia.org/wiki/Pete_Townshend" target="_blank">Pete Townshend</a>.</p>
<p><strong>Name three hobbies team members pursue &#8211; the more unusual the better.<br />
</strong>Most of us have small children, so hobbies take a back seat. Ed is an accomplished artist, writer, broadcaster, musician and samizdat publisher. Patricia is about to learn how to stain glass before starting Arabic lessons for the third time.</p>
<p><strong>If, as a group, you stormed the musical charts, what genre of music would that be in, what would be your stage name and who would be your lead vocalist?<br />
</strong>We could be a psychedelic beat group called the Dates Of Creation. Our lead singer would have to be Patricia.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/01/your-friendly-neighborhood-digital-archives-team/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>A Tab in the Ocean</title>
		<link>http://dablog.ulcc.ac.uk/2011/05/24/a-tab-in-the-ocean/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/05/24/a-tab-in-the-ocean/#comments</comments>
		<pubDate>Tue, 24 May 2011 14:52:23 +0000</pubDate>
		<dc:creator>Ed Pinsent</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[UKWAC]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1426</guid>
		<description><![CDATA[I&#8217;ve been using Web Curator Tool (WCT) to curate the JISC website collection at UKWA since 2008. I&#8217;ve long been aware that the system offered me the opportunity to record a lot of metadata, in tabs called General, Annotations, Groups and Access. It&#8217;s a mix of technical metadata (about the gather / website) and descriptive [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/tabs.jpg"><img class="alignleft size-medium wp-image-1434" style="margin: 5px;" title="tabs" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/tabs-300x148.jpg" alt="" width="300" height="148" /></a><br />
I&#8217;ve been using Web Curator Tool (WCT) to curate the <a href="http://www.jisc.ac.uk/whatwedo/programmes/preservation/ajw.aspx" target="_blank">JISC website collection</a> at <a href="http://www.webarchive.org.uk/ukwa/"  target="_blank">UKWA</a> since 2008. I&#8217;ve long been aware that the system offered me the opportunity to record a lot of metadata, in tabs called General, Annotations, Groups and Access. It&#8217;s a mix of technical metadata (about the gather / website) and descriptive metadata. It&#8217;s mainly of value to the curator who wants to keep track of what they&#8217;re doing with the website gathering; but WCT also allows us to create some descriptive metadata for exposure. At the bare minimum, we&#8217;re required to use Groups; despite its name, this component is actually a simple subject classification scheme, allowing me to tag all my websites with &#8220;Higher Education&#8221; for example. Once stored in the WCT database and rendered through Wayback Machine, this subject selection translates into <a href="http://www.webarchive.org.uk/ukwa/subject/79/page/1" target="_blank">this useful view of the collection</a>.</p>
<p>Recently the British Library team approached all the users of the shared WCT tool. It seems that the curators involved in UKWA have been using these metadata fields slightly differently and the BL team have initiated a project to move towards more consistency. The project will involve deciding on definitive interpretations of how to use these fields, followed by a process of cleaning up legacy data stored in the system. Some of it is potentially useful, some of it not so useful; some is legacy from the earlier PANDAS phase of the project, mostly not needed, or entered into the wrong field.</p>
<p>As noted, a lot of this metadata is mainly to do with selection and evaluation decisions, curation information such as changes in status of the site, and as such it&#8217;s never been exposed anywhere except within WCT. However, one descriptive field will eventually end up exposed on the UKWA live site, and provide us cataloguer types an opportunity to describe the resources in more detail. It will appear on the Title Entry Page (TEP) for each instance.</p>
<p>I welcome any move towards exposing more descriptive metadata on the UKWA public site. I have always taken the view that the phrase which currently appears alongside a Title “The live site may provide more information” is not really very helpful in the context of a web archive, for three reasons. (1) we don’t want our users clicking away from UKWA; (2) the link to the live site may be dead by now, and; (3) as archivists and curators, I feel strongly that we are the ones who should be providing that “more information” in the shape of a catalogue description of some kind.</p>
<p>The JISC project sites, as a collection, have high evidentiary value as stages in development of very specific tools, services and activities that benefit the UK Higher Education community. The sites by themselves don’t always explain their history or intentions; I would argue that a lot of rich contextual detail about the reasons these sites existed (the JISC programme under which they were developed, the dates, the staff involved, the themes, the outputs) would help interpret the collection to the users and make it more intelligible.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/05/24/a-tab-in-the-ocean/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>House of Books Part 2: OCR and Arabic texts</title>
		<link>http://dablog.ulcc.ac.uk/2011/05/05/house-of-books-part-2/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/05/05/house-of-books-part-2/#comments</comments>
		<pubDate>Thu, 05 May 2011 09:41:56 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[Amman]]></category>
		<category><![CDATA[Arabic fonts]]></category>
		<category><![CDATA[Arabic OCR]]></category>
		<category><![CDATA[Arabic script OCR]]></category>
		<category><![CDATA[Baghdad]]></category>
		<category><![CDATA[digitisation]]></category>
		<category><![CDATA[Digitisation Service]]></category>
		<category><![CDATA[House of Books]]></category>
		<category><![CDATA[Iraq]]></category>
		<category><![CDATA[Jordan]]></category>
		<category><![CDATA[OCR]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1280</guid>
		<description><![CDATA[&#8216;Machine replication of human functions, like reading, is an ancient dream&#8217; * One of the many topics discussed in the House of Books project in Amman was the issue of OCR and Arabic texts. Optical character recognition or OCR has become one of the most successful applications of technology in the field of pattern recognition [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1310" class="wp-caption alignleft" style="width: 232px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/glimpseoftreasury1.jpg1.jpg"><img class="size-medium wp-image-1310 " title="glimpseoftreasury.jpg" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/glimpseoftreasury1.jpg1-222x300.jpg" alt="" width="222" height="300" /></a><p class="wp-caption-text">A glimpse of Petra, Jordan</p></div>
<p>&#8216;Machine replication of human functions, like reading, is an ancient dream&#8217; *</p>
<p>One of the many topics discussed in the House of Books project in Amman was the issue of OCR and Arabic texts. Optical character recognition or OCR has become one of the most  successful applications of technology in the field of pattern  recognition and artificial intelligence. It is now a necessary step in  the transition from analogue text to the elctronic world, particularly due to the  quantity of information now available in the electronic age as it enables rapdi searching and scanning. In the last five decades, machine reading of text has grown from a dream to reality.</p>
<p>Software for OCR is now almost 100% sucessful for Roman scripts. Middle Eastern library content however, particularly for Arabic and other non-Roman language materials, poses special challenges to the creation of digital repositories of arabic texts.  Arabic, being a diacritic language has many characters (letters) which have exactly the same form, and are distinguished only by the position of various dots over, above, or inside the main character block. This  poses special difficulty for OCR, as dots can be ignored by software as speckling or error, or even removed. Most institutions digitising Arabic manuscripts  use Sakhr OCR software, but it does not seem to pick up the intricacies of  Arabic script. What to do?</p>
<div id="attachment_1332" class="wp-caption aligncenter" style="width: 274px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/arabic-fonts.jpg"><img class="size-medium wp-image-1332 " title="arabic fonts" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/arabic-fonts-264x300.jpg" alt="" width="264" height="300" /></a><p class="wp-caption-text">Some arabic fonts http://university.arabsbook.com/</p></div>
<p><img src="file:///C:/DOCUME%7E1/cziapas/LOCALS%7E1/Temp/moz-screenshot-8.png" alt="" /></p>
<p><img src="file:///C:/DOCUME%7E1/cziapas/LOCALS%7E1/Temp/moz-screenshot-7.png" alt="" /></p>
<p>It seems that if prepared well the Sakhr recognition software package has the capability to recognize generic Arabic fonts (called Naskh or Kūfī) with a fair degree of accuracy. However the software has to be taught to recognize any peculiarities or unusual characteristics in the font of the scanned volume in question. This is extremely time consuming and requires technical expertise. Also it is taken for granted in such a process that the font will be more or less consistent throughout any given volume; in many cases the hand can change in any manuscript so I imagine it would need to be reinstructed according to each section where the hand or the font changes. In addition the quality of any OCR depends on the quality of the original scanned file.  Also not everyone wants to use generic fonts, think of how much we like to personalise our own? Another headache for Sakhr.</p>
<p>Our group in Amman as a whole expressed frustration with  Sakhr and really hoped that it could in some way be  generally instructed to recognise characters which it consistently fails to pick up. We felt sure that it will be solved soon and I personally cannot imagine that the military have not got a  solution up their sleeve  about this considering the politics of the world these days.</p>
<p>Interestingly in terms of resources discovery, Google Scholar does not  allow searching in Arabic, while it allows for searching of both Japanese and Chinese scholarly texts. Surely as complex for an OCR piece of software to recgnise as Arabic?   This means that any texts written in Arabic  cannot be accessed, which means that scholarship in Arabic is not being  picked up by one of  the biggest and widest search engines for scholarly  literature. Why such an oversight by Google scholar? I have contacted them and have yet to find out!</p>
<p>This of course brought home the real need for more collaboration between libraries and archives involved in digitsiation projects in the Middle East itself. There are many  projects based in North America such as <a href="http://www.library.yale.edu/ameel/index.htm">Ameel</a> and in the UK such as<a href="http://www.soas.ac.uk/ysimg/"> SOAS</a> (which our own Repository folk in DART have been working on!) which unify and make available digital resources from the Middle East.  There was also an interesting <a href="http://www.soas.ac.uk/ysimg/">JISC study</a> with the University of Exeter about user requirements for digitised resources in Islamic studies. These are of course a western approach to arabic material, albeit in their own collections. It often also is concerned with transalations of arabic texts to greek or Latin as was the norm.</p>
<p>The issue of OCR and its sucess rate for non Roman fonts also raises  questions about the power of the digital and askes the question that if  OCR canot serve one of the great languages &#8211; Arabic,  how many minority  languages which are also very diacritic are not being served well by the  OCR sofwtare available.  The result of this must a tip in the balance  of available reserach material in favour of texts in Roman script and  sees an imbalance in what is being made available online.</p>
<div id="attachment_1326" class="wp-caption aligncenter" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/baghdad.jpg"><img class="size-medium wp-image-1326 " title="baghdad" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/05/baghdad-300x226.jpg" alt="" width="300" height="226" /></a><p class="wp-caption-text">Baghdad at night, 2011</p></div>
<p>There is a need for the countries which created this material to work together on such projects. Many very interesting and topical projects  to do with the emergence of which were being proposed in Amman relating to digitisation and working together to track missing journals as well as trying to avoid duplicating efforts.</p>
<p>So how to do this? Several libraries attending our workshop in Amman highlighted the necessity to coordinate the effort for  Arabic texts digitization in order to avoid duplication, share best practices and develop common standards, index and software. To enable this  a decision was made to work on  developing new cultural cooperation interventions for digitisation in the Middle East; to fund-raise for this and to set up groups in a social network (facebook, linkedin) including all the participants from the House of Books project. Importantly further workshops will be run to encourage this cooperation and hopefully see strides being made in cooperation and digitisation of arabic texts in the Middle East.</p>
<p>* http://www.nr.no/~eikvil/OCR.pdf</p>
<p>**Thanks to Qaiss Hatef  Saeed of the Iraq National Library and Archives for his help.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/05/05/house-of-books-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The House of Books/Dar El Kataub/دار الكتب والوثائق العراقية   Part 1</title>
		<link>http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/#comments</comments>
		<pubDate>Tue, 03 May 2011 07:17:17 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[Arab texts]]></category>
		<category><![CDATA[Dar El Kataub]]></category>
		<category><![CDATA[Egypt]]></category>
		<category><![CDATA[House of Books]]></category>
		<category><![CDATA[Iraq]]></category>
		<category><![CDATA[Iraq National Library and Archives]]></category>
		<category><![CDATA[Jordan]]></category>
		<category><![CDATA[Lebanon]]></category>
		<category><![CDATA[Qatar]]></category>
		<category><![CDATA[Spring revolution]]></category>
		<category><![CDATA[survey]]></category>
		<category><![CDATA[UNESCO]]></category>
		<category><![CDATA[دار الكتب]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1275</guid>
		<description><![CDATA[Baghdad, 2003 -  when Domenico Chirico, Director of Un Ponte Per&#8230; first asked various organisations for support and resources for the reconstruction of the Iraq National Library and Archives (INLA) destroyed during the Iraq invasion and occupation, he was met with cries of bemusement and disbelief: &#8216;Why worry about books and archives when we have [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1289" class="wp-caption alignright" style="width: 286px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/Jordanian_newspapers1.jpg"><img class="size-medium wp-image-1289 " title="Jordanian_newspapers" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/Jordanian_newspapers1-276x300.jpg" alt="" width="276" height="300" /></a><p class="wp-caption-text">Newspapers in National Library of Jordan</p></div>
<p style="text-align: left;"><strong>Baghdad, 2003</strong> -  when Domenico Chirico, Director of <a href="http://www.unponteper.it/bibliotecabaghdad/">Un Ponte Per&#8230;</a> first asked various organisations for support and resources for the reconstruction of the Iraq National Library and Archives (INLA) destroyed during the Iraq invasion and occupation, he was met with cries of bemusement and disbelief:</p>
<p>&#8216;Why worry about books and archives when we have lives to save?!&#8217;.</p>
<p>Some if not many  struggled to understand that a library in Baghdad could be a priority during such horrific times.  But they had not yet met <a href="http://www.guardian.co.uk/world/2008/jun/09/iraq.iraqandthearts">Dr Saad Eskander</a>, Director of the INLA&#8230;but that is another <a href="http://dablog.ulcc.ac.uk/2010/06/01/iraqi-visit-to-ulcc/">story</a> which has been told.</p>
<p><strong>Cairo,  2011</strong> and the &#8216;Spring revolution&#8217; is happening in the Middle East.  In Egypt almost as soon as unrest broke out, two public libraries in Cairo were  burnt to the ground however cheeringly in Alexandria, <a href="http://www.guardian.co.uk/books/2011/feb/01/alexandria-youth-protecting-library">news </a>came of staff and citizens forming a human chain around the Biblioteca Alexandrina (much supported by Susan Mubarack) and another around the National Mususem of Egypt to protect it, though it wasn&#8217;t so <a href="http://www.reuters.com/article/2011/01/29/us-egypt-museum-idUSTRE70R7K820110129">lucky</a>. These stories as told via Twitter cited what had happened in Iraq. Iraq, sadly is not alone in having suffered such destruction to its cultural property. These attacks,  fire, pillage, looting to order and just plain old theft  do more than just destroy a building or some documents. They are attacks on civil society, denying Iraqi people  access to engage in democracy, to access information as well as to their cultural memory.</p>
<p><strong>Amman, 2011</strong> &#8211; In light of all this, it was with great interest that I agreed to participate in the &#8216;The House of Books/ Dar El Kataub&#8217; workshop run by the aforementioned Domenico Chirico&#8217;s NGO UPP and UNESCO.   Its purpose was to work on challenges facing the digitisation of Arabic texts.  I was there to look at the preservation aspect of digitisation, a shock to some as it is often thought that digitisation is itself a preservation strategy! So there was work to be done&#8230;</p>
<div id="attachment_1290" class="wp-caption aligncenter" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/hard_at_work.jpg"><img class="size-medium wp-image-1290 " title="hard_at_work" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/hard_at_work-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">William Kopycki and I leading a session</p></div>
<p>We had a wide range of people from Iraq, Lebanon, Qatar/Australia, Jordan, Egypt/USA, Italy and the United Kingdom.  From an digital archivist&#8217;s point of view (and in my view the right one),  the projects represented presented us with the gamut of activities which are now present in a digital library or archive, from the development of impressives copyright legisalation in Jordan to a start to finish overview of a project digitsing  journals relating to the 19th centroy arab cultural renaissance Al-Nahda. We also heard about training and devloping infrastructures for digital object management in Iraq, as well as an overview of the current project in the INLA to reconstitute many of its collections through digitisation. We had a good deal of course about preservation from myself and Giovanni Bergamin from the University of Florence and Maurizio Messina.</p>
<p>My work consisted partly of leading the group in a consolidation of ideas discussed during day one.  A very important goal of this workshop is collaboration and as a result we wanted the group to think individually and in groups of four about why they saw the need to collaborate and then to tell us how they would approach collaboration.  We ended up with great points which reinforced the absolute need for collaboration in terms of standardisation, best practise, resource discovery, lobbying and important networking opportunities across the arab speaking world.</p>
<p>I also was there to speak and work with the group a lot about digital preservation and digitisation. This is an area which can be neglected often as many considered digitisation to be simply about simply delivering access to materials.  In addition these digital objects are considered digital surrogates and little consideration is given to their preservation as the analogue copies are available. It is important to consider preservation of these digital surrogates over time at the point of their creation.  Do we really want to invest the time and money again in their re-digitisation? I very much doubt it! However unless consideration is given to their long term sustainability this is what will happen, data loss or re-digitisation. This is time and money few of course can afford to spend.</p>
<p>Touching of some of the issues we covered in terms of preservation we looked at planning digitisation projects in light of preservation and their sustainability and explored the main points to consider for preservation of digitised content, drawing a lot from our very popular <a href="'The House of Books/ Dar El Kataub'">report</a> on digitisation and preservation produced by DPC/ULCC and PORTICO. I reinforced the real need for honest sharing and for sharing failures as well as successes, we have all had them so we can only progress through acknowledgement of both.</p>
<p>Looking at the 1923 magazine on women&#8217;s issues called <a href="http://www.wdl.org/en/item/2857/?ql=eng&amp;s=layla&amp;view_type=gallery"> Layla,</a> we explored in a session what characteristics we would like to preserve over time.  Many things which we assume will be kept in the paper based world, have to planned for in detail  in the digital world, not much if anything can be left to chance.</p>
<div id="attachment_1291" class="wp-caption aligncenter" style="width: 310px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/National_library_Jordan1.jpg"><img class="size-medium wp-image-1291" title="National_library_Jordan" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/National_library_Jordan1-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">Workshop over!</p></div>
<p>All speakers were extremely interesting individually and collectively as they gave an outsider like me the overview of the digital library/archives world in this part of the Middle East.  Qaiss Hatef  Saeed of the INLA spoke about the motivation of the INLA to establish a digital library  due to the loss of their holdings in the library during and post conflict.  Much of the library is being rebuilt literally from ashes by means of digital content,  and popular holdings need to be digitised to give as Quaiss said &#8216; books a rest&#8217; from handling.</p>
<p>Qaiss also proposed that in 100 years hard copy books will no longer exist and everything will be digital. Fighting talk! But  digital libraries however will only survive as long as we invest in their sustainability.  Digital resources have great power in terms of access but they are also very vulneable in terms of long term sustainabilty.  As such action needs to be taken an now to stop bit rot and technology obsolesence being the next threat to access to information which is now increasingly digital.</p>
<p>*Note: The House of Books workshop was picked up by 16  newspapers in Jordan and Iraq. These include  articles in the<a href="http://www.baghdad-news.com/home.asp?mode=more&amp;NewsID=32281&amp;catID=14"> Baghdad press,</a> <a href="http://jordantimes.com/index.php?news=36718">The Jordan Times</a> and even the Iraq National Congress.</p>
<div>
<dl id="attachment_1284">
<dt><a href="../wp-content/uploads/2011/04/National_library_Jordan.jpg"><br />
</a> </dt>
<dd><strong><br />
</strong></dd>
</dl>
</div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/05/03/the-house-of-booksdar-el-kataub-part-1/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Statistically relevant</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/27/statistically-relevant/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/27/statistically-relevant/#comments</comments>
		<pubDate>Wed, 27 Apr 2011 11:47:28 +0000</pubDate>
		<dc:creator>Rory McNicholl</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Repositories]]></category>
		<category><![CDATA[Ars Technica]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[eprints]]></category>
		<category><![CDATA[IRStats]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[SHERPA-LEAP]]></category>

		<guid isPermaLink="false">http://www.sherpa-leap.ac.uk/?p=277</guid>
		<description><![CDATA[Over the last year or so we&#8217;ve installed and configured (in some cases reconfigured) the IRStats package for several of the LEAP repositories, including those hosted by ULCC. It seemed a good moment to share a few thoughts about the process of getting &#8220;all statted up&#8221; with EPrints. By default, and without any further action, [...]]]></description>
			<content:encoded><![CDATA[<i>From the <a href="http://www.sherpa-leap.ac.uk/">SHERPA-LEAP</a> blog.</i>
<p>Over the last year or so we&#8217;ve installed and configured (in some cases reconfigured) the IRStats package for several of the LEAP repositories, including those hosted by ULCC. It seemed a good moment to share a few thoughts about the process of getting &#8220;all statted up&#8221; with EPrints.</p>
<p>By default, and without any further action, IRStats provides a kind of smorgasbord control panel, demonstrating the many optional graphs, charts and list available. You can see <a href="http://pubs.ulcc.ac.uk/cgi/irstats.cgi">an example</a> on our own ULCC Publications repository.</p>
<p>More recently we&#8217;ve seen growing demand among repository managers to share data on downloads with both their depositors and  users at large. It&#8217;s really important for repository managers to select carefully which statistics views they actually want or need to display &#8211; we can only suggest things we think might work. Once you&#8217;ve decided on the views you want, we can look at the most effective ways to display them: and this is why I&#8217;ve been having fun souping up some of the displays already offered by IRstats.</p>
<p>The first display we&#8217;ve been working on is the Statistics digest. These are common enough and we&#8217;ve used the example of <a href="http://discovery.ucl.ac.uk/past-statistics.html">UCL Discovery</a> repository as the basis of work for both SAS-Space and SOAS institutional repository.</p>
<p>The second approach has been to re-style the IRstats &#8220;dashboard&#8221; view to lay the graphs on top of each other and then use some Javascript to handle the tabbed navigation. This seemed a more elegant approach than inserting lots of charts in the abstract page itself (as, for example, at <a href="http://eprints.ecs.soton.ac.uk/18493/">ECS EPrints</a>). I&#8217;ve used this display technique to display statistics for individual eprints for the School of Pharmacy, as well as SAS and SOAS.</p>
<p><a href="http://www.sherpa-leap.ac.uk/wp-content/uploads/2011/04/irstats-pharmacy-1.png"><img class="aligncenter size-medium wp-image-278" src="http://www.sherpa-leap.ac.uk/wp-content/uploads/2011/04/irstats-pharmacy-1-300x212.png" alt="IRStats on School of Pharmacy EPrints" width="300" height="212" /></a><br />
The tabbed display of graphs and tables was also combined with a &#8216;modal box&#8217; display that keeps the height of page the same (for example on <a href="http://eprints.soas.ac.uk/3316/">this Abstract page</a> at SOAS. At the bottom of the Abstract page I&#8217;ve added a statistics section showing the number full-text downloads, and a link that displays detailed stats in an overlaid box.</p>
<p>This method doesn&#8217;t just work for individual items, but can be used on other datasets in too. For example, on SAS-Space we have added it to the bottom of their <a href="http://sas-space.sas.ac.uk/view/collections/ialsac.html">Collection browse pages</a>, so that at the bottom of each Collection view there is an opportunity to view download statistics for that collection as a whole.</p>
<p>Additionally in SAS-Space, since it is a repository for a number of discrete institutes, there was a requirement for institutional editors to have access to their own institute&#8217;s statistics. To achieve this, I allowed access to a constrained version of the IRStats control panel for editor-users who had the appropriate editorial permissions for the institute in question. (Unless you are a SAS-Space editor, you won&#8217;t be able to access this.)</p>
<p>Which statistics views to insert as tabs is the decision of the repository manager. Views we&#8217;ve used include:</p>
<ul>
<li>Monthly downloads</li>
<li>Daily downloads</li>
<li>Unique visitors</li>
<li>Referrers</li>
<li>Search Engines</li>
<li>Top 10 items downloaded (only for a Collection, Repository or Division)</li>
<li>Top 10 search terms</li>
</ul>
<p>From a technical point-of-view, we will have to review these configurations when we upgrade to EPrints version 3.3, possibly later in the year (if it&#8217;s released!!), in conjunction with our VM infrastructure migration, and start doing things with EPStats rather than IRStats. But we now have an effective framework for adding statistics quickly to any EPrints installation.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/27/statistically-relevant/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thoughts about blog data and metadata</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/#comments</comments>
		<pubDate>Mon, 25 Apr 2011 20:39:18 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[Atom]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data model]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[newsfeeds]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=487</guid>
		<description><![CDATA[During the ArchivePress project at ULCC, we briefly considered the data and metadata generally made available with blogs and blog posts. As ArchivePress focused on the representations of blogs in newsfeeds, we examined the metadata that is generated in common, and exposed in the newsfeeds of three of the most common blog platforms, WordPress, Blogger [...]]]></description>
			<content:encoded><![CDATA[<i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i>
<p>During the <a href="http://jiscpowr.jiscinvolve.org/wp/2009/06/24/archivepress-when-one-size-doesnt-fit-all/">ArchivePress</a> project at ULCC, we briefly considered the data and metadata generally made available with blogs and blog posts. As ArchivePress focused on the representations of blogs in newsfeeds, we examined the metadata that is generated in common, and exposed in the newsfeeds of three of the most common blog platforms, WordPress, Blogger and TypePad. Blogger and Typepad prefer the Atom newsfeed format; WordPress (particularly WordPress.com) prefers RSS (though it can be made to publish Atom feeds too). This analysis was done, about a year ago, things may have changed, but here is a summary of what we found.</p>
<p>For each <strong>Blog</strong>, the following core information is available in the feeds:</p>
<table style="border-collapse: collapse" border="1" cellspacing="1" cellpadding="2" width="100%">
<tbody>
<tr>
<th></th>
<th><strong>WordPress (RSS)</strong></th>
<th><strong>Blogger (Atom)</strong></th>
<th><strong>Typepad (Atom)</strong></th>
</tr>
<tr>
<th>Feed Unique ID</th>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/id </td>
<td style="vertical-align: top">feed/id</td>
</tr>
<tr>
<th>Blog URL</th>
<td style="vertical-align: top">rss/channel/link </td>
<td style="vertical-align: top">feed/link@rel=&#8221;alternate&#8221; </td>
<td style="vertical-align: top">feed/link@rel=&#8221;alternate&#8221;</td>
</tr>
<tr>
<th>Blog Title</th>
<td style="vertical-align: top">rss/channel/title </td>
<td style="vertical-align: top">feed/title </td>
<td style="vertical-align: top">feed/title</td>
</tr>
<tr>
<th>Blog Description</th>
<td style="vertical-align: top">rss/channel/description </td>
<td style="vertical-align: top">feed/subtitle </td>
<td style="vertical-align: top">feed/subtitle</td>
</tr>
<tr>
<th>Date of last update</th>
<td style="vertical-align: top">rss/channel/lastBuildDate </td>
<td style="vertical-align: top">feed/updated </td>
<td style="vertical-align: top">feed/updated</td>
</tr>
<tr>
<th>Generating software</th>
<td style="vertical-align: top">rss/channel/generator </td>
<td style="vertical-align: top">feed/generator </td>
<td style="vertical-align: top">feed/generator</td>
</tr>
</tbody>
</table>
<p></p>
<p>For each <strong>Post</strong>, we established that the following core information is available in the newsfeeds:</p>
<table border="1" cellspacing="1" cellpadding="2" width="100%" style="border-collapse: collapse">
<tbody>
<tr>
<th></th>
<th>WordPress (RSS)</th>
<th>Blogger (Atom)</th>
<th>Typepad (Atom)</th>
</tr>
<tr>
<th>Post Unique ID</th>
<td style="vertical-align: top">rss/channel/item/guid@isPermaLink </td>
<td style="vertical-align: top">feed/entry/id </td>
<td style="vertical-align: top">feed/entry/id</td>
</tr>
<tr>
<th>Post Title</th>
<td style="vertical-align: top">rss/channel/item/title </td>
<td style="vertical-align: top">feed/entry/title </td>
<td style="vertical-align: top">feed/entry/title</td>
</tr>
<tr>
<th>Post Summary</th>
<td style="vertical-align: top">rss/channel/item/description </td>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/entry/summary</td>
</tr>
<tr>
<th>Post URL</th>
<td style="vertical-align: top">rss/channel/item/link </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;alternate&#8221; </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;alternate&#8221;</td>
</tr>
<tr>
<th>Date of publication</th>
<td style="vertical-align: top">rss/channel/item/pubDate </td>
<td style="vertical-align: top">feed/entry/published </td>
<td style="vertical-align: top">feed/entry/published </td>
</tr>
<tr>
<th>Date of last update</th>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/entry/updated </td>
<td style="vertical-align: top">feed/entry/updated</td>
</tr>
<tr>
<th>Post Author</th>
<td style="vertical-align: top">rss/channel/item/dc:creator<br />
rss/xmlns:dc
</td>
<td style="vertical-align: top">feed/entry/author/name</td>
<td style="vertical-align: top">feed/entry/author/name</td>
</tr>
<tr>
<th>Post Category</th>
<td style="vertical-align: top">rss/channel/item/category </td>
<td style="vertical-align: top">feed/entry/category@term </td>
<td style="vertical-align: top">feed/entry/category@term</td>
</tr>
<tr>
<th>Post Content</th>
<td style="vertical-align: top">rss/channel/item/content:encoded<br />
rss/xmlns:content
</td>
<td style="vertical-align: top">feed/entry/content
</td>
<td style="vertical-align: top">feed/entry/content
</td>
</tr>
<tr>
<th>Post Comments</th>
<td style="vertical-align: top">rss/channel/item/comments </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;replies&#8221; </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;replies&#8221;</td>
</tr>
<tr>
<th>Post Comments Feed</th>
<td style="vertical-align: top">rss/channel/item/wfw:commentRss </td>
<td style="vertical-align: top">NA
</td>
<td style="vertical-align: top">NA
</td>
</tr>
</tbody>
</table>
<p></p>
<p>One interesting point we noted was that neither Blogger nor Typepad published a link to a Comments Feed for each post. This made our work on ArchivePress more difficult since it was predicated on being able to easily identify the Comments feed for each post, and harvest new Comments as they were published. Obviously for blogs generated other than by WordPress, this was not going to be so easy. (Our ace developer Emanuele found some workarounds, but that&#8217;s another story.)</p>
<p>I think this offers us an interesting overview of the core of standard, structured blog data and metadata, in three of the leading blog platforms. This is the data structure and metadata profile that is maintained in blog databases, in one of its native forms, and I&#8217;d expect it to be present in all blog platforms, since it arguably represents the essence of blogs. I hope this will be useful background when considering the core models for data and metadata handling that will be developed for BlogForever.</p>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/25/thoughts-about-blog-data-and-metadata/&via=blogforever&text=Thoughts%20about%20blog%20data%20and%20metadata&related=:&lang=en&count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nominate blogs for our survey</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/#comments</comments>
		<pubDate>Wed, 20 Apr 2011 07:59:46 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=443</guid>
		<description><![CDATA[Is there a particular blog or blogger you would like to see included in the BlogForever survey? We invite you to use this form to nominate them, and we will try to ensure that the blog is reviewed or the blogger contacted to participate in our survey. Tweet]]></description>
			<content:encoded><![CDATA[<i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i>
<p>Is there a particular blog or blogger you would like to see included in the BlogForever survey? We invite you to use this form to nominate them, and we will try to ensure that the blog is reviewed or the blogger contacted to participate in our survey.</p>
<div class="pageview">
	
  <iframe src="https://spreadsheets.google.com/embeddedform?formkey=dDlxOEgwRUpKVU9KZHFxSGo3MGxIb0E6MQ" frameborder="0" style="" scrolling="yes" height="700px" width="100%">Get a better browser!</iframe>
</div>

<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/20/nominate-blogs-for-our-survey/&via=blogforever&text=Nominate%20blogs%20for%20our%20survey&related=:&lang=en&count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>To have and to hold</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/18/to-have-and-to-hold/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/18/to-have-and-to-hold/#comments</comments>
		<pubDate>Mon, 18 Apr 2011 11:00:09 +0000</pubDate>
		<dc:creator>Patricia Sleeman</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Digital Archives]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1261</guid>
		<description><![CDATA[We gave a 2 day in house workshop to the National Archives of Scotland last week.  As Ed Pinsent has noted in his post about legal admissability, our timing was quite interesting; we arrived on the Monday the week after NAS had merged with the General Register Office, to become  the National Records of Scotland.  [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1264" class="wp-caption aligncenter" style="width: 458px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/nas.jpeg1.jpg"><img class="size-full wp-image-1264 " title="nas.jpeg" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/nas.jpeg1.jpg" alt="" width="448" height="205" /></a><p class="wp-caption-text">Susan Corrigall(NRS), Patricia Sleeman &amp; Ed Pinsent (ULCC).</p></div>
<p style="text-align: center;"><a href="../wp-content/uploads/2011/04/nas.jpeg.jpg"></a></p>
<p>We gave a 2 day in house workshop to the National Archives of Scotland last week.  As Ed Pinsent has noted in his <a href="http://dablog.ulcc.ac.uk/2011/04/07/dptp-nas/">post </a>about legal admissability, our timing was quite interesting; we arrived on the Monday the week after NAS had merged with the General Register Office, to become  the <a href="http://www.nas.gov.uk/about/110331.asp" target="_blank">National Records of Scotland</a>.  Interesting times for all and while we couldn&#8217;t have known in 2010 what a pivotal day it would be for NAS, we hoped  our training session on digital preservation would be in some way timely.</p>
<p>The 20 staff attending were very varied in their reponsibility and roles in the NAS/NRS, however wisely it was felt that all needed to have a entry level awareness of the issues relating to digital preservation.   NAS has been engaged for some time in the management of its digital resources and it is looking at cost effective ways of doing so.  Staff are being skilled up in relation to digital preservation, in this way NAS are &#8216;maximising their current resource potential&#8217;, to use a buzz word.</p>
<p>Working with a group from one organisation in situ is quite different for us as normally the <a href="http://www.dptp.org/">DPTP</a> is an open course which sees a variety of delegate hailing from many very varied organsiations. As a result we cover many topics to give a good grounding in what we feel are the most important aspects fo digital preservation, while trying to accomodate varying levels of expertise and knowledge.  This type of course is sucessful in many ways &#8211; it can often lift spirits, particularly for peope working in a solitary environment to see how much in common they have in terms of digital preservation and its challenges. The social aspect of the course is important as people recognise that digital preservation affects all communities involved in any kind of management of digital content in the long term.  A lot of what we aspire to do is build confidence in people to tackle the issue.</p>
<p>Working in an organisation who has commissioned the DPTP is quite a different experience. We see the coming together of a group already known to each other to a greater or lesser degree. However very often they come from different sections and this &#8216;time out&#8217; together should be of value in many ways:  as a collective learning opportunity, socialisation and in a time of change an opportunity for supporting each other.</p>
<p>We also get the chance during a class project to investigate a particular issue of note for the organisation.  This hopefully enables the group to leave with a finished product.  In the &#8216;open to all&#8217; DPTP we want our group to leave with a better understanding of everything but more precisely to have gone some way regarding the implmentation of theory to real life work situatsions through our classwork.  To leave the course with soemthign immediately implementable within the workplace we feel is important. Our class session at NAS/NRS remains confidential but we thoroughly enjoyed it and felt that it translated a lot of what was hitherto theory into a an enhancement of an existing practise to enable improved digital object management.</p>
<p>Interesting times for all and I feel perhaps it was timely that NAS are skilling up their staff to embark on digital preservation now.  Digital preservation is a concern for all government departments but who can manage it? Archivists as a profession have a good set of tools and concepts from their profession which translate well to digital preservation.  In fact OAIS tells many archivists nothing new &#8211; they are doing most of it already in the analogue world.</p>
<p>Thus archivists hold one major key on our large key set which we need to unlock digital preservation.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/18/to-have-and-to-hold/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Asynchronicities in blog structure</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/#comments</comments>
		<pubDate>Mon, 11 Apr 2011 15:56:04 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[data model]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[time]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=430</guid>
		<description><![CDATA[At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” [...]]]></description>
			<content:encoded><![CDATA[<i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i>
<p>At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of  a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” in the world of non-digital publications.</p>
<p>The blog also displays, as part of its frame, other graphical and functional elements (sidebars, widgets, “blogrolls”, etc) which may themselves contain dynamically updated, constantly changing information. These can be added, removed, amended and rearranged at will by the blog author/editor. Blog posts that were “published” in the context of one set of framing elements, will persist through subsequent versions of that framework.</p>
<p>Similarly with design (layout, colours, mastheads, etc), though the persistence tends to be longer, the informal nature of blogs means that these may be easily changed by the blog editor/author, and are thus more volatile than a typical “corporate” website. Again, blog posts may persist, unchanged in themselves, through many iterations of the blog site design and layout.</p>
<div id="attachment_431" class="wp-caption aligncenter" style="width: 310px"><a href="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms.png"><img class="size-medium wp-image-431" src="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms-300x223.png" alt="blogatoms 300x223 Asynchronicities in blog structure" width="300" height="223" title="Asynchronicities in blog structure" /></a><p class="wp-caption-text">A simple view of blog elements and their temporal relationship</p></div>
<p>&nbsp;</p>
<p>This very simplified visualisations suggests where we might start conceptualising key elements of a blog. It indicates that they iterate over time, but in the cases of Design, Posts and Widgets (as we’ll call them for brevity), according to independent schedules. While Posts and Comments persist in the online view of a blog, designs and widget arrangements are overwritten.</p>
<p>With my earlier ArchivePress project we deliberately overlooked preservation of the blog&#8217;s framing elements, and (given the much smaller scope of that project) established an <a href="http://archivepress.ulcc.ac.uk/2009/07/31/archival-musings/">acceptable rationale</a> for doing so. The challenge for BlogForever is to find a solution to  precisely these issues. Unless we were simply to adopt the snapshot approach of Heritrix-based web archiving initiatives (e.g. Wayback/archive.org, UK Web Archive), we need to ensure the BlogForever repository supports a degree of granularity that can capture, describe and preserve atomic blog objects in a way that reflects the particular interdependencies, in order to understand and preserve them authentically, and permit the many possible authentic and valid “time slice” views and analyses that users of the archive will need.</p>
<p>(I appreciate, by the way that these objects themselves are compound objects, so not strictly &#8220;atomic&#8221;: but the same is also true of atoms, as our CERN colleagues can attest!)</p>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/11/asynchronicities-in-blog-structure/&via=blogforever&text=Asynchronicities%20in%20blog%20structure&related=:&lang=en&count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DPTP at the NAS &#8211; legal admissibility</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/07/dptp-nas/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/07/dptp-nas/#comments</comments>
		<pubDate>Thu, 07 Apr 2011 13:35:12 +0000</pubDate>
		<dc:creator>Ed Pinsent</dc:creator>
				<category><![CDATA[All]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[digital archives]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[DPTP]]></category>
		<category><![CDATA[legal admissibility]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1247</guid>
		<description><![CDATA[We recently gave a two-day version of the Digital Preservation Training Programme to the National Archives of Scotland. Our timing was quite interesting; we arrived on the Monday the week after NAS had merged with the General Register Office, to become a new body called the National Records of Scotland. And just days before, the [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-medium wp-image-1257" style="margin: 5px;" title="1087_18_1---National-Archives-of-Scotland--Edinburgh_web" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/04/1087_18_1-National-Archives-of-Scotland-Edinburgh_web-300x200.jpg" alt="" width="300" height="200" /><br />
We recently gave a two-day version of the <a href="http://www.dptp.org">Digital Preservation Training Programme</a> to the National Archives of Scotland. Our timing was quite interesting; we arrived on the Monday the week after NAS had merged with the General Register Office, to become a new body called the <a href="http://www.nas.gov.uk/about/110331.asp" target="_blank">National Records of Scotland</a>. And just days before, the Public Records (Scotland) Bill was amended to strengthen its powers for the preservation of digital records. I was personally very encouraged to <a href="http://www.guardian.co.uk/government-computing-network/2011/mar/17/public-records-scotland-bill-passed" target="_blank">read what the Minister for Culture had to say about the latter</a>, as it confirms what I&#8217;ve always believed about digital preservation; it has a lot of common ground with traditional archives and records management.</p>
<p>Our brief was to introduce digital preservation topics to traditional archivists. Among other things I was asked to deliver a module on <a href="http://www.thecabinetoffice.co.uk/page28.html" target="_blank">BIP0008 and the legal admissibility of electronic records</a>. In preparing this I discovered that the standard is very comprehensive, requiring written policies for classes of records that are in scope, a stated &#8220;duty of care&#8221; and written procedures for what staff should be doing, an extremely meticulous and well-documented methodology for the production of scanned versions of records, plus a reliable IT framework in which this can work. And of course, an audit trail marked with date and timestamps generated at every possible link in the chain of custody. That&#8217;s a lot of boxes to tick, but the payoff is a scanned document (or born-digital record) which is regarded in the eyes of the law as an authentic unaltered copy, hence legally admissible.</p>
<p>My personal take on the standard it goes a long way to satisfying the requirements of auditors who seem to take legal admissibility to rather extreme lengths; to my mind, any good EDRMS or records management system ought to be providing enough audit trails to keep them happy. Nonetheless the archivists at Scotland seem to have a requirement not only to observe this standard, but also to ensure they continue the chain of custody from current records management into archival storage. In other words, legally admissible records must also be legally admissible archives; and any preservation actions performed on these objects while in digital custody must not compromise that authenticity.</p>
<p>This may also have been reflected in their IT manager&#8217;s interest in use of the checksum and file format validation tools which can be used in the repository; he seemed to be wondering if such tools could verify authenticity, perhaps by indicating whether a file had been tampered with at any stage between leaving the EDRM system and entering the archival repository. It&#8217;s an interesting line of thought, and an approach that would probably involve a heightened degree of audit trailing for any organisation that wanted to work this way. My off-the-cuff contribution to this discussion on the day involved something about providing evidence that &#8220;best effort&#8221; had been made, but I suppose the real proof would be in a court of law and a legal precedent.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/07/dptp-nas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AIM25 Open Metadata Project update</title>
		<link>http://dablog.ulcc.ac.uk/2011/03/25/aim25-open-metadata-project-latest/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/03/25/aim25-open-metadata-project-latest/#comments</comments>
		<pubDate>Fri, 25 Mar 2011 09:00:22 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Digital Archives]]></category>
		<category><![CDATA[AIM25]]></category>
		<category><![CDATA[AIM25OMP]]></category>
		<category><![CDATA[JISC]]></category>
		<category><![CDATA[Open Metadata]]></category>
		<category><![CDATA[Semantic Web]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1238</guid>
		<description><![CDATA[From the Open Metadata Pathway Blog: Rory and I met with Richard Gartner and Gareth Knight at CeRch today, to catch up with their investigations into using GATE and OpenCalais to process the EAD outputs from AIM25. Results look very encouraging. OpenCalais, in particular, generates a post-processing set of identified entities (personal names, place names, [...]]]></description>
			<content:encoded><![CDATA[<p><em>From the <a href="http://openmetadatapathway.blogspot.com/2011/03/semantic-analysis-of-aim25-ead.html">Open Metadata Pathway Blog</a>:</em></p>
<p>Rory and I met with Richard Gartner and Gareth Knight at CeRch today, to catch up with their investigations into using <a href="http://gate.ac.uk/">GATE</a> and <a href="http://www.opencalais.com/">OpenCalais</a> to process the <a href="http://www.loc.gov/ead/">EAD</a> outputs from <a href="http://www.aim25.ac.uk/">AIM25</a>.</p>
<p>Results look very encouraging. OpenCalais, in particular, generates a post-processing set of identified entities (personal names, place names, corporate names) which Richard G has then created regular expressions to locate these in the body of the EAD and wrap in appropriate EAD tags (<span style="font-family: 'Courier New', Courier, monospace;">&lt;persname&gt;</span> etc).</p>
<p>This suggests that the way forward for enhancing the existing data entry processes for AIM25 will involve dispatching the EAD-compliant data entered by collections manager to OpenCalais, and returning the data, with enhanced markup, for checking by the submitter. This hook should be easy enough to insert for manual, form-based entry; for batch entry processes we will need to assess whether any significant delays are introduced.</p>
<p>We&#8217;ve also started to consider ideas for a URI scheme for the entities identified. Our current working hypothesis is that this will involve defining a &#8220;data&#8221; namespace for AIM25, binding to <span style="font-family: 'Courier New', Courier, monospace;">http://data.aim25.ac.uk</span>/. Within that we can develop a structure along the lines <span style="font-family: 'Courier New', Courier, monospace;">/person</span>, <span style="font-family: 'Courier New', Courier, monospace;">/place</span>, <span style="font-family: 'Courier New', Courier, monospace;">/corporate_body</span>, and append our unique IDs for each entity. Further research is necessary, particularly into the recommendations of the Cabinet Office recommendations for <a href="http://www.cabinetoffice.gov.uk/resource-library/designing-uri-sets-uk-public-sector">Designing URI Sets for the UK Public Sector</a>.</p>
<p>These URIs can then be used in identifier attributes for our EAD elements (<span style="font-family: 'Courier New', Courier, monospace;">&lt;persname&gt;</span>, etc.), and thence easily transformed into an RDFa format for the Web-based HTML rendering of the AIM25 catalogues.</p>
<p>Next steps include further investigating how to implement and assert relationships between our entities and other open datasets (e.g. <span style="font-family: 'Courier New', Courier, monospace;">our_entity</span> <em><span style="font-family: 'Courier New', Courier, monospace;">is_the_same_as</span></em> <span style="font-family: 'Courier New', Courier, monospace;">your_entity</span>). And how to make the authority data, duly marked-up, available as open metadata.</p>
<p>Rory and I can now start to consider suitable approaches to embedding this in our development copy of the existing AIM25 system, and we&#8217;ll continue to liaise closely with CeRch for advice on  the relative merits of Gate and OpenCalais processing, and guidance on URI implementation.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/03/25/aim25-open-metadata-project-latest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

