<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ulcc da blog &#187; European Commission</title>
	<atom:link href="http://dablog.ulcc.ac.uk/tag/european-commission/feed/" rel="self" type="application/rss+xml" />
	<link>http://dablog.ulcc.ac.uk</link>
	<description>blogging about digital archives &#38; repositories since 2007</description>
	<lastBuildDate>Tue, 14 May 2013 10:42:32 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.2</generator>
		<item>
		<title>DA visits CERN</title>
		<link>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/#comments</comments>
		<pubDate>Wed, 22 Jun 2011 15:41:57 +0000</pubDate>
		<dc:creator>Silvia Arango-Docio</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[CERN]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[FP7]]></category>
		<category><![CDATA[Organisations]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://dablog.ulcc.ac.uk/?p=1540</guid>
		<description><![CDATA[Last week I was in a very welcoming Geneva, exactly in the European Organization for Nuclear Research (CERN) to meet other partners working on BlogForever and to have several Invenio workshops. I felt very lucky to be in the hub of such an organization and to see how many young international students are getting the [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1556" class="wp-caption alignleft" style="width: 202px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computercentre_cern-300x225.jpg"><img class="size-full wp-image-1556 " title="CERN Computer Centre" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computercentre_cern-300x225.jpg" alt="CERN Computer Centre" width="192" height="144" /></a><p class="wp-caption-text">CERN Computer Centre</p></div>
<div id="attachment_1557" class="wp-caption alignright" style="width: 164px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computer_raks_cern-300x300.jpg"><img class="size-full wp-image-1557 " title="CERN Computer Racks" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/computer_raks_cern-300x300.jpg" alt="" width="154" height="154" /></a><p class="wp-caption-text">CERN Computer Racks</p></div>
<p>Last week I was in a very welcoming Geneva, exactly in the <a href="http://public.web.cern.ch/public/en/About/About-en.html">European Organization for Nuclear Research</a> (CERN) to meet other partners working on <a href="http://blogforever.eu/">BlogForever</a> and to have several <a href="http://invenio-software.org/">Invenio</a> workshops. I felt very lucky to be in the hub of such an organization and to see how many young international students are getting the opportunity to be in the forefront of high physics research.</p>
<div id="attachment_1558" class="wp-caption alignleft" style="width: 164px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/Outside_Globe_Cern.jpg"><img class="size-medium wp-image-1558 " title="The Globe at CERN" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/Outside_Globe_Cern-300x300.jpg" alt="The Globe at CERN" width="154" height="154" /></a><p class="wp-caption-text">The Globe at CERN</p></div>
<p>CERN is home of the world&#8217;s biggest and most powerful particle accelerator, the <a href="http://public.web.cern.ch/public/en/lhc/lhc-en.html">Large Hadron Collider</a> (LHC). This machine is installed in a 27 km circumference tunnel. The LHC records around 15 petabytes per year. All the data is stored in their vast <a href="http://cdsweb.cern.ch/record/1103476/">computer centre</a>, where open access and sharing has been the driving principle since their foundation in 1954 and an inspirational environment for the Web to be born there.</p>
<p>Invenio&#8217;s workshops showed us that their electronic document management system is robust and versatile, targets the management of more than 1.2 million documents and it can be used in 19 different languages. Its content is clean and complete. In just their High Energy Physics domain, they have around 700 collections and approximately 20K queries a day. As well Invenio is used for special programs like the UNESCO funded digital repos in Africa and EU funded projects like <a href="http://www.d4science.eu/">D4Science</a> and <a href="http://www.openaire.eu/">OpenAIRE.</a></p>
<div id="attachment_1559" class="wp-caption alignright" style="width: 250px"><a href="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/me_at_CERN-300x225.jpg"><img class="size-full wp-image-1559 " title="Silvia at CERN" src="http://dablog.ulcc.ac.uk/wp-content/uploads/2011/06/me_at_CERN-300x225.jpg" alt="Silvia at CERN" width="240" height="180" /></a><p class="wp-caption-text">Me at CERN</p></div>
<p>In the case of BlogForever and Invenio, plenty of work to be done by the Invenio Team at the User and Document Service Group. At the moment, they have more than 30 readily available <a href="http://www.python.org/">Python</a> modules that can be adapted to the case of preserving huge amount of blogs. From the point of view of my work with repositories as part of the <a href="http://www.ulcc.ac.uk/content/repositories-he-and-research">Digital Archives and Repositories Team</a> at <a href="http://www.ulcc.ac.uk/">ULCC</a>, I was inspired by Invenio&#8217;s advance search engines; indexing and ranking methods.</p>
<p>In a more personal level, if you are ever crossing the border between France and Switzerland near Geneva, get Tram 18 and hop off at CERN to see their <a href="http://microcosm.web.cern.ch/microcosm/Welcome.html">Microcosm</a> and <a href="http://public.web.cern.ch/public/en/spotlight/SpotlightGlobe-en.html">Globe</a> exhibitions.</p>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/06/22/da-visits-cern/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BlogForever: Thoughts about blog data and metadata</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/#comments</comments>
		<pubDate>Mon, 25 Apr 2011 20:39:18 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[Atom]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data model]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[newsfeeds]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=487</guid>
		<description><![CDATA[During the ArchivePress project at ULCC, we briefly considered the data and metadata generally made available with blogs and blog posts. As ArchivePress focused on the representations of blogs in newsfeeds, we examined the metadata that is generated in common, and exposed in the newsfeeds of three of the most common blog platforms, WordPress, Blogger [...] ]]></description>
			<content:encoded><![CDATA[<p><i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i></p>
<p>During the <a href="http://jiscpowr.jiscinvolve.org/wp/2009/06/24/archivepress-when-one-size-doesnt-fit-all/">ArchivePress</a> project at ULCC, we briefly considered the data and metadata generally made available with blogs and blog posts. As ArchivePress focused on the representations of blogs in newsfeeds, we examined the metadata that is generated in common, and exposed in the newsfeeds of three of the most common blog platforms, WordPress, Blogger and TypePad. Blogger and Typepad prefer the Atom newsfeed format; WordPress (particularly WordPress.com) prefers RSS (though it can be made to publish Atom feeds too). This analysis was done, about a year ago, things may have changed, but here is a summary of what we found.</p>
<p>For each <strong>Blog</strong>, the following core information is available in the feeds:</p>
<table style="border-collapse: collapse" border="1" cellspacing="1" cellpadding="2" width="100%">
<tbody>
<tr>
<th></th>
<th><strong>WordPress (RSS)</strong></th>
<th><strong>Blogger (Atom)</strong></th>
<th><strong>Typepad (Atom)</strong></th>
</tr>
<tr>
<th>Feed Unique ID</th>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/id </td>
<td style="vertical-align: top">feed/id</td>
</tr>
<tr>
<th>Blog URL</th>
<td style="vertical-align: top">rss/channel/link </td>
<td style="vertical-align: top">feed/link@rel=&#8221;alternate&#8221; </td>
<td style="vertical-align: top">feed/link@rel=&#8221;alternate&#8221;</td>
</tr>
<tr>
<th>Blog Title</th>
<td style="vertical-align: top">rss/channel/title </td>
<td style="vertical-align: top">feed/title </td>
<td style="vertical-align: top">feed/title</td>
</tr>
<tr>
<th>Blog Description</th>
<td style="vertical-align: top">rss/channel/description </td>
<td style="vertical-align: top">feed/subtitle </td>
<td style="vertical-align: top">feed/subtitle</td>
</tr>
<tr>
<th>Date of last update</th>
<td style="vertical-align: top">rss/channel/lastBuildDate </td>
<td style="vertical-align: top">feed/updated </td>
<td style="vertical-align: top">feed/updated</td>
</tr>
<tr>
<th>Generating software</th>
<td style="vertical-align: top">rss/channel/generator </td>
<td style="vertical-align: top">feed/generator </td>
<td style="vertical-align: top">feed/generator</td>
</tr>
</tbody>
</table>
<p>For each <strong>Post</strong>, we established that the following core information is available in the newsfeeds:</p>
<table border="1" cellspacing="1" cellpadding="2" width="100%" style="border-collapse: collapse">
<tbody>
<tr>
<th></th>
<th>WordPress (RSS)</th>
<th>Blogger (Atom)</th>
<th>Typepad (Atom)</th>
</tr>
<tr>
<th>Post Unique ID</th>
<td style="vertical-align: top">rss/channel/item/guid@isPermaLink </td>
<td style="vertical-align: top">feed/entry/id </td>
<td style="vertical-align: top">feed/entry/id</td>
</tr>
<tr>
<th>Post Title</th>
<td style="vertical-align: top">rss/channel/item/title </td>
<td style="vertical-align: top">feed/entry/title </td>
<td style="vertical-align: top">feed/entry/title</td>
</tr>
<tr>
<th>Post Summary</th>
<td style="vertical-align: top">rss/channel/item/description </td>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/entry/summary</td>
</tr>
<tr>
<th>Post URL</th>
<td style="vertical-align: top">rss/channel/item/link </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;alternate&#8221; </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;alternate&#8221;</td>
</tr>
<tr>
<th>Date of publication</th>
<td style="vertical-align: top">rss/channel/item/pubDate </td>
<td style="vertical-align: top">feed/entry/published </td>
<td style="vertical-align: top">feed/entry/published </td>
</tr>
<tr>
<th>Date of last update</th>
<td style="vertical-align: top">NA </td>
<td style="vertical-align: top">feed/entry/updated </td>
<td style="vertical-align: top">feed/entry/updated</td>
</tr>
<tr>
<th>Post Author</th>
<td style="vertical-align: top">rss/channel/item/dc:creator</p>
<p>rss/xmlns:dc</p>
</td>
<td style="vertical-align: top">feed/entry/author/name</td>
<td style="vertical-align: top">feed/entry/author/name</td>
</tr>
<tr>
<th>Post Category</th>
<td style="vertical-align: top">rss/channel/item/category </td>
<td style="vertical-align: top">feed/entry/category@term </td>
<td style="vertical-align: top">feed/entry/category@term</td>
</tr>
<tr>
<th>Post Content</th>
<td style="vertical-align: top">rss/channel/item/content:encoded</p>
<p>rss/xmlns:content</p>
</td>
<td style="vertical-align: top">feed/entry/content</p>
</td>
<td style="vertical-align: top">feed/entry/content</p>
</td>
</tr>
<tr>
<th>Post Comments</th>
<td style="vertical-align: top">rss/channel/item/comments </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;replies&#8221; </td>
<td style="vertical-align: top">feed/entry/link@rel=&#8221;replies&#8221;</td>
</tr>
<tr>
<th>Post Comments Feed</th>
<td style="vertical-align: top">rss/channel/item/wfw:commentRss </td>
<td style="vertical-align: top">NA</p>
</td>
<td style="vertical-align: top">NA</p>
</td>
</tr>
</tbody>
</table>
<p>One interesting point we noted was that neither Blogger nor Typepad published a link to a Comments Feed for each post. This made our work on ArchivePress more difficult since it was predicated on being able to easily identify the Comments feed for each post, and harvest new Comments as they were published. Obviously for blogs generated other than by WordPress, this was not going to be so easy. (Our ace developer Emanuele found some workarounds, but that&#8217;s another story.)</p>
<p>I think this offers us an interesting overview of the core of standard, structured blog data and metadata, in three of the leading blog platforms. This is the data structure and metadata profile that is maintained in blog databases, in one of its native forms, and I&#8217;d expect it to be present in all blog platforms, since it arguably represents the essence of blogs. I hope this will be useful background when considering the core models for data and metadata handling that will be developed for BlogForever.</p>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/25/thoughts-about-blog-data-and-metadata/&#038;via=blogforever&#038;text=Thoughts%20about%20blog%20data%20and%20metadata&#038;related=:&#038;lang=en&#038;count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/25/thoughts-about-blog-data-and-metadata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BlogForever: Nominate blogs for our survey</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/#comments</comments>
		<pubDate>Wed, 20 Apr 2011 07:59:46 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=443</guid>
		<description><![CDATA[Is there a particular blog or blogger you would like to see included in the BlogForever survey? We invite you to use this form to nominate them, and we will try to ensure that the blog is reviewed or the blogger contacted to participate in our survey. Tweet]]></description>
			<content:encoded><![CDATA[<p><i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i></p>
<p>Is there a particular blog or blogger you would like to see included in the BlogForever survey? We invite you to use this form to nominate them, and we will try to ensure that the blog is reviewed or the blogger contacted to participate in our survey.</p>
<div class="pageview">
<p><iframe src="https://spreadsheets.google.com/embeddedform?formkey=dDlxOEgwRUpKVU9KZHFxSGo3MGxIb0E6MQ" frameborder="0" style="" scrolling="yes" height="700px" width="100%">Get a better browser!</iframe></p>
</div>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/20/nominate-blogs-for-our-survey/&#038;via=blogforever&#038;text=Nominate%20blogs%20for%20our%20survey&#038;related=:&#038;lang=en&#038;count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/20/nominate-blogs-for-our-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BlogForever: Asynchronicities in blog structure</title>
		<link>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/</link>
		<comments>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/#comments</comments>
		<pubDate>Mon, 11 Apr 2011 15:56:04 +0000</pubDate>
		<dc:creator>Richard M. Davis</dc:creator>
				<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[BlogForever]]></category>
		<category><![CDATA[blogs]]></category>
		<category><![CDATA[data model]]></category>
		<category><![CDATA[European Commission]]></category>
		<category><![CDATA[preservation]]></category>
		<category><![CDATA[time]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogforever.eu/?p=430</guid>
		<description><![CDATA[At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” [...] ]]></description>
			<content:encoded><![CDATA[<p><i>From the <a href="http://blogforever.eu/">BlogForever</a> blog.</i></p>
<p>At an atomic level, a “blog” comprises “blog posts”, which are continually added to the blog corpus: that is the dynamic essence of a blog, and distinguishes it from old-fashioned, largely static Websites and hypertexts in which little content changed between major update iterations, which process was probably more akin to “publishing a new edition” in the world of non-digital publications.</p>
<p>The blog also displays, as part of its frame, other graphical and functional elements (sidebars, widgets, “blogrolls”, etc) which may themselves contain dynamically updated, constantly changing information. These can be added, removed, amended and rearranged at will by the blog author/editor. Blog posts that were “published” in the context of one set of framing elements, will persist through subsequent versions of that framework.</p>
<p>Similarly with design (layout, colours, mastheads, etc), though the persistence tends to be longer, the informal nature of blogs means that these may be easily changed by the blog editor/author, and are thus more volatile than a typical “corporate” website. Again, blog posts may persist, unchanged in themselves, through many iterations of the blog site design and layout.</p>
<div id="attachment_431" class="wp-caption aligncenter" style="width: 310px"><a href="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms.png"><img class="size-medium wp-image-431" src="http://blogforever.eu/wp-content/uploads/2011/04/blogatoms-300x223.png" alt="blogatoms 300x223 Asynchronicities in blog structure" width="300" height="223" title="Asynchronicities in blog structure" /></a>
<p class="wp-caption-text">A simple view of blog elements and their temporal relationship</p>
</div>
<p>&nbsp;</p>
<p>This very simplified visualisations suggests where we might start conceptualising key elements of a blog. It indicates that they iterate over time, but in the cases of Design, Posts and Widgets (as we’ll call them for brevity), according to independent schedules. While Posts and Comments persist in the online view of a blog, designs and widget arrangements are overwritten.</p>
<p>With my earlier ArchivePress project we deliberately overlooked preservation of the blog&#8217;s framing elements, and (given the much smaller scope of that project) established an <a href="http://archivepress.ulcc.ac.uk/2009/07/31/archival-musings/">acceptable rationale</a> for doing so. The challenge for BlogForever is to find a solution to  precisely these issues. Unless we were simply to adopt the snapshot approach of Heritrix-based web archiving initiatives (e.g. Wayback/archive.org, UK Web Archive), we need to ensure the BlogForever repository supports a degree of granularity that can capture, describe and preserve atomic blog objects in a way that reflects the particular interdependencies, in order to understand and preserve them authentically, and permit the many possible authentic and valid “time slice” views and analyses that users of the archive will need.</p>
<p>(I appreciate, by the way that these objects themselves are compound objects, so not strictly &#8220;atomic&#8221;: but the same is also true of atoms, as our CERN colleagues can attest!)</p>
<div style="float: right; margin-left: 10px;"><a href="http://twitter.com/share?url=http://blogforever.eu/blog/2011/04/11/asynchronicities-in-blog-structure/&#038;via=blogforever&#038;text=Asynchronicities%20in%20blog%20structure&#038;related=:&#038;lang=en&#038;count=vertical" class="twitter-share-button">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>
]]></content:encoded>
			<wfw:commentRss>http://dablog.ulcc.ac.uk/2011/04/11/asynchronicities-in-blog-structure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
