Transcribing Bentham

March 1st, 2010 Richard M. Davis Posted in Linnean Online, Museums 7 Comments »

Jeremy Bentham, Bloomsbury WC1 by Ewan-M on Flickr (CC:BY)Did I mention that we are very excited to be contributing to UCL’s Bentham Transcription Initiative. This is an AHRC-funded project to complete the digitisation of the manuscripts of 18th Century philosopher Jeremy Bentham, and transcribe them using a wiki-based collaborative approach. It is being run by the Bentham Project at UCL, with support from ourselves and UCL’s newly-launched Centre for Digital Humanities. You can read an overview of the project on Melissa Terras’s blog.

Obviously, transcription of manuscript materials is an important digitisation activity that can rarely, if ever, be left to computers, in the way that printed texts can be, using OCR. But it’s painstaking and laborious work, and anything that eases the burden is welcome.

The project is already throwing up some very interesting conversations about transcription.  At ULCC we have thought about transcription before, particularly with regard to our ongoing work for the Linnean Society archives, and we hope that there will yet be synergies to exploit. It is a great feeling to be so closely involved with disseminating the work of two such seminal figures as Linnaeus and Bentham.

We’re not naïve enough to think that collaborative web-based transcription is new, but we’ve yet to find any substantial comparable examples. A comment on UCL’s Digital Humanities blog teases us with the prospect of information about other similar projects, but fails to provide even a single link or hint, so is effectively useless: hardly in the collaborative spirit! A more useful lead was Joanne Evans’ link to the National Library of Australia’s Australian Newspapers project, which is crowdsourcing the proof-reading and correcting of OCR outputs, and has an impressive-looking site – I’m sure we’ll be borrowing some ideas from there.

Another useful lead has been from Ben Brumfield of Austin, Texas, directing us to his blog about collaborative manuscript transcription which has been going even longer than DA Blog, and looks like it’s going to make interesting reading. Ben’s recent blog post about a distributed transcription exercise of the US Geological Survey’s Bird Phenology Program includes a link to a training video for volunteers (it even sounds like it’s been recorded in a birdhouse).  In the video we can see a database-form approach to transcription, which is particularly appropriate for transcribing data already entered on structured forms.

For more heterogeneous and free-form texts, such as the Bentham manuscripts, wikis seem to me much more appropriate, being in essence discrete hypertext engines. As for collaborative features, MediaWiki in particular has strong and proven features: there can be few better advertisements for effective virtual, global collaboration and crowdsourcing than Wikipedia.

One thing that is particularly compelling about the BPP video is that it is an excellent example of a thorough approach to online collaboration, giving clear and unequivocal guidance to contributors. Now that screencast tools are so readily available, it’s clear that for many activities like this, video-based instruction is the ideal tool, and often preferable to any number of written instructions. No less than for online teaching and learning environments, the need for effective induction and inclusive management of the online community must never be overlooked.

AddThis Social Bookmark Button

Open Repositories 2009

June 10th, 2009 Richard M. Davis Posted in Events, Linnean Online, SNEEP 1 Comment »

Georgia Aquarium by Driek Heesakkers on Flickr (CC:by-nc-sa)Less than three weeks have passed since I found myself at Open Repositories 2009 (#OR09) in Atlanta, and it already seems a long time ago. For the record, Georgia Tech put on an excellent show, overflowing with fascinating presentations, people and ideas – far too many to take in – and (most importantly) an excellent and entertaining dinner at the Georgia Aquarium.

I took a smashing poster describing our work on Linnean Online and the SNEEP extensions for EPrints, and also spoke about these projects to the EPrints User Group sessions and had to endure the now inevitable Minute Madness. I was pleased to spot the SNEEP Comments plugin in use when Jessie Hey demonstrated EdShare, another of Southampton’s learning resource repository projects. It was also great to meet up again with Patrick McSweeney who has been tweaking SNEEP at Southampton, and discuss ways of keeping ongoing work on the plugins in sync. Regular readers may remember Patrick from OR08, and he cut an even more unforgettable figure this time.

The talk of the event seemed to be the relentless buzz around the unification of DSpace/Fedora Commons, engendering the new creation that is DuraSpace (and DuraCloud). This offers a lot of exciting possibilities that we’ll need to keep track of, though it won’t be the first repositories event that has offered us a surfeit of jam tomorrow… For now, for the curious, here’s the Duraspace FAQ.

By contrast, it’s slightly disappointing that, over the water, the EPrints user group seemed a tad under-subscribed. Features available in EPrints 3.1.x, and those imminent for 3.2, from cloud storage controllers and desktop folder visualisations to preservation support, promise quick wins for anyone wanting to push the repository model further: Les and the EPrints team waste no time in responding to the latest demands of the zeitgeist. All the same, informal discussions with users and non-users of EPrints suggested substantial resistance to its Perl-based core. Yet EPrints continues to push more configurability away from its Perl source: in the kind of repository-driven future oft foretold – from WordPress-type exensibility to modular service-oriented solutions – the underlying code base ought to become increasingly irrelevant as long as the package does what it says on the tin.

As usual it was great to meet some old friends, and lots of people for the first time. Memorably serendipitous (re-)discoveries included:

  • Bibapp – “a Campus Research Gateway and Expert Finder”. There have been many attempts to integrate personalised, portfolio pages with repositories, and this looks like an effort worth investigating further, particularly as it claims to be repository neutral (and a good excuse to try out Ruby for real?).
  • ParallelArchive – another variant on the repository model: “a personal scholarly workspace, a collaborative research environment, and a digital repository”. Run by Open Society Archives (OSA) at Central European University in Budapest – of particular interest to students of cold war and related issues
  • E-Lis – still a superb multilingual collection of LIS resources, and undoubtedly the acid test of all EPrints internationalisation efforts
  • MIT Open CourseWare – the mother of all OERs?
  • The great Peter Sefton – great to meet him at last, at 6′ 7″, someone I can truly look up to. For a much more thorough account of the conference, see Pete’s Blog

I didn’t manage anything in the way of sightseeing, though the Aquarium seemed to be top of most locals’ list of recommendations, and we went there. Perhaps I should have made more of an effort to see the Civil War museum. For the visual record of OR09, content and context, you might like to see Jim Downing’s photos from the event, and the official photo OR09 set on Flickr.

AddThis Social Bookmark Button

Fish and Correspondence on Linnean-Online

November 6th, 2008 Rory McNicholl Posted in Linnean Online, News No Comments »

Hippocampus hippocampus

Since November 2007 the linnean-online collection has allowed the public access to digitised images of plant (and since this summer insect) specimens held by the Linnean Society of London. This week the on-line collection has been expanded to include 144 images of fish specimens held by the society. This part of the ongoing digitisation project was made possible thanks to generous funding from amongst others His Majesty the Emperor of Japan and the Worshipful Company of Fishmongers.

As well as these further specimens, 3845 images of correspondence to and from Carl Linnaeus have also joined the on-line collection. The correspondence enrich the collection providing important insight and historical context. Most of the letters are written in latin, however where possible links are provided to translated summaries at The Linnaean correspondence, an electronic edition prepared by the Swedish Linnaeus Society, Uppsala, and published by the Centre international d’étude du XVIIIe siècle, Ferney-Voltaire.

AddThis Social Bookmark Button

Open Repositories 2008 in Southampton

April 2nd, 2008 Richard M. Davis Posted in Events, Linnean Online, Reports, SNEEP 1 Comment »

Enjoyed the last couple of days at OR08 in Southampton, catching up with the OR crowd and developments, as well as presenting on our work with Eprints for Linnean Online and SNEEP.

The conference was organised with gusto by Les Carr and the Southampton team, who kept things moving at a rapid pace, seemingly unphased by any of the inevitable challenges of staging an event like this for over 300 people. Encouragingly, as well as the usual crowd from western Europe and anglophone countries, other countries represented this time included Serbia, Ukraine and Sri Lanka.

One interesting innovation was using Crowdvine to create an online community of delgates, which proved very simple and effective. And of course there’s an Eprints repository of all the conference papers and proceedings.

Richard at OR08. Photo: R. Akerman.

Read the rest of this entry »

AddThis Social Bookmark Button

Launch of Linnean Online

November 2nd, 2007 Richard M. Davis Posted in Linnean Online, Reports 3 Comments »

Delphinium grandiflorum from the Linnaean HerbariumLast night the Linnean Society launched the new image repository that we have developed for it using Eprints. The address of the system is http://www.linnean-online.org/. It contains over 14,000 high-resolution images of the Linnaean Herbarium – a collection of botanical specimens begun by Carl Linnaeus.

At the Society’s headquarters in Piccadilly, Julia Hoare, Linnean Society’s Project Manager for the extensive digitisation project, delivered a consummate and well-received demonstration to a colloquium of orchid specialists. This event is being treated as a “soft launch”, and feedback is being sought from this small community initially, before the system gets a full public launch in December.

Read the rest of this entry »

AddThis Social Bookmark Button