<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nostuff.org &#187; libraries, library technologies &amp; open data</title>
	<atom:link href="http://www.nostuff.org/words/category/information-searching-and-libraries/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nostuff.org/words</link>
	<description>living up to its name</description>
	<lastBuildDate>Mon, 19 Mar 2012 00:22:49 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com"/><atom:link rel="hub" href="http://superfeedr.com/hubbub"/>		<item>
		<title>MARC Tools &amp; MARC::Record errors</title>
		<link>http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/</link>
		<comments>http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 12:17:59 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[catalogue]]></category>
		<category><![CDATA[library technology]]></category>
		<category><![CDATA[marc]]></category>
		<category><![CDATA[marc21]]></category>
		<category><![CDATA[perl]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=698</guid>
		<description><![CDATA[I know next to nothing about MARC,though being a shambrarian I have to fight it sometimes. My knowledge is somewhat binary, absolutely nothing for most fields/subfields/tags but &#8216;fairly ok&#8217; for the bits I&#8217;ve had to wrestle with. [If you don't know that MARC21 is an ageing bibliographic metadata standard, move on. This is not the blog post you're looking [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/ircount-repository-record-statistics/' rel='bookmark' title='ircount : Repository Record Statistics'>ircount : Repository Record Statistics</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/" data-text="MARC Tools &#038; MARC::Record errors" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>I know next to nothing about MARC,though being a <a href="http://www.shambrarian.org/">shambrarian</a> I have to fight it sometimes. My knowledge is somewhat binary, absolutely nothing for most fields/subfields/tags but &#8216;fairly ok&#8217; for the bits I&#8217;ve had to wrestle with.</p>
<p>[If you don't know that MARC21 is an ageing bibliographic metadata standard, move on. This is not the blog post you're looking for]</p>
<p>Recent encounters with MARC</p>
<ul>
<li>Importing MARC files in to our Library System (<del>Talis</del> Capita <a href="http://www.talis.com/alto/">Alto</a>), mainly for our e-journals (so users can search our catalogue and find a link to a journal if we subscribe to it online). Many of the MARC records were of poor quality and often did not even state the item was (a) a journal (b) online. Additionally Alto will only import if there is a <a href="http://www.loc.gov/marc/bibliographic/bd001.html">001</a> field, even though the first thing it does is move the 001 field to the <a href="http://www.loc.gov/marc/bibliographic/bd035.html">035</a> field and create its own. To handle these I used a very simple script to run through the MARC file &#8211; using <a href="http://search.cpan.org/dist/MARC-Record/">MARC::Record</a> &#8211; to add an 001/<a href="http://www.loc.gov/marc/bibliographic/bd006.html">006</a>/<a href="http://www.loc.gov/marc/bibliographic/bd007.html">007</a> where required.</li>
<li>Setting up <a href="http://sabre.sussex.ac.uk/">sabre</a> &#8211; a web catalogue which searches the records of both the University of Sussex and the University of Brighton &#8211; we need to pre-process the MARC records to add extra fields, in particular a field to tell the software (<a href="http://vufind.org/">vufind</a>) which organisation the record was from.</li>
</ul>
<h3>Record problems</h3>
<div>One of the issues was that not all the records from the University of Brighton were present in sabre. Where were they going missing? Were they being exported from the Brighton system? copied to the sabre server ok? Being output through the perl scritp? lost during the vufind import process?</div>
<div>To answer these questions I needed to see what was in the MARC files, the problem is that MARC is a binary format so you can&#8217;t just fire up vi to investigate. The first tool of the trade is a quick script using MARC::Record to convert a MARC file to text file. But this wasn&#8217;t getting to the bottom of it. This lead me to a few PC tools that were of use.</div>
<h3>PC Tools</h3>
<div><strong><a href="http://people.oregonstate.edu/~reeset/marcedit/html/index.php">MarcEdit</a></strong> : Probably the best known PC application. It allows you to convert a MARC file to text, and contains an editor as well as a host of other tools. A good swiss army knife.</div>
<div><strong>MARCView</strong> : Originally from <a href="http://www.systemsplanning.com/marconvert/">Systems Planning</a> and now <a href="http://oclc.org/developer/content/marcviewmarconvert-software-description">provided by OCLC</a>, I had not come across MARCView until recently. It allows you to browse and search through a file containing MARC records. Though the browsing element does not work on larger files.</div>
<div>
<div id="attachment_700" class="wp-caption alignright" style="width: 624px"><a href="http://www.nostuff.org/words/wp-content//2011/07/marcview.jpg"><img class="size-large wp-image-700 " title="marcview" src="http://www.nostuff.org/words/wp-content//2011/07/marcview-1024x867.jpg" alt="marcview" width="614" height="520" /></a><p class="wp-caption-text">marcview</p></div>
<p>&nbsp;</p>
<p><strong>USEMARCON</strong> is the final utility. It comes with a GUI interface, both of which can be<a href="http://www.nationallibrary.fi/libraries/format/usemarcon.html"> downloaded from The National Library of Finland</a>. The British Library also have <a href="http://www.bl.uk/bibliographic/usemarcon.html">some information</a> on it. Its main use is to convert MARC files from one type of MARC to another, something I haven&#8217;t looked in to, but the GUI provides a way to delve in to a set of MARC records.</p>
<h3>Back to the problem&#8230;</h3>
<p>So we were pre-processing MARC records from two Universities before importing them in to vufind using a Perl script which had been supplied by another University.</p>
<p>It turns out the script was crashing on certain records, all records after the problematic record were not being processed. It wasn&#8217;t just that script, any perl script using MARC::Record (and MARC::batch) would crash when it hit a certain point.</p>
<p>By writing a simple script that just printed out each record we could as least see what the record was immediately <em>before</em> the record causing it to crash (i.e. the last in the list of output). This is where the PC applications were useful. Once we know the record before the problematic record, we could find it using the PC viewers and then move to the next record.</p>
<p style="text-align: center;"><a href="http://www.nostuff.org/words/wp-content//2011/07/marcedit.jpg"><img class="size-full wp-image-702 aligncenter" title="marcedit" src="http://www.nostuff.org/words/wp-content//2011/07/marcedit.jpg" alt="" width="690" height="667" /></a></p>
<p style="text-align: left;">The issue was certain characters (here in the 245 and 700 fields). I haven&#8217;t got to the bottom of what the exact issue is. There are two kinds of popular encodings: <a href="http://en.wikipedia.org/wiki/MARC-8">MARC-8</a> and records in UTF-8, and this can be designated in the Leader (<a href="http://www.loc.gov/marc/bibliographic/bdleader.html">9th</a> character). I think Alto (via it&#8217;s marcgrabber tool) exports in MARC-8 but perhaps the characters in the record did not match the specified encoding.</p>
<p style="text-align: left;">The title (245) on the orignal catalogue looks like this:</p>
<p style="text-align: left;"><a href="http://www.nostuff.org/words/wp-content//2011/07/problemcahronprism.jpg"><img class="aligncenter size-full wp-image-708" title="problem title/author on original catalogue" src="http://www.nostuff.org/words/wp-content//2011/07/problemcahronprism.jpg" alt="" width="661" height="70" /></a></p>
<p style="text-align: left;">One work around was to use a slightly hidden feature of MarcEdit to convert the file to UTF:</p>
<p style="text-align: left;"><a href="http://www.nostuff.org/words/wp-content//2011/07/marcedit2.jpg"><img class="aligncenter size-full wp-image-703" title="marcedit2" src="http://www.nostuff.org/words/wp-content//2011/07/marcedit2.jpg" alt="" width="599" height="403" /></a></p>
<p style="text-align: left;">I was then able to run the records through the perl script, and import it in to vufind.</p>
<p style="text-align: left;">But clearly this was not a sustainable solution. Copying files to my PC and running MarcEdit was not something that would be easy to automate.</p>
<h3 style="text-align: left;">Back to MARC::Record</h3>
<p style="text-align: left;">The error message produced looked something like this:</p>
<pre style="text-align: left;">utf8 "\xC4" does not map to Unicode at /usr/lib/perl/5.10/Encode.pm line 174</pre>
<p style="text-align: left;">I didn&#8217;t find much help via Google, though did find a few mentions of this error related to working with MARC Records.</p>
<p style="text-align: left;">The issue was that the script loops through each record, the moment it tries to start a loop with a record it does not like it crashes, so there is no way to check for certain characters in the record as it will already be too late.</p>
<p style="text-align: left;">Unless we use something like exceptions. The closest to this perl has out-of-the-box is <em>eval</em>.</p>
<p style="text-align: left;">By putting the whole loop in to an <em>eval</em>, if it hits a problem the eval simply passed the flow down to the <em>or do</em> part of the code. But we want to continue processing the records, so this simply calls the <em>eval</em> again, until it reaches the end of the record. You can <a href="http://code.google.com/p/credaul/source/browse/branches/110715/marc/list_records.pl">see a basic working example of this here</a>.</p>
<p style="text-align: left;">So if you&#8217;ve having problems processing a file of MARC records using perl MARC::Record / MARC::batch try wrapping it in a eval. You&#8217;ll still loose the records it can not process but it wont stop in it&#8217;s tracks (and you can output an error log to record the record number of the records with errors).</p>
<h3 style="text-align: left;">Post-script</h3>
<p style="text-align: left;">So, after pulling my hair out, I finally found a way to process a filewhich contains records which cause MARC::Record to crash. It had caused me much stress as I needed to get this working, and quickly, in an automated manner. As I said, the script had been passed to us by another University and it already did quite a few things so I was a little unwilling to rewrite using another language (though a good candidate would be php as the vufind script was written in that language and didn&#8217;t seem to have these problems).</p>
<p style="text-align: left;">But in writing this blog post, I was searching using Google to re-find the various sites and pages I had found when I encountered the problem. And in doing so I had found this: <strong><a href="http://keeneworks.posterous.com/marcrecord-and-utf">http://keeneworks.posterous.com/marcrecord-and-utf</a> </strong></p>
<p style="text-align: left;">Yes. I had actually already resolved the issue,<em> and blogged about it</em>, back in early May. I had somehow &#8211; worryingly - completely forgotten any of this. Unbelievable! You can find a copy of a script based on that solution (which is a little similar to the one above) <a href="http://code.google.com/p/credaul/source/browse/branches/110715/marc/list3.pl">here</a>.</p>
<p style="text-align: left;">So there you are, a few PC applications and a couple of solutions to perl/MARC issue.</p>
</div>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/ircount-repository-record-statistics/' rel='bookmark' title='ircount : Repository Record Statistics'>ircount : Repository Record Statistics</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>VuFind in 8 minutes using Amazon EC2</title>
		<link>http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/</link>
		<comments>http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/#comments</comments>
		<pubDate>Wed, 26 Jan 2011 13:02:24 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[ec2]]></category>
		<category><![CDATA[vufind]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=670</guid>
		<description><![CDATA[I&#8217;ve created a screencast showing how easy it can be to install VuFind. Here I go from nothing (no server, no OS) to full VuFind install in under 8 minutes. It would probably take less than two minutes (under 10mins in total) to add MARC records to the installation, but I didn&#8217;t have any to [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/' rel='bookmark' title='Amazon AWS EC2 and vufind'>Amazon AWS EC2 and vufind</a></li>
<li><a href='http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/' rel='bookmark' title='MARC Tools &amp; MARC::Record errors'>MARC Tools &#038; MARC::Record errors</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/" data-text="VuFind in 8 minutes using Amazon EC2" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>I&#8217;ve created a screencast showing how easy it can be to install VuFind. Here I go from nothing (no server, no OS) to full VuFind install in under 8 minutes.</p>
<p><object id="scPlayer" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="562" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="data" value="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick_controller.swf" /><param name="quality" value="high" /><param name="bgcolor" value="#FFFFFF" /><param name="flashVars" value="containerwidth=640&amp;containerheight=562&amp;showstartscreen=true&amp;showendscreen=true&amp;loop=false&amp;autostart=false&amp;color=000000,000000&amp;thumbscale=45&amp;content=http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick.mp4&amp;blurover=false" /><param name="allowFullScreen" value="true" /><param name="scale" value="showall" /><param name="allowScriptAccess" value="always" /><param name="base" value="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/" /><param name="src" value="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick_controller.swf" /><param name="flashvars" value="containerwidth=640&amp;containerheight=562&amp;showstartscreen=true&amp;showendscreen=true&amp;loop=false&amp;autostart=false&amp;color=000000,000000&amp;thumbscale=45&amp;content=http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick.mp4&amp;blurover=false" /><param name="allowfullscreen" value="true" /><embed id="scPlayer" type="application/x-shockwave-flash" width="640" height="562" src="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick_controller.swf" base="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/" allowscriptaccess="always" scale="showall" allowfullscreen="true" flashvars="containerwidth=640&amp;containerheight=562&amp;showstartscreen=true&amp;showendscreen=true&amp;loop=false&amp;autostart=false&amp;color=000000,000000&amp;thumbscale=45&amp;content=http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick.mp4&amp;blurover=false" bgcolor="#FFFFFF" quality="high" data="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick_controller.swf"></embed></object></p>
<p>It would probably take less than two minutes (under 10mins in total) to add MARC records to the installation, but I didn&#8217;t have any to hand at the time.</p>
<p>This demo cheats a bit by using a script that does the heavy work, the script was a mash up I created taking existing scripts and commands that come with VuFind with a few tweaks. It probably would have been only slightly slow to run most commands manually.</p>
<p>The script in <a href=http://092.me>question</a> is at <a href="http://www.nostuff.org/vufind-install.txt">http://www.nostuff.org/vufind-install.txt</a> and of course anyone is free to download (and improve, please share changes). There&#8217;s lot of potential to improve it&#8217;s ability to work on different flavours of Linux.</p>
<h3>Multi Instance</h3>
<p>One of the key aspects of the script is that is allows you to easily install multiple instances of VuFind on to a server. By default VuFind installs in to /usr/local/vufind and has other things (databases, apache conf) names vufind. The script prompts for an &#8216;instance name&#8217; and then uses that in place of &#8216;vufind&#8217;.</p>
<p>The rational for this is my feeling that VuFind is an excellent tool for creating niche catalogues that are a subset of the full Library collection (or as<a href="http://twitter.com/#!/Viche/status/29845394116579328"> someone put</a> it a &#8216;tranche of the catalogue&#8217;). A branch Library, particular collection, rare books, a particular School, Core reading (short loan books), specialist resources (AV / laptop items for loan) etc. The idea of a organisation&#8217;s records being in system, rather than many (of varying quality) makes sense, but it&#8217;s reasonable for those moving their records to a central system to want to be able to search their records independently of other records (and expecting users to go to an Advanced search of using a refine option of the main catalogue is not really not an option). VuFind seems like an obvious <a href=http://092.me>answer</a> to this. Especially if new instances can be set up quickly.</p>
<p>In fact it seems to be a failing of most of the main Library Manage Systems (ILS) and their web interfaces that being able to create lots of interfaces (slicing and dicing the underlying single large pool of records). Most require a large amount of effort to create a second web interface to a catalogue. This seems like such an obvious flaw and a barrier to moving to one system to manage resources such as books and other items for an entire organisation.</p>
<h3>Amazon EC2</h3>
<p>Amazon AWS is a great tool to use for this. A small instance will cost around $0.10 an hour, the micro instance is even cheaper (just over $0.02). Create an instance for ten hours and you have spent around a dollar. Mess it up, just destroy it and create a new one. No risk and no hassle (for first time users the hardest thing is probably the ssh private keys).</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/' rel='bookmark' title='Amazon AWS EC2 and vufind'>Amazon AWS EC2 and vufind</a></li>
<li><a href='http://www.nostuff.org/words/2011/marc-tools-marcrecord-errors/' rel='bookmark' title='MARC Tools &amp; MARC::Record errors'>MARC Tools &#038; MARC::Record errors</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://content.screencast.com/users/SussexLibrary/folders/Default/media/c438eb43-1971-4282-a01a-4e4f598126f6/vufind-quick.mp4&amp;amp" length="19288032" type="video/mp4" />
		</item>
		<item>
		<title>Jisc bid writing</title>
		<link>http://www.nostuff.org/words/2010/jisc-bid-writing/</link>
		<comments>http://www.nostuff.org/words/2010/jisc-bid-writing/#comments</comments>
		<pubDate>Mon, 15 Nov 2010 23:18:33 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[jisc]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=647</guid>
		<description><![CDATA[Today I submitted a JISC bid on behalf of a team, as part of the recent JISC call Infrastructure for Education and Research (&#8217;15/10&#8242; to its friends). The call was actually a set of broadly related different strands, we submitted (with a whole 30 mins to spare) under a strand called Infrastructure for Resource Discovery, [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/jisc-library-management-system-review/' rel='bookmark' title='JISC Library Management System Review'>JISC Library Management System Review</a></li>
<li><a href='http://www.nostuff.org/words/2009/jisc-monitter-and-dius-department-of-innovation-universities-and-skills/' rel='bookmark' title='JISC, Monitter and DIUS (Department of Innovation, Universities and Skills)'>JISC, Monitter and DIUS (Department of Innovation, Universities and Skills)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/jisc-bid-writing/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/jisc-bid-writing/" data-text="Jisc bid writing" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/jisc-bid-writing/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>Today I submitted a JISC bid on behalf of a team, as part of the recent JISC call <a href="http://www.jisc.ac.uk/fundingopportunities/funding_calls/2010/10/grant1510.aspx">Infrastructure for Education and Research</a> (&#8217;15/10&#8242; to its friends). The call was actually a set of broadly related different strands, we submitted (with a whole 30 mins to spare) under a strand called Infrastructure for Resource Discovery, and there&#8217;s a <a href="http://infrastructurecalloct2010.jiscpress.org/"><a href=http://092.me>nice</a> web based version of the call</a> on Jiscpress.</p>
<p>Jiscpress was created in part by Joss Winn and a <a href="http://joss.blogs.lincoln.ac.uk/2010/11/15/writing-jisc-bids/">post of his I saw today</a> inspired me to knock out this this rambling.  Go <a href="http://joss.blogs.lincoln.ac.uk/2010/11/15/writing-jisc-bids/">read it</a> before you read this, thanks.</p>
<p>I admire his openness and I should strive to do the same. Funny that I try &#8211; and to an extent automatically do &#8211; make much of what I do open, but with this sort of thing there is a tendency to keep it close to your chest. There were very few tweets in the run up to the bid. Why are we not more open? He also talks about his JISC bid writing and tips, here&#8217;s mine.</p>
<p>My first experience was attending a &#8216;town hall meeting&#8217; in Birmingham about the JISC <a href="http://www.jisc.ac.uk/whatwedo/programmes/capital.aspx">Capital programme</a>, around 2006. For a starts I didn&#8217;t even know what a Town hall meeting was (I think it means a briefing day, everyone presumed you should know this). I do remember it felt daunting, there were lots of people in suits. Lots and lots of sentences I didn&#8217;t understand (We&#8217;re going to innovate to take vertical solutions and silos and break them in to a horizontal landscape to help foster a new wave of disruption to help inject in to the information environment testbed) and no one I knew. I looked at the massive documents that accompanied the programme, many of them, many times. And looked at what I needed to do to write a bid. Budgets, signatures, partners, matched funding. I didn&#8217;t submit one.</p>
<p>Since then the community has developed, in no small part thanks to Twitter, but also to things like mashlib and many one day events (which either never used to exist in the field that I work, or I was just more ignorant then than I am now). Beer has been a big part of forging links in the HE Library / tech community. Seriously. It really needs its own cost code.</p>
<p>I looked at a number of potential calls over the last few years &#8211; often they required a report or development that I had no really knowledge in, I almost came close to putting something in for the JISC <a href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/jiscri.aspx">rapid innovation</a> call (and helped mark it). When the JISC LMS call came out about a year a go the time and subject were right to submit a bid. I knew the subject matter, I had a natural interest and passion, and I knew the people who would be involved in these sorts of things.</p>
<p>These are tips for people who are thinking of putting in a bid, especially those who are stupidly disorganised like me:</p>
<ul>
<li>Time between a call being released and the submission deadline is short, normally about a month, which in HE terms is not long. Use the JISC funding timeline to get a heads up of future funding opportunities so that you can prepare for working on a bid (including blocking off time during a month, and arranging initial meetings with others) before it comes out and not taken by surprise. The JISC 15/10 call had a blog post a few weeks before the call came out giving a feel for the call and confirming the date it should be released. It helped me to start thinking about ideas and block out time to read it (even if some of that time was in the evening) on the day it was released.</li>
<li>Every organisation is different (that applies to everything I say) but for us, setting up a meeting a couple of days after the call was out was very useful. It included those who it could affect and relevant senior staff. The call had lots of areas which matched our goals (and some, not always the same, that matched my personal interests), it was good to prepare a briefing and then bounce those ideas around to see what had potential and see what other ideas came up. It helps in many ways, to quickly focus and refine potential ideas (and drop those that people show no interest in), keep everyone in the loop and see whose willing to work on it. It stops it being one person or departments little side project.</li>
<li>The briefing day was very useful, especially for talking to people, finding potential partners and getting great advice.</li>
<li>Now I have an incredible amount of bad points, but two of them are leaving everything to the last minute and working in a very linear fashion. Often things that feel like they are the last things you need to do are actually things you need to set in motion earlier on. This seems so simple typing it now but I&#8217;ll probably (be warned colleagues) do the same next time. These include budget, project name and supporting letters.</li>
<li>The budget is hard. See if your org offers support in doing this. The problem is certain magic numbers (the wonders of trac and fec) can only be calculated once you know all your other costs. However I tend to find that near the end of the bid writing process you suddenly think of some work a particular group/person/org will need to do so you need to factor in those hours and costs, or you invite someone from the other end of the country to be on a panel and need to cater for their travel and hotel costs. In best &#8216;do as I say don&#8217;t do as I do&#8217; tradition I would try and bash this out well in advance so it can be sent to those who can then check it over and fill in the magic numbers.</li>
<li>Asking a friend at another Uni if they don&#8217;t mind asking their (P)VC to drop everything so that he/she can write a <a href=http://092.me>nice</a> supporting letter for your project is hard. So try and avoid it by getting it done sooner. Again often easier said than done as projects tend to evolve during the bid writing process which can make letters reflect out of date ideas or stress the wrong areas.</li>
<li>Letters and other stuff need a project name. I&#8217;m guilty of really not thinking a name is that important. The acronym will be meaningless to all. On my first bid I just used a working name (all of 5 seconds thought) and right at the end asked everyone if they are happy to go with it. Mistake. Changing project name at the last minute is a pain.</li>
<li>A key point. You need a good idea. And a good idea is one that is a good fit to the call. You may have a perfect methodology but if the idea doesn&#8217;t fit with the call then you could be in trouble. I&#8217;m guessing &#8216;It&#8217;s not really what you&#8217;re after but it&#8217;s such a good idea you must want to fund it&#8217; is not a good sign.</li>
<li>Speak to people, I mentioned the briefing day above, but also speak to the programme manager, they&#8217;re <a href=http://092.me>nice</a> people! Talk about it on twitter.</li>
<li>You don&#8217;t need to be an expert. I was put off for years from writing a bid about things I was interested in but didn&#8217;t think I knew enough about. You can ask people to work with you! People who know how to do stuff. I&#8217;ve just submitted a bid about Linked Data. Now I&#8217;ve followed the rise of Linked Data for years and tried to learn about it, but taking an XML file and &#8216;converting&#8217; (is that the right term?) in to Linked Data, I had no idea how to even start. But I spoke to some people, who recommended someone, and they do know what to do</li>
<li>Approaching others out the blue is difficult, especially if you don&#8217;t feel &#8216;part of it&#8217;. All I can say is ask. And if you don&#8217;t know who to approach ask people (either at the JISC or via twitter) for advice.</li>
<li>If you have a clear(ish) idea of what you are going to do, broken down in to mini packages of work, andwho is going to do each one of them, then writing the actually bid is easy. Treat it like a job application. We all know that when writing a job app use the Person spec as a structure, a paragraph for each entry of the person spec, perhaps even with headings to help those marking your application. A bid is just the same, the clearly laid out structure of a bid is worth sticking to, it&#8217;s the same thing the markers will have to use to score each section. If the JISC Call document &#8216;proposal outline&#8217; refers to a section which talks about Project Management, leadership, copyright, evaluation and sustainability. Then write about those things together as clearly as possible. Long winded paragraphs which ramble on about everything and make subtle implied passing references are a bain to the marker and no help to you.</li>
<li>But, I have been involved in marking and assessing bids, what impressed me was the impartial way bids were judged, and the real desire of wanting to fund Good Ideas, even if the actually bid document needs a little clarity in some parts. Especially from first bidders. To stress, there was a real desire to see first time bidders (with a good idea) be successful.</li>
<li>So the actually bid write up can in a way be left later than other tasks mentioned above, as it mainly involves just you, (and probably a couple of people who work closely with you to check it).</li>
<li>In an ideal world this would all be done weeks before it needed submission. In real life other factors (and work) can mean it can be a last minute dash. That&#8217;s fine. But make sure a few days before it has to be submitted you put in to an email to yourself (and others involved in writing it up): The email address to send it to, the submission date time, cut and paste things such as the exact format it needs to be submitted in (how many fiels, how big), number of pages. Add a link below each of these facts to the actually source of information (jiscpress is excellent for this), so when you&#8217;re panicing and presume everything you know is wrong you can follow a link and see for yourself that it really is eight pages max for the proposal, direct from the horses mouth.</li>
</ul>
<p>The whole process of submitting a bid, and running a project, is good experience. It often involves working with people you would not normally, and doing differently to your normal job. Now if I get a chance in the next few days I will follow Joss&#8217; example and blog about our proposal.</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/jisc-library-management-system-review/' rel='bookmark' title='JISC Library Management System Review'>JISC Library Management System Review</a></li>
<li><a href='http://www.nostuff.org/words/2009/jisc-monitter-and-dius-department-of-innovation-universities-and-skills/' rel='bookmark' title='JISC, Monitter and DIUS (Department of Innovation, Universities and Skills)'>JISC, Monitter and DIUS (Department of Innovation, Universities and Skills)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/jisc-bid-writing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Talis Aspire, checking if a course has a list</title>
		<link>http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/</link>
		<comments>http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/#comments</comments>
		<pubDate>Tue, 31 Aug 2010 14:26:18 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[aspire]]></category>
		<category><![CDATA[curl]]></category>
		<category><![CDATA[json]]></category>
		<category><![CDATA[php]]></category>
		<category><![CDATA[talis]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/</guid>
		<description><![CDATA[Talis Aspire is a new-ish Reading List system used at the University of Sussex Library. On Aspire, a url for a Department looks like this: http://liblists.sussex.ac.uk/departments/anthropology.html A page for a course looks like this (for course l6061): http://liblists.sussex.ac.uk/courses/l6061.html The nice thing is that you can replace the ‘.html’ with .json or .rdf – while the [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/nostuff-library-catalogue-using-the-talis-platform/' rel='bookmark' title='nostuff library catalogue using the Talis Platform'>nostuff library catalogue using the Talis Platform</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/" data-text="Talis Aspire, checking if a course has a list" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p><a href="http://www.talis.com/aspire/" target="_blank">Talis Aspire</a> is a new-ish Reading List system used at the University of Sussex Library.</p>
<p>On Aspire, a url for a <em>Department</em> looks like this:<br />
<a href="http://liblists.sussex.ac.uk/departments/anthropology.html">http://liblists.sussex.ac.uk/departments/anthropology.html</a></p>
<p>A page for a <em>course</em> looks like this (for course l6061):<br />
<a href="http://liblists.sussex.ac.uk/courses/l6061.html">http://liblists.sussex.ac.uk/courses/l6061.html</a></p>
<p>The <a href=http://092.me>nice</a> thing is that you can replace the ‘.html’ with <a href="http://liblists.sussex.ac.uk/courses/l6061.json" target="_blank">.json</a> or <a href="http://liblists.sussex.ac.uk/courses/l6061.rdf" target="_blank">.rdf</a> – while the html only has a link to the related list, the json and rdf expose other links and relationships, such as the department.</p>
<p>For us, most (but not all) courses only have one list. URLs for lists are not predictable in the same way as the courses URL. E.g.<br />
<a href="http://liblists.sussex.ac.uk/lists/EEC1E2AA-C350-DAFC-BDE4-1E9EF5EC69E5.html">http://liblists.sussex.ac.uk/lists/EEC1E2AA-C350-DAFC-BDE4-1E9EF5EC69E5.html</a></p>
<p><span id="more-618"></span>So what if you want to link to a list from an external system.</p>
<p>The University has a portal which includes for students a list of their courses and provides a link to those that have a reading list. Two issues: it needs to know which courses have a reading list on Talis Aspire, and it needs a way to link to them.</p>
<p><a href="http://www.nostuff.org/words/wp-content//2010/08/sussexdirect.png"><img style="display: inline; border: 0px;" title="Sussex Direct, links to reading lists" src="http://www.nostuff.org/words/wp-content//2010/08/sussexdirect_thumb.png" border="0" alt="Sussex Direct, links to reading lists" width="644" height="193" /></a></p>
<p>The easiest way to achieve the former is for the central admin/portal server to query (make a http request) for each course provided by the University, if it gets a 404, no list exists for that course. It can then flag each course that has a list (using a field in the database).</p>
<p>And what about linking to a list. We actually have three options. The first is to simply link to the course link above (second link from the top), this will show any lists for that course, but will normally require the user to click on the single reading list link to see the list items, which isn’t ideal. The second method is to process the json data in the step above (where we were checking for the existence of a course page), the data will include the URL for the list(s) which could then be stored in the portal DB.</p>
<p>But the third option is the easiest. Turns out there is a <a href=http://092.me>nice</a> URL you can use, similar to the one above, which will take the user directly to a list for a course, if there is only one, if there is more than more it will show a list (of, ummm, lists) instead, and this URL is predictable for any given course code.</p>
<p>Here’s an example of both scenarios. Thanks for Chris Clarke at Talis for the information on this.</p>
<p><a href="http://liblists.sussex.ac.uk/courses/l6061/lists.html">http://liblists.sussex.ac.uk/courses/l6061/lists.html</a> [one list, norm]<br />
<a href="http://liblists.sussex.ac.uk/courses/v1367/lists.html">http://liblists.sussex.ac.uk/courses/v1367/lists.html</a> (<a href="http://liblists.sussex.ac.uk/courses/v1367/lists.json" target="_blank">json</a>) [two lists]</p>
<p>Finally, I knocked up some bad php to demonstrate this. It queries the url above, and returns either the urls of the list(s) or a message to say there is no course or lists for that course code.</p>
<p><a href="http://www.lib.sussex.ac.uk/coursechecker.php?course=v1367">http://www.lib.sussex.ac.uk/coursechecker.php?course=v1367</a><br />
<a href="http://www.lib.sussex.ac.uk/coursechecker.php?course=l6066">http://www.lib.sussex.ac.uk/coursechecker.php?course=l6066</a><br />
<a href="http://www.lib.sussex.ac.uk/coursechecker.php?course=abc">http://www.lib.sussex.ac.uk/coursechecker.php?course=abc</a> [doesn’t exist]</p>
<blockquote><p>&lt;?php</p>
<p># This script is past a course code<br />
# it returns any reading lists attached to that coursecode</p>
<p>$baseurl = &#8220;<a href="http://liblists.sussex.ac.uk&quot;;">http://liblists.sussex.ac.uk&#8221;;</a></p>
<p>$coursecode = $_GET["course"];<br />
if ($coursecode == &#8220;&#8221; OR $coursecode == null) {<br />
echo &#8220;no course&#8221;;<br />
exit;<br />
}</p>
<p># this url will exist if there are lists for this course,<br />
# and will not exist (404) if there are not<br />
$url = $baseurl . &#8220;/courses/&#8221; . $coursecode . &#8220;/lists.json&#8221;;<br />
# &#8230;also available in html and rdf</p>
<p># Get webpage using curl<br />
$ch      = curl_init( $url );<br />
# OPTION: don&#8217;t put it on screen<br />
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);<br />
$page = curl_exec($ch);<br />
$header  = curl_getinfo( $ch );<br />
curl_close($ch);<br />
$statuscode = $header['http_code'];</p>
<p># if status code 404 then no page<br />
if ($statuscode == &#8217;404&#8242;) {<br />
echo &#8216;NOREADINGLIST&#8217;;<br />
}</p>
<p># we have a page for this course<br />
elseif ($statuscode == &#8217;200&#8242;) {<br />
# use json to return url to list(s), though if only one the original url,<br />
# ending in html, will take you to it.<br />
get_list_url($page);<br />
}</p>
<p># or maybe we have another status code<br />
else {<br />
echo &#8216;NOTSURE&#8217;;<br />
}</p>
<p>function get_list_url ($page) {<br />
# JSON<br />
$data = json_decode($page, true); #true=array</p>
<p>foreach ($data as $listurl =&gt; $stuff) {<br />
# we only want lists. not courses or departments<br />
if (preg_match(&#8220;/\/lists\//&#8221;, $listurl)) {<br />
echo &#8220;$listurl&lt;br&gt;\n&#8221;;<br />
}<br />
}</p>
<p>}</p>
<p>?&gt;</p></blockquote>
<p>So once a week, the central portal will query Talis Aspire for each course to see if there is a reading list, and store the result. And it will link to lists using the URL <a href="http://liblists.sussex.ac.uk/courses/{coursecode}/lists.html">http://liblists.sussex.ac.uk/courses/{coursecode}/lists.html</a></p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/nostuff-library-catalogue-using-the-talis-platform/' rel='bookmark' title='nostuff library catalogue using the Talis Platform'>nostuff library catalogue using the Talis Platform</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/talis-aspire-checking-if-a-course-has-a-list/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Library Catalogues need to cater for light-weight discovery clients</title>
		<link>http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/</link>
		<comments>http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/#comments</comments>
		<pubDate>Wed, 25 Aug 2010 13:11:26 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[aquabrowser]]></category>
		<category><![CDATA[catalogues]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[lms]]></category>
		<category><![CDATA[ngc]]></category>
		<category><![CDATA[opac]]></category>
		<category><![CDATA[resource discovery]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=610</guid>
		<description><![CDATA[Way back I wrote a piece about the changing model for library catalogues, you can see it here. The main premise was that trying to maintain records in a Library Management System (LMS/ILS) for all the items you want your users to discover is no longer feasible. This is especially true in this here digital [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/' rel='bookmark' title='Library search/discovery apps : intro'>Library search/discovery apps : intro</a></li>
<li><a href='http://www.nostuff.org/words/2009/academic-discovery-and-library-catalogues/' rel='bookmark' title='Academic discovery and library catalogues'>Academic discovery and library catalogues</a></li>
<li><a href='http://www.nostuff.org/words/2009/library_catalogues_changing_model/' rel='bookmark' title='Library catalogues, search systems and data'>Library catalogues, search systems and data</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/" data-text="Library Catalogues need to cater for light-weight discovery clients" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>Way back I wrote a piece <a href="http://www.nostuff.org/words/2009/library_catalogues_changing_model/">about the changing model for library catalogues, you can see it here</a>. The main premise was that trying to maintain records in a Library Management System (LMS/ILS) for all the items you want your users to discover is no longer feasible. This is especially true in this here digital age, trying to maintain records for all the e-journals a University has access to is an almost impossible task, and LMS were not designed for thousands of MARC records to be dropped and then re-imported (i.e. sync&#8217;d) with another source. And what about all the free stuff, is an e-book not worth being discovered by users because it is free?</p>
<p>So <em>let the LMS be a record of what your library physically holds</em>, and your discovery service a place where users can find (and see how to access) resources that are of interest to their research and work. The former being just one element (albeit a major one) of the latter. Meanwhile your LMS physically holdings can be shared with other discovery systems (such as union catalogues) to show what your library physically contains.</p>
<p><span id="more-610"></span>The blog post included a diagram. Some said it was rather crude, badly drawn and basic. Its critics weren&#8217;t so kind.</p>
<div id="attachment_287" class="wp-caption aligncenter" style="width: 310px"><a href="http://www.nostuff.org/words/wp-content//new_cat_model.png"><img class="size-medium wp-image-287" title="New Library catalogue model" src="http://www.nostuff.org/words/wp-content//new_cat_model-300x190.png" alt="" width="300" height="190" /></a><p class="wp-caption-text">New Library catalogue model</p></div>
<p>But I want to focus on the bit near the bottom, a line and &#8216;Endnote search&#8217; with not even a box around it. And the point has nothing to do with Endnote. (well a little bit but who&#8217;s counting).</p>
<p><strong>Story 1</strong></p>
<p>I had a phone call from a trainer here at Sussex. Their problem went a bit like this: When showing postgrads Endnote and reference management, a common query arose, they wanted to import multiple records from the Library catalogue in to Endnote. This seemed reasonable. She had discovered a number of ways to import records: (a) from our discovery service &#8211; <a href="http://catalogue.sussex.ac.uk/ABL/">Aquabrowser</a> (b) from our traditional library catalogue that comes with our LMS &#8211; Talis Prism (c) from within Endnote.</p>
<p>Aquabrowser has a <a href=http://092.me>nice</a> &#8216;export to Endnote&#8217; button on each record, we&#8217;d set that up. But there was no way to export multiple records. <a href="http://ustie1.lib.sussex.ac.uk/TalisPrism/">Talis Prism</a> was no better. Endnote has a feature for connecting directly to a database (in this case our catalogue) via <a href="http://en.wikipedia.org/wiki/Z39.50">Z39.50</a>. (This feature can be a real bane for libraries as it looks like a real time saver to researchers but often database vendors and &#8216;platform&#8217; middle men don&#8217;t support this type of connection for institutional subscribers.) This connects directly in to our catalogue and uses the stupidly complex and aging Z39.50 to retrieve results. And from within Endnote you can then easily add them to your Endnote library.</p>
<p>But the problem was these search results in no way reflected the results they had when using our normal catalogue, in fact they just seemed  to be of no real order. So they had to either use the web catalogue to download one record at a time, or use Endnote but with a strange set of search results which does not reflect what they found (and what they wanted to save) on the catalogue</p>
<p><strong>Story 2</strong></p>
<p>Sussex is about to launch an iphone app using <a href="http://www.ombiel.com/campusm.html">CampusM</a>. Very exciting. One of the developers was chatting to me and asked if our catalogue had an API which they could perhaps use in the future to search the catalogue from within the app. Now we Library geeks love standards and apis but the only one I could think of was <em>Z39.50</em>.</p>
<p>Problem: aside from Z39.50 being the most difficult thing to get your head around and use &#8211; is that, like Endnote above, the results would be not ordered, with no relevance ranking.</p>
<p>The issues here are the same. Neither Endnote, nor a University iphone app is trying to be a fully fledged catalogue. Even if they did have the resources to develop search relevance and rank, it would be reinventing the wheel (which can only be bad) and ultimately would differ from the results seen within our main catalogue (Aquabrowser).</p>
<p>What&#8217;s more, in the model described at the top, the resource discovery layer (e.g. Aquabrowser) may hold records for items not on the catalogue (e.g. if we were starting again, our e-journal records would not be in our LMS, but indexed straight in to Aquabrowser).</p>
<p>So these &#8216;thin clients&#8217; really just want to connect to a catalogue and get back some results in XML or similar which they just have to transform in to HTML (or display in an iphone app). They don&#8217;t want to implement a full bodied search interface, and nor should they have to.</p>
<p>The <a href=http://092.me>answer</a> is, of course, that these clients need to an API provided by the Resource Discovery system.</p>
<p>The good news is that I think this is already happening, a number provide XML interfaces which developers can use. Aquabrowser for example has a <a href="http://catalogue.sussex.ac.uk/ABL/sru.ashx?operation=searchRetrieve&amp;version=1.1&amp;query=dogs&amp;maximumRecords=100&amp;recordSchema=dc">SRU interface</a> (you may need to &#8216;browse source&#8217; to see this) and its own <a href="http://catalogue.sussex.ac.uk/ABL/result.ashx?q=cats&amp;output=xml">XML interface</a>, though this is not very well documented at all (thanks to Ed Chamberlain for the help in finding the url for it).</p>
<p>Until our front end discovery systems provide a way to allow other systems to search and retrieve results &#8211; preferably via a common standard such as SRU &#8211; we can&#8217;t expect third parties to develop new interfaces to our catalogues. Which is a shame and frankly an embarrassment. When next look at a resource discovery or web catalogue system, the ability for other systems to connect to its API and bring back ordered results will be an requirement.</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/' rel='bookmark' title='Library search/discovery apps : intro'>Library search/discovery apps : intro</a></li>
<li><a href='http://www.nostuff.org/words/2009/academic-discovery-and-library-catalogues/' rel='bookmark' title='Academic discovery and library catalogues'>Academic discovery and library catalogues</a></li>
<li><a href='http://www.nostuff.org/words/2009/library_catalogues_changing_model/' rel='bookmark' title='Library catalogues, search systems and data'>Library catalogues, search systems and data</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ircount : update</title>
		<link>http://www.nostuff.org/words/2010/ircount-update/</link>
		<comments>http://www.nostuff.org/words/2010/ircount-update/#comments</comments>
		<pubDate>Sun, 30 May 2010 17:45:42 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[ircount]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[roar]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=584</guid>
		<description><![CDATA[One Sunday morning in January this year I got an email sent automatically from the webhosting company. It contained the output of the script that ran weekly, when all ran fine the script produced no output. When something went wrong the error messages were emailed to me. Judging by the length of the email something big had gone wrong. [...]
No related posts.]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/ircount-update/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/ircount-update/" data-text="ircount : update" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/ircount-update/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>One Sunday morning in January this year I got an email sent automatically from the webhosting company. It contained the output of the script that ran weekly, when all ran fine the script produced no output. When something went wrong the error messages were emailed to me. Judging by the length of the email something big had gone wrong.</p>
<p>The script collected data from http://roar.eprints.org/ &#8211; to be used as this weeks &#8216;number of records&#8217; for each repository.</p>
<p>The reason became clear quickly. A major revamp to ROAR had just been launch, showing off a new interface, which used the Eprints software as a platform (essential a repository or repositories). This was a great leap forward but unfortunately removed the simple text file I used to collect the data, and what was more, the IDs for each IR had changed.</p>
<p>I finally got around to fixing this in May. The most fiddly bit was linking the data I collected now with the data I already had. This involved matching URLs and repository names.</p>
<p>Anyways. Things should be more or less as they were. A few little tweaks have been added. A few bugs still remain.</p>
<p>As ever you can view the code and changes here: <a href="http://trac.nostuff.org/ircount/browser/trunk">http://trac.nostuff.org/ircount/browser/trunk</a></p>
<p>And checkout the svn here: <a href="http://svn.nostuff.org/ircount/">http://svn.nostuff.org/ircount/</a></p>
<p>ircount can be found here: <a href="http://www.nostuff.org/ircount/">http://www.nostuff.org/ircount/</a></p>
<p>No related posts.</p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/ircount-update/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summon @ Huddersfield</title>
		<link>http://www.nostuff.org/words/2010/summon-huddersfield/</link>
		<comments>http://www.nostuff.org/words/2010/summon-huddersfield/#comments</comments>
		<pubDate>Tue, 25 May 2010 21:38:05 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[catalogue]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[jisclms]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[resource discovery]]></category>
		<category><![CDATA[summon]]></category>
		<category><![CDATA[summon4hn]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=562</guid>
		<description><![CDATA[I attended an event at Huddersfield looking at their and Northumberland&#8217;s experience of Summon. http://library.hud.ac.uk/blogs/summon4hn/?p=22 These are my rough notes. Take all with a pinch of salt. The day reaffirmed my view of Summon, it is ground breaking in the Library market, and with no major stumbling blocks. They are very aware that coverage is key [...]
No related posts.]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/summon-huddersfield/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/summon-huddersfield/" data-text="Summon @ Huddersfield" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/summon-huddersfield/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>I attended an event at Huddersfield looking at their and Northumberland&#8217;s experience of Summon. <a href="http://library.hud.ac.uk/blogs/summon4hn/?p=22">http://library.hud.ac.uk/blogs/summon4hn/?p=22</a> These are my rough notes. Take all with a pinch of salt.</p>
<p>The day reaffirmed my view of Summon, it is ground breaking in the Library market, and with no major stumbling blocks. They are very aware that coverage is key and seem to be adding items and publishers. It searches items that a organisation has access to (though users can tick a option to search all items in the kb, not just those they can access). They have good metadata, merging records from a number of sources, and making use of subject headings (to refine or exclude from the search).</p>
<p>There was general consensus that it made sense to maintain only one Knowledge-base, and therefore in this case, using 360 Link if implementing Summon. There was also general dissatisfaction for federated search tools.</p>
<p>To me, and I must stress this is a personal view, there are two products that I have seen which are worth future consideration: Summon and Primo. Summon&#8217;s strength is in the e-resources realm and as a resource discovery service. Primo&#8217;s strength, while offering these features/services, is as a OPAC (with My account features etc) and personalisation (tags, lists). Both products are in a stage of rapid development.</p>
<p>In my view, one decision to implement one of these products &#8211; which ever it is &#8211;  will have a chain reaction. And I think this is an important point. Using Sussex as an example, it currently has Aquabrowser (as a Library Catalogue), Talis Prism 2 (for Borrower Account, reservations, renewals), SFX (Link Resolver) and Metalib (Federated Search).</p>
<p>Two example scenarios (and I stress there are other products on the market and this is just my personal immediate thoughts):</p>
<p><strong>One:</strong> Let&#8217;s say Sussex first decide to replace Metalib with Summon. They would probably cancel Metalib (Summon replaces it). Probably move from SFX to 360 Linker (one Knowledge base). May then wish to review our Library Catalogue in a years time: Primo is no longer on the cards (too much cross over with Summon, which we now already have), so they either stick with Aquabrowser (but the new SaaS v3 release) or perhaps move to Prism 3 (Talis&#8217; new-ish SaaS Catalogue). Sussex would end up with no Ex Libris products, but would potentially subscribe to several Serial Solutions products.</p>
<p><strong>Two:</strong> Let&#8217;s say Sussex decide to replace Aquabrowser with Primo (which acts more like a Catalogue than AB). They cancel Aquabrowser. Primo would (in addition to being the primary OPAC) have Summon-like functionality, allowing users to search a large database of items instantly, with relevance and facets. So Summon would not be an option. Stick with SFX (Metalib would be a side feature of Primo, with a Primo-like interface). With a number of Ex Libris products they would want to keep an eye on the Ex Libris URM (next genration LMS), they would have no Serials Solutions products.</p>
<p>The following are some notes from the day:</p>
<p><strong>Sue White</strong> from Huddersfield Library started the day, saying it is probably the best decision they have ever made.</p>
<div><strong>Helle Lauridsen</strong> from Serials Solutions Europe started with a basic introduction of what it is and what their key aims were (i.e. be like google).</div>
<div>She emphasised that all content (different types and publishers) is treated equally.</div>
<div>
<p>&#8220;better metadata for better findability&#8221;. merge metadata elements. Use SerialSolutions, urichs, Medline, crossref to create the best record. &#8216;record becomes incredibly rich&#8217;.</p>
<p>She went through all the new features added in the last 12 months, including a notable size in the knowledge base. &#8216;dials&#8217; to play with relevancy of different fields. Recommender service coming.</p>
<p>Shows a list of example publishers, included many familiar names, have just signed with Jstor. She showed increase in &#8216;click throughs&#8217; for particular publishers, the biggest were for jstor and ScienceDirect. Newspapers have proved to be very popular.</p>
<p>There is an advanced search. There has been negative feedback &#8216;please bring back title/author search&#8217;.</p>
<p><strong>Eileen Hiller</strong> from Huddersfield talked about product selection. She mentioned having people from across the Library and campus on the selection/implementation group, getting student feedback and talking to academics. They used good feedback in their communications (e.g. in the student newspaper and their blog). Student feedback <a href=http://092.me>question</a>naire has been useful.</p>
<p><strong>Dave Pattern</strong> talks about the history of e-resource access at Huddersfield, started with a online word document and then a onenote version. Metalib was slow, and they found more students using Google Scholar than metalib.</p>
<p><a href="http://www.nostuff.org/words/wp-content//2010/05/IMG_0220.jpg"><img class="alignright size-medium wp-image-564" title="Huddersfield Summon Timeline" src="http://www.nostuff.org/words/wp-content//2010/05/IMG_0220-300x224.jpg" alt="" width="300" height="224" /></a></p>
<p>They started with a blank sheet of paper and as a group thrashed out their ideal product, without knowing about Summon. First class search engine, &#8216;one stop shop&#8217;, improved systems management, etc. Invited a number of suppliers in, showed them the vision and asked them to present their product against it, Huddersfield rated each one against The Vision. Report to Library Management Group. Summon was the clear fit.</p>
<p>Implementation: Starts off with a US conference call. MARC21 mapping spreadsheet, they went with defaults. they add a unique id to the 999|a field.</p>
<p>Be relistic with early implementation, e.g. lib cat and repository are only two local databases. Be aware of when you LMS deletes things flagged for deletion. Huddersfield had early issues with this.</p>
<p>Do you want your whole catalogue on Summon? ebook/ejournal records etc.</p>
<p>Summon originally screenscraped for holdings/availability (aquabrowser does this for Sussex) could bring the traditional catalogue to its knees.</p>
<p>Moving to 360 Link makes you life much easier if moving to Summon, only one Knowledgebase to maintain.</p>
<p>They asked Elsevier to create a custom file for their sciencedirect holdings to upload to 360.</p>
<p>Huddersfield found activating journals in 360 a quick process.</p>
<p>360 API more open than Summon API. for customers only. You can basically build your own interface. Virginia using it to produce a mobile friendly version of their catalogue. Hud used it to identify problem MARC records.</p>
<p>94% of Huddersfield subscribed journals are on Summon (No agreement with the following: BSP 80%, Sciencedirect, Jstor&#8230; Westlaw/LexisNexis 55%). They now have a agreement with LexisNexis and Jstor. In discussion with Elsevier. They manage to have this level of coverage for these reources by using other sources for the data (e.g. publishers for Business source premier and A&amp;I databases for ScienceDirect).</p>
<p>Dummy journal records for journal titles (print and e) so that they are easily found on Summon. <a href="http://hud.summon.serialssolutions.com/search?s.q=journal+of+clinical+nursing">See this example</a>.</p>
<p>Can recommend specific resources (&#8216;you might be interested in ACM Digital Library&#8217;), can be useful for subjects like Law.</p>
<p>Summon at Hudderfield now has 60 million items (see <a href="http://hud.summon.serialssolutions.com/search?s.q=">left hand side</a> for breakdown),  indexed. Judging by <a href="http://hud.summon.serialssolutions.com/search?s.cmd=setHoldingsOnly(false)">this</a> Summon seems to have 575 million items indexed in total.</p>
<p>Survey results: Users found screens easy to understand. many (43%) refined their results. Dave thinks that now Google has facets on the left may increase facet usage. 82% for results were relevant to their research topics.</p>
<p>They will go live in July. Currently working on training materials and staff training. Considering adding archives and Special Collections in the future.</p>
<p><strong>Annette Coates</strong>, Digital Services Manager, Northumbria Uni.</p>
<p><a href="http://www.nostuff.org/words/wp-content//2010/05/IMG_0221.jpg"><img class="alignright size-medium wp-image-565" title="Anette" src="http://www.nostuff.org/words/wp-content//2010/05/IMG_0221-300x224.jpg" alt="" width="300" height="224" /></a></p>
<p>She gave a history of e-resource provision, 2005 onwards: webfeat (they brand it nora, which they are keeping for Summon). &#8216;We have the same issues with federated search that everyone else has&#8217;. Both Northumbria and Huddersfield are keeping a seperate A-Z list for e-resources (N are using libguides, like Sussex).</p>
<p>User Evaluation: is it improving the user&#8217;s search experience? how can we improve it futher? NORA user survey. Timing important, Getting people involved, Incentives, Capturing the session. They will use all the user feedback in a number of ways, &#8216;triangulate to ensure depth&#8217;, use good quotes as a marketing tool (including to lib staff), feedback good/bad to Serials Solutions, use it to improve the way they show it to others&#8230;</p>
<p><a href="http://www.nostuff.org/words/wp-content//2010/05/IMG_0222.jpg"><img class="alignright size-medium wp-image-566" title="Northumbria Summon implementation timeline" src="http://www.nostuff.org/words/wp-content//2010/05/IMG_0222-300x225.jpg" alt="Northumbria Summon implementation timeline" width="300" height="225" /></a></p>
<p><strong>Q&amp;A</strong></p>
<p>Focus groups, guidance?<br />
very little guidance in focus group, and let them play with it</p>
<p>What is the position regarding authentication?</p>
<p>N use citrix. Will be Shibolising their 360link.</p>
<p>H channeling as much as possible through ezproxy. don&#8217;t have shib. promote usage though usage portal, which authenticates them.</p>
<p>No shibboleth integration at the moment.</p>
<p>(discussion about how summon may mean you can stop trying to add journal records, and can raise lots of <a href=http://092.me>question</a>s&#8230; should summon be the interface on your catalogue kiosks).</p>
<p>You can send list of ISSNs to Serial Solutions to see matches, to find out what your coverage would be.</p>
<p>There was a very vague indication that OPAC integration may be on the cards for Summon. This is an important thing IMO.</p>
<p>Number of comments about Library staff being far more critical than users.</p>
<p>Summon ingesting stuff (MARC) from LMS 4 times a day. Using DLF standard for getting holdings data from LMS. (this is a good thing). Huddersfield wrote the DLF protocol code.</p>
<p>Q: Are SerialSolutions (proquest) struggling to get metadata from their direct competitors?</p>
<p>A: SS: Ebsco the main one, but we go direct to publishers. And for Elsevier, able to index it from elsewhere (and in talks with them).</p>
<p>Q: lexis and westlaw, where only 50% coverage, how do students know to go elsewhere (i.e. direct to the resource)?</p>
<p>A: for law students point them to e-resource pages (wiki) as well as summon to promote direct access to them. also (and perhaps more importantly) will have recommender which can recommend lexis/westlaw for law searches.</p>
<p>Q: can you search the whole summon kb, not just those things we subscribe to?<br />
yes</p>
<p>Q: Are there personalisation options? (saving lists, items, marking records)<br />
May come in the future, summon are thinking about it.</p>
</div>
<p>No related posts.</p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/summon-huddersfield/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Dev8D</title>
		<link>http://www.nostuff.org/words/2010/dev8d/</link>
		<comments>http://www.nostuff.org/words/2010/dev8d/#comments</comments>
		<pubDate>Sun, 28 Feb 2010 20:07:22 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[universities]]></category>
		<category><![CDATA[dev8d]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=500</guid>
		<description><![CDATA[This isn't a comprehensive review of dev8d, checkout the twitter stream (that link uses twapperkeeper by the way, the guy who created it was there and it was great talking to him), and wiki for and much else on the web for what went on.
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2012/dev8d-2/' rel='bookmark' title='Dev8D'>Dev8D</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2010/dev8d/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2010/dev8d/" data-text="Dev8D" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2010/dev8d/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>This isn&#8217;t a comprehensive review of dev8d, checkout the <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://twapperkeeper.com/dev8d/">twitter stream</a> (that link uses twapperkeeper by the way, the guy who created it was there and it was great talking to him), and <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://wiki.2010.dev8d.org/w/Main_Page">wiki</a> for and much else on the web for what went on.</p>
<p><span id="more-500"></span></p>
<p>Things I saw that I liked:</p>
<ul>
<li>An iphone app developed which allowed you to point your iphone in different directions (think <a href="http://www.youtube.com/watch?v=U2uH-jrsSxs">augmented reality</a>) and show what was currently on in the room your iphone was pointing in.</li>
<li>This was possible because the wiki and programme exposed data in a machine readable way, using RDF &amp; ical for each event, geo co-ordinates for rooms. With out this the above would not have been more difficult to produce. It proves the saying: <em>the coolest thing to be done with your data will be thought of by someone else</em>. And likewise, set your data free.</li>
<li>Another app, little more than a simple iphone switch, allowed you to <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://dev8d.jiscinvolve.org/2010/02/26/world-first-at-dev8d-open-source-iphone-app-for-home-automation/?utm_source=twitterfeed&amp;utm_medium=twitter">turn a light on and off</a>. Of course the iphone could have been anywhere in the world when it turns the light on and off, the app calls a REST web interface on a notebook running a webserver which was connected to the light. Simple implementation of a <a href=http://092.me>nice</a> idea, which much potential (someone was discussing how they want to be able to control their heating from their phone if they are out for the night, or if they are coming home early).</li>
<li>RFID tags connected to the door of each room, each attendee had a small RFID tag so they could record if they entered or passed a room. A web interface was built to display this data and lots of ideas of how we could build on this. The RFID readers where a simple <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://www.touchatag.com/e-store">consumer model</a> (from Touchatag), connected to netbooks and similar near by, and a mac mini acting as a central place to collect all the data. It was coded in ruby and used <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://code.google.com/p/redis/">Redis</a> as a quick and neat way to collect the data.</li>
<li>A Developer Happiness <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://twitpic.com/15fx3b">iphone app</a> which shows a number out of 10 representing on average how happy those at Dev8d are. It builds on the <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://samscam.co.uk/happier/">Happiness Pipe</a> site built last year.</li>
<li>I&#8217;m not sure how far they got, but some explored an open source OCR code library. They had the idea they could use these to try and OCR a frames from a videos on the web. This would have massive potential. indexing words from videos (say on youtube) whether super implanted on the screen, or signs and general wording in the image, would help improve searching for videos no end for starters. (of interest to us whose work includes resource discovery).</li>
<li>A cheap 3D printer, which can almost replicate (print) itself. Clearly a device that can build a huge array of items (from wine glasses to children&#8217;s shoes, though made out of a plastic like substance) has potential to change the world we live in. Was very impressive to see it.</li>
<li>Lots of hardware hacking going on and clever genetics stuff too, in fact loads not covered here.</li>
</ul>
<p><img title="IMG_0130.jpg" src="http://www.nostuff.org/words/wp-content//2010/02/IMG_0130.jpg" alt="IMG_0130.jpg" width="360" height="480" /></p>
<p>I created a twitter account which posts the latest blogs, videos and photos. You can <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://chriskeene.posterous.com/dev8d-google-reader-and-dipity-streams">read about it here</a> (this is more for info, sounds stupid when coming after the amazing things above). I also started to learn Ruby on Rails and python.</p>
<p>I was somewhat envious of those who by the end of the few days were able to say &#8216;look what I&#8217;ve built&#8217;, but then I&#8217;m no a developer as such, and certainly not as clever or geeky as some at the event. That is not to say I wasn&#8217;t busy, trying out lots of little things, discussing things with others, meeting lots of new people, attending expert talks and a couple of coding labs and trying out lots of things.</p>
<p>An example of 45 minutes: On seeing a tweet saying <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://inkdroid.org/journal/about/">Ed Summers</a> (LCSH Linked Data fame) was starting his expert talk for 15 minutes I went running in to that room to catch it (with power cable tailing behind me), great talk and was followed by a couple of guys from <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://www.mendeley.com/">Mendeley</a> &#8211; a great app and like last.fm they are collecting a lots of data which we should be able to do some interesting things with using the new API. This was followed by a great talk from Kate Pekacar from the MLA explaining their efforts to open the data of various collections and their keenness that the likes of Dev8D create useful new ways for people to discover these collections. Three different talks yet all interesting stuff <em>and</em> related to the day job.</p>
<p>I was lucky enough to attend the awards ceremony on the Friday night. One of those receiving an award was Garry Bulmer, who had run the popular Arduino workshops, who said teaching and working with those who were so enthusiastic and smart was a joy. This seemed to sum up the event well. It works because those attending want to challenge themselves, try something new, work with new people and move sightly outside their comfort zone. It would be easy to not do much at barcamp style events, but the reverse actually happens as people try and squeeze so much in to the few days, often working on tasks late in to the night.</p>
<p>If I had to sum up what I got out of this &#8211; a difficult task not due to lack of quantity but the sheer broadness and soft qualities of what has been gained &#8211; I&#8217;d add to what I&#8217;ve already out above:</p>
<ul>
<li>There were 15 minute expert lightening talks that have informed and got me thinking more than entire one day workshops and conferences.</li>
<li>In terms of hard facts, getting started with two programming languages both very useful when working and developing for the web. Learning these would have taken days in themselves, it&#8217;s hard to stress how much you can pack in to an afternoon when you have an excellent expert leading the session working with bright people in the room (my brain hurt a lot throughout).</li>
<li>Motivation is hard to identity, quantify and where exactly it originates, but I come away from this even with a higher opinion of the industry I work in, the Tech/dev community around Higher Education and our ability to break through old ways of working and needless bureaucracy. I think it helps those attend see a bigger picture to what they are doing. Tired yet with a fresh urge to build new things and Get Things Done.</li>
<li>I ended up talking to lots of people I wouldn&#8217;t do normally, neither personally or socially. This is useful in itself, but I think the experience itself is useful (especially for the socially inept like me). I found myself a few times suddenly talking to people (often from the States) who have created high profile web apps, including those which have nothing to do with HE or libraries.</li>
</ul>
<p>All in all. Tired. Good event. Good food. Metropolitan line crap. Well organised. Nice people. Brain hurts. Learnt lots. Shambrarian meeting a success . Chatted Lots. Drank a moderate amount while adhering to medical daily recommended limits. Obviously.</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2012/dev8d-2/' rel='bookmark' title='Dev8D'>Dev8D</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2010/dev8d/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ircount development</title>
		<link>http://www.nostuff.org/words/2009/ircount-development/</link>
		<comments>http://www.nostuff.org/words/2009/ircount-development/#comments</comments>
		<pubDate>Thu, 26 Nov 2009 20:59:18 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[me me me]]></category>
		<category><![CDATA[ircount]]></category>
		<category><![CDATA[php]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=483</guid>
		<description><![CDATA[I've finally got around to spending a bit of time on the ircount code.

This post goes through some of the techy stuff behind it. If you're just interested in features, I'm afraid there's none yet, but you can now compare more than 4 repositories, but that's as far as you'll want to read.
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/ircount-new-location-new-functionality/' rel='bookmark' title='ircount : new location, new functionality'>ircount : new location, new functionality</a></li>
<li><a href='http://www.nostuff.org/words/2009/ircount-repository-record-statistics/' rel='bookmark' title='ircount : Repository Record Statistics'>ircount : Repository Record Statistics</a></li>
<li><a href='http://www.nostuff.org/words/2010/ircount-update/' rel='bookmark' title='ircount : update'>ircount : update</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/ircount-development/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/ircount-development/" data-text="ircount development" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/ircount-development/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>I&#8217;ve finally got around to spending a bit of time on the<em> <a href="http://www.nostuff.org/ircount/">ircount</a></em> code.</p>
<p>This post goes through some of the techy stuff behind it. If you&#8217;re just interested in features, I&#8217;m afraid there&#8217;s none yet, but you can now compare more than 4 repositories, but that&#8217;s as far as you&#8217;ll want to read.<span id="more-483"></span></p>
<p><strong>Subversion and Track</strong></p>
<p>The first thing I did a few months a go was create a Subversion repository. I seemed to time this quite well in learning how to use subversion just as every man and his dog gets excited by GIT.</p>
<p>I also installed <a href="http://trac.edgewall.org/">Trac</a> (using <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://www.nostuff.org/words/dreamhost-hosting/">Dreamhost</a>&#8216;s useful one-click install), which I haven&#8217;t really used other than to browse the code and view changes (time-line).</p>
<p>The repository is publicly available from here: <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://svn.nostuff.org/ircount/">http://svn.nostuff.org/ircount/</a></p>
<p>The trac site (which can be used to browse the code) is found at: <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://trac.nostuff.org/ircount/">http://trac.nostuff.org/ircount/</a></p>
<p>It took me a while to get svn working well. Originally I would edit the files in a local working copy of my Macbook, and then use subversion to load these to the server to test. Of course, this meant every little change had to be manually checked in to test it. I got apache and php working on the Macbook, and setup the Mysql db (on the Dreamhost server) to allow connections remotely which allowed me to test/use files located in my local copy. This seems to work well.</p>
<p>I use <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://code.google.com/p/svnx/">svnx</a> as a GUI Mac client. It&#8217;s free and is easy to use.</p>
<p><strong>The code. The rewrite</strong></p>
<p>I don&#8217;t do much code writing or developing. Anyone glancing at my work will take that as plain obvious.</p>
<p>I&#8217;ve realised when writing code I have a tendency to be very linear and unconsciously put efficiency before good design. Anything more than one database call per page was unthinkable, loading in other pages a sin, which often led to large unwieldy while loops processing the results from a massive database call. Database calls are mixed with html output mixed with logic.</p>
<p>This is just about ok when showing information about one Institutional Repository. But when comparing a number, it becomes unreadable, and not in any state to be reused. The key aspect of comparing a number of IRs is that you need to ascertain a number of facts for the page as a whole (the earliest data collection date for the page, the higher record count for the page &#8211; for the chart for example &#8211; which could be from any of the IRs).</p>
<p>My  aim was that the page files should be little more than calls to a few discrete functions.</p>
<p>I&#8217;m not quite there yet, but it&#8217;s a start. <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://trac.nostuff.org/ircount/browser/trunk/archive.php">archive.php</a> is mainly a set of function calls, but there is still too much in there, and random bits of html code dotted around. There&#8217;s also too many arrays holding information the repository (php) objects can provide. <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://trac.nostuff.org/ircount/browser/trunk/include.php">include.php</a> holds the functions, but is now a little bit unwieldy itself, with functions ordered randomly. The third file of note is <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://trac.nostuff.org/ircount/browser/trunk/class.archive.php">class.archive.php</a>. This is a repository object, which can grab data from the db for the repository, and return it to the calling page in various ways.</p>
<p>My original plan was to merge the code for showing one repository, with the code to show more than one, to make it easier to implement changes (not having to update two files). By the end of it, I&#8217;m now wondering if it would be easier to have two files again, for the little changes, which both call the same core functions.</p>
<p><strong>Google Chart</strong></p>
<p><a href="http://code.google.com/apis/chart/">Google Chart </a>is a great API and I recommend anyone to play with it.</p>
<p>However one of the problems of the last version of ircount is that the URLs for the chart images often became too long (more than 2,000 characters) resulting in no chart being shown. The Chart URL includes each data point separated by a comma, so four repositories multiplied by 100 weeks (for example), multiplied by 4 or 5 digits per datapoint (4 digit number plus a comma), it soon adds up.</p>
<p>One solution was to only pass data per month rather than per week (roughly reducing the number of data points to a quarter of the original). Another would have been (and probably will do in the future) to make use of Google Chart&#8217;s encoding function, made easy using<a href="http://bendodson.com/2008/02/28/google-extended-encoding-made-easy/"> these helpful functions</a>.</p>
<p>Overcoming this in an efficient way was a challenge. Originally I had a Google Chart PHP object. IR data would be passed to it in one method, and another method would return the URL.</p>
<p>This seemed sensible at the time, but deciding which php object did what became confusing. For example, the first thing the chart object needed to do was decide if the URL would be too long for the Repositories in <a href=http://092.me>question</a>, taking in to account the data we had for each repository. Should the chart object loop around each data point for each repository to first decide how many there are? or should the repository object be handling this by telling the chart object how much data it has? How to avoid the need to loop through the same data several times. Does it matter which object does the work? It&#8217;s for the chart, so the chart object should do it, yet other parts of the page may want this info about the repositories, so the repository objects should provide it for all.</p>
<p>In the end, I did away with the chart object and used a function instead, which is passed an array of repository objects, which in turn handle a lot of work.</p>
<p><strong>Future</strong></p>
<p>The foundations are about there. For any page I (or anyone else) wishes to create. A couple of lines are all that are needed to take one or more repository IDs passed in the URL and load in all the data for them, ready to be used as needed. We can then easily call a table or chart to display for these repositories (or a subset of them).</p>
<p>As I mentioned above, the only real improvements are the ability to show more than 4 repositories at once (the chart stops showing once you get to about nine repositories), and the chart is more robust and will now show when it would have failed to do so in the past.</p>
<p>Google Chart does have an encoding which allows far more to be passed in a condensed way, and <a class="libx-autolink" style="border-bottom: 1px dotted;" href="http://bendodson.com/2008/02/28/google-extended-encoding-made-easy/">this php function</a> looks very useful for using it.</p>
<p>If I was starting again today I would look to use a framework such as <a href="http://framework.zend.com/">Zend Framework</a> or <a href="http://cakephp.org/">CakePHP</a>, or maybe even have a go at <a href="http://rubyonrails.org/">Ruby on Rails</a>. But perhaps a third rewrite is a little over the top for now.</p>
<p>I need to tidy up the <a href="http://www.nostuff.org/ircount/table.php?country=uk">table view</a> a bit (some <a href="http://trac.nostuff.org/ircount/browser/tags/09nov/table.php">nasty code there</a>) and then look to a few new features, may be collecting more data, and exposing it in some computer friendly formats such as atom.</p>
<p>So<a href="http://www.nostuff.org/ircount/"> ircount</a> is really no more than a play thing for a bad coder to make mistakes and learn a little bit along the way. slowly. but if you have any ideas or thoughts I would love to hear them.</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/ircount-new-location-new-functionality/' rel='bookmark' title='ircount : new location, new functionality'>ircount : new location, new functionality</a></li>
<li><a href='http://www.nostuff.org/words/2009/ircount-repository-record-statistics/' rel='bookmark' title='ircount : Repository Record Statistics'>ircount : Repository Record Statistics</a></li>
<li><a href='http://www.nostuff.org/words/2010/ircount-update/' rel='bookmark' title='ircount : update'>ircount : update</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/ircount-development/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon AWS EC2 and vufind</title>
		<link>http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/</link>
		<comments>http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/#comments</comments>
		<pubDate>Sat, 11 Jul 2009 21:53:36 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[interesting]]></category>
		<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[ec2]]></category>
		<category><![CDATA[vufind]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=419</guid>
		<description><![CDATA[Today I saw a tweet from juliancheal, which mentioned he was setting up his virtual server on slicehost. I hadn&#8217;t heard of this company but their offering looks interesting. This got me thinking about cloud hosting and I decided it was time to actually try out Amazon&#8217;s AWS EC2.This allows you to run a virtual [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/' rel='bookmark' title='VuFind in 8 minutes using Amazon EC2'>VuFind in 8 minutes using Amazon EC2</a></li>
<li><a href='http://www.nostuff.org/words/2007/amazon-censor-reviews-of-the-high-arts/' rel='bookmark' title='Amazon : censor reviews of the high arts'>Amazon : censor reviews of the high arts</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/" data-text="Amazon AWS EC2 and vufind" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>Today I saw a tweet from <a href="http://twitter.com/juliancheal">juliancheal</a>, which mentioned he was setting up his virtual server on <a href="http://www.slicehost.com/">slicehost</a>. I hadn&#8217;t heard of this company but their offering looks interesting. This got me thinking about cloud hosting and I decided it was time to actually try out <a href="http://aws.amazon.com/ec2/">Amazon&#8217;s AWS EC2</a>.This allows you to run a virtual server (or multiple servers) in the Amazon cloud, servers can be created and destroyed by a click of a button.</p>
<p>First thing is to get a server &#8216;image&#8217; to run in the cloud. Thankfully many have already been created. I went for a recent <a href="http://alestic.com/">ubuntu server by Eric Hammond</a>. This is basically a ubuntu server vanilla install, but with a few tweaks to work <a href=http://092.me>nice</a>ly as a EC2 virtual server. Perfect!</p>
<p>Signing up is quick and easy, it just uses your Amazon (the shop) credentials. Once created, you are taken back to the main control panel where you can see your new instance, including details like the all important public DNS name.  Just save a private key to your computer and use it to ssh in to your new server.</p>
<p>e.g.: ssh -i key1.pem root@ec2-174-129-145-xx.compute.amazonaws.com</p>
<p>(you may need to chmod 400 the key file, but all this is documented)</p>
<p>Once in, well it&#8217;s a new server, what do you want to do with it?</p>
<p>I installed a LAMP stack (very easy in ubuntu: <em>apt-get update</em> and then <em>tasksel install lamp-server</em>). I initally couldn&#8217;t connect to apache (but could from the server itself using &#8216;telent localhost 80&#8242;). I presumed it was a ubuntu firewall issue, but it turned out you also control these things from the AWS control panel. The solution was to go to &#8216;security groups&#8217; and modify the group I had created when setting things up and adding HTTP to &#8216;Allowed Connections&#8217;. This couldn&#8217;t of been easier. And then success, I could point my browser at the DNS name of the host and saw my test index page from the web server.</p>
<p><a href="http://www.nostuff.org/words/wp-content//2009/07/Picture-8.png"><img class="aligncenter size-medium wp-image-420" title="Amazon aws control panel, modify to allow http connections" src="http://www.nostuff.org/words/wp-content//2009/07/Picture-8-300x131.png" alt="Amazon aws control panel, modify to allow http connections" width="300" height="131" /></a></p>
<p>So now what? I pondered this out loud via Twitter, and got this reply:</p>
<p><a href="http://www.nostuff.org/words/wp-content//2009/07/Picture-9.png"><img class="aligncenter size-medium wp-image-421" title="vufind-twitter" src="http://www.nostuff.org/words/wp-content//2009/07/Picture-9-300x179.png" alt="vufind-twitter" width="300" height="179" /></a></p>
<p>Excellent idea!</p>
<p>Good news:<a href="http://vufind.org/"> vufind</a> has some good &#8211; and simple &#8211; documentation for installing on ubuntu;</p>
<p><a href="http://vufind.org/wiki/installation_ubuntu">http://vufind.org/wiki/installation_ubuntu</a></p>
<p>Following the instructions (and editing them as well, they specified an earlier release and lacked a couple of steps if you weren&#8217;t also reading the more general install instructions) I quickly had a vufind installation up and running. Took around 20-25 minutes in all.</p>
<p>Now to add some catalogue data to the installation. I grabbed a MARC file with some journal records from one of our servers at work and copied it across as a test (I copied it just by using a scp command logged in to my ec2 server). After running the import script I had the following:</p>
<p><a href="http://www.nostuff.org/words/wp-content//2009/07/Picture-7.png"><img class="aligncenter size-medium wp-image-422" title="vufind results." src="http://www.nostuff.org/words/wp-content//2009/07/Picture-7-300x218.png" alt="vufind results." width="300" height="218" /></a>If the server is still running when you read this then you can access it here:</p>
<p><a href="http://ec2-174-129-145-75.compute-1.amazonaws.com/vufind/">http://ec2-174-129-145-75.compute-1.amazonaws.com/vufind/</a></p>
<p>EC2 is charged by the hour, and while cheap, I can&#8217;t afford to leave it running for ever. <img src='http://www.nostuff.org/words/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>So, a successful evening. Mainly due to the ease of both Amazon EC2 and Vufind.</p>
<p>A final note that if you are interested in EC2 you may want to look at some notes made by Joss Winn as part of the jiscpress project: <a href="http://code.google.com/p/jiscpress/wiki/AmazonWebServices">http://code.google.com/p/jiscpress/wiki/AmazonWebServices</a></p>
<p>Both Ec2 and vufind are worth further investigation.</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2011/vufind-in-8-minutes-using-amazon-ec2/' rel='bookmark' title='VuFind in 8 minutes using Amazon EC2'>VuFind in 8 minutes using Amazon EC2</a></li>
<li><a href='http://www.nostuff.org/words/2007/amazon-censor-reviews-of-the-high-arts/' rel='bookmark' title='Amazon : censor reviews of the high arts'>Amazon : censor reviews of the high arts</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/amazon-aws-ec2-and-vufind/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Data Imperative: Libraries and Research Data : comment</title>
		<link>http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/</link>
		<comments>http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/#comments</comments>
		<pubDate>Wed, 03 Jun 2009 20:20:08 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[me me me]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=401</guid>
		<description><![CDATA[I put this in to a seperate post. It continues on from my previous post, but didn&#8217;t want my notes of the day to be taken over by my ill thought views. Personal Thoughts Reluctant to give some thoughts as I know so little about the service. However&#8230; (!) There seems to be two clear [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/' rel='bookmark' title='Event: The Data Imperative: Libraries and Research Data'>Event: The Data Imperative: Libraries and Research Data</a></li>
<li><a href='http://www.nostuff.org/words/2009/linked-data-rdf-draft-notes-for-comment/' rel='bookmark' title='Linked data &amp; RDF : draft notes for comment'>Linked data &#038; RDF : draft notes for comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/' rel='bookmark' title='Research in the Open: How Mandates Work in Practice'>Research in the Open: How Mandates Work in Practice</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/" data-text="The Data Imperative: Libraries and Research Data : comment" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>I put this in to a seperate post. It continues on from my previous post, but didn&#8217;t want my notes of the day to be taken over by my ill thought views.</p>
<p><strong>Personal Thoughts</strong></p>
<p>Reluctant to give some thoughts as I know so little about the service. However&#8230; (!)</p>
<p>There seems to be two clear areas here: Data formatting and Data storing. There is some linkage (Preserving surely covers both, formats can become obsolete, Servers die), yet the two seem to be somewhat seperate.</p>
<p>Both require IT skills, but IT is a broad church, the former is technical metadata (and is very much IT and library) and in the general area that I sees covered in the Eduserv <a href="http://efoundations.typepad.com/">efoundations</a> blog.</p>
<p>The latter in its simplest form is hard core infrastructure. Disks, sans, servers, security, but also has elements at the application level (how do we access it, using what software, repositories? CRIS? Fedora?).</p>
<p>On another issue, while it is easy to say that libraries should take the lead, I think we need to be cautious. With the current climate of frozen or decreasing budgets nationally, and journal subscription pressure, how wise is it to go to the University&#8217;s executive and demand funding for resources/staff for data management. We know it&#8217;s important and could make the process of research more efficient, but there are other things higher up a Universities list of priorities (NSS/atracting good students, REF, research funding). Even at a library level, journals help researchers do research (which brings funding), and keep students happy because we have the stuff they need (NSS). How many journals should we cancel to focus on Research Data? Why? The recent JISC call will help with providing a business case.</p>
<p>The problem at the moment is that there are not enough clear benefits for most Universities to steam ahead with this. Let&#8217;s clarify this: not enough benefits for the institution itself. The benefits are for the UK as a while (actually, the while world). It&#8217;s the UK-wide economy and research that will benefit. So maybe it needs UK-wide funding. It&#8217;s easier to convince someone (or something) to spend money when the benefits for them are clear. In this case the benefits are for UK so it should the UK which sets aside explicit cash (via HEFCE, JISC, and so on).</p>
<p>And this is happening, with the JISC call (talked about today), amongst other things it will help build examples.</p>
<p>But I&#8217;m not sure if the institutional level is the best one. Australia has been successful with a centralised approach. We have a number of small Universities, and those which only have one or two departments which are research active. Yet the resources/knowledge required of them will be similar to that of a large institution. Will this leave them at a disadvantage?</p>
<p>On another note, it seems the range of data is vast. When dicussing this, I always &#8211; incorrectly &#8211; picture text based data, of vearying size, perhaps using XML. Of course this is blinkered. For auido, images and similar should a data service just provide a method to download, or a method to browse and view/listen? When it comes to storage and delivery, should we just treat all data as &#8216;blobs&#8217; &#8211; things to be downloaded as a file, and we no nothing more with it? This makes it easy and repository softwareapplications (eprints/dspace/fedora) are well placed to cater to this need. But I get the impression that this is somewhat simplistic. Perhaps this means a data service needs a clear scope, otherwise we could end up building front end applications which mimic flickr, youtube and last.fm all in one. A costly path to go down.</p>
<p>[all views are my own. are wrong, badly worded, ill thought, why are you reading this?, just think the opposite and it will be right, etc]</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/' rel='bookmark' title='Event: The Data Imperative: Libraries and Research Data'>Event: The Data Imperative: Libraries and Research Data</a></li>
<li><a href='http://www.nostuff.org/words/2009/linked-data-rdf-draft-notes-for-comment/' rel='bookmark' title='Linked data &amp; RDF : draft notes for comment'>Linked data &#038; RDF : draft notes for comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/' rel='bookmark' title='Research in the Open: How Mandates Work in Practice'>Research in the Open: How Mandates Work in Practice</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Event: The Data Imperative: Libraries and Research Data</title>
		<link>http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/</link>
		<comments>http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/#comments</comments>
		<pubDate>Wed, 03 Jun 2009 17:26:06 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[or08]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=393</guid>
		<description><![CDATA[Today I&#8217;m at the one day event &#8216;The Data Imperative: Libraries and Research Data&#8217; at the Oxford e-research Centre. As usual, these are my own rough notes. There are mistakes, gaps and my own interpretation of what was said. Paul Jeffreys : Director of IT, Oxford University. Started off giving an overview of where this [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/' rel='bookmark' title='The Data Imperative: Libraries and Research Data : comment'>The Data Imperative: Libraries and Research Data : comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/' rel='bookmark' title='Research in the Open: How Mandates Work in Practice'>Research in the Open: How Mandates Work in Practice</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/" data-text="Event: The Data Imperative: Libraries and Research Data" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p class="documentFirstHeading">Today I&#8217;m at the one day event &#8216;<span id="parent-fieldname-title">The <span class="highlightedSearchTerm">Data</span> <span class="highlightedSearchTerm">Imperative</span>:  Libraries and Research <span class="highlightedSearchTerm">Data&#8217; at the Oxford e-research Centre. As usual, these are my own rough notes. There are mistakes, gaps and my own interpretation of what was said.</span></span></p>
<h3 class="documentFirstHeading"><span><span class="highlightedSearchTerm">Paul Jeffreys : Director of IT, Oxford University.</span></span></h3>
<p class="documentFirstHeading"><span><span class="highlightedSearchTerm">Started off giving an overview of where this has come from. e-Research is more than just e-Infrastructure. e-Research is not just about outputs, but outputs (articles/data) are a part of this, and an discreet area to work on. </span></span></p>
<p class="documentFirstHeading"><span><span class="highlightedSearchTerm">This is a cross-discipline area, it needs academics, University executive, research office, IT and Library. Libraries have skills that have to be fed in to this.</span></span></p>
<p class="documentFirstHeading"><span><span class="highlightedSearchTerm">EIDCSR : &#8216;Enough talking,  let&#8217;s try and do it&#8217;, selected two research groups to work with, but not a pilot, a long term commitment. He talks about Oxford&#8217;s commitment to a data repository, it stresses cross agencies, mentions business models and feeds in to a senior research committee (the quote is far too long to add here!).</span></span></p>
<p class="documentFirstHeading"><span><span class="highlightedSearchTerm">As each HEI is facing the same issue, it makes sense for national activity. but how much is done locally and how much is done nationally.<br />
</span></span>
</p>
<p class="documentFirstHeading"><span><span class="highlightedSearchTerm">What is the vision of research management data? To what extent is managing research data the role of the Library/librarians? Is data management and data repositories a new kind of activity? Is it Librarians or Information Professionals who are charged to take this forward? [cjk: i thought they were one and the same]</span></span></p>
<p class="documentFirstHeading">
<h3>John K Milner : Project Manager UKRDS</h3>
<p class="documentFirstHeading">Can&#8217;t just use existing subject specific data centres. Need for cross-discipline (eg climate change) and therefore universal standards and methods so one subject can use another subject&#8217;s data with ease.</p>
<p class="documentFirstHeading">Feasibility study:</p>
<p class="documentFirstHeading">Understand what is happening today? where are the gaps. Avoid re-inventing the wheel.</p>
<p class="documentFirstHeading">Four Case studies (Bristol, Leeds, Leicester, Oxford), views of ~700 researchers over all disciplines (inc the arts).</p>
<p class="documentFirstHeading">What did they learn?</p>
<p class="documentFirstHeading">About half of the data has a useful life of 10 years? 26% has &#8216;indefinite&#8217; value, ie keep for ever&#8217; Nearly all kept locally (memory stick, departmental server, [cjk: not good!]).</p>
<p class="documentFirstHeading">21% use a national/international data centre. 18% share with them.</p>
<p class="documentFirstHeading">UK has rich landscape of facilities, skills and infrastructure.</p>
<p class="documentFirstHeading">The management of data from a research project are now starting to be directly funded, which is important.</p>
<p class="documentFirstHeading">What are others doing? Are we in step with other countries? Yes. US spending $100 million on 5 large data centres. Australians are leading in this area, and have a central approach to it. Canada and Germany also have similar developments.</p>
<p class="documentFirstHeading">Aim: to set up a framework for research data.</p>
<p class="documentFirstHeading">Why Pathfinder: not a pilot but the start of a long term commitment.</p>
<p class="documentFirstHeading">[my notes miss a bit here, had to deal with a urgent work issue]</p>
<p class="documentFirstHeading">Service must be useful and accessible. Need a framework for stakeholder engagement.</p>
<p class="documentFirstHeading">This is non-trivial. Lots of parties involved, a lot of effort needed.</p>
<p class="documentFirstHeading">Citation of datasets is of growing interest to some researchers, this may help engage the research community.</p>
<p class="documentFirstHeading">Showing a diagram of UKRDS Basic processes. Split between &#8216;Research Project process&#8217;, Research data sharing process and UKRDS Services and Administration</p>
<p class="documentFirstHeading">Diagram doesn&#8217;t focus on curation but on accessibility (inc discovery, stable storage, identity) as this seems like the most important part. Discovery:Google, Identity(auth):Shibboleth.</p>
<p class="documentFirstHeading">Making it happen.</p>
<p class="documentFirstHeading">Need clearly defined service elements, will involve DCC, RIN and data centres.</p>
<p class="documentFirstHeading">HEIs need a reliable back-office service to handle working with data.</p>
<p class="documentFirstHeading">UKRDS is extremely challenging, nothing is easy and it is expensive. Needs support of funders and HEIs, need the right bodies to show leadership and shape policy. It will take time.</p>
<p class="documentFirstHeading">Q: Is it limited to HEI or public sector (museums etc). A: a more complicated issue, but they are working with the liked of Connecting for Health and DEFRA.</p>
<p class="documentFirstHeading">Q: Copyright. A: HEI often don&#8217;t own copyright. Data Management Plan (Wellcome are funding Data planning as part of funding)</p>
<p class="documentFirstHeading">Q: Is it retrospective? A: Could be. [he did say more]</p>
<p class="documentFirstHeading">Q: Could UKRDS influence &#8216;reputational kick back&#8217; [<a href=http://092.me>nice</a> phase!] e.g. for the REF. A: Yes, in discussion with HEFCE.</p>
<p class="documentFirstHeading">Q: Research Councils A: they are in discussion with RCs but Wellcome very much taking the lead (leap of faith) in the area. The whole key is a &#8216;value proposition&#8217; which makes a case for funding this.</p>
<p class="documentFirstHeading">Q/point: Engage government/politicians.</p>
<p class="documentFirstHeading">Q: Challenge in explaining what it is, especially for subjects which are already doing something with data. How can we tap in to those already doing it? A: there is sometimes a missing link between researchers and subject national data centres. No real relationship between the two. Which is a problem in cross-subject research.</p>
<h3 class="documentFirstHeading">Research <span class="highlightedSearchTerm">data</span> management at the University of <span class="highlightedSearchTerm">Oxford</span>:  a case study for institutional engagement &#8211; Luis Martinez, OeRC, Sally Rumsey, <span class="highlightedSearchTerm">Oxford</span> University <span class="highlightedSearchTerm">Library</span> Service</h3>
<p class="documentFirstHeading">More of a &#8216;in practice&#8217; talk, rather than high level.</p>
<p class="documentFirstHeading"><strong>Luis Martinez</strong></p>
<p class="documentFirstHeading">Scoping study: &#8216;<a href="http://www.disc-uk.org/datashare.html">DataShare project</a>&#8216;. Talking to researchers they found some couldn&#8217;t understand they own old data, some wanted to publish their own data, some found data was lost when academics moved on.</p>
<p class="documentFirstHeading">Requirements: Advice/support across research cycle (where to store it, how, etc), Secure Storage for large dataset. Sustainably infrastructure.</p>
<p class="documentFirstHeading">Lots of different Oxford units need to be consulted (library, it, research technology, academics, legal, repository etc).</p>
<p class="documentFirstHeading">Findings after consultation: there is actually widespread expertise in data management and curation amongst service units, and other findings. DataShare: new models, tools, workflows for academic data sharing.</p>
<p class="documentFirstHeading"><a href="http://www.data-audit.eu/">Data Audit Framework</a>: (DAF) adapted this to Oxford needs and used it to document practices in research groups.</p>
<p class="documentFirstHeading">&#8216;<a href="http://www.google.co.uk/url?sa=t&amp;source=web&amp;ct=res&amp;cd=1&amp;url=http%3A%2F%2Fwww.disc-uk.org%2Fdocs%2Fguide.pdf&amp;ei=_VQmStD6HsLMjAey9vHdBw&amp;usg=AFQjCNGevMHDV9M5yUYsL2981q0o2lxGmw&amp;sig2=ktq8WNDkPkA2E2vxqXmYgw">Policy-making for Research Data in Repositories : a guide</a>&#8216; [pdf]</p>
<p class="documentFirstHeading">The EIDCSR challenge: two units that both research around the human heart. The two groups share the data between them and agree to produce 3d models using the combined data. They are helping this groups do this, using a &#8216;life cycle approach&#8217;.</p>
<p class="documentFirstHeading">Using the DAF to capture the requirements. Participating in the UKRDS Pathfinder (as above).</p>
<p class="documentFirstHeading">They have a blog <a href="http://eidcsr.blogspot.com/">http://eidcsr.blogspot.com/</a></p>
<p class="documentFirstHeading"><strong>Sally Rumsey</strong></p>
<p class="documentFirstHeading">Starts of by talking about the roles required regarding the library. They have Repository staff, librarians, curators, but not so sure about &#8216;data librarians&#8217;.</p>
<p class="documentFirstHeading">What should of data should they be responsible for? Some stuff can go to a national service. There are vast datasets (eg Oxford Supercomputing centre), who has the expertise to make these specialised datasets available. Some departments already have provision in place, fine, why rock the boat.</p>
<p class="documentFirstHeading">Long tail. Every thing else (not above). No other home, lots of it, Academics asking for it, highly individual (ie unique), hums and sciences.</p>
<p class="documentFirstHeading">Things to consider: live or changing data Freely available or restricted? Long term post project?</p>
<p class="documentFirstHeading">Showing what looks like a list of random words/letters/strings of chars, an example of some data they were asked to look after from the English department.</p>
<p class="documentFirstHeading">Showing a diagram showing that Fedora (a repository system which is strong on metadata/structure but lacks an out of the box UI) is key to the setup. many applications can sit on top of it. Institutional Repository is just one application which runs on top of Fedora.</p>
<p class="documentFirstHeading">ORA (IR) for DATA: actual data can be held anywhere in University but ORA is a place of discovery. Allows for referencing of data. Might want to link to &#8216;<a href="http://databank.ouls.ox.ac.uk/">DataBank</a>&#8216; (a proof of concept to show what is possible).</p>
<p class="documentFirstHeading">Databank: how do you search/discover? First things added were audio files, perhaps then photos, how do you find them?</p>
<p class="documentFirstHeading">Showing Databank. Explaining that everything has a uid so we have cool URLs, and hence you can link to it [yes!]. Explaining how you can group an audio object, a related photo object and a related text object (perhaps explaining it).</p>
<p class="documentFirstHeading">End of morning discussion (I&#8217;ll just note some points I picked up):</p>
<p class="documentFirstHeading">This seems to raise such huge resource implications.</p>
<p class="documentFirstHeading">DAF is flexible, you can pick elements of it to use.</p>
<p class="documentFirstHeading">Non academic repositories, such as flickr, preservation issues, if they go down. [unlike the AHDS then!]</p>
<p class="documentFirstHeading"><strong>The Research <span class="highlightedSearchTerm">Data</span> Management Workforce &#8211; Alma Swan, Key Perspectives </strong></p>
<p class="documentFirstHeading">Study commissioned by JISC, looking at the &#8216;supply of DS [data scientists] skills&#8217;.</p>
<p class="documentFirstHeading">NSF Roles:</p>
<ul>
<li>Data Authors &#8211; produce data</li>
<li>Data Managers &#8211; more technical people &#8211; often work in partnership with data authors</li>
<li>Data Users</li>
<li>Data Scientists &#8211; expert data handlers and managers (perhaps &#8216;Data Manager&#8217; was a confusing name).</li>
</ul>
<p class="documentFirstHeading">Our Definitions (but in practice the roles and names are fuzzy):</p>
<ul>
<li>Data creators or authors</li>
<li>Data Scientists</li>
<li>Data Managers</li>
<li>Data Librarians</li>
</ul>
<p><strong>Data Creators</strong></p>
<p>Using <a href="http://www.dcc.ac.uk/docs/publications/DCCLifecycle.pdf">DCC Curation lifecycle model</a>, these are the out ring. But not all of it, and do things not on the ring, such as throw data away.</p>
<p>Shows picture of an academics office. Data is stored in random envelops.</p>
<p><strong>Data Scientists &#8211; the focus of this study</strong></p>
<p>Work with the researchers, in the same lab. Do most things in the DCC model. Are computer scientists (or can be one), experts in database technologies, ensure systems are in place, format migration. A &#8216;translation service&#8217; between Researchers and computer experts.</p>
<p>Lots of facts about this, based on the research. Often fallen in to the role by accident, often started out as a researcher. Domain (maths, chemistry) related or Computer training. Informatics Skills: well advanced in biology and chemistry. Majority have a further degree. Need People skills. Rapidly involving area.</p>
<p><strong>Data Librarians</strong></p>
<p>Only a handful in the UK. specific skills in data care, curation. Bottom half (or bottom two thirds) of DCC model.</p>
<p>Library schools have not yet geared up for training. Demand is low, no established career path. Good subject-based first degree is required.</p>
<p>Things are changing, eg library schools are creating courses/modules around this.</p>
<p>Future Roles of the library</p>
<p>train researchers to be more data aware</p>
<p>Pressing issue inform researchers on data principles, eg ownership.</p>
<p>Open Data : datasets</p>
<p>A growing recognition across all disciplines that articles aren&#8217;t enough, datasets are what are needed to be in the open.</p>
<p>Datasets are a resource in their own right.</p>
<p>Publishers do not normally claim ownership of datasets. Some are (usual suspects)</p>
<p>Funder may own Data, Employers may own data. No one seems sure. Several entities may own the data.</p>
<p>In some areas of research journals play role in enforcement.</p>
<p>Some journals are just data.</p>
<p>Using PDF for data is very very not good.</p>
<p>Do we leave preservation of data to publishers [cjk: no! they should have nothing to do with this, the actors are Universities, their employees and their funders]</p>
<h3><strong>Simon Hodson &#8211; JISC Data Management Infrastructure Programme<br />
</strong></h3>
<p>Something problem, not easy to tackle. Would be a mistake for institutions to wait. The Call is designed to better understand how its data management facility can be taken forward.</p>
<p>Detailed business cases are needed.</p>
<p>Needs everyone (HEI, funders, data centres, RIN, etc) to be on board.</p>
<p>the Call will have an Advisory Group.</p>
<p>&#8216;Exemplar projects and studies designed to help establish partnership between researchers, institutions, research councils.</p>
<p>See DCC as playing a major role in developing capacity and skills in the sector.</p>
<p>Tools and technologies: tools to help managers make business case internally, institutional planning tools (building on DAT, DRAMBORA, and costing tools). Workshop 1oth June DCC to review progress/outcomes of DAT project.</p>
<p>Two calls planned for the early Autumn.</p>
<p>2 June Call: Infrastructure. To build examples within the sector. Requirements analysis -&gt; Implementation plan -&gt; Execution thereof -&gt; business models.</p>
<p>Bids encouraged from consortia.</p>
<p>Briefing day 6 July. DCC will provide support for bids, including a specific helpdesk.</p>
<p>There may be a Digital Curation course in the next few weeks.</p>
<h3>Libraries and Research <span class="highlightedSearchTerm">Data</span> Management; conclusions &#8211; Martin Lewis, Director of <span class="highlightedSearchTerm">Library</span> Services and University Librarian, University of Sheffield.</h3>
<p>Martin had been chairing all day and here he sums up and bring the various threads together.</p>
<p>The library research data pyramid. Things at the bottom need to be in place before things higher up. At the bottom, training in library (confidence), Library schools. Then develop local data curation capacity, teach data literacy. Higher up: research data awareness, research data advice, Lead on local policy. At the very top &#8216;influence national data agenda&#8217;.</p>
<p><strong>Summary</strong></p>
<p>An excellent day and excellent knowledgeable speakers. Nice venue, and most importantly, I found the only plug socket in the room!</p>
<p>This is clearly an emerging area. Many are in the same posistion, they are aware of the (Opene) Research Data developments, but nothing has yet happened at their university, nor academics queuing up to demand such a service. This is a good thing and it needs to happen, and Universities need to start acting now. But there are many preasures on University resources at the moment. How high on the institutional priority list will this come?</p>
<p>[Very finally, I did another audioboo experiment. On the fly, with no pre-planning, I recorded about 2 minutes of talk during the lunch. It's random, with no thought, many umms, a pointless 'one more thing' and basically wrong. <a href="http://audioboo.fm/boos/27212-data-imperative-libraries-and-research-data">laugh at it here</a>]</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/' rel='bookmark' title='The Data Imperative: Libraries and Research Data : comment'>The Data Imperative: Libraries and Research Data : comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/' rel='bookmark' title='Research in the Open: How Mandates Work in Practice'>Research in the Open: How Mandates Work in Practice</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research in the Open: How Mandates Work in Practice</title>
		<link>http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/</link>
		<comments>http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/#comments</comments>
		<pubDate>Fri, 29 May 2009 09:28:35 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[rsp]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=370</guid>
		<description><![CDATA[Today I&#8217;m at the RSP/RIN Research in the Open: How Mandates Work in Practice at the impressive RIBA 66 Portland Place. Slides can be found here (not available when I made this post, as semi excuse as to why my notes miss so much). These are rough notes, which I&#8217;m making available in case others [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/' rel='bookmark' title='The Data Imperative: Libraries and Research Data : comment'>The Data Imperative: Libraries and Research Data : comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/' rel='bookmark' title='Event: The Data Imperative: Libraries and Research Data'>Event: The Data Imperative: Libraries and Research Data</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/" data-text="Research in the Open: How Mandates Work in Practice" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>Today I&#8217;m at the RSP/RIN <a href="http://www.rsp.ac.uk/events/index.php?page=MandatesDay2009/index.php">Research in the Open: How Mandates Work in Practice</a> at the impressive <a href="http://www.architecture.com/RIBAVenues/About66PP.aspx">RIBA 66 Portland Place</a>.</p>
<p>Slides can be found <a href="http://www.rsp.ac.uk/events/index.php?page=MandatesDay2009/index.php">here</a> (not available when I made this post, as semi excuse as to why my notes miss so much). These are rough notes, which I&#8217;m making available in case others are interested, apologies for mistakes and don&#8217;t take it as gospel!</p>
<p>After an introduction by <em>Stéphane Goldstein, kicking off with </em><strong><em>Robert Kiley from the Wellcome Trust.</em></strong></p>
<p><strong><em><a href="http://farm4.static.flickr.com/3658/3574952609_442086aa40.jpg?v=0"><img class="alignright" title="rsp1" src="http://farm4.static.flickr.com/3658/3574952609_442086aa40.jpg?v=0" alt="" width="259" height="300" /></a></em></strong></p>
<p>Wellcome trust mandate since 2006, anyone receiving funding from <a href="http://www.wellcome.ac.uk/">Wellcome Trust</a> must deposit in to pubmed, now <a href="http://ukpmc.ac.uk/">uk pubmed central</a>. <em><a href="http://www.sherpa.ac.uk/juliet/">SHERPA Juliet</a> lists 48 funder policies/mandates.</em></p>
<p>Two routes to complying to their mandate: (route 1) publisher in open access / hybrid journal (preferred), Wellcome will normally pay any associated fees. However when paying the publisher, they expect a certain level of service in return (deposited on behalf of author, final version available at time of publication, certain level of re-use. Route 2 Author self-archives author&#8217;s final version within 6 months of publication. It was stressed that the first option is very much preferred.<em> </em></p>
<p>&#8220;Publication costs are legitimate research costs&#8221;. To fund Open Access fees for ALL research they fund would, they estimate. take up 1-2% of their budget.</p>
<p>Risk of &#8216;Double payment&#8217; (author fees and subscriptions). <a href="http://www.oxfordjournals.org/">OUP</a> have a good model here.</p>
<p>Still to do:</p>
<ul>
<li>Improve compliance (roughly 33%, significant increase after letters to VCs),</li>
<li>improve mechanisms (Elsevier introduced OA workflow which resulted in significant increase in deposits, but funders/institutions/publishers all need to play a part here),</li>
<li>Clarifying Publishers OA Policies  (and re-use rights, didn&#8217;t catch this).</li>
</ul>
<h3>Research Councils UK &#8211; <em>Astrid Wissenburg, ESRC</em></h3>
<p>Starts of by talking about drivers for OA in the RC. Value for money, ensuring research is used, infrastructure and more.</p>
<p>Principles: Accessible, Quality (peer review), preservation (she&#8217;s moving through the slides fast)</p>
<p>April 2009 study in to OA impact, provides options for RC to consider.Findings</p>
<ul>
<li>Significant shift in favour of OA over last decade</li>
<li>Knowledge/awareness still limited. Confusion</li>
<li>Engagement with OA varies by subject area.</li>
<li>Too early to access impact of RCs policies.</li>
<li>Drivers
<ul>
<li>Not speed of dissemination</li>
<li>principles of free access</li>
<li>co-authors views are a big influence (mandates less so!)</li>
<li>some evidence that OA increases citation just after publication</li>
<li>limited compliance monitoring by finders</li>
<li>concern about impact of learned societies (but no evidence of libraries cancelling journals)</li>
<li>little evidence of use by non-researchers (CJK comment: interesting, I would imagine this may grow, wish newspapers would link/cite journal articles)</li>
</ul>
</li>
</ul>
<p>Both models (oa journals/repositories) supported by RCs, level playing field.</p>
<p>Pay to publish findings: limited use, barriers, costs, awareness, not RAE. would lead to redistribution of costs from non-academic to academic areas.</p>
<p>OA Deposit (repositories): from grant application from 1 Oct 2006, so a three year project starting then will only be finished in Autumn 2009. Acknowledges embargos but &#8216;at earliest opportunity&#8217;.</p>
<p>75% researchers were not aware of the mandate. diversity across subjects. &#8216;In general, no active deposit&#8217;.</p>
<p>A slide showing % of awareness broken down by RC, interesting.</p>
<p>From the highest level RCs are committed to supporting OA (this will increase). But change takes time.</p>
<p>Some issues: what do to with embargo periods, difficult for funders to manage (are there incentives we could use), depends on existence of repositories, multiple deposit options confusing to researchers, awareness/understanding.</p>
<h3>UKPubMed Central &#8211; <em>Paul Davey, Engagement Manager, UKPubMed Central</em></h3>
<p>Aims to become the information resource of choice for biomedical sector.</p>
<p>Principles: freely available, added to UK pubmed central, freely copied and reused.</p>
<p>Departmental of Health have clear policy to make research freely available.</p>
<p>95% of papers submitted are taken care of (deposited?) by the authors. only 0.5% submitted by academics (PIs/colleagues)</p>
<p>1.6 milion papers in uk pubmed central. 366 thousand downloads last month.</p>
<p>Core benefits: transparency, cutting down duplication, greater visibility.</p>
<p>Text mining, grabbing key terms from an article  (a little like  OpenCalais does)</p>
<p>Mentions EBI&#8217;s CiteXplore, encouraging academics to ink to other research.</p>
<p>Pubmed UK includes funding/grant facilities search. Can link articles to funding grants.</p>
<p>In short, backing from key funders, will make researchers more efficient, researcher&#8217;s visibility will increase.</p>
<p>Beta out in the Autumn, new site in Jan 2012.</p>
<p><em>Questions:</em></p>
<p>Worried about text mining, need for humans to moderate this.<em> response: Limited finding in this area so human intervention also limited. really need specialist to <a href=http://092.me>answer</a> this fully. </em></p>
<p>Question about increasing visibility of UK pubmed central, referring to Google, response: getting indexed by Google very much part of increasing visibility.</p>
<p>Question about Canadian &#8216;pubmed central&#8217;, response confirms this and mentions talk of a European pubmed central. Potential of European funders using UK pubmed central as a place to deposit research (like everything here, not sure if I&#8217;ve noted this right).</p>
<h3>PEER &#8211; Pioneering collaboration between publishers, repositories and researchers &#8211; <em>Julia Wallace</em></h3>
<p>Funded by EC, not a &#8216;publisher project&#8217;.</p>
<p>Three key stages of publication: NISO Author&#8217;s original, NISO Accepted Manuscript, NISO version of record.</p>
<p><em>Starts of talking about the project, interesting stuff but failed to take notes.</em></p>
<p>From the <a href="http://www.peerproject.eu/">website</a>:</p>
<p style="padding-left: 30px;">PEER (Publishing and the Ecology of European Research), supported by the <a class="external-link-new-window" title="Opens external link in new window" href="http://ec.europa.eu/information_society/activities/econtentplus/index_en.htm" target="_blank">EC eContent<em>plus</em> programme</a>, will investigate the effects of the large-scale, systematic depositing of authors’ final peer-reviewed manuscripts (so called Green Open Access or stage-two research output) on reader access, author visibility, and journal viability, as well as on the broader ecology of European research. The project is a collaboration between publishers, repositories and researchers and will last from 2008 to 2011.</p>
<p><em>Seven members: including a publisher group, university, funders etc. Various publishers involved, big and small and about six European repositories taking part.</em></p>
<p><em>Approach / content:</em></p>
<ul>
<li><em>Publishers contribute 300 journals, plus control</em></li>
<li><em>Maximises deposit and access in participating repositories</em></li>
<li><em> 50% publisher submitted 50% author submitted.</em></li>
<li><em>Good quality, range of impact factors. Publishers set embargo periods, up to 36 months.</em></li>
</ul>
<p>Publishers will deposit articles in to the repositories via a central depot for their 50% of articles submitted (50% fulltext, metadata for the remaining 50%). Publishers will invite authors to deposit for the &#8216;author&#8217; 50%</p>
<p>Technical: using PDFA-1 (where possible) and SWORD</p>
<p>Three strands: Behaviour, Usage (looking at raw log files), Economic. Also looking at Model Development (the three strands will look in to this).</p>
<p>Question about why they chose PDF (not very good for text mining). A: wide range of subjects and publishers means that PDF the best fit</p>
<h3>Economic Implications of Alternative Scholarly Publishing Models, also Loughborough University&#8217;s Institutional Mandate &#8211; <em>Charles Oppenheim, Loughborough University</em></h3>
<p>&#8216;Houghton report&#8217; looks at costs and benefits of scholarly publishing.</p>
<p><a href=" http://hdl.handle.net/2134/4137">Link to report <strong> <code>http://hdl.handle.net/2134/4137</code></strong></a></p>
<p><strong><a href="http://www.cfses.com/EI-ASPM/">Link to main website and models http://www.cfses.com/EI-ASPM/</a><br />
</strong></p>
<ul>
<li>Massive savings by using OA, UK would benefit from this.</li>
<li>Savings include: quicker searching, less negotiations, savings not just in library budgets</li>
<li>2,300 activity items costed.</li>
<li>This report currently final word in economics of OA.</li>
<li>Charles Talks about the various methods and work involved in producng this report.</li>
<li>a 5% incease in accessibility would lead to savings (or extra money to spend) in research/he/RCs</li>
<li>Hard to compare UK toll/open access publishing costs as one pays for UK access to content from across the world, the other pays for UK content to be world wide accessible.</li>
<li>Keen to role this out to other countires</li>
<li>Publishers response to report: furious!</li>
</ul>
<p>Now for something completly different: Loughborough approve a mandae a few months a go, to come in to affect Oct 09. An intergral part of academic personal research plans (only those research items in the IR will be considered at the review). Now have over<a href="http://www.nostuff.org/ircount/archive.php?id=http%3A%2F%2Fdspace.lboro.ac.uk%2Fdspace%2F20070419092801&amp;country=uk"> 4,000 items</a></p>
<h3>Lunch and audioboo</h3>
<p>During lunch I did an experiment using audioboo. Would I be able to summarise the morning, on the fly with no planning, in a brief audio recording. The <a href=http://092.me>answer</a>, as you can discover, is &#8216;no&#8217;, but fun to try, and made me think of what I had taken in during the morning. <a href="http://audioboo.fm/boos/25593-rsp-madates-in-practice">Link to audioboo recording</a>. or try the embedded version below.<br />
<object width="400" height="129" data="http://boos.audioboo.fm/swf/fullsize_player.swf" type="application/x-shockwave-flash"><param name="scale" value="noscale" /><param name="salign" value="lt" /><param name="bgColor" value="#FFFFFF" /><param name="wmode" value="window" /><param name="FlashVars" value="mp3Title=Rsp+Madates+In+Practice&amp;mp3=http%3A%2F%2Faudioboo.fm%2Fboos%2F25593-rsp-madates-in-practice.mp3&amp;mp3Time=01.45pm+29+May+2009&amp;mp3LinkURL=http%3A%2F%2Faudioboo.fm%2Fboos%2F25593-rsp-madates-in-practice&amp;mp3Author=chriskeene&amp;size=full&amp;playerWidth=400" /><param name="src" value="http://boos.audioboo.fm/swf/fullsize_player.swf" /></object></p>
<h3>Institutional Mandates &#8211; <em>Paul Ayris, University College London</em></h3>
<p><em>Paul starts off by shoing a number of Venn diagrams, for example: 90% of its research is available online, 40% available to an NHS hospital</em></p>
<p><em>What do UCL academics want</em></p>
<ul>
<li><em>as authors: visbility / impact</em></li>
<li><em>as readers: access</em></li>
<li><em>delivery 24&#215;7 anywhere</em></li>
</ul>
<p>UCL madate, a case study:</p>
<p>Looking global is an important part of UCL (for PIs rankings etc). Number of systems in their publication system: Symplectic, IRIS, eprints, data mart (and portico, FIS, HR). Symplectic (or similar tool) and IRIS seem central in this model. Plan to automatically extra metadata from other external places (publication repositories.</p>
<p>How did they get the mandate? Paul spoke at UCLs senate (Academic Board), the agreed: all academic staff should record they own publication on a UCL publication system, and, teaching materials should all be deposited in their eprints systems.</p>
<p>UCL are going to set up a publication board to over see the OA rollout; to advise, monitor, oversee presentation and more.</p>
<p>Next steps: market/exploit, set standards for online publication, to advise on ongoing resource issues in this area. Also, establish processes, Statistics and management information, advise on multimedia, copyright issues.</p>
<p>&#8216;Open Access is the natural way for a global university to achieve its objectives&#8217;</p>
<p>Question about blurring the line between dissemination and publication, and that some of UCLs aims seem more fitting of &#8216;publication&#8217;. Paul agrees, still trying to figure this out.</p>
<h3>HEFCE &#8211; <em>Paul Hubbard, Head of Research Policy, HEFCE</em></h3>
<p>Policy: Research is a process which leads to insights for sharing. So Scholarly Communication matters to HEFCE. Prompt and accessible publishing is essential for a world class research system.</p>
<p>Supporting research: JISC, RIN, Programmes to support national research libraries (<a href="http://www.ukrr.ac.uk/">UKRR</a>), <a href="http://www.ukrds.ac.uk/">UKRDS</a>. Mentions Boston Spa (BL) document centre as an example of our world class sharing.</p>
<p>Internet opens up new ways of scholarly communication and sharing.</p>
<p>What do HEFCE want to see:</p>
<ul>
<li>Widest and earliest dissemination of public research.</li>
<li>IP shared effectively with the people best placed to exploit it (CJK comment, i don&#8217;t think it is publishers!)</li>
</ul>
<p>Committed to: UK maintaining world leading research, funding that fosters autonomy and dynamism, research quality assessment regime that supports rather than inhibits new developments.</p>
<p>As we move forward, things may be unclear those HEIs with repositories will be at an advantage.</p>
<p>Paul finishes up with a personal view of scholarly communications in 2030. He sees to forms of communication: discussion (building up ideas), and writing up a formal firm idea/conclusion based on these. HEFCE supports &#8211; through the likes of JISC &#8211; a range of tools and systems to enable this. (sorry that was an awful summary, he said much more than that!).</p>
<p>Answered a <a href=http://092.me>question</a> as to why IRs, HEIs are the places to administrate/manage. Websites people go to see research for a particular subject need to be overlay systems harvesting from IRs.</p>
<p>[hmm, does 'university requirement' sound better than mandate?]</p>
<h3>Institutional Policies and Processes for Mandate Compliance &#8211; <em>Bill Hubbard, SHERPA, University of Nottingham</em></h3>
<p>99.9% of academics do not object to Open Access, but need to show it will not change how they work. Librarians going to be much more part of the research process. Most people (including most publishers) are in favour of Open Access.</p>
<p>Other pressures on the systems, lack of peer reviewers, rising prices of journals, growing need for different forms of scholarly communications (e-lab books, multimedia), public demand for highest value for money &#8216;public should get what they pay for&#8217;,<em><br />
</em></p>
<p>Not if we change, but how we change. Research has to change seamlessly. Mandates have a value-added basis with fast delivery of benefits. Need integrated processes, need integrated support (we don&#8217;t want researchers to hear different messages from their Uni, funder, publisher, etc).</p>
<p>Authors need to know &#8216;what do i meed to do&#8217;. Need to make it less confusing, need to make it clear when they can get help.</p>
<p>First step compliance: how can funders improve compliance, how can authors be supported?</p>
<p>All 1994 and Russel Group now have IR (Reading, I think, just setting one up now).</p>
<p>Compliance for mandates makes it better for us admin/support staff, and for the University generally.</p>
<p>Institutions need a compliance officer (perhaps repository manager). Funders need to ensure these people have the information they need. Share compliance information</p>
<p><em>I&#8217;ve missed so much of Bill&#8217;s talk here, he moves fast (and passionately) and lots of points.</em></p>
<p>After Bill&#8217;s talk there was a panel session.</p>
<h3>Twitter</h3>
<p>Finally check out some of the <a href="http://sn.im/j0hjp">useful tweets</a> from the day. (Twitter search only goes back about a month or so, so this link may not work after a certain date). <a href="http://twitter.com/jimrhiz">Jim Richardson</a> also created a <a href="http://www.webcitation.org/query?url=http%3A%2F%2Fsearch.twitter.com%2Fsearch%3Fq%3D%2523rsp%26rpp%3D50%26format%3Diphone&amp;date=2009-05-30">permanent copy</a> with the (new to me) webcitation website.</p>
<h3>Conclusions</h3>
<p>With such dodgy note taking I feel some concise summary is in order!</p>
<ul>
<li>Mandates are happening, by Universities and by Funders.</li>
<li>HEFCE want research to be accessible to as many as possible as quickly as possible.  Coming from HEFCE, this holds a lot of weight.</li>
<li>Funders (Research Councils / Wellcome) put mandates in place several years a go. They have not sat back and said &#8216;job done&#8217;. They are building on this foundation. How can they check? How can they enforce/encourage? How can they assist? How can they automate? How can they work with publishers and HE to share this information? Expect more to come in this area.</li>
<li>Wellcome Trust prefers submission to Open Access Journals rather than author depositing in to a repository at a later date.</li>
<li>HE Mandates are coming, we alreay have a few in the UK. Making them an intergral part of an academic&#8217;s review seems like a good idea. My opinion is that this is reasonable &#8211; even if there are those who disagree &#8211; surely an employer can (and <em>does</em> in every other sector) ask for a record of what an employee has been working on, and a copy of the end output, i.e. the full text in an IR.</li>
<li>The report &#8216;<a href="https://dspace.lboro.ac.uk/dspace-jspui/handle/2134/4137">Economic implications of alternative scholarly publishing models : exploring the costs and benefits. JISC EI-ASPM Project</a>&#8216; is a thourough comprehensive look at the economic costs of Open Access and new forms of Scholorarly Communications.</li>
<li>I think we are starting to see the larger Universities developing sophisticated network of systems to manage research/publications/OA/research-funding. See <a href="http://www.rsp.ac.uk/events/index.php?page=MandatesDay2009/index.php">slide 10 of Paul Ayris</a> presentation, and<a href="http://spiral.imperial.ac.uk/handle/10044/1/493"> this article about Imperial&#8217;s setup</a> as two examples.</li>
<li>It makes sense to share information (between IT systems) between funders, HE and publishers. Examples: Funders sharing (bibliographic) information to a University about publications from its researchers, Universities (or publishers) passing information to funders linking publications to funding (or even the other way round?).</li>
<li>This is an area which is still developing, fast, and will of course involve a culture change. Publishers seem unsure how to handle this new world.</li>
</ul>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2009/the-data-imperative-libraries-and-research-data-commet/' rel='bookmark' title='The Data Imperative: Libraries and Research Data : comment'>The Data Imperative: Libraries and Research Data : comment</a></li>
<li><a href='http://www.nostuff.org/words/2009/event-the-data-imperative-libraries-and-research-data/' rel='bookmark' title='Event: The Data Imperative: Libraries and Research Data'>Event: The Data Imperative: Libraries and Research Data</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/research-in-the-open-how-mandates-work-in-practice/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Library search/discovery apps : intro</title>
		<link>http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/</link>
		<comments>http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/#comments</comments>
		<pubDate>Wed, 27 May 2009 14:41:41 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[aquabrowser]]></category>
		<category><![CDATA[catalogue]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[library 2.0]]></category>
		<category><![CDATA[ngc]]></category>
		<category><![CDATA[opac]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=363</guid>
		<description><![CDATA[There’s a lot of talk in the Library world about ‘next generation catalogues’, library search tools and ‘discovery’. There’s good reason for this talk, in this domain the world has been turned on its head. History in a nutshell: The card catalogue became the online catalogue, the online catalogue let users search for physical items [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/' rel='bookmark' title='Library Catalogues need to cater for light-weight discovery clients'>Library Catalogues need to cater for light-weight discovery clients</a></li>
<li><a href='http://www.nostuff.org/words/2009/library_catalogues_changing_model/' rel='bookmark' title='Library catalogues, search systems and data'>Library catalogues, search systems and data</a></li>
<li><a href='http://www.nostuff.org/words/2009/academic-discovery-and-library-catalogues/' rel='bookmark' title='Academic discovery and library catalogues'>Academic discovery and library catalogues</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/" data-text="Library search/discovery apps : intro" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>There’s a lot of talk in the Library world about ‘next generation catalogues’, library search tools and ‘discovery’. There’s good reason for this talk, in this domain the world has been turned on its head.</p>
<p>History in a nutshell:</p>
<ul>
<li>The card catalogue became the online catalogue, the online catalogue let users search for physical items within the Library.</li>
<li>Journals became online journals. Libraries needed to let users find the online journals they subscribed to through various large and small publishers and hosts. They built simple in-house databases (containing journal titles and links to their homepages), <em>or</em> added them to the catalogue, <em>or</em> used a third party web based tool. As the number of e-journals grew, most ended up using the last option, a third party tool (which could offer other services, such as link resolving, and do much of the heavy lifting with managing a knowledge base).</li>
<li>But users wanted one place to search. Quite understandable. If you are after a journal, why should you look in one place to see if there is a physical copy, and another place if they had access to it online. Same with books/ebooks.</li>
<li>So libraries started to try and find ways to add online content to catalogue systems in bulk (which weren’t really designed for this). <a href="http://www.nostuff.org/words/wp-content//2009/05/betageneral.jpg"><img class="aligncenter" style="border: 0pt none; display: inline;" title="Aquabrowser : Uni Sussex beta catalogue" src="http://www.nostuff.org/words/wp-content//2009/05/betageneral-thumb.jpg" border="0" alt="Aquabrowser : Uni Sussex beta catalogue" width="354" height="226" align="right" /></a></li>
</ul>
<p>The online catalogues (OPAC) were simple web interfaces supplied with the much larger Library management system (ILS or LMS) which ran the backend the public never saw. These were nearly always slow, ugly, unloved and not very useful.</p>
<p>A couple of years a go(ish), we saw the birth of the <em>next generation catalogue</em>, or <em>search and discovery tools</em>. I could list them, but the <a href="http://dltj.org/article/niso-discovery-presentation-links/" target="_blank">Disruptive Technology Library Jester does an excellent job here</a>. I strongly suggest you take a look.</p>
<p>Personally, I think I first heard about <a href="http://www.medialab.nl/" target="_blank">Aquabrowser</a>. At the time a new OPAC which was miles ahead of those supplied with Library systems and was (I think) unique as a web catalogue interface not associated with a particular system, and shock, not from an established Library Company. The second system I heard about was probably <a href="http://www.exlibrisgroup.com/category/PrimoOverview" target="_blank">Primo</a> from Ex Libris. At first not understanding what it was: It sounds like Metalib (another product from the same company which cross-searches various e-resource), is Primo replacing it? Or replacing the OPAC? It took a while to appreciate that this was something that sat on top of the rest. From then, <a href="http://www.vufind.org/" target="_blank">VuFind</a>, <a href="http://libraryfind.org/" target="_blank">LibraryFind</a> and <a href="http://dltj.org/article/niso-discovery-presentation-links/" target="_blank">more</a>.</p>
<p>While some where traditional commercial products (Primo, Encore, Aquabrowser), many more were open source solutions, a number of which developed at American Libraries. Often built on common (and modern) technology stacks such as Apache solr/Lucene, Drupal, php/java, mysql/postgres etc.<a href="http://www.nostuff.org/words/wp-content//2009/05/bl-primo.jpg"><img class="aligncenter size-medium wp-image-368" title="Primo : British Library" src="http://www.nostuff.org/words/wp-content//2009/05/bl-primo-300x236.jpg" alt="Primo : British Library" width="300" height="236" /></a></p>
<p>In the last year or so a number of major Libraries have started to use one of these ‘Discovery Systems’ for example: the <a href="http://searchbeta.bl.uk" target="_blank">BL</a> and <a href="http://solo.ouls.ox.ac.uk/" target="_blank">Oxford</a> using Primo, National Libraries of <a href="http://discover.nls.uk/" target="_blank">Scotland</a> &amp; Wales and <a href="http://nrs.harvard.edu/urn-3:hul.eresource:hollisds" target="_blank">Harvard</a> have purchased Aquabrowser and the <a href="https://beta.catalogue.lse.ac.uk/" target="_blank">LSE</a> is trying VuFind. At Sussex (where I work) we have purchased and implemented <a href="http://catalogue.sussex.ac.uk/ABL/" target="_blank">Aquabrowser</a>. We’ve added data enrichments such as table of contents (searchable and visible on records), book covers<span style="text-decoration: line-through;"> and the ability to tag and review items</span> (tag/reviewing has been removed for various reasons) .</p>
<p>It would be a mistake to put all of these in to one basket. Some focus on being a OPAC replacement, others on being a unified search tool, searching both local and online items. Some focus on social tools, tagging &amp; reviewing. Some work out the box others are just a set of components which a Library can sow together, and some are ‘<a href="http://en.wikipedia.org/wiki/Software_as_a_service">SaaS</a>’.</p>
<p>It’s an area that is fast changing. Just recently an established Library web app Company announced a forthcoming product called ‘<a href="http://www.serialssolutions.com/summon/index.html" target="_blank">Summon</a>’, which takes searching a library’s online content a step further.</p>
<p>So what do libraries go for, it’s not just potentially backing the wrong horse, but backing the wrong horse when everyone one else had moved on to dog racing!</p>
<p>And within all this it is important to remember ‘what do users actually want’. From the conversations and articles I’ve read, they want a Google search box, but one which returns results from trusted sources and academic content. Whether they are looking for a specific book, specific journal, a reference/citation, or one/many keywords. And not just one which searches the metadata, but one which brings back results based on the full text of items as well. There are some that worry that too many results are confusing. As Google proves, an intelligent ranking system makes the number of results irrelevant.</p>
<p>Setting up (and even reviewing) most of these systems take time, and if users start to add data (tags, reviews) to one system, then changing could cause problems (so should we be using third party tag/rating/review systems?).</p>
<p>You may be interested in some other articles I’ve written around this:</p>
<ul>
<li><a href="http://www.nostuff.org/words/2009/library_catalogues_changing_model/">Library catalogues, search systems and data</a> – discusses the issues in putting online journal/item records on to the catalogue, and therefore why a ‘discovery systems’ can resolve these.</li>
<li><a href="http://www.nostuff.org/words/2009/academic-discovery-and-library-catalogues/">Academic discovery and library catalogues</a> – Discusses the eXtensible Catalogue, and also links to an Academics article about how they conduct research</li>
<li><a href="http://www.nostuff.org/words/2008/jisc-library-management-system-review/">JISC Library Management System Review</a> – Summarises a recent JISC report on Library Management systems</li>
<li><a href="http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/">Free e-books online via University of Pittsburgh Press</a> – from a couple of days a go, talks about the issues of Discovery related to free academic items (books) on the web, and ensuring Library Discovery Tools include them.</li>
</ul>
<p>There’s a lot talk about discovery tools, but what sort to go for, who to back? And many issues have yet to be resolved. I’m come on to those next…</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2010/library-catalogues-need-to-cater-for-light-weight-discovery-clients/' rel='bookmark' title='Library Catalogues need to cater for light-weight discovery clients'>Library Catalogues need to cater for light-weight discovery clients</a></li>
<li><a href='http://www.nostuff.org/words/2009/library_catalogues_changing_model/' rel='bookmark' title='Library catalogues, search systems and data'>Library catalogues, search systems and data</a></li>
<li><a href='http://www.nostuff.org/words/2009/academic-discovery-and-library-catalogues/' rel='bookmark' title='Academic discovery and library catalogues'>Academic discovery and library catalogues</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/library-searchdiscovery-apps-intro/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Free e-books online via University of Pittsburgh Press</title>
		<link>http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/</link>
		<comments>http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/#comments</comments>
		<pubDate>Tue, 26 May 2009 09:27:09 +0000</pubDate>
		<dc:creator>Chris Keene</dc:creator>
				<category><![CDATA[libraries, library technologies & open data]]></category>
		<category><![CDATA[discovery]]></category>
		<category><![CDATA[ebooks]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[worldcat]]></category>

		<guid isPermaLink="false">http://www.nostuff.org/words/?p=355</guid>
		<description><![CDATA[The University of Pittsburgh Press has put nearly 500 out of print books online and Open Access. You can access them via their Digital Editions website.  This is excellent news, making work which could be lost openly available to all. For years there has been a movement towards making Journal articles Open Access, i.e. publicly [...]
Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/google-books-api/' rel='bookmark' title='Google Books API'>Google Books API</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/"></g:plusone></div><div class="socialize-in-button socialize-in-button-right"><a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/" data-text="Free e-books online via University of Pittsburgh Press" data-count="vertical" data-via="socializeWP" data-related="chriskeene"><!--Tweetter--></a></div><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div></div><p>The <a href="http://www.upress.pitt.edu/upressIndex.aspx">University   of Pittsburgh Press</a> has put nearly 500 out of print books online and Open Access. You can access them via their <a href="http://digital.library.pitt.edu/p/pittpress/">Digital Editions website</a>.  This is excellent news, making work which could be lost openly available to all.</p>
<div id="attachment_356" class="wp-caption alignright" style="width: 310px"><a href="http://www.nostuff.org/words/wp-content//2009/05/picture-1.png"><img class="size-medium wp-image-356" title="University of Pittsburgh Press Digital Editions - Open Access free ebooks " src="http://www.nostuff.org/words/wp-content//2009/05/picture-1-300x171.png" alt="iversity of Pittsburgh Press Digital Editions - Open Access free ebooks " width="300" height="171" /></a><p class="wp-caption-text">University of Pittsburgh Press Digital Editions - Open Access free ebooks </p></div>
<p>For years there has been a movement towards making Journal articles Open Access, i.e. publicly available. However some subjects (especially in the Humanities) publish much of their research in books, not journals. Letting the world gain from the (normally publicly funded) research contained within books is more complex, and it&#8217;s not an area I fully understand. The author normally receives royalties from book sales. However I understand this are normally very small 99% of the time, and normally tail down to tiny amounts after a few years. What if funders and Universities demanded that any book written with their money (or during their employment) must be made publicly available after x number of years (let&#8217;s say 10 years)? Academics and Publishers would not welcome the move, but would still allow a window where they can gain revenue, and if this became the norm it would be something they just have to accept. Meanwhile, once open access, the book becomes much easier to archive and preserve, and ensure the knowledge is available to all in the long term. Just a thought.<span id="more-355"></span></p>
<p>Back to the Pittsburgh Press website (created with the University Library), the Copyright remains with the Press and I note this somewhat restrictive quote at the bottom of their homepage: &#8220;<em>This  material is provided for scholarly, educational, and research use only.</em>&#8221; In an ideal world they would be under a more liberal licence (creative commons for example), though I imagine this would lead to many legal complexities to resolve, and don&#8217;t want this to detract from the good work that has been done here. It&#8217;s a massive step to release these so that they are publicly available.</p>
<p>It occurs to me that this raises the <a href=http://092.me>question</a> of &#8216;discovery&#8217;. Not as a criticism of this service, but as an example of the problems we face in fully utilising its existence.</p>
<p>How will our (a University Library) users find these books? I can think of two  situations: first, they are looking for a particular book (perhaps due a reference) and discovering the online version will be much quicker than trying to find any existing print copy in a Library around the world, second: when a book contained in this service will be relevant to their research &#8211; and they are searching (catalogues/Google) using keywords to find relevant material.</p>
<p>Should users find this when they search our catalogue, or when they search our online resources (using a federated search service)? With the next generation library search products appearing, should they be &#8216;discoverable&#8217; within these systems, and if so, how?</p>
<p>The above focuses on how a library should add these items in to its systems, the other side of the coin is adding them to universal worldwide systems. I selected a particular book and searched Worldcat for the title, it shows the book twice in the search results, once for the paper version and<a href="http://www.worldcat.org/oclc/7671714"> once for the ebook at Pittsburgh</a>, the latter linking to the free e-book. A Good start!</p>
<p>Searching <a href="http://www.google.co.uk/search?q=After+innocence+%3A+visions+of+the+Fall+in+modern+literature&amp;ie=utf-8&amp;oe=utf-8&amp;aq=t&amp;rls=org.mozilla:en-GB:official&amp;client=firefox-a">Google for the same title</a>, the Worldcat entry was the fifth link, so someone searching Google for this book would &#8211; if their attention span can extend to trying the fifth link &#8211; find the free online version via the Worldcat record. An even better situation would be for Google to link to the free book directly.</p>
<p>Some thoughts:</p>
<ul>
<li>While the above is promising it relies on a user searching for the title of the book, which doesn&#8217;t cater for those who don&#8217;t know it exists, yet it&#8217;s content would be useful for their research.</li>
<li>Users are saying to us they want one place to search for sources of information and content useful for their work. They have to use many different systems and it can be confusing when to use which one, and each has their own interface and rules. The Library catalogue, VLE/e-learning platform, <a href="www.isiknowledge.com/">Web of Science</a>, <a href="http://www.worldcat.org/">Wordcat</a>, <a href="http://www.scopus.com/">Scopus</a>, Google, Federated (online resources) search, <a href="http://copac.ac.uk/about/">COPAC</a>, etc. Which ones to use, and when? The ultimate aim of providing one search box is to highlight information (content) which could be useful &#8211; both that which the institution subscribes to, and that which is free. How will records of Open Access books such as these &#8216;get in to&#8217; such a search index?  Each Library adding such content is not very efficient, better would be for it to be added in one place/database (along with similar items) which can be openly searched/harvested by other systems using an API. Libraries could then simply add such a service to their own search system to make the content available to users. <a href="http://www.doaj.org/">DOAJ</a> (a directory of Open Access Journals) <a href=http://092.me>answer</a>ed a similar concern for Open Access Journals.</li>
</ul>
<p>Content is useless unless people can discover it, and how will Libraries develop systems which help users to discover everything they can potentially access in such an easy way that it is intuitive and requires no specialist training?</p>
<p>Related posts:<ol>
<li><a href='http://www.nostuff.org/words/2008/google-books-api/' rel='bookmark' title='Google Books API'>Google Books API</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.nostuff.org/words/2009/free-e-books-online-via-university-of-pittsburgh-press/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 9.387 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-03-20 01:05:22 -->
<!-- Compression = gzip -->
