<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-2425168083034317580</id><updated>2012-02-17T14:28:37.870+13:00</updated><category term='TEI'/><category term='GIS'/><category term='mike'/><category term='teReo'/><category term='red hat'/><category term='New Zealand'/><category term='l10n'/><category term='selenium'/><category term='open source'/><category term='creative commons'/><category term='baduk'/><category term='mugshot'/><category term='suffix tree'/><category term='library'/><category term='MARC'/><category term='ndha'/><category term='firefox'/><category term='awk'/><category term='RSS'/><category term='portrait'/><category term='national library'/><category term='configuration'/><category term='invitation to tinker'/><category term='nzetc'/><category term='community building'/><category term='script'/><category term='librarything'/><category term='code'/><category term='work'/><category term='usability'/><category term='rant'/><category term='place names'/><category term='social network'/><category term='taxonomy'/><category term='linux'/><category term='facebook'/><category term='computer science'/><category term='xml'/><category term='i18n'/><category term='catalyst.net'/><category term='jgogears'/><category term='authority'/><category term='authentication'/><category term='vmware'/><category term='maori'/><category term='macrons'/><category term='topic maps'/><category term='ohloh'/><category term='semantic web'/><category term='sqlite'/><category term='bloglines'/><category term='xslt'/><category term='bookmarks'/><category term='nzetc catalyst.net.nz'/><category term='go'/><category term='networking'/><category term='OPML'/><category term='photo'/><category term='geolocation'/><category term='sql'/><category term='NDF'/><category term='flickr'/><category term='twitter'/><category term='kernel'/><category term='LINZ'/><category term='google reader'/><category term='epubs'/><category term='folksomony'/><category term='mozilla'/><category term='Māori'/><category term='binomial names'/><category term='communications'/><category term='ubuntu'/><category term='NewZealand'/><category term='metadata'/><category term='#opengovt'/><title type='text'>Open Source Exile</title><subtitle type='html'>An open sourcer in exile</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>48</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6632414432456807912</id><published>2011-12-02T14:11:00.003+13:00</published><updated>2011-12-02T14:25:44.790+13:00</updated><title type='text'>Prep notes for NDF2011 demonstration</title><content type='html'>I didn't really have a presentation for my demonstration at the NDF, but the event team have asked for presentations, so here are the notes for my practice demonstration that I did within the library. The notes served as an advert to attract punters to the demo; as a conversation starter in the actual demo and as a set of bookmarks of the URLs I wanted to open.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;br /&gt;&lt;br /&gt;Depending on what people are interested in, I'll be doing three things&lt;br /&gt;&lt;br /&gt;*) Demonstrating basic editing, perhaps by creating a page from the requested articles at &lt;a href="http://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles"&gt;http://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;*) Discussing some of the quality control processes I've been involved with (&lt;a href="http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion"&gt;http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion&lt;/a&gt; and &lt;a href="http://en.wikipedia.org/wiki/New_pages_patrol"&gt;http://en.wikipedia.org/wiki/New_pages_patrol&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;*) Discussing how wikipedia handles authority control issues using redirects (&lt;a href="https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Redirect"&gt;https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Redirect&lt;/a&gt; ) and disambiguation (&lt;a href="https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Disambiguation"&gt;https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Disambiguation&lt;/a&gt; )&lt;br /&gt;&lt;br /&gt;I'm also open to suggestions of other things to talk about.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6632414432456807912?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6632414432456807912/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6632414432456807912' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6632414432456807912'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6632414432456807912'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/12/prep-notes-for-ndf2011-demonstration.html' title='Prep notes for NDF2011 demonstration'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8546580500262349533</id><published>2011-12-01T20:20:00.004+13:00</published><updated>2011-12-01T20:36:25.491+13:00</updated><title type='text'>Metadata vocabularies  LODLAM NZ cares about</title><content type='html'>At today's LODLAM NZ, in Wellington, I co-hosted a vocabulary schema /  interoperability session. I kicked off the session with a list of the  metadata schema we care about and counts of how many people in the room  cared about it. Here are the results:&lt;br /&gt;&lt;br /&gt;8 &lt;a href="http://www.loc.gov/catdir/pcc/naco/nacopara.html"&gt;Library of Congress / NACO Name Authority List&lt;/a&gt;&lt;br /&gt;7 &lt;a href="http://mshupoko.natlib.govt.nz/mshupoko/index.htm"&gt;Māori Subject Headings&lt;/a&gt;&lt;br /&gt;6 &lt;a href="http://www.loc.gov/aba/cataloging/subject/"&gt;Library of Congress Subject Headings&lt;/a&gt;&lt;br /&gt;5 &lt;a href="http://www.vocabularyserver.com/sonz/"&gt;SONZ&lt;/a&gt;&lt;br /&gt;5 &lt;a href="https://en.wikipedia.org/wiki/Linnaean_taxonomy"&gt;Linnean&lt;/a&gt;&lt;br /&gt;4 &lt;a href="http://www.getty.edu/research/tools/vocabularies/index.html"&gt;Getty Thesauri&lt;/a&gt;&lt;br /&gt;3 &lt;a href="http://www.arc.gov.au/era/ANZSRC.htm"&gt;Marsden Research Subject Codes / ANZRSC Codes&lt;/a&gt;&lt;br /&gt;3 &lt;a href="http://www2.curriculum.edu.au/scis/connections/schools_online_thesaurus_%28scot%29.html"&gt;SCOT&lt;/a&gt;&lt;br /&gt;3 &lt;a href="http://iwihapu.natlib.govt.nz/iwi-hapu/"&gt;Iwi Hapū List&lt;/a&gt;&lt;br /&gt;2 &lt;a href="http://www.picturethesaurus.gov.au/"&gt;Australian Pictorial Thesaurus&lt;/a&gt;&lt;br /&gt;1 &lt;a href="http://www.powerhousemuseum.com/collection/database/thesaurus.php"&gt;Powerhouse Object Names Thesaurus&lt;/a&gt;&lt;br /&gt;0 &lt;a href="http://www.ncbi.nlm.nih.gov/mesh"&gt;MESH&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This straw poll naturally only reflects on the participants who attended this particular session and counting was somewhat haphazard (people were still coming into the room), but is gives a sample of the scope.&lt;br /&gt;&lt;br /&gt;I don't recall whether the heading was "Metadata we care about" or "Vocabularies we care about," but it was something very close to that.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8546580500262349533?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8546580500262349533/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8546580500262349533' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8546580500262349533'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8546580500262349533'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/12/metadata-lodlam-nz-cares-about.html' title='Metadata vocabularies  LODLAM NZ cares about'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-7688811774663202296</id><published>2011-11-30T21:08:00.002+13:00</published><updated>2011-11-30T21:27:43.385+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Māori'/><category scheme='http://www.blogger.com/atom/ns#' term='NDF'/><category scheme='http://www.blogger.com/atom/ns#' term='metadata'/><title type='text'>Unexpected advice</title><content type='html'>During the &lt;a href="http://ndf.natlib.govt.nz/about/2011Programme.htm"&gt;NDF2011&lt;/a&gt; today I was in "Digital initiatives in Māori communities" put on the the talented Honiana Love and Claire Hall from the &lt;a href="http://www.taranakireo.co.nz/"&gt;Te Reo o Taranaki Charitable Trust&lt;/a&gt; about their work on &lt;a href="http://kete.taranakireo.co.nz/"&gt;He Kete Kōrero&lt;/a&gt;. At the end I asked a question "Most of us [the audience] are in institutions with te Reo Māori holdings or cultural objects of some description. What small thing can we do to help enable our collections for the iwi and hapū source communities? Use &lt;a href="http://mshupoko.natlib.govt.nz/mshupoko/index.htm"&gt;Māori Subject Headings&lt;/a&gt;? The&lt;a href="http://iwihapu.natlib.govt.nz/iwi-hapu/"&gt; Iwi / Hapū list&lt;/a&gt;? &lt;a href="https://en.wikipedia.org/wiki/Geotagging"&gt;Geotagging&lt;/a&gt;? ..." Quick-as-a-blink the response was "Geotagging." If I understood the answer (given mainly by Honiana) correctly, the point was that geotagging is much more useful because it's much more likely to be done right in contexts like this. Presumably because geotagging lends itself to checking, validation and visualisations that make errors easy to spot in ways that these other metadata forms don't; it's better understood by those processing the documents and processing the data.&lt;br /&gt;&lt;br /&gt;I think it's fabulous that we're getting feedback from indigenous groups using information systems in indigenous contexts, particularly feedback about previous attempts to cater to their needs. If this is the experience of other indigenous groups, it's really important.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-7688811774663202296?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/7688811774663202296/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=7688811774663202296' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7688811774663202296'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7688811774663202296'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/11/unexpected-advice.html' title='Unexpected advice'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6120951398147781692</id><published>2011-11-26T21:58:00.000+13:00</published><updated>2011-11-26T21:59:25.763+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='facebook'/><category scheme='http://www.blogger.com/atom/ns#' term='twitter'/><title type='text'>Goodbye 'social-media' world</title><content type='html'>You may or may not have noticed, but recently a number of 'social media' services have begun looking and working very similarly. Facebook is the poster-child, followed by google+ and twitter. Their modus operandi is to entice you to interact with family-members, friends and acquaintances and then leverage your interactions to both sell your attention advertisers and entice other members of you social circle to join the service.&lt;br /&gt;&lt;br /&gt;There are, naturally, a number of shiny baubles you get for participating it the sale of your eyeballs to the highest bidder, but recently I have come to the conclusion that my eyeballs (and those of my friends, loved ones and colleagues) are worth more.&lt;br /&gt;&lt;br /&gt;I'll be signing off google plus, twitter and facebook shortly. I my return for particular events, particularly those with a critical mass the size of Jupiter, but I shall not be using them regularly. I remain serenely confident that all babies born in my extended circle are cute, I do not need to see their pictures.&lt;br /&gt;&lt;br /&gt;I will continue using other social media as before (email, wikipedia, irc, skype, etc) as usual. My deepest apologies to those who joined at least party on my account.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6120951398147781692?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6120951398147781692/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6120951398147781692' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6120951398147781692'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6120951398147781692'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/11/goodbye-social-media-world.html' title='Goodbye &apos;social-media&apos; world'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5130106030056557823</id><published>2011-11-06T21:24:00.004+13:00</published><updated>2011-11-06T21:58:32.995+13:00</updated><title type='text'>Recreational authority control</title><content type='html'>Over the last week or two I've been having a bit of a play with &lt;a href="http://mshupoko.natlib.govt.nz/mshupoko/index.htm"&gt;Ngā Ūpoko Tukutuku / The Māori Subject Headings&lt;/a&gt; (for the uninitiated, think of the widely used Library of Congress Subject Headings, done Post-Colonial and bi-lingually but in the same technology) the main thing I've been doing is trying to &lt;a href="http://catb.org/jargon/html/M/munge.html"&gt;munge&lt;/a&gt; the MSH into Wikipedia (Wikipedia being my addiction du jour).&lt;br /&gt;&lt;br /&gt;My thinking has been to increase the use of MSH by taking it, as it were, to where the people are. I've been working with the &lt;a href="http://en.wikipedia.org/"&gt;English language Wikipedia&lt;/a&gt;, since the &lt;a href="http://mi.wikipedia.org/"&gt;Māori language Wikipedia&lt;/a&gt; has fewer pages and sees much less use. &lt;br /&gt;&lt;br /&gt;My first step was to download the MSH in &lt;a href="http://en.wikipedia.org/wiki/MARC_standards#MARC-XML"&gt;MARC XML&lt;/a&gt; format (available from the website) and use XSL to transform it into a &lt;a href="http://en.wikipedia.org/wiki/User:Stuartyeates/Ng%C4%81_%C5%AApoko_Tukutuku"&gt;wikipedia table&lt;/a&gt; (warning: large page). When looking at that table, each row is a subject heading, with the first column being the the te reo Māori term, the second being permutations of the related terms and the third being the scope notes. I started a &lt;a href="http://en.wikipedia.org/wiki/Wikipedia:Administrators%27_noticeboard/Archive228#Bulk_creation_of_redirects"&gt;discussion about my thoughts&lt;/a&gt; (warning: large page) and got a clear green light to create redirects (or 'related terms' in librarian speak) for MSH terms which are culturally-specific to Māori culture.&lt;br /&gt;&lt;br /&gt;I'm about 50% of the way through the 1300 terms of the MSH and have 115 redirects in the newly created &lt;a href="http://en.wikipedia.org/wiki/Category:Redirects_from_M%C4%81ori_language_terms"&gt;Category:Redirects from Māori language terms&lt;/a&gt;. That may sound pretty average, until you remember that institutions are increasingly rolling out tools such as Summon, which use wikipedia redirects for auto-completion, taking these mappings to the heart of most Māori speakers in higher and further education.&lt;br /&gt;&lt;br /&gt;I don't have a time-frame for the redirects to appear, but they haven't appeared in &lt;a href="http://otago.summon.serialssolutions.com/"&gt;Otago's Summon&lt;/a&gt;, whereas redirects I created ~ two years ago have; type 'jack yeates' and pause to see it at work.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5130106030056557823?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5130106030056557823/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5130106030056557823' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5130106030056557823'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5130106030056557823'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/11/recreational-authority-control.html' title='Recreational authority control'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6872329588216559015</id><published>2011-08-16T19:46:00.003+12:00</published><updated>2011-08-16T19:53:56.805+12:00</updated><title type='text'>Thoughts on "Letter about the TEI" from Martin Mueller</title><content type='html'>&lt;p&gt;Thoughts on "&lt;a href="http://ariadne.northwestern.edu/mmueller/teiletter.pdf"&gt;Letter about the TEI&lt;/a&gt;" from Martin Mueller&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Note: I am a member of the TEI&lt;span class="rwRRO" id="s5uk" title="tei-council@lists.village.Virginia.EDU"&gt; council&lt;/span&gt;, but this message is should be read as personal position at the time of writing, not a council position, nor the position of my employer.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Reading Martin's missive was painful. I should have responded  earlier, I think perhaps I was hoping someone else could say what I  wanted to say and I could just say "me too." They haven't so I've become  the someone else.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-weight:bold;"&gt;I don't think that Martin's "fairly radical  model" is nearly radical enough. I'd like to propose a significantly  more radical model as strawman&lt;/span&gt;:&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;hr/&gt;&lt;p&gt;1) The TEI shall maintain a document called the 'The TEI Principals.' The purpose of The TEI is to advance The TEI Principals.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;2)  Institutional membership of The TEI is open to  groups which  publish, collect and/or curate documents in formats released by The TEI.  Institutional membership requires members acknowledge The TEI Principals  and permits the members to be listed at &lt;a href="http://www.tei-c.org/Activities/Projects/"&gt;http://www.tei-c.org/Activities/Projects/&lt;/a&gt; and use The TEI logos and  branding.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;3) Individual membership of The TEI is open to  individuals; individual membership requires members acknowledge The TEI  Principals and subscribe to The TEI mailing list at  &lt;a href="http://listserv.brown.edu/?A0=TEI-L"&gt;http://listserv.brown.edu/?A0=TEI-L&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;4) All business of The TEI  is conducted in public. Business which needs be conducted in private  (for example employment matters, contract negotiation, etc) shall be  considered out of scope for The TEI.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;5) Changes to the structure  of The TEI will be discussed on the TEI mailing list and put to a  democratic vote with a voting period of at least one month, a two-thirds  majority of votes cast is required to pass a motion, which shall be in  English.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;6) Groups of members may form for activities from  time-to-time, such as members meetings, summer schools, promotions of The TEI or collective digitisation efforts, but these groups are not The TEI, even  if the word 'TEI' appears as part of their name.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;hr/&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;I'll  admit that there are a couple of issues not covered here (such as who  holds the IPR), but it's &lt;span style="font-weight:bold;"&gt;only a straw man for discussion. Feel free to  fire it as necessary&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6872329588216559015?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6872329588216559015/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6872329588216559015' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6872329588216559015'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6872329588216559015'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/08/thoughts-on-letter-about-tei-from.html' title='Thoughts on &quot;Letter about the TEI&quot; from Martin Mueller'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-7855191486421740573</id><published>2011-06-23T22:02:00.002+12:00</published><updated>2011-06-23T22:10:56.475+12:00</updated><title type='text'>unit testing framework for XSL transformations?</title><content type='html'>I'm part of the  &lt;a href="http://www.tei-c.org/"&gt;TEI community&lt;/a&gt;, which maintains an XML standard which is commonly  transformed to HTML for presentation (more rarely PDF). The TEI standard  is relatively large but relatively well documented, the transformation  to HTML has thus far been largely piecemeal (from a software engineering point of view) and not error free.&lt;br /&gt;&lt;br /&gt;Recently  we've come under pressure to introduce significantly more complexity  into transformations, both to produce ePub (which is wrapped HTML bundled with media and metadata files) and HTML5 (which can represent more of the formal semantics in TEI). The  software engineer in me sees &lt;a href="http://en.wikipedia.org/wiki/Unit_testing"&gt;unit testing&lt;/a&gt; the a way to reduce our errors  while opening development up to a larger more diverse group of people  with a larger more diverse set of features they want to see implemented.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The problem is, that I can't seem to find a decent unit testing framework for XSLT. Does anyone know of one?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Our  requirements are: XSLT 2.0; free to use; runnable on our ubuntu build  server; testing the transformation with multiple arguments; etc;&lt;br /&gt;&lt;br /&gt;We're already using: XSD, RNG, DTD and schematron schemas, epubcheck, xmllint, standard HTML validators, etc. Having the framework drive these too would be useful.&lt;br /&gt;&lt;br /&gt;The kinds of things we want to test include:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Footnotes appear once and only once&lt;/li&gt;&lt;li&gt;Footnotes are referenced in the text and there's a back link from the footnote to the appropriate point in the text&lt;/li&gt;&lt;li&gt;Internal references (tables of contents, indexes, etc) point somewhere&lt;/li&gt;&lt;li&gt;Language encoding used xml:lang survives from the TEI to the HTML&lt;/li&gt;&lt;li&gt;That all the paragraphs in the TEI appear at least once in the HTML&lt;/li&gt;&lt;li&gt;That local links work&lt;/li&gt;&lt;li&gt;Sanity check tables&lt;/li&gt;&lt;li&gt;Internal links within parallel texts&lt;/li&gt;&lt;li&gt;....&lt;/li&gt;&lt;/ol&gt;Any of many languages could be used to represent these tests, but ideally it should have a DOM library and be able to run that library across entire directories of files. Most of our community speak XML fluently, so leveraging that would be good.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-7855191486421740573?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/7855191486421740573/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=7855191486421740573' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7855191486421740573'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7855191486421740573'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/06/unit-testing-framework-for-xsl.html' title='unit testing framework for XSL transformations?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2142345088746622487</id><published>2011-03-23T20:13:00.002+13:00</published><updated>2011-03-23T20:26:03.367+13:00</updated><title type='text'>Is there a place for readers' collectives in the bright new world of eBooks?</title><content type='html'>The transition costs of migrating from the world of books-as-physical-artefacts-of-pulped-tree to the world of books-as-bitstreams are going to be non-trivial.&lt;br /&gt;&lt;br /&gt;Current attempts to drive the change (and by implication apportion those costs to other parties) have largely been driven by publishers, distributors and resellers of physical books in combination with the e-commerce and electronics industries which make and market the physical eBook readers on which eBooks are largely read. The e-commerce and electronics industries appear to see traditional publishing as an industry full of lumbering giants unable to compete with the rapid pace of change in the electronics industry and the associated turbulence in business models, and have moved to poach market-share. By-and-large they've been very successful. Amazon and Apple have shipped millions of devices billed as 'eBook readers' and pretty much all best-selling books are available on one platform or another.&lt;br /&gt;&lt;br /&gt;This top tier, however, is the easy stuff. It's not surprising that money can be made from the latest bodice-ripping page-turner, but most of the interesting reading and the majority of the units sold are outside the best-seller list, on the so-called 'long tail.'&lt;br /&gt;&lt;br /&gt;There's a whole range of books that I'm interested in that don't appear to be on the business plan of any of the current eBook publishers, and I'll miss them if they're not converted:&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The back catalogue of local poetry. Almost nothing ever gets reprinted, even if the original has a tiny print run and the author goes on to have a wonderfully successful career. Some gets anthologised and a few authors are big enough to have a posthumous collected works, when their work is no longer cutting edge.&lt;/li&gt;&lt;li&gt;Some fabulous theses. I'm thinking of things like: &lt;a href="http://ir.canterbury.ac.nz/handle/10092/1978"&gt;http://ir.canterbury.ac.nz/handle/10092/1978&lt;/a&gt;, &lt;a href="http://victoria.lconz.ac.nz/vwebv/holdingsInfo?bibId=69659"&gt;http://victoria.lconz.ac.nz/vwebv/holdingsInfo?bibId=69659&lt;/a&gt; and  &lt;a href="http://otago.lconz.ac.nz/vwebv/holdingsInfo?bibId=241527"&gt;http://otago.lconz.ac.nz/vwebv/holdingsInfo?bibId=241527&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Lots of te reo Māori material (pick your local indigenous language if you're reading this outside New Zealand)&lt;/li&gt;&lt;li&gt;Local writing by local authors.&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;Note that all of these are local content---no foreign mega-corporation is going to regard this as their home-turf. Getting these documents from the old world to the new is going to require a local program run by (read funded by) locals.&lt;br /&gt;&lt;br /&gt;Would you pay for these things? I would, if it gave me what I wanted.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;What is it that readers want?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;We're all readers, of one kind or another, and we all want a different range of things, but I believe that what readers want / expect out of the digital transition is:&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;To genuinely own books. Not to own them until they drop their eReader in the bath and lose everything. Not to own them until a company they've never heard of goes bust and turns off a DRM server they've never heard of. Not to own them until technology moves on and some new format is in use. To own them in a manner which enables them to use them for at least their entire lifetime. To own them in a manner that poses at least a question for their heirs.&lt;/li&gt;&lt;li&gt;A choice of quality books. Quality in the broadest sense of the word. Choice in the broadest sense of the word. Universality is a pipe-dream, of course, but with releasing good books faster than I can read them.&lt;/li&gt;&lt;li&gt;A quality recommendation service. We all have trusted sources of information about books: friends, acquaintances, librarians or reviewers that history have suggested have similar ideas as us about what a good read is.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;To get some credit for already having bought the book in pulp-of-murdered-tree work. Lots of us have collections of wood-pulp and like to maintain the illusion that in some way that makes us well read.&lt;/li&gt;&lt;li&gt;Books bought to their attention based on whether they're worth reading, rather than what publishers have excess stock of. Since the concept of 'stock' largely vanishes with the transition from print to digital this shouldn't be too much of a problem.&lt;/li&gt;&lt;li&gt;Confidentially for their reading habits. If you've never come across it, go and read the ALA's &lt;a href="http://www.ala.org/ala/aboutala/offices/oif/statementspols/ftrstatement/freedomreadstatement.cfm"&gt;The Freedom to Read Statement&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;A not-for-profit readers' collective&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;It seems to me that the way to manage the transition from the old world to the new is as a not-for-profit readers' collective. By that I mean a subscription-funded system in which readers sign up for a range of  works every year. The works are digitised by the collective (the expensive step, paid for up-front), distributed to the subscribers in open file formats such as ePub (very cheap via the internet) and kept in escrow for them (a tiny but perpetual cost, more on this later).&lt;br /&gt;&lt;br /&gt;Authors, of course, need to pay their mortgage, and part of the digitisation would be obtaining the rights to the work. Authors of new work would be paid a 'reasonable' sum, based on their statue as authors (I have no idea what the current remuneration of authors is like, so I won't be specific). The collective would acquire (non-exclusive) the rights to digitise the work if not born digital, to edit it, distribute it to collective members and to sell it to non-members internationally (i.e. distribute it through 'conventional' digital book channels). In the case of sale to non-members through conventional digital book channels the author would get a cut. Sane and mutually beneficial deals could be worked out with libraries of various sizes.&lt;br /&gt;&lt;br /&gt;Generally speaking, I'd anticipate the rights to digitise and distribute in-copyright but out-of-print poetry would would be fairly cheap; the rights to fabulous old university theses cheaper; and rights to out-of-copyright materials are, of course, free.  The cost of rights to new novels / poetry would hugely depend on statue of the author and the quality of the work, which is where the collective would need to either employ a professional editor to make these calls or vote based on sample chapters / poems or some combination of the two. Costs of quality digitisation is non-trivial, but costs are much lower in bulk and dropping all the time. Depending on the platform in use, members of the collective might be recruited as proof-readers for OCR errors.&lt;br /&gt;&lt;br /&gt;That leaves the question of how to fund the the escrow. The escrow system stores copies of all the books the collective has digitised for the future use of the collectives' members and is required to give efficacy to the promise that readers really own the books. By being held in escrow, the copies survive the collective going bankrupt, being wound up, or evolving into something completely different, but requires funding. The simplest method of obtaining funding would be to align the collective with another established consumer of local literature and have them underwrite the escrow, a university, major library, or similar.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The difference between a not-for-profit readers' collective and an academic press?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Of hundreds of years, major universities have had academic presses which publish quality content under the universities' auspices. The key difference between the  not-for-profit readers' collective I am proposing and an academic press is that the collective would attempt to publish the unpublished and out-of-print books that the members wanted rather than aiming to meet some quality criterion. I acknowledge a popularist bias here, but it's the members who are paying the subscriptions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Which links in the book chain do we want to cut out?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There are some links in the current book production chain which we need to keep, there are others wouldn't have a serious future in a not-for-profit. Certainly there is a role for judgement in which works to purchase with the collective's money. There is a role for editing, both large-scale and copy-editing. There is a role for illustrating works, be it cover images or icons. I don't believe there is a future for roles directly relating to the production, distribution, accounting for, sale, warehousing or pulping of physical books. There may be a role for the marketing books, depending on the business model (I'd like to think that most of the current marketing expense can be replaced by combination of author-driven promotion and word-of-month promotion, but I've been known to dream). Clearly there is an evolving techie role too.&lt;br /&gt;&lt;br /&gt;The role not mentioned above that I'd must like to see cut, of course, is that of the multinational corporation as gatekeeper, holding all the copyrights and clipping tickets (and wings).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2142345088746622487?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2142345088746622487/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2142345088746622487' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2142345088746622487'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2142345088746622487'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2011/03/is-there-place-for-readers-collectives.html' title='Is there a place for readers&apos; collectives in the bright new world of eBooks?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6917338734970082655</id><published>2010-11-20T10:20:00.003+13:00</published><updated>2010-11-20T10:45:12.875+13:00</updated><title type='text'>HOWTO: Deep linking into the NZETC site</title><content type='html'>As the heaving mass of activity that is the &lt;a href="http://www.mixandmash.org.nz/"&gt;mixandmash competition&lt;/a&gt; heats up, I have come to realise that I should have better documented a feature of the NZETC site, the ability to extract the &lt;a href="http://www.tei-c.org/release/doc/tei-p5-doc/html/REF-ELEMENTS.html#"&gt;TEI xml&lt;/a&gt; annotated with the IDs for deep linking. &lt;br /&gt;&lt;br /&gt;Our content's archival form is TEI xml, which we massage for various output formats. There is a link from the top level of every document to the TEI for the document, which people are welcome to use in their mashups and remixes. Unfortunately, between that TEI and our HTML output is a deep magic that involves moving footnotes, moving page breaks, breaking pages into nicely browsable chunks, floating marginal notes, etc., and this makes it hard to deep link back to the website from anything derived from that TEI.&lt;br /&gt;&lt;br /&gt;There is another form of the TEI available which is annotated with whether or not each structural element maps 1:1 to an HTML: &lt;span style="font-weight:bold;"&gt;nzetc:has-text&lt;/span&gt; and what the ID of that page is: &lt;span style="font-weight:bold;"&gt;nzetc:id&lt;/span&gt; This annotated XML is found by replacing the 'tei-source' in the URL with 'etexts'&lt;br /&gt;&lt;br /&gt;Thus for &lt;span style="font-style:italic;"&gt;The Laws of England, Compiled and translated into the Māori language&lt;/span&gt; at &lt;a href="http://www.nzetc.org/tm/scholarly/tei-GorLaws.html"&gt;http://www.nzetc.org/tm/scholarly/tei-GorLaws.html&lt;/a&gt; there is the raw TEI at &lt;a href="http://www.nzetc.org/tei-source/GorLaws.xml"&gt;http://www.nzetc.org/tei-source/GorLaws.xml&lt;/a&gt; and the annotated TEI at &lt;a href="http://www.nzetc.org/etexts/GorLaws.xml"&gt;http://www.nzetc.org/etexts/GorLaws.xml&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Looking in the annotated TEI at &lt;a href="http://www.nzetc.org/etexts/GorLaws.xml"&gt;http://www.nzetc.org/etexts/GorLaws.xml&lt;/a&gt; we see for example:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-weight:bold;"&gt;&amp;lt;div xml:id="t1-g1-t1-front1-tp1" xml:lang="en" rend="center" type="titlePage" nzetc:id="tei-GorLaws-t1-g1-t1-front1-tp1" nzetc:depth="5" nzetc:string-length="200" nzetc:has-text="true"&amp;gt;&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;This means that this &lt;span style="font-weight:bold;"&gt;div&lt;/span&gt; has it's own page (because it has &lt;span style="font-weight:bold;"&gt;nzetc:has-text="true"&lt;/span&gt; and that the ID of that page is tei-GorLaws-t1-g1-t1-front1-tp1 (because of the &lt;span style="font-weight:bold;"&gt;nzetc:id="tei-GorLaws-t1-g1-t1-front1-tp1"&lt;/span&gt;). The ID can be plugged into: http://www.nzetc.org/tm/scholarly/&amp;lt;ID&amp;gt;.html to get a URL for the HTML. Thus the URL for this div is  &lt;a href="http://www.nzetc.org/tm/scholarly/tei-GorLaws-t1-g1-t1-front1-tp1.html"&gt;http://www.nzetc.org/tm/scholarly/tei-GorLaws-t1-g1-t1-front1-tp1.html&lt;/a&gt; This process should work for both text and figures. &lt;br /&gt;&lt;br /&gt;Happy remixing everyone!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6917338734970082655?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6917338734970082655/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6917338734970082655' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6917338734970082655'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6917338734970082655'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2010/11/howto-deep-linking-into-nzetc-site.html' title='HOWTO: Deep linking into the NZETC site'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6521823705590946247</id><published>2009-11-08T20:36:00.002+13:00</published><updated>2009-11-08T20:41:17.942+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='macrons'/><category scheme='http://www.blogger.com/atom/ns#' term='maori'/><category scheme='http://www.blogger.com/atom/ns#' term='library'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><title type='text'>ePubs and quality</title><content type='html'>You may have heard news about the release of "&lt;a title="bookserver" href="http://www.archive.org/bookserver/" id="o3uc"&gt;bookserver&lt;/a&gt;" by the good folks at the Internet Archive. This is a DRM-free ePub ecosystem, initially stocked with the prodigious output of Google's book scanning project and the Internet Archive's own book scanning project.&lt;br /&gt;&lt;br /&gt;To see how the NZETC stacked up against the much larger (and better funded) collection I picked one of our Maori Language dictionaries. Our Maori and Pacifica dictionaries month-after-month make up the bulk of our top five must used resources, so they're in-demand resources. They're also an appropriate choice because when they were encoded by the NZETC into TEI, the decision was made not to use full dictionary encoding, but a cheaper/easier tradeoff which didn't capture the linguistic semantics of the underlying entries, but treated them as typeset text. I was interested in how well this tradeoff was wearing.&lt;br /&gt;&lt;br /&gt;I did my comparison using the new&lt;a href="https://addons.mozilla.org/en-US/firefox/addon/45281"&gt; firefox ePub plugin&lt;/a&gt;, things will be slightly different if you're reading these ePubs on an iPhone or Kindle.&lt;br /&gt;&lt;p class="author"&gt;The ePub I looked at was &lt;a title="A Dictionary of the Maori Language" href="http://www.nzetc.org/tm/scholarly/tei-WillDict.html" id="n.h3"&gt;A Dictionary of the Maori Language&lt;/a&gt; by &lt;a href="http://www.nzetc.org/tm/scholarly/name-209644.html" title="Herbert William Williams — Wiremu Hapata"&gt;Herbert W. Williams&lt;/a&gt;. The NZETC has the 1957 sixth edition. There are two versions of the work on bookserver. A &lt;span class="opds-entry-item"&gt;&lt;span class="opds-entry-value"&gt;1852 second edition scanned by Google books (original at the New York Public library) and a &lt;/span&gt;&lt;/span&gt;&lt;span class="opds-entry-item"&gt;&lt;span class="opds-entry-value"&gt;1871 third edition scanned by the Internet Archive in association with Microsoft (original in the University of California library system). All the processing of both works appear to be been done in the U.S. The original print used macrons (NZETC), acutes (Google) and breves (Internet Archive) to mark long vowels. Find them &lt;a title="here" href="http://www.archive.org/bookserver/catalog/search?q=Dictionary+of+the+Maori+Language&amp;amp;submit=Search" id="otle"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="author"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="author"&gt;Lets take a look at some entries from each, starting at &lt;b&gt;'kapukapu':&lt;/b&gt;&lt;/p&gt;&lt;p class="author"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="author"&gt;&lt;a title="NZETC:" href="http://www.nzetc.org/tm/scholarly/tei-WillDict-t1-body-d1-d5.html" id="qy5w"&gt;NZETC:&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt; &lt;b&gt;kapukapu&lt;/b&gt;. &lt;b&gt;1&lt;/b&gt;. n. &lt;i&gt;Sole of the foot&lt;/i&gt;.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;2&lt;/b&gt;. Apparently a synonym for &lt;b&gt;kaunoti&lt;/b&gt;, the firestick which was kept steady with the foot. Tena ka riro, i runga i nga hanga a Taikomako, i te kapukapu, i te kaunoti (M. 351).&lt;/p&gt;            &lt;p&gt; &lt;b&gt;3&lt;/b&gt;. v.i. &lt;i&gt;Curl&lt;/i&gt; (as a wave). Ka kapukapu mai te ngaru.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;4&lt;/b&gt;. &lt;i&gt;Gush&lt;/i&gt;.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;5&lt;/b&gt;. &lt;i&gt;Gleam, glisten&lt;/i&gt;. Katahi ki te huka o Huiarau, kapukapu ana tera.&lt;/p&gt;                    &lt;div class="section" id="Kapua"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kapua&lt;/b&gt;, n. &lt;b&gt;1&lt;/b&gt;. &lt;i&gt;Cloud, bank of clouds&lt;/i&gt;. E tutakitaki ana nga kapua o te rangi, kei runga te Mangoroa e kopae pu ana (P.).&lt;/p&gt;            &lt;p&gt; &lt;b&gt;2&lt;/b&gt;. A flinty stone. = &lt;b&gt;kapuarangi&lt;/b&gt;.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;3&lt;/b&gt;. &lt;i&gt;Polyprion oxygeneios&lt;/i&gt;, a fish. = &lt;b&gt;hapuku&lt;/b&gt;.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;4&lt;/b&gt;. An edible species of fungus.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;5&lt;/b&gt;. Part of the titi pattern of tattooing.&lt;/p&gt;          &lt;/div&gt;          &lt;div class="section" id="Kapuarangi"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kapuarangi&lt;/b&gt;, n. A variety of &lt;b&gt;matā&lt;/b&gt;, or cutting stone, of inferior quality. = &lt;b&gt;kapua, 2&lt;/b&gt;.&lt;/p&gt;          &lt;/div&gt;&lt;div class="section" id="Ka.puhi"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kāpuhi, kāpuhipuhi&lt;/b&gt;, n. &lt;i&gt;Cluster of branches&lt;/i&gt; at the top of a tree.&lt;/p&gt;          &lt;/div&gt;          &lt;div class="section" id="Ka.pui"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kāpui&lt;/b&gt;, v.t. &lt;b&gt;1&lt;/b&gt;. &lt;i&gt;Gather up in a bunch&lt;/i&gt;. Ka kapuitia nga rau o te kiekie, ka herea.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;2&lt;/b&gt;. &lt;i&gt;Lace up&lt;/i&gt; or &lt;i&gt;draw in&lt;/i&gt; the mouth of a bag.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;3&lt;/b&gt;. &lt;i&gt;Earth up&lt;/i&gt; crops, or &lt;i&gt;cover up&lt;/i&gt; embers with ashes to keep them alight.&lt;/p&gt;            &lt;p&gt; &lt;b&gt;kāpuipui&lt;/b&gt;, v.t. &lt;i&gt;Gather up&lt;/i&gt; litter, etc.&lt;/p&gt;          &lt;/div&gt;          &lt;div class="section" id="Ka.puka"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kāpuka&lt;/b&gt;, n. &lt;i&gt;Griselinia littoralis&lt;/i&gt;, a tree. = &lt;b&gt;papauma&lt;/b&gt;.&lt;/p&gt;          &lt;/div&gt;          &lt;div class="section" id="Kapukiore"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kapukiore&lt;/b&gt;, n. &lt;i&gt;Coprosma australis&lt;/i&gt;, a shrub. = &lt;b&gt;kanono&lt;/b&gt;.&lt;/p&gt;          &lt;/div&gt;          &lt;div class="section" id="Ka.puku"&gt;            &lt;p class="hang"&gt; &lt;b&gt;Kāpuku&lt;/b&gt; = &lt;b&gt;kōpuku&lt;/b&gt;, n. &lt;i&gt;Gunwale&lt;/i&gt;.&lt;/p&gt;&lt;p class="hang"&gt;&lt;br /&gt;&lt;/p&gt;&lt;p class="hang"&gt;&lt;br /&gt;&lt;/p&gt;          &lt;/div&gt;&lt;p class="author"&gt;&lt;u&gt;&lt;i&gt;Google Books:&lt;/i&gt;&lt;/u&gt;&lt;br /&gt;&lt;/p&gt;       &lt;p&gt; Kapukapu,  s. Sole of the foot,&lt;/p&gt;       &lt;p&gt; Eldpukdpu,   v.    To curl* as a&lt;/p&gt;       &lt;p&gt; wave.&lt;/p&gt;       &lt;p&gt; Ka kapukapu mai te ngaru; The wave curls over.&lt;/p&gt;       &lt;p&gt; Kapunga,  v. To take up with both hands held together,&lt;/p&gt;       &lt;p&gt; Kapungatia he kai i te omu; Take up food from the oven.&lt;/p&gt;       &lt;p&gt; (B.  C,&lt;/p&gt;       &lt;p&gt; Kapura,  s.    Fire, -'      Tahuna he kapura ;   Kindle a fire.&lt;/p&gt;       &lt;p&gt; Kapurangi,  s. Rubbish; weeds,&lt;/p&gt;       &lt;p&gt; Kara,  s.    An old man,&lt;/p&gt;&lt;p&gt;       &lt;/p&gt;&lt;p&gt; Tena korua ko kara ? How are you and the old man ?&lt;/p&gt;       &lt;p&gt; Kara,  s&gt;    Basaltic stone.&lt;/p&gt;       &lt;p&gt; He kara te kamaka nei; This stone is kara.&lt;/p&gt;       &lt;p&gt; Karaha,  s.   A calabash. ♦Kardhi, *.     Glass,&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;u&gt;&lt;i&gt;Internet Archive:&lt;/i&gt;&lt;/u&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt; kapukapu, n.  sole of the foot.&lt;/p&gt;       &lt;p&gt; kapukapu, v. i. 1.  curl  (as a wave). Ka kapukapu mai te ngaru.    2.  gush.&lt;/p&gt;       &lt;p&gt; kakapii,  small basket  for cooked food.&lt;/p&gt;       &lt;p&gt; Kapua, n.  cloud; hank of clouds,&lt;/p&gt;       &lt;p&gt; Kapunga, n.  palm of the hand.&lt;/p&gt;       &lt;p&gt; kapunga,  \.  t.  take up in  both hands together.&lt;/p&gt;       &lt;p&gt; Kapiira, n.  fire.&lt;/p&gt;       &lt;p&gt; Kapiiranga, n.  handful.&lt;/p&gt;       &lt;p&gt; kapuranga, v. t.  take up by hand-fuls.  Kapurangatia nga otaota na e ia. v. i.  dawn.  Ka kapuranga te ata.&lt;/p&gt;       &lt;p&gt; Kapur&amp;amp;ngi, n.  rubbish; uveds.&lt;/p&gt;I.  K&amp;amp;r&amp;amp;, n.  old man.  Tena korua ko kara.       &lt;p&gt; II.  K&amp;amp;r&amp;amp;,  n.  secret  plan; conspiracy.  Kei te whakatakoto kara mo Te Horo kia patua.&lt;/p&gt;       &lt;p&gt; k&amp;amp;k&amp;amp;r&amp;amp;, D.  scent; smell.&lt;/p&gt;       &lt;p&gt; k&amp;amp;k&amp;amp;r&amp;amp;, a.    savoury; odoriferous.&lt;/p&gt;       &lt;p&gt; k^ar&amp;amp;,  n.   a shell-iish.&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;Unlike the other two, the NZETC version has accents, bold and italics in the right place. It' the only one with a workable and useful table of contents. It is also edition which has been extensively revised and expanded. Google's second edition has many character errors, while the Internet Archive's third edition has many 'á' mis-recognised as '&amp;amp;.' The Google and Internet Achive versions are also available as PDFs, but of course, without fancy tables of contents these PDFs are pretty challenging to navigate and because they're built from page images, they're huge.&lt;br /&gt;&lt;br /&gt;It's tempting to say that the NZETC version is better than either of the others, and from a naïve point of it is, but it's more accurate to say that it's different. It's a digitised version of a book revised more than a hundred years after the &lt;span class="opds-entry-item"&gt;&lt;span class="opds-entry-value"&gt;1852 second edition scanned by Google books&lt;/span&gt;&lt;/span&gt;. People who're interested in the history of the language are likely to pick the 1852 edition over the 1957 edition nine times out of ten.&lt;br /&gt;&lt;br /&gt;Technical work is currently underway to enable third parties like the Internet Archive's bookserver to more easily redistribute our ePubs. For some semi-arcane reasons it's linked to upcoming new search functionality.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6521823705590946247?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6521823705590946247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6521823705590946247' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6521823705590946247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6521823705590946247'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/11/epubs-and-quality.html' title='ePubs and quality'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5562252431568975315</id><published>2009-11-08T14:56:00.002+13:00</published><updated>2009-11-08T15:00:44.028+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='librarything'/><category scheme='http://www.blogger.com/atom/ns#' term='library'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><category scheme='http://www.blogger.com/atom/ns#' term='epubs'/><title type='text'>What LibraryThing metadata can the NZETC reasonable stuff inside it's CC'd epubs?</title><content type='html'>This is the second blog  following on from an excellent talk about &lt;a href="http://www.librarything.com/home/stuartyeates"&gt;librarything&lt;/a&gt; by &lt;a href="http://www.librarything.com/profile/timspalding"&gt;LibraryThing's Tim&lt;/a&gt; given the VUW in Wellington after his trip to &lt;a href="http://www.lianza.org.nz/events/conference2009/index.html"&gt;LIANZA&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The &lt;a href="http://www.nzetc.org/"&gt;NZETC&lt;/a&gt; publishes all of it's works as epubs (a file format primarily aimed at mobile devices), which are literally processed crawls of it's website bundled with some metadata. For some of the NZETC works (such as Erewhon and The Life of Captain James Cook), LibraryThing has a lot more metadata than the NZETC, becuase many LibraryThing users have the works and have entered metadata for them. Bundling as much metadata into the epubs makes sense, because these are commonly designed for offline use---call-back hooks are unlikely to be avaliable.&lt;br /&gt;&lt;br /&gt;So what kinds of data am I interested in?&lt;br /&gt;1) Traditional bibliographic metadata. Both LT and NZETC have this down really well.&lt;br /&gt;2) Images. LT has many many cover images, NZETC has images of plates from inside many works too.&lt;br /&gt;3) Unique identification (ISBNs, ISSNs, work ids, etc). LT does very well at this, NZETC very poorly&lt;br /&gt;4) Genre and style information. LT has tags to do fancy statistical analysis on, and does. NZETC has full text to do fancy statistical analysis on, but doesn't.&lt;br /&gt;5) Intra-document links. LT has work as the smallest unit. NZETC reproduces original document tables of contents and indexes, cross references and annotations.&lt;br /&gt;6) Inter-document links. LT has none. NZETC captures both 'mentions' and 'cites' relationships between documents.&lt;br /&gt;&lt;br /&gt;While most current-generation ebook readers, of course, can do nothing with most of this metadata, but I'm looking forward to the day when we have full-fledged OpenURL resolvers which can do interesting things, primarily picking the best copy (most local / highest quality / most appropiate format / cheapest) of a work to display to a user; and browsing works by genre (LibraryThing does genre very well, via tags).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5562252431568975315?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5562252431568975315/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5562252431568975315' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5562252431568975315'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5562252431568975315'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/11/what-librarything-metadata-can-nzetc.html' title='What LibraryThing metadata can the NZETC reasonable stuff inside it&apos;s CC&apos;d epubs?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5828136464717378879</id><published>2009-10-15T21:36:00.004+13:00</published><updated>2009-10-16T07:51:59.513+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='authority'/><category scheme='http://www.blogger.com/atom/ns#' term='community building'/><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='metadata'/><category scheme='http://www.blogger.com/atom/ns#' term='taxonomy'/><title type='text'>Interlinking of collections: the quest continues</title><content type='html'>After an excellent talk today about&lt;a href="http://www.librarything.com/profile/stuartyeates"&gt; LibraryThing&lt;/a&gt; by &lt;a href="http://www.librarything.com/profile/timspalding"&gt;LibraryThing's Tim&lt;/a&gt;, I got enthused to see how LibraryThing stacks up against other libraries for having matches in it's authority control system for entities we (the &lt;a href="http://www.nzetc.org/"&gt;NZETC&lt;/a&gt;) care about.&lt;br /&gt;The answer is averagely.&lt;br /&gt;For copies of printed books less than a hundred years old (or reprinted in the last hundred years), and their authors, LibraryThing seems to do every well. These are the books likely to be in active circulation in personal libraries, so it stands to reason that these would be well covered.&lt;br /&gt;I tried half a dozen books from our &lt;a href="http://www.nzetc.org/tm/scholarly/tei-corpus-nineteenthCenturyNovels.html"&gt;Nineteenth-Century Novels Collection&lt;/a&gt;, and most were missing, &lt;a href="http://www.nzetc.org/tm/scholarly/tei-ButErew.html"&gt;Erewhon&lt;/a&gt;, of course, was &lt;a href="http://www.librarything.com/work/19407"&gt;well represented&lt;/a&gt;. LibraryThing doesn't have the "&lt;a href="http://www.nzetc.org/tm/scholarly/name-122436.html"&gt;Treaty of Waitangi&lt;/a&gt;" (a set of manuscripts) but it does have "&lt;a href="http://www.librarything.com/work/5312337"&gt;Facsimiles of the Treaty of Waitangi&lt;/a&gt;." It's not clear to me whether these would be merged under their cataloguing rules.&lt;br /&gt;Coverage of non-core bibliographic entities was lacking. Places get a little odd. Sydney is "&lt;a href="http://www.librarything.com/place/Sydney,%20New%20South%20Wales,%20Australia"&gt;http://www.librarything.com/place/Sydney,%20New%20South%20Wales,%20Australia&lt;/a&gt;" but Wellington is "&lt;a href="http://www.librarything.com/place/Wellington"&gt;http://www.librarything.com/place/Wellington&lt;/a&gt;" and &lt;span class="fwikiSubnavBreadcrumbItem"&gt;&lt;a href="http://www.nzetc.org/tm/scholarly/name-123470.html"&gt;Anzac Cove&lt;/a&gt; appears to be is missing altogether. This doesn't seem like a sane &lt;/span&gt;authority control system for places, as far as I can see. People who are the subjects rather than the authors of books didn't come out so well. I couldn't find Abel Janszoon Tasman, Pōtatau Te Wherowhero or Charles Frederick Goldie, all of which are near and dear to our hearts.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://spreadsheets.google.com/ccc?key=0AtkIjlDqC2H4cFlrdFdSX2ZQNFhiSGdDQWNRc1FrZEE&amp;amp;hl=en_GB"&gt;Here is the spreadsheet of how different web-enabled systems map entities we care about.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 255, 51);"&gt;Correction:&lt;/span&gt; It seems that the correct URL for Wellington is &lt;a href="http://www.librarything.com/place/Wellington,%20New%20Zealand"&gt;http://www.librarything.com/place/Wellington,%20New%20Zealand&lt;/a&gt; which brings sanity back.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5828136464717378879?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5828136464717378879/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5828136464717378879' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5828136464717378879'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5828136464717378879'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/10/interlinking-of-collections-quest.html' title='Interlinking of collections: the quest continues'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5246952927527204101</id><published>2009-09-19T21:57:00.002+12:00</published><updated>2009-09-19T22:03:28.387+12:00</updated><title type='text'>eBook readers need OpenURL resolvers</title><content type='html'>Everyone's talking about the next generation of eBook readers having larger reading area, more battery life and more readable screen. I'd give up all of those, however, for an eBook reader that had an internal &lt;a href="http://en.wikipedia.org/wiki/OpenURL"&gt;OpenURL&lt;/a&gt; resolver.&lt;br /&gt;&lt;br /&gt;OpenURL is the nifty protocol that libraries use to find the closest copy of a electronic resources and direct patrons to copies that the library might have already licensed from commercial parties. It's all about finding the version of a resource that is most accessible to the user, dynamically.&lt;br /&gt;&lt;p&gt;Say I've loaded 500 eBooks into my eBook reader: a couple of encyclopedias and dictionaries; a stack of books I was meant to read in school but only skimmed and have been meaning to get back to; current block-busters; guidebooks to the half-dozen countries I'm planning on visiting over the next couple of years; classics I've always meant to read (Tolstoy, Chaucer, Cervantes, Plato, Descartes, Nietzsche); and local writers (Baxter, Duff, Ihimaera, Hulme, ...). My eBooks by Nietzsche are going to refer to books by Descartes and Plato; my eBooks by Descartes are going to refer to books by Plato; my encyclopaedias are going to refer to pretty much everything; most of the works in translation are going to contain terms which I'm going to need help with (help which theencyclopedias and dictionaries can provide). &lt;br /&gt;&lt;/p&gt;  &lt;p&gt;Ask yourself, though, whether you'd want to flick between works on the current generation of readers---very painful, since these devices are not designed for efficient navigation between eBooks, but linear reading of them. You can't follow links between them, of course, because on current systems links must point either with the same eBook or out on to the internet---pointing to other eBooks on the same device is verboten. OpenURL  can solve this by catching those URLs and making them point to local copies of works (and thus available for free even when the internet is unavailable) where possible while still retaining their &lt;br /&gt;&lt;/p&gt;  Until eBook readers have a mechanism like this eBooks will be at most a replacement only for paperback novels---not personal libraries.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5246952927527204101?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5246952927527204101/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5246952927527204101' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5246952927527204101'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5246952927527204101'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/09/ebook-readers-need-openurl-resolvers.html' title='eBook readers need OpenURL resolvers'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2446413394973576114</id><published>2009-09-15T21:30:00.001+12:00</published><updated>2009-09-16T06:44:59.785+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='library'/><title type='text'>Thoughts on koha</title><content type='html'>&lt;a href="http://koha.org/"&gt;&lt;br /&gt;&lt;/a&gt;The &lt;a href="http://koha.org/"&gt;Koha&lt;/a&gt; community is currently undergoing a spasm, with a company apparently forking the code.&lt;br /&gt;As a result a bunch of people are looking at where the community should go from here and how it should be led. In particular the idea of a &lt;a href="http://wiki.koha.org/doku.php?id=kohafoundation"&gt;not-for-profit foundation&lt;/a&gt; has been floated and is to be discussed at a &lt;a href="http://wiki.koha.org/doku.php?id=meetingnotes09sep15"&gt;meeting early tomorrow morning&lt;/a&gt; .&lt;br /&gt;My thoughts on this issue are pretty simple:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;A not-for-profit is a fabulous idea&lt;/li&gt;&lt;li&gt;Reusing one of the existing software not-for-profit (Apache, Software in the Public Interest, etc) introduces a layer of non-library complexity. Libraries are have a long history with consortia, but tend to very much flock together with their own kind, I can see them being leary of a non-library entity.&lt;/li&gt;&lt;li&gt;A clear description of a forward-looking plan written in plain language that everyone can understand is vital to communicate the vision of the community, particularly to those currently on the fringes&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2446413394973576114?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2446413394973576114/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2446413394973576114' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2446413394973576114'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2446413394973576114'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/09/thoughts-on-koha.html' title='Thoughts on koha'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-7346253659968330712</id><published>2009-09-01T18:44:00.004+12:00</published><updated>2009-09-01T18:51:45.965+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='place names'/><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='GIS'/><category scheme='http://www.blogger.com/atom/ns#' term='geolocation'/><category scheme='http://www.blogger.com/atom/ns#' term='binomial names'/><category scheme='http://www.blogger.com/atom/ns#' term='taxonomy'/><category scheme='http://www.blogger.com/atom/ns#' term='#opengovt'/><title type='text'>Data and data modelling and underlying assumptions</title><content type='html'>I feel that there was a huge disconnect between some groups of participants at #opengovt (&lt;a href="http://groups.google.co.nz/group/nzopengovtbarcamp"&gt;http://groups.google.co.nz/group/nzopengovtbarcamp&lt;/a&gt;) in Wellington last weekend. This is my attempt to illuminate the gaps.&lt;br /&gt;&lt;br /&gt;The gaps were about data and data modelling and underlying assumptions that the way one person / group / institution viewed a kind of data was the same as the way others viewed it.&lt;br /&gt;&lt;br /&gt;This gap is probably most pronounced in geo-location.&lt;br /&gt;&lt;br /&gt;There's a whole bunch of very bright people doing wonderful mashups in geo-location using a put-points-on-a-map model. Typically using google maps (or one of a small number of competitors) they give insights into all manner of things by throwing points onto maps, street views, etc, etc. It's a relatively new field and every time I look they seem to have a whizzy new toy. Whizzy thing of the day for me was &lt;a href="http://groups.google.com/group/digitalnz/browse_thread/thread/b5b0c96ce08ca441"&gt;http://groups.google.com/group/digitalnz/browse_thread/thread/b5b0c96ce08ca441&lt;/a&gt; .  Unfortunately the very success of the 'data as points' model encourages the view that location is a lat / long pair and the important metric is the number of significant digits in the lat / long.&lt;br /&gt;&lt;br /&gt;In the GLAM (Galleries, Libraries, Archives and Museums) sector, we have a tradition of using thesauri such as the Getty Thesaurus of Geographic Names. Take all look at the entry for The Wellington region:&lt;a href="http://www.getty.edu/vow/TGNFullDisplay?find=wellington&amp;place=&amp;nation=New+Zealand&amp;prev_page=1&amp;english=Y&amp;subjectid=7000512"&gt;http://www.getty.edu/vow/TGNFullDisplay?find=wellington&amp;place=&amp;nation=New+Zealand&amp;prev_page=1&amp;english=Y&amp;subjectid=7000512&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Yes, if has a lat and a long (with laughable precision), but the lat and long are arguably the least important information on the page. There's a faceted hierarchy, synonyms, linked references and type data. Te Papa have just moved to Getty for place names in their new site (&lt;a href="http://collections.tepapa.govt.nz/"&gt;http://collections.tepapa.govt.nz/&lt;/a&gt;) and frankly, I'm jealous. They paid a few thousand dollars for a licence to thesaurus and it's a joy to use.&lt;br /&gt;&lt;br /&gt;The idea of #opengovt is predicated on institutions and individuals speaking the same languages, being able to communicate effectively, and this is clearly a case where we're not. Learning to speak each others languages seems like it's going to be key to this whole venture.&lt;br /&gt;&lt;br /&gt;As something of a worked example, here's something that I'm working on at the moment. It's a page from The Manual of the New Zealand Flora by Thomas Frederick Cheeseman, a core text in New Zealand botany, see  &lt;a href="http://www.nzetc.org/tm/scholarly/tei-CheManu-t1-body1-d22-d5.html"&gt;http://www.nzetc.org/tm/scholarly/tei-CheManu-t1-body1-d22-d5.html&lt;/a&gt; The text is live on our website, but it's not yet fully marked up. I've chosen it because it illustrates two separate kinds of languages and their disparities.&lt;br /&gt;&lt;br /&gt;What are the geographic locations on that page?&lt;br /&gt;&lt;br /&gt;    * Nelson-Mountains flanking the Clarence Valley&lt;br /&gt;    * Marlborough—Kaikoura Mountains&lt;br /&gt;    * Canterbury—Kowai River&lt;br /&gt;    * Canterbury—Coleridge Pass&lt;br /&gt;    * Otago—Mount St. Bathan's&lt;br /&gt;&lt;br /&gt;The qualifier "2000–5000 ft" (which I believe is an elevation range at which these flourish) applies across these. Clearly we're going to struggle to represent these with a finite number of lat/long points, no matter how accurate. In all likelihood, I'll not actually mark up these locations, since the because no one's working with complex locations, the cost benifit isn't within sight of being worth it.&lt;br /&gt;&lt;br /&gt;Te Papa and the NZETC have a small-scale binomial name exercise underway, and for that I'll be scripting the extraction of the following names from that page:&lt;br /&gt;&lt;br /&gt;    * Notospartium carmichœliœ (synonym Notospartium carmichaeliae)&lt;br /&gt;    * Notospartium torulosum&lt;br /&gt;&lt;br /&gt;There were a bunch of folks at the #opengovt barcamp who're involved in the "New Zealand Organisms Register" (&lt;a href="http://www.nzor.org.nz/"&gt;http://www.nzor.org.nz/&lt;/a&gt;) project. As I understand it, they want me to expose the following names from that page:&lt;br /&gt;&lt;br /&gt;    * Notospartium carmichœliœ, Hook. f.&lt;br /&gt;    * Notospartium torulosum, Hook. f.&lt;br /&gt;&lt;br /&gt;Of course the name the public want is:&lt;br /&gt;&lt;br /&gt;    * New Zealand Pink Broom&lt;br /&gt;    * ? (Notospartium torulosum appears not to have a common name)&lt;br /&gt;&lt;br /&gt;Note that none of these taxonomic names actually appear in full on the page...&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Yes is, clearly, an area where the best can be the good and visa versa, but the good needs to at least be aware of the best.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-7346253659968330712?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/7346253659968330712/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=7346253659968330712' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7346253659968330712'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7346253659968330712'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/09/data-and-data-modelling-and-underlying.html' title='Data and data modelling and underlying assumptions'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1065249124190247727</id><published>2009-07-27T22:31:00.002+12:00</published><updated>2009-07-27T22:47:02.165+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TEI'/><category scheme='http://www.blogger.com/atom/ns#' term='code'/><category scheme='http://www.blogger.com/atom/ns#' term='xslt'/><title type='text'>Learning XSLT 2.0 Part 1; Finding Names</title><content type='html'>&lt;p&gt;We mark up a lot of names, so one of the first things I decided to do was to build an XSLT stylesheet that takes a list of names and tags those names when they occur in a separate XSLT file. To make things easier and clearer, I've ignored little things like namespaces, conformant TEI, etc, etc.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;First up, the list of names, these are multi-word names. Notice the simple structure, this could easily be built from a comma seperated list or similar:&lt;/p&gt;&lt;br /&gt; &lt;br /&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&lt;br /&gt;&amp;lt;names&amp;gt;&lt;br /&gt;   &amp;lt;name&amp;gt;Papaver argemone&amp;lt;/name&amp;gt;&lt;br /&gt;   &amp;lt;name&amp;gt;Papaver dubium&amp;lt;/name&amp;gt;&lt;br /&gt;   &amp;lt;name&amp;gt;Papaver Rhceas&amp;lt;/name&amp;gt;&lt;br /&gt;&amp;lt;name&amp;gt;Zanthoxylum novæ-zealandiæ&amp;lt;/name&amp;gt;&lt;br /&gt;&amp;lt;/names&amp;gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;Next, some sample text:&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&lt;br /&gt;&amp;lt;doc&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; There are several names Papaver argemone in this document Papaver argemone&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; Some of them are the same as others (Papaver Rhceas Papaver rhceas P. rhceas)&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; Non ASCII characters shouldn't cause a problem in names like Zanthoxylum novæ-zealandiæ AKA Zanthoxylum novae-zealandiae&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;/doc&amp;gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;Finally the stylesheet. It consists of three parts: the regexp variable that builds a regexp from the names in the file; a default template for everything but text(); and a template for text()s that applies the rexexp.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&amp;lt;?xml version="1.0"?&amp;gt;&lt;br /&gt;&amp;lt;xsl:stylesheet  version="2.0"&lt;br /&gt;         xmlns:xsl="http://www.w3.org/1999/XSL/Transform"  &amp;gt;&lt;br /&gt;&lt;br /&gt;  &amp;lt;!-- build a regexp of the names --&amp;gt;&lt;br /&gt;  &amp;lt;xsl:variable name="regexp"&amp;gt;&lt;br /&gt;    &amp;lt;xsl:value-of select="concat('(',string-join(document('name-list.xml')//name/text(), '|'), ')')"/&amp;gt;&lt;br /&gt;  &amp;lt;/xsl:variable&amp;gt;&lt;br /&gt;&lt;br /&gt;  &amp;lt;!-- generic copy-everything-except-texts template --&amp;gt;&lt;br /&gt;  &amp;lt;xsl:template match="@*|*|processing-instruction()|comment()"&amp;gt;&lt;br /&gt;    &amp;lt;xsl:copy&amp;gt;&lt;br /&gt;      &amp;lt;xsl:apply-templates select="@*|*|processing-instruction()|comment()|text()"/&amp;gt;&lt;br /&gt;    &amp;lt;/xsl:copy&amp;gt;&lt;br /&gt;  &amp;lt;/xsl:template&amp;gt;&lt;br /&gt; &lt;br /&gt;  &amp;lt;!-- Look for binomal names in appreviated form where the genus name is in the immediately preceeding head --&amp;gt;&lt;br /&gt;  &amp;lt;xsl:template match="text()"&amp;gt;&lt;br /&gt;    &amp;lt;xsl:analyze-string select="." regex="{$regexp}"&amp;gt;&lt;br /&gt;      &amp;lt;xsl:matching-substring&amp;gt;&lt;br /&gt;    &amp;lt;name type="taxonomic" subtype="matched"&amp;gt;&lt;br /&gt;      &amp;lt;xsl:value-of select="regex-group(1)"/&amp;gt;&lt;br /&gt;    &amp;lt;/name&amp;gt;&lt;br /&gt;      &amp;lt;/xsl:matching-substring&amp;gt;&lt;br /&gt;      &amp;lt;xsl:non-matching-substring&amp;gt;&lt;br /&gt;      &amp;lt;xsl:value-of select="."/&amp;gt;&lt;br /&gt;      &amp;lt;/xsl:non-matching-substring&amp;gt;&lt;br /&gt;    &amp;lt;/xsl:analyze-string&amp;gt;&lt;br /&gt;  &amp;lt;/xsl:template&amp;gt;&lt;br /&gt;&lt;br /&gt;&amp;lt;/xsl:stylesheet&amp;gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;The output looks like:&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&amp;lt;doc&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; There are several names &amp;lt;name type="taxonomic" subtype="matched"&amp;gt;Papaver argemone&amp;lt;/name&amp;gt; in this document &amp;lt;name type="taxonomic" subtype="matched"&amp;gt;Papaver argemone&amp;lt;/name&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; Some of them are the same as others (&amp;lt;name type="taxonomic" subtype="matched"&amp;gt;Papaver Rhceas&amp;lt;/name&amp;gt; Papaver rhceas P. rhceas)&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;p&amp;gt; Non ASCII characters shouldn't cause a problem in names like &amp;lt;name type="taxonomic" subtype="matched"&amp;gt;Zanthoxylum novæ-zealandiæ&amp;lt;/name&amp;gt; AKA Zanthoxylum novae-zealandiae&amp;lt;/p&amp;gt;&lt;br /&gt;&amp;lt;/doc&amp;gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;As you may notice, I've not yet worked out the best way to handle the 'æ'&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1065249124190247727?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1065249124190247727/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1065249124190247727' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1065249124190247727'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1065249124190247727'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/07/learning-xslt-20-part-1-finding-names.html' title='Learning XSLT 2.0 Part 1; Finding Names'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1923868213302505974</id><published>2009-06-06T23:22:00.002+12:00</published><updated>2009-06-06T23:25:31.317+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='TEI'/><category scheme='http://www.blogger.com/atom/ns#' term='maori'/><category scheme='http://www.blogger.com/atom/ns#' term='Māori'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><category scheme='http://www.blogger.com/atom/ns#' term='teReo'/><title type='text'>Legal Māori Archive</title><content type='html'>&lt;span style="font-size:85%;"&gt;&lt;br /&gt;Now that the &lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;a title="Legal Māori Archive" href="http://www.nzetc.org/tm/scholarly/tei-corpus-legalMaori.html" id="de9n"&gt;Legal Māori Archive&lt;/a&gt; is live, I thought I'd highlight a couple of my favourite texts from the corpus.&lt;br /&gt;&lt;br /&gt;The first is a great example of reinforcing cultural confusion. &lt;/span&gt;&lt;a href="http://www.nzetc.org/tm/scholarly/tei-GorLaws.html"&gt;"The Laws of England, Compiled and translated into the Māori language"&lt;/a&gt; by judge &lt;a href="http://www.nzetc.org/tm/scholarly/name-160002.html"&gt;Francis Dart Fenton&lt;/a&gt; is a bi-lingual compendium of the laws of England, but extraordinarily uses &lt;a title="bible quotes as examples" href="http://www.nzetc.org/tm/scholarly/tei-GorLaws-t1-g1-t2-body1-d1-d55.html" id="o-35"&gt;bible quotes as examples&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The second example is actaully a collection of texts, the &lt;a title="works of Rev. Henry Hanson Turton" href="http://www.nzetc.org/tm/scholarly/name-401540.html" id="a9l9"&gt;works of Rev. Henry Hanson Turton&lt;/a&gt;, who compiled thousands of pages of land deeds and associated documents into six volumes. I can see these seeing a lot of use by Treaty researchers.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1923868213302505974?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1923868213302505974/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1923868213302505974' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1923868213302505974'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1923868213302505974'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/06/legal-maori-archive.html' title='Legal Māori Archive'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-7555004382390805753</id><published>2009-05-05T21:54:00.004+12:00</published><updated>2009-06-04T19:28:09.289+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='rant'/><category scheme='http://www.blogger.com/atom/ns#' term='MARC'/><category scheme='http://www.blogger.com/atom/ns#' term='library'/><category scheme='http://www.blogger.com/atom/ns#' term='metadata'/><title type='text'>Why card-based records aren't good enough</title><content type='html'>Card catalogs have a long tradition in librarianship, dating back, I'm told, to the &lt;a href="http://liswiki.org/wiki/History_of_the_card_catalog"&gt;book stock-take in the French revolution&lt;/a&gt;. Librarians understand card catalogs in a deep way that comes from generations of librarians having used them as a core professional tool all their professional lives. Librarians understand card catalogs in ways that I, as a computer scientist, never will. I still recall on one of my first visits to a university library, I asked a librarian where I might find books by a particular author, they found the work for me arguably as fast as I can now find works with the new wizzy electronic catalog.&lt;br /&gt;&lt;br /&gt;It is natural, when faced with something new, to understand it in terms of what we already know and already understand. Unfortunately, understanding the new by analogy to the old can lead to form of the old being assumed in the new. It was true that when libraries digitized their card catalogs in the 1970s and 1980s, they were more or less exactly digital versions of the card catalog predecessors, because their content was limited to old data from the cards and new data from cataloging processes (which were unchanged from the card catalog era) and because librarians and users had come to equate a library catalog with a card catalog---it was what they expected.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/MARC_standards"&gt;MARC&lt;/a&gt; is a perfect example of this kind of thing. As a data format to directly replace a card catalog of printed books, it can hardly be faulted.&lt;br /&gt;&lt;br /&gt;Unfortunately, digital metadata has capabilities undreamt of at the time of the French revolution, and card catalogs and MARC do a poor job of handling these capabilities.&lt;br /&gt;&lt;br /&gt;A whole range of people have come up with criticisms of MARC that involve materials and methodologies not routinely held in libraries at the time of the French revolution (digital journal subscriptions and music, for example), but I view these as postdating card catalogs and thus the criticism as unfair.&lt;br /&gt;&lt;br /&gt;So what was held in libraries in 1789 that MARC struggle with? Here's a list:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Systematically linking discussion of particular works with instances of those works&lt;/li&gt;&lt;li&gt;Systematically linking discussion of particular instances with those instances ("Was person X the transcriber of manuscript Y?")&lt;/li&gt;&lt;li&gt;Handling ambiguity ("This play may have been written by Shakespeare. It might also have been a later forgery by Francis Bacon, Christopher Marlowe or Edward de Vere")&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;All of these relate to core questions which have been studed in libraries for centuries. They're well understood issues, which changed little in the hundred years until the invention of the computer (which is when all the usually-cited issues with MARC began).&lt;br /&gt;&lt;br /&gt;The real question is why we're still expecting an approach that didn't solve the problems two hundred years ago to solve our problems now? Computers are not magic in this area they just seem to be helping us do the wrong things faster, more reliably and for larger collections.&lt;br /&gt;&lt;br /&gt;We need a new approach to bibliographic metadata, one which is not ontologically bound to little slips of paper. There are a whole range of different alternatives out there (including a bevy of RDF vocabularies), but I've yet to run into one which both allowed clear representation of existing data (because lets face it, I'm not going to re-enter worldcat, and neither are you, not in our lifetimes) and admitting non-card-based metadata as first class elements.&lt;br /&gt;&lt;br /&gt;&amp;lt;/rant&amp;gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-7555004382390805753?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/7555004382390805753/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=7555004382390805753' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7555004382390805753'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7555004382390805753'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/05/why-card-based-records-arent-good.html' title='Why card-based records aren&apos;t good enough'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8363114012937380185</id><published>2009-05-01T09:34:00.001+12:00</published><updated>2009-05-01T09:34:45.612+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='national library'/><title type='text'>LoC gets semantic</title><content type='html'>&lt;p&gt;This morning, the Library of Congress launched &lt;a mce_href="http://id.loc.gov/authorities/" href="http://id.loc.gov/authorities/"&gt;http://id.loc.gov/authorities/&lt;/a&gt;, their first serious entry into the semantic web. &lt;/p&gt;&lt;p&gt;The site makes the Library of Congress Subject Headings available as defererenable URLs. For example &lt;a mce_href="http://id.loc.gov/authorities/sh90005545" href="http://id.loc.gov/authorities/sh90005545"&gt;http://id.loc.gov/authorities/sh90005545&lt;/a&gt;.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8363114012937380185?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8363114012937380185/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8363114012937380185' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8363114012937380185'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8363114012937380185'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/05/loc-gets-semantic.html' title='LoC gets semantic'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1063739577637906320</id><published>2009-02-04T21:11:00.002+13:00</published><updated>2009-02-04T21:15:57.797+13:00</updated><title type='text'>NDHA demo and the National Library</title><content type='html'>This morning I went to the &lt;a href="ttp://www.natlib.govt.nz/about-us/current-initiatives/ndha"&gt;&lt;span class="misspell" suggestions="ND HA,ND-HA,DOHA,NADA,NEDA"&gt;NDHA&lt;/span&gt; demonstration&lt;/a&gt; where a National Library techie talked us through the &lt;span class="misspell" suggestions="ND HA,ND-HA,DOHA,NADA,NEDA"&gt;NDHA&lt;/span&gt; ingest tools. The tools are the most visible piece of the &lt;span class="misspell" suggestions="ND HA,ND-HA,DOHA,NADA,NEDA"&gt;NDHA&lt;/span&gt; infrastructure, and are designed to unify the ingest of digital documents, whether they are born-digital documents physically submitted (i.e. producers mail in &lt;span class="misspell" suggestions="Cd's,Cads,Cods,Cuds,CD"&gt;CDs&lt;/span&gt;/DVDs etc); born-digital documents electronically submitted (i.e. producers upload content via &lt;span class="misspell" suggestions="whizz,woozy,Izzy,Lizzy,dizzy"&gt;wizzy&lt;/span&gt; web tools); or digital scans of current holdings produced as part of the on-going digitisation efforts. The tools have a unified system with different &lt;span class="misspell" suggestions="work flows,work-flows,workforces,forkfuls,workforce"&gt;workflows&lt;/span&gt; for unpublished material (=archive) and published material (=&lt;span class="misspell" suggestions="library,literary,Libra,library's,Libras"&gt;libarary&lt;/span&gt;). The unification of library and archival functionality seemed like futile ground for miscommunication.&lt;br /&gt;&lt;br /&gt;The infrastructure is (correctly) embedded across the library, and uses all the current tools for collection maintenance, searching and access.&lt;br /&gt;&lt;br /&gt;As a whole it looks like the system is going to save time and money for large content producers and capture better metadata for small donors of content, which is great. By moving the capture of metadata closer to the source (while still allowing professional curatorial processes for selection and cataloguing), it looks like more context is going to be captured, which is fabulous.&lt;br /&gt;&lt;br /&gt;A couple of things struck me as odd:&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;The first feedback to the producer/&lt;span class="misspell" suggestions="up loader,up-loader,upload er,upload-er,upload"&gt;uploader&lt;/span&gt; is via a human. Despite having an elaborate validation suite, the user wasn't told immediately "that .doc file looks like a &lt;span class="misspell" suggestions="PD,PF,PDQ,PDT,PD'S"&gt;PDF&lt;/span&gt;, would you like to try again?" or "Yep, that *.&lt;span class="misspell" suggestions="XML,XL,ml"&gt;xml&lt;/span&gt; file is valid and conforming XML" Time and again studies have shown that immediate feedback to allow people to correct their mistakes immediately is important and effective. &lt;/li&gt;&lt;li&gt;The range of metadata fields available for tagging content was very limited. For example there was no &lt;a href="http://iwihapu.natlib.govt.nz/iwi-hapu/index.htm"&gt;&lt;span class="misspell" suggestions="WI,Ii,Kiwi,WWI,Iii"&gt;Iwi&lt;/span&gt;/&lt;span class="misspell" suggestions="Hap,Happy,Harpy,Harp,Heap"&gt;Hapu&lt;/span&gt; list&lt;/a&gt;, no &lt;a href="http://mshupoko.natlib.govt.nz/mshupoko/"&gt;Maori Subject Headings&lt;/a&gt;, no &lt;a title="New Zealand Gazetteer of Official Geographic Names" href="http://www.linz.govt.nz/placenames/find-names/nz-gazetteer-official-names/index.aspx" id="m5bk"&gt;Gazetteer of Official Geographic Names&lt;/a&gt;. When I asked about these I was told "that's the &lt;span class="misspell" suggestions="CM S's,CM-S's,Cm's,Cam's,Camus's"&gt;CMS's&lt;/span&gt; role" (=Collection Management Software, i.e. those would be added by professional cataloguers later), but if you're going to move the metadata collection to as close to content generation, it makes sense to at least have the option of proper authority control over it. &lt;/li&gt;&lt;/ol&gt;Or that's my take, anyway. Maybe I'm missing something.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1063739577637906320?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1063739577637906320/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1063739577637906320' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1063739577637906320'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1063739577637906320'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/02/ndha-demo-and-national-library.html' title='NDHA demo and the National Library'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8132649278280858203</id><published>2009-02-02T21:40:00.001+13:00</published><updated>2009-02-02T21:58:03.046+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='place names'/><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='national library'/><category scheme='http://www.blogger.com/atom/ns#' term='ndha'/><title type='text'>Report from the NDHA's International Perspectives on Digital Preservation</title><content type='html'>NOTE: I'm a computer scientist by training and this was largely librarian/archivist gig, so it's entirely possibly I've got the wrong end of the stick on one or more point in the summary below. It's also my own summary, and not the position of my employer, even though I was on work time during the event.&lt;br /&gt;&lt;br /&gt;The NDHA is about to announce that the &lt;a title="NDHA project" href="http://www.natlib.govt.nz/about-us/current-initiatives/ndha" id="ri8g"&gt;NDHA project&lt;/a&gt; has been completed on time and under budget. This is particularly pleasing in light of the poor history of government IT failures over the course of the last 30 years and a tribute to all concerned. Indeed, when I was taking undergraduate courses in software engineering a contemporary national library project was used as a text-book example of how not to run a software development undertaking. It's good to see how far they've come.&lt;br /&gt;&lt;br /&gt;The &lt;a title="event itself was a one-day event in the national library auditorium" href="http://ndha-wiki.natlib.govt.nz/ndha/pages/InternationalPerspectives" id="fp6o"&gt;event itself was a one-day event in the national library auditorium&lt;/a&gt;, with a handful of overseas speakers. I'm not entirely certain that a handful of foreigners counts as "international," but maybe that's just me being a snob. Certainly there was a fine turn-out of locals, including many from the &lt;a title="National Library" href="http://www.natlib.govt.nz/" id="txk3"&gt;National Library&lt;/a&gt;, the &lt;a title="Ministry of Culture and Heritage" href="http://www.mch.govt.nz/" id="y1r3"&gt;Ministry of Culture and Heritage&lt;/a&gt; and from &lt;a title="VUW" href="http://www.victoria.ac.nz/" id="q70l"&gt;VUW&lt;/a&gt;, including a number of students, who couldn't possibly have been there for the free food.&lt;br /&gt;&lt;br /&gt;There seemed to be an underlying tension between librarianship and archivistship running through the event. I see this as being a really crazy turfwar, personally, since I see the chances of libraries and archives existing as separate entities and disciplines in fifty years seems pretty slim. The separation between the two, the "uniqueness" of objects in an archive seems to be to be obliterated by the free-duplication of digital objects. I've heard people say that archives also work access controls and embargoes for their depositors, but then so can libraries, particularly those in the military and those working with classified documents.&lt;br /&gt;&lt;br /&gt;It seemed to me that the word &lt;span style="font-style: italic;"&gt;"reliability"&lt;/span&gt; was used in a confusing number different of ways by different people. Without naming the guilty parties:&lt;br /&gt;&lt;ol&gt;&lt;li&gt; reliability as the truthfulness of the documents in the library/archive. This is the old problem of ingestors having to determine the absolute veracity of documents&lt;/li&gt;&lt;li&gt; reliability as getting the same metadata every time. This seems odd to me, since systems with audit control give _different_ results every time, because information on the previous accesses is included in the metadata of subsequent accesses&lt;br /&gt;&lt;/li&gt;&lt;li&gt; reliability as the degree to which the system conformed to a standard/specification&lt;/li&gt;&lt;/ol&gt;On reflection this may have been a symptom of the different vocabulary used by librarians and archivists. Whatever the cause, if we're wanting to spend public money, we have to be able to explain to the public what we're doing, and this isn't helping.&lt;br /&gt;&lt;br /&gt;The organisers told us the presentations would be up by tonight (the evening of the presentation), but you won't find them on google if you go looking, because they tell google to &lt;a title="please fuck off" href="http://ndha-wiki.natlib.govt.nz/robots.txt" id="ye-y"&gt;please f**k off&lt;/a&gt;. I guess this is what someone was referring to when they said we had to work to make content accessible to google. The link is &lt;a title="http://ndha-wiki.natlib.govt.nz/ndha/pages/IPoDP%2009%20Presentations" href="http://ndha-wiki.natlib.govt.nz/ndha/pages/IPoDP%2009%20Presentations" id="hiu."&gt;http://ndha-wiki.natlib.govt.nz/ndha/pages/IPoDP%2009%20Presentations&lt;/a&gt; and most were up at the time of writing.&lt;br /&gt;&lt;br /&gt;I was hugely encouraged by the number of pieces of software that seemed to be being open sourced, as I see this as being a much better economic model than paying vendors for custom software, particularly since it's potentially scalable out from the national and top-tier libraries/archives/museums out to the second and third tier libraries/archives/museums, which by dint of their much larger numbers actually serve the most users and have the most content. It was unfortunate that the national library hasn't looked beyond propriety software for non-specialist software but continues to use AbodePhotoshop / Microsoft Windows, which are available only for limited periods of time on certain platforms (which will inevitably become obsolete), rather than &lt;a title="openoffice" href="http://www.openoffice.org/" id="f2nu"&gt;openoffice&lt;/a&gt;, &lt;a title="GIMP" href="http://www.gimp.org/" id="s:yg"&gt;GIMP&lt;/a&gt;, etc, which are cross platform and licensed under perpetual licences which include the right to port the software from one platform to another. I guessPhotoshop / Windows is what their clients and funders know and use.&lt;br /&gt;&lt;br /&gt;With a number of participants I had conversations about preservation. Andrew Wilson in his presentation used the quote:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;“traditionally, preserving things meant keeping them unchanged; however our digital environment has fundamentally changed our concept of preservation requirements. If we hold on to digital information without modifications, accessing the information will become increasingly difficult, if not impossible”&lt;/span&gt; &lt;a title="Su-Sing Chen, “The Paradox of Digital Preservation”, Computer, March 2001, 2-6" href="http://portal.acm.org/citation.cfm?id=621669" id="pt5y"&gt;Su-Sing Chen, “The Paradox of Digital Preservation”, Computer, March 2001, 2-6&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;If you think about what intellectual objects we have from the Greeks (which is were us Westerners traditionally trace our intellectual history from), the majority fall into two main classes: (a) art works, which have survived primarily through roman copies and (b) texts, which have survived by copying, including a body of mathematics which were kept alive in the Arabic translation during a period when we Westerners were burning the works in Latin and Greek and claiming that the bible was the only book we needed. I'll grant you that a high-quality book will last maybe 500 years in a controlled environment, maybe even 1000, but for real permanence, you just can't get past physical ubiquity. If we have things truly worthy of long-term preservation, we should be striking deals with the Warehouse to get them into every home in the country, and setting them as translation exercises in our language learning courses.&lt;br /&gt;&lt;br /&gt;I had some excellent conversations with other participants at the event, including Phillipa Tocker from  &lt;a title="Museums Aotearoa ~ Te Tari o Nga Whare Taonga o te Motu" href="http://www.museums-aotearoa.org.nz/" id="b41e"&gt;Museums Aotearoa / Te Tari o Nga Whare Taonga o te Motu&lt;/a&gt; who told me about the &lt;a title="http://www.nzmuseums.co.nz/" href="http://www.nzmuseums.co.nz/" id="wnst"&gt;http://www.nzmuseums.co.nz/&lt;/a&gt; site they put together for their members.&lt;br /&gt;&lt;br /&gt;Looking at the site I'm struck by how similar the search functionality is to &lt;a title="http://www.nram.org.nz/" href="http://www.nram.org.nz/" id="p_nx"&gt;http://www.nram.org.nz/&lt;/a&gt;. I'm not sure whether their relative similarity is a good thing (because it enables non-experts to search the holdings) or a bad thing (because by lowering themselves to the lowest common denominator they've devalued their uniqueness). While I'm certain that these websites have vital roles in the museums and archives community respectively, I can't help but feel that from an end-users perspective have two sites rather than one seems redundant, and the fact that they don't seem to reference/suggest any other information sources doesn't help. I can't imagine a librarian/archivist not being forth-coming with a suggestion of where to look next if they've run out of local relevant content---why should our websites be any different?&lt;br /&gt;&lt;br /&gt;I recently changed the NZETC &lt;a title="to point to likely-relavent memory institutions when a search returns no results" href="http://www.nzetc.org/tm/scholarly/search/search.html?text=asdf&amp;amp;submit=Search" id="fyby"&gt;to point to likely-relevant memory institutions when a search returns no results&lt;/a&gt; (or when a user pages through to the end of any list of results).&lt;br /&gt;&lt;br /&gt;I also talked to some chaps from Te Papa about the metadata they're using to to represent places names (&lt;a href="http://www.getty.edu/research/conducting_research/vocabularies/tgn/"&gt;Getty Thesaurus of Geographic Names&lt;/a&gt;) and species names (ad-hoc). At the NZETC we have many place names marked up (in NZ, Europe and the Pacific), but are not currently syncing with an external authority. Doing so would hugely enable interoperability. Ideally we'd be using the shiny new &lt;a title="New Zealand Gazetteer of Official Geographic Names" href="http://www.linz.govt.nz/placenames/find-names/nz-gazetteer-official-names/index.aspx" id="m5bk"&gt;New Zealand Gazetteer of Official Geographic Names&lt;/a&gt;, but it doesn't yet have enough of the places we need (it basically only covers places mentioned in legislation or treaty settlements). It does have macrons in all the right places though, which is an excellent start. We currently don't mark up species names, but would like to, and again an external authority would be great.&lt;br /&gt;&lt;br /&gt;It might have been useful if the day had included an overview of what the NDHA actually was and what had been achieved (maybe I missed this?).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8132649278280858203?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8132649278280858203/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8132649278280858203' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8132649278280858203'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8132649278280858203'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/02/report-from-ndhas-international.html' title='Report from the NDHA&apos;s International Perspectives on Digital Preservation'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2449104320816804219</id><published>2009-02-01T11:38:00.004+13:00</published><updated>2009-02-01T12:14:00.663+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='photo'/><category scheme='http://www.blogger.com/atom/ns#' term='creative commons'/><category scheme='http://www.blogger.com/atom/ns#' term='flickr'/><category scheme='http://www.blogger.com/atom/ns#' term='national library'/><title type='text'>flickr promoting the commons / creative commons</title><content type='html'>&lt;a href="http://www.flickr.com/"&gt;flickr&lt;/a&gt; is promoting photos in what it calls "&lt;a href="http://www.flickr.com/commons/"&gt;The Commons&lt;/a&gt;", but only to logged in users. Normal users (who can't comment / tag the photos in the commons anyway) don't get an obvious link to them (except via about a billion third party sites such as blogs and google search). The page also shows how the national library's choice not to include text in their logo has come up trumps.&lt;br /&gt;&lt;br /&gt;The confusingly similarly named "&lt;a href="http://www.flickr.com/commons/"&gt;The Commons&lt;/a&gt;" and "&lt;a href="http://www.flickr.com/creativecommons/"&gt;Creative Commons&lt;/a&gt;" parts of the website apparently don't reference each other. Odd.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2449104320816804219?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2449104320816804219/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2449104320816804219' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2449104320816804219'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2449104320816804219'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/02/flickr-promoting-and-commons.html' title='flickr promoting the commons / creative commons'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1318819464157586254</id><published>2009-01-09T21:00:00.002+13:00</published><updated>2009-01-09T21:42:47.398+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='LINZ'/><category scheme='http://www.blogger.com/atom/ns#' term='GIS'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><category scheme='http://www.blogger.com/atom/ns#' term='New Zealand'/><title type='text'>Excellent stuff from New Zealand Geographic Board Ngā Pou Taunaha o Aotearoa</title><content type='html'>A while ago, motivated by the need for an authoritative list of New Zealand place names for our with at the &lt;a href="http://www.nzetc.org/" id="muw9" title="NZETC"&gt;NZETC&lt;/a&gt;, I criticised the NZGB fairly roundly.&lt;br /&gt;While they haven't produced what I/we want/need, in the last couple of months they've made huge progress in an unambiguously right direction.&lt;br /&gt;   Their primary work is the &lt;a href="http://www.linz.govt.nz/placenames/find-names/nz-gazetteer-official-names/index.aspx" id="l465" title="New Zealand Gazetteer of Official Geographic Names"&gt;New Zealand Gazetteer of Official Geographic Names&lt;/a&gt;, a list of all official place names in New Zealand. It uses have a peculiar definition of "official" (= mentioned in legislation or a Treaty of Waitangi settlement), they have very few names of inhabited places (and no linking with the much larger ones maintained by official bodies such as the police and fire service), They have no elevation data for mountains and pass (which are defined by their height) and they define some things as points when they appear to be areas (such as Arthur Pass National Park), but it's &lt;b&gt;much&lt;/b&gt; better than the &lt;a href="http://www.linz.govt.nz/placenames/find-names/topographic-names-db/index.aspx" id="z5t-" title="New Zealand Place Names Database"&gt;New Zealand Place Names Database&lt;/a&gt; since:&lt;br /&gt; &lt;ol&gt;&lt;li&gt;  It has a statutory reference for every place, given the source of the officialness of the name  &lt;/li&gt;&lt;li&gt;  It fully support Macrons  &lt;/li&gt;&lt;li&gt; It has a machine readable-list of DoC administered lands --- I can imagine this being used for all sorts of interesting things, getting people out in other scenic and marine reserves. &lt;/li&gt;&lt;/ol&gt;    NZGB sent around an email in which they explicitly addressed some of the points &lt;a href="http://opensourceexile.blogspot.com/2008/10/place-name-inconsistencies.html" id="hbqz" title="I'd earlier raised"&gt;I'd earlier raised&lt;/a&gt; (I'm sure I wasn't the only one):&lt;br /&gt; &lt;div style="margin-left: 40px;"&gt; It should be noted that some of the naming practices of the past will have to be lived with, despite inconsistencies.  Moving forward, the rules of nomenclature followed by the NZGB are designed to promote standardisation, consistency, and non-ambiguity.  The modern format for dual names is '&amp;lt;Maori name&amp;gt; / &amp;lt;non-Maori name', which the NZGB has applied for the past 10 years, though Treaty settlement dual names sometimes deviate from this convention, because the decision is ultimately made by the Minister for Treaty of Waitangi Negotiations.  Older forms of dual names, with brackets, will remain depicted as such until changed through the statutory processes of the NZGB Act 2008.  These are not generally regarded as alternative names.&lt;br /&gt;Macrons in Maori names have posed problems for electronic databases.  Nevertheless they are part of the orthography, recommended by the Maori Language Commission, and the Board endorses their use.  The Gazetteer will include macrons where they are formalised as part of the official name.  When Section 32 of the new Act comes into force, official documents will be required to show official names, and these will need to include macrons where they have been included as part of the official name (unless the proviso is used).  A list of those official names which have macrons is at &lt;a href="http://www.linz.govt.nz/placenames/researching-place-names/macrons/index.aspx" target="_blank"&gt;http://www.linz.govt.nz/&lt;wbr&gt;placenames/researching-place-&lt;wbr&gt;names/macrons/index.aspx&lt;/a&gt; .  LINZ's Customer Services has some solutions for showing macrons in LINZ's own databases and on published maps and charts, and is currently investigating how bulk data extracts might include information about macrons, for the customer's benefit.&lt;br /&gt; &lt;/div&gt;   Despite the name, it isn't clear in my mind exactly what's official and what isn't. Is the content of the "coordinates" column official? For railway lines this is a reference to the description, which in the cases of railways is usually of the form "From X to Y", where X and Y are place names, frequently place names that aren't on the list, so are thus presumably not official. Unless I'm going blind there is also no indication of accuracy on the physical measurements.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1318819464157586254?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1318819464157586254/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1318819464157586254' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1318819464157586254'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1318819464157586254'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2009/01/excellent-stuff-from-new-zealand.html' title='Excellent stuff from New Zealand Geographic Board Ngā Pou Taunaha o Aotearoa'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2164023884190396711</id><published>2008-10-09T01:04:00.002+13:00</published><updated>2008-10-09T01:08:56.523+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><category scheme='http://www.blogger.com/atom/ns#' term='topic maps'/><category scheme='http://www.blogger.com/atom/ns#' term='bookmarks'/><title type='text'>fuzzziness</title><content type='html'>I've been using &lt;a href="http://en.wikipedia.org/wiki/Topic_Maps"&gt;topic maps&lt;/a&gt; in my day job, so I decided to try out &lt;a href="http://www.fuzzzy.com/"&gt;http://www.fuzzzy.com/&lt;/a&gt;, a social bookmark engine that uses an underlying topic map engine.&lt;br /&gt;I tried to approach fuzzzy with an open mind, but the increasingly stumbling on &lt;span style="font-weight: bold;"&gt;really&lt;/span&gt; annoying (mis-)features.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;This is the first bookmark engine I've ever used hat doesn't let users migrate their bookmarks with them. This is perhaps the biggest single feature  fuzzzy could add to attract new users, since it seems that most people who're likely to use a bookmark engine have already played with another one long enough to have dozens or hundreds of bookmarks they'd like to bring with them. I know this is non-ideal from the point of view of the social bookmark engine they're migrating too, since it makes it hard to do things completely differently, but users have baggage.&lt;/li&gt;&lt;li&gt;While it'd possible to vote up or vote down just about everything (bookmarks, tags, bookmark-tags, users, etc), very little is actually done with these votes. If I've viewed a bookmark once and voted it down, why is it added to my "most used Bookmarks"? Surely if I've indicated I don't like it the bookmark should be hidden from me, not advertised to me.&lt;/li&gt;&lt;li&gt;For all the topic map goodness on the site, there is no obvious way to link from the fuzzzy topic map to other topic maps.&lt;/li&gt;&lt;li&gt;There doesn't seem to be much in the way of interfacing with other semantic web standards (i.e. RDF).&lt;br /&gt;&lt;/li&gt;&lt;li&gt;The help isn't. Admittedly this may be partly because many of the key participants have English as a second language.&lt;/li&gt;&lt;li&gt;There's a spam problem. But then everywhere has a spam problem.&lt;/li&gt;&lt;li&gt;It's not obvious that I can export my bookmarks out of fuzzzy in a form that any other bookmark engine understands.&lt;/li&gt;&lt;/ol&gt;These (mis-)features are a pity, because at &lt;a href="http://www.nzetc.org/"&gt;NZETC&lt;/a&gt; we use topic maps for authority (in the librarianship sense), and it would be great to have a compatible third party that could be used for non-authoritative stuff and which would just work seamlessly.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2164023884190396711?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2164023884190396711/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2164023884190396711' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2164023884190396711'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2164023884190396711'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/10/fuzzziness.html' title='fuzzziness'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2104369756756501635</id><published>2008-10-05T12:41:00.005+13:00</published><updated>2008-10-05T20:14:10.900+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='place names'/><category scheme='http://www.blogger.com/atom/ns#' term='macrons'/><category scheme='http://www.blogger.com/atom/ns#' term='authority'/><category scheme='http://www.blogger.com/atom/ns#' term='LINZ'/><category scheme='http://www.blogger.com/atom/ns#' term='maori'/><category scheme='http://www.blogger.com/atom/ns#' term='Māori'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc'/><category scheme='http://www.blogger.com/atom/ns#' term='New Zealand'/><title type='text'>Place name inconsistencies</title><content type='html'>I've been looking at the &lt;a href="http://www.linz.govt.nz/placenames/search/place-names-dataset-download/index.aspx"&gt;"Dataset of New Zealand Geographic Place Names"&lt;/a&gt; from LINZ. This appears to be as close as New Zealand comes to an Official list of place names. I've been looking because it would be great to use as an authority in the &lt;a href="http://authority.nzetc.org/"&gt;NZETC&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Coming to the data I was aware of a number of issues:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Unlike most geographical data users, I'm primarily interested in the names rather than the relative positions&lt;/li&gt;&lt;li&gt;New Zealand is currently going through an extended period of renaming of geographic features to their original Māori names&lt;/li&gt;&lt;li&gt;The names in the dataset are primarily map labels and are subject to cartographic licence&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;What I didn't expect was the insanity in the names. I know that there are some good historical reasons for this insanity, but that doesn't make it any less insane.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Names can differ only by punctuation. There is a "No. 1 Creek" and a "No 1 Creek".&lt;/li&gt;&lt;li&gt;Names can differ only by presentation. There is a "Crook Burn or 8 Mile Creek", an "Eight Mile Creek or Boundary Creek" and an "Eight Mile Creek" (each in a different province).&lt;/li&gt;&lt;li&gt;There is no consistent presentation of alternative names. There is "Saddle (Mangaawai) Bivouac", "Te Towaka Bay (Burnside Bay)", "Queen Charlotte Sound (Totaranui)", "Manawatawhi/Three Kings Islands", "Mount Hauruia/Bald Rock", "Crook Burn or 8 Mile Creek" and "Omere, Janus or Toby Rock"&lt;/li&gt;&lt;li&gt;There is no machine-readable source of the Māori place names with macrons, and the &lt;a href="http://www.linz.govt.nz/placenames/researching-place-names/macrons/index.aspx"&gt;human readable version&lt;/a&gt; has contains subtle difference to the machine-readable database (which contains no non-ASCII characters). For example "Franz Josef Glacier/Kā Roimata o Hine Hukatere (Glacier)" and "Franz Josef Glacier/Ka Roimata o Hine Hukatere" differ by more than the macrons. There appears to be no information on which are authoritative.&lt;/li&gt;&lt;/ol&gt;Right now I'm find finding this rather frustrating.&lt;br /&gt;&lt;br /&gt;(grammar edit)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2104369756756501635?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2104369756756501635/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2104369756756501635' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2104369756756501635'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2104369756756501635'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/10/place-name-inconsistencies.html' title='Place name inconsistencies'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6537330597185944727</id><published>2008-09-02T21:10:00.006+12:00</published><updated>2008-09-03T11:23:29.159+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sql'/><category scheme='http://www.blogger.com/atom/ns#' term='awk'/><category scheme='http://www.blogger.com/atom/ns#' term='GIS'/><category scheme='http://www.blogger.com/atom/ns#' term='sqlite'/><category scheme='http://www.blogger.com/atom/ns#' term='xml'/><category scheme='http://www.blogger.com/atom/ns#' term='New Zealand'/><category scheme='http://www.blogger.com/atom/ns#' term='script'/><title type='text'>Does anyone publish the Dataset of New Zealand Geographic Place Names already in XML form?</title><content type='html'>&lt;p&gt;I've been playing with the &lt;a href="http://www.linz.govt.nz/placenames/search/place-names-dataset-download/index.aspx"&gt;Dataset of New Zealand Geographic Place Names&lt;/a&gt; which is a set of CSV files published by Toitū te whenua / Land Information New Zealand (LINZ). The data takes quite a bit of massaging, and I was wondering whether anyone else had already done the work of making acceptable XML out of the data rather than doing all the work myself.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;I've attached the script I have so far, but it's not perfect. In particular:&lt;/p&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;It doesn't include place names with Macrons&lt;br /&gt;&lt;/li&gt;&lt;li&gt;It makes lots of ASCII-type assumptions &lt;br /&gt;&lt;/li&gt;&lt;li&gt;Many of the element names are poorly named and map non-obviously to fields in the CSV files.&lt;/li&gt;&lt;li&gt;The script isn't very generic and does little or no checking&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;p&gt;Anyway, here's he script, hopefully it's successfully escaped. The basics are that it creates an sqlite database and streams the CSV files into it direct from the zip (which it expects to have been downloaded into the current directory). It then streams each point out using awk to transform it to XML.&lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;pre&gt;&lt;br /&gt;#!/bin/bash&lt;br /&gt;# script to import data from&lt;br /&gt;# http://www.linz.govt.nz/placenames/search/place-names-dataset-download/index.aspx&lt;br /&gt;# into an XML file.&lt;br /&gt;# this script licensed under the GPL/BSD/Apache 2 licences&lt;br /&gt;&lt;br /&gt;echo \(re\)creating the database, expect DROP errors the first time you run this &lt;br /&gt;sqlite nzgeonames.db &amp;lt;&amp;lt; EOF&lt;br /&gt;DROP TABLE name;&lt;br /&gt;CREATE TABLE  name (id, name, east, north, pdescription, district, sheet, lat, long);&lt;br /&gt;&lt;br /&gt;DROP TABLE district;&lt;br /&gt;CREATE TABLE  district (district, description);&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;DROP TABLE pdescription;&lt;br /&gt;CREATE TABLE  pdescription (pdescription, short, description);&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;DROP TABLE sheet;&lt;br /&gt;CREATE TABLE  sheet (edition, map, sheet);&lt;br /&gt;&lt;br /&gt;VACUUM;&lt;br /&gt;EOF&lt;br /&gt;&lt;br /&gt;echo importing the names&lt;br /&gt;unzip -p nznames_6Aug08.zip namedata.txt  | sed 's/\r//'  |  sed 's/`/","/g' | awk -F^ '{print "INSERT INTO name VALUES (\"" $0 "\");"}' | sqlite nzgeonames.db &lt;br /&gt;&lt;br /&gt;echo importing the districts&lt;br /&gt;unzip -p nznames_6Aug08.zip landdist.txt  | sed 's/\r//'  |  sed 's/`/","/g' | awk -F^ '{print "INSERT INTO district VALUES (\"" $0 "\");"}' | sqlite nzgeonames.db&lt;br /&gt;&lt;br /&gt;echo importing the point descriptions \(expect two lines of errors\) &lt;br /&gt;unzip -p nznames_6Aug08.zip pointdes.txt  | sed 's/\r//'  |  sed 's/`/","/g' | awk -F^ '{print "INSERT INTO pdescription VALUES (\"" $0 "\");"}' | sed 's/:/","/' | sqlite nzgeonames.db&lt;br /&gt;&lt;br /&gt;echo importing the sheet names&lt;br /&gt;unzip -p nznames_6Aug08.zip sheetnam.txt  | sed 's/\r//'  |  sed 's/`/","/g' | awk -F^ '{print "INSERT INTO sheet VALUES (\"" $0 "\");"}' | sqlite nzgeonames.db &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;# pick up the ugly duckling&lt;br /&gt;sqlite nzgeonames &amp;lt;&amp;lt; EOF&lt;br /&gt; INSERT INTO pdescription VALUES ("MRFM","MARINE ROCK FORMATION","Marine Rock Formation");&lt;br /&gt;EOF&lt;br /&gt;&lt;br /&gt;echo exporting points as xml&lt;br /&gt;echo "&amp;lt;document source=\"Sourced from Land Information New Zealand, [date]. Crown copyright reserved.\"&amp;gt;" &amp;gt; nzgeonames.xml&lt;br /&gt;sqlite nzgeonames.db "SELECT name.id, name.name, name.east, name.north, name.pdescription, name.district, name.sheet, name.lat, name.long, district.description, pdescription.short, pdescription.description AS descriptionA, sheet.edition, sheet.map FROM name, district, pdescription, sheet WHERE name.district = district.district AND name.pdescription = pdescription.pdescription AND name.sheet = sheet.sheet;" | awk -F\| '{print "&amp;lt;point&amp;gt;&amp;lt;id&amp;gt;" $1 "&amp;lt;/id&amp;gt;&amp;lt;name&amp;gt;" $2 "&amp;lt;/name&amp;gt;&amp;lt;east&amp;gt;" $3 "&amp;lt;/east&amp;gt;&amp;lt;north&amp;gt;" $4 "&amp;lt;/north&amp;gt;&amp;lt;pdescription&amp;gt;" $5 "&amp;lt;/pdescription&amp;gt;&amp;lt;district&amp;gt;" $6 "&amp;lt;/district&amp;gt;&amp;lt;sheet&amp;gt;" $7 "&amp;lt;/sheet&amp;gt;&amp;lt;lat&amp;gt;" $8 "&amp;lt;/lat&amp;gt;&amp;lt;long&amp;gt;" $9 "&amp;lt;/long&amp;gt;&amp;lt;description&amp;gt;" $10 "&amp;lt;/description&amp;gt;&amp;lt;short&amp;gt;" $11 "&amp;lt;/short&amp;gt;&amp;lt;descriptionA&amp;gt;" $12 "&amp;lt;/descriptionA&amp;gt; &amp;lt;edition&amp;gt;" $13 "&amp;lt;/edition&amp;gt; &amp;lt;map&amp;gt;" $14 "&amp;lt;/map&amp;gt;  &amp;lt;/point&amp;gt;"}' | sed 's/&amp;amp;/&amp;amp;amp;/' &amp;gt;&amp;gt; nzgeonames.xml &lt;br /&gt;echo "&amp;lt;/document&amp;gt;" &amp;gt;&amp;gt; nzgeonames.xml&lt;br /&gt;&lt;br /&gt;echo formatting the points nicely&lt;br /&gt;xmllint --format nzgeonames.xml  &amp;gt; nzgeonames-formatted.xml&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6537330597185944727?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6537330597185944727/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6537330597185944727' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6537330597185944727'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6537330597185944727'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/09/does-anyone-publish-dataset-of-new.html' title='Does anyone publish the Dataset of New Zealand Geographic Place Names already in XML form?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2791786781570124104</id><published>2008-09-02T18:08:00.002+12:00</published><updated>2008-09-02T18:09:28.866+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='flickr'/><category scheme='http://www.blogger.com/atom/ns#' term='folksomony'/><title type='text'>Library of Congress flickr experiment</title><content type='html'>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt;&lt;a href="http://www.flickr.com/photos/library_of_congress/2809105456/" title="photo sharing"&gt;&lt;img src="http://farm4.static.flickr.com/3261/2809105456_8f548edd01_m.jpg" alt="" style="border: solid 2px #000000;" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="font-size: 0.9em; margin-top: 0px;"&gt;&lt;a href="http://www.flickr.com/photos/library_of_congress/2809105456/"&gt;Native troops, Cuba (LOC)&lt;/a&gt;&lt;br /&gt;Originally uploaded by &lt;a href="http://www.flickr.com/people/library_of_congress/"&gt;The Library of Congress&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;p&gt;While processing the photos from my parent's ruby wedding anniversary, I ran into the Library of Congress's &lt;a href="http://www.loc.gov/rr/print/flickr_pilot_faq.html"&gt;flickr experiment&lt;/a&gt;. &lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;I probably shouldn't have been, but I was astounded. It looks the the bastion of old-school cataloguing is coming to bathe in the fountain of social tagging. &lt;/p&gt;&lt;br /&gt;&lt;br /&gt;&lt;p&gt;This is part of a larger effort described at &lt;a href="http://www.flickr.com/commons/"&gt;http://www.flickr.com/commons/&lt;/a&gt;&lt;/p&gt;&lt;br clear="all" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2791786781570124104?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2791786781570124104/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2791786781570124104' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2791786781570124104'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2791786781570124104'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/09/library-of-congress-flickr-experiment.html' title='Library of Congress flickr experiment'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://farm4.static.flickr.com/3261/2809105456_8f548edd01_t.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-7760890155601720775</id><published>2008-08-26T22:14:00.003+12:00</published><updated>2008-09-05T15:51:49.139+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='xml'/><category scheme='http://www.blogger.com/atom/ns#' term='xslt'/><title type='text'>Saxon joy!</title><content type='html'>I've just moved to &lt;a href="http://saxon.sourceforge.net/"&gt;saxon&lt;/a&gt; from &lt;a href="http://xmlsoft.org/"&gt;libxml&lt;/a&gt; for some XSLT stuff I'm doing, and I'm really loving it.&lt;br /&gt;&lt;br /&gt;Not only does saxon run take much less memory, it also speaks &lt;a href="http://www.w3.org/TR/xslt20/"&gt;XSLT 2.0&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-7760890155601720775?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/7760890155601720775/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=7760890155601720775' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7760890155601720775'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/7760890155601720775'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/saxon-joy.html' title='Saxon joy!'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-388845421985784887</id><published>2008-08-10T08:42:00.001+12:00</published><updated>2008-08-10T08:42:10.695+12:00</updated><title type='text'>kowhai flowers at waikanae</title><content type='html'>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt;&lt;a href="http://www.flickr.com/photos/stuartyeates/2745645669/" title="photo sharing"&gt;&lt;img src="http://farm4.static.flickr.com/3156/2745645669_7f412e85fb_m.jpg" alt="" style="border: solid 2px #000000;" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="font-size: 0.9em; margin-top: 0px;"&gt;&lt;a href="http://www.flickr.com/photos/stuartyeates/2745645669/"&gt;kowhai flowers at waikanae&lt;/a&gt;&lt;br /&gt;Originally uploaded by &lt;a href="http://www.flickr.com/people/stuartyeates/"&gt;Stuart Yeates&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;br clear="all" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-388845421985784887?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/388845421985784887/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=388845421985784887' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/388845421985784887'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/388845421985784887'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/kowhai-flowers-at-waikanae.html' title='kowhai flowers at waikanae'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://farm4.static.flickr.com/3156/2745645669_7f412e85fb_t.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1310698480560509081</id><published>2008-08-05T21:54:00.005+12:00</published><updated>2008-08-06T18:46:46.794+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='usability'/><category scheme='http://www.blogger.com/atom/ns#' term='google reader'/><category scheme='http://www.blogger.com/atom/ns#' term='RSS'/><category scheme='http://www.blogger.com/atom/ns#' term='bloglines'/><category scheme='http://www.blogger.com/atom/ns#' term='OPML'/><title type='text'>moving back to google reader from bloglines</title><content type='html'>A couple of months ago I migrated to &lt;a href="http://www.bloglines.com"&gt;bloglines&lt;/a&gt; from google reader, not because I was necessarily unhappy with google reader, but because I was interested in seeing what else was available and how it might differ. I've just moved back to google reader.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://en.wikipedia.org/wiki/OPML"&gt;OMPL&lt;/a&gt; just worked. I was able to move my RSS "reading list" from google reader to bloglines and back again with no fuss, no hassle and no duplication.&lt;br /&gt;&lt;br /&gt;The advantages of google reader over bloglines are:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;AJAX - whereas bloglines marks all items on a page as read when you browse to it, google reader marks them as read when you scroll past them. &lt;/li&gt;&lt;li&gt;Ordering - google entwines items from all feeds in time order, bloglines presents items feed by feed&lt;/li&gt;&lt;li&gt;Better integration with other services &lt;/li&gt;&lt;/ol&gt;The advantages of bloglines over google reader are:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Fast scanning of voluminous feeds&lt;/li&gt;&lt;li&gt;Fast browsing (it seems _much_ faster when there are thousands of items)&lt;/li&gt;&lt;li&gt;Less integration with other services&lt;/li&gt;&lt;/ol&gt;You'll notice that better integration is both a positive and a negative.&lt;br /&gt;&lt;br /&gt;The fact that I have several google accounts and and only one of them is tied to my RSS reading means that there are tasks I can't multi-task between, even at the coarsest of levels and also means that contacts from the google account almost never get forwarded articles I discover via RSS.&lt;br /&gt;&lt;br /&gt;The fact that my blogger.com account and my google reader accounts magically know about each other is great, as is being able to sign in once to a whole suite of tools.&lt;br /&gt;&lt;br /&gt;In the end the reason for changing back was ordering. I read too many RSS feeds that cover the same topic for reading them out of order to make sense.&lt;br /&gt;&lt;br /&gt;I've also just culled some of my RSS feeds, with the a prime criterion being the quality of their RSS. A number of web comics require one to click a link to read the strip and I no longer read them, but I still read &lt;a href="http://www.unshelved.com/"&gt;Unshelved&lt;/a&gt;, which has the strip (and an ad) in the RSS.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1310698480560509081?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1310698480560509081/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1310698480560509081' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1310698480560509081'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1310698480560509081'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/moving-back-to-google-reader-from.html' title='moving back to google reader from bloglines'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8897584299215685706</id><published>2008-08-04T21:45:00.002+12:00</published><updated>2008-08-05T09:29:27.259+12:00</updated><title type='text'>Decent editor for blogger.com?</title><content type='html'>Can someone recommend a decent replacement for the default editor for blogger.com?&lt;br /&gt;&lt;br /&gt;Before it drives me insane...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8897584299215685706?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8897584299215685706/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8897584299215685706' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8897584299215685706'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8897584299215685706'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/decent-editor-for-bloggercom.html' title='Decent editor for blogger.com?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8581938736898020099</id><published>2008-08-04T19:55:00.004+12:00</published><updated>2008-08-04T20:16:12.464+12:00</updated><title type='text'>KDE/Gnome Māori localisation on the rocks?</title><content type='html'>&lt;span style="color: rgb(0, 0, 0);font-family:times new roman;font-size:100%;" id="sydl0"  &gt;&lt;/span&gt;It looks like Maori localisation has been removed from the KDE 4.0 repository:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;stuartyeates@stuartyeates:~/tmp/mi$ svn co svn://anonsvn.kde.org/home/kde/trunk/l10n-kde4/mi/messages&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;svn: URL 'svn://anonsvn.kde.org/home/kde/trunk/l10n-kde4/mi/messages' doesn't exist&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;stuartyeates@stuartyeates:~/tmp/mi$ svn co svn://anonsvn.kde.org/home/kde/trunk/l10n-kde4/mi/docmessages&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;svn: URL 'svn://anonsvn.kde.org/home/kde/trunk/l10n-kde4/mi/docmessages' doesn't exist&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;stuartyeates@stuartyeates:~/tmp/mi$ svn co svn://anonsvn.kde.org/home/kde/branches/stable/l10n-kde4/mi/messages&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;font-family:courier new;" &gt;svn: URL 'svn://anonsvn.kde.org/home/kde/branches/stable/l10n-kde4/mi/messages' doesn't exist&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Things don't look good for the upcoming 4.* releases, with the stats for translation at 0%: &lt;a href="http://l10n.kde.org/stats/gui/trunk-kde4/team/"&gt;http://l10n.kde.org/stats/gui/trunk-kde4/team/ &lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Gnome Māori localisation is not much better: stable at 1%: &lt;a href="http://l10n.gnome.org/teams/mi"&gt;http://l10n.gnome.org/teams/mi&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;In the medium/long term there is hope that much of this localisation can be bootstrapped by application-centric localisation that appears to be thriving, particularly with respect to &lt;a href="http://kiharoa.dear.maori.nz/"&gt;firefox, thunderbird and OOo.&lt;/a&gt;&lt;span style="color: rgb(0, 0, 0);font-size:100%;" &gt;&lt;span style="font-family:times new roman;"&gt;&lt;a href="http://kiharoa.dear.maori.nz/"&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8581938736898020099?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8581938736898020099/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8581938736898020099' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8581938736898020099'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8581938736898020099'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/kdegnome-mori-localisation-on-rocks.html' title='KDE/Gnome Māori localisation on the rocks?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8744500193200028253</id><published>2008-08-03T18:54:00.004+12:00</published><updated>2008-08-03T19:09:09.904+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='work'/><category scheme='http://www.blogger.com/atom/ns#' term='nzetc catalyst.net.nz'/><category scheme='http://www.blogger.com/atom/ns#' term='catalyst.net'/><title type='text'>Leaving catalyst :( joining NZETC :)</title><content type='html'>Last week I gave notice at my current employer (&lt;a href="http://catalyst.net.nz/"&gt;Catalyst.net.nz&lt;/a&gt;) and accepted a job at Victoria University's &lt;a href="http://www.nzetc.org/"&gt;New Zealand Electronic Text Centre&lt;/a&gt;. The NZETC is primarily a TEI/XSLT/Cocoon-house which publishes digital versions of culturally significant works. It also runs a number of other digital services for the university library (into which it is currently being integrated). As such it's significantly closer to what I've been doing previously in terms of environment, content and technology. &lt;br /&gt;Exciting things about the NZETC from my point of view:&lt;br /&gt;&lt;ul&gt;&lt;li&gt; &lt;a id="sfkg0" href="http://www.nzetc.org/tm/scholarly/tei-BaxWeWi.html" title=""&gt;We Will Not Cease&lt;/a&gt; by &lt;a id="sfkg1" href="http://www.nzetc.org/tm/scholarly/name-207373.html" class="toc-author" title="Archibald McColl Learmond Baxter"&gt;Archibald Baxter&lt;/a&gt;, a first hand account of conscientious objection in the first world war by the father of New Zealand's greatest poet&lt;/li&gt;&lt;li&gt; Works by &lt;a id="tnov" href="http://www.nzetc.org/tm/scholarly/name-207379.html" class="toc-author" title=""&gt;John Cawte Beaglehole&lt;/a&gt;, including letters that mention him meeting my grandfather ("Jack Yeates") at Cambridge &lt;/li&gt;&lt;li&gt; Work with &lt;a id="g:_1" class="xref" href="http://www.learningmedia.co.nz/"&gt;Learning Media&lt;/a&gt;, (who I've previously published poetry with) on their publishing process&lt;/li&gt;&lt;li&gt; Extensive use of the  &lt;a id="sc7b" class="xref" href="http://creativecommons.org/licenses/by-sa/3.0/nz/"&gt;New Zealand Creative Commons Attribution Share-Alike License (CC BY-CA)&lt;/a&gt; license for works &lt;/li&gt;&lt;/ul&gt;The commute to work will be slightly longer, with me either getting off the bus one stop earlier and catching the &lt;a href="http://www.flickr.com/photos/glutnix/6063846/in/set-151661/"&gt;cablecar&lt;/a&gt; up the hill, or getting off at my current stop and walking up. I'm hoping to do mainly the later.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8744500193200028253?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8744500193200028253/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8744500193200028253' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8744500193200028253'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8744500193200028253'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/08/leaving-catalyst-joining-nzetc.html' title='Leaving catalyst :( joining NZETC :)'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2757543482127738935</id><published>2008-07-09T21:11:00.003+12:00</published><updated>2008-07-09T21:21:32.036+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='Māori'/><category scheme='http://www.blogger.com/atom/ns#' term='ubuntu'/><title type='text'>Who should I nominate for the NZ Open Source Awards?</title><content type='html'>So nominations are open for the &lt;a href="http://www.nzosa.org.nz/"&gt;New Zealand Open Source Awards&lt;/a&gt; and I have to decide who I should nominate. There doesn't seem to be anything stopping me nominating several, but picking one contender and throwing my weight behind them seems like the right thing to do. The ideas I've come up with so far are:&lt;br /&gt;&lt;br /&gt;Kiharoa Dear for excellent work in getting firefox, thunderbird and open office working in Māori contexts:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://kiharoa.dear.maori.nz/"&gt;http://kiharoa.dear.maori.nz/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Standards New Zealand for sanity control in the OOXML fiasco:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.standards.co.nz/news/Media+releases/NZ+maintains+negative+vote+on+OOXML+Standard.htm"&gt;http://www.standards.co.nz/news/Media+releases/NZ+maintains+negative+vote+on+OOXML+Standard.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Hagley Community College for rolling out Ubuntu in a secondary school:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://computing.hagley.school.nz/about/opensource"&gt;http://computing.hagley.school.nz/about/opensource&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Who should I nominate? Is there someone I've missed?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2757543482127738935?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2757543482127738935/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2757543482127738935' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2757543482127738935'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2757543482127738935'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/07/who-should-i-nominate-for-nz-open.html' title='Who should I nominate for the NZ Open Source Awards?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5182844215638089571</id><published>2008-06-30T20:13:00.001+12:00</published><updated>2008-06-30T20:27:53.785+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><category scheme='http://www.blogger.com/atom/ns#' term='configuration'/><category scheme='http://www.blogger.com/atom/ns#' term='ubuntu'/><title type='text'>I'm confused about hardy heron and default applications</title><content type='html'>Back in the day you told your linux system which applications you wanted to use with environmental variables things like:&lt;br /&gt; &lt;blockquote&gt;export EDITOR=/usr/bin/emacs &lt;/blockquote&gt; Then along came the wonderful debianness of the apt-family and the alternatives system.&lt;br /&gt;&lt;blockquote&gt; update-alternatives --config vi&lt;/blockquote&gt; Now this system too is being undermined by various systems, leaving me uncertain where to set things. What I'm trying to do is:&lt;br /&gt;&lt;ul&gt;&lt;li&gt; have Sound Juicer and not Music Player (RhythmBox) launched when a CD is inserted. There is an entry for "Multimedia" under the "Preferred Applications" menu option, but this seems to be about opening files, not responding to newly-mounted media and Sound Juicer is not listed as an option. There doesn't seem to be anything about CDs under the "Removable Drives and Media Preferences" (although this is where the setting are that automatically load F-Spot when I attach my camera, which seems like the same kind of thing).&lt;/li&gt;&lt;li&gt; configure which applications I can launch on the .cr2/TIFF/Canon RAW files produced by my digital camera I want the same applications to appear in both the file browser and F-Spot (which look like they're presenting the same interface but apparently aren't). ufraw seems to be the tool of choice here (either standalone or as a gimp plugin), but I'd like to pass it some command line args. I can find no entry or this under the "Preferred Applications" menu option.&lt;/li&gt;&lt;/ul&gt;  There are lots of menus with a "Help" as an option, but very few of them seem to be.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5182844215638089571?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5182844215638089571/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5182844215638089571' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5182844215638089571'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5182844215638089571'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/im-confused-about-hardy-heron-and.html' title='I&apos;m confused about hardy heron and default applications'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1008662576703238580</id><published>2008-06-30T16:34:00.002+12:00</published><updated>2008-06-30T16:40:41.779+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='photo'/><category scheme='http://www.blogger.com/atom/ns#' term='mike'/><category scheme='http://www.blogger.com/atom/ns#' term='portrait'/><category scheme='http://www.blogger.com/atom/ns#' term='catalyst.net'/><title type='text'>Mike O'Connor at Friday drinks</title><content type='html'>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt;&lt;a href="http://www.flickr.com/photos/stuartyeates/2619680029/" title="photo sharing"&gt;&lt;img src="http://farm4.static.flickr.com/3281/2619680029_b077837255_m.jpg" alt="" style="border: 2px solid rgb(0, 0, 0);" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="margin-top: 0px;font-size:0;" &gt;&lt;a href="http://www.flickr.com/photos/stuartyeates/2619680029/"&gt;Mike O'Connor&lt;/a&gt;&lt;br /&gt;Originally uploaded by &lt;a href="http://www.flickr.com/people/stuartyeates/"&gt;Stuart Yeates&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;I took some photos at Friday drinks, trying to do the whole wide-aperture-to-isolate-visual-elements thing. I wasn't really aware of just how much it is dependent on the relative position of the photographer, subject and background.&lt;br /&gt;&lt;br /&gt;Some of them turned out better than others.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1008662576703238580?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1008662576703238580/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1008662576703238580' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1008662576703238580'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1008662576703238580'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/mike-o-at-friday-drinks.html' title='Mike O&amp;#39;Connor at Friday drinks'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://farm4.static.flickr.com/3281/2619680029_b077837255_t.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8859111687857695541</id><published>2008-06-29T13:01:00.002+12:00</published><updated>2008-06-29T13:07:24.201+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='community building'/><category scheme='http://www.blogger.com/atom/ns#' term='ohloh'/><title type='text'>What should the ohloh homepage look like?</title><content type='html'>&lt;p&gt;In a previous post I criticised &lt;a href="http://www.ohloh.net/"&gt;ohloh&lt;/a&gt; homepage for being completely useless for current users of the site. This was somewhat unfair, since I provided no concrete constructive suggestions as to what should be on the page. This blog post, hopefully, fixes that.&lt;/p&gt;&lt;p&gt;To my mind there are two classes of information that should be on the homepage: (a) things that lots of users are confused about and (b) things that are 'new' (think customised rss feeds) (c) combinations of both.&lt;/p&gt;&lt;p&gt;Finding out what people are confused about is easy, just look in the forums, where people are most confused about:&lt;/p&gt;&lt;ol&gt;&lt;li&gt;changes in their kudos&lt;/li&gt;&lt;li&gt;why their enlistment hasn't been updated&lt;/li&gt;&lt;li&gt;why version control system of choice isn't supported&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;The list of 'new' things is:&lt;/p&gt;&lt;ol&gt;&lt;li&gt;new / updated projects&lt;/li&gt;&lt;li&gt;new / updated users&lt;/li&gt;&lt;li&gt;new / updated enlistments&lt;/li&gt;&lt;li&gt;new forum posts &lt;/li&gt;&lt;li&gt;new RSS items in projects RSS feeds&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;These (1), (3) and (5) can be filtered by the users connection to the project (contributor/user/none).&lt;/p&gt;&lt;p&gt;So the trick now is to find combinations which help users understand what's going on and encourage users to engage with ohloh and the projects.&lt;/p&gt;&lt;p&gt;Idea X: A feed of updated enlistments a user is a contributor or user of:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;  Project A's enlistment at http://example.com/git updated 24st June 2008 at 24:50 GMT. A, B and C are the biggest commiters to this project, which is in Java and XML. Last updated 1st Feb 1970.&lt;/li&gt;&lt;li&gt;  Project B's enlistment at http://example.com/git updated 24st June 2008 at 24:50 GMT. D, E and F are the biggest commiters to this project, which is in C and shell script. Last updated 1st Feb 1970.&lt;/li&gt;&lt;li&gt;  Project C's enlistment at http://example.com/git failed at or about revision 12345. Click here for instrustions on what to do about this. Last updated 1st Feb 1970.&lt;/li&gt;&lt;li&gt;  ...&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;This not only tells user the status of their projects, but that enlistments are being processed, the expected time between each processing of enlistments, that some processing fails and that there's a link to find out more information. Such a feed also focuses attention on the processing of enlistments---which is the heart of ohloh and the key differentiating factor that seperates ohloh from 15 billion other open source sites.&lt;/p&gt;&lt;p&gt;Idea Y: A mixed feed of upstream bugs that effect ohloh performance and functionality:&lt;/p&gt;&lt;ul&gt;&lt;li&gt;  Ticket "support for .xcu file format" updated in ohcount by user "batman"&lt;/li&gt;&lt;li&gt;  Post "jabber message length" updated in Help! forum by user "someone else"&lt;/li&gt;&lt;li&gt;  Ticket "svn branch support" updated in ohloh by user "robin"&lt;/li&gt;&lt;li&gt;  Ticket "bzr support in ohloh" created in ohloh by user "joker"&lt;/li&gt;&lt;li&gt;  Post "jabber message length" created in Help! forum by user "someone"&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;This lets people keep up with the status of ohloh progress on issues such as the implementation branch support for svn and support for hg.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8859111687857695541?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8859111687857695541/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8859111687857695541' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8859111687857695541'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8859111687857695541'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/what-should-ohloh-homepage-look-like.html' title='What should the ohloh homepage look like?'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1624899669758395371</id><published>2008-06-23T18:27:00.000+12:00</published><updated>2008-06-23T18:29:23.171+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='usability'/><category scheme='http://www.blogger.com/atom/ns#' term='ohloh'/><title type='text'>New ohloh look and feel</title><content type='html'>&lt;a href="http://www.ohloh.net/"&gt;ohloh&lt;/a&gt; have changed their look and feel, and I've got to say I hate it.&lt;br /&gt;&lt;br /&gt;Once you're logged in, almost nothing above the scroll cut on the front page is useful---we already know what ohloh is and don't need bandwidth-hogging ads to tell us. What we need are deep links into new stuff---projects, users and forum posts.&lt;br /&gt;&lt;br /&gt;How about logged in users see content rather than ads on the homepage?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1624899669758395371?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1624899669758395371/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1624899669758395371' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1624899669758395371'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1624899669758395371'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/new-ohloh-look-and-feel.html' title='New ohloh look and feel'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2779397838284971663</id><published>2008-06-15T11:07:00.003+12:00</published><updated>2008-06-15T11:16:56.094+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><category scheme='http://www.blogger.com/atom/ns#' term='kernel'/><category scheme='http://www.blogger.com/atom/ns#' term='ubuntu'/><title type='text'>Kernel Hell and what to do about it</title><content type='html'>&lt;p&gt;I've been in kernel hell with my home system for the past couple of days. What I want to build is a custom kernel that'll do xen, vserver, vmware, selinux, support both my wireless chipsets and support my video chipset. Ideally it should be built the Debian/Ubuntu way, so it just works on my Ubuntu Hardy Heron system.&lt;/p&gt;&lt;p&gt;So far I've had various combinations of four or five out of six working at once.&lt;/p&gt;&lt;p&gt;I'm not a kernel hacker, but I have a PhD in computer science, so I should be able to at least make progress on this, and the fact that I can't is very frustrating.  At work I grabbed a kernel off a co-worker, but it wasn't built the Debian/Ubuntu way.&lt;/p&gt;&lt;p&gt;Standing back and looking at the problem, there seem to be two separate contributing factors:&lt;/p&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;  There are a huge number of organically-grown structural layers. I count git, the kernel build scripts, make, Linus's release system, the Debian kernel building system and the ubuntu kernel building system. I won't deny that each of these service a purpose, but that's is different points a which each of the six different things I'm trying to make work can begin their explanation of how to make them work and six different places for things to go wrong.&lt;/li&gt;&lt;li&gt;There are about many Linux distributions and each of the things I'm trying to get working caters to a different set of them.&lt;/li&gt;&lt;/ol&gt;In many ways the distribution kernel packagers are victims of their own success, most Ubuntu, debian and RedHat kernels just work because they're packagers keep adding more and more features and more and more drivers to the default kernels. With the default kernels working for so many people, fewer and fewer people build their own kernels and the pool of knowledge shrinks. The depth of that knowledge increases too, with the each evolution of the collective build system.&lt;p&gt;Wouldn't it be great if someone (ideally under the auspices of the OSDL) stepped in and said "This is insane, we need a system to allow users to build their own kernels from a set of &amp;lt;git repository, tag&amp;gt;  pairs and a set of flags (a la the current kernel config system). It would download the git repositories and sync to the tags and then compile to the set of flags.  Each platform can build their own GUI and their own backend so it works with their widget set and their low level architecture, but here's a prototype."&lt;/p&gt;&lt;p&gt;The system would take the set of repositories and tags in those repositories and download the sources with git, merge the results, use the flags to configure the build and build the kernel. Of course, sometimes the build won't work (in which case the system sends a copy of the config and the last N lines of output to a central server) and sometimes it will (in which case the system sends a copy of the config and an md5 checksum of the kernel to a central server and optionally uploads the kernel to a local repository), but more than anything it'll make it easy and safe for regular users to compile their own kernels.  The system would supplant "building kernels the Debian way" or "building kernels the RedHat way" and enable those projects working at the kernel level to provide meaningful support and help to their users on distributions other than slackware.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;Potential benefits I can see are:&lt;/p&gt;&lt;ol&gt;&lt;li&gt;  increasing the number of crash-tolerant users willing to test the latest kernel features (better testing of new kernels and new features, which is something that's frequently asked for on lkml)&lt;/li&gt;&lt;li&gt;  easing the path of new device drivers (users get to use their shiny new hardware on linux faster)&lt;/li&gt;&lt;li&gt;  increasing the feedback from users to developers, in terms of which features people are using/interested in (better, more responsive, kernel development)&lt;/li&gt;&lt;li&gt;  reduce the reliance on linux packagers to release kernels in which an impossible-to-test number of features work flawlessly (less stressed debian/ubuntu/redhat kernel packages)&lt;/li&gt;&lt;li&gt;  ease the path to advanced kernel use such as virtualisation&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;You know the great thing about that list? Everyone who would need to cooperate gets some benefit, which means that it might just happen...&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2779397838284971663?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2779397838284971663/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2779397838284971663' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2779397838284971663'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2779397838284971663'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/kernel-hell-and-what-to-do-about-it.html' title='Kernel Hell and what to do about it'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2114180494282979079</id><published>2008-06-15T10:38:00.003+12:00</published><updated>2008-06-15T13:35:40.637+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NewZealand'/><category scheme='http://www.blogger.com/atom/ns#' term='macrons'/><category scheme='http://www.blogger.com/atom/ns#' term='maori'/><category scheme='http://www.blogger.com/atom/ns#' term='Māori'/><category scheme='http://www.blogger.com/atom/ns#' term='i18n'/><category scheme='http://www.blogger.com/atom/ns#' term='l10n'/><title type='text'>Macrons and URLs</title><content type='html'>Macrons are allowed in the path part of URLs, but not currently in the machine-name part (or at least, &lt;a href="http://dnc.org.nz/story/30294-29-1.html"&gt;not yet&lt;/a&gt;), so &lt;a href="http://extensions.services.openoffice.org/project/m%C4%81ori-papakupu"&gt;http://extensions.services.openoffice.org/project/māori-papakupu&lt;/a&gt; is good, but  &lt;a href="http://www.taiuru.M%C3%84%C2%81ori.nz/"&gt;http://www.taiuru.Māori.nz/&lt;/a&gt; is not (use &lt;a href="http://www.taiuru.maori.nz/"&gt;http://www.taiuru.maori.nz/&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;A review of how lots of programs handle macrons is at &lt;a href="http://research.elabs.govt.nz/macron-support-in-open-source-web-applications/"&gt;http://research.elabs.govt.nz/macron-support-in-open-source-web-applications/&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2114180494282979079?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2114180494282979079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2114180494282979079' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2114180494282979079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2114180494282979079'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/macrons-and-urls.html' title='Macrons and URLs'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-2151972969924259813</id><published>2008-06-14T09:06:00.005+12:00</published><updated>2008-06-14T09:26:43.187+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sql'/><category scheme='http://www.blogger.com/atom/ns#' term='firefox'/><category scheme='http://www.blogger.com/atom/ns#' term='sqlite'/><category scheme='http://www.blogger.com/atom/ns#' term='selenium'/><category scheme='http://www.blogger.com/atom/ns#' term='mozilla'/><title type='text'>Exporting firefox 3.0 history to selenium</title><content type='html'>In the new firefox 3.0 they've completely changed he way history is recorded, using a SQL engine to record it (details &lt;a href="http://www.forensicswiki.org/index.php?title=Mozilla_Firefox_3_History_File_Format"&gt;here&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;I wrote a quik hack to export the history as a series of selenium tests:&lt;br /&gt;&lt;br /&gt;sqlite3 .mozilla/firefox/98we5tz3.default/places.sqlite 'select * from moz_places' | awk -F\| '{print   "&amp;lt;tr&amp;gt;&amp;lt;td&amp;gt;open&amp;lt;/td&amp;gt;&amp;lt;td&amp;gt;"$2"&amp;lt;/td&amp;gt;&amp;lt;td&amp;gt;&amp;lt;/td&amp;gt;&amp;lt;/tr&amp;gt;" }'&lt;br /&gt;&lt;br /&gt;sqlite3 locks the file, so you'll need to close firefox (or take a copy of the file) first. Cut and paste the results into an empty selenium test.&lt;br /&gt;&lt;br /&gt;Obviously, your profile will have a different name.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-2151972969924259813?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/2151972969924259813/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=2151972969924259813' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2151972969924259813'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/2151972969924259813'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/06/exporting-firefox-30-history-to.html' title='Exporting firefox 3.0 history to selenium'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-8526585748933284747</id><published>2008-05-04T18:23:00.004+12:00</published><updated>2008-05-04T18:48:45.033+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='authentication'/><category scheme='http://www.blogger.com/atom/ns#' term='networking'/><title type='text'>OpenID - Everybody Wants To Go To Heaven, Nobody Wants To Die</title><content type='html'>I've been looking recently at &lt;a href="http://openid.net/"&gt;OpenId&lt;/a&gt; a little more closely. It's a great system, that I've been using for a number of years. &lt;br /&gt;&lt;br /&gt;My first free openid provider (&lt;a href="http://stuartyeates.videntity.org/"&gt;http://stuartyeates.videntity.org/&lt;/a&gt;) went belly up for reasons I'm still not too sure about, but the life expectancy of internet startups and free services has never been that long. I have a new identity (&lt;a href="http://stuartyeates.myopenid.com/"&gt;http://stuartyeates.myopenid.com/&lt;/a&gt;) which does me just fine.&lt;br /&gt;&lt;br /&gt;One thing I've been noticing is that while a host of the internet services that I use want to be my identity provider, very few of them want to let me login using an openid provided by another identity provider. From a business point of view I can completely understand this: they want to know everything about the user so they can (a) off better, more integrated services and (b) see more customised advertising. I have no problem with (a), but (b) is one of the reasons I'm eying up openid in the first place.&lt;br /&gt;&lt;br /&gt;They don't understand that to get to openid heaven, they're going to have to die. By giving away the user-authentication-and-give-us-your-personal-information step, they can drive significantly more logins and significantly deeper interaction with their website and with their content. Sure, they can't necessarily get access to the users email address and whatever fake personal details they submitted at sign up, but I've yet to see this used particularly well anyway. The personal details certainly don't seem to be used to anywhere near the same effect as analysis of user behaviour.&lt;br /&gt;&lt;br /&gt;What would be really good is an openid identity provider with (a) shared no information with any of the large advertising groups or dominant internet companies, (b) really clear information about privacy expectations (and I'm not talking here about a page of legalese titled "privacy policy," but discussion and disclosure of things which jurisdictions user data is stored in, steps taken to avoid collection of user data, etc) and (c) a clear sustainability model to prevent it being bought-out for the user data and to ensure it's continuity (I can imagine paying a subscription for it).&lt;br /&gt;&lt;br /&gt;I can imagine that this is the kind of thing that google or yahoo might sponsor in their efforts to promote their  next generation of web authentication standards. The existance of such an outfit would hugely boost the reputation of such a standard and lacking the huge advertising and existing lockin it would hardly challenge their own identity providers while answering a whole range of independence, privacy and monopoly questions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-8526585748933284747?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/8526585748933284747/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=8526585748933284747' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8526585748933284747'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/8526585748933284747'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/05/openid-everybody-wants-to-go-to-heaven.html' title='OpenID - Everybody Wants To Go To Heaven, Nobody Wants To Die'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-64593083719989468</id><published>2008-05-04T10:04:00.001+12:00</published><updated>2008-05-04T10:11:40.336+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='community building'/><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='semantic web'/><category scheme='http://www.blogger.com/atom/ns#' term='go'/><category scheme='http://www.blogger.com/atom/ns#' term='ohloh'/><title type='text'>Tinkering with ohloh</title><content type='html'>&lt;a href="http://www.ohloh.net/accounts/8380"&gt;I've&lt;/a&gt; been tinkering with &lt;a href="http://www.ohloh.net/"&gt;www.ohloh.net&lt;/a&gt;, which is probably best described as a web 2.0 &lt;a href="http://freshmeat.net/"&gt;freshmeat&lt;/a&gt;. Rather than tracking manually-updated releases it relies on automatically detected updates to version control repositories, RSS  and geo-urls. It relies on wiki like editing rather than the strict ownership rules of freshmeat. ohloh does automatic language detection and source code analysis based on the version control repository and attributes individual commits to specific developers and their ohloh user account. &lt;br /&gt;&lt;br /&gt;I've added and am actively curating &lt;a href="http://www.ohloh.net/projects/search?q=baduk"&gt;a group of go/baduk projects&lt;/a&gt;. The overall goal is to encourage reuse and reduce the willingness of hackers to rewrite go/baduk systems from scratch.&lt;br /&gt;&lt;br /&gt;My next step on the technical side is to write some  &lt;a href="http://www.w3.org/TR/grddl/"&gt;GRDDL (Gleaning Resource Descriptions from Dialects of Languages)&lt;/a&gt; to transform the XML returned by the API into RDF, which I can them import into &lt;a href="http://code.google.com/p/simal/"&gt;simal.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;My next step on the social side is to mention what I'm doing in some of the go/baduk mailing lists, but I want to wait until I've got something concrete to provide that &lt;a href="http://senseis.xmp.net/?Software"&gt;Sensei's Library&lt;/a&gt; (the current repository of information about go/baduk programs) hasn't already got.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-64593083719989468?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/64593083719989468/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=64593083719989468' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/64593083719989468'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/64593083719989468'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/05/tinkering-with-ohloh.html' title='Tinkering with ohloh'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-6792821643925822973</id><published>2008-05-02T22:46:00.002+12:00</published><updated>2008-05-02T22:59:33.465+12:00</updated><title type='text'>Happiness is tests passing</title><content type='html'>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt; &lt;a href="http://www.flickr.com/photos/stuartyeates/2459209170/" title="photo sharing"&gt;&lt;img src="http://farm3.static.flickr.com/2354/2459209170_ba1aec3591_o.png" alt="" style="border: solid 2px #000000;" /&gt;&lt;/a&gt; &lt;br /&gt; &lt;span style="font-size: 0.9em; margin-top: 0px;"&gt;  &lt;a href="http://www.flickr.com/photos/stuartyeates/2459209170/"&gt;All tests passed&lt;/a&gt;  &lt;br /&gt;  Originally uploaded by &lt;a href="http://www.flickr.com/people/stuartyeates/"&gt;Stuart Yeates&lt;/a&gt; &lt;/span&gt;&lt;br /&gt;&lt;br /&gt; &lt;a href="http://www.flickr.com/photos/stuartyeates/2458381561/" title="photo sharing"&gt;&lt;img src="http://farm4.static.flickr.com/3222/2458381561_176200f8df_o.png" alt="" style="border: solid 2px #000000;" /&gt;&lt;/a&gt; &lt;br /&gt; &lt;span style="font-size: 0.9em; margin-top: 0px;"&gt;  &lt;a href="http://www.flickr.com/photos/stuartyeates/2458381561/"&gt;2 tests failing&lt;/a&gt; &lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;p&gt;I've just (re-)attained the happy state of the unit tests passing in the key package of &lt;a href="http://code.google.com/p/jgogears/"&gt;jgogears&lt;/a&gt;. I've invested a surprising amount of time and energy in the unit tests, mainly as a form of requirements analysis, and I'm really pleased with the result.&lt;/p&gt;&lt;br /&gt;&lt;p&gt; Sadly, the core package that contains much of the code has some outstanding, longstanding failures which are going to be challenging to fix. They are represent failures in &lt;a href="http://code.google.com/p/jgogears/"&gt;jgogears&lt;/a&gt;' ability to round-trip board states between GNU-Go ASCII and SGF  files and will require a bug-for-bug reimplementation of the GNU-Go ASCII board printer.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;jgogears has reached the stage where it now plays games that are obviously meant to be go, but is not yet a serious contender.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-6792821643925822973?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/6792821643925822973/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=6792821643925822973' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6792821643925822973'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/6792821643925822973'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/05/happiness-is-tests-passing.html' title='Happiness is tests passing'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-5256166235701608993</id><published>2008-04-18T12:10:00.003+12:00</published><updated>2008-04-18T12:23:34.417+12:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='vmware'/><category scheme='http://www.blogger.com/atom/ns#' term='ubuntu'/><category scheme='http://www.blogger.com/atom/ns#' term='networking'/><title type='text'>Resetting vmware networking</title><content type='html'>I had a hardware glitch (apparently chained failure of on-board network card, auto-negotiation on switch, auto-negotiation on larger switch) which left networking on my ubuntu edgy box somewhat randomised. Ubuntu recovered after a reboot, successfully finding the new network card (but calling it eth11). vmware-server didn't recover, and no amount of fiddling with the magic of  vmware-config-network.pl would cause vmware to forget all it knew about networking and look only at the current state of play.&lt;br /&gt;&lt;br /&gt;In an attempt to erase it's memory I removed /etc/vmware/locations but this caused vmware-config-network.pl not to run at all. &lt;br /&gt;&lt;br /&gt;Eventually I worked out that the trick was to remove everything in /etc/vmware/locations other than the lines near the start telling it where to find directories and executables and rerun vmware-config-network.pl selecting all the defaults. &lt;br /&gt;&lt;br /&gt;Suddenly everything worked.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-5256166235701608993?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/5256166235701608993/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=5256166235701608993' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5256166235701608993'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/5256166235701608993'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/04/resetting-vmware-networking.html' title='Resetting vmware networking'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-677934788472410979</id><published>2008-03-20T08:28:00.002+13:00</published><updated>2008-03-20T09:06:39.095+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='computer science'/><category scheme='http://www.blogger.com/atom/ns#' term='communications'/><title type='text'>Quality explanations of technical issues</title><content type='html'>I'm a big fan of clear explanations. If you want to explain something to someone (and given the alternative is letting everyone learn from their own mistakes, this has got to be good), clear explanations are really important. I've tutored computer science at uni and I've explained open source concepts to a whole range of people as part of my work at &lt;a href="http://www.oss-watch.ac.uk/"&gt;OSS Watch&lt;/a&gt; and come to learn that an analogy can be very useful.&lt;br /&gt;&lt;br /&gt;Imagine my pleasure at reading &lt;a href="http://it.slashdot.org/comments.pl?sid=492266&amp;cid=22793474"&gt;this analogy&lt;/a&gt; of a &lt;a href="http://lwn.net/SubscriberLink/272048/ecf14f359bcdcd15/"&gt;really rather complex compiler / interrupt issue&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;For the record, I know nothing about the x86 interrupts, since we were taught interrupts using the much simpler RISC SPARC system.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-677934788472410979?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/677934788472410979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=677934788472410979' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/677934788472410979'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/677934788472410979'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/03/quality-explanations-of-technical.html' title='Quality explanations of technical issues'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-294813060857692969</id><published>2008-03-18T22:47:00.001+13:00</published><updated>2008-03-18T22:54:16.007+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='baduk'/><category scheme='http://www.blogger.com/atom/ns#' term='jgogears'/><category scheme='http://www.blogger.com/atom/ns#' term='suffix tree'/><category scheme='http://www.blogger.com/atom/ns#' term='go'/><category scheme='http://www.blogger.com/atom/ns#' term='computer science'/><title type='text'>Tinkering with suffix-trees and algorithms</title><content type='html'>I've been tinkering with learning algorithms for my computer-go player, &lt;a href="http://code.google.com/p/jgogears/"&gt;jgogears&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;It linearises board positions and then uses classic string processing techniques, principally a large suffix-tree. &lt;a href="http://www.allisons.org/ll/AlgDS/Tree/Suffix/"&gt;Suffix-trees&lt;/a&gt; are widely used in information processing, information theory and compression fields of computer science. I also used them extensively in my recent Ph.D.&lt;br /&gt;&lt;br /&gt;Currently I'm training with about 200 go games (~40k moves), giving me about 950K nodes in my suffix tree.&lt;br /&gt;&lt;br /&gt;I've just switched my linearisation method from a strict distance measure to one which capitalises on adjacency much better.  &lt;br /&gt;&lt;br /&gt;There are a number of tuning parameters for the rate at which I grow the tree. I'll be tinkering with them as I increase the number of boards I'm using for training.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-294813060857692969?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/294813060857692969/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=294813060857692969' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/294813060857692969'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/294813060857692969'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/03/tinkering-with-suffix-trees-and.html' title='Tinkering with suffix-trees and algorithms'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-2425168083034317580.post-1729419664506681059</id><published>2008-03-14T12:09:00.001+13:00</published><updated>2008-03-14T12:42:24.393+13:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='social network'/><category scheme='http://www.blogger.com/atom/ns#' term='invitation to tinker'/><category scheme='http://www.blogger.com/atom/ns#' term='red hat'/><category scheme='http://www.blogger.com/atom/ns#' term='mugshot'/><title type='text'>RedHat's MugShot and lockin</title><content type='html'>I've been playing with &lt;a href="http://mugshot.org/"&gt;MugShot&lt;/a&gt;, the &lt;a href="http://www.redhat.com/"&gt;RedHat&lt;/a&gt;'s venture into the social networking sphere. I was initially impressed by the site, which has builds a dynamic (if HTML-only) website while asking for really very little information and not asking for any confidential information (i.e. no asking of gmail passwords to pester friends to join up). The website is slick  and glossy, if a little heavy on my dialup connection.&lt;br /&gt;&lt;br /&gt;The more I use MugShot, however, the more I see it supporting rather than undermining lockin in the social networking sphere.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;1 Login with Gnome account or local username/password&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;While many of MugShot's Gnome account initial target audiance may have had Gnome accounts suitable for logging into to MugShot, I'm guessing that a really small proportion of people they knew did. Login using openid or similar would be great.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;2 Supporting only a small number of services, and not flexibly&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;It's great that mugshot supports &lt;a href="http://del.icio.us/"&gt;delicious&lt;/a&gt;. However, by only supporting a single service of this kind, not only are my &lt;a href="http://ma.gnolia.com/"&gt;magnolia&lt;/a&gt; bookmarks unsupported, but the current delicious control over the social bookmarking market is strengthened. I understand that the differences between the delicious and magnolia APIs are largely cosmetic, and find it hard to believe that much effort would have been required to support it. Similarly, there are a number of exciting competitors to youtube, amazon, facebook and other MugShot supported services which could be supported with very little time and effort on the part of MugShot.&lt;br /&gt;&lt;br /&gt;The services that are added, use a one-size-fits all approach, when I know of no two long-term users of (for example) facebook who use it in the same way.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;3 No machine-readable export&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There is (that I can see) no constructive machine readable export of any kind from MugShot. Two basic RSS feeds (which aren't advertised in the GUI). No RDF/FOAF, no blogroll for integration into the next generation of social networking and web services. MugShot may see itself as the top of the social networking heap, but until that's evident to the rest of us, they need to play nicely with third parties, both below them and above them on the heap. Failing to export anything useful in a machine-readable format is not playing nicely.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;4 No "deep connection" back to the data sources &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Having configured MugShot, I can see lots of books, photos and links that those in my network find interesting. But even though Mugshot can tell the difference between  books, photos and links when it's displaying them and knows about the accounts I have on services for bookmarking, favouriting or commenting on  books, photos and links it doesn't offer functionality to bookmark, foavourite or comment on them. By not driving content and information back to the underlying datasources, MugShot undermines the underlying services and devalues them, in so doing it also (a) devalues the commitment and investment I've made to those systems and (b) reduces the likelihood that those services will go out of their way to help MugShot.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;5 "Invitation to tinker" only covers look-and-feel&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://mugshot.org/"&gt;MugShot&lt;/a&gt; has a great system for creating skins for music players, and several hundred people appear to have accepted this invitation to tinker and made come great skins. Unfortunately only cosmetic changes are possible. What I would like to see is a generic feed subscription creator which let me add new services that MugShot could listen to. Just like the music player skins, only a very small proportion users would bother, and most of those who did would not produce anything to shout about, but with a feed subscription creator, each success would lead to a whole new service that Mugshot could access and channel to their users. Such a effort would completely alleviate issue problems with the small number of supported services.&lt;br /&gt;&lt;br /&gt;MugShot has an invitation to tinker in the traditional open source web 1.0 sense, with a wiki and version control system, but it doesn't have an invitation to tinker in the web 2.0 sense in which users can scratch their itch (and make the project better) from within their browser in the way the &lt;a href="http://pipes.yahoo.com"&gt;yahoo pipes&lt;/a&gt;, for example, does.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/2425168083034317580-1729419664506681059?l=opensourceexile.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://opensourceexile.blogspot.com/feeds/1729419664506681059/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=2425168083034317580&amp;postID=1729419664506681059' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1729419664506681059'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/2425168083034317580/posts/default/1729419664506681059'/><link rel='alternate' type='text/html' href='http://opensourceexile.blogspot.com/2008/03/redhats-mugshot-and-lockin.html' title='RedHat&apos;s MugShot and lockin'/><author><name>Stuart Yeates</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
