Re: TEI indexing

Erik Hatcher Tue, 22 May 2007 05:46:24 -0700


On May 21, 2007, at 10:52 PM, Gary Browne wrote:

I'm wondering if anyone has any hints on how to prepare TEI documents
for indexing - I was about to write some XSLT but didn't want to
reinvent the wheel (unless it's punctured)?

I'm using Ruby to index TEI files, and leveraging the XPathMapperfunctionality built into the solr-ruby gem.

The (not so) funny thing about TEI is that every project uses itslightly differently, so whatever solution you come up with is likelynot to be exactly right for other projects (sadly). So the wheel ispunctured. For the Rossetti Archive (a way forked TEI variant), weuse XSLT to generate RDF/XML that then gets fed into a Java-basedindexer which uses Sesame's API to parse the RDF for sending intoSolr. [The reason we go to RDF first is that is the convention we'vedeveloped for getting data into NINES for all archives, not just ours]


        Erik

Re: TEI indexing

Reply via email to