If you need a framework to build your enhancement pipeline on I think Apache UIMA [1] is good as it's also able to store annotated documents into Lucene and Solr so it may be a good fit for your needs. Just consider that you have to learn how to use / develop on top of it, it's not a big deal but needs it to be taken into account (especially because you're running away from GATE).
My 2 cents, Tommaso 2014/1/16 Philippe de Rochambeau <phi...@free.fr> > Hello, > > can anyone suggest alternatives to GATE (http://gate.ac.uk/download/)? I > would like to index place and person names in PDFs using gazetteers (ie, > dictionaries) and normalize dates ( (eg, December 1st, 2001 will be indexed > as 20011201) and feed the indexes to SOLR? > > GATE is a great tool, but the search engine, Mimir, is unfortunately not > customizable (well-document enough) enough for my purposes, which are to > return the found documents (PDFs) ordered by document or entity (eg, > {Date}, {Person}) name. > > Many thanks. > > Philippe > > >