If you need a framework to build your enhancement pipeline on I think
Apache UIMA [1] is good as it's also able to store annotated documents into
Lucene and Solr so it may be a good fit for your needs. Just consider that
you have to learn how to use / develop on top of it, it's not a big deal
but needs it to be taken into account (especially because you're running
away from GATE).

My 2 cents,
Tommaso


2014/1/16 Philippe de Rochambeau <phi...@free.fr>

> Hello,
>
> can anyone suggest alternatives to GATE (http://gate.ac.uk/download/)? I
> would like to index place and person names in PDFs using gazetteers (ie,
> dictionaries) and normalize dates ( (eg, December 1st, 2001 will be indexed
> as 20011201) and feed the indexes to SOLR?
>
> GATE is a great tool, but the search engine, Mimir, is unfortunately not
> customizable (well-document enough) enough for my purposes, which are to
> return the found documents (PDFs) ordered by document or entity (eg,
> {Date}, {Person}) name.
>
> Many thanks.
>
> Philippe
>
>
>

Reply via email to