Re: Anybody knows what IBM FileNet search looks like?

2013-05-13 Thread Oleg Tikhonov
:-) Alex, it seems to be a copyright ... Think about Lucene + ManifoldCF. FileNet is file repository saved in DB2. ManifoldCF has a connector that helps retrieve files/directories from DB & using Lucene it may index the context of the files. I am not sure if Solr has such handler like Tika, howeve

Re: Development inside or outside of Solr?

2012-02-19 Thread Oleg Tikhonov
Hi, I cannot say about two mentioned approaches however take a look at Tika CLI with --language option. Hope it helps, Oleg On Mon, Feb 20, 2012 at 4:44 AM, bing wrote: > Hi, all, > > I am deploying a multicore solr server runing on Tomcat, where I want to > achieve language detection during

Re: Debugging on Tika

2012-02-03 Thread Oleg Tikhonov
Hi Arkadi, You can try to extract text from your documents using Tika's CLI (more details http://tika.apache.org/0.7/gettingstarted.html). If you were succeeded that means that something goes wrong during the indexing. Tika only extracts text and metadata from the documents and sends this text to

LineEntityProcessor

2011-12-06 Thread Oleg Tikhonov
Hello everybody, I'm trying to use LineEntityProcessor of DIH but somehow without success. I've create data-lep-config.xml, added request handler in solrconfig.xml. During full-import I get a response saying that x rows were fetched, 0 docs added/updated. I defined also very basic regex for Reg

Fwd: Assist please

2011-11-04 Thread Oleg Tikhonov
-- Forwarded message -- From: NDIAYE Bacar Date: Fri, Nov 4, 2011 at 12:05 PM Subject: Assist please To: d...@tika.apache.org, u...@tika.apache.org Hi, I need your assist please for to configuration the Apache Tika to Sorl attachment in Drupal 7. I have try to confi

Re: Indexing Lotus Notes database using API

2011-09-05 Thread Oleg Tikhonov
Hi Tobias, I've been using ManifoldFC in context of FileNet P8 but not in the context of your use case. I think that you should use Notes API because it has very specific IBM tailored format. If you will find some open source project that can "extract" Lotus Notes DB, please share the informatio

Re: Find out why a document wasn't found

2011-08-31 Thread Oleg Tikhonov
Hi, why don't you index the file metadata, i.e. file name for instance. So when file's metadata is indexed you could start querying by file name. BR, Oleg On Wed, Aug 31, 2011 at 12:02 PM, occurred < schaubm...@infodienst-ausschreibungen.de> wrote: > Hi, > > I'm looking for a solution to find ou

Re: TikaEntityProcessor

2011-04-19 Thread Oleg Tikhonov
I don't know, will ask him. On Tue, Apr 19, 2011 at 7:02 PM, Li wrote: > Looks like dependencies. Did you or him included the dependencies in the > solrconfig? > > Sent from my iPhone > > On Apr 19, 2011, at 8:35 AM, Oleg Tikhonov wrote: > > >> Hello eve

TikaEntityProcessor

2011-04-19 Thread Oleg Tikhonov
Hello everybody, Recently, I got a message from a guy who was asking about TikaEntityProcessor. He uses Solr 1.4 and Tika 0.8. Here is a stack: SEVERE: Full Import failed org.apache.solr.handler. dataimport.DataImportHandlerException: Unable to load En tityProcessor implementation for entity:99464

Re: TikaEntityProcessor

2011-04-19 Thread Oleg Tikhonov
> Hello everybody, > > Recently, I got a message from a guy who was asking about > TikaEntityProcessor. > He uses Solr 1.4 and Tika 0.8. > Here is a stack: > SEVERE: Full Import failed > org.apache.solr.handler. > dataimport.DataImportHandlerException: Unable to load En > tityProcessor implementati