Re: TikaEntityProcessor + multivalue field as url source

2014-01-29 Thread Bustaa
Thanks for you suggestions Ahmet. We are using the Typo3 CMS (with custom extensions / db-schemas). We are using Solarium to connect to the Solr instance. The schema is pretty simple:

Re: TikaEntityProcessor + multivalue field as url source

2014-01-29 Thread Ahmet Arslan
Hi Bustaa, Can you paste your data-config.xml?  Also, did you consider using ManifoldCF [1] to crawl/index your CMS? What CMS are you using? [1] http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#repositoryconnectiontypes On Wednesday, January 29, 2014 1:03 PM,

TikaEntityProcessor + multivalue field as url source

2014-01-29 Thread Bustaa
Hello Solr Users, i'm trying to get Tika's "BinFileDataSource" to take the filenames from a multivalue field (array) but I'm getting the following exception: Debug output from running dataimport (shortenend): "query", "<<< LONG SQL-QUERY >>>", "time-taken",