Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
Hi Erik, thanks for your concerns and thoughts. There is no XY problem because we decouple input (storing) from, searching, faceting, ... What you see is just the input for storing and output of the original text in the results. There is no need to do any analysis on this. So don't worry, it works

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Erick Erickson
I've got to ask _how_ are you intending to search this field? On the surface, this feels like an XY problem. It's a "string" type. Therefore, if this is the input: 102, 111, 114, 32, 97, 32, 114, 101, 118, 105, 101, 119, 32, 115, 101, 101, 32, 66, 114 you'll only ever get a match if you search ex

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
It turned out that I didn't recognized that dcdescription is not indexed, only stored. So the next in "chain" ist f_dcperson where dccreator and dcdescription is combined and indexed. And this is why the error shows up on f_dcperson. ("delay of error") Thanks for your help, regards. Bernd Am 11.

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Shawn Heisey
On 5/11/2015 7:19 AM, Bernd Fehling wrote: > After reading https://issues.apache.org/jira/browse/LUCENE-5472 > one question still remains. > > Why is it complaining about f_dcperson which is a copyField when the > origin problem field is dcdescription which definately is much larger > than 32766? >

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
Hi Emir, ahhh, yes you're right. I missed that. Now I understand why it is not complaining about dcdescription and the error shows up on f_dcperson. "delay of error" ;-) Thanks Bernd Am 11.05.2015 um 15:25 schrieb Emir Arnautovic: > Hi Bernrd, > dcdescription field is not indexed. > > Thanks,

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
Hi Shawn, that means if I set a length limit on dcdescription or make dcdescription multivalue than the problem is solved because f_dcperson is already multivalue? Regards Bernd Am 11.05.2015 um 15:17 schrieb Shawn Heisey: > On 5/11/2015 6:13 AM, Bernd Fehling wrote: >> Caused by: java.lang.Il

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Emir Arnautovic
Hi Bernrd, dcdescription field is not indexed. Thanks, Emir -- Monitoring * Alerting * Anomaly Detection * Centralized Log Management Solr & Elasticsearch Support * http://sematext.com/ On 11.05.2015 15:22, Bernd Fehling wrote: Hi Emir, the dcdescription field is definately to big. But why i

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
After reading https://issues.apache.org/jira/browse/LUCENE-5472 one question still remains. Why is it complaining about f_dcperson which is a copyField when the origin problem field is dcdescription which definately is much larger than 32766? I would assume it complains about dcdescription field.

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
Hi Emir, the dcdescription field is definately to big. But why is it complaining about f_dcperson and not dcdescription? Regards Bernd Am 11.05.2015 um 15:12 schrieb Emir Arnautovic: > Hi Bernd, > Issue is with f_dcperson and what ends up in that field. It is configured to > be string, which m

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Shawn Heisey
On 5/11/2015 6:13 AM, Bernd Fehling wrote: > Caused by: java.lang.IllegalArgumentException: Document contains at least one > immense term > in field="f_dcperson" (whose UTF8 encoding is longer than the max length > 32766), all of which were skipped. > Please correct the analyzer to not produce su

Re: SOLR 4.10.4 - error creating document

2015-05-11 Thread Emir Arnautovic
Hi Bernd, Issue is with f_dcperson and what ends up in that field. It is configured to be string, which means it is not tokenized so if some huge value is in either dccreator or dccontributor it will end up as single term. Nemes suggest that it should not contain such values, but double check

SOLR 4.10.4 - error creating document

2015-05-11 Thread Bernd Fehling
I'm getting the following error with 4.10.4 WARN org.apache.solr.handler.dataimport.SolrWriter – Error creating document : SolrInputDocument(fields: [dcautoclasscode=310, dclang=unknown, ..., dcdocid=dd05ad427a58b49150a4ca36148187028562257a77643062382a1366250112ac]) org.apache.solr.comm