RE: Delimited payloads input issue

2015-03-11 Thread Markus Jelsma
Thanks David for sharing! The custom attribute approach sounds interesting indeed. Markus -Original message- > From:david.w.smi...@gmail.com > Sent: Tuesday 10th March 2015 16:53 > To: solr-user@lucene.apache.org > Subject: Re: Delimited payloads input issue &g

RE: Delimited payloads input issue

2015-03-06 Thread Markus Jelsma
st be modified to carry the original PayloadAttribute to its descendants. Markus -Original message- > From:Markus Jelsma > Sent: Friday 27th February 2015 17:28 > To: solr-user > Subject: Delimited payloads input issue > > Hi - we attempt to use payloads to iden

Delimited payloads input issue

2015-02-27 Thread Markus Jelsma
Hi - we attempt to use payloads to identify different parts of extracted HTML pages and use the DelimitedPayloadTokenFilter to assign the correct payload to the tokens. However, we are having issues for some language analyzers and issues with some types of content for most regular analyzers. If