In my solr searching i need to calculate distance between words in html text
and also i need tracking if there any tag between words occures. i've used
the following structure:

field:data - tokenized, indexed, not stored. this is array of words.
field:data-orig - stored , not indexed. this is simplified html markup of
original doc

When search matches a document i need to test result against tag absence
between matched words. In my case, i've decided to do this with regex way,
checking that no <> (&lt;&gt;) entities occured between words. 

Currently i'm fetching data-orig and verifying it on client side deciding to
show or not this hit and than reconecting to solr to fetch more. This cause
huge amount of data being set to client.

I've decided that best way to regexp on server side. I've tried to inject
this at FQ paramenter but came to result that i need to modify lucene query
language in this case. I suppose that there is more easy way to do this.
Could you help me in this?
-- 
View this message in context: 
http://www.nabble.com/XY-Problem-with-regular-expressions-%3A%29-tp16914089p16914089.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to