Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-07 Thread Caleb Land
ure what the solution is, but currently I'm just trimming leading punctuation + a space off on the client side, and leaving the sentence terminator-less. -- Caleb Land

Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-06 Thread Caleb Land
gt; > In the grammatical sense, PowerShot should just be > PowerShot, not power shot (which is what WordDelimiterFactory > gives you, options permitting). So I think you probably want > one of the other analyzers > > Have you tried any other analyzers? StandardAnalyzer might be

Re: Basic sentence parsing with the regex highlighter fragmenter

2010-01-05 Thread Caleb Land
. On Tue, Jan 5, 2010 at 2:05 PM, Caleb Land wrote: > Hello, > I'm using Solr 1.4, and I'm trying to get the regex fragmenter to parse > basic sentences, and I'm running into a problem. > > I'm using the default regex specified in the example solr configuratio

Basic sentence parsing with the regex highlighter fragmenter

2010-01-05 Thread Caleb Land
ried everything I can think of, regex-wise, to get this to work, and I always end up with this problem. For example, I've tried: \w[^.!?]{0,200}[.!?] Which seems like it should include the ending punctuation, but it doesn't, so I think I'm missing something. Does anybody know a regex that works? -- Caleb Land