The text field in the schema configuration looks like this. I changed
catenateNumbers to 0 but it still doesn't work as aspected.
<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<!-- in this example, we will only use synonyms at query time
<filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt"
ignoreCase="true" expand="false"/>
-->
<!-- Case insensitive stop word removal.
add enablePositionIncrements=true in both the index and query
analyzers to leave a 'gap' for more accurate phrase queries.
-->
<filter class="solr.StopFilterFactory"
ignoreCase="true"
words="stopwords_en.txt"
enablePositionIncrements="true"
/>
<filter class="solr.StopFilterFactory"
ignoreCase="true"
words="stopwords_du.txt"
enablePositionIncrements="true"
/>
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SnowballPorterFilterFactory" language="English"
protected="protwords.txt"/>
<filter class="solr.SnowballPorterFilterFactory" language="Dutch" />
<filter class="solr.NGramFilterFactory" minGramSize="3" maxGramSize="15"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
<filter class="solr.StopFilterFactory"
ignoreCase="true"
words="stopwords_en.txt"
enablePositionIncrements="true"
/>
<filter class="solr.StopFilterFactory"
ignoreCase="true"
words="stopwords_du.txt"
enablePositionIncrements="true"
/>
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="0" catenateNumbers="0"
catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SnowballPorterFilterFactory" language="English"
protected="protwords.txt"/>
<filter class="solr.SnowballPorterFilterFactory" language="Dutch" />
</analyzer>
</fieldType>
On 01/31/2012 03:03 PM, Erick Erickson wrote:
Unless you provide your schema configuration, there's
not much to go on here. Two things though:
1> look at the admin/analysis page to see how your
data is broken up into tokens.
2> at a guess you have WordDelimiterFilterFactory
in your chain and perhaps catenateNumbers="1"
Best
Erick
On Mon, Jan 30, 2012 at 3:21 AM, Arkadi Colson<ark...@smartbit.be> wrote:
Hi
I'm using the pecl PHP class to query SOLR and was wondering how to query
for a part of a sentence exactly.
There are 2 data items index in SOLR
1327497476: 123 456 789
1327497521. 1234 5678 9011
However when running the query, both data items are returned as you can see
below. Any idea why?
Thanks!
SolrObject Object
(
[responseHeader] => SolrObject Object
(
[status] => 0
[QTime] => 5016
[params] => SolrObject Object
(
[debugQuery] => true
[shards] =>
solr01:8983/solr,solr02:8983/solr,solr03:8983/solr
[fl] =>
id,smsc_module,smsc_ssid,smsc_description,smsc_content,smsc_courseid,smsc_date_created,smsc_date_edited,score,metadata_stream_size,metadata_stream_source_info,metadata_stream_name,metadata_stream_content_type,last_modified,author,title,subject
[sort] => smsc_date_created asc
[indent] => on
[start] => 0
[q] => (smsc_content:\"123 456\" ||
smsc_description:\"123 456\")&& (smsc_module:Intradesk)&&
(smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&& (smsc_ssid:38)
[distrib] => true
[wt] => xml
[version] => 2.2
[rows] => 55
)
)
[response] => SolrObject Object
(
[numFound] => 2
[start] => 0
[docs] => Array
(
[0] => SolrObject Object
(
[smsc_module] => Intradesk
[smsc_ssid] => 38
[id] => 1327497476
[smsc_courseid] => 0
[smsc_date_created] => 2011-12-25T10:29:51Z
[smsc_date_edited] => 2011-12-25T10:29:51Z
[score] => 10.028017
)
[1] => SolrObject Object
(
[smsc_module] => Intradesk
[smsc_ssid] => 38
[id] => 1327497521
[smsc_courseid] => 0
[smsc_date_created] => 2011-12-25T10:29:51Z
[smsc_date_edited] => 2011-12-25T10:29:51Z
[score] => 5.541335
)
)
)
[debug] => SolrObject Object
(
[rawquerystring] => (smsc_content:\"123 456\" ||
smsc_description:\"123 456\")&& (smsc_module:Intradesk)&&
(smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&& (smsc_ssid:38)
[querystring] => (smsc_content:\"123 456\" ||
smsc_description:\"123 456\")&& (smsc_module:Intradesk)&&
(smsc_date_created:[2011-12-25T10:29:51Z TO NOW])&& (smsc_ssid:38)
[parsedquery] => +(smsc_content:123 smsc_content:456
smsc_description:123 smsc_content:456) +smsc_module:intradesk
+smsc_date_created:[2011-12-25T10:29:51Z TO 2012-01-25T13:33:21.098Z]
+smsc_ssid:38
[parsedquery_toString] => +(smsc_content:123 smsc_content:456
smsc_description:123 smsc_content:456) +smsc_module:intradesk
+smsc_date_created:[2011-12-25T10:29:51 TO 2012-01-25T13:33:21.098]
+smsc_ssid:`#8;#0;#0;#0;&
[QParser] => LuceneQParser
[timing] => SolrObject Object
--
Smartbit bvba
Hoogstraat 13
B-3670 Meeuwen
T: +32 11 64 08 80
F: +32 89 46 81 10
W: http://www.smartbit.be
E: ark...@smartbit.be