$ cat add_test.sh
DATA='
<add>
  <doc>
    <field name="docid">666</field>
    <field name="titi_txt_fr">toto titi tata toto tutu titi</field>
  </doc>
</add>
'
$ sh add_test.sh
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">484</int></lst>
</response>


$ curl 
'http://localhost:8983/solr/alian_test/terms?terms.fl=titi_txt_fr&terms.sort=index'
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">0</int></lst><lst name="terms"><lst
name="titi_txt_fr"><int name="tata">1</int><int
name="titi">1</int><int name="toto">1</int><int
name="tutu">1</int></lst></lst>
</response>


So it's not only on Luke Side, it's come from Solr. Does it sound normal ?

2017-11-21 11:43 GMT+01:00 Barbet Alain <alian123sol...@gmail.com>:
> Hi,
>
> I build a custom analyzer & setup it in solr, but doesn't work as I expect.
> I always get 1 as frequency for each word even if it's present
> multiple time in the text.
>
> So I try with default analyzer & find same behavior:
> My schema
>
>   <fieldType name="text_ami" class="solr.TextField">
>     <analyzer class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
>   </fieldType>
>   <field name="docid" type="string" indexed="true" required="true"
> stored="true"/>
>   <field name="test_text" type="nametext"/>
>
> alian@yoda:~/solr> cat add_test.sh
> DATA='
> <add>
>   <doc>
>     <field name="docid">666</field>
>     <field name="test_text">toto titi tata toto tutu titi</field>
>   </doc>
> </add>
> '
> curl -X POST -H 'Content-Type: text/xml'
> 'http://localhost:8983/solr/alian_test/update?commit=true'
> --data-binary "$DATA"
>
> When I test in solr interface / analyze, I find the right behavior
> (find titi & toto 2 times).
> But when I look in solr index with Luke or solr interface / schema,
> the top term always get 1 as frequency. Can someone give me the thing
> I forget ?
>
> (solr 6.5)
>
> Thank you !

Reply via email to