$ cat add_test.sh DATA=' <add> <doc> <field name="docid">666</field> <field name="titi_txt_fr">toto titi tata toto tutu titi</field> </doc> </add> ' $ sh add_test.sh <?xml version="1.0" encoding="UTF-8"?> <response> <lst name="responseHeader"><int name="status">0</int><int name="QTime">484</int></lst> </response>
$ curl 'http://localhost:8983/solr/alian_test/terms?terms.fl=titi_txt_fr&terms.sort=index' <?xml version="1.0" encoding="UTF-8"?> <response> <lst name="responseHeader"><int name="status">0</int><int name="QTime">0</int></lst><lst name="terms"><lst name="titi_txt_fr"><int name="tata">1</int><int name="titi">1</int><int name="toto">1</int><int name="tutu">1</int></lst></lst> </response> So it's not only on Luke Side, it's come from Solr. Does it sound normal ? 2017-11-21 11:43 GMT+01:00 Barbet Alain <alian123sol...@gmail.com>: > Hi, > > I build a custom analyzer & setup it in solr, but doesn't work as I expect. > I always get 1 as frequency for each word even if it's present > multiple time in the text. > > So I try with default analyzer & find same behavior: > My schema > > <fieldType name="text_ami" class="solr.TextField"> > <analyzer class="org.apache.lucene.analysis.standard.StandardAnalyzer"/> > </fieldType> > <field name="docid" type="string" indexed="true" required="true" > stored="true"/> > <field name="test_text" type="nametext"/> > > alian@yoda:~/solr> cat add_test.sh > DATA=' > <add> > <doc> > <field name="docid">666</field> > <field name="test_text">toto titi tata toto tutu titi</field> > </doc> > </add> > ' > curl -X POST -H 'Content-Type: text/xml' > 'http://localhost:8983/solr/alian_test/update?commit=true' > --data-binary "$DATA" > > When I test in solr interface / analyze, I find the right behavior > (find titi & toto 2 times). > But when I look in solr index with Luke or solr interface / schema, > the top term always get 1 as frequency. Can someone give me the thing > I forget ? > > (solr 6.5) > > Thank you !