How does facet_count work with a facet field that is defined as solr. 
PathHierarchyTokenizerFactory?

I have multiple records that contains field Parameter which is of type 
PathHierarchyTokenizerFactory.
E.g
        "Parameter": [
          "EARTH SCIENCE>OCEANS>OCEAN TEMPERATURE>WATER TEMPERATURE",
          "EARTH SCIENCE>OCEANS>OCEAN PRESSURE>WATER PRESSURE",
          "EARTH SCIENCE>OCEANS>OCEAN ACOUSTICS>ACOUSTIC VELOCITY",
          "EARTH SCIENCE>ACOUSTIC",
          "EARTH SCIENCE>VELOCITY",
          "EARTH SCIENCE>ACOBAR | ACOUSTIC TECHNOLOGY FOR OBSERVING THE 
INTERIOR OF THE ARCTIC OCEAN",
          "EARTH SCIENCE>GEOGRAPHIC REGION>POLAR",
          "EARTH SCIENCE>GEOGRAPHIC REGION>ARCTIC"
        ],

But when I run a query to get all facet counts for Parameter - with this query:
http://localhost:8983/solr/nmdc/query? 
q=*:*&facet=true&rows=0&facet.mincount=1&facet.field=Parameter

the two last entries from this record;
"EARTH SCIENCE>GEOGRAPHIC REGION>POLAR",
"EARTH SCIENCE>GEOGRAPHIC REGION>ARCTIC"

is missing from the facet_count - which looks like:

  "facet_counts":{

    "facet_queries":{},

    "facet_fields":{

      "Parameter":[

        "EARTH SCIENCE",228,

        "EARTH SCIENCE>OCEANS",128,

        "EARTH SCIENCE>OCEANS>OCEAN TEMPERATURE",100,

        "EARTH SCIENCE>OCEANS>SALINITY/DENSITY",90,
...

Im running solr 5.0

Why does the query seem to omit some of the Parameter entries from records?
Path is configured with:
        <fieldType name="path" class="solr.TextField">
                <analyzer type="index">
                        <tokenizer class="solr.PathHierarchyTokenizerFactory" 
delimiter=">" />
                </analyzer>
                <analyzer type="query">
                        <tokenizer class="solr.KeywordTokenizerFactory" />
                </analyzer>
        </fieldType>

Cheers
Endre

Reply via email to