I have a different problem so I created a new thead:
I have a custom field type:
<fieldType name="customfield" class="solr.TextField"
positionIncrementGap="1000">
<analyzer type="index">
<tokenizer class="solr.StandardTokenizerFactory" />
<filter class="solr.ICUFoldingFilterFactory" />
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.ICUFoldingFilterFactory" />
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.GreekStemFilterFactory"/>
</analyzer>
</fieldType>
in this field i have to seach custom tags and their attributes (i mean
tag like html tag lile <div>) i would be able to search:
a tag with an attribute equal to something, like: <div
attribute="ablock">*</div>
a tag with an attribute that contain a certain word, like: <span
attribute="lang" * >word</span> or like <div attribute="ablock">*word*</div>
a tag with an attribute that contain another tag that contain a certain
word: <div attribute="ablock">*<span attribute="lang"
*>word</span>*</div>: in this case is important to find the final </div>
match
In the highlighter if I search a div I want to get the contents inside
the div.
I think i have to change the tokenizer but do not know which tokenizer
to use. The tokenizer must be compatible with ICUFoldingFilterFactory
because I need to make accents insensitive searches.