By "changing the record", i mean translate them word by word using software.
Sorry i m new for this kind of modification. For synonyms filter, would
there be
a big table and result in degrade of indexing performance?
I have tried using filter like ICUTransformFilterFactory but it seems
not works
<analyzer type="index" class="org.apache.lucene.analysis.cjk.CJKAnalyzer">
<tokenizer class="org.apache.lucene.analysis.cjk.CJKTokenizer"/>
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="0" splitOnCaseChange="1"/>
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="true"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SnowballPorterFilterFactory" language="English"
protected="protwords.txt"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
<filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j"
composed="false" remove_diacritics="true" remove_modifiers="true"
fold="true"/>
<filter class="solr.ISOLatin1AccentFilterFactory"/>
<filter class="solr.ICUTransformFilterFactory" id="Traditional-Simplified"/>
</analyzer>
Am i setting it wrong?
Regards,
Wayne
On 6/21/2011 2:30 AM, François Schiettecatte wrote:
Wayne
I am not sure what you mean by 'changing the record'.
One option would be to implement something like the synonyms filter to generate
the TC for SC when you index the document, which would index both the TC and
the SC in the same location. That way your users would be able to search with
either TC or SC.
Another option would be to use the same synonyms filter but do the expansion at
search time.
Cheers
François
On Jun 20, 2011, at 5:41 AM, waynelam wrote:
Hi,
I 've recently make change to my schema.xml to support import of Chinese Record.
What i want to do is to search both Traditional Chinese(TC) (e.g. ?? )and
Simplified Chinese (SC) (e.g. ??) Record
when in the same query. I know I can do that by encoding all SC Record to TC. I
want to change to way to index
rather that change the record.
Anyone should show me the way in much appreciated.
Thanks
Wayne
--
-----------------------------------------
Wayne Lam
Assistant Library Officer I
Systems Development& Support
Fong Sum Wood Library
Lingnan University
8 Castle Peak Road
Tuen Mun, New Territories
Hong Kong SAR
China
Phone: +852 26168585
Email: wayne...@ln.edu.hk
Website: http://www.library.ln.edu.hk
--
-----------------------------------------
Wayne Lam
Assistant Library Officer I
Systems Development& Support
Fong Sum Wood Library
Lingnan University
8 Castle Peak Road
Tuen Mun, New Territories
Hong Kong SAR
China
Phone: +852 26168585
Email: wayne...@ln.edu.hk
Website: http://www.library.ln.edu.hk