Hello, I would like to know the best way to load a huge synonym list into Solr.
I would like to do concept indexing (a.k.a category indexing) with Solr. For example, I want to be able to index all cities and be able to search for all of them using a special keyword, say 'CONCEPTcity', where 'CONCEPTcity' will match anything that IS-A city, as specified in the index_synonyms.txt file. I believe the best way to do this is via the SynonymFilterFactory and do index-time synonym expansion. Or is there a better alternative? I would still like to keep the original city names and do not want to replace them with 'CONCEPTcity', so if someone searches for 'Lake', the city name 'Salt Lake City' still matches. Also, obviously, I do not want two different city names to be synonyms of each other. Is the correct way to specify the index_synonyms.txt file like this? ----------------------------------------------------- CONCEPTcity, Salt Lake City CONCEPTcity, New York CONCEPTcity, San Jose . . . ----------------------------------------------------- and then keep expand="true" for SynonymFilterFactory? I tried to load a synonym file with 10K entries like this, and Solr/Jetty took a few seconds to start, but if I try to load a synonym file with 1M+ entries, then it is taking a long time. What is the best way to do this? Thanks, Arun.