: We've had some issues with people searching for a document with the : search term '200 movies'. The document is actually title 'two hundred : movies'. : : Do we need to add every number to our synonyms dictionary to : accomplish this? Is it best done at index or search time?
if all you care about is english, there's actually an "English.longToEnglish" method in the lucene test-framework that was used to generate test corpuses back in the Lucene 1.x days .. i don't actaully think it's used in any Lucene tests anymore at all. could probably whip up a filter using that in about a dozen lines of code ... but it still wouldn't handle things like "dozen" (or "half dozen" or "gross") but it's there if you want to try. -Hoss