I got a similar question: how would one normalize or even detect if a string is a phone number?
On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes <[EMAIL PROTECTED]> wrote: > > great info ,,, thanks a lot all > > > ---------------------------------------- > > Date: Mon, 9 Jun 2008 05:58:50 -0700 > > From: [EMAIL PROTECTED] > > Subject: Re: Solr system and numbers > > To: solr-user@lucene.apache.org > > > > Hi, > > Solr/Lucene can treat phone numbers as strings. If you want to clean > them up and normalize them outside of Solr, you can do that and feed them > into Solr as pure numbers. > > > > How the phone numbers will be treated after you pump them into Solr > depends on the analyzer you choose to use for this data. If you don't need > to search on subsets of phone numbers, then just don't tokenize them (i.e. > use string type if the phone numbers contain any non-numeric characters, > sint otherwise). > > > > Otis > > -- > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > ----- Original Message ---- > >> From: dudes dudes > >> To: solr-user@lucene.apache.org > >> Sent: Monday, June 9, 2008 2:10:20 PM > >> Subject: Solr system and numbers > >> > >> > >> Hello experts, > >> > >> How does Solr deal with numbers or phone numbers .. For example if you > have 1234 > >> and 12 34 or 1 234... with spaces between the numbers .. > >> Or this is dealt by lucene ? > >> > >> any documentations or tutorial on this ? > >> > >> many thanks, > >> ak > >> _________________________________________________________________ > >> > >> All new Live Search at Live.com > >> > >> http://clk.atdmt.com/UKM/go/msnnkmgl0010000006ukm/direct/01/ > > > > _________________________________________________________________ > > All new Live Search at Live.com > > http://clk.atdmt.com/UKM/go/msnnkmgl0010000006ukm/direct/01/ >