Not sure. Perhaps it can be done by training a language model and treating phone numbers as named entities? Not sure if it would work. But I know there are a few NLP people subscribed, maybe they'll have some good ideas.
Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: Cam Bazz <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, June 9, 2008 4:24:48 PM > Subject: Re: Solr system and numbers > > I got a similar question: > how would one normalize or even detect if a string is a phone number? > > On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes wrote: > > > > > great info ,,, thanks a lot all > > > > > > ---------------------------------------- > > > Date: Mon, 9 Jun 2008 05:58:50 -0700 > > > From: [EMAIL PROTECTED] > > > Subject: Re: Solr system and numbers > > > To: solr-user@lucene.apache.org > > > > > > Hi, > > > Solr/Lucene can treat phone numbers as strings. If you want to clean > > them up and normalize them outside of Solr, you can do that and feed them > > into Solr as pure numbers. > > > > > > How the phone numbers will be treated after you pump them into Solr > > depends on the analyzer you choose to use for this data. If you don't need > > to search on subsets of phone numbers, then just don't tokenize them (i.e. > > use string type if the phone numbers contain any non-numeric characters, > > sint otherwise). > > > > > > Otis > > > -- > > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > > > > > > ----- Original Message ---- > > >> From: dudes dudes > > >> To: solr-user@lucene.apache.org > > >> Sent: Monday, June 9, 2008 2:10:20 PM > > >> Subject: Solr system and numbers > > >> > > >> > > >> Hello experts, > > >> > > >> How does Solr deal with numbers or phone numbers .. For example if you > > have 1234 > > >> and 12 34 or 1 234... with spaces between the numbers .. > > >> Or this is dealt by lucene ? > > >> > > >> any documentations or tutorial on this ? > > >> > > >> many thanks, > > >> ak > > >> _________________________________________________________________ > > >> > > >> All new Live Search at Live.com > > >> > > >> http://clk.atdmt.com/UKM/go/msnnkmgl0010000006ukm/direct/01/ > > > > > > > _________________________________________________________________ > > > > All new Live Search at Live.com > > > > http://clk.atdmt.com/UKM/go/msnnkmgl0010000006ukm/direct/01/ > >