Re: Email regular expression.

2013-07-30 Thread Luis Cappa Banda
I´ve been re-reading about that in older solr-mail-list messages, and it seems that a query like 'field:*' implies that internally the whole terms indexed are checked one by one even if they are some caches filled for that field. That make reasonable my poor performance in the past. However, it ma

Re: Email regular expression.

2013-07-30 Thread Luis Cappa Banda
I´ve tried this kind of queries in the past but I detected that they have a poor performance and that they are incredibly slow. But it´s just my experience, maybe someone can share with us any other opinion. 2013/7/30 Raymond Wiker > On Jul 30, 2013, at 22:05 , Luis Cappa Banda wrote: > > Anyw

Re: Email regular expression.

2013-07-30 Thread Raymond Wiker
On Jul 30, 2013, at 22:05 , Luis Cappa Banda wrote: > Anyway, I still need to do a query like the following to retrieve those > documents with at least one E-mail detected: > > http://localhost:8080/mysolr/select?q=emails:[* TO > *]&start=0&rows=10&sort=mydate desc Can't you just use emails:* ?

Re: Email regular expression.

2013-07-30 Thread Luis Cappa Banda
Hello guys, Hey, I think I´ve found how to do this just adding a filter. Just for anyone´s curiosity: Anyway, I still need to do a query like the following to retrieve those documents with at least one E-mail detected: http://localhost:8080/mysolr/select

Re: Email regular expression.

2013-07-30 Thread Luis Cappa Banda
Hello, Jack, Steve, Thank you for your answers. I´ve never used UAX29URLEmailTokenizerFactory, but I´ve read about it before trying RegExp´s queries. As far as I know, UAX29URLEmailTokenizerFactory allows to tokenize an entry text value into patterns that match URLs, E-mails, etc. Reading the docu

Re: Email regular expression.

2013-07-30 Thread Jack Krupansky
Just use the UAX29URLEmailTokenizerFactory, which recognizes email addresses. Any particular reason that you're trying to reinvent the wheel? -- Jack Krupansky -Original Message- From: Luis Cappa Banda Sent: Tuesday, July 30, 2013 10:53 AM To: solr-user@lucene.apache.org Subject: Ema

Re: Email regular expression.

2013-07-30 Thread Steve Rowe
Luis, do you know about UAX29URLEmailTokenizerFactory?: On Jul 30, 2013, at 10:53 AM, Luis Cappa Banda wrote: > Hello everyone! > > Unfortunately I have to search all E-mail addresses found in a te

Re: Email regular expression.

2013-07-30 Thread Andy Lester
On Jul 30, 2013, at 9:53 AM, Luis Cappa Banda wrote: > The syntax is the following: > > *E-mail: * > text:/[a-z0-9_\|-]+(\.[a-z0-9_\|-]|)*@[a-z0-9-]|(\.[a-z0-9-]|)*\.([a-z]{2,4})/ Please note that the question of "How do I write a regex to match an email address" is one of the most discussed