Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
(11/02/26 5:24), Rosa (Anuncios) wrote: Hi Koji, Yes of course i have RegexTransformer" in my . What i'm not sure is the syntax of this i don't need any other parameter here? Hi Rosa, So I've mentioned groupNames attribute for field element in previous mail. Did you try it? Koji -- http://

Re: DIH regex remove email + extract url

2011-02-25 Thread Rosa (Anuncios)
Hi Koji, Yes of course i have RegexTransformer" in my . What i'm not sure is the syntax of this xpath="/product/url" regex=" /> i don't need any other parameter here? Rosa Le 25/02/2011 12:21, Koji Sekiguchi a écrit : Hi Rosa, Are you sure you have transformer="RegexTransformer" in your

Re: DIH regex remove email + extract url

2011-02-25 Thread Koji Sekiguchi
Hi Rosa, Are you sure you have transformer="RegexTransformer" in your ? My question was more about the solr DIH syntax. It doesn't work either with the new regex. Especially the syntax for this: <--- Is it correct? (not the regex, the syntax)? In this case, I think you need to have two f

Re: DIH regex remove email + extract url

2011-02-25 Thread Rosa (Anuncios)
Hi Koji, My question was more about the solr DIH syntax. It doesn't work either with the new regex. Especially the syntax for this: /> <--- Is it correct? (not the regex, the syntax)? Example: url=http://www.abcd.com/product.php?id=324 --> i want to index source = abcd.com thanks for you

Re: DIH regex remove email + extract url

2011-02-24 Thread Koji Sekiguchi
Hi Rosa, Shouldn't it be regex="[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-z]{2,4}"? Example: url=http://www.abcd.com/product.php?id=324 --> i want to index source = abcd.com Probably it could be regex="http:\/\/(.*?)\/(.*)" I use a regex web tool: http://www.regexplanet.com/simple/index.h