Different behavior (bug?) for RegExTransformer in Solr5

2015-05-26 Thread Carrie Coy
I'm experimenting with Solr5 (5.1.0 1672403 - timpotter - 2015-04-09 10:37:54). In my custom DIH, I use a RegExTransformer to load several columns, which may or may not be present. If present, the regexp matches and the data loads correctly in both Solr4 and 5. If not present and the r

Re: regexTransformer returns no results if there is no match

2014-08-15 Thread Shalin Shekhar Mangar
construct wikipedia page url from page title using > regexTransformer > with > > > sourceColName="title" /> > > This does not work for titles that have no space, so title_underscore for > them is empty. > > Any ideas what is wrong here? > > This is with solr-4.8.1 > > Thanks. Alex. > -- Regards, Shalin Shekhar Mangar.

regexTransformer returns no results if there is no match

2014-08-11 Thread alxsss
Hello, I try to construct wikipedia page url from page title using regexTransformer with This does not work for titles that have no space, so title_underscore for them is empty. Any ideas what is wrong here? This is with solr-4.8.1 Thanks. Alex.

RegexTransformer and xpath in DataImportHandler

2014-03-03 Thread eShard
Good afternoon, I have this DIH: https://redacted/"; processor="XPathEntityProcessor" forEach="/rss/channel/item" transformer="DateFormatTransformer,Templa

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Raheel Hasan
the facet component, but first you > should describe exactly what your requirements are. > > -- Jack Krupansky > -Original Message- From: Raheel Hasan > Sent: Friday, September 06, 2013 9:58 AM > To: solr-user@lucene.apache.org > Subject: Facet Count and RegexTransformer&

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Raheel Hasan
in the facet component, but first you >> should describe exactly what your requirements are. >> >> -- Jack Krupansky >> -Original Message- From: Raheel Hasan >> Sent: Friday, September 06, 2013 9:58 AM >> To: solr-user@lucene.apache.org >> Subject:

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Raheel Hasan
here - are the commas delimiting fields or do you >> have one value per row? >> >> Yes, you can tokenize a comma-delimited value in Solr. >> >> >> -- Jack Krupansky >> >> -Original Message- From: Raheel Hasan >> Sent: Friday, Septemb

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Raheel Hasan
ave one value per row? > > Yes, you can tokenize a comma-delimited value in Solr. > > > -- Jack Krupansky > > -Original Message- From: Raheel Hasan > Sent: Friday, September 06, 2013 11:54 AM > To: solr-user@lucene.apache.org > Subject: Re: Facet Count and RegexTr

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Jack Krupansky
e.org Subject: Re: Facet Count and RegexTransformer>splitBy Hi, What I want is very simple: The "query" results: row 1 = a,b,c,d row 2 = a,f,r,e row 3 = a,c,ff,e,b .. facet count needed: 'a' = 3 occurrence 'b' = 2 occur. 'c' = 2 occur. .

Re: Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Jack Krupansky
, September 06, 2013 9:58 AM To: solr-user@lucene.apache.org Subject: Facet Count and RegexTransformer>splitBy Hi guyz, Just a quick question: I have a field that has CSV values in the database. So I will use the DataImportHandler and will index it using RegexTransformer's splitBy attribute. Howeve

Facet Count and RegexTransformer>splitBy

2013-09-06 Thread Raheel Hasan
Hi guyz, Just a quick question: I have a field that has CSV values in the database. So I will use the DataImportHandler and will index it using RegexTransformer's splitBy attribute. However, since this is the first time I am doing it, I just wanted to be sure if it will work for Facet Count? For

Re: PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-26 Thread meghana
Thanks Matthew , Its really helped a lot. i am about to done with this. -- View this message in context: http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3612674.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-24 Thread Matthew Parker
> > if we can convert this as my desired result then it would be great. > Thanks in Adcance. > Meghana > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3608726.html > S

Re: PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-23 Thread meghana
: Enjoy if we can convert this as my desired result then it would be great. Thanks in Adcance. Meghana -- View this message in context: http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3608726.html Sent from the Solr - User mailing

PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-23 Thread meghana
with RegexTransformer and TemplateTransformer for that as below. but i am receiving ConfigurationError when i set that. I like to add here that i am able do this using TemplateTransformer and multivalued field by setting foreach on

PlainTexttransformer and RegexTransformer in DataImport Handler

2011-12-23 Thread meghana
, RegexTransformer and TemplateTransformer for that as below. but i am receiving ConfigurationError when i set that. I like to add here that i am able do this using TempleteTransformer and multivalued field. but i need above format in

Re: RegexTransformer - need help with regex value

2011-09-14 Thread Pulkit Singhal
to match > even on the 1st regex=".*img src=.(.*)\.gif..alt=.*" and produce a > result like: > http://g-ecx.images-amazon.com/images/G/01/x-locale/common/customer-reviews/stars-5-0._V192240867_ > But it doesn't! > Can anyone tell me why that would be the case? > Is it something about the way RegexTransformer is wired or is it just > my regex value that isn't right? >

RegexTransformer - need help with regex value

2011-09-14 Thread Pulkit Singhal
roduce a result like: http://g-ecx.images-amazon.com/images/G/01/x-locale/common/customer-reviews/stars-5-0._V192240867_ But it doesn't! Can anyone tell me why that would be the case? Is it something about the way RegexTransformer is wired or is it just my regex value that isn't right?

Re: problem with RegexTransformer and delimited data

2010-04-13 Thread Lance Norskog
It would be nice if the RegexTransformer logged that the user does not know how to use the different parameters... On 4/13/10, Gerald wrote: > > AWESOME. may take me some time to understand the regex pattern but it > worked > > And many thanks for looking into RegexTransformer.p

RE: problem with RegexTransformer and delimited data

2010-04-13 Thread Gerald
-with-RegexTransformer-and-delimited-data-tp713846p716749.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: problem with RegexTransformer and delimited data

2010-04-13 Thread Steven A Rowe
hing like > "dataA1|^dataA2|?dataB1|^dataB2|?dataC1". > > to me this doesnt look like a regex pattern issue; instead looks more > like a > solr/lucene issue with regex. > > any other suggestions welcome. otherwise, will have to create custom > transformer -- View this message in context: > http://n3.nabble.com/problem-with- > RegexTransformer-and-delimited-data-tp713846p716206.html Sent from the > Solr - User mailing list archive at Nabble.com.

Re: problem with RegexTransformer and delimited data

2010-04-13 Thread Gerald
a regex pattern issue; instead looks more like a solr/lucene issue with regex. any other suggestions welcome. otherwise, will have to create custom transformer -- View this message in context: http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p716206.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: problem with RegexTransformer and delimited data

2010-04-13 Thread Gerald
tern issue; instead looks more like a solr/lucene issue with regex. any other suggestions welcome. otherwise, will have to create custom transformer -- View this message in context: http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p716203.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: problem with RegexTransformer and delimited data

2010-04-12 Thread Lance Norskog
t I DID use replaceWith="$1" in tests where the >> pattern >> was like "(.*)(\|.*)" in order to only get the first group >> -- >> View this message in context: >> http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> > -- Lance Norskog goks...@gmail.com

Re: problem with RegexTransformer and delimited data

2010-04-12 Thread Erick Erickson
replaceWith="$1" in tests where the > pattern > was like "(.*)(\|.*)" in order to only get the first group > -- > View this message in context: > http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html > Sent from the Solr - User mailing list archive at Nabble.com. >

Re: problem with RegexTransformer and delimited data

2010-04-12 Thread Gerald
forgot to mention that I DID use replaceWith="$1" in tests where the pattern was like "(.*)(\|.*)" in order to only get the first group -- View this message in context: http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html Sent from the

problem with RegexTransformer and delimited data

2010-04-12 Thread Gerald
"(.*)(\|.*)", "(.*)\|(.*)", "(.*)\|.*", "(.*)(\|^.*)", ... -- View this message in context: http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p713846.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: RegexTransformer

2010-03-15 Thread blargy
ersTokenFilters. >> >> > Yes but only on the indexed values. Empty values will still be stored and > returned in the response unless you stop them from reaching the indexing > chain. > > -- > Regards, > Shalin Shekhar Mangar. > > -- View this message in context: http://old.nabble.com/RegexTransformer-tp27897870p27907090.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: RegexTransformer

2010-03-15 Thread Shalin Shekhar Mangar
On Mon, Mar 15, 2010 at 2:53 PM, Michael Kuhlmann < michael.kuhlm...@zalando.de> wrote: > On 03/15/10 08:56, Shalin Shekhar Mangar wrote: > > On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote: > > > >> > >> How would I go about splitting a column by a certain delimiter AND > ignore > >> all > >> empt

Re: RegexTransformer

2010-03-15 Thread Michael Kuhlmann
On 03/15/10 08:56, Shalin Shekhar Mangar wrote: > On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote: > >> >> How would I go about splitting a column by a certain delimiter AND ignore >> all >> empty matches. [...] > You will probably have to write a custom Transformer to remove empty values. > See ht

Re: RegexTransformer

2010-03-15 Thread Shalin Shekhar Mangar
On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote: > > How would I go about splitting a column by a certain delimiter AND ignore > all > empty matches. > > For example: > > > > I have a some columns that dont have a value for values but so its getting > actually index as blank. I just want to totall

RegexTransformer

2010-03-14 Thread blargy
in context: http://old.nabble.com/RegexTransformer-tp27897870p27897870.html Sent from the Solr - User mailing list archive at Nabble.com.

DIH : RegexTransformer with groupNames requires all groups to be not empty?

2009-11-03 Thread Chantal Ackermann
Ok, I can confirm that the following configuration for RegexTransformer works as I would expect it: regex="[^\|]+\|\d+,\d+,\d+,(.+)" /> To the multivalued fields participant and role, values are only added if their corresponding regex matches. The following configuration

DIH : RegexTransformer with groupNames requires all groups to be not empty?

2009-11-03 Thread Chantal Ackermann
ion: - How can I achieve that the RegexTransformer adds a value only if it contains a non-empty value and avoiding at the same time that it only adds values when all of the groups contain values. Maybe the configuration with groupNames is meant to work like that. If that is the case, it

DIH : RegexTransformer with groupNames requires all groups to be not empty?

2009-11-03 Thread Chantal Ackermann
Dear all, my DIH config contains the following directive for the RegexTransformer: (this is SOLR 1.4.0 RC downloaded yesterday from Grant's URL) It expects input of the kind (version A): Daniel Radcliffe|24897,1,1,Harry Potter It should also work with (version B): Daniel Radcliffe|2489

Re: RegexTransformer debugging (DIH)

2008-10-16 Thread Noble Paul നോബിള്‍ नोब्ळ्
If it is a normal exception it is logged with the number of document where it failed and you can put it on debugger with start=&rows=1 We do not catch a throwable or Error so it gets slipped through. if you are adventurous enough wrap the RegexTranformer with your own and apply that say transform

RegexTransformer debugging (DIH)

2008-10-16 Thread Jon Baer
Is there a way to prevent this from occurring (or a way to nail down the doc which is causing it?): INFO: [news] webapp=/solr path=/admin/dataimport params={command=status} status=0 QTime=0 Exception in thread "Thread-14" java.lang.StackOverflowError at java.util.regex.Pattern$Single