I'm experimenting with Solr5 (5.1.0 1672403 - timpotter - 2015-04-09
10:37:54). In my custom DIH, I use a RegExTransformer to load several
columns, which may or may not be present. If present, the regexp
matches and the data loads correctly in both Solr4 and 5. If not present
and the r
construct wikipedia page url from page title using
> regexTransformer
> with
>
>
> sourceColName="title" />
>
> This does not work for titles that have no space, so title_underscore for
> them is empty.
>
> Any ideas what is wrong here?
>
> This is with solr-4.8.1
>
> Thanks. Alex.
>
--
Regards,
Shalin Shekhar Mangar.
Hello,
I try to construct wikipedia page url from page title using regexTransformer
with
This does not work for titles that have no space, so title_underscore for them
is empty.
Any ideas what is wrong here?
This is with solr-4.8.1
Thanks. Alex.
Good afternoon,
I have this DIH:
https://redacted/";
processor="XPathEntityProcessor"
forEach="/rss/channel/item"
transformer="DateFormatTransformer,Templa
the facet component, but first you
> should describe exactly what your requirements are.
>
> -- Jack Krupansky
> -Original Message- From: Raheel Hasan
> Sent: Friday, September 06, 2013 9:58 AM
> To: solr-user@lucene.apache.org
> Subject: Facet Count and RegexTransformer&
in the facet component, but first you
>> should describe exactly what your requirements are.
>>
>> -- Jack Krupansky
>> -Original Message- From: Raheel Hasan
>> Sent: Friday, September 06, 2013 9:58 AM
>> To: solr-user@lucene.apache.org
>> Subject:
here - are the commas delimiting fields or do you
>> have one value per row?
>>
>> Yes, you can tokenize a comma-delimited value in Solr.
>>
>>
>> -- Jack Krupansky
>>
>> -Original Message- From: Raheel Hasan
>> Sent: Friday, Septemb
ave one value per row?
>
> Yes, you can tokenize a comma-delimited value in Solr.
>
>
> -- Jack Krupansky
>
> -Original Message- From: Raheel Hasan
> Sent: Friday, September 06, 2013 11:54 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Facet Count and RegexTr
e.org
Subject: Re: Facet Count and RegexTransformer>splitBy
Hi,
What I want is very simple:
The "query" results:
row 1 = a,b,c,d
row 2 = a,f,r,e
row 3 = a,c,ff,e,b
..
facet count needed:
'a' = 3 occurrence
'b' = 2 occur.
'c' = 2 occur.
.
, September 06, 2013 9:58 AM
To: solr-user@lucene.apache.org
Subject: Facet Count and RegexTransformer>splitBy
Hi guyz,
Just a quick question:
I have a field that has CSV values in the database. So I will use the
DataImportHandler and will index it using RegexTransformer's splitBy
attribute. Howeve
Hi guyz,
Just a quick question:
I have a field that has CSV values in the database. So I will use the
DataImportHandler and will index it using RegexTransformer's splitBy
attribute. However, since this is the first time I am doing it, I just
wanted to be sure if it will work for Facet Count?
For
Thanks Matthew ,
Its really helped a lot. i am about to done with this.
--
View this message in context:
http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3612674.html
Sent from the Solr - User mailing list archive at Nabble.com.
>
> if we can convert this as my desired result then it would be great.
> Thanks in Adcance.
> Meghana
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3608726.html
> S
: Enjoy
if we can convert this as my desired result then it would be great.
Thanks in Adcance.
Meghana
--
View this message in context:
http://lucene.472066.n3.nabble.com/PlainTextEntityProcessor-and-RegexTransformer-in-DataImport-Handler-tp3608449p3608726.html
Sent from the Solr - User mailing
with RegexTransformer
and TemplateTransformer for that as below. but i am receiving
ConfigurationError when i set that.
I like to add here that i am able do this using TemplateTransformer and
multivalued field by setting foreach on
, RegexTransformer and
TemplateTransformer for that as below. but i am receiving ConfigurationError
when i set that.
I like to add here that i am able do this using TempleteTransformer and
multivalued field. but i need above format in
to match
> even on the 1st regex=".*img src=.(.*)\.gif..alt=.*" and produce a
> result like:
> http://g-ecx.images-amazon.com/images/G/01/x-locale/common/customer-reviews/stars-5-0._V192240867_
> But it doesn't!
> Can anyone tell me why that would be the case?
> Is it something about the way RegexTransformer is wired or is it just
> my regex value that isn't right?
>
roduce a
result like:
http://g-ecx.images-amazon.com/images/G/01/x-locale/common/customer-reviews/stars-5-0._V192240867_
But it doesn't!
Can anyone tell me why that would be the case?
Is it something about the way RegexTransformer is wired or is it just
my regex value that isn't right?
It would be nice if the RegexTransformer logged that the user does not
know how to use the different parameters...
On 4/13/10, Gerald wrote:
>
> AWESOME. may take me some time to understand the regex pattern but it
> worked
>
> And many thanks for looking into RegexTransformer.p
-with-RegexTransformer-and-delimited-data-tp713846p716749.html
Sent from the Solr - User mailing list archive at Nabble.com.
hing like
> "dataA1|^dataA2|?dataB1|^dataB2|?dataC1".
>
> to me this doesnt look like a regex pattern issue; instead looks more
> like a
> solr/lucene issue with regex.
>
> any other suggestions welcome. otherwise, will have to create custom
> transformer -- View this message in context:
> http://n3.nabble.com/problem-with-
> RegexTransformer-and-delimited-data-tp713846p716206.html Sent from the
> Solr - User mailing list archive at Nabble.com.
a regex pattern issue; instead looks more like a
solr/lucene issue with regex.
any other suggestions welcome. otherwise, will have to create custom
transformer
--
View this message in context:
http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p716206.html
Sent from the Solr - User mailing list archive at Nabble.com.
tern issue; instead looks more like a
solr/lucene issue with regex.
any other suggestions welcome. otherwise, will have to create custom
transformer
--
View this message in context:
http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p716203.html
Sent from the Solr - User mailing list archive at Nabble.com.
t I DID use replaceWith="$1" in tests where the
>> pattern
>> was like "(.*)(\|.*)" in order to only get the first group
>> --
>> View this message in context:
>> http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>
--
Lance Norskog
goks...@gmail.com
replaceWith="$1" in tests where the
> pattern
> was like "(.*)(\|.*)" in order to only get the first group
> --
> View this message in context:
> http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
forgot to mention that I DID use replaceWith="$1" in tests where the pattern
was like "(.*)(\|.*)" in order to only get the first group
--
View this message in context:
http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p714636.html
Sent from the
"(.*)(\|.*)", "(.*)\|(.*)", "(.*)\|.*",
"(.*)(\|^.*)", ...
--
View this message in context:
http://n3.nabble.com/problem-with-RegexTransformer-and-delimited-data-tp713846p713846.html
Sent from the Solr - User mailing list archive at Nabble.com.
ersTokenFilters.
>>
>>
> Yes but only on the indexed values. Empty values will still be stored and
> returned in the response unless you stop them from reaching the indexing
> chain.
>
> --
> Regards,
> Shalin Shekhar Mangar.
>
>
--
View this message in context:
http://old.nabble.com/RegexTransformer-tp27897870p27907090.html
Sent from the Solr - User mailing list archive at Nabble.com.
On Mon, Mar 15, 2010 at 2:53 PM, Michael Kuhlmann <
michael.kuhlm...@zalando.de> wrote:
> On 03/15/10 08:56, Shalin Shekhar Mangar wrote:
> > On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote:
> >
> >>
> >> How would I go about splitting a column by a certain delimiter AND
> ignore
> >> all
> >> empt
On 03/15/10 08:56, Shalin Shekhar Mangar wrote:
> On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote:
>
>>
>> How would I go about splitting a column by a certain delimiter AND ignore
>> all
>> empty matches.
[...]
> You will probably have to write a custom Transformer to remove empty values.
> See ht
On Mon, Mar 15, 2010 at 2:12 AM, blargy wrote:
>
> How would I go about splitting a column by a certain delimiter AND ignore
> all
> empty matches.
>
> For example:
>
>
>
> I have a some columns that dont have a value for values but so its getting
> actually index as blank. I just want to totall
in context:
http://old.nabble.com/RegexTransformer-tp27897870p27897870.html
Sent from the Solr - User mailing list archive at Nabble.com.
Ok, I can confirm that the following configuration for RegexTransformer
works as I would expect it:
regex="[^\|]+\|\d+,\d+,\d+,(.+)" />
To the multivalued fields participant and role, values are only added if
their corresponding regex matches.
The following configuration
ion:
- How can I achieve that the RegexTransformer adds a value only if
it contains a non-empty value and avoiding at the same time that it only
adds values when all of the groups contain values.
Maybe the configuration with groupNames is meant to work like that. If
that is the case, it
Dear all,
my DIH config contains the following directive for the RegexTransformer:
(this is SOLR 1.4.0 RC downloaded yesterday from Grant's URL)
It expects input of the kind (version A):
Daniel Radcliffe|24897,1,1,Harry Potter
It should also work with (version B):
Daniel Radcliffe|2489
If it is a normal exception it is logged with the number of document
where it failed and you can put it on debugger with start=&rows=1
We do not catch a throwable or Error so it gets slipped through.
if you are adventurous enough wrap the RegexTranformer with your own
and apply that say transform
Is there a way to prevent this from occurring (or a way to nail down
the doc which is causing it?):
INFO: [news] webapp=/solr path=/admin/dataimport
params={command=status} status=0 QTime=0
Exception in thread "Thread-14" java.lang.StackOverflowError
at java.util.regex.Pattern$Single
37 matches
Mail list logo