Re: Indexing files from HDFS

2017-10-12 Thread István
med responses from > the Cloudera folks, especially about Hue. > > Best, > Erick > > On Wed, Oct 11, 2017 at 6:05 AM, István wrote: > > Hi, > > > > I have Solr 4.10.3 part of a CDH5 installation and I would like to index > > huge amount of CSV files on

Indexing files from HDFS

2017-10-11 Thread István
Hi, I have Solr 4.10.3 part of a CDH5 installation and I would like to index huge amount of CSV files on HDFS. I was wondering what is the best way of doing that. Here is the current approach: data.csv: id, fruit 10, apple 20, orange Indexing with the following command using search-mr-1.0.0-cd

Re: Querying nested datastructures

2015-11-25 Thread István
rview/ I haven't > tried it, but it sounds like they implemented this approach. > - if you follow recent blog post, you see our favorite approach > http://blog.griddynamics.com/2013/09/solr-block-join-support.html > > Also, query time join {!join} and field collapsing are also alternati

Re: Querying nested datastructures

2015-11-25 Thread István
> As always, queries should drive your data model, so please specify what a > typical query might be like, in plain English. > > -- Jack Krupansky > > On Tue, Nov 24, 2015 at 4:39 AM, István wrote: > > > Hi all, > > > > I would like to find documents in a key-

Querying nested datastructures

2015-11-24 Thread István
Hi all, I would like to find documents in a key-value store (Riak) with Solr and I am running into a challenge. I have nested JSON documents with patent information. Patents have a one or many CPC ( http://www.cooperativepatentclassification.org/index.html) codes something like these: { // more

Re: WordDelimiterFilter and the dot character

2012-10-17 Thread Farkas István
ually query "123/2012 123 2012", which is not a term sequence that was indexed. -- Jack Krupansky -Original Message- From: Farkas István Sent: Wednesday, October 17, 2012 8:58 AM To: solr-user@lucene.apache.org Subject: WordDelimiterFilter and the dot character Hello, I'v

Re: WordDelimiterFilter and the dot character

2012-10-17 Thread Farkas István
not match the exact term sequence that was indexed. A query of "123 2012" would not split any terms and hence not generate the extra "preserved" term. But a query of "123/2012" would actually query "123/2012 123 2012", which is not a term sequence that

WordDelimiterFilter and the dot character

2012-10-17 Thread Farkas István
Hello, I've ran into an interesting problem. I am using Solr 3.5 on an Ubuntu server. I have some data with a code field, which contains some identifiers (mostly) in the following format: E.123/2012. I've set up a fieldType for this code field: | | If I search for the exact co