Thanks Otis,

I didn't know there was a Cloudera Search Mailing list. Will try it out.

-----Ursprüngliche Nachricht-----
Von: Otis Gospodnetic [mailto:otis.gospodne...@gmail.com] 
Gesendet: Dienstag, 7. Januar 2014 03:53
An: solr-user@lucene.apache.org
Betreff: Re: Index for csv-file created successfully, but no data is shown

Hi,

This may be a better question for the Cloudera Search mailing list.

Otis
--
Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch 
Support * http://sematext.com/


On Mon, Jan 6, 2014 at 11:06 AM, Huynh, Chi-Hao <hu...@initions.com> wrote:

> Dear solr users,
>
> I would appreciate if someone can help me out here. My goal is to 
> index a csv-file.
>
> First of all, I am using the CDH 5 beta distribution of Hadoop, which 
> includes solr 4.4.0, on a single node. I am following the hue tutorial 
> to index and search the data from the yelp dataset challenge 
> http://gethue.tumblr.com/post/65969470780/hadoop-tutorials-season-ii-7
> -how-to-index-and-search
> .
>
> Following the tutorial, I have uploaded the config files, including 
> the prepared schema.xml, to zookeeper via the solrctl-command:
> >solrctl instancedir --create reviews [path to conf]
>
> After this, I have created the collection via:
> >solrctl collection --create reviews -s 1
>
> This works fine, as I can see the collection created in the Solr Admin 
> Web UI and the instancedir in the zookeeper shell.
>
> Then, using the MapReduceIndexerTool and the provided morphline file 
> the index is created and uploaded to solr. According to the command 
> output the index was created successfully:
>
> 1481 [main] INFO  org.apache.solr.hadoop.MapReduceIndexerTool  - 
> Indexing
> 1 files using 1 real mappers into 1 reducers
> 52716 [main] INFO  org.apache.solr.hadoop.MapReduceIndexerTool  - Done.
> Indexing 1 files using 1 real mappers into 1 reducers took 51.233 secs
> 52774 [main] INFO  org.apache.solr.hadoop.GoLive  - Live merging of 
> output shards into Solr cluster...
> 52829 [pool-4-thread-1] INFO  org.apache.solr.hadoop.GoLive  - Live 
> merge
> hdfs://svr-hdp01:8020/tmp/load/results/part-00000 into 
> http://SVR-HDP01:8983/solr
> 53017 [pool-4-thread-1] INFO
>  org.apache.solr.client.solrj.impl.HttpClientUtil  - Creating new http 
> client, 
> config:maxConnections=128&maxConnectionsPerHost=32&followRedirects=fal
> se
> 53495 [main] INFO  org.apache.solr.hadoop.GoLive  - Committing live 
> merge...
> 53496 [main] INFO  org.apache.solr.client.solrj.impl.HttpClientUtil  - 
> Creating new http client, config:
> 53512 [main] INFO  org.apache.solr.common.cloud.ConnectionManager  - 
> Waiting for client to connect to ZooKeeper
> 53513 [main-EventThread] INFO
>  org.apache.solr.common.cloud.ConnectionManager  - Watcher 
> org.apache.solr.common.cloud.ConnectionManager@19014023name:ZooKeeperC
> onnection Watcher:SVR-HDP01:2181/solr got event WatchedEvent 
> state:SyncConnected type:None path:null path:null type:None
> 53513 [main] INFO  org.apache.solr.common.cloud.ConnectionManager  - 
> Client is connected to ZooKeeper
> 53514 [main] INFO  org.apache.solr.common.cloud.ZkStateReader  - 
> Updating cluster state from ZooKeeper...
> 53652 [main] INFO  org.apache.solr.hadoop.GoLive  - Done committing 
> live merge
> 53652 [main] INFO  org.apache.solr.hadoop.GoLive  - Live merging of 
> index shards into Solr cluster took 0.878 secs
> 53652 [main] INFO  org.apache.solr.hadoop.GoLive  - Live merging 
> completed successfully
> 53652 [main] INFO  org.apache.solr.hadoop.MapReduceIndexerTool  - 
> Succeeded with job: jobName:
> org.apache.solr.hadoop.MapReduceIndexerTool/MorphlineMapper, jobId:
> job_1388405934175_0013
> 53653 [main] INFO  org.apache.solr.hadoop.MapReduceIndexerTool  - Success.
> Done. Program took 53.719 secs. Goodbye.
>
> Now, when I go to the web UI and select the created core, I find the 
> core to be empty, there are 0 number of Docs and querying it bears no 
> result. My question is, if I have to upload the csv-file manually to 
> somewhere on the solr server as it seems as if the csv-file was parsed 
> and indexed successfully, but the data is missing that was indexed.
>
> I hope, the description of the problem was clear enough. Thanks a lot!
> Kind regards
>
> __________________
> initions AG
> Chi-Hao Huynh
> Weidestraße 120a
> D-22081 Hamburg
>
> t:   +49 (0) 40 / 41 49 60-62
> f:   +49 (0) 40 / 41 49 60-11
> e:  hu...@initios.com<mailto:hu...@initios.com>
> w: www.initions.com<http://www.initions.com>
> Vollständiger Name der Gesellschaft: initions innovative IT solutions 
> AG Sitz der Gesellschaft: Hamburg Handelsregister Hamburg B 83929
> Aufsichtsratsvorsitzender: Dr. Michael Leue
> Vorstand: Dr. Stefan Anschütz, André Paul Henkel, Dr. Helge Plehn
>
>

Reply via email to