Hi,

I was trying to use the Mapreduce Indexer tool from cloudera, to index my data 
in Hive table using Solr.


hadoop jar /path/to/lib/solr/contrib/mr/search-mr-*-job.jar  
org.apache.solr.hadoop.MapReduceIndexerTool -Djute.maxbuffer=<buff 
size>--morphline-file /path/to/morphlines.conf --output-dir 
hdfs://path/to/output/dir --reducers -1 --mappers -1 --verbose --go-live 
--zk-host <zookeeperHostIP>:2181/solr --shards 2 --collection <collection name> 
hdfs://location/of/hive/table

My MR job runs successfully and I am able to view _SUCCESS flag in the 
specified output loc

 hadoop fs -ls /path/to/output/results
Found 2 items
-rwxrwx--x+  3 hive hive          0 2016-07-26 11:35 
/path/to/output/results/_SUCCESS
drwxrwx--x+  - hive hive          0 2016-07-26 11:20 
/path/to/output/results/part-00000

But my go-live option is not working.

Exception:
java.util.concurrent.ExecutionException: 
org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: Expected 
mime type application/octet-stream but got text/html

I also tried using jaas-client.conf

Client {
 com.sun.security.auth.module.Krb5LoginModule required
 useKeyTab=false
 useTicketCache=true
 principal="<MyID@DOMAIN>";
 };

So, before executing the Mapreduce job, HADOOP_OPTS was set to the 
jass-client.conf
export HADOOP_OPTS="-Djava.security.auth.login.config=/path/to/jaas-client.conf"


1. What could be the issue?
2. Am I missing something?
3. Since I have my data indexed locally, is there a way to perform the go-live 
option separately?

Thanks in advance

Archana





Reply via email to