Re: Warming queries and Solr Cloud - just curious ...

2013-03-27 Thread santoash
This is interesting. I'm looking into doing something similar too. 

Quick question: Would you be targeting each of the shard with exactly the same 
set of queries? 


On Mar 27, 2013, at 12:59 PM, Mark Miller  wrote:

> Yup. You only want to warm locally. We should add that to the wiki.
> 
> - Mark
> 
> On Mar 27, 2013, at 3:54 PM, Timothy Potter  wrote:
> 
>> When running in SolrCloud mode, does it make sense to disable distributed
>> mode for warming queries? i.e. distrib=false in my warming query config
>> 
>> I actually asked this on Erik's informative Webinar this morning but had to
>> drop off before I heard the answer ... so Erik might have answered this
>> already ;-)
>> 
>> My thinking here is that a hard commit gets sent around the cluster
>> automatically. Say I have 36 nodes (18 leaders and 18 replicas), on hard
>> commit, all 36 nodes will be warming up. If my warming queries are
>> distributed, then all nodes are going to be sending the same query
>> needlessly around the cluster 36 times - seems unnecessary.
>> 
>> Thoughts?
>> 
>> Cheers,
>> Tim
> 



How to find the routing algorithm used?

2013-05-15 Thread santoash
Im trying to find out which routing algorithm (implicit/composite id) is being 
used in my cluster. We are running solr 4.1. I was expecting to see it in my 
clusterState (based on a previous thread that someone else posted) but  I don't 
see it there. Could someone please help?

Thanks!

Santoash



SolrCloud UpdateRequestHandler Error [ unsupported Content-Type application/octet-stream ]

2013-01-26 Thread santoash
I use HttpSolrServer (going through an external loadbalancer) to connect to my 
solrCloud. When I try to index an object via httpSolrServer.addBean(), I get a 
400 error back and in the SolrCloud node, I see the following exception:


SEVERE: org.apache.solr.common.SolrException: Unsupported ContentType: 
application/octet-stream  Not in: [application/xml, text/csv, text/json, 
application/csv, application/javabin, text/xml, application/json]
at 
org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:86)
at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1699)
at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:455)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:276)
at 
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1088)
at 
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:360)
at 
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at 
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
at 
org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:729)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405)
at 
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:206)
at 
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
at 
org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
at org.mortbay.jetty.Server.handle(Server.java:324)
at 
org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:505)
at 
org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:843)
at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:729)
at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:211)
at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:380)
at 
org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
at 
org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:450




All the fields in my bean are either simple strings, array of strings or 
integers. In my solrconfig.xml, I have the following in my updateHandler 
section:


  
  

  ..
  

 application/json
   
  
  

 application/csv
   
  

  
  

  true
  ignored_

  
  true
  links
  ignored_

  


I would appreciate any pointers on how to get past this issue. 

Thanks in advance!

Santoash


Re: SolrCloud UpdateRequestHandler Error [ unsupported Content-Type application/octet-stream ]

2013-01-26 Thread santoash
My bad…turns out that I was using the HttpSolrServer in 3.6.1 in my client 
side. When I changed it to CommonsHttpSolrServer in 3.6.1 or HttpSolrServer in 
4.1, everything works correctly. 


On Jan 26, 2013, at 4:18 PM, santoash  wrote:

> I use HttpSolrServer (going through an external loadbalancer) to connect to 
> my solrCloud. When I try to index an object via httpSolrServer.addBean(), I 
> get a 400 error back and in the SolrCloud node, I see the following exception:
> 
> 
> SEVERE: org.apache.solr.common.SolrException: Unsupported ContentType: 
> application/octet-stream  Not in: [application/xml, text/csv, text/json, 
> application/csv, application/javabin, text/xml, application/json]
>   at 
> org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:86)
>   at 
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)
>   at 
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
>   at org.apache.solr.core.SolrCore.execute(SolrCore.java:1699)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:455)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:276)
>   at 
> org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1088)
>   at 
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:360)
>   at 
> org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
>   at 
> org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:181)
>   at 
> org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:729)
>   at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:405)
>   at 
> org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:206)
>   at 
> org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
>   at 
> org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
>   at org.mortbay.jetty.Server.handle(Server.java:324)
>   at 
> org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:505)
>   at 
> org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:843)
>   at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:729)
>   at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:211)
>   at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:380)
>   at 
> org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
>   at 
> org.mortbay.thread.BoundedThreadPool$PoolThread.run(BoundedThreadPool.java:450
> 
> 
> 
> 
> All the fields in my bean are either simple strings, array of strings or 
> integers. In my solrconfig.xml, I have the following in my updateHandler 
> section:
> 
> 
>  
>   class="solr.BinaryUpdateRequestHandler" />
> 
>  ..
>  
>
> application/json
>   
>  
>  
>
> application/csv
>   
>  
> 
>  
>startup="lazy"
>  class="solr.extraction.ExtractingRequestHandler" >
>
>  true
>  ignored_
> 
>  
>  true
>  links
>  ignored_
>
>  
> 
> 
> I would appreciate any pointers on how to get past this issue. 
> 
> Thanks in advance!
> 
> Santoash



Re: How to find the routing algorithm used?

2013-05-16 Thread Santoash Rajaram
I tried looking for it there but I don't see the word router in my clusterstate.

I'm trying to figure out the router info since I have duplicate documents in my 
cluster (document with the same id). In the worst case  I was expecting to see 
something like "router": "implicit". But I don't see anything.

Any ideas? 

Thanks!

-scr

On May 16, 2013, at 12:31 AM, Furkan KAMACI  wrote:

> At admin gui click on the "Cloud" link then "Tree" link. A page will open
> and choose clusterstate.json from list. Scroll down to end and you will see
> something like: "router":"compositeId
> 
> 
> 
> 2013/5/16 santoash 
> 
>> Im trying to find out which routing algorithm (implicit/composite id) is
>> being used in my cluster. We are running solr 4.1. I was expecting to see
>> it in my clusterState (based on a previous thread that someone else posted)
>> but  I don't see it there. Could someone please help?
>> 
>> Thanks!
>> 
>> Santoash
>> 
>> 


Re: SOLR Num Docs vs NumFound

2013-03-15 Thread Santoash Rajaram
I don't have an answer but I have seen this before too. I assumed this is an 
issue with the admin UI. In my case the number returned by the query looked 
closer to the truth than the one in the UI. I even tried an hard commit and 
optimize via admin UI. It didn't help. 

If you want to try hard commits, they can be done either via configuration in 
solrconfig.xml (specify a frequency) or solrj API or /update URL. All of these 
are explained here:

http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22

-santoash

On Mar 14, 2013, at 9:48 PM, Nathan Findley  wrote:

> On my solr 4 setup a query returns a higher "NumFound" value during a *:* 
> query than the "Num Docs" value reported on the statistics page of 
> collection1. Why is that? My data is split across 3 data import handlers 
> where each handler has the same type of data but the ids are guaranteed to be 
> different.
> 
> Are some of my documents not hard commited? If so, how do I hard commit. 
> Otherwise, why are these numbers different?
> 
> -- 
> CTO
> Zenlok株式会社
>