Re: How to exclude a mimetype in tika?

2014-09-19 Thread Jorge Luis Betancourt Gonzalez
Which crawler are you using? On Sep 18, 2014, at 10:14 AM, keeblerh wrote: > eShard wrote >> Good afternoon, >> I'm using solr 4.0 Final >> I need movies "hidden" in zip files that need to be excluded from the >> index. >> I can't filter movies on the crawler because then I would have to exclude

Re: Error Instantiating UpdateRequestProcessorFactory

2014-09-19 Thread Shalin Shekhar Mangar
Sounds like a class loader issue. Try adding your jar to $SOLR_HOME/lib instead of tomcat lib. Also, upgrade to Solr 4.x, 3.6 is ancient! :) On Sat, Sep 20, 2014 at 1:13 AM, Allistair C wrote: > Hi all, > > I’m in a bit of a cul de sac with an issue, hope you can help. > > I am creating a custo

Re: Issue Adding Filter Query

2014-09-19 Thread Erick Erickson
Hmmm, I'd have to see the schema definition for your description field. For this, the admin/analysis page is very helpful. Here's my guess: Your analysis chain doesn't break the incoming tokens up quite like you think it is. Thus you have the tokens in your index like 'protein,' (notice the comma)

Error Instantiating UpdateRequestProcessorFactory

2014-09-19 Thread Allistair C
Hi all, I’m in a bit of a cul de sac with an issue, hope you can help. I am creating a custom UpdateRequestProcessor. The Solr documentation details that I need to write a factory class subclassing UpdateRequestProcessorFactory and this should return an instance of my class that subclasses Upd

Re: Issue Adding Filter Query

2014-09-19 Thread aaguilar
Hello Erick, Thanks for the response. I tried adding the debug=True to the query, but I do not know exactly what I am looking for in the output. Would it be possible for you to look at the results? I would really appreciate it. I attached two files, one of them is with the filter query descrip

Result grouping using externalfilefield

2014-09-19 Thread Sudhakar Maddineni
Hi, Just trying to understand grouping feature with solrcloud 4.2 and I have 4 node/2 shard cluster setup.I am trying to group my search results based on an externalfilefield that I defined. Here is my configuration: core1: == schema.xml: solrconfig.xml: And, created empty file

Re: Issue Adding Filter Query

2014-09-19 Thread Erick Erickson
Your very best friend here is attaching &debug=query to the URL and looking at the parsed query results. Upon occasion there's some One possible explanation is that description field has something like "fatty acid-binding some words protein" in which case your query "fatty acid-binding protein" wo

field collapsing related problem

2014-09-19 Thread sankalp gupta
Hi I have about 15 fields in my solr schema but there are two fields lets say them field1 and field2 in my schema. For most searches I feel I have a perfect schema but for one use case it is not apt: *problem*: I have to group by column using field1 and then I have to search a particular value "a"

Issue Adding Filter Query

2014-09-19 Thread aaguilar
Hello All, I recently came across a problem when I tried using description:"fatty acid-binding protein" as a filter query when doing a query through the query interface for Solr in the Tomcat server. Using that filter query did not give me any results at all, however if I used description:"fatty

Re: running solr in debug through eclipse

2014-09-19 Thread Erick Erickson
Yeah, it's usually pretty daunting to know where to start, the codebase is kinda big. Even "start from junit test" is often daunting, there are a lot of them too. Others have given you good places to start, good luck! Erick On Fri, Sep 19, 2014 at 12:23 AM, Bernd Fehling wrote: > Just start at

Re: SolrCloud replication is push or pull style

2014-09-19 Thread Walter Underwood
This architecture is impossible. There is no such thing as “the” leader for a shard in SolrCloud. The leader is chosen dynamically. You must use a different design. Put all of SolrCloud outside the firewall and push the db updates to it. wunder Walter Underwood wun...@wunderwood.org http://obs

traversing Automaton in lucene 4.10

2014-09-19 Thread Dmitry Kan
Hi, o.a.l.u.automaton.Automaton api has changed in lucene 4.10 ( https://issues.apache.org/jira/secure/attachment/12651171/LUCENE-5752.patch ). Method getNumberedStates() got dropped class State does not exist anymore. How do I traverse an Automaton with the new api? Dmitry -- Dmitry Kan Blog

RE: How to exclude a mimetype in tika?

2014-09-19 Thread Allison, Timothy B.
One option (I think--answer is untested!) is to remove the parsers you don't want from the tika config file. Make sure to specify the tika.config file parameter in your ExtractingRequestHandler in Solr (https://wiki.apache.org/solr/ExtractingRequestHandler). In response to this question, I ju

Re: SolrCloud replication is push or pull style

2014-09-19 Thread Shawn Heisey
> I have a requirement for SolrCloud where leader(Where the indexing will > take place) will be network protected using a firewall and will not accept > any incoming request/data from replicas since replicas will be outside the > firewall. Inside the firewall is our leader node and the database ser

SolrCloud replication is push or pull style

2014-09-19 Thread pawan pandey
Hi, I have a requirement for SolrCloud where leader(Where the indexing will take place) will be network protected using a firewall and will not accept any incoming request/data from replicas since replicas will be outside the firewall. Inside the firewall is our leader node and the database server

Re: AND operator in multi valued fields

2014-09-19 Thread Alexandre Rafalovitch
Well, if it works, open source it. Could even become an official contribution. You are not the only one asking for this kind of features. Though your use case does seem to be a bit further out than most. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and new

Re: AND operator in multi valued fields

2014-09-19 Thread lboutros
I 've just finished a first implementation of a CrossFieldSpanNearQuery and it just works perfectly :D I can now play with position increments and slops to get "exact" results within two multi valued fields. And for the 1st proposal, my user queries can be bigger than 10k with lots of different b

Re: AND operator in multi valued fields

2014-09-19 Thread Alexandre Rafalovitch
I do not think the queries have access to multiple fields at once. Did you check the API? But I am not sure why 1 would be so hard. You know what event field names are, so you just need to copy their conditions into subquery. You could probably even do that in custom search component and not even

JSON Query Format

2014-09-19 Thread Ankit Jain
Hi All, I am using the below piece of code to execute a query: public void solrSearch() throws SolrServerException { CloudSolrServer server = new CloudSolrServer("192.168.41.122:2181"); SolrQuery query = new SolrQuery(); *String categoryFilter = "(( name:" + "\"Gou

Re: AND operator in multi valued fields

2014-09-19 Thread lboutros
Thx Alex for your answer. 1) This could be tricky, because the application users write very complex combined queries with main document fields and event fields too. A custom parser does the abstraction. I think that could be very tricky to extract event part of a complex query in order to filter o

Re: AW: AW: slorj -> httpclient 4, but we already have httpclient 3 in use

2014-09-19 Thread Guido Medina
Hi Clemens, You just found the carrot, let us know when you find the stick :D We had to do some refactor due to new HTTP client way of handling multiple connections to a host, also BASIC authentication, from HTTP client 3 to 4 is just put the jar there and it would just work. We also use HTT

Re: AW: AW: slorj -> httpclient 4, but we already have httpclient 3 in use

2014-09-19 Thread Siegfried Goeschl
Lucky you :-) Siegfried Goeschl On 19.09.14 07:31, Clemens Wyss DEV wrote: I'd like to mention, that substituting the httpcore.jar with the latest (4.3) "sufficed"... -Ursprüngliche Nachricht- Von: Guido Medina [mailto:guido.med...@temetra.com] Gesendet: Donnerstag, 18. September 201

Re: running solr in debug through eclipse

2014-09-19 Thread Bernd Fehling
Just start at the UpdateHandler and follow it down the line. I would start at org/apache/solr/update/UpdateHandler.java If you already know if it is add, delete or update then start with AddUpdateCommand.java, DeleteUpdateCommand.java or UpdateCommand.java. Just follow the red line :-) Regards

Re: running solr in debug through eclipse

2014-09-19 Thread Shalin Shekhar Mangar
You need to look at DistributedUpdateProcessor and specifically the getUpdatedDocument method. On Fri, Sep 19, 2014 at 12:17 PM, Anurag Sharma wrote: > Thanks Bernd for your insight. > As of now, I am focussing to fix the issue in the updater but not able to > localize which code to look in for