Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread preetesh dubey
Nope! if u r doing grouping then "matches" is always the total no. of results and "ngroups" is the number of groups. Every groups can have some docs belonging to it which can be anything according to provided parameter "group.limit". If u get the sum of all the docs of each group, then it's equival

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread tamanjit.bin...@yahoo.co.in
Ummm.. I think there is some confusion here. As per my understanding, matches is the total no of docs which the original query/filter query returned. On these docs grouping is done. So matches may not be actually equal to total no. of returned in your result, post grouping. Its just a subset of th

Re: SolrCloud - issues running with embedded zookeeper ensemble

2012-02-02 Thread Mark Miller
Thanks Dipti! One thing that seems off is that you are passing the bootstrap_confdir param on each instance? Other than that though, the problem you are seeing is indeed a bug - though hidden if using localhost. I'll fix it here: https://issues.apache.org/jira/browse/SOLR-3091 Again, thanks f

Re: Federation in SolrCloud?

2012-02-02 Thread Mark Miller
So it sounds like what you want is partial results. We don't support that yet, but there is a JIRA issue for it. Currently we require that the full index is available - which means if you want to survive any given instance instance going down, you need to have a replica for each shard A and sh

Re: Federation in SolrCloud?

2012-02-02 Thread Darren Govoni
Thanks for the reply Mark. I did example A. One of the instances had zookeeper. If I shut down the other instance, all searches on the other (running) instance produced an error in the browser. I don't have the error handy but it was one line. Something like missing shard in collection IIRC.

MLT and Content Streams

2012-02-02 Thread bordercore
Hello. I'm having issues getting MLT to work with content streams. If I make a query for a term like 'tomcat' using the 'stream.body' parameter, I get back a valid response from Solr. I use a curl command like this: $ curl "http://host/solr/mlt?stream.body=tomcat&mlt.mintf=1"; (I've set a defa

Re: solr to index php files

2012-02-02 Thread Ahmet Arslan
> I am a  beginner with Solr and would like to index > dynamic php files ( page.php?ID=233) and static php files > and .shtml files.  This is for a small website, which > hits a small MySql database on the backend, however some php > files are static and are not part of the database. > > Can you t

RE: solr to index php files

2012-02-02 Thread Reid, Stephen
Thanks Emmanuel, I should have been more clear about my use of the word 'dynamic', I actually meant PHP files that do not pull content from a database. I will take a look at the links you provided, ...thanks again for putting me on the right path. Steve -Original Message- From: Emmanue

Re: solr to index php files

2012-02-02 Thread Emmanuel Espina
What do you mean by static php files? As far as I know PHP is to make pages look dynamic. If you want to index dynamic pages as they where just HTML you will have to download them, and add them to Solr. Programming a small program in SolrJ and using some HTTP library (http://hc.apache.org/httpclien

Re: SolrCloud - issues running with embedded zookeeper ensemble

2012-02-02 Thread Dipti Srivastava
Hi Mark, I am trying to set up on 4 ami's, where 3 of the instances will have the embedded ZK running. Here are the startup commands for all 4. - Note that on the 4th instance I do not have the ZK host and bootstrap conf dir specified. The 4th instance throws exception (earlier in this email chain

solr to index php files

2012-02-02 Thread Reid, Stephen
Hi , I am a beginner with Solr and would like to index dynamic php files ( page.php?ID=233) and static php files and .shtml files. This is for a small website, which hits a small MySql database on the backend, however some php files are static and are not part of the database. Can you tell m

Re: Federation in SolrCloud?

2012-02-02 Thread Mark Miller
On Feb 2, 2012, at 9:51 AM, dar...@ontrenet.com wrote: > Hi, > I want to use SolrCloud in a more federated mode rather than > replication. The failover is nice, but I am more interested in > increasing capacity of an index through horizontal scaling (shards). > > How can I configure shards such

Re: solr taking too long to update a document

2012-02-02 Thread Carlos Alberto Schneider
Thanks Tomás, We are trying many combinations on master/slave and commit configurations. If we dont find the solution, we'll try NRT. Once we've found an acceptable solution, we'll report back ... 2012/2/2 Tomás Fernández Löbbe > The problem is that in order to make the changes visible to th

Re: solr.VelocityResponseWriter error in version 3.5.0

2012-02-02 Thread Erik Hatcher
On Feb 2, 2012, at 14:54 , Dipti Srivastava wrote: > Hrik, > Thanks for you response. I have a question though... > Will I be able to access the solr/admin console after disabling the > velocity response writer? If so I can disbale it. The admin console in Solr does not use the velocity response

Re: search returns 'categories' instead of url

2012-02-02 Thread Ahmet Arslan
> Ok let's assume we have the basic solr engine that's able to > search and > give result of urls...now from those pages, I would like to > know which > terms are the most mentioned, e.g. iPad, Samsung, > Candy...the list can be > long but we could decide to only output the top#20 or so. I use mlt

Re: search returns 'categories' instead of url

2012-02-02 Thread remi tassing
Sincere apologies for the unclarity! I'm probably misusing technical terms such 'category' ... Ok let's assume we have the basic solr engine that's able to search and give result of urls...now from those pages, I would like to know which terms are the most mentioned, e.g. iPad, Samsung, Candy...th

Re: Help: nothing is searchable in Solr

2012-02-02 Thread Yuhao
Erick (sorry for missing the "c" previous :D), After playing around with the edismax query parser, I'm starting to like it.  Originally I just wanted the simplest search feature to get started, but I can see that I might take advantage of edismax's field booster feature later. Turns out the tric

Re: solr.VelocityResponseWriter error in version 3.5.0

2012-02-02 Thread Dipti Srivastava
Hrik, Thanks for you response. I have a question though... Will I be able to access the solr/admin console after disabling the velocity response writer? If so I can disbale it. Otherwise, could you please advise on where and which libraries need to be copied and if any changes in solrconfig.xml a

Re: Help: nothing is searchable in Solr

2012-02-02 Thread Erick Erickson
You're getting confused between default search fields and the dismax query parser. Look in your solrconfig.xml file and you'll see a request handler I think. Take a look at: http://wiki.apache.org/solr/DisMaxQParserPlugin I think this will do what you want. The catch-all field you mentioned is al

Re: Solr sort param behavior clarification

2012-02-02 Thread Erick Erickson
See below: On Thu, Feb 2, 2012 at 12:28 PM, tiuser123 wrote: > Hello Erick, > > > Erick Erickson wrote >> >> Lucene (where the sorting happens) assembles a list of all the unique >> *values* for a sort field and sorts the result set by comparing to that >> list. It doesn't sort all the documents

Re: Help: nothing is searchable in Solr

2012-02-02 Thread Yuhao
Erik, Thanks for your suggestions.  After I made all field names [a-zA-Z0-9_] and turned on debugQuery=true, I saw that the query was using something like "text^0.5", which is beyond my current comprehension.  I commented out those "^0.5" type settings in solrconfig.xml.  Now the search works a

Re: AW: Advanced search in solr

2012-02-02 Thread Chris Hostetter
: I just want to ask, for example for all products from the category "X" name "Samsung" before you do anything else, *PLEASE* start with the Solr tutorial, it should really help you understand a lot of basic concepts in Solr... http://lucene.apache.org/solr/tutorial.html Another resource you

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread Erick Erickson
No, I just used a SolrJ program I had lying around that dummied up some fields for the default schema. I was just looking at how you were using the update request Best Erick On Thu, Feb 2, 2012 at 10:13 AM, rzoao wrote: > Hey guys, I appreciate everybody's help... > > I solved what was going

Re: Complex query, need filtering after query not before

2012-02-02 Thread Chris Hostetter
: 700ms. The problem is, we are using geofiltering and that is happening * : before* the queries, so we have to geofilter on the 1B docs to restrict our : set of docs first, and then do the query on a name field. But it seems that ... : I've looked at the "cache=false" param, and the "co

Re: solr.VelocityResponseWriter error in version 3.5.0

2012-02-02 Thread Erik Hatcher
Dipti - the fix is purely configuration. There is no later version of Solr officially released at this point, but it has been corrected on the branch that will eventually become 3.6 (as well as the 4.0/trunk branch). Follow the instructions mentioned below to fix this in your environment.

Re: solr.VelocityResponseWriter error in version 3.5.0

2012-02-02 Thread Dipti Srivastava
Hi Erik, The latest release from the apache site for Solr 3.5 still has this issue. Where can I get a drop that has the fix for this issue? Thanks, Dipti On 12/9/11 1:22 PM, "Erik Hatcher" wrote: >My bad. > >To clarify the issue here... the problem manifests itself only on Solr >3.5 specifically

Re: search returns 'categories' instead of url

2012-02-02 Thread Chris Hostetter
: > Another alternative solution would be to add a category field to the : > already crawled content. : > >> Let's say Solr is setup and can return relevant urls. What if I wanted : > to get the most cited terms from a predefined list, instead? It could be : > from a list of products, names, citi

Re: Solr sort param behavior clarification

2012-02-02 Thread tiuser123
Hello Erick, Erick Erickson wrote > > Lucene (where the sorting happens) assembles a list of all the unique > *values* for a sort field and sorts the result set by comparing to that > list. It doesn't sort all the documents per-se. > Hmm, still confused. What do you mean by "sorts by comparing

Re: Help: Creating another handler and template to display document attributes

2012-02-02 Thread Erik Hatcher
On Feb 2, 2012, at 11:17 , Yuhao wrote: > Erik, > > You were right! The space in "Entrez ID" was the problem. It works fine > after I got rid of all spaces and capital letters. > Now I just have to come up with a way to display the original field names in > the UI, which the users would pref

Re: Help: Creating another handler and template to display document attributes

2012-02-02 Thread Yuhao
Erik, You were right!  The space in "Entrez ID" was the problem.  It works fine after I got rid of all spaces and capital letters. Now I just have to come up with a way to display the original field names in the UI, which the users would prefer. Is there a way I can stick the display value (with

Re: Fuzzy search not giving expected results

2012-02-02 Thread Ahmet Arslan
> I have the following problem. I am doing a search by street > name, but I > would like to use the fuzzy capabilities of Solr. > > My example: > I search for street name:ALCALA (query: NVIAC:ALCALA) and I > get this result: > > 079 > 2807900132 > 28 > MADRID > ALCALA > CALLE > > > but... if I

Re: Help: Creating another handler and template to display document attributes

2012-02-02 Thread Erik Hatcher
There should only be one document matching that query (provided "Entrez ID" is your unique key field name). Using a space in a field name is perhaps the problem. It's way best practice that fields have only [a-zA-z0-9_] in them. Maybe that space isn't the issue though, but try &debugQuery=tru

Re: Help: Creating another handler and template to display document attributes

2012-02-02 Thread Yuhao
Erik, Thanks for the slides.  I followed the example on pages 24-25 (maybe too rigidly).  The first line is giving me trouble: #set($doc= $response.results.get(0)) This will always get the first document in the search results, which happens to be the first document I indexed.  So, no matter wh

Fuzzy search not giving expected results

2012-02-02 Thread m0rt0n
Hello, I have the following problem. I am doing a search by street name, but I would like to use the fuzzy capabilities of Solr. My example: I search for street name:ALCALA (query: NVIAC:ALCALA) and I get this result: 079 2807900132 28 MADRID ALCALA CALLE but... if I do NVIAC:ALCALO~ I would

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread rzoao
Hey guys, I appreciate everybody's help... I solved what was going on: my schema.xml had a UniqueKey tag that was specifying the id field as required. Since I got the schema from the example folders, it came with those extra configs. Thanks for helping me!!! Erick, did you create a new scheme.x

Federation in SolrCloud?

2012-02-02 Thread darren
Hi, I want to use SolrCloud in a more federated mode rather than replication. The failover is nice, but I am more interested in increasing capacity of an index through horizontal scaling (shards). How can I configure shards such that they retain their own documents and don't replicate (or replic

Re: solr taking too long to update a document

2012-02-02 Thread Tomás Fernández Löbbe
The problem is that in order to make the changes visible to the user you have to issue a commit. If you commit with every user change (I assume you may have concurrent users) you may have many commits per second. That's too much for Solr, as each commit will flush a new segment, reopen an index sea

Re: Solr sort param behavior clarification

2012-02-02 Thread Erick Erickson
First, sorting completely overrides scoring. So if you specify a sort, scoring is essentially ignored. If you specify more than one sort, they are applied in order. That is, any ties in the first sort parameter are broken by the second sort parameter. If all sort parameters specified tie, the inter

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread preetesh dubey
Hi, I am talking about post grouping only. 5017 4062 gives you the total no. of groups. If you iterate through all groups and do the sum of counts in each group, you will get the same number as in "matches". in above example total no. of groups is 4062 and total no. of docs is 5017. if u get the su

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread Erick Erickson
I just tried your code in 3.x (with no DB connection) and it works fine, so it looks like something in your environment Longer term, it's a bad idea to commit (or even update) single documents, you can use the add(Collection wrote: > Hi rzao! > > I think this is the problem: > > > On 02.02.201

solr taking too long to update a document

2012-02-02 Thread Carlos Alberto Schneider
Good morning everyone, I'm working on a project using solr 3.5, one master and two slaves. We run a grails app, and it has an update function. When the user click the button, we search for the message to be updated, clone it using SolrJ, delete the old message and save the new one. We do this up

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread Michael Kuhlmann
Hi rzao! I think this is the problem: On 02.02.2012 13:59, rzoao wrote: UpdateRequest req = new UpdateRequest(); req.setAction(AbstractUpdateRequest.ACTION.COMMIT, false, false);

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread rzoao
Hey Preetesh Dubey, thanks for you help, I tried it that way and I'm getting the same error over here: Bad Request I'm wondering if it's not my schema.xml ... or something else that I don't know. Joao -- View this message in context: http://lucene.472066.n3.nabble.com/Bad-Request-Solr-Weblogic

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread tamanjit.bin...@yahoo.co.in
No my question is post grouping. If group.limit=5 then a group may have 5 docs or less. So if i have to find out total number of docs returned post grouping i will have to see the count of docs in each group which may be any one of these - 5 or 4 or 3 or 2 or 1. So does that mean i have to iterate

Re: Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread preetesh dubey
Instead of CommonHttpSolrServer, I am using StreamingUpdateSolrServer and just doing "server.add(solrInputDocument)". /** StreamingUpdateSolrServer server = new StreamingUpdateSolrServer(solrurl, 2, indexingThreads); server.add(solrInputDocument); **/ On Thu, Feb 2, 2012 at 6:29 PM, rzoao wrote:

Bad Request (Solr + Weblogic + Oracle DB)

2012-02-02 Thread rzoao
Hello all, I'm new to solr and I'm having a hard time trying to configure the schema.xml for a simple table. I'm using WebLogic with standard configs and Oracle Database. I have a simple table with the following fields Fase varchar Campo varchar Antes varchar Depois varchar Regras varchar My sc

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread preetesh dubey
"ngroups" XML response shows the the Total no. of groups and "matches" shows the actual number of docs. On Thu, Feb 2, 2012 at 6:23 PM, tamanjit.bin...@yahoo.co.in < tamanjit.bin...@yahoo.co.in> wrote: > You are right it does. Kind of missed this all together. Thanks > > And anyway to know the

Re: SolrReplication configuration with frequent deletes and updates

2012-02-02 Thread Erick Erickson
First of all, what evidence do you have that you even need to shard? 12 M documents is quite a small index by Solr standards, just test it and see. As far as replication, 10 minutes is probably a good place to start, but you can experiment with reducing it. I've often found that "real time" is usu

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread tamanjit.bin...@yahoo.co.in
You are right it does. Kind of missed this all together. Thanks And anyway to know the actual number of docs returned post grouping? -- View this message in context: http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3709743.html Sent from the Solr - Use

Re: $doc.getFieldNames() - what determines the order of fields?

2012-02-02 Thread Erick Erickson
Why do you care? That is, what is it you're doing that is sensitive to the order of the return relative to the order defined? Best Erick On Wed, Feb 1, 2012 at 3:27 PM, Yuhao wrote: > $doc.getFieldNames() will give you a list of field names as defined in your > schema.xml file.  However, the or

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread O. Klein
group.ngroups=true doesnt give you the desired result? -- View this message in context: http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3709711.html Sent from the Solr - User mailing list archive at Nabble.com.

error in indexing

2012-02-02 Thread leonardo2
Hi all I have a NullPointerException in SolrJetty when using CommonsHttpSolrServer.add(SolrInputDocument) that I can not solveplease help me. >From the core stats, the document does not result to be indexed. The stackTrace is: 2-feb-2012 13.05.04 org.apache.solr.common.SolrException log GRAVE

Re: Controll of results output

2012-02-02 Thread tamanjit.bin...@yahoo.co.in
Try sorting on category. That should do. http://wiki.apache.org/solr/CommonQueryParameters#sort http://wiki.apache.org/solr/CommonQueryParameters#sort -- View this message in context: http://lucene.472066.n3.nabble.com/Controll-of-results-output-tp3709288p3709691.html Sent from the Solr - User

Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread tamanjit.bin...@yahoo.co.in
I tried using grouping. Actually what works for me is group.main=true. However there is an issue. Collapsing, as in the earlier version when it returned docs, it gave a count of number of docs returned post collapsing. But I could not find a similar feature in grouping.. For eg. if before groupin

Re: Is complex query like this possible?

2012-02-02 Thread Sergei Ananko
Hello, Chris. Thank you and Mikhail for the explanation. I'll think about changing the model of indexing to be able to handle this case. > : DIR:true > : PATH:/root/folder1/folder2/ > : NAME:folder3 > : SIZE:0 > ... > : DIR:false > : PATH:/root/folder1/folder2/folder3/ > : NAME:image.jpg

Solr sort param behavior clarification

2012-02-02 Thread tiuser123
Hello new user here, Would just like to clarify the behavior of the solr/lucene sort param. In this post: http://lucene.472066.n3.nabble.com/Lucene-sort-performance-roots-tp3102493p3104294.html I somehow got the impression that solr would do the sort only on the top ranking documents taken from