Re: Wildcard query vs facet.prefix for autocomplete?

2012-07-17 Thread santamaria2
I'll consider using the other methods, but I'd like to know which would be faster among the two approaches mentioned in my opening post. -- View this message in context: http://lucene.472066.n3.nabble.com/Wildcard-query-vs-facet-prefix-for-autocomplete-tp3995199p3995458.html Sent from the Solr -

ICUCollation throws exception

2012-07-17 Thread Oliver Schihin
Hello According to release notes from 4.0.0-ALPHA, SOLR-2396, I replaced ICUCollationKeyFilterFactory with ICUCollationField in our schema. But this throws an exception, see the following excerpt from the log: Jul 16, 2012 5:27:48 PM org.apache.solr.common.

RE: Query facet count and its matching documents

2012-07-17 Thread Gnanakumar
Any ideas on this? > We're running Apache Solr v3.1 and SolrJ is our client. > > We're passing multiple Arbitrary Faceting Query (facet.query) to get the > number of matching documents (the facet count) evaluated over the search > results in a *single* Solr query. My use case demands the actual m

Re: Error 404 on every request

2012-07-17 Thread Nils Abegg
Hey Guys, I'm trying to get solr running. I got it installed and I can access the admin dashboard, but if I try to index some docs, i always get a 404 Error. I tried it with the following URLs: http://mydomain/solr/update/json http://mydomain/solr/mycore/update/json http://mydomain/update/json ht

Re: Error 404 on every request

2012-07-17 Thread Yonik Seeley
On Tue, Jul 17, 2012 at 6:01 AM, Nils Abegg wrote: > I have installed the 4.0 Alpha with the build-in Jetty Server on Ubuntu > Server 12.04…i followed this tutorial to set it up: > http://kingstonlabs.blogspot.de/2012/06/installing-solr-36-on-ubuntu-1204.html Instead of trying to "install" Solr,

Re: Error 404 on every request

2012-07-17 Thread Nils Abegg
Same issue with the stock server….i followed the steps of the wiki. XML via post.jar its working, JSON via Curl is not. Am 17.07.2012 um 12:05 schrieb Yonik Seeley: > On Tue, Jul 17, 2012 at 6:01 AM, Nils Abegg wrote: >> I have installed the 4.0 Alpha with the build-in Jetty Server on Ubuntu >>

Re: Grouping performance problem

2012-07-17 Thread Agnieszka Kukałowicz
Hi, I made some more tests to find what exaclty slows the queries. During debugging queries I found that queries using group.facet=true are much more slower than queries without it. For example: query with group.facet=true: 4524.0 1.0 878.0 3449.0 query without group.facet=true: < double

Re: Lost answers?

2012-07-17 Thread Bruno Mannina
Hi Michael, My answer are below, Thanks ! Le 16/07/2012 21:22, Michael Della Bitta a écrit : Hello Bruno, Jetty is a legitimate choice. I do, however, worry that you might be masking an underlying problem by making that choice, without a guarantee that it won't someday hurt you even if you us

Re: Error 404 on every request

2012-07-17 Thread Nils Abegg
Ok, i got it working with path /update not /update/json But it feels somewhat fishy to have solr sitting in my home dir.

AW: DIH XML configs for multi environment

2012-07-17 Thread Markus Klose
Hi There is one more approach using the property mechanism. You could specify the datasource like this: And you can specifiy the properties in the solr.xml in your core configuration like this: Viele Grüße aus Augsburg Markus Klose SHI Elektronische Medien GmbH  

Re: Error 404 on every request

2012-07-17 Thread Erik Hatcher
/update/json was removed from the example configuration in 4.0 because /update now handles content based on content-type internally. It may not be spelled out as clearly as it should be, but here's the CHANGES entry for it: * SOLR-2857: Support XML,CSV,JSON, and javabin in a single RequestHandl

Re: Solr facet multiple constraint

2012-07-17 Thread Erick Erickson
OK, maybe I'm finally getting it. When you do a facet.field=blahblah, you're telling Solr to take all the documents that match the query, look in field blahblah, and tally the documents that match _any_ value in the field. There's no restriction at all on the _values_ that that tally is made for.

Re: When shall index be split over shards?

2012-07-17 Thread Erick Erickson
not really. It's a matter of when your system starts to bog down, and unfortunately there's no good way to give general guidance, especially on a number like size of the index. 90% of the index size could be stored data (*.fdt and *.fdx files) that have no bearing on search requirements. My ad

Re: are stopwords indexed?

2012-07-17 Thread Erick Erickson
Two things: 1> did you re-index after you got your stopwords file set up? And I'd blow away the index directory before re-indexing. 2> If you _store_ your field, the stopwords will be in your results lists, but _not_ in your index. As a secondary check, try going into your admin/schema browser

Re: Metadata and FullText, indexed at different times - looking for best approach

2012-07-17 Thread Erick Erickson
In that case, I think your best option is to re-index the entire document when you have the text available, metadata and all. Which actually begs the question whether you want to index the bare metadata at all. Is it the use-case that the user actually gets value when there's no text? If not, forge

Re: Solr 3.5 DIH delta-import replicating full index or Admin UI problem?

2012-07-17 Thread Erick Erickson
What you're seeing is the replication of the changed segments (new segments actually). Replication only moves new or merged segments and they will be a varying portion of the total index. If you optimized, you'd see the entire index be moved (but you don't need to do that!). You should be able to

Result docs missing only when shards parameter present in query?

2012-07-17 Thread Bill Havanki
I had the same problem as the original poster did two years ago (!), but with Solr 3.4.0: > I cannot get hits back and do not get a correct total number of records when using shard searching. When performing a sharded query, I would get empty / missing results - no documents at all. Querying each

Indexing data in csv format

2012-07-17 Thread gopes
Hi , I am trying to index data in csv format. But while indexing I get this following message - HTTP ERROR 404 Problem accessing /solr/update/csv. Reason: NOT_FOUND/Powered by Jetty:/// solrconfig.xml has the following entries for CSVRequestHandler ; true publish_date " Thanks, Saral

Disable cache ?

2012-07-17 Thread Bruno Mannina
Hi Solr Users, I would like for my test disable the cache fonction, so I modified all information concerning cache in solrconfig.xml but after restarting my Tomcat cache is always here. Do you think I forgot something? Requests are done with QTime=1 or QTime=0 and with this rapidity my progra

Re: Disable cache ?

2012-07-17 Thread Tomás Fernández Löbbe
I think you could disable Solr caches by setting their size to 0 (deleting them won't work, as for example, the FieldValueCache will take default values, not sure about the other ones). I don't think you'll be able to disable Lucene's Field Cache. What's the test that you want to run? Why do you n

Re: Disable cache ?

2012-07-17 Thread lboutros
Hi Bruno, don't forget the OS disk cache. On linux you can clear it with this tiny script : #!/bin/bash sync && echo 3 > /proc/sys/vm/drop_caches Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3.nabble.com/Disable-cache-tp3995575p3995589.html Sent from

Re: edismax not working in a core

2012-07-17 Thread Richard Frovarp
On 07/14/2012 05:32 PM, Erick Erickson wrote: Really hard to say. Try executing your query on the cores with &debugQuery=on and compare the parsed results (for this you can probably just ignore the explain bits of the output, concentrate on the parsed query). Okay, for the example core from th

solr home in jar?

2012-07-17 Thread Matt Mitchell
Hi, I'd like to bundle up a jar file, with a complete solr home and index. This jar file is a dependency for another application, which uses an instance of embedded solr, multi-core. Is there any way to have the application's embedded solr, read the configs/index data from jar dependency? I attem

java.lang.AssertionError: System properties invariant violated.

2012-07-17 Thread Roman Chyla
Hello, (Please excuse cross-posting, my problem is with a solr component, but the underlying issue is inside the lucene test-framework) I am porting 3x unittests to the solr/lucene trunk. My unittests are OK and pass, but in the end fail because the new rule checks for modifier properties. I know

Re: Metadata and FullText, indexed at different times - looking for best approach

2012-07-17 Thread Alexandre Rafalovitch
Thank you, Re-index does look like a real option then. I am looking now at storing text/files in MongoDB or like and indexing into SOLR from that. Initially, I was going to skip the DB part for as long as possible. Regarding the use case, yes it does make sense to have just metadata. It is rich,

Re: Using Solr 3.4 running on tomcat7 - very slow search

2012-07-17 Thread Mou
Brian, Thanks again. swappiness is set to 60 and from vmstat , I can see no swapping is going on. Also I am using fusion IO SSD for storing my index. I also used the visualVM and it shows me that it is blocked on lock=org.apache.lucene.index.SegmentCoreReaders@299172a7. Any clue? On Mon, Jul 1

Could I use Solr to index multiple applications?

2012-07-17 Thread Zhang, Lisheng
Hi, We have an application where we index data into many different directories (each directory is corresponding to a different lucene IndexSearcher). Looking at Solr config it seems that Solr expects there is only one indexed data directory, can we use Solr for our application? Thanks very

RE: SOLR 4 Alpha Out Of Mem Err

2012-07-17 Thread Nick Koton
After trying a number of things, I am successful in allowing the server to auto commit and without having it hit thread/memory errors. I have isolated the required client change to replacing ConcurrentUpdateSolrServer with HttpSolrServer. I am able to maintain index rates of 3,000 documents/sec w

Solr 4.0 ALPHA: AbstractSolrTestCase depending on LuceneTestCase

2012-07-17 Thread Koorosh Vakhshoori
Hi, I have been developing extensions to SOLR code using 4.0 truck. For JUnit testing I am extending AbstractSolrTestCase which in the ALPHA release is located in JAR apache-solr-test-framework-4.0.0-ALPHA.jar. However, this class extends LuceneTestCase which comes from JAR lucene-test-framework-

Re: Could I use Solr to index multiple applications?

2012-07-17 Thread Shashi Kant
Look up multicore solr. Another choice could be ElasticSearch - which is more straightforward in managing multiple indexes IMO. On Tue, Jul 17, 2012 at 7:53 PM, Zhang, Lisheng wrote: > Hi, > > We have an application where we index data into many different directories > (each directory > is cor

RE: Could I use Solr to index multiple applications?

2012-07-17 Thread Zhang, Lisheng
Thanks very much for quick help! Multicore sounds interesting, I roughly read the doc, so we need to put each core name into Solr config XML, if we add another core and change XML, do we need to restart Solr? Best regards, Lisheng -Original Message- From: shashi@gmail.com [mailto:shas

Re: Could I use Solr to index multiple applications?

2012-07-17 Thread Shashi Kant
My suggestion would be to look into Multi Tenancy http://www.elasticsearch.org/. It is easy to setup and use for multiple indexes. On Tue, Jul 17, 2012 at 9:26 PM, Zhang, Lisheng wrote: > Thanks very much for quick help! Multicore sounds interesting, > I roughly read the doc, so we need to put e

Re: SOLR 4 Alpha Out Of Mem Err

2012-07-17 Thread Mark Miller
On Jul 17, 2012, at 8:08 PM, Nick Koton wrote: > So could there be something amiss in the server side implementation of > ConcurrentUpdateSolrServer? See my earlier email. Once we decide on the appropriate change, we will address it. - Mark Miller lucidimagination.com

Re: Could I use Solr to index multiple applications?

2012-07-17 Thread Yury Kats
On 7/17/2012 9:26 PM, Zhang, Lisheng wrote: > Thanks very much for quick help! Multicore sounds interesting, > I roughly read the doc, so we need to put each core name into > Solr config XML, if we add another core and change XML, do we > need to restart Solr? You can add/create cores on the fly,

RE: Using Solr 3.4 running on tomcat7 - very slow search

2012-07-17 Thread Fuad Efendi
> FWIW, when asked at what point one would want to split JVMs and shard, > on the same machine, Grant Ingersoll mentioned 16GB, and precisely for > GC cost reasons. You're way above that. - his index is 75G, and Grant mentioned RAM heap size; we can use terabytes of index with 16Gb memory.

UTF-8

2012-07-17 Thread William Bell
-Dfile.encoding=UTF-8... Is this usually recommended for SOLR indexes? Or is the encoding usually just handled by the servlet container like Jetty? -- Bill Bell billnb...@gmail.com cell 720-256-8076

Re: Facet on all the dynamic fields with *_s feature

2012-07-17 Thread Rajani Maski
Hi Users, Any reply for the query below? On Mon, Jul 16, 2012 at 6:27 PM, Rajani Maski wrote: > In this URL - https://issues.apache.org/jira/browse/SOLR-247 > > there are *patches *and one patch with name "*SOLR-247-FacetAllFields*" > > Will that help me to fix this problem? > > If yes, ho

Re: configuring solr3.6 for a large intensive index only run

2012-07-17 Thread nanshi
1) In SolrConfig.xml, find ramBufferSizeMB and change to: 1024 2) Also, try decrease the mergefactor to see if it will give you less segments. In my experiment, it does. -- View this message in context: http://lucene.472066.n3.nabble.com/configuring-solr3-6-for-a-large-intensive-index-only-run

Re: UTF-8

2012-07-17 Thread Paul Libbrecht
My experience is that this property has made a whole lot of a difference. At least till solr 3.1. The servlet container has not been the only bit. paul Le 18 juil. 2012 à 05:12, William Bell a écrit : > -Dfile.encoding=UTF-8... Is this usually recommended for SOLR indexes? > > Or is the encod