General question on understanding Solr log output

2012-02-26 Thread Loren Siebert
I'm trying to understand what Solr is doing regarding commits based on the logs below. I have a 60 second autocommit and no explicit commits coming in. What I am seeing in my Solr log appears to be 2 (or 3?) commits per 60 second commit cycle. Everything below happens within a few seconds. I say ma

Does solrj support compound type for field?

2012-02-26 Thread SuoNayi
Hi all, I'm new to solr and just know that solrj generates index for instance of POJO by transform instance of pojo into an instance of type of SolrInputDocument with DocumentObjectBinder. Supposing my POJO has a property of List type and its element compound types which are my customized cl

Re: Fail to compile Java code (trying to use SolrJ with Solr)

2012-02-26 Thread Mark Miller
Java 6 supports classpath wildcards as well - so you can also do dist/* On Feb 24, 2012, at 3:33 AM, bing wrote: > Hi, Dmitry > > Thank you. It solved my problem. > > Best Regards, > Bing > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Fail-to-compile-Java-code

How to define a multivalued string type "langid.langsField" in solrconfig.xml

2012-02-26 Thread bing
Hi, all, I am using tika language detection. It is said that, if "langid.langsField" is set as multivalued string, and then a list of languages can be stored for the fields specified in "langid.fl". Following is how I configure the processor in soleconfig.xml. I tried using "text" only, and the

Customizing Solr score with DixMax query

2012-02-26 Thread Xiao
In my application logic, I want to implement the ranking (scoring) logic as follows: score = "Solr relecency score" * a_special_field_value. I tried to use DixMax to do this. My query statement is q={!type=dixmax qf='title content' bf=field1}data. However, when I open the debugquery option, I fi

Re: TikaLanguageIdentifierUpdateProcessorFactory(since Solr3.5.0) to be used in Solr3.3.0?

2012-02-26 Thread bing
Hi, Erick, My idea is to use Tika0.10 in Dspace1.7.2, which is based on two steps: 1. Upgrade Solr1.4.1 to Solr3.3.0 in Dspace1.7.2 In the following link, upgraded Solr & Lucene 3.3.0 has been resolved. https://jira.duraspace.org/browse/DS-980 2. Upgrade to Tika0.10 in Solr3.3.0 In the follo

TIKA Errors Importing MS Word Documents into SOLR Cloud

2012-02-26 Thread Matthew Parker
I tried to import some documents into SOLR Cloud using Apache Manifold. TIKA started throwing exceptions for various documents The exception reads like the following: org.apache.solr.common.SolrException at org.apache.solr.handler.extraction.ExtractionDocumentLoader.load( ExtractingDocumentLoade

Re: Solr Performance Improvement and degradation Help

2012-02-26 Thread Yonik Seeley
On Sun, Feb 26, 2012 at 3:32 PM, Erick Erickson wrote: > Would you hypothesize that lazy field loading could be that much > slower if a large fraction of fields were selected? If you actually use the lazy field later, it will cause an extra read for each field. If you don't have enough free RAM f

Re: Solr Performance Improvement and degradation Help

2012-02-26 Thread Erick Erickson
I sure can't reproduce this on an 11M document Wikipedia dump. I added the "text" from the Wiki dump 49 extra times (i.e. there are 50 copies of the text field in each document), and pulled back 12000 documents from my test machine (a Macbook Pro from 3 years ago). I also debugged the code a bit an

Re: Dynamically Load Query Time Synonym File

2012-02-26 Thread Bill Bell
It would depend. If the synonyms are used on indexing, you need to reindex. Otherwise, you could reload and use the synonyms on "query". On 2/26/12 4:05 AM, "Ahmet Arslan" wrote: > >> Is there a way to dynamically load a synonym file without >> restarting solr core ? > >There is an open jira fo

Re: distributed deletes working?

2012-02-26 Thread Mark Miller
> > > > Are there any outstanding issues that I should be aware of? > > Not that I know of - we where trying to track down an issue around peer sync recovery that our ChaosMonkey* tests were tripping, but looks like Yonik may have tracked that down last night. * The ChaosMonkey tests randomly sta

Time Stats

2012-02-26 Thread Raimon Bosch
Hi, Today I was playing with StatsComponent just to extract some statistics from my index. I'm using a solr index to store user searches. Basically what I did is to aggregate data from accesslog into my solr index. So now I can see average bounce rate for a group of user searches and see which one

Re: Dynamically Load Query Time Synonym File

2012-02-26 Thread Ahmet Arslan
> Is there a way to dynamically load a synonym file without > restarting solr core ? There is an open jira for this : https://issues.apache.org/jira/browse/SOLR-1307

Dynamically Load Query Time Synonym File

2012-02-26 Thread Sphene Software
Hello, Is there a way to dynamically load a synonym file without restarting solr core ? If this is possible, what are the negative side effects of these ? Example: If Synonym file is loaded dynamically, what happens to the results of the queries stored in the queryresultscache ? Thanks in adva

Solr Analyzed or "parsedquery" Terms

2012-02-26 Thread Charlie Maroto
Hi all, *Question* Is there a way to get the analyzed or parsedquery terms for a given search? Preferably along with the search results *What are we looking for?* Something that would tell us what are the actual terms searched by Solr after all tokenization, analysis and filtering have been appl