Re: Standard request with functional query

2008-12-18 Thread Chris Hostetter
: Thanks for the response, but how would make recency a factor on : scoring documents with the standard request handler. : The query (title:iphone OR bodytext:iphone OR title:firmware OR : bodytext:firmware) AND _val_:"ord(dateCreated)"^0.1 : seems to do something very similar to just sorting

Re: [ANNOUNCE] Solr Logo Contest Results

2008-12-18 Thread Lukáš Vlček
Congratulations Michiel.Lukas On Thu, Dec 18, 2008 at 3:44 AM, Matt Mitchell wrote: > Love it! Congratulations Michiel. > > Matt > > On Wed, Dec 17, 2008 at 9:15 PM, Chris Hostetter > wrote: > > > (replies to solr-user please) > > > > On behalf of the Solr Committers, I'm happy to announce that

Solrj - Exception in thread "main" java.lang.ClassCastException: java.lang.Long cannot be cast to org.apache.solr.common.util.NamedList

2008-12-18 Thread Sajith Vimukthi
Hi all, I used the sample code given below and tried to run with all the relevant jars. I receive the exception written below. package test.general; import org.apache.solr.client.solrj.SolrServer; import org.apache.solr.client.solrj.SolrServerException; import org.apache.solr.client.s

Solrj - Exception in thread "main" java.lang.ClassCastException: java.lang.Long cannot be cast to org.apache.solr.common.util.NamedList

2008-12-18 Thread Sajith Vimukthi
Hi all, I used the sample code given below and tried to run with all the relevant jars. I receive the exception written below. package test.general; import org.apache.solr.client.solrj.SolrServer; import org.apache.solr.client.solrj.SolrServerException; import org.apache.solr.clien

Re: date facets doubt

2008-12-18 Thread Marc Sturlese
has anyone experienced this problem? Can't find an explanation... Thanks in advance Marc Sturlese wrote: > > Hey there, > > 1.- I am trying to use date facets but I am facing a trouble. I want to > use the same field to do 2 facet classification. I want to show the count > of the docs of the

Re: Solrj - Exception in thread "main" java.lang.ClassCastException: java.lang.Long cannot be cast to org.apache.solr.common.util.NamedList

2008-12-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
which version of the server are you using? SolrJ documenttaion says that the binary format works only with Solr1.3 On Thu, Dec 18, 2008 at 2:49 PM, Sajith Vimukthi wrote: > > > Hi all, > > > > I used the sample code given below and tried to run with all the relevant > jars. I receive the exceptio

[SolrJ] SolrException: missing content stream

2008-12-18 Thread Gunnar Wagenknecht
Hi, I'm using SolrJ to index a couple of documents. I do this in batches of 50 docs to safe some machine memory. I call SolrServer#add(Collection) for each batch. For some reason, I get the following exception: org.apache.solr.common.SolrException: missing content stream at org.apache.sol

Multi language search help

2008-12-18 Thread Sujatha Arun
Hi, I am prototyping lanuage search using solr 1.3 .I have 3 fields in the schema -id,content and language. I am indexing 3 pdf files ,the languages are foroyo,chinese and japanese. I use xpdf to convert the content of pdf to text and push the text to solr in the content field. What is the anal

Re: [SolrJ] SolrException: missing content stream

2008-12-18 Thread Ryan McKinley
are you sure the Collection is not empty? what version are you running? what do the server logs say when you get this error on the client? On Dec 18, 2008, at 6:42 AM, Gunnar Wagenknecht wrote: Hi, I'm using SolrJ to index a couple of documents. I do this in batches of 50 docs to safe some

Change in config file (synonym.txt) requires container restart?

2008-12-18 Thread Sagar Khetkade
Hi, I am using SolrJ client to connect to the Solr 1.3 server and the whole POC (doing a feasibility study ) reside in Tomcat web server. If any change I am making in the synonym.txt file to add the synonym in the file to make it reflect I have to restart the tomcat server. The synonym filter

Re: Change in config file (synonym.txt) requires container restart?

2008-12-18 Thread Mark Miller
Sagar Khetkade wrote: Hi, I am using SolrJ client to connect to the Solr 1.3 server and the whole POC (doing a feasibility study ) reside in Tomcat web server. If any change I am making in the synonym.txt file to add the synonym in the file to make it reflect I have to restart the tomcat server

Re: Get All terms from all documents

2008-12-18 Thread Erick Erickson
I think I'd pin the user down and have him give me the real-world use-cases that require this, then see if there's a more reasonable way to satisfy that use-case. Do they want type-ahead? What is the user of the system going to see? Because, for instance, a drop-down of 10,000 terms is totally use

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Erick Erickson
See the CJKAnalyzer for a start, StandardAnalyzer won't help you much. Also, tell us a little more about your requirements. For instance, if a user submits a query in Japanese, do you want to search across documents in the other languages too? And will you want to associate different analyzers wit

Highlighting broken? String index out of range: 35

2008-12-18 Thread Steffen B.
Hi everyone, it seems that I've run into another problem with my Solr setup. :/ The highlighter just won't highlight anything, no matter which fragmenter or config params I use. Here's an example, taken straight out of the example solrconfig.xml: dismax explicit 0.01

Re: Highlighting broken? String index out of range: 35

2008-12-18 Thread Steffen B.
Alright, I pinned it down, I think... The cause of the error seems to be the "features" field, which has termVectors="true", termPositions="true" and termOffsets="true". The other 2 fields ("name" and "text") work, they have the same type but lack the term*-attributes. When you overwrite the defau

Solr openning many threads

2008-12-18 Thread Alexander Ramos Jardim
Hello, I can see from a thread dump that Solr opens a lot of threads. How does Solr use these threads? Does exist more than one thread for search in Solr? Does Solr use any type of workManager or are the threads simple java.lang.Thread ? How many concurrent threads does Solr create? How does it m

Re: Highlighting broken? String index out of range: 35

2008-12-18 Thread Koji Sekiguchi
I think you are facing this problem: https://issues.apache.org/jira/browse/SOLR-925 I'm just looking the issue to solve it, I'm not sure that I can fix it in my time, though... Koji Steffen B. wrote: Hi everyone, it seems that I've run into another problem with my Solr setup. :/ The highlig

Problem in Date Format in Solr 1.3

2008-12-18 Thread rohit arora
Hi I have upgraded from solr lucene 1.2 to solr lucene 1.3. I have coppied all the "" tag of   "schema.xml" from the solr 1.2 to solr 1.3 it gives an error.. SEVERE: org.apache.solr.common.SolrException: Invalid Date in Date Math String:'2006-Oct-10T10:06:13Z' can you help me in this problem

TermVectorComponent and SolrJ

2008-12-18 Thread Aleksander M. Stensby
Hello everyone, I've started to look at TermVectorComponent and I'm experimenting with the use of the component in a sort of "top terms" setting for a given query... Was also looking at mlt and the interestingTerms, but I would like to do a query, get say 10k results, and from those results r

Re: Solr openning many threads

2008-12-18 Thread Yonik Seeley
On Thu, Dec 18, 2008 at 9:03 AM, Alexander Ramos Jardim wrote: > I can see from a thread dump that Solr opens a lot of threads. > > How does Solr use these threads? Does exist more than one thread for search > in Solr? Does Solr use any type of workManager or are the threads simple > java.lang.Thr

Solr and Autocompletion

2008-12-18 Thread Kashyap, Raghu
Hi, One of things we are looking for is to Autofill the keywords when people start typing. (e.g. Google autofill) Currently we are using the RangeQuery. I read about the PrefixQuery and feel that it might be appropriate for this kind of implementation. Has anyone implemented the autofill fea

RE: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Daniel Alheiros
Hi Sujatha. I've developed a search system for 6 different languages and as it was implemented on Solr 1.2 all those languages are part of the same index, using different fields for each so I can have different analyzers for each one. Like: content_chinese content_english content_russian content_

Re: Solr and Autocompletion

2008-12-18 Thread Ryan McKinley
lots of options out there Rather then doing a slow query like Prefix, i think its best to index the ngrams so the autocomplete is a fast query. http://www.mail-archive.com/solr-user@lucene.apache.org/msg06776.html On Dec 18, 2008, at 11:56 AM, Kashyap, Raghu wrote: Hi, One of thing

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Chris Hostetter
: Subject: looking for multilanguage indexing best practice/hint : References: <49483388.8030...@drun.net> : <502b8706-828b-4eaa-886d-af0dccf37...@stylesight.com> : <8c0c601f0812170825j766cf005i9546b2604a19f...@mail.gmail.com> : In-Reply-To: <8c0c601f0812170825j766cf005i9546b2604a19

Re: Solr and Autocompletion

2008-12-18 Thread Chris Hostetter
: Subject: Solr and Autocompletion : References: <49483388.8030...@drun.net> : <502b8706-828b-4eaa-886d-af0dccf37...@stylesight.com> : <8c0c601f0812170825j766cf005i9546b2604a19f...@mail.gmail.com> : <4949537a.3050...@drun.net> : <8599f2e4e80ecc44aee81fa2974ce2bd0c31d...@mail-sd

Re: [ANNOUNCE] Solr Logo Contest Results

2008-12-18 Thread Mathijs Homminga
Good choice! Mathijs Homminga Chris Hostetter wrote: (replies to solr-user please) On behalf of the Solr Committers, I'm happy to announce that we the Solr Logo Contest is officially concluded. (Woot!) And the Winner Is... https://issues.apache.org/jira/secure/attachment/12394264/apache_sol

Re: Get All terms from all documents

2008-12-18 Thread roberto
Erick, Thanks for the answer, let me clarify the thing, we would like to have a combobox with the terms to guide the user in the search i mean, if a have thousands of documents and want to tell them how many documents in the base have the particular word, how can i do that? thanks On Thu, Dec 18

Approximate release date for 1.4

2008-12-18 Thread Kay Kay
Just curious - if we have an approximate target release date for 1.4 / list of milestones / feature sets for the same.

Re: [ANNOUNCE] Solr Logo Contest Results

2008-12-18 Thread Jeryl Cook
looks cool :), how about a talking mascot as Jeryl Cook twoenc...@gmail.com On Thu, Dec 18, 2008 at 1:38 PM, Mathijs Homminga wrote: > Good choice! > > Mathijs Homminga > > Chris Hostetter wrote: >> >> (replies to solr-user please) >> >> On behalf of the Solr Committers, I'm happy to announce t

Re: Approximate release date for 1.4

2008-12-18 Thread Yonik Seeley
On Thu, Dec 18, 2008 at 2:43 PM, Kay Kay wrote: > Just curious - if we have an approximate target release date for 1.4 / list > of milestones / feature sets for the same. Mid January. Issues included: case-by-case analysis of how ready they are (and obviously affected by committers "scratching th

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Julian Davchev
Thanks Erick, I think I will go with different language fields as I want to give different stop words, analyzers etc. I might also consider scheme per language so scaling is more flexible as I was already advised but this will really make sense if I have more than one server I guess, else just all

does this break Solr? dynamicField name="*" type="ignored"

2008-12-18 Thread Peter Wolanin
I'm seeing a weird effect with a '*' field. In the example schema.xml, there is a commented out sample: We have this un-commented, and in the schema browser via the admin interface I see that all non-dynamic fields get a type of "ignored". I see this in the Solr admin interface: Field:

Re: does this break Solr? dynamicField name="*" type="ignored"

2008-12-18 Thread Yonik Seeley
Looks like it's a bug in the schema browser (i.e. just this display, no the inner workings of Solr). Could you open a JIRA issue for this? -Yonik On Thu, Dec 18, 2008 at 3:20 PM, Peter Wolanin wrote: > I'm seeing a weird effect with a '*' field. In the example > schema.xml, there is a commente

Re: Partitioning the index

2008-12-18 Thread Yonik Seeley
It's more related to how much memory you have on your boxes, how resource intensive your queries are, how many fields you are trying to facet on, what acceptable response times are, etc. Anyway... a single box is normally good for between 5M and 50M docs, but can fall out of that range (both up an

Re: Get All terms from all documents

2008-12-18 Thread Erick Erickson
How do you get the word in the first place? If the combobox is for all words in your index, it's probably completely useless to provide this information because there is too much data to guide the user at all. I mean a list of 10,000 words with some sort of document frequency seems to me to require

Data Import Request Handler problem: Odd performance behaviour for large number of records

2008-12-18 Thread Glen Newton
Hello, I amusing Solr 1.4 (solr-2008-11-19) with Lucene 2.4 dropped in instead of 2.9 I am indexing 500k records using the JDBC Data Import Request Handler. Config: Linux openSUSE 10.2 (X86-64) Dual core dual core 64bit Xeon 3GHz Dell blade 8GB RAM java version "1.6.0_07" Java(TM) SE Runtim

Re: does this break Solr? dynamicField name="*" type="ignored"

2008-12-18 Thread Peter Wolanin
created issue: https://issues.apache.org/jira/browse/SOLR-929 -Peter On Thu, Dec 18, 2008 at 3:32 PM, Yonik Seeley wrote: > Looks like it's a bug in the schema browser (i.e. just this display, > no the inner workings of Solr). > Could you open a JIRA issue for this? > > -Yonik > > > On Thu, Dec

Full reindex needed if termVectors added to fields in schema?

2008-12-18 Thread Eric Kilby
hi, I've successfully added fields to my schema.xml before, and been able to incrementally keep indexing documents with just the new ones picking up the fields. This appears to be similar to the case of not including certain fields in certain documents, as the other documents simply don't have t

Re: Get All terms from all documents

2008-12-18 Thread Mike Klaas
On 18-Dec-08, at 10:53 AM, roberto wrote: Erick, Thanks for the answer, let me clarify the thing, we would like to have a combobox with the terms to guide the user in the search i mean, if a have thousands of documents and want to tell them how many documents in the base have the partic

RE: Change in config file (synonym.txt) requires container restart?

2008-12-18 Thread Sagar Khetkade
But i am using CommonsHttpSolrServer for Solr server configuation as it is accepts the url. So here how can i reload the core. -Sagar> Date: Thu, 18 Dec 2008 07:55:02 -0500> From: markrmil...@gmail.com> To: solr-user@lucene.apache.org> Subject: Re: Change in config file (synonym.txt) requires

Re: TermVectorComponent and SolrJ

2008-12-18 Thread Grant Ingersoll
On Dec 18, 2008, at 10:06 AM, Aleksander M. Stensby wrote: Hello everyone, I've started to look at TermVectorComponent and I'm experimenting with the use of the component in a sort of "top terms" setting for a given query... Was also looking at mlt and the interestingTerms, but I would like

Re: Multi language search help

2008-12-18 Thread Grant Ingersoll
On Dec 18, 2008, at 6:25 AM, Sujatha Arun wrote: Hi, I am prototyping lanuage search using solr 1.3 .I have 3 fields in the schema -id,content and language. I am indexing 3 pdf files ,the languages are foroyo,chinese and japanese. I use xpdf to convert the content of pdf to text and pu

Re: Data Import Request Handler problem: Odd performance behaviour for large number of records

2008-12-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
DIH does not maintain any state between two runs. So if there is a perf degradation it could be because - Solr Indexing is taking longer after you do a delete *:* - Your RAM is insufficient (your machine is swapping) On Fri, Dec 19, 2008 at 2:51 AM, Glen Newton wrote: > Hello, > > I amusing Solr

Re: Change in config file (synonym.txt) requires container restart?

2008-12-18 Thread Shalin Shekhar Mangar
Please note that a core reload will also stop Solr from serving any search requests in the time it reloads. On Fri, Dec 19, 2008 at 8:24 AM, Sagar Khetkade wrote: > > But i am using CommonsHttpSolrServer for Solr server configuation as it is > accepts the url. So here how can i reload the core. >

Re: Precisions on solr.xml about cross context forwarding.

2008-12-18 Thread Chris Hostetter
: This bothers me too. I find it really strange that Solr's entry-point : is a servlet filter instead of a servlet. it traces back to the need for it to decide when to handle a request and when to let it pass through (to a later filter, a servlet or a JSP) this is the only way legacy support

Fwd: Distributed Searching - Limitations?

2008-12-18 Thread Pooja Verlani
Hi, I am planning to use Solr's distributed searching for my project. But while going through http://wiki.apache.org/solr/DistributedSearch, i found a few limitations with it. Can anyone please explain the 2nd and 3rd points in the limitations sections on the page. The points are: - When du