Re: highlighting performance

2009-07-28 Thread ravi.gidwani
Hey Matt: I have been facing the same issue. I have a text field that I highlight along with other fields (may be 10 others fields). But If I enable highlighting on this text field that contains large number of characters/words ( > 100 000 characters) , highlighting suffers performanc

HTTP Status 500 - java.lang.RuntimeException: Can't find resource 'solrconfig.xml'

2009-07-28 Thread huenzhao
Hi all, I used ubuntu 8.10 as the solr server OS, and set the solr.solr.home=home/huenzhao/search/tomcat6/bin/solr. When I run the tomcat(The tomcat and the solr that I used running on windows XP has no problem), there has error as : HTTP Status 500 - Severe errors in solr configuration. Check

Re: query in solr lucene

2009-07-28 Thread Sushan Rungta
I tried using AND, but it even provided me doc 3 which was not required. Hence my problem still persists... regards, Sushan At 06:59 AM 7/29/2009, Avlesh Singh wrote: > > No, phrase query would match docs 2 and 3. Sushan only wantsdoc 2 as I read > it. > Sorry, my bad. I did not read properly

Re: Solr replication and spellcheck data

2009-07-28 Thread Noble Paul നോബിള്‍ नोब्ळ्
This is not supported by the Java Replication . but planned for later https://issues.apache.org/jira/browse/SOLR-866 On Wed, Jul 29, 2009 at 4:04 AM, Ian Sugar wrote: > Hi > > I would like to make use of the "new" replication mechanism [1] to set up a > master-slaves configuration, but from quick

Re: query in solr lucene

2009-07-28 Thread Avlesh Singh
> > No, phrase query would match docs 2 and 3. Sushan only wantsdoc 2 as I read > it. > Sorry, my bad. I did not read properly before replying. Cheers Avlesh On Wed, Jul 29, 2009 at 3:23 AM, Erick Erickson wrote: > No, phrase query would match docs 2 and 3. Sushan only wantsdoc 2 as I read > it.

Re: Indexing TIKA extracted text. Are there some issues?

2009-07-28 Thread Grant Ingersoll
Hmm, looks very much like an encoding problem. Can you post a sample showing it, along with the commands you invoked? Thanks, Grant On Jul 28, 2009, at 6:14 PM, ashokc wrote: I am finding that the search results based on indexing Tika extracted text are very different from results based

Re: Storing string field in solr.ExternalFieldFile type

2009-07-28 Thread Chris Hostetter
First off: it seems like you are massively missunderstanding the point of ExternalFileField ... the reason it only supports "float" as a base type is because it's purpose is to allow people to have an external file provide float values in ValueSource calculations (ie: you can have a popularity

Solr replication and spellcheck data

2009-07-28 Thread Ian Sugar
Hi I would like to make use of the "new" replication mechanism [1] to set up a master-slaves configuration, but from quick reading and searching around, I can't seem to find a way to replicate the spelling index in addition to the main search index. (We use the spellcheck component) Is there a wa

Indexing TIKA extracted text. Are there some issues?

2009-07-28 Thread ashokc
I am finding that the search results based on indexing Tika extracted text are very different from results based on indexing the text extracted via other means. This shows up for example with a chinese web site that I am trying to index. I created the documents (for posting to SOLR) in two ways.

Re: query in solr lucene

2009-07-28 Thread Erick Erickson
No, phrase query would match docs 2 and 3. Sushan only wantsdoc 2 as I read it. You might have some joy with KeywordAnalyzer, which does not break the incoming stream up into tokens. You have to be careful, though, because it also won't fold the case, so 'Hello' would not match 'hello'. Best Eric

Rotating the primary shard in /solr/select

2009-07-28 Thread Phillip Farber
Is there any value in a round-robin scheme to cycle through the Solr instances supporting a multi-shard index over several machines when sending queries or is it better to just pick one instance and stick with it. I'm assuming all machines in the cluster have the same hardware specs. So sce

Re: µTorrent indexed as µTorrent

2009-07-28 Thread Robert Muir
Bill, somewhere in the process I think you might be treating your UTF-8 text as ISO-8859-1. Your character: 00B5 (µ) Bits: 10110101 UTF8-encoded: 1110 10110101 If you were to treat these bytes as ISO-8859-1 (i.e. reading from a file or wrong url encoding) then it looks like: 0xC2 (Å) followe

Is there a multi-shard optimize message?

2009-07-28 Thread Phillip Farber
Normally to optimize an index you POST to /solr/update. Is there any way to POST an optimize message to one instance and have it propagate to all shards sort of like the select? /solr-shard-1/select?q=dog... shards=shard-1,shard2 Thanks, Phil

Re: Anybody reformatted the "explain" output to be more visual?

2009-07-28 Thread Chris Hostetter
: So to be clear, you'd be in favor of adding some type of .toXml() to the : explain classes, vs. trying to parse the string output. ...no i mean a helper method like... public static NamedList convert(Explanation exp) { ... } ...that walks the Explanation structure and converts it to a Nmaed

Re: Anybody reformatted the "explain" output to be more visual?

2009-07-28 Thread Mark Bennett
Thanks Hos, So to be clear, you'd be in favor of adding some type of .toXml() to the explain classes, vs. trying to parse the string output. And then I guess a default implementation nesting a string in XML for operators that are complex and where the author hasn't added a .toXml() to their stuff

Re: multiple languages in result set

2009-07-28 Thread Kevin Osborn
BTW, the search will always be in a single language. From: Kevin Osborn To: Solr Sent: Tuesday, July 28, 2009 12:23:43 PM Subject: multiple languages in result set As of Solr 1.3, it looks like my choices for searching in multiple languages are either one la

Re: Anybody reformatted the "explain" output to be more visual?

2009-07-28 Thread Chris Hostetter
once upon a time, pre Apache, i had some really simple code to walk an Explanation tree and convert it into nested NamedLists ... but back then there was a bug in the XmlresponseWRiter code that made it break on structures deeper then some magic number so i tossed the code and just used the e

µTorrent indexed as µTorrent

2009-07-28 Thread Bill Au
I am using SolrJ to index the word µTorrent. After a commit I was not able to query for it. It turns out that the document in my Solr index contains the word µTorrent instead of µTorrent. Any one has any idea what's going on??? Bill

multiple languages in result set

2009-07-28 Thread Kevin Osborn
As of Solr 1.3, it looks like my choices for searching in multiple languages are either one language per core or using different fields per language (productTitle_en, productTitle_de, etc.). However, I may want my results back in multiple languages as well. For example, I could search for a term

Re: Confusion around Binary/XML in SolrJ

2009-07-28 Thread Chris Hostetter
: My code and solr were running as 2 different processes! ( Weird part is that : when I run the code using EmbeddedSolrServer, it did not throw any exception : that there was already a server running on that port. ) EmeddedSolrServer doesn't start up any port, or attempt to communicate with any

Re: Question re SOLR-920 Cache and reuse schema

2009-07-28 Thread Chris Hostetter
: Subject: Question re SOLR-920 Cache and reuse schema : In-Reply-To: <69de18140907230244r79010dfcp5468cc855703a...@mail.gmail.com> : References: <3b910d890907230236q4c9b6d67o4d484b5433f8...@mail.gmail.com> : <69de18140907230244r79010dfcp5468cc855703a...@mail.gmail.com> http://people.apache.o

Re: THIS WEEK: PNW Hadoop, HBase / Apache Cloud Stack Users' Meeting, Wed Jul 29th, Seattle

2009-07-28 Thread Bradford Stephens
Hey everyone, SLIGHT change of plans. A few people have asked me to move to a place with Air Conditioning, since the temperature's in the 90's this week. So, here we go: Big Time Brewing Company 4133 University Way NE Seattle, WA 98105 Call me at 904-415-3009 if you have any questions. On Mon

Re: THIS WEEK: PNW Hadoop, HBase / Apache Cloud Stack Users' Meeting, Wed Jul 29th, Seattle

2009-07-28 Thread Bradford Stephens
On Mon, Jul 27, 2009 at 12:16 PM, Bradford Stephens wrote: > Hello again! > > Yes, I know some of us are still recovering from OSCON. It's time for > another delicious meetup to chat about Hadoop, HBase, Solr, Lucene, > and more! > > UW is quite a pain for us to access until August, so we're changi

Anybody reformatted the "explain" output to be more visual?

2009-07-28 Thread Mark Bennett
I realize Solr just pulls the .toString from Lucene's explain. It's helpful to a point, but quickly scanning over a results list doesn't convey "the big picture". Looking at Lucene's code, I don't think moving to .toHtml() would be much better, as I read it you'd just get the same dense text in a

search suggest

2009-07-28 Thread Jack Bates
how can i use solr to make search suggestions? i'm thinking google-style suggestions, which suggests more refined queries - vs. freebase-style suggestions, which suggests top hits. i've been looking at the query params, http://wiki.apache.org/solr/StandardRequestHandler - and searching for "solr

RE: debugQuery=true issue

2009-07-28 Thread Robert Petersen
I had something similar happen where optimize fixed an odd sorting/scoring problem, and as I understand it the optimize will clear out index 'lint' from old schemas/documents and so thus could affect result scores since all the term vectors or something similar are refreshed etc etc -Original

Re: query in solr lucene

2009-07-28 Thread Avlesh Singh
You should perform a PhraseQuery on the required field. Meaning, http://your-solr-host:port:/your-core-path/select?q=fieldName:"Hello how are you sushan" would work for you. Cheers Avlesh 2009/7/28 Gérard Dupont > Hi Sushan, > > I'm not an expert of Solr, just beginner, but it appears to me tha

Re: query in solr lucene

2009-07-28 Thread Gérard Dupont
Hi Sushan, I'm not an expert of Solr, just beginner, but it appears to me that you may have default 'OR' combinaison fo keywords so that will explain this behavior. Try to modify the configuration for an 'AND' combinaison. cheers On Tue, Jul 28, 2009 at 16:49, Sushan Rungta wrote: > I am extr

Re: query in solr lucene

2009-07-28 Thread Sushan Rungta
I am extremely sorry for responding late as I was ill from past few days. My problem is explained below with an example: I am having three documents with following list: 1. Hello how are you 2. Hello how are you sushan 3. Hello how are you sushan. I am fine. When I search for a query "Hello ho

RE: Storing string field in solr.ExternalFieldFile type

2009-07-28 Thread Daniel Alheiros
Hi John, Have you considered buying an existing commercial product that delivers what you want (searching over log files / maybe monitoring)? It may be cheaper than developing it... http://www.splunk.com/product Just a disclaimer: I'm not related to the company or product so if you need any infor

Re: update some index documents after indexing process is done with DIH

2009-07-28 Thread Marc Sturlese
That really sounds the best way to reach my goal. How could I invoque a listener from the newSearcher?Would be something like: solr 0 10 rocks 0 10 static newSearcher warming query from solrconfig.xml And MyCustomListener would be the class

Has this issue been fixed on 1.3 (SOL-613)?

2009-07-28 Thread Daniel Alheiros
Hi I reported this issue a long time ago and if I remember it correctly someone told me this issue no longer happens on 1.3 onwards. But as the Jira issue hasn't been commented or changed states I'm writing to confirm. Regards, Daniel http://www.bbc.co.uk/ This e-mail (and any attachments) is co

Re: update some index documents after indexing process is done with DIH

2009-07-28 Thread Noble Paul നോബിള്‍ नोब्ळ्
when a core is reloaded the event fired is firstSearcher. newSearcher is fired when a commit happens On Tue, Jul 28, 2009 at 4:19 PM, Marc Sturlese wrote: > > Ok, but if I handle it in a newSearcher listener it will be executed every > time I reload a core, isn't it? The thing is that I want to u

Re: update some index documents after indexing process is done with DIH

2009-07-28 Thread Marc Sturlese
Ok, but if I handle it in a newSearcher listener it will be executed every time I reload a core, isn't it? The thing is that I want to use an IndexReader to load in a HashMap some doc fields of the index and depending of the values of some field docs modify other docs. Its very memory consuming (I

Re: debugQuery=true issue

2009-07-28 Thread gwk
Hi, Hoping this was completely my fault I changed my solr to a nightly build from june (I run Solr patched with SOLR-1240) but the same problems occur. After reindexing a single always_on_top document it suddenly appeared in far down the resultset with score around 5.311 (where it would be if

Re: update some index documents after indexing process is done with DIH

2009-07-28 Thread Noble Paul നോബിള്‍ नोब्ळ्
It is best handled as a 'newSearcher' listener in solrconfig.xml. onImportEnd is invoked before committing On Tue, Jul 28, 2009 at 3:13 PM, Marc Sturlese wrote: > > Hey there, > I would like to be able to do something like: After the indexing process is > done with DIH I would like to open an inde

update some index documents after indexing process is done with DIH

2009-07-28 Thread Marc Sturlese
Hey there, I would like to be able to do something like: After the indexing process is done with DIH I would like to open an indexreader, iterate over all docs, modify some of them depending on others and delete some others. I can easy do this directly coding with lucene but would like to know if

Create new (sub)index from existing one

2009-07-28 Thread Chantal Ackermann
Hi all, I was wondering whether it is possible to create a new index out of existing ones. Say, you have multiple indexes (multicore) that are based on language. There are client applications that should see only a subset of the data contained in the indexes. Instead of constantly having to

Re: debugQuery=true issue

2009-07-28 Thread gwk
Grant Ingersoll wrote: What's the line number that is giving the NPE? Can you paste in a stack trace? Here it is: java.lang.NullPointerException: value cannot be null java.lang.RuntimeException: java.lang.NullPointerException: value cannot be null at org.apache.solr.search.QueryPars

Re: FieldCollapsing: Two response elements returned?

2009-07-28 Thread Marc Sturlese
That's provably because you are using both the CollpaseComponent and the QueryComponent. I think the 2 or 3 last patches allow full replacement of QueryComponent.You shoud just replace: for: This will sort your problem and make response times faster. Jay Hill wrote: > > I'm doing some test