Re: complex multi valued fields

2010-01-18 Thread Shalin Shekhar Mangar
On Tue, Jan 12, 2010 at 7:55 PM, Adamsky, Robert wrote: > > I have a document that has a multi-valued field where each value in > the field itself is comprised of two values itself. Think of an invoice > doc > with multi value line items - each line item having quantity and product > name. > > On

Re: schema question

2010-01-18 Thread Uri Boness
Yeah, probably the SignatureUpdateProcessorFactory can do the trick, but you still need to write a custom Signature. (we should really offer a simple "ConcatSignature" implementation for generating predictable combination keys) +1 Cheers, Uri Chris Hostetter wrote: : TemplateTranformer. Other

Re: NullPointerException in ReplicationHandler.postCommit + question about compression

2010-01-18 Thread Shalin Shekhar Mangar
On Wed, Jan 13, 2010 at 12:51 AM, Stephen Weiss wrote: > Hi Solr List, > > We're trying to set up java-based replication with Solr 1.4 (dist tarball). > We are running this to start with on a pair of test servers just to see how > things go. > > There's one major problem we can't seem to get past

Updating a single field in a Solr document

2010-01-18 Thread Raghuveer Kancherla
Hi, I have 2 fields one with captures the category of the documents and an other which is a pre processed text of the document. Text of the document is fairly large. The category of the document changes often while the text remains the same. Search happens on both fields. The problem is, I have to

TermsComponent, multiple fields, total count

2010-01-18 Thread Lukas Kahwe Smith
Hi, I want to use TermsComponent for both auto complete suggestions but also showing a search "quality" meter. As in indicate the total number of matches (doesnt need to be accurate, just a ballpark figure especially if there are a lot of matches). I also want to match multiple fields at once.

Re: NullPointerException in ReplicationHandler.postCommit + question about compression

2010-01-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
When you copy paste config from wiki, just copy what you need. excluding documentation and comments On Wed, Jan 13, 2010 at 12:51 AM, Stephen Weiss wrote: > Hi Solr List, > > We're trying to set up java-based replication with Solr 1.4 (dist tarball). >  We are running this to start with on a pair

Re: Does specifying a smaller number of rows in search improve efficiency?

2010-01-18 Thread Erick Erickson
Nope. The problem is that SOLR needs to create a ranked list. It has to search the entire corpus every time. There's always the possibility that the very last document examined would rank highest. So the search times should be unchanged, no matter how many rows you return, but the time to assemble

multi field search

2010-01-18 Thread Lukas Kahwe Smith
Hi, I realize that I can copy all fields together into one multiValue field and set that as the defaultSearchField. However in that case I cannot leverage the various custom analyzers I want to apply to the fields separately (name should use doublemetaphone, street should use the world splitter

Re: multi field search

2010-01-18 Thread Sven Maurmann
Hi, you might want to use the Dismax-Handler. Sven --On Monday, January 18, 2010 02:58:09 PM +0100 Lukas Kahwe Smith wrote: Hi, I realize that I can copy all fields together into one multiValue field and set that as the defaultSearchField. However in that case I cannot leverage the variou

Re: Does specifying a smaller number of rows in search improve efficiency?

2010-01-18 Thread Yonik Seeley
On Mon, Jan 18, 2010 at 8:57 AM, Erick Erickson wrote: > Nope. The problem is that SOLR needs to create a ranked > list. It has to search the entire corpus every time. There's > always the possibility that the very last document examined > would rank highest. There's also the priority queue used

Re: analyzer type="query" with NGramTokenFilterFactory forces phrase query

2010-01-18 Thread Wangsheng Mei
I faced a similar problem when I was dealing with Chinese words search. By simply adding a PositionFilter at the end of analyzer, the damn phrase query disappeared and replaced by term queries which is what I've expected. That's very nice, thank you very much! Note that Chinese words segmentation

Re: analyzer type="query" with NGramTokenFilterFactory forces phrase query

2010-01-18 Thread Robert Muir
the way that queryparser treats whitespace is also a problem for languages that have words that contain spaces, like vietnamese. i think it also causes grief for multi-word synonyms, such that they don't work correctly at querytime: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#Synony

Re: Multi-word Terms

2010-01-18 Thread shamrockstores
Thank you. While interesting what I'm really after is a programmatic way to get at multi-word terms and their frequencies from a given document. Is this possible? Ahmet Arslan wrote: > >> What is the best way to essentially get a term frequency >> vector for >> multi-word terms? > > To us

Re: Does specifying a smaller number of rows in search improve efficiency?

2010-01-18 Thread Walter Underwood
"Search the entire corpus" makes it sound like Solr is grepping the documents. The corpus has already been converted to an inverted index before the search, so only the terms in the query are retrieved. For basic, relevance-sorted search, there are two kinds of work done by Solr: work per query

Re: Multi-word Terms

2010-01-18 Thread Ahmet Arslan
> Thank you. > > While interesting what I'm really after is a programmatic > way to get at > multi-word terms and their frequencies from a given > document.  > > Is this possible? > What do you mean by programmatic way? You mean without indexing? Multi-word terms means phrases right? Like "ta

filter query parsing problem

2010-01-18 Thread John Thorhauer
I am submitting a query and it seems to be parsing incorrectly. Here is the query with the debug output. Any ideas what the problem is: ((VLog:814124 || VLog:12342) && (PublisherType:U || PublisherType:A)) +(VLog:814124 VLog:12342) +PublisherType:u I would have thought that th

Re: filter query parsing problem

2010-01-18 Thread Ahmet Arslan
> I am submitting a query and it seems > to be parsing incorrectly.  Here > is the query with the debug output.  Any ideas what > the problem is: > > >   >     ((VLog:814124 || VLog:12342) && > (PublisherType:U || PublisherType:A)) >   > > >     +(VLog:814124 VLog:12342) > +PublisherType:u >

Specify logging options from command line in Solr 1.4?

2010-01-18 Thread Mat Brown
Hi all, Wondering if anyone can point me at a simple way to specify basic logging options (log level, log file location) when starting the Solr example jar from the command line. As a bit of background, I maintain a Ruby library for Solr called Sunspot that ships with a Solr installation for ease

Re: Specify logging options from command line in Solr 1.4?

2010-01-18 Thread Mark Miller
Mat Brown wrote: > Hi all, > > Wondering if anyone can point me at a simple way to specify basic > logging options (log level, log file location) when starting the Solr > example jar from the command line. > > As a bit of background, I maintain a Ruby library for Solr called > Sunspot that ships wi

Re: Does specifying a smaller number of rows in search improve efficiency?

2010-01-18 Thread Gora Mohanty
On Mon, 18 Jan 2010 13:21:27 -0800 Walter Underwood wrote: [...] > For basic, relevance-sorted search, there are two kinds of work > done by Solr: work per query term and work per document. The work > per query term depends on the number of query terms. The work per > document depends on the numbe

Tokenization and wild card search

2010-01-18 Thread johnmunir
Hi, I have an issue and I'm not sure how to address it, so I hope someone can help me. I have the following text in one of my fields: "ABC_Expedition_ERROR". When I search on it like: "MyField:SDD_Expedition_PCB" (without quotes) it will fail to find me only this word “ABC_Expedition_ERRO

How can I boost bq in FieldQParserPlugin?

2010-01-18 Thread Wangsheng Mei
Hi, ALL. My original query is: http://myhost:8080/solr/select?q=ipod&*bq=userId:12345^0.5* &fq=&start=0&rows=10&fl=*%2Cscore&qt=dismax&wt=standard&debugQuery=on&explainOther=&hl.fl= It works this way. But I would like to place bq phrase in the default solrconfig.xml configuration to make the quer

build path

2010-01-18 Thread Siv Anette Fjellkårstad
Hi! I try to run the tests of Solr 1.4 in Eclipse, but a most of them fails. The error messages indicate that I miss some config files in my build path. Is there any documentation of how to get Solr up and running in Eclipse? If not; How did you set up (build path for) Solr in Eclipse? Another

Re: build path

2010-01-18 Thread Wangsheng Mei
maybe you should add "-Dsolr.solr.home=" to your JAVA_OPTS before your servlet container starts. 2010/1/19 Siv Anette Fjellkårstad > Hi! > I try to run the tests of Solr 1.4 in Eclipse, but a most of them fails. > The error messages indicate that I miss some config files in my build path. > Is

Fastest way to use solrj

2010-01-18 Thread Tim Terlegård
There are a few ways to use solrj. I just learned that I can use the javabin format to get some performance gain. But when I try the binary format nothing is added to the index. This is how I try to use this: server = new CommonsHttpSolrServer("http://localhost:8983/solr";) server.setReque

SV: build path

2010-01-18 Thread Siv Anette Fjellkårstad
I apologize for the newbie questions :| Do I need a servlet container to run the tests? Kind regards, Siv Fra: Wangsheng Mei [mailto:hairr...@gmail.com] Sendt: ti 19.01.2010 08:49 Til: solr-user@lucene.apache.org Emne: Re: build path maybe you should add "-Ds