Re: Searching for two terms together in a multiValued TextField

2007-12-05 Thread Chris Hostetter
: that must appear together on a page. I have a multiValued TextField called : "page" in a document with uniqueId called "id" that represents a OCR'd book. : My default operator is AND. My default field is "page". My query is: : : q=adhesion+ring&fl=id,score&fq=id:(1+44)&version=2.2 : : But thi

RE: Solr Highlighting, word index

2007-12-05 Thread Binkley, Peter
We're doing a similar process using term vectors to look up the bounding-box data in a custom response writer for a specific project, but we're trying to get this packaged up in a more generally usable way along with handling paging: see https://issues.apache.org/jira/browse/SOLR-380. We're looking

Re: Tomcat6 env-entry

2007-12-05 Thread Chris Hostetter
Erik has already made some good fllowup comments, but to address some specific points... : What was the original thinking behind not having solr/home set in the : web.xml -- seems like an easier way to deal with this. because then people would *have* to unpack the war to change it ... some co

Re: Solr Highlighting, word index

2007-12-05 Thread Mike Klaas
On 5-Dec-07, at 1:02 PM, Owens, Martin wrote: Thanks Mike, So in essence I need to write a new RequestHandler plugin which takes the query string, tokenises it then perform a some kind of action against the index to return results which I should then be able to get the termVectors from?

RE: Solr Highlighting, word index

2007-12-05 Thread Owens, Martin
You do not necessarily need two requests; instead, you can override or modify the request handler you are using (StandardRequestHandler, DisMaxREquestHandler) to return the information. You'll have to process the Query to extract the terms (like HighlighingUtils does), then get the TermVe

Re: Solr Highlighting, word index

2007-12-05 Thread Mike Klaas
On 3-Dec-07, at 10:58 AM, Owens, Martin wrote: You can tell lucene to store token offsets using TermVectors (configurable via schema.xml). Then you can customize the request handler to return the token offsets (and/or positions) by retrieving the TVs. I think that is the best plan of actio

Re: Solr Highlighting, word index

2007-12-05 Thread Ryan McKinley
Owens, Martin wrote: Hello everyone, We're working to replace the old Linux version of dtSearch with Lucene/Solr, using the http requests for our perl side and java for the indexing. The functionality that is causing the most problems is the highlighting since we're not storing the text in so

Re: Solr result offsets

2007-12-05 Thread Yonik Seeley
On Dec 5, 2007 3:06 PM, Owens, Martin <[EMAIL PROTECTED]> wrote: > surely the term offsets are returned when a search is done on a field with > that data available? Nope. That data isn't even stored in the index unless you store termvectors with that info... and even in that case the info is more

Solr result offsets

2007-12-05 Thread Owens, Martin
Hello again, So I've been concentrating on hacking the Util/Highlighting.java to see if I could get it to output the results offsets I need to do the highlighting I need. It turns out that this method requires that the field be stored as well as indexed. I would like to be able to just set term

Re: Tomcat6 env-entry

2007-12-05 Thread Sami Siren
I don't recall any differences in tomcat 6 configuration compared to tomcat 5, did you try to follow the information on wiki for tomcat 5 on your installation? -- Sami Siren Matthew Runo wrote: > Ok, I updated it. I hope it makes sense =\ > > I'm not really familiar enough with the Context cha

Re: Tomcat6 env-entry

2007-12-05 Thread Erik Hatcher
Thanks Matthew! I tidied up a couple of minor things in there. Erik On Dec 5, 2007, at 2:00 PM, Matthew Runo wrote: Ok, I updated it. I hope it makes sense =\ I'm not really familiar enough with the Context changes to add those. If someone else would be so kind as to add "the othe

Re: Tomcat6 env-entry

2007-12-05 Thread Matthew Runo
Ok, I updated it. I hope it makes sense =\ I'm not really familiar enough with the Context changes to add those. If someone else would be so kind as to add "the other way", it'd be much appreciated. http://wiki.apache.org/solr/SolrTomcat --Matthew On Dec 5, 2007, at 9:31 AM, Erick Erickso

Re: 1.2 commit script chokes on 1.2 response format

2007-12-05 Thread Charles Hornberger
https://issues.apache.org/jira/browse/SOLR-426 On Dec 5, 2007 9:57 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote: > > Charles Hornberger wrote: > > On Dec 4, 2007 6:25 PM, Chris Hostetter <[EMAIL PROTECTED]> wrote: > >> i'm not sure what you mean by "applied to 1.2" ... releases are static: > >> onc

Re: solr + maven?

2007-12-05 Thread Ryan McKinley
It would be cool if you'd end up with something similar to what Lucene has recently experienced (fully usable maven artifacts for releases and nightly builds). Agreed. Hopefully solr could have a similar setup to lucene. If it has any value I could at least put some poms together. that w

Re: 1.2 commit script chokes on 1.2 response format

2007-12-05 Thread Ryan McKinley
Charles Hornberger wrote: On Dec 4, 2007 6:25 PM, Chris Hostetter <[EMAIL PROTECTED]> wrote: i'm not sure what you mean by "applied to 1.2" ... releases are static: once published they are never changed. in the event of serious bugs (ie: security holes or crash related bugs) then point releases

Re: 1.2 commit script chokes on 1.2 response format

2007-12-05 Thread Charles Hornberger
On Dec 4, 2007 6:25 PM, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > i'm not sure what you mean by "applied to 1.2" ... releases are static: > once published they are never changed. in the event of serious bugs (ie: > security holes or crash related bugs) then point releases may be published > (

Re: possible to set mincount on facetquery?

2007-12-05 Thread Britske
It seemed handy in the mentioned case where its not certain if there are products in each of the budgetcategories so you simply ask them all, and only get back the categories which contain at least 1 product. >From a functional perspective to me that's kind of on par with doing facet.mincount=1

Re: Tomcat6 env-entry

2007-12-05 Thread Erick Erickson
The beautiful thing about a wiki is that *anybody* can update them. It's especially useful if someone who's just struggled through the issues can write something up since the pain is still fresh . Especially if you're better than I am about writing things down All of which leads me to ask if y

Re: Tomcat6 env-entry

2007-12-05 Thread Matthew Runo
I found that the JNDI settings for Tomcat6 were hard to figure out. Would someone be willing to write it up for the wiki? Since I think most people getting started with SOLR will be using Tomcat6 (or Jetty), it would make sense to update the docs a bit to make it easier to figure out the pr

Re: solr + maven?

2007-12-05 Thread Sami Siren
Ryan McKinley wrote: > Is anyone managing solr projects with maven? I see: > https://issues.apache.org/jira/browse/SOLR-19 > but that is >1 year old > > If someone has a current pom.xml, can you post it on SOLR-19? > > I just started messing with maven, so I don't really know what I am > doing y

Searching for two terms together in a multiValued TextField

2007-12-05 Thread Phillip Farber
Hello, I'm still new to Solr/Lucene. I want to search documents for 2 or more terms that must appear together on a page. I have a multiValued TextField called "page" in a document with uniqueId called "id" that represents a OCR'd book. My default operator is AND. My default field is "page".

Re: possible to set mincount on facetquery?

2007-12-05 Thread Erik Hatcher
On Dec 5, 2007, at 8:33 AM, Yonik Seeley wrote: On Dec 5, 2007 7:45 AM, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Dec 5, 2007, at 5:12 AM, Erik Hatcher wrote: In my perusal of the code (SimpleFacets#getFacetQueryCounts), I'm not seeing that facet.query respects facet.limit even. If you aske

Re: solr + maven?

2007-12-05 Thread climbingrose
Hi Ryan, I'm using solr with Maven 2 in our project. Here is how my pom.xml looks like: org.apache.solr solr-solrj 1.3.0 Since I have all solrj dependencies declared by other artifacts, I don't need to declare any of solrj dependenci

Re: possible to set mincount on facetquery?

2007-12-05 Thread Yonik Seeley
On Dec 5, 2007 7:45 AM, Erik Hatcher <[EMAIL PROTECTED]> wrote: > On Dec 5, 2007, at 5:12 AM, Erik Hatcher wrote: > > In my perusal of the code (SimpleFacets#getFacetQueryCounts), I'm > > not seeing that facet.query respects facet.limit even. If you > > asked for a count for a query, you get it re

Re: Invalid character in search results

2007-12-05 Thread Yonik Seeley
On Dec 5, 2007 4:27 AM, Maciej Szczytowski <[EMAIL PROTECTED]> wrote: > I think I have everything correct with encoding because: > - there are about 15000 documents with word "иго", but only some of them > are broken ("и��о") > - when I reindex broken document invalid characters will be fixed. > >

Re: possible to set mincount on facetquery?

2007-12-05 Thread Erik Hatcher
On Dec 5, 2007, at 5:12 AM, Erik Hatcher wrote: In my perusal of the code (SimpleFacets#getFacetQueryCounts), I'm not seeing that facet.query respects facet.limit even. If you asked for a count for a query, you get it regardless of any other parameters such as mincount or limit. sorry, f

RE: single word Vs multiple word search

2007-12-05 Thread Dilip.TS
Hi, I am using the text and string mostly for the field types. And the defaultSearchField has the field type as text. Thanks in advance. Regards, Dilip TS -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 05, 2007 3:45 PM To: solr-user@lucene.

Re: single word Vs multiple word search

2007-12-05 Thread Erik Hatcher
This all depends on the schema setting for that particular field. What is the field type? Sounds like you've got it set to a non- analyzed/tokenized field. Erik On Dec 5, 2007, at 2:04 AM, Dilip.TS wrote: Hi, This is in continuation with my previous mail. Iam using the SOLRInp

Re: possible to set mincount on facetquery?

2007-12-05 Thread Erik Hatcher
In my perusal of the code (SimpleFacets#getFacetQueryCounts), I'm not seeing that facet.query respects facet.limit even. If you asked for a count for a query, you get it regardless of any other parameters such as mincount or limit. Erik p.s. I also notice that facet.query always u

Re: possible to set mincount on facetquery?

2007-12-05 Thread Erik Hatcher
No, facet.query does not support limiting currently. You'll need to strip those client-side for now. Erik On Dec 5, 2007, at 4:49 AM, Britske wrote: is it possible to set a mincount on a facetquery as well as on a facetfield? I have a situation in which I want to group facetquer

RE: possible to set mincount on facetquery?

2007-12-05 Thread Dilip.TS
Hi, Ideally the params.put("facet.mincount","1"); used for the facet search should do the same. It works fine in my case. Regards Dilip. -Original Message- From: Britske [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 05, 2007 3:19 PM To: solr-user@lucene.apache.org Subject: possible

Re: Tomcat6 env-entry

2007-12-05 Thread Erik Hatcher
Or, instead of messing around with the JNDI setting, simply set - Dsolr.solr.home=/opt/solr with the JVM startup parameters for Tomcat. Hardcoding a path in web.xml is definitely _not_ what we want to do. Not all containers unpack the WAR file onto disk. Also, consider the case of up

possible to set mincount on facetquery?

2007-12-05 Thread Britske
is it possible to set a mincount on a facetquery as well as on a facetfield? I have a situation in which I want to group facetqueries (price-ranges) but I obviously dont want to show ranges with 0 results. I tried things like: f.price:[0 TO 50].facet.mincount=1 and f.price:[0 TO 50].query.mincou

Re: Invalid character in search results

2007-12-05 Thread Maciej Szczytowski
I think I have everything correct with encoding because: - there are about 15000 documents with word "иго", but only some of them are broken ("и��о") - when I reindex broken document invalid characters will be fixed. Any ideas? Maciek Yonik Seeley wrote: On Dec 4, 2007 5:02 AM, Maciej Szczyt