help refactoring from 3.x to 4.x

2010-08-22 Thread Ryan McKinley
I have a function that works well in 3.x, but when I tried to re-implement in 4.x it runs very very slow (~20ms vs 45s on an index w ~100K items). Big picture, I am trying to calculate a bounding box for items that match the query. To calculate this, I have two fields bboxNS, and bboxEW that get

Re: Solr Admin Schema Browser and field named "keywords"

2010-08-22 Thread Shawn Heisey
On 8/23/2010 12:07 AM, Shawn Heisey wrote: I have a field named "keywords" in my index. The schema browser page is not able to deal with this, so I have trouble getting statistical information on this field. When I click on the field, Firefox hangs for a minute and then gives the "unrespons

Re: How to do Spatial Search with Solr?

2010-08-22 Thread Mattmann, Chris A (388J)
Hi Savannah, Check out the patches I just threw up for SOLR-2073, SOLR-2074, SOLR-2075, SOLR-2076 and SOLR-2077. There's code in there to deal with Geonames.org data. There's more patches coming so hopefully it will be clearer as I add them. Thanks to W. Quach for leading the charge on these p

How to do Spatial Search with Solr?

2010-08-22 Thread Savannah Beckett
Hi,   I am using nutch to do the crawling and solr to do the searching.  The index has City and State.  I want to able to get all nearby cities by entering city name.  e.g. when I type New York, I want to get the following as facet: New York, NY (1905) Brooklyn, NY (89) Jersey City, NJ (55) N

Solr Admin Schema Browser and field named "keywords"

2010-08-22 Thread Shawn Heisey
I have a field named "keywords" in my index. The schema browser page is not able to deal with this, so I have trouble getting statistical information on this field. When I click on the field, Firefox hangs for a minute and then gives the "unresponsive script" warning. I assume (without actu

Re: spellcheck index blown away during rebuild

2010-08-22 Thread Shawn Heisey
On 8/20/2010 8:56 PM, Lance Norskog wrote: The first question is about your use cases. How many words are in the eventual 3GB spelling index? Do you really need that many? Spell-checking is a more controllable UI if you make it from a dictionary. It's built from an index-only field that combin

Re: Duplicate docs when merging indices?

2010-08-22 Thread Shalin Shekhar Mangar
On Sat, Aug 21, 2010 at 5:56 PM, Andrew Clegg wrote: > > Hi, > > First off, sorry about previous accidental post, had a sausage-fingered > moment. > > Anyway... > > If I merge two indices with CoreAdmin, as detailed here... > > http://wiki.apache.org/solr/MergingSolrIndexes > > What happens to dup

Re: How to get most indexed keyword from SOLR

2010-08-22 Thread Pawan Darira
Thanks. It worked out fine. On Fri, Aug 20, 2010 at 7:08 PM, Jan Høydahl / Cominvent < jan@cominvent.com> wrote: > Check out the luke request handler: > > http://localhost:8983/solr/admin/luke?fl=my_ad_field&numTerms=100 - you'll > find topTerms for the fields specified > > -- > Jan Høydahl,

Re: Doing Shingle but also keep special single word

2010-08-22 Thread 朱炎詹
I think I didn't state my problem very well, allow me rephrase my case here: 1. We have over ten million news articles to build into Solr index. 2. We copy several fields, such as title, author, body, caption of attahed photos into a new field for default search. 3. We then wanna use shingle fi

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
okay, thx. but it want work =( i checkout solr1.4.1 as dynamic web project into eclipse. startet jetty with XDebug. In eclpise i add WebLogic exactly how the tutorial shows but eclipse cannot connect =( any idea what im doing wrong ? -- View this message in context: http://lucene.472066.n3.na

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread Drew Farris
After a build of what you have checked out, there's an instance you can run in the examples directory within the solr directory. 1.4.1 can be checked out from http://svn.apache.org/repos/asf/lucene/solr/tags/release-1.4.1/ On Sun, Aug 22, 2010 at 6:01 PM, stockii wrote: > > im stupid. XD > > how

Re: Autocomplete and Sorting on multiple multi-value/single-value fields

2010-08-22 Thread Erick Erickson
Hmmm, is it then really acceptable for the document display to look like title crew cast cvalue mcast value ? Presuming that each document has a title, I know of no built-in way to say "only sort on the title if there was a title match

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
im stupid. XD how get i the revsion of the solr 1.4.1 or do i nee build an war from my snv-checkout ? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1281253.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to limit rows to which highlighting applies

2010-08-22 Thread Otis Gospodnetic
I didn't look at them closely now, but look at: https://issues.apache.org/jira/browse/SOLR-1093 https://issues.apache.org/jira/browse/SOLR-2026 Incidentally, I found them with: http://search-lucene.com/?q=multiple+queries&fc_project=Solr&fc_type=jira Otis Sematext :: http://sematext.com/ ::

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
wich revision is the revision of 1.4.1 ? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1280956.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Autocomplete and Sorting on multiple multi-value/single-value fields

2010-08-22 Thread Erick Erickson
Could you fill us in a little more on the behavior you're after? Because I'm having trouble understanding what "sort across title and multi-valued fields" means... If every document has a title, and title is unique, then there's no need to sort by anything else. Sub-sorts only make sense if you ha

Re: solr

2010-08-22 Thread Erick Erickson
Make sure you're clear on the concepts though. "Unique ID" is kind of fuzzy. Unique Key is a field YOU define in your schema. This will cause SOLR to overwrite any document already in the index with a particular unique key when another document is added. Examples would be, say, "inventory_number" o

Re: Autocomplete and Sorting on multiple multi-value/single-value fields

2010-08-22 Thread Neil Lott
Hi Eric, I think this query explains what I'm trying to do to an extent minus the sorting: > http://localhost:8983/solr/core/select/?q=(titleac:dr or castac:dr or > crewac:dr)&version=2.2&start=0&rows=100&indent=on&fl=title,cast,crew If I get a match in the title field or the cast field

Re: Tokenising on Each Letter

2010-08-22 Thread Erick Erickson
I suspect (though I can't say for sure since you didn't include your schema definition, both type and actual field def) that your problem stems from WordDelimiterFilterFactory options. The default in the schema usually has catenateall=0. In which case you have the tokens "ads" and "12" but not "ads

Re: How to limit rows to which highlighting applies

2010-08-22 Thread Alex Baranau
Hello Mitch, Agree. Basically you described the same context/needs. Your suggestion about adding possibility to create more than one resultset for a single solr-request is exactly what I meant in last paragraph of my initial message (I called it "specifying extra group(s) of rows" and "we could ex

Re: Wild card based filter queries

2010-08-22 Thread Erick Erickson
Well, you can't really do a *blah* without arranging your index appropriately (n-grams come to mind). Queries of this form are really, really expensive. You can do leading and trailing wildcards (i.e. *blah or blah*), but even leading wildcards require some changes to your schema... The short ans

Re: Scoring of documents, boost partial and exact hits in one field

2010-08-22 Thread Marc Harding
Hi, thanks for your suggestion (so simple, why didn't i think of that). When using AND i didn't get the desired result, probably because "work" matches work* and work. I changed the query to something like q=title:work OR title:work?* which works. I'll have to try that with real documents/examp

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
oha, thx i will try it =) -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1278704.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread Drew Farris
On Sun, Aug 22, 2010 at 8:22 AM, stockii wrote: > > thx for you reply. > > i dont want to test my own classes in unittest. i try to understand how solr > works , because i write a little text about solr and lucene. so i want go > through the code, step by step and find out on which places is solr

Re: Doing Shingle but also keep special single word

2010-08-22 Thread MitchK
Hi, keepword-filter is no solution for this problem, since this would lead to the problematic that one has to manage a word-dictionary. As explained, this would lead to too much effort. You can easily add outputUnigrams=true and check out the analysis.jsp for this field. So you can see how much

Re: Doing Shingle but also keep special single word

2010-08-22 Thread Ahmet Arslan
> Isn't set outputUnigrams="true" will > make index size about twice than when it's set to false? Sure index will be bigger. I didn't know that this is problem for you. But if you have a list of special single words that you want to keep, keepwordfilter can eliminate other tokens. So index size

Re: How to limit rows to which highlighting applies

2010-08-22 Thread MitchK
Alex, it sounds like it would make sense. Use cases could be i.e. clustering or similar techniques. However, in my opinion the point of view for such a modification is not the right. I.e. one wants to have got several resultsets. I could imagine that one does a primary-query (the query for the d

Re: Scoring of documents, boost partial and exact hits in one field

2010-08-22 Thread Jan Høydahl / Cominvent
Hi, Try a wildcard term with lower score: q=title:work AND title:work*&debugQuery=true You will now see from the debug printout that you get an extra boost for workload. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 22.

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
1. did you mean solr war ? or all libs in /trunk/solr/lib/ 2. what is the revision of the actual 1.4.1 ? 3. EmbededSolrServer ? what should do this little programm ? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1275295.html Sent

Re: Doing Shingle but also keep special single word

2010-08-22 Thread scott chu
Isn't set outputUnigrams="true" will make index size about twice than when it's set to false? Scott - Original Message - From: "Ahmet Arslan" To: Sent: Saturday, August 21, 2010 1:15 AM Subject: Re: Doing Shingle but also keep special single word I am building index with Shingle

Scoring of documents, boost partial and exact hits in one field

2010-08-22 Thread Marc Harding
Hello, i need some help adjusting the scoring of documents. Assume i have two documents with only one field "title", fieldtype is text (nothing changed), index and stored are true. Doc1's title is "work", doc2's title is "work and workload". A search for "work" will now return the following order

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread Geert-Jan Brits
1. download solr lib and import them in your project. 2. download solr source-code of the same version and attach in to the libraries. (I haven't got eclipse open but it is something like project -> settings -> jre/libraries?) 3. write a small program yourself which calls EmbededSolrServer and step

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread stockii
thx for you reply. i dont want to test my own classes in unittest. i try to understand how solr works , because i write a little text about solr and lucene. so i want go through the code, step by step and find out on which places is solr using lucene. when i can debug the code its easyer ;-) --

How to limit rows to which highlighting applies

2010-08-22 Thread Alex Baranau
Hello Solr users and devs! Is there a way to limit number of rows to which highlighting applies? I don't see any "hl.rows" or similar parameter description, so it looks like I need to enhance HighlightComponent to enable that. If it is not possible currently, do you think it's worth adding such po