Scoring algorithm?

2009-10-31 Thread Paul Tomblin
Am I right in thinking that a document that the sortable field is only two sentences long and contains the search term once will score higher than one that is 50 sentences long that contains the search term 4 times? Is there a way to change it to score higher based only on number of hits? -- ht

Re: Scoring algorithm?

2009-10-31 Thread Yonik Seeley
On Sat, Oct 31, 2009 at 8:48 AM, Paul Tomblin wrote: > Am I right in thinking that a document that the sortable field is only > two sentences long and contains the search term once will score higher > than one that is 50 sentences long that contains the search term 4 > times? Yep. Assuming 15 to

Re: Scoring algorithm?

2009-10-31 Thread Paul Tomblin
If I change the schema this way, do I need to re-submit all the documents to Solr? And if I have them all sitting on disk as XML files that look like ... ... is there a quick way to submit them all to Solr? On Sat, Oct 31, 2009 at 10:04 AM, Yonik Seeley wrote: > On Sat, Oct 31, 2009 at 8:48 A

Re: Scoring algorithm?

2009-10-31 Thread Yonik Seeley
On Sat, Oct 31, 2009 at 10:22 AM, Paul Tomblin wrote: > If I change the schema this way, do I need to re-submit all the > documents to Solr? Yep. And you should delete the index first before doing so (some field properties are contagious... merging a segment w/o norms and a segment with norms wi

Ok, that didn't work

2009-10-31 Thread Paul Tomblin
I was looking at the script in example/exampledocs to feed documents to the server. Just to see if it was possible, I took one of the documents that I've previously indexed using SolrJ, and I tried to feed it directly to the Solr server using the following command: curl http://localhost:8697/solr

Re: Ok, that didn't work

2009-10-31 Thread Yonik Seeley
Hmmm... perhaps you're missing the tag around the ? -Yonik http://www.lucidimagination.com On Sat, Oct 31, 2009 at 10:37 AM, Paul Tomblin wrote: > I was looking at the script in example/exampledocs to feed documents > to the server. > > Just to see if it was possible, I took one of the docume

Re: Ok, that didn't work

2009-10-31 Thread Paul Tomblin
The add tag isn't part of the document. Is there a way to feed the actual documents without adding tags that aren't part of the schema to them? On Sat, Oct 31, 2009 at 10:43 AM, Yonik Seeley wrote: > Hmmm... perhaps you're missing the tag around the ? > > -Yonik > http://www.lucidimagination.co

Re: Ok, that didn't work

2009-10-31 Thread Yonik Seeley
On Sat, Oct 31, 2009 at 10:45 AM, Paul Tomblin wrote: > The add tag isn't part of the document. Yeah, I know... but that's the way the current XML loader expects it. The history is that these were XML commands being posted, rather than XML docs (so one could specify other commands like commit, an

Re: Ok, that didn't work

2009-10-31 Thread Paul Tomblin
On Sat, Oct 31, 2009 at 11:08 AM, Yonik Seeley wrote: > I personally think it would be cleaner to allow a post of just a > (or multiple with a surrounding tag), esp now that we can put > modifiers in the URL. Exactly. The action should be in the url. > > For now, just use shell scripting I gue

Re: Solr Cell on web-based files?

2009-10-31 Thread Insight 49, LLC
markus.rietz...@rzf.fin-nrw.de wrote: curl reads from local file or stdin, so you could do something like if it only a single file from a webserver curl http://someserver/file.html/ | curl "http://localhost:8983/solr/update/extract?extractOnly=true"; -F na...@- but this way no crawling, no

Re: Solr Cell on web-based files?

2009-10-31 Thread Yonik Seeley
On Sat, Oct 31, 2009 at 12:52 PM, Insight 49, LLC wrote: > Is local file URIs a limitation of solr cell, or just curl; All of Solr's interfaces are currently based on HTTP and usable over a network. Curl (like wget) is simply a useful command line tool that can speak HTTP and is nice for testing.

Another question about omitNorms

2009-10-31 Thread Paul Tomblin
In an earlier message, Yonik suggested that I use omitNorms="true" if I wanted the length of the document to not be counted in the scoring. The documentation also mentions that it omits "index-time boosting". What does that mean? -- http://www.linkedin.com/in/paultomblin http://careers.stackoverf

Re: Another question about omitNorms

2009-10-31 Thread Yonik Seeley
On Sat, Oct 31, 2009 at 3:18 PM, Paul Tomblin wrote: > In an earlier message, Yonik suggested that I use omitNorms="true" if > I wanted the length of the document to not be counted in the scoring. > The documentation also mentions that it omits "index-time boosting". > What does that mean? You ca

Match all terms in doc

2009-10-31 Thread Magnus Eklund
Hi How do I restrict hits to documents containing all words (regardless of order) of a query in particular field? Suppose I have two documents with a field called name in my index: doc1 => name: Pink doc2 => name: Pink Floyd When querying for "Pink" I want only doc1 and when querying for "P

Re: solr web ui

2009-10-31 Thread Uri Boness
If you wish to save yourself from the hassle of applying the patch, you can also download it from http://www.jteam.nl/news/solrexplorer Uri Grant Ingersoll wrote: There is also a GWT contribution in JIRA that is pretty handy and will likely be added in 1.5. See http://issues.apache.org/jira/

Re: Match all terms in doc

2009-10-31 Thread AHMET ARSLAN
> Hi > > How do I restrict hits to documents containing all words > (regardless of order) of a query in particular field? > > Suppose I have two documents with a field called name in my > index: > > doc1 => name: Pink > doc2 => name: Pink Floyd > > When querying for "Pink" I want only doc1 and

json.wrf parameter

2009-10-31 Thread Ankit Bhatnagar
Hi Yonik, I have a question regarding json.wrf parameter that you introduced in Solr query. I am using YUi Datasource widget and it accepts JSONP format. Could you tell me if I specify json.wrf in the query will solr return the response enclosed in () which is essentially JSONP format. As of

Re: best way to model 1-N

2009-10-31 Thread Avlesh Singh
> > what am I missing? > Change your to The "splitBy" directive is understood by this transformer and in your case the attribute was simply ignored. Don't forget to re-index once you have changed. Cheers Avlesh On Fri, Oct 30, 2009 at 9:33 PM, Joel Nylund wrote: > Thanks Chantal, I will keep

Re: autocomplete

2009-10-31 Thread Avlesh Singh
> > > "q=*:*&fq=ac:*all*&wt=json&rows=15&start=0&indent=on&omitHeader=true&json.wrf=?"; > Why is the json.wrf not specified? Without the callback function, the string that is return back is illegal javascript for the browser. You need to specify this parameter which is a wrapper or a callback funct

Re: Iso accents and wildcards

2009-10-31 Thread Avlesh Singh
> > When I request with title:econ* I can have the correct answers, but if I > request with title:écon* I have no answers. > If I request with title:économ (the exact word of the index) it works, so > there might be something wrong with the wildcard. > As far as I can understand the analyser

Re: Indexing multiple entities

2009-10-31 Thread Avlesh Singh
> > The use case on DocumentObjectBinder is that I could override > toSolrInputDocument, and if field = ID, I could do: setField("id", > obj.getClass().getName() + obj.getId()) or something like that. > Unless I am missing something here, can't you write the getter of id field in your solr bean as

Re: "begins with" searches

2009-10-31 Thread Avlesh Singh
> > G'day Avlesh, converting the "all" field to type "edgytext" doesn't work as > expected as the various "text" analysers etc don't get to work on that > field, so I get less results than expected. And adding the edgy filter into > the text field also yields less results. I can work around the iss