RE: field:(-null) returns records where field was not specified

2008-01-14 Thread Chris Hostetter
Several things in this thread should be clarified (note: order of quotations munged for clarity)... : I had read this page. But I'm not using the "NOT" operator, I'm using the : "-" operator. I'm assuming there is a subtle difference between them in : that NOT qualifies something else, hence

Re: Text Summarizer

2008-01-14 Thread Ycrux
Hi Mike and Otis, Mike Klaas a écrit : See http://wiki.apache.org/solr/HighlightingParameters . The default behaviour will provide snippets like google does. Note that you need to "store" the text of fields you want to highlight for this to work. Thanks for the help. Works like a charm.

RE: LNS - or - "now i know we've succeeded"

2008-01-14 Thread Lance Norskog
Now that Microsoft is buying FAST (!!) the open source world needs a matching technology :) -Original Message- From: Walter Underwood [mailto:[EMAIL PROTECTED] Sent: Monday, January 14, 2008 7:42 AM To: solr-user@lucene.apache.org Subject: Re: LNS - or - "now i know we've succeeded" Yes

RE: field:(-null) returns records where field was not specified

2008-01-14 Thread Lance Norskog
The *:* (star colon star) means "all records". The trick is to use (*:* AND -field:[* TO *]). It's silly, but there it is. A performance note: we switched from empty fields to fields with a standard 'empty' value. This way we don't have to do a range check to find records with empty fields. Lance

Re: Text Summarizer

2008-01-14 Thread Mike Klaas
See http://wiki.apache.org/solr/HighlightingParameters . The default behaviour will provide snippets like google does. Note that you need to "store" the text of fields you want to highlight for this to work. cheers, -Mike On 14-Jan-08, at 2:17 PM, Ycrux wrote: Maybe the right name is "S

Re: index out of disk space, CorruptIndexException

2008-01-14 Thread Chris Hostetter
: I've made room, restarted resin, and now solr won't start. No useful messages : in the startup, just a : : [21:01:49.105] Could not start SOLR. Check solr/home property : [21:01:49.105] java.lang.NullPointerException : [21:01:49.105] at org : .apache.solr.servlet.SolrDispatchFilter.init(SolrDi

Re: Text Summarizer

2008-01-14 Thread Ycrux
Maybe the right name is "Snippet". Like Google snippets. cheers Y. Otis Gospodnetic a écrit : Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux <[EMAIL PROTECTED]>

Re: MoreLikeThis similarity field boosting

2008-01-14 Thread Ken Krugler
I'm using Solr for searching our system. Using MoreLikeThis for related content searching. Now url used for search is like this: http://localhost:8983/solr/mlt?q=nid:7280&mlt=true&mlt.fl=title,teaser,body&mlt.mindf=1&mlt.mintf=1&fl=nid,title,score Where "nid" is uniqueKey and "title,teaser,body" a

Re: index out of disk space, CorruptIndexException

2008-01-14 Thread Brian Whitman
On Jan 14, 2008, at 4:08 PM, Ryan McKinley wrote: ug -- maybe someone else has better ideas, but you can try: http://svn.apache.org/repos/asf/lucene/java/trunk/src/java/org/apache/lucene/index/CheckIndex.java thanks for the tip, i did run that, but I stopped it 30 minutes in, as it was s

Re: index out of disk space, CorruptIndexException

2008-01-14 Thread Ryan McKinley
ug -- maybe someone else has better ideas, but you can try: http://svn.apache.org/repos/asf/lucene/java/trunk/src/java/org/apache/lucene/index/CheckIndex.java I think that converts (what it can) to a 2.3 index. The NullPointerException should be gone in trunk, that is just an artifact of st

index out of disk space, CorruptIndexException

2008-01-14 Thread Brian Whitman
We had an index run out of disk space. Queries work fine but commits return 500 doc counts differ for segment _18lu: fieldsReader shows 104 but segmentInfo shows 212 org.apache.lucene.index.CorruptIndexException: doc counts differ for segment _18lu: fieldsReader shows 104 but segmentInfo

Re: unique ID question

2008-01-14 Thread Ryan McKinley
Evgeniy Strokin wrote: If I make one of my field as a unique ID, id doesn't increase/decrease performance of searching by this field. Right? For example if I have two fields, I know for sure both of them are unique, both the same type, and make one of them as a Solr Unique ID. The general perf

unique ID question

2008-01-14 Thread Evgeniy Strokin
If I make one of my field as a unique ID, id doesn't increase/decrease performance of searching by this field. Right? For example if I have two fields, I know for sure both of them are unique, both the same type, and make one of them as a Solr Unique ID. The general performance should be the sam

Re: Text Summarizer

2008-01-14 Thread Ycrux
Hi Otis, Don't know really what's the name for that. cheers Y. Otis Gospodnetic a écrit : Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux <[EMAIL PROTECTED]> To:

Re: Text Summarizer

2008-01-14 Thread Otis Gospodnetic
Sounds like you are looking for a highlighter/KWIC, not a summarizer? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Ycrux <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Monday, January 14, 2008 2:45:09 PM Subject: Text Summarizer

MoreLikeThis similarity field boosting

2008-01-14 Thread Vladimir Garvardt
Hello. I'm using Solr for searching our system. Using MoreLikeThis for related content searching. Now url used for search is like this: http://localhost:8983/solr/mlt?q=nid:7280&mlt=true&mlt.fl=title,teaser,body&mlt.mindf=1&mlt.mintf=1&fl=nid,title,score Where "nid" is uniqueKey and "title,teaser

Text Summarizer

2008-01-14 Thread Ycrux
Hi! I'm looking for a good way to get a good "text summarizer" for my personal search engine based Solr. Actually, I'm using "ots" (Open Text Summurizer) but the result is far from perfection. Here's an example of usage: $ elinks "http://lucene.apache.org/solr/"; -force-html -no-numbering \ -no

Re: Spellchecker index rebuild error

2008-01-14 Thread Otis Gospodnetic
I haven't looked at the Spellchecker in a while, but it sounds like you are deleting the index files manually. Any reason for that? Shouldn't that rebuild command run smoothly even with a pre-existing index there (funny that I ask this, considering this was my doing). Otis -- Sematext -- http

Re: batch indexing takes more time than shown on SOLR output --> something to do with IO?

2008-01-14 Thread Otis Gospodnetic
Re monitoring IO activity iostat, vmstat, sar and such under Linux, for example. Yes, Solr doesn't count how long it takes to send the response back to the client, so if the response is large and/or network is slow, the actual number is going to be higher than the number that Solr logs. Ot

Re: new to solr

2008-01-14 Thread Ryan McKinley
the example.xsl is an example using XSLT to format results. Check: http://wiki.apache.org/solr/XsltResponseWriter For php, check: http://wiki.apache.org/solr/SolPHP ryan Ma, Xiaohui (NIH/NLM/LHC) [C] wrote: Thanks so much for your reply! Please tell me what example.xsl is for in conf/xslt.

Re: new to solr

2008-01-14 Thread Stuart Sierra
On Jan 14, 2008 11:55 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote: > the example.xsl is an example using XSLT to format results. Check: > http://wiki.apache.org/solr/XsltResponseWriter To add to the above: I think the XsltResponseWriter is not intended for formatting results for display on your w

RE: new to solr

2008-01-14 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks very much, Ryan. I really appreciate it. I will take a look on both. Best regards, Xiaohui -Original Message- From: Ryan McKinley [mailto:[EMAIL PROTECTED] Sent: Monday, January 14, 2008 11:56 AM To: solr-user@lucene.apache.org Subject: Re: new to solr the example.xsl is an exam

Re: Documents with One-to-many

2008-01-14 Thread Stuart Sierra
On Jan 11, 2008 10:44 AM, Evgeniy Strokin <[EMAIL PROTECTED]> wrote: > Hello. If I need documents which has number of fields but also I have number > of other documents which related to the first one one-to-many. For example a > person, could have several addresses. I want to have all of them in

RE: new to solr

2008-01-14 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much for your reply! Please tell me what example.xsl is for in conf/xslt. Please let me know where the search result is located. I can use php or .net to display the result in web. Is it created on fly? Thanks, Xiaohui -Original Message- From: Ryan McKinley [mailto:[EMAIL PROT

Re: new to solr

2008-01-14 Thread Ryan McKinley
Ma, Xiaohui (NIH/NLM/LHC) [C] wrote: Hello, I am new to solr. Welcome! I followed solr online tutorial to get the example work. The search result is xml. I wonder if there is a way to show result in a form. I saw there is example.xsl in conf/xslt directory. I really don't know how to do it.

new to solr

2008-01-14 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Hello, I am new to solr. I followed solr online tutorial to get the example work. The search result is xml. I wonder if there is a way to show result in a form. I saw there is example.xsl in conf/xslt directory. I really don't know how to do it. Anyone has some ideas for me. I really appreciate it

batch indexing takes more time than shown on SOLR output --> something to do with IO?

2008-01-14 Thread Britske
I have a batch program which inserts items in a solr/lucene index. all is going fine and I get update messages in the console like: 14-jan-2008 16:40:52 org.apache.solr.update.processor.LogUpdateProcessor finish INFO: {add=[10485, 10488, 10489, 10490, 10491, 10495, 10497, 10498, ...(42 more) ]}

Re: field:(-null) returns records where field was not specified

2008-01-14 Thread Karen Loughran
Hi Erik, thanks for your reply, I had read this page. But I'm not using the "NOT" operator, I'm using the "-" operator. I'm assuming there is a subtle difference between them in that NOT qualifies something else, hence needs 2 terms. Isn't the "-" operator supposed to be a complement to th

Re: LNS - or - "now i know we've succeeded"

2008-01-14 Thread Walter Underwood
Yes, they are reputable. They've been doing consulting with Verity, Ultraseek, and other platforms for many years. --wunder On 1/12/08 1:22 AM, "Chris Hostetter" <[EMAIL PROTECTED]> wrote: > It is pretty cool to see a reputable > Search company (is ideaeng.com a reputable search consulting compa

Re: field:(-null) returns records where field was not specified

2008-01-14 Thread Erick Erickson
Have you seen this page? http://lucene.apache.org/java/docs/queryparsersyntax.html >From that page: Note: The NOT operator cannot be used with just one term. For example, the following search will return no results: NOT "jakarta apache" Erick On Jan 14, 2008 9:30 AM, Karen Loughran <[EMAIL PRO

field:(-null) returns records where field was not specified

2008-01-14 Thread Karen Loughran
Hi all, We are indexing different types of documents, some with certain fields set and some without, some fields sometimes in both. If a particular field is missing in a newly added record, I would have expected the query: field_name:(-null) not to return this particular record in the respo

Spellchecker index rebuild error

2008-01-14 Thread Doug Steigerwald
Lately I've been having issues with the spellchecker failing to properly rebuild my spell index. I used to be able to delete the spell directory and reload the core and build the index fine if it ever crapped out, but now I can't even build it. java.io.FileNotFoundException: /home/dsteiger/sol