Re: Solr logo poll

2007-04-06 Thread nick19701
A -- View this message in context: http://www.nabble.com/Solr-logo-poll-tf3537968.html#a9877009 Sent from the Solr - User mailing list archive at Nabble.com.

Re: Find docs close to a date

2007-04-05 Thread nick19701
Chris Hostetter wrote: > > off the top of my head, i can't think of any better way to do what you are > doing "out of hte box" with Solr ... if you wanted to write a bit of > custom java code, a FunctionQuery ValueSource that made a "bell curve" > arround a particular value would be a very cool/

Find docs close to a date

2007-04-02 Thread nick19701
Let me use this date for example: 2007-03-25T17:22:00Z My docs have a date field and I need to find the two docs with a date which is closest to 2007-03-25T17:22:00Z. I use the following two queries to accomplish the task. date:{* TO 2007-03-25T17:22:00Z};date desc&start=0&rows=1 date:{2007-03-

Re: which one will save hard disk space?

2007-03-26 Thread nick19701
Mike Klaas wrote: > > Storing and indexing are completely disjoint: indexing is a lossy > operation, so if you want to be able retrieve the original contents, > they must be stored separately (ie., the first option uses the least > space). > > -MIke > > But here the "signature" field has fie

which one will save hard disk space?

2007-03-26 Thread nick19701
I don't need to search the "signature" field. But my intuition tells me that if I index this field, I will use less hard disk space since a lot of docs may have the same signature. Am I right? -- View this message in context: http://www.nabble.com/which-one-will-save-hard-disk-space--tf34

XmlPullParserException

2007-03-19 Thread nick19701
I'm trying to put this html page into my solr instance. http://creditboards.com/forums/index.php?showtopic=30893 http://creditboards.com/forums/index.php?showtopic=30893 I surrounded the html with the following: Then I got the XmlPullParserException. If you take a close look at the source of

Re: Returning xx number of each group in a single query?

2007-03-18 Thread nick19701
How about returning at most 1 result of each group in a single query? For example, a website may have a lot of pages. When google returns search results, it only shows at most one result for each website. I have a similar situation. Is there an easy way for this kind of problem? -- View this mes

Re: [2] synonym filter fix

2007-03-07 Thread nick19701
Mike Klaas wrote: > > > Using the latest solr trunk, the error I reproduced before seems fixed > (see below). Note that if you are using term vectors with > highlighting, you'll have to reindex. > > org.apache.solr.analysis.SynonymFilterFactory {expand=true, > ignoreCase=true, synonyms=synony

synonym filter fix

2007-03-07 Thread nick19701
About this synonym filter fix: http://issues.apache.org/jira/browse/SOLR-167 http://issues.apache.org/jira/browse/SOLR-167 I tried today's solr build. It seems the verbose analysis of the index analyzer still has the same symptom. should I try another build? Or maybe my tomcat is not using the l

Re: [2] Highlighting problems with HTML tagged fields

2007-03-07 Thread nick19701
Chris Hostetter wrote: > > > patches for issues can't be applied until someone who cares about them > write them and contribute them for committers to consider/apply :) > > it seems I'm one of the very few people who care about this feature :) Unfortunately my daily languages are c++ and c#

Re: [2] Highlighting problems with HTML tagged fields

2007-03-07 Thread nick19701
Chris Hostetter wrote: > > > It is tracked in http://issues.apache.org/jira/browse/SOLR-42 > > ...there are currently no patches. > > The suggested fix from Mirko seems very simple. Hopefull a patch will be applied very soon. In the meantime, I'll use my backup solution: http://fucoder.co

Re: [2] Highlighting problems with HTML tagged fields

2007-03-06 Thread nick19701
Yonik Seeley wrote: > > HTMLStripWhitespaceTokenizerFactory works in two phases... > HTMLStripReader removes the HTML and passes the result to > WhitespaceTokenizer... at that point, Tokens are generated, but the > offsets will correspond to the text after HTML removal, not before. > > I did it

Re: highlight search keywords on html page

2007-02-20 Thread nick19701
Chris Hostetter wrote: > > i'm not really sure that Solr can help you in this case ... it only know > about the data you give it -- if you want it to highlight the raw html of > hte entire page, then you're going to need to store the raw html of hte > entire page in the index. > > you can still

Re: highlight exception

2007-02-20 Thread nick19701
Mike Klaas wrote: > > nick, > > It looks as though there is a bug in the synonym filter. Since you > are using Solr's example synonym list, perhaps it would be sufficient > to remove that from your analyzer chain (schema.xml)? At least that > would prevent crashes until the bug is fixed. > >

Re: highlight exception

2007-02-19 Thread nick19701
Mike Klaas wrote: > > That 60, 8 produced by the synonym filter is surely signs of a bug > (and what is producing the -52). What is your list of synonyms? > > -Mike > > Here is: # The ASF licenses this file to You under the Apache License, Version 2.0 # (the "License"); you may not use thi

Re: highlight search keywords on html page

2007-02-16 Thread nick19701
Chris Hostetter wrote: > > I'm not sure i'm understanding your question ... is it how to highlight a > stored field that has HTML in it, or how to index a chunk of HTML text? > > the first should be no difference then highlighting any other bit of text > -- the second can be accomplished using

highlight search keywords on html page

2007-02-16 Thread nick19701
With solr, I can generate a list of links containing highlighted fragments. After a user clicks a link, I will fetch the stored and not-indexed html from solr and return it to user. But I want search keywords within the html to be highlighted just like google. I'm wondering what people are using t

Re: highlight exception

2007-02-15 Thread nick19701
Mike Klaas wrote: > > Corresponds to: > startOffset = > tokenGroup.matchStartOffset; > endOffset = > tokenGroup.matchEndOffset; > tokenText = > text.substring(startOffset, endO

highlight exception

2007-02-15 Thread nick19701
I have thousands of docs in my solr instance. The following doc (maybe others) is causing exception everytime highlight is turned on. Best buy - Acer Aspire AS5610-2273 - $599. Windows vista, 1 GB RAM The exception is like this: java.lang.StringIndexOutOfBoundsException: String index out

Re: question about synonyms

2007-02-13 Thread nick19701
Yonik Seeley wrote: > > Are you using the synonyms at index time, query time, or both? > Did you reindex if you made changes to an "index" analyzer? > It would help if you post the fieldtype for the field you are searching. > I am using the synonyms only at query time. Below is the field analy

question about synonyms

2007-02-13 Thread nick19701
Hi, I put this line in my synonyms.txt bestbuy,bb,best buy I expect that when bb is searched, all results including "bestbuy", "bb" or "best buy" will be returned. But in my test I only got back the results which include "bestbuy" or "best buy". The results which include "bb" are not returned.

Re: question about highlighting

2007-02-13 Thread nick19701
Hi, Andre, I tried hl=true. But it still doesn't work. Here is my request: select?indent=on&version=2.2&q=pageContent%3Adell&start=0&rows=10&fl=pageContent&qt=standard&wt=standard&explainOther=&hl=true&hl.fl=pageContent This is part of the response: standard 10 0 pageContent on pageContent t

question about highlighting

2007-02-13 Thread nick19701
I can't locate any concrete examples of using highlighting. After checking out the following wiki, http://wiki.apache.org/solr/HighlightingParameters I sent my solr server the following request: select?indent=on&version=2.2&q=dell&start=0&rows=10&fl=*%2Cscore&qt=standard&wt=standard&explainOthe