Re: Can I exclude certain terms from MoreLikeThis query?

2009-08-24 Thread Paras Chopra
Hi Koji, Thank you for your reply. Actually, the terms I would like to exclude would be based on the document I use for MoreLikeThis Query. As I understand from StopFilter, it is a static method to exclude terms such as stop words. My problem is that I want to return theme/area specific results fo

Re: Can I exclude certain terms from MoreLikeThis query?

2009-08-24 Thread Koji Sekiguchi
Paras Chopra wrote: Hi All, I am tinkering with MoreLikeThis component of Solr and had a particular use case where I would like to exclude certain terms from consideration while MoreLikeThis makes a query vector out of a document. Is it possible with Solr? I searched for this in the documentation

Can I exclude certain terms from MoreLikeThis query?

2009-08-24 Thread Paras Chopra
Hi All, I am tinkering with MoreLikeThis component of Solr and had a particular use case where I would like to exclude certain terms from consideration while MoreLikeThis makes a query vector out of a document. Is it possible with Solr? I searched for this in the documentation but wasn't able to fi

Exact word search

2009-08-24 Thread bhaskar chandrasekar
Hi,   Can any one helpe me with the below scenario?.   Scenario 1:   Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give "BHASKAR" as input string It should give me search results pertaining to BHASKAR only.

Re: Solr Query help - sorting

2009-08-24 Thread erikea...@yahoo.com
Clever... but if more than one row adds up to the same value I may get the wrong order (like 50, 50 and 10, 90) I need a max function but the one provided only compares against a constant. Sent from my iPhone On Aug 24, 2009, at 6:15 PM, Koji Sekiguchi wrote: How about using sum() FunctionQu

Re: schema browing character encoding

2009-08-24 Thread Koji Sekiguchi
smock wrote: Hello, I'm trying to use the schema browser (/file/?file=schema.xml) to examine the schema of my solr installation, but am having problems with character encodings. Everything I have is defined in UTF-8, and I can read the file locally with that encoding without any problems. Howe

Exact pattern search in Solr

2009-08-24 Thread bhaskar chandrasekar
Hi,   Can any one helpe me with the below scenario?.   Scenario 1:   Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give "BHASKAR" as input string It should give me search results pertaining to BHASKAR only.

Re: Solr Query help - sorting

2009-08-24 Thread Koji Sekiguchi
How about using sum() FunctionQuery, via: http://localhost:8983/solr/select/?q=_val_%3A%22sum%28width%2Clength%29%22 Thanks, Koji Erik Earle wrote: Suppose I have a database of rectangle attributes that I have indexed in Solr and I want to get the top 10 widest or longest rectangles from on

Re: solr.StopFilterFactory not filtering words

2009-08-24 Thread Yonik Seeley
On Mon, Aug 24, 2009 at 8:58 PM, darniz wrote: > > HI > i dont understand whats wrong here. > i am trying to implement solr.StopFilterFactory > here is my field type definition >     >         >             >                 ignoreCase="true"/> >         >     > > and i have field which is >   req

solr.StopFilterFactory not filtering words

2009-08-24 Thread darniz
HI i dont understand whats wrong here. i am trying to implement solr.StopFilterFactory here is my field type definition and i have field which is when i try to insert a doc like an elphant is an animal it does not remove an and

Multiple field / pdf file per document

2009-08-24 Thread Joe Kessel
New to Solr, not so new to search. I have an existing data model that I am pushing into a Solr index. For example, I am indexing a product which includes product brochures in multiple locales. So this single Solr document contains multiple text fields which require linguistics analyzers. Th

how to get highlighter to only show matched term

2009-08-24 Thread Kevin Osborn
If my query is something like manufacturer:IBM OR productTitle:Thinkpad, I actually just want to printout "IBM" or "Thinkpad" in any of the highlighted fields. I don't want to parse through any HTML or other text. Basically, I just want to know which of the terms in my query matched and in which

Solr Query help - sorting

2009-08-24 Thread Erik Earle
Suppose I have a database of rectangle attributes that I have indexed in Solr and I want to get the top 10 widest or longest rectangles from one query. name | width | length A | 100| 10 B | 10 | 90 C | 80 | 10 ... My indexed fields are: name, width, length I

Re: defining qf in your own request handler

2009-08-24 Thread darniz
Continuing on this i am having a use case where i have to strip out single quote for certain fields for example for testing i added teh following fieldType in schema.xml file and then i declared a field like this. So i am assuming if i index a docum

Re: Responses getting truncated

2009-08-24 Thread Uri Boness
It can very well be an issue with the data itself. For example, if the data contains un-escaped characters which invalidates the response. I don't know much about ruby, but what do you get with wt=json? Rupert Fiasco wrote: I am seeing our responses getting truncated if and only if I search on

shingle filter

2009-08-24 Thread Joe Calderon
hello *, im currently faceting on a shingled field to obtain popular phrases and its working well, however ide like to limit the number of shingles that get created, the solr.ShingleFilterFactory supports maxShingleSize, can it be made to support a minimum as well? can someone point me in the right

Responses getting truncated

2009-08-24 Thread Rupert Fiasco
I am seeing our responses getting truncated if and only if I search on our main text field. E.g. I just do some basic like title_t:arthritis Then I get a valid document back. But if I add in our larger text field: title_t:arthritis OR text_t:arthritis then the resultant document is NOT valid X

September Hadoop Get Together

2009-08-24 Thread Isabel Drost
I would like to announce the September-2009 Hadoop Get Together in newthinking store Berlin. When: 29. September 2009 at 5:00pm Where: newthinking store, Tucholskystr. 48, Berlin, Germany As always there will be slots of 20min each for talks on your Hadoop topic. After each ta

Re: multi-language search

2009-08-24 Thread Uri Boness
I can think of ways to tackle your problem: Option 1: each document will have a field indicating its language. Then, when searching, you can simply filter the query on the language you're searching on. Advantages: everything is in one index, so if in the future you will need to do a cross lang

Re: solr nutch url indexing

2009-08-24 Thread Uri Boness
Hi, Nutch comes with support for Solr out of the box. I suggest you follow the steps as described here: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ Cheers, Uri Fuad Efendi wrote: Is SolrIndex plugin for Nutch? Thanks! -Original Message- From: Uri Boness [mailto:ub

RE: solr nutch url indexing

2009-08-24 Thread Fuad Efendi
Is SolrIndex plugin for Nutch? Thanks! -Original Message- From: Uri Boness [mailto:ubon...@gmail.com] Sent: August-24-09 4:42 PM To: solr-user@lucene.apache.org Subject: Re: solr nutch url indexing How did you configure nutch? Make sure you have the "parse-html" and "index-basic" conf

Dismax Wildcard Queries

2009-08-24 Thread Kurt N.
Hello all. We have a situation in the requirements for our project that make it desireable to be able to perform a DisMax query with wildcard (* and ?) characters in it. We are using the standard release (not nightly) of Solr 1.3. Our first thought was to apply the SOLR-756 patch (http://issues

Re: solr nutch url indexing

2009-08-24 Thread Uri Boness
How did you configure nutch? Make sure you have the "parse-html" and "index-basic" configured. The HtmlParser should by default extract the page title and add to the parsed data, and the BasicIndexingFilter by default adds this title to the NutchDocument and stores it in the "title" filed. All

why would a search for a specific field value fail when data is present?

2009-08-24 Thread Brian Klippel
Hopefully, someone can tell me what is going wrong here. I have a field, "SearchObjectType", and a large number of the documents indexed in a give core have a value of "USER_PROFILE". When I examine the schema browser in admin, it confirms this. The term "USER_PROFILE" is the top term for

Re: Clarifications to Synonym Filter Wiki entry? (2 of 2)

2009-08-24 Thread Mark Bennett
Here's the best thread I've found so far about multi-word matching and synonyms: http://www.nabble.com/solr-synonyms-behaviour-ts15051211.html#a18476205 And an interesting workaround: http://www.nabble.com/solr-synonyms-behaviour-ts15051211.html#a18693735 Earlier on the thread repeats the claim t

multi-language search

2009-08-24 Thread Elaine Li
Hi, I have two questions. 1) Can solr be configured so all my english docs will be saved in a group, say group-en? My chinese docs will be saved in group-cn. So my search will only be conducted in the intended group, instead of everywhere. 2) I posted some chinese docs to the server. The query o

Re: Not return my expected contents

2009-08-24 Thread Elaine Li
Thanks. The fieldtype without the porter filter works for my case. I appreciate your information very much. Elaine On Sat, Aug 22, 2009 at 9:02 PM, Lance Norskog wrote: > These text types 'stem' the words arabic, arabian, and arabs to 'arab'. > > http://wiki.apache.org/solr/SchemaDesign > > The P

schema browing character encoding

2009-08-24 Thread smock
Hello, I'm trying to use the schema browser (/file/?file=schema.xml) to examine the schema of my solr installation, but am having problems with character encodings. Everything I have is defined in UTF-8, and I can read the file locally with that encoding without any problems. However, in the we

Re: defining qf in your own request handler

2009-08-24 Thread Shalin Shekhar Mangar
On Mon, Aug 24, 2009 at 11:38 PM, darniz wrote: > > 1) is there a parameter where i can flush out the browser cache. Most of > the > time i see cached content in brower and i have to delete the temp files > folder. > Solr emits headers for playing well with http caches. You can disable emitting

Re: defining qf in your own request handler

2009-08-24 Thread Avlesh Singh
> > is there a parameter where i can flush out the browser cache. Most of the > time i see cached content in brower and i have to delete the temp files > folder. > You can disable cache in your browser. I think there is already a jira issue to send no-cache headers to the client for all admin pages

Re: Overview of Query Parsing API Stack? / Dismax parsing, new 1.4 parsing, etc.

2009-08-24 Thread Mark Bennett
Thanks Hoss and Yonik. Hoss, you had a particluarly pertinent passage: > ... because the normal Lucene QueryParser uses whitespace ... > and breaks up the input on the whitespace boundaries > before it ever passes those chunks ... to the analyzers This is EXACTLY what the issue is. At first I th

Re: defining qf in your own request handler

2009-08-24 Thread darniz
Thanks it working now. Moving on the same topic i need two more answers 1) is there a parameter where i can flush out the browser cache. Most of the time i see cached content in brower and i have to delete the temp files folder. 2) the other thing i wanted to know how to do stem search. Does dis

Clarifications to Synonym Filter Wiki entry? (2 of 2)

2009-08-24 Thread Mark Bennett
There are a couple of things about the Solr Thesaurus doc that I'd like to confirm / understand. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#SynonymFilter There's a section about multi word matching, using seabiscit as an example. I've also seen references to this discussion in pos

Clarifications to Synonym Filter Wiki entry? (1 of 2)

2009-08-24 Thread Mark Bennett
There are a couple of things about the Solr Thesaurus doc that I'd like to confirm / understand. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#SynonymFilter I believe the following section is a bit misleading; I'm sure it's correct for the case it describes, but there's another case

Re: defining qf in your own request handler

2009-08-24 Thread Shalin Shekhar Mangar
On Mon, Aug 24, 2009 at 10:43 PM, darniz wrote: > > Hello > i created a custom request handler and i want it to do a search on features > and make field by default. i added the following decleration in my > solrconfig.xml file > > > > 12 > explicit > * > model d

Re: defining qf in your own request handler

2009-08-24 Thread Erik Hatcher
I think you need a space, not a comma, in the qf parameter. It's designed to allow for boosts, like qf=features^2.0 make^1.0 Erik On Aug 24, 2009, at 1:13 PM, darniz wrote: Hello i created a custom request handler and i want it to do a search on features and make field by defaul

defining qf in your own request handler

2009-08-24 Thread darniz
Hello i created a custom request handler and i want it to do a search on features and make field by default. i added the following decleration in my solrconfig.xml file 12 explicit * model desc features,make as you can see in the qf i defin

Facet prefix

2009-08-24 Thread smock
I'd like to set up case insensitive matching on a facet.prefix, but would like the facet handler to return the stored value rather than the indexed value. For instance, if a field value is 'Yes', I'd like facet.prefix to match on 'yes' but return 'Yes' - is this behavior possible to set up? Tha

solr nutch url indexing

2009-08-24 Thread Lassalle, Thibaut
Hi, I would like to crawl intranets with nutch and index them with solr. I would like to search mostly on the title of the pages (the one in This is a title) I tried to tweak the schema.xml to do that but nothing is working. I just have the content indexed. How do I index on title ?

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Marc Sturlese
It just worked. Thanks a lot! Good to know sortMissingLast works not just in sortable fields Constantijn Visinescu wrote: > > not 100% sure but the example schema has: > omitNorms="true"/> > > So i'd say give it a go and see what happens ;) > > On Mon, Aug 24, 2009 at 12:24 PM, Marc Stur

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Avlesh Singh
Ah ... I just got wiser! Thanks Erik. Cheers Avlesh On Mon, Aug 24, 2009 at 4:39 PM, Erik Hatcher wrote: > > On Aug 24, 2009, at 7:03 AM, Avlesh Singh wrote: > > Can you really sort accurately on tokenized fields? >> > > Yes, as long as there is *one and only one* term emitted from the analyzer

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Erik Hatcher
On Aug 24, 2009, at 7:03 AM, Avlesh Singh wrote: Can you really sort accurately on tokenized fields? Yes, as long as there is *one and only one* term emitted from the analyzer. KeywordTokenizer is your friend, and comes in handy to lowercase or pattern replace things. Erik

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Avlesh Singh
Can you really sort accurately on tokenized fields? Cheers Avlesh On Mon, Aug 24, 2009 at 3:54 PM, Marc Sturlese wrote: > > Yes but I thought it was just for sortable fields: > sint,sfloat,sdouble,slong. > Can I apply "sortMissingLast"to text fields analyzed with KeywordAnalyzer? > > Constantij

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Constantijn Visinescu
not 100% sure but the example schema has: So i'd say give it a go and see what happens ;) On Mon, Aug 24, 2009 at 12:24 PM, Marc Sturlese wrote: > > Yes but I thought it was just for sortable fields: > sint,sfloat,sdouble,slong. > Can I apply "sortMissingLast"to text fields analyzed with

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Marc Sturlese
Yes but I thought it was just for sortable fields: sint,sfloat,sdouble,slong. Can I apply "sortMissingLast"to text fields analyzed with KeywordAnalyzer? Constantijn Visinescu wrote: > > There's a "sortMissingLast" true/false property that you can set on your > fielType definitions in the schema

Re: Implementing a logout

2009-08-24 Thread Rahul R
*"release any SOLR resources" - no need.* My query is answered. Thank you. Regards Rahul On Mon, Aug 24, 2009 at 12:32 AM, Fuad Efendi wrote: > Truly correct: > > - SOLR does not create HttpSession for user access to Admin screens (do we > have any other screens of UI?) > - SolrCore is shared

Re: Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Constantijn Visinescu
There's a "sortMissingLast" true/false property that you can set on your fielType definitions in the schema. On Mon, Aug 24, 2009 at 11:58 AM, Marc Sturlese wrote: > > Hey there, I need to sort my query results alphabetically for a > determinated > field called "town". This field is analyzed with

Optimizing a query to sort results alphabetically for a determinated field

2009-08-24 Thread Marc Sturlese
Hey there, I need to sort my query results alphabetically for a determinated field called "town". This field is analyzed with a KeywordAnalyzer and isn't multiValued. Add that some docs doesn't doesn'h have this field. Doing just: http://localhost/solr//select/?q=whatever&version=2.2&start=0&rows