underscores are parsed only as spaces

2009-04-20 Thread sunnyfr
Hi, I don't get why and how to change this: underscores are parsed only as spaces, meaning that a search for user "ejekt_festival" will return zero results, while "ejekt festival" will return the user "ejekt_festival". Thanks for your help, -- View this message in context: http://www.nabble.c

Re: underscores are parsed only as spaces

2009-04-20 Thread Shalin Shekhar Mangar
On Mon, Apr 20, 2009 at 1:46 PM, sunnyfr wrote: > > Hi, > > I don't get why and how to change this: underscores are parsed only as > spaces, meaning that a search for user "ejekt_festival" will return zero > results, while "ejekt festival" will return the user "ejekt_festival". > I think the fie

Big Problem with special characters

2009-04-20 Thread Kraus, Ralf | pixelhouse GmbH
Hello, first some details about my SOLR installation: schema.xml positionIncrementGap="100" > words="stopwords.txt"/> language="german" /> words="stopwords.txt"/>

RE: OutofMemory on Highlightling

2009-04-20 Thread Gargate, Siddharth
Anybody facing the same issue? Following is my configuration ... ... ... explicit 500 true id,score teaser teaser 200 200 500 ... Search works fine if I

Re: Search on all fields and know in which field was the match

2009-04-20 Thread Rui Carneiro
On Tue, Apr 14, 2009 at 9:54 PM, Chris Hostetter wrote: > one option is to index each attachemnt as it's own document *in addition* > to indexing each email will all of hte attachment text in a single > "atachments" field. that way you can search for all emails where "Bob" is > mentioned in an at

Re: Customizing solr with my lucene

2009-04-20 Thread mirage1987
Hi, Here's the schema.xml i am using. id value I can't figure out the error. Do u see any probs with the current schema. The schema is defined as such because i have implemented the my own analyzer an token

ExtractingRequestHandler and SolrRequestHandler issue

2009-04-20 Thread francisco treacy
Hi all, I am unsuccessfully attempting to use the ExtractingRequestHandler (indexing documents via Tika, Solr cell). I start Solr from the example app (start.jar), but point to my own Solr conf, where I have last_modified true Using the nightly builds (2009-04-17).

Re: OutofMemory on Highlightling

2009-04-20 Thread Koji Sekiguchi
Gargate, Siddharth wrote: Anybody facing the same issue? Following is my configuration ... ... ... explicit 500 true id,score teaser teaser 200 200 500

Using Solr to index a database

2009-04-20 Thread ahammad
Hello, I've never used Solr before, but I believe that it will suit my current needs with indexing information from a database. I downloaded and extracted Solr 1.3 to play around with it. I've been looking at the following tutorials: http://www.ibm.com/developerworks/java/library/j-solr-update/i

Re: ExtractingRequestHandler and SolrRequestHandler issue

2009-04-20 Thread Grant Ingersoll
Can you give the full stack trace? On Apr 20, 2009, at 6:49 AM, francisco treacy wrote: Hi all, I am unsuccessfully attempting to use the ExtractingRequestHandler (indexing documents via Tika, Solr cell). I start Solr from the example app (start.jar), but point to my own Solr conf, where I hav

Re: Using Solr to index a database

2009-04-20 Thread Glen Newton
You have not indicated how you wish to use the index (inside Solr or not). It is possible that LuSql might be an preferable alternative to Solr/DataImportHandler, depending on your requirements. LuSql: http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/LuSql Disclaimer: I am the autho

Re: CollapseFilter with the latest Solr in trunk

2009-04-20 Thread Jeff Newburn
What are the current issues holding this back? Seems to be working with some minor bug fixes. -- Jeff Newburn Software Engineer, Zappos.com jnewb...@zappos.com - 702-943-7562 > From: Otis Gospodnetic > Reply-To: > Date: Sun, 19 Apr 2009 20:30:22 -0700 (PDT) > To: > Subject: Re: CollapseFilte

Re: Using Solr to index a database

2009-04-20 Thread ahammad
For now it's unclear, as this is sort of an "experiment" to see how much we can do with it. I am inclined to use the index within Solr though, simply for the very powerful querying (the stuff I've seen at least). I am not exactly sure how much of the querying capabilities I'll require though. I'l

Re: ExtractingRequestHandler and SolrRequestHandler issue

2009-04-20 Thread francisco treacy
Hi Grant, Here is the full stacktrace: 20-Apr-2009 12:36:39 org.apache.solr.common.SolrException log SEVERE: java.lang.ClassCastException: org.apache.solr.handler.extraction.ExtractingRequestHandler cannot be cast to org.apache.solr.request.SolrRequestHandler at org.apache.solr.core.Requ

Solr webinar

2009-04-20 Thread Erik Hatcher
(excuse the cross-post) I'm presenting a webinar on Solr. Registration is limited, so sign up soon. Looking forward to "seeing" some of you there! Thanks, Erik "Got data? You can build your own Solr-powered Search Engine!" Erik Hatcher, Lucene/Solr Committer and author, will show

Re: CollapseFilter with the latest Solr in trunk

2009-04-20 Thread Ryan McKinley
I have not looked at this in a while, but I think the biggest thing it is missing right now is a champion -- someone to get the patches (and bug fixes) to a state where it can easily be committed. "Minor bug fixes" are road blocks to getting things integrated. ryan On Apr 20, 2009, at 10

Re: Solr webinar

2009-04-20 Thread ahammad
Hello Erik, I'm interested in attending the Webinar. I just have some questions to verify whether or not I am fit to attend... 1) How will it be carried out? What software or application would I need? 2) Do I have to have any experience or can I attend for the purpose of learning about Solr? Th

Re: Solr webinar

2009-04-20 Thread Matt Mitchell
Thanks Erik! Looking forward to it. Matt On Mon, Apr 20, 2009 at 11:00 AM, ahammad wrote: > > Hello Erik, > > I'm interested in attending the Webinar. I just have some questions to > verify whether or not I am fit to attend... > > 1) How will it be carried out? What software or application woul

Re: Solr webinar

2009-04-20 Thread Erik Hatcher
I replied to this off-list, and will do so for future questions about the webinar. Please direct them to me personally rather than the list. But in short, no Solr experience is necessary, and it's purpose is to educate about Solr. If you're already developing with Solr you're likely ove

Customization of solr

2009-04-20 Thread HPN 75
Hi, I have some years of experience with lucene and I am knowing solr now. I see that many processes are encapsulated in the API. My doubts is on the level of customization of solr. Is it possible to create my units of searches in solr having: 1- Send of seed for ramdomize for my sort

Re: Customization of solr

2009-04-20 Thread Yonik Seeley
On Mon, Apr 20, 2009 at 11:46 AM, HPN 75 wrote: >   I have some years of experience with lucene and I am knowing solr now. >  I see that many processes are encapsulated in the API. My doubts is on the > level of customization of solr. >  Is it possible to create my units of searches in solr havin

Re: ebook resources - including lucene in action

2009-04-20 Thread Grant Ingersoll
Lest you think silence equals acceptance... This is not appropriate use of these lists. -Grant On Apr 19, 2009, at 11:58 PM, wu fuheng wrote: welcome to download http://www.ultraie.com/admin/flist.php

Re: Big Problem with special characters

2009-04-20 Thread Otis Gospodnetic
Try &debugQuery=true and see if the resulting query string makes sense. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: "Kraus, Ralf | pixelhouse GmbH" > To: solr-user@lucene.apache.org > Sent: Monday, April 20, 2009 4:34:36 AM > Subject:

Does solr directly call underlying lucene functions

2009-04-20 Thread mirage1987
Hi, I had made some changes to the lucene code . So i have changes to the index writer,query parser and added some new classes. Would this effect the working of solr in any way. Would i have to make any changes apart from replacing the lucene jar in the war file. I want solr to just use my luc

Using a function in a filter query

2009-04-20 Thread Pete Smith
I want to filter my result set before I search. I know the correct way to do this is by using the filter query (fq) parameter. However, I want to filter based on the output of a function performed on a field. I have a field 'rating' which is an integer in the range of 1 to ~75000. The upper limit

Re: ExtractingRequestHandler and SolrRequestHandler issue

2009-04-20 Thread francisco treacy
Additionally, here's what I've got in example/lib: apache-solr-cell-nightly.jar bcmail-jdk14-132.jar commons-lang-2.1.jar icu4j-3.8.jar log4j-1.2.14.jar poi-3.5-beta5.jar slf4j-api-1.5.5.jar xml-apis-1.0.b2.jar apache-solr-core-nightly.jar bcprov-jdk14-132.jar common

working with dynamic fields

2009-04-20 Thread mirage1987
Hi, Is it possible use a dynamic field name="*" without having to define any other field. I need this for i have many fields in my docs and i want all of them to be treated the same way. Plus i want to use a field type with no analyzer for the above. Thanx. -- View this message in context:

maxBooleanClauses implications of a high number ?

2009-04-20 Thread Rakesh Sinha
I am configuring solr locally for our apps and for some of our apps - we need to configure maxBooleanQueries in the solr configuration. Right now - we had set it to 8K ( as opposed to the default of 1K) . Our dataset document size is about 500K . We have about 6G of ram (totally) - so ignoring the

case insensitive sentence matches in text field

2009-04-20 Thread Ian Connor
If I have a field that is the default type text (from the sample schema) with the lowercase filter and so forth, is it possible to also do sentence matches in a case insensitive way? I can see the word roots are indexed in lowercase, but when I then try to match on the entire sentence, it will onl

Re: Using a function in a filter query

2009-04-20 Thread Yonik Seeley
On Mon, Apr 20, 2009 at 12:40 PM, Pete Smith wrote: > fq=rord(rating):[0 TO 500] Solr 1.4 can now do range queries on arbitrary functions: http://lucene.apache.org/solr/api/org/apache/solr/search/FunctionRangeQParserPlugin.html Note that ord() and rord() won't work properly in Solr 1.4 trunk. Lu

Re: case insensitive sentence matches in text field

2009-04-20 Thread Yonik Seeley
On Mon, Apr 20, 2009 at 1:22 PM, Ian Connor wrote: > If I have a field that is the default type text (from the sample schema) > with the lowercase filter and so forth, is it possible to also do sentence > matches in a case insensitive way? This should already work... can you add debugQuery=true t

Re: case insensitive sentence matches in text field

2009-04-20 Thread Ian Connor
Hi, Thanks for the tip - it is in fact working. It is just that the word PubMed trips it up. It splits it up to pub med but if you leave it lowercase, it removes the 'ed' and leaves the root pubm. That is tricky and not what I expected - I will need to be more careful with these filters - thanks.

Re: CollapseFilter with the latest Solr in trunk

2009-04-20 Thread Jeff Newburn
We would love to help debug the issues but we have limited knowledge in the source code. I have looked through the patch information but I am not understanding the interactions of where the component should be. The example of this is our struggle with the collapse.facet. It does not appear to do

Re: python response handler treats "unschema'd" fields differently

2009-04-20 Thread Chris Hostetter
: Seems like we could handle this 2 ways... leave out the field if it's : not defined in the schema, or include it and write it out as a string. : I think either would probably be more useful than throwing an error : (which isn't really a request error but rather a schema/indexing : error). i'd

Re: Solr Search Error

2009-04-20 Thread Chris Hostetter
: HTTP Status 500 - 13724 java.lang.ArrayIndexOutOfBoundsException: : 13724 at org.apache.lucene.search.TermScorer.score(TermScorer.java:74) An ArrayIndexOutOfBoundsException from TermScorer is a prtty serious error -- and probably indicates an interal problem of some kind, not a config issue

Re: ebook resources - including lucene in action

2009-04-20 Thread Erik Hatcher
It is not legal to share purchased e-books in this manner. Please purchase copies of the books you read, otherwise authors have very little incentive to dedicate months (14 months in the case of Lucene in Action, first edition) of their lives to writing this content. Erik On Apr 2

query on part number not matching

2009-04-20 Thread Kevin Osborn
I have a manufacturer part number: CISCO7204VXR-CH. The indexer produces: 12 3 4 cisco7204vxrch vxrch cisco7204vxrch If I query on CISCO7204VXR-CH, I get: 12 3 4 cisco7204vxrch Everyt

Re: Seattle / PNW Hadoop + Lucene User Group?

2009-04-20 Thread Bradford Stephens
Thanks for the responses, everyone. Where shall we host? My company can offer space in our building in Factoria, but it's not exactly a 'cool' or 'fun' place. I can also reserve a room at a local library. I can bring some beer and light refreshments. On Mon, Apr 20, 2009 at 7:22 AM, Matthew Hall

Re: Seattle / PNW Hadoop + Lucene User Group?

2009-04-20 Thread ristretto.rb
Beer h, I'm in New Zealand, so probably can't make it, but I sounds tempting. cheers gene On Tue, Apr 21, 2009 at 11:28 AM, Bradford Stephens wrote: > Thanks for the responses, everyone. Where shall we host? My company > can offer space in  our building in Factoria, but it's not exactly

Re: query on part number not matching

2009-04-20 Thread Yonik Seeley
On Mon, Apr 20, 2009 at 6:59 PM, Kevin Osborn wrote: > > I have a manufacturer part number: CISCO7204VXR-CH. The indexer produces: > > 1        2       3      4 > cisco    7204    vxr    ch >                        vxrch >                        cisco7204vxrch It looks like you're using catenateA

Re: query on part number not matching

2009-04-20 Thread Kevin Osborn
Looks like the format didn't come through in the email. ch, vxrch, and cisco7204xvrch are all in position 4. But, your suggestion of turning off catenateAll may work out. I'll have do some testing to make sure that it doesn't have any unintended consequences. Specifically, I am worried about a

Re: query on part number not matching

2009-04-20 Thread Yonik Seeley
On Mon, Apr 20, 2009 at 8:50 PM, Kevin Osborn wrote: > Looks like the format didn't come through in the email. ch, vxrch, and > cisco7204xvrch are all in position 4. Ah... the traditional way to "handle" that case is to use a little slop with the phrase query. -Yonik

Re: Using Solr to index a database

2009-04-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Mon, Apr 20, 2009 at 7:15 PM, ahammad wrote: > > Hello, > > I've never used Solr before, but I believe that it will suit my current > needs with indexing information from a database. > > I downloaded and extracted Solr 1.3 to play around with it. I've been > looking at the following tutorials: