RE: Ebay Kleinanzeigen and Auto Suggest

2011-05-03 Thread Charton, Andre
Hi, yes we do. If you use a limit number of categories (like 100) you can use dynamic fields with the termscomponent and by choosing a category specific prefix, like: {schema.xml} ... ... {schema.xml} And within data import handler we script prefix from given category: {data-config.xml}

Re: fq parameter with partial value

2011-05-03 Thread elisabeth benoit
Ok, thanks a lot. After making a few tests, I finally understood what you meant. Best regards, Elisabeth 2011/5/2 Jonathan Rochkind > So if you have a field that IS tokenized, regardless of what it's called, > then when you send "My Great Restaurant" to it for _indexing_, it gets > _tokenized

Re: Indexing multiple languages

2011-05-03 Thread Stefan Matheis
Peter, is there a specific need to split these entities? why not just fetch both columns in one entity? like this: two additional hints: 1) if you 'alias' your fields in your select query (select title_en as categories_en) then it's no longer nee

stemming for English

2011-05-03 Thread Dmitry Kan
Dear list, In SOLR schema on the index side we use no stemming to support favor wildcard search. On the query side of the index we use Porter stemming. I have noticed the following issue: the term "pretty" gets stemmed to "pretti" and thus not found. What would be the approach to handle such sit

Re: stemming for English

2011-05-03 Thread lboutros
Hi, I think you have to use stemming on both side (index and query) if you really want to use stemming. Ludovic 2011/5/3 Dmitry Kan [via Lucene] < ml-node+2893599-894006307-383...@n3.nabble.com> > Dear list, > > In SOLR schema on the index side we use no stemming to support favor > wildcard sea

Re: stemming for English

2011-05-03 Thread Dmitry Kan
Hi Ludovic, That's an option we had before we decided to go for a full-blown support of wildcards. Do you know of a way to keep both stemming and consistent wildcard support in the same field?` Dmitry On Tue, May 3, 2011 at 12:56 PM, lboutros wrote: > Hi, > > I think you have to use stemming

Re: stemming for English

2011-05-03 Thread lboutros
Dmitry, I don't know any way to keep both stemming and consistent wildcard support in the same field. To me, you have to create 2 different fields. Ludovic. 2011/5/3 Dmitry Kan [via Lucene] < ml-node+2893628-993677979-383...@n3.nabble.com> > Hi Ludovic, > > That's an option we had before we dec

Re: Nutch Web Interface - not anymore in 1.3

2011-05-03 Thread Gabriele Kahlout
Hello, I'm also in favor of maintaing a web interface that ships with nutch. As has been mentioned it say well be a bridge to Solr. If I find the time to contribute my solution (and make it general enough), I'll happily do it. Earlier I was wondering of actually using the previous nutch web inter

Re: stemming for English

2011-05-03 Thread Dmitry Kan
Yes, Ludovic. Thus effectively we get index doubled. Given the volume of data we store, we very carefully consider such cases, where the doubling of index is must. Dmitry On Tue, May 3, 2011 at 1:08 PM, lboutros wrote: > Dmitry, > > I don't know any way to keep both stemming and consistent wild

Dismax scoring multiple fields TIE

2011-05-03 Thread roySolr
Hello, I have a question about scoring when i use the dismax handler. I will give some examples: name category related category 1. Chelsea best club everChelseaSport 2. Chelsea

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread Erick Erickson
I'm not sure you can. very short fields aren't differentiated on the basis of field length due to rounding errors. Here's a cut-n-paste from Jay Hill: So the values are not pre-set for the lengthNorm, but for some counts the fieldLength value winds up being the sam

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread roySolr
No, but i think the difference between fieldlength is large and the score is still the same. Same score for this results(q=chelsea): 1. Chelsea is a very very big club in london, england Chelsea Sport 2. Chelsea Ch

facet search and UnInverted multi-valued field?

2011-05-03 Thread Bernd Fehling
Dear list, we use solr 3.1.0. my logs have the following entry: May 3, 2011 2:01:39 PM org.apache.solr.request.UnInvertedField uninvert INFO: UnInverted multi-valued field {field=f_dcperson,memSize=1966237,tindexSize=35730,time=849,phase1=782,nTerms=12,bigTerms=0,termInstances=368008,uses=0}

Re: Dismax scoring multiple fields TIE

2011-05-03 Thread elisabeth benoit
for category:chelsea, you have a fieldNorm=1.0, so your category field must have a type with omitNorms=true. if you don't have omitNorms=true, then shorter field will score higher. I'm new to Solr, but from what I've experienced, this is the cause. Regards, Elisabeth 2011/5/3 roySolr > Hello,

full-import called simultaneously for multiple core.

2011-05-03 Thread Kannan
Hi I am running one instance with multiple core. If call full-import URI simultanesously for multiple core few of the fields are not getting indexed.If i do full-import one by one.Its works fine. Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/full-import

Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Hello Friends, I am new to Solr and experiencing issue while trying to use DataImportHandler. I added the required fields to schema.xml file and here is my data-config.xml file : I am getting the following errors : org.apache.solr.common.SolrExcepti

Re: Unable to use DataImportHandler

2011-05-03 Thread Stefan Matheis
Serenity, there is no field fileNo in your SELECT-Query? you've defined it in the pk-Attribut of the Tag, but it's also required in the Query itself. just to note it: you can skip your definition, if the query returns the field with exactly the same name as the solr-schema requires it. Regards

Re: Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Dear Stefan, Am still getting the following error message even after including the pk-Attribute to the query : WARNING: Error creating document : SolrInputDocument[{lname=lname(1.0)={cindy}, file=file(1.0)={ http://localhost:8084/Access/UploadFiles/laura.mp3}, fname=fname(1.0)={troutman}}] org.ap

Re: full-import called simultaneously for multiple core.

2011-05-03 Thread Erick Erickson
Do you log files show any errors? Erick On Tue, May 3, 2011 at 11:06 AM, Kannan wrote: > Hi >   I am running one instance with multiple core. If call full-import URI > simultanesously for multiple core few of the fields are not getting > indexed.If i do full-import one by one.Its works fine. Tha

Getting field information inside a Tokenizer

2011-05-03 Thread FatMan Corp
Hi, I would like to get another's field information for the same document within a Tekonizer class. How can this be achieved? Thanks

Re: Unable to use DataImportHandler

2011-05-03 Thread Erick Erickson
The next thing to check is if your select statement returns the fileNo for every field. Wait.. You took out the bit of your entity definition, is that a cut/paste error? You might get some joy from the DIH debug page at: solr/admin/dataimport.jsp it's not very well known, but it's a debug consol

Re: Unable to use DataImportHandler

2011-05-03 Thread serenity keningston
Dear Erick, I am using Solr 1.4 version. Yes, for each row , we will get one fileNo which is the primary key for the table "file". No, I intentionally removed the from the data-config.xml I tried opening the dataimport.jsp to debug but I don't know, it doesn't show any result but it updates the

getLuceneVersion parsing xml node on every request

2011-05-03 Thread Stephane Bailliez
I' m using Solr 3.1 right now. I was looking at a threadump trying to figure out why queries were not exactly fast and noticed that it keeps parsing xml over and over from the schema to get the lucene version. SolrQueryParser are created for each request and in the constructor there is a call sim

UIMA analysisEngine path

2011-05-03 Thread Barry Hathaway
I'm new to Solr and trying to get it call a UIMA aggregate analysis engine and not having much luck. The null pointer exception indicates that it can't find the xml file associated with the engine. I have tried a number of combinations of a path in the element, but nothing seems to work. In ad

Re: facet search and UnInverted multi-valued field?

2011-05-03 Thread Jay Hill
UnInvertedField is similar to Lucene's FieldCache, except, while the FieldCache cannot work with multivalued fields, UnInvertedField is designed for that very purpose. So since your f_dcperson field is multivalued, by default you use UnInvertedField. You're not doing anything wrong, that's default

RE: stemming for English

2011-05-03 Thread Robert Petersen
From what I have seen, adding a second field with the same terms as the first does *not* double your index size at all. -Original Message- From: Dmitry Kan [mailto:dmitry@gmail.com] Sent: Tuesday, May 03, 2011 4:06 AM To: solr-user@lucene.apache.org Subject: Re: stemming for English

RE: Getting field information inside a Tokenizer

2011-05-03 Thread Steven A Rowe
Hi FMC, On 5/3/2011 at 12:37 PM, FatMan Corp wrote: > Hi, I would like to get another's field information for the same document > within a Tekonizer class. > How can this be achieved? Use s in your schema , and associate different analysis pipe

Re: Unable to use DataImportHandler

2011-05-03 Thread Erick Erickson
OK, put it back According to this page: http://wiki.apache.org/solr/DataImportHandler#Configuration_in_data-config.xml-1 the pk is used for delta imports and "has no relation to uniqueKey defined in the schema.xml...". The error you're getting is because your schema.xml defines the "id" fiel

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
I tried it. It just does not work. the debug component only works when query component is there, and it is just showing debugging information for query result, not term match result. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-debug-if-termsComponent-is-used-tp28

Re: How to debug if termsComponent is used

2011-05-03 Thread Erick Erickson
Saying "it does not work" doesn't give us much to go on. Can you describe what you've tried? *How* it fails? Have you looked in the log for any clues? You might review this page: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Tue, May 3, 2011 at 3:35 PM, cyang2010 wrote: > I tried

How to know which value matched for a multi-value field

2011-05-03 Thread cyang2010
Hi, I have a use case where i need to know for a paritcular multivalue field, which particular value match when a query is run on that field. For example, for a movie document, movie title name is single-value field, movie actors is multi-value field. When user search "colin", i know it is "col

Re: How to debug if termsComponent is used

2011-05-03 Thread cyang2010
Sorry i didn't mean to give random reply. It is just today my solr configuration/schema is different now and can't get the error message. Anyway, i rerun the test. Basically by specifying such searchcomponent and requesthandler, you won't get any error. When you query this it is fine without an

An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Gavin Engel
Hello all, I've been trying to add the Spatial Search Plugin to my Solr 1.4.1 setup, and I get this error: > java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin > at java.lang.ClassLoader.defineClass1(Native Method) > at java.lang.ClassLoader.defineClass(ClassLoader.java:634) >

Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Markus Jelsma
Where did you store the jar? Is it in a directory Solr looks for libs? Depending on your distro or set up there can be different places to store the jar. The easiest solution is to put it in a dir where other Solr libs are found or in a dir that you configured in a directive in solrconfig. > H

Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Gavin Engel
Oh, I apparently figured out how to get the jar file to load, so problem is solved I suppose. The fix seems very odd to me, but I got it from a comment on the SSP 2 blog page ( http://blog.jteam.nl/2009/08/03/geo-location-search-with-solr-and-lucene/comment-page-1/#comment-4774 ): The solution, f

RE: Ebay Kleinanzeigen and Auto Suggest

2011-05-03 Thread Andy
--- On Tue, 5/3/11, Charton, Andre wrote: > > yes we do. > > If you use a limit number of categories (like 100) you can > use dynamic fields with the termscomponent and by choosing a > category specific prefix, like: > > {schema.xml} > ... > indexed="true" stored="false" multiValued="true" >

Re: Has NRT been abandoned?

2011-05-03 Thread Nagendra Nagarajayya
Thanks Andy! Solr-RA is the same as Solr, except that the underlying search library is now RankingAlgorithm library instead of Lucene. BoostQParserPlugin works at the Solr level, so this should still work as before. A query of the form q={!boost b=log(x)}abcde comes back with results but am n

Re: getLuceneVersion parsing xml node on every request

2011-05-03 Thread Stephane Bailliez
I went ahead and patched locally the SolrQueryParser in current 3_x branch. Doing a quick test, baring any obvious mistake due to sleep deprivation I get close to a 10X performance boost from 200qps to 2000qps. I opened https://issues.apache.org/jira/browse/SOLR-2493 cheers, -- stephane On Tue

RE: How to take differential backup of Solr Index

2011-05-03 Thread Gaurav Shingala
how we can configure query server in solr using replication feature? Thanks, Gaurav > Date: Mon, 2 May 2011 22:05:33 -0700 > Subject: Re: How to take differential backup of Solr Index > From: goks...@gmail.com > To: solr-user@lucene.apache.org > > The Replication feature does this. If you confi

Re: Replicaiton Fails with Unreachable error when master host is responding.

2011-05-03 Thread Jed Glazner
So it turns out that it's the host names. According the DNS RFC underscores are not valid in host names. Most DNS servers now support them, but it's not in the rfc strictly speaking. So there must be something in the underlying java classes that bork when using underscores in host names, thou

Using lowercase as field type

2011-05-03 Thread Isan Fulia
Hi , My schema consists of a field of type lowercase(for applying the lowercase filter factory) and is the unique key . But its no longer behaving as unique key. Multiple documents with same value for the unique key are getting indexed. Does anyone know why this is happening or is it that the fi

How to preserve filelist / commit-points after master restart

2011-05-03 Thread Maduranga Kannangara
Hi All, We use Solr 1.4.1. Single core setup with a repeater (for QA) and a few slaves (for Production). Master will index many sources and make data ready. Once all data is "ready-for-production", optimization will take place. On master "replicateAfter" is set to "optimize". (Subsequently on rep

Re: An error I can't manage to fix: java.lang.NoClassDefFoundError: org/apache/solr/search/QParserPlugin

2011-05-03 Thread Markus Jelsma
So, you're using Jetty. That's indeed a place to store the file when using Jetty. > Oh, I apparently figured out how to get the jar file to load, so problem is > solved I suppose. > > The fix seems very odd to me, but I got it from a comment on the SSP 2 blog > page ( > http://blog.jteam.nl/200

Re: Using lowercase as field type

2011-05-03 Thread Markus Jelsma
So those multiple documents overwrite eachother? In that case, your data is not suited for a lowercased docID. I'd recommend not doing any analysis on the docID to prevent such headaches. > Hi , > > My schema consists of a field of type lowercase(for applying the lowercase > filter factory) an

Solr Terms and Date field issues

2011-05-03 Thread Viswa S
Hello, The terms query for a date field seems to get populated with some weird dates, many of these dates (1970,2009,2011-04-23) are not present in the indexed data. Please see sample data below I also notice that a delete and optimize does not remove the relevant terms for date fields, the

Re: Using lowercase as field type

2011-05-03 Thread Isan Fulia
I want multiple documents with same unique key to overwrite each other but they are not overwriting because of lowercase field type as unique key On 4 May 2011 11:45, Markus Jelsma wrote: > So those multiple documents overwrite eachother? In that case, your data is > not suited for a lowercased