Re: Setting many properties for a multivalued field. Schema.xml ? External file?

2010-06-25 Thread Otis Gospodnetic
Saïd, Dynamic fields could help here, for example imagine a doc with: id pic_url_* pic_caption_* pic_description_* See http://wiki.apache.org/solr/SchemaXml#Dynamic_fields So, for you: Then you can add docs with unlimited number of pic_(url|caption|description)_* fields, e.g. id pic_ur

indexing xml document with literals

2010-06-25 Thread Kyle Langan
Does anyone know how to read in data from one or more of the example xml docs and ALSO store the filename and path from which it came? ie: exampledocs/vidcard.xml contains: EN7800GTX/2DHTV/256M ASUS Extreme N7800GTX/2DHTV (256 MB) 100-435805 ATI Radeon X1900 XTX 512 MB PCIE Video Card

Setting many properties for a multivalued field. Schema.xml ? External file?

2010-06-25 Thread Saïd Radhouani
Hi, I'm trying to index data containing a multivalued field "picture", that has three properties: url, caption and description: Thus, each indexed document might have many pictures, each of them has a url, a caption, and a description. I wonder wether it's possibl

Re: dataimport.properties is not updated on delta-import

2010-06-25 Thread Alexey Serba
Please note that Oracle ( or Oracle jdbc driver ) converts column names to upper case eventhough you state them in lower case. If this is the case then try to rewrite your query in the following form select id as "id", name as "name" from table On Thursday, June 24, 2010, warb wrote: > > Hello ag

Re: XML DataImportHandler copy + rezise pictures in localhost?

2010-06-25 Thread Otis Gospodnetic
Marc, Why not use http://www.imagemagick.org/script/index.php to generate thumbnails separately from document indexing? Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: "scr...@asia.com

Re: Debugging Queries

2010-06-25 Thread Otis Gospodnetic
Frank: http://www.getopt.org/luke/ Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Frank A > To: solr-user@lucene.apache.org > Sent: Fri, June 25, 2010 1:23:37 PM > Subject: Debuggin

RE: solr indexing takes a long time and is not reponsive to abort command

2010-06-25 Thread Ya-Wen Hsu
Thanks for the response. I double-checked that we don't have the core open multiple times. The complete index size is about 200M (around 1,060,000 documents). During the indexing process, 26 files were created. Core admin interface indicated that no query or process were running after roughly 5

Re: SweetSpotSimilarity

2010-06-25 Thread Ahmet Arslan
> Thanks. Im guessing this is all or nothing.. ie you can't > you one similarity > class for one request handler and another for a separate > request handler. Is > that correct? correct, also re-index is required. length norms are calculated and stored at index time.

Re: SweetSpotSimilarity

2010-06-25 Thread Blargy
iorixxx wrote: > > it is in schema.xml: > > > Thanks. Im guessing this is all or nothing.. ie you can't you one similarity class for one request handler and another for a separate request handler. Is that correct? -- View this message in context: http://lucene.472066.n3.nabble.com/Sweet

Debugging Queries

2010-06-25 Thread Frank A
I have a query that is not returning the results I expect - as in there are missing results. Is there a way given an ID to the index field to dive into how the entity is stored in the index? Thanks.

Re: SweetSpotSimilarity

2010-06-25 Thread Ahmet Arslan
> Would someone mind explaining how this differs from the > DefaultSimilarity? The difference is length normalization. Default one punishes long documents. "Sweet one computes to a constant norm for all lengths in the [min,max] range (the "sweet spot"), and smaller norm values for lengths out of

SweetSpotSimilarity

2010-06-25 Thread Blargy
Would someone mind explaining how this differs from the DefaultSimilarity? Also how would one replace the use of the DefaultSimilarity class with this one? I can't seem to find any such configuration in solrconfig.xml. Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/S

Re: [ANN] Solr 1.4.1 Released

2010-06-25 Thread Mark Miller
Can a solr/maven dude look at this? I simply used the copy command on the release to-do wiki (sounds like it should be updated). If no one steps up, I'll try and straighten it out later. On 6/25/10 10:28 AM, Stevo Slavić wrote: > Congrats on the release! > > Something seems to be wrong with solr

Re: [ANN] Solr 1.4.1 Released

2010-06-25 Thread Stevo Slavić
Congrats on the release! Something seems to be wrong with solr 1.4.1 maven artifacts, there is in extra solr in the path. E.g. solr-parent-1.4.1.pom at in http://repo1.maven.org/maven2/org/apache/solr/solr/solr-parent/1.4.1/solr-parent-1.4.1.pomwhile it should be at http://repo1.maven.org/maven2/o

Re: Recommended MySQL JDBC driver

2010-06-25 Thread Lukas Kahwe Smith
On 18.05.2010, at 17:22, Shawn Heisey wrote: > On 5/14/2010 12:40 PM, Shawn Heisey wrote: >> I downgraded to 5.0.8 for testing. Initially, I thought it was going to be >> faster, but it slows down as it gets further into the index. It now looks >> like it's probably going to take the same amou

[ANN] Solr 1.4.1 Released

2010-06-25 Thread Mark Miller
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Apache Solr 1.4.1 has been released and is now available for public download! http://www.apache.org/dyn/closer.cgi/lucene/solr/ Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major feature

Re: MoreLikeThis (mlt) : use the match's maxScore for result score normalization

2010-06-25 Thread MitchK
Hi Chantal, Munich? Germany seems to be soo small :-). Chantal Ackermann wrote: > > I only want a way to show to the > user a kind of relevancy or similarity indicator (for example using a > range of 10 stars) that would give a hint on how similar the mlt hit is > to the input (match) item.

DIH - $deleteDocById

2010-06-25 Thread Ingar Hov
I seem to have a hard time to get $deleteDocById to work with the XPathEntityProcessor. Anyone tested it and got it to work? Here's a snippet of the code: -- .. .. .. Whenever I try to run a delta-import with a document that should delete from the index it only updates the document in the ind

Re: performance sorting multivalued field

2010-06-25 Thread Marc Sturlese
>>*There are lot's of docs with the same value, I mention that because I supose that same value has nothing to do with the number of un-inverted term instances. It has to do, I've been able to reproduce teh error by setting different values to each field: HTTP Status 500 - there are more terms th

Re: MoreLikeThis (mlt) : use the match's maxScore for result score normalization

2010-06-25 Thread Chantal Ackermann
Hi Mitch, thanks for the answer and the link. The use case is to provide content based recommendations for a single item no matter where that came from. So, this input (match) item is "the best match", all "more like this" items compare to it, and the ones that are the most alike would have the h

XML DataImportHandler copy + rezise pictures in localhost?

2010-06-25 Thread scrapy
Hi, I'm adding documents to Solr via XML files and DataImportHandler. In the XML file i've got some product picture links: http://www.example.com/pic.jpg I would like to keep a local thumb of these picture in local server in order to avoid long external loading time. Example:

Re: SOLR-236 Patch

2010-06-25 Thread Martijn v Groningen
Hi Sam, It seems that the patch is out of sync again with the trunk. Can you try patching with revision 955615? I'll update the patch shortly. Martijn On 24 June 2010 09:49, Amdebirhan, Samson, VF-Group wrote: > Hi > > > > Trying to apply the SOLR-236 patch to a trunk i get what follows. Can >