Re: performace jetty (jetty.xml)

2011-10-19 Thread Gastone Penzo
ok Thanx ;) 2011/10/19 Otis Gospodnetic > Gastone, > > Those numbers are probably OK. Let us know if you have any actual problems > with Solr 3.4. Oh, and use the solr-user mailing list instead please. > > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosyst

Re: add thumnail image for search result

2011-10-19 Thread Paul Libbrecht
Hadi, I do not think solr or solrj does this. are your document HTML documents? I would look in the crawler resources but I note that rendering is a rather server-unfriendly task and it bears some security risk if the documents are not fully trusted. In i2geo.net, we finally gave up on automate

Re: score based on unique words matching???

2011-10-19 Thread Ahmet Arslan
> >> Heres my problem : > >> > >> field1 (text) - subject > >> q=david bowie changes > >> > >> Problem : If a record mentions david bowie a lot, > it beats out something more relevant (more unique matches) > ... > >> > >> A. (now appearing david bowie at the cineplex 7pm > david bowie goes on st

Re: IndexBasedSpellChecker on multiple fields

2011-10-19 Thread Simone Tripodi
Hi James! terrific suggestion, thanks a lot!!! And sorry for the delay (due to my timezone ;) ) I'll let you know how things will go, thanks once again and have a nice day! Simo http://people.apache.org/~simonetripodi/ http://simonetripodi.livejournal.com/ http://twitter.com/simonetripodi http://w

Re: Dismax boost + payload boost

2011-10-19 Thread Jean-Claude Dauphin
Hello Milan, You may also be interesting by the following article: "Using Payloads with DisMaxQParser in SOLR " http://digitalpebble.blogspot.com/2010/08/using-payloads-with-dismaxqparser-in.html I have implemen

Dismax and phrases

2011-10-19 Thread Hyttinen Lauri
Hello, I've inherited a solr-lucene project which I continue to develop. This particular SOLR (1.4.1) uses dismax for the queries but I am getting some results that I do not understand. Mainly when I search for two terms I get some results however when I put quotes around the two terms I get

Optimization /Commit memory

2011-10-19 Thread Sujatha Arun
Do we require 2 or 3 Times OS RAM memory or Hard Disk Space while performing Commit or Optimize or Both? what is the requirement in terms of size of RAM and HD for commit and Optimize Regards Sujatha

Re: solr/lucene and its database (a silly question)

2011-10-19 Thread lorenlai
Hello Alireza, thank you for the link again ;-) Cheers Loren -- View this message in context: http://lucene.472066.n3.nabble.com/solr-lucene-and-its-database-a-silly-question-tp3431436p3433803.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr/lucene and its database (a silly question)

2011-10-19 Thread lorenlai
Hi Robert, also many thanks to you and your shortly descriptions/explanations to my questions again were really helpful. Cheers & have a nice day Loren -- View this message in context: http://lucene.472066.n3.nabble.com/solr-lucene-and-its-database-a-silly-question-tp3431436p3433811.html Sen

Re: Solr MultiValue Fields and adding values

2011-10-19 Thread Tiernan OToole
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 I was hoping that wasent going to be the case... I ended up querying for all unique IDs in the DB, and then querying for each unique ID and getting all names, and then inserting them that way... Seems a lot slower than in theory it really should be..

Re: Solr scraping: Nutch and other alternatives.

2011-10-19 Thread Luis Cappa Banda
Hello Marco, Markus and Óscar. Thank you very much for your answers. What you suggest, Óscar, sounds very interesting. I mean the alternative that covers data mining with any 'popular searcher'. Do you know any tutorial or book that can teach me the first steps? Bye!

Re: Find Documents with field = maxValue

2011-10-19 Thread Alireza Salimi
What I'm looking for is to do everything in single shot in Solr. I'm not even sure if it's possible or not. Finding the max value and then running another query is NOT my ideal solution. Thanks everybody On Tue, Oct 18, 2011 at 6:28 PM, Sujit Pal wrote: > Hi Alireza, > > Would this work? Sort

Re: Dismax and phrases

2011-10-19 Thread Otis Gospodnetic
Lauri, Start with adding &debugQuery=true to your URL calls to Solr and look at how the queries are getting rewritten to understand what is going on.  What you are seeing is actually expected, so if you want your phrase query to be a strict phrase query, just use standard request handler, not d

RE: Filter Question

2011-10-19 Thread Monica Skidmore
Thanks Steven, that's just the kind of feedback I needed. And thanks also to Jan. I'll do a little clean-up on my filter and submit it... -Monica -Original Message- From: Steven A Rowe [mailto:sar...@syr.edu] Sent: Friday, October 14, 2011 3:18 AM To: solr-user@lucene.apache.org Sub

Painfully slow indexing

2011-10-19 Thread Pranav Prakash
Hi guys, I have set up a Solr instance and upon attempting to index document, the whole process is painfully slow. I will try to put as much info as I can in this mail. Pl. feel free to ask me anything else that might be required. I am sending documents in batches not exceeding 2,000. The size of

Merging Remote Solr Indexes?

2011-10-19 Thread darren
Hi, I thought of a useful capability if it doesn't already exist. Is it possible to do an index merge between two remote Solr's? To handle massive index-time scalability, wouldn't it be useful to have distributed indexes accepting local input, then merge them into one central index after? Dar

RE: Solr MultiValue Fields and adding values

2011-10-19 Thread Dyer, James
While Solr/Lucene can't support true document updates, there are 2 ways you might be able to work around this in your situation. 1. If you store all of the fields, you can write something that will read back everything already indexed to the document, append whatever data you want, then write i

Re: Solr scraping: Nutch and other alternatives.

2011-10-19 Thread Igor MILOVANOVIC
Try this if you haven't use python before : http://gun.io/blog/python-for-the-web/ Keep in mind that the usage of "some very known search engine" is usually not in line with their ToS, so they will sooner or later block you, at least. Be gentle and polite, and you even might make it work... ;)

Re: Solr MultiValue Fields and adding values

2011-10-19 Thread Tiernan OToole
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Thanks for the comment. Sounds like too much of a change in all fairness... I have actually made a tweak to my DB to allow multiple names, and storing them off the main table. my query then only needs to query the IDs, and then the second table to ge

Re: Solr MultiValue Fields and adding values

2011-10-19 Thread Tiernan OToole
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Thats what i though too... we see what the speed difference actually is... running some tests now... Thanks for the info! - --Tiernan On 19/10/2011 16:07, Dyer, James wrote: > > Not that I am doing this with any of my indexes, but I'm pretty su

stemEnglishPossessive and contractions

2011-10-19 Thread Herman Kiefus
We utilize a comprehensive dictionary of English words, place names, surnames, male and female first names, ... you get the point. As such, the possessive plural forms of these words are recognized as 'misspelled'. I simply thought that 'turning on' this option for the WordDelimiterFactory wou

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Otis Gospodnetic
Hi Darren, http://search-lucene.com/?q=solr+merge&fc_project=Solr Check hit #1 Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ > >From: "dar...@ontrenet.com" >To: solr-user@lucene.apache

java.lang.NoSuchMethodError: org.slf4j.spi.LocationAwareLogger.log

2011-10-19 Thread Tod
I'm working on upgrading to Solr 3.4.0 and am seeing this error in my tomcat log. I'm using the following slf jars: slf4j-api-1.6.1.jar slf4j-jdk14-1.6.1.jar Has anybody run into this? I can reproduce it doing curl calls to the Solr ExtractingRequestHandler ala /solr/update/extract. TIA -

Re: stemEnglishPossessive and contractions

2011-10-19 Thread Robert Muir
The word delimiter filter also does other things, it treats ' as punctuation by default. So it normally splits on ', except if its 's (in this case it removes the 's completely if you use this stemEnglishPossessive). There are a couple approaches you can use: 1. you can keep worddelimiterfilter wi

How to make UnInvertedField faster?

2011-10-19 Thread Michael Ryan
I was wondering if anyone has any ideas for making UnInvertedField.uninvert() faster, or other alternatives for generating facets quickly. The vast majority of the CPU time for our Solr instances is spent generating UnInvertedFields after each commit. Here's an example of one of our slower fields

dataimport indexing fails: where are my log files ? ;-)

2011-10-19 Thread Fred Zimmerman
dumb question ... today I set up solr3.4/example, indexing to 8983 via post is working, so is search, solr/dataimport reports 0 0 0 2011-10-19 18:13:57 Indexing failed. Rolled back all changes. Google tells me to look at the exception logs to find out what's happening ... but, I can't find the l

Re: dataimport indexing fails: where are my log files ? ;-)

2011-10-19 Thread Shawn Heisey
On 10/19/2011 12:42 PM, Fred Zimmerman wrote: dumb question ... today I set up solr3.4/example, indexing to 8983 via post is working, so is search, solr/dataimport reports 0 0 0 2011-10-19 18:13:57 Indexing failed. Rolled back all changes. Google tells me to look at the exception logs to find

Re: java.lang.NoSuchMethodError: org.slf4j.spi.LocationAwareLogger.log

2011-10-19 Thread Tim Terlegård
Hi Tod, I had similar issue with slf4j, but it was NoClassDefFound. Do you have some other dependencies in your application that use some other version of slf4j? You can use mvn dependency:tree to get all dependencies in your application. Or maybe there's some other version already in your tomcat

RE: stemEnglishPossessive and contractions

2011-10-19 Thread Herman Kiefus
Thanks Robert, exactly what I was looking for. -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 19, 2011 1:15 PM To: solr-user@lucene.apache.org Subject: Re: stemEnglishPossessive and contractions The word delimiter filter also does other things, it

where is solr data import handler looking for my file?

2011-10-19 Thread Fred Zimmerman
Solr dataimport is reporting file not found when it looks for foo.xml. Where is it looking for /data? is this an url off the apache2/htdocs on the server, or is it an URL within example/solr/...?

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Darren Govoni
Hi Otis, Yeah, I saw page, but it says for merging cores, which I presume must reside locally to the solr instance doing the merging? What I'm interested in doing is merging across solr instances running on different machines into a single solr running on another machine (programmatically). I

RE: how was developed solr admin page and the UI part?

2011-10-19 Thread Jaeger, Jay - DOT
I believe that if you have the Solr distribution, you have the source for the web UI already: it is just .jsp pages. They are inside the solr .war file. JRJ -Original Message- From: nagarjuna [mailto:nagarjuna.avul...@gmail.com] Sent: Wednesday, October 19, 2011 12:07 AM To: solr-user@

RE: OS Cache - Solr

2011-10-19 Thread Jaeger, Jay - DOT
200 instances of what? The Solr application with lucene, etc. per usual? Solr cores? ??? Either way, 200 seems to be very very very many: unusually so. Why so many? If you have 200 instances of Solr in a 20 GB JVM, that would only be 100MB per Solr instance. If you have 200 instances of S

RE: How to update document with solrj?

2011-10-19 Thread Jaeger, Jay - DOT
Solr does not have an "update" per se: you have to re-add the document. A document with the same value for the field defined as the uniqueKey will replace any existing document with that key (you do not have to query and explicitly delete it first). JRJ -Original Message- From: hadi

RE: add thumnail image for search result

2011-10-19 Thread Jaeger, Jay - DOT
It won't do it for you automatically. I suppose you might create the thumbnail image beforehand, Base64 encode it, and add it as a stored, non-indexed, binary field (see schema: solr.BinaryField) when you index the document. JRJ -Original Message- From: hadi [mailto:md.anb...@gmail.com

RE: Optimization /Commit memory

2011-10-19 Thread Jaeger, Jay - DOT
Commit does not particularly spike disk or memory usage, unless you are adding a very large number of documents between commits. A commit can cause a need to merge indexes, which can increase disk space temporarily. An optimize is *likely* to merge indexes, which will usually increase disk spa

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Otis Gospodnetic
Darren, No, that is not possible without one copying an index/shard to a single machine on which you would then merge indices as described on the Wiki. H, wouldn't it be nice to make use of existing replication code to make it possible to move shards around the cluster? Otis Sematext

Re: Merging Remote Solr Indexes?

2011-10-19 Thread Darren Govoni
Actually, yeah. If you think about it a remote merge is like the inverse of replication. Where replication is a one to many away from an index, the inverse would be merging many back to the one. Sorta like a recall. I think it would be a great analog to replication. On 10/19/2011 06:18 PM, Oti

RE: how was developed solr admin page and the UI part?

2011-10-19 Thread nagarjuna
Thank u for ur reply jaegeri saw that and i would like to use that jsp code and thought to modify solr UI little bit as per user convinience .now my question is ,is it possible to develop that using spring mvc architecture. -- View this message in context: http://lucene.472066.n3.

Re: Optimization /Commit memory

2011-10-19 Thread Sujatha Arun
Thanks Jay , I was trying to compute the *OS RAM requirement* *not JVM RAM* for a 14 GB Index [cumulative Index size of all Instances].And I put it thus - Requirement of Operating System RAM for an Index of 14GB is - Index Size + 3 Times the maximum Index Size of Individual Instance for Opt

Re: OS Cache - Solr

2011-10-19 Thread Sujatha Arun
Yes 200 Individual Solr Instances not solr cores. We get an avg response time of below 1 sec. The number of documents is not many most of the isntances ,some of the instnaces have about 5 lac documents on average. Regards Sujahta On Thu, Oct 20, 2011 at 3:35 AM, Jaeger, Jay - DOT wrote: > 200

Re: OS Cache - Solr

2011-10-19 Thread GR
i wonder how do you manage 200 instances On 20 Oct 2011, at 09:21, Sujatha Arun wrote: Yes 200 Individual Solr Instances not solr cores. We get an avg response time of below 1 sec. The number of documents is not many most of the isntances ,some of the instnaces have about 5 lac documents