add thumnail image for search result

2011-10-18 Thread hadi
I want to know how can i add thumbnail image for my files when i am indexing files with solrj? thanks -- View this message in context: http://lucene.472066.n3.nabble.com/add-thumnail-image-for-search-result-tp3433440p3433440.html Sent from the Solr - User mailing list archive at Nabble.com.

How to update document with solrj?

2011-10-18 Thread hadi
I have indexed some files that do not have any tag or description and i want to add some field without deleting them,how can i update or add info to my index files with solrj? my idea for this issue is query on specific file and delete it and add some info and re index it but i think it is not a go

Re: How to change default operator in velocity?

2011-10-18 Thread hadi
thanks a lot,your answer is great On 10/18/11, Jan Høydahl / Cominvent [via Lucene] wrote: > > > Rather than deleting the dismax config, I would recommend adding a new entry > inside your "/browse" request handler config's tag: > > 0 > > This will go "OR" mode, and you will still benefit from al

Re: OS Cache - Solr

2011-10-18 Thread Sujatha Arun
Thanks ,Otis, This is our Solr Cache Allocation.We have the same Cache allocation for all our *200+ instances* in the single Server.Is this too high? *Query Result Cache*:LRU Cache(maxSize=16384, initialSize=4096, autowarmCount=1024, ) *Document Cache *:LRU Cache(maxSize=16384, initialSize=1638

use lucene to create index(with synonym) and solr query index

2011-10-18 Thread cmd
1.use lucene to create index(with synonym) 2.config solr open synonym functionality 3.user solr to query lucene index but the result missing the synonym word why? and how can i do with each other. thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/use-lucene-to-create-in

Re: changing base URLs in indexes

2011-10-18 Thread Markus Jelsma
Is this a crawler indexing the pages? If so, i would point it to whatever you need. If, for some reason, you cannot, you can modifiy the host/domain in your index using pattern char filters or maybe the stored (returned) values using a custom update processor. > Hi, > > I am getting ready to i

Re: Question about near query order

2011-10-18 Thread Jason, Kim
Thanks a ton iorixxx. Jason. -- View this message in context: http://lucene.472066.n3.nabble.com/Question-about-near-query-order-tp3427312p3432922.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Instructions for Multiple Server Webapps Configuring with JNDI

2011-10-18 Thread Shawn Heisey
On 10/18/2011 6:59 AM, Tod wrote: One more question, is there a particular advantage of multiple solr instances vs. multiple solr cores? One way of doing multiple instances is running more than one copy of your container (tomcat/jetty/whatever). I've never tried to put more than one .war fil

changing base URLs in indexes

2011-10-18 Thread Fred Zimmerman
Hi, I am getting ready to index a recent copy of Wikipedia's pages-articles dump. I have two servers, foo and bar. On foo.com/mediawiki I have a Mediawiki install serving up the pages. On bar.com/solr I have my solr install. I have the pages-articles.xml file from Wikipedia and the solr instruct

Hit search-lucene.com a little harder

2011-10-18 Thread Otis Gospodnetic
Hello folks, Do you ever use http://search-lucene.com (SL) or http://search-hadoop.com (SH)? If you do, I'd like to ask you for a small favour: We are at Lucene Eurocon in Barcelona and we are about to show the Search Analytics [1] and Performance Monitoring [2] tools/services we've built and t

score based on unique words matching???

2011-10-18 Thread Craig Stadler
Heres my problem : field1 (text) - subject q=david bowie changes Problem : If a record mentions david bowie a lot, it beats out something more relevant (more unique matches) ... A. (now appearing david bowie at the cineplex 7pm david bowie goes on stage, then mr. bowie will sign autographs)

Re: OS Cache - Solr

2011-10-18 Thread Otis Gospodnetic
Maybe your Solr Document cache is big and that's consuming a big part of that JVM heap? If you want to be able to run with a smaller heap, consider making your caches smaller. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/

Re: How to retreive multiple documents using one unique field?

2011-10-18 Thread Otis Gospodnetic
This won't work.  But you could add all 4 docs with the same "123" value in their id fields, just comment out uniqueKey field.  Don't ask me what will or will not happen when you later try updating a document with id:123... Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Luce

Re: Find Documents with field = maxValue

2011-10-18 Thread Sujit Pal
Hi Alireza, Would this work? Sort the results by age desc, then loop through the results as long as age == age[0]. -sujit On Tue, 2011-10-18 at 15:23 -0700, Otis Gospodnetic wrote: > Hi, > > Are you just looking for: > > age: > > This will return all documents/records where age field is equal

Re: performace jetty (jetty.xml)

2011-10-18 Thread Otis Gospodnetic
Gastone, Those numbers are probably OK.  Let us know if you have any actual problems with Solr 3.4.  Oh, and use the solr-user mailing list instead please. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ >___

Re: Find Documents with field = maxValue

2011-10-18 Thread Otis Gospodnetic
Hi, Are you just looking for: age: This will return all documents/records where age field is equal to target age. But maybe you want age:[0 TO ] This will include people aged from 0 to target age. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search ::

Dismax boost + payload boost

2011-10-18 Thread Milan Dobrota
Is it possible to combine dismax boost (query time) and payload boost (index time)? I've done something very similar to this post http://sujitpal.blogspot.com/2011/01/payloads-with-solr.html but it seems that query time boosts get ignored.

Term Frequency - tf() ?

2011-10-18 Thread Hung Huynh
I've revised the tf() function to always return 1, regardless of the number of terms it finds. However, I run into a problem when a stemming words and root words appear together. These documents get a higher boost than documents with just the root. For example: "woman walking fast" gets tf(woman)

RE: How to retreive multiple documents using one unique field?

2011-10-18 Thread Jaeger, Jay - DOT
I do not believe that it will work as you have written it, unless you put an application in between to read that XML and then call Solr with what it expects. See http://wiki.apache.org/solr/UpdateXmlMessages You need to have: unique-value-if-any-1 abc 123 un

RE: Getting errors thrown from sun.nio.ch.FileDispatcher with native or simple or single lock .Please , i need help in resolving the issue.

2011-10-18 Thread Jaeger, Jay - DOT
As others have reported, I also did not get your image. I am interested in your situation because we will deploy to WAS 7 in production, and have tested there. One thing I noted that might point to a possible problem you might have: 1. "The owner of the files created in the 2 environment

Access Document Score in Custom Function Query (ValueSource)

2011-10-18 Thread sangrish
Hi, I use the following 2 components in ranking documents: "Normal Query" : myField^2 Custom Function Query(ValueSource): myFunc() In this value source I compute another score for every document using some features. I want to access the score of the query myField^2 (for

Re: Query with star returns double type values equal 0

2011-10-18 Thread romain
Hi iorixxx, I am using lucene On Monday, October 17, 2011 5:58:31 PM, iorixxx [via Lucene] wrote: > > I am experiencing an unexpected behavior using solr 3.4.0. > > > > if my query includes a star, all the properties of type > > 'long' or 'LatLon' > > have 0 as value > > (ex: select/?start=0&q=wa

Re: How to change default operator in velocity?

2011-10-18 Thread Jan Høydahl
Rather than deleting the dismax config, I would recommend adding a new entry inside your "/browse" request handler config's tag: 0 This will go "OR" mode, and you will still benefit from all the advantages that DisMax gives you for weighted search across different fields. See http://wiki.apac

Re: Question about near query order

2011-10-18 Thread Ahmet Arslan
> Is it possible only defType=lucnee in your second > suggestion? > I'm using ComplexPhraseQueryParser. > So my defType is "complexphrase". Oh, then life is easy. Just setting the inOrder parameter to false in solrconfig.xml should do the trick. false

RE: IndexBasedSpellChecker on multiple fields

2011-10-18 Thread Dyer, James
Simone, You can set up a "master" dictionary but with a few caveats. What you'll need to do is all of the fields you want to include in your "master" dictionary into one field and base your IndexBasedSpellChecker dictionary on that. In addition, I would recommend you use the "collate" featur

Re: solr/lucene and its database (a silly question)

2011-10-18 Thread Robert Stewart
SOLR stores all data in the directory you specify in solrconfig.xml in dataDir setting. SOLR uses Lucene to store all the data in one or more proprietary binary files called segment files. As a SOLR user typically you should not be too concerned with binary index structure. You can see detail

Re: Question about near query order

2011-10-18 Thread Jason, Kim
Thank you for your kind reply. Is it possible only defType=lucnee in your second suggestion? I'm using ComplexPhraseQueryParser. So my defType is "complexphrase". -- View this message in context: http://lucene.472066.n3.nabble.com/Question-about-near-query-order-tp3427312p3431465.html Sent fro

Re: solr/lucene and its database (a silly question)

2011-10-18 Thread Alireza Salimi
In here: http://wiki.apache.org/solr/SolrConfigXml#dataDir_parameter On Tue, Oct 18, 2011 at 10:38 AM, lorenlai wrote: > Hello expert, > > I have just a silly question regarding to Solr/Lucene, pls. > > Where are the importing data stored ? In Lucene or Solr ? > Here is a picture of the architec

solr/lucene and its database (a silly question)

2011-10-18 Thread lorenlai
Hello expert, I have just a silly question regarding to Solr/Lucene, pls. Where are the importing data stored ? In Lucene or Solr ? Here is a picture of the architecture. http://3.bp.blogspot.com/-rTZPN3sm9e0/TjAdqciXHgI/Cs0/N_W_iSAI8cY/s1600/solr_arch.jpg I mean when importing the data

RE: Find Documents with field = maxValue

2011-10-18 Thread Brandon Ramirez
I don't know anything about your environment, so maybe this doesn't make sense, but maybe you can check your source system (database or whatnot) to get the max_age, then search for the max_age in your Solr index. It's not as elegant, but may be a lot easier. To reduce the risk of interacting wi

Re: performace jetty (jetty.xml)

2011-10-18 Thread Alireza Salimi
Can't you use some profilers to find out about your new performance? I'm new to Solr, but I think 200,000 req/day is not that many. On Tue, Oct 18, 2011 at 10:03 AM, Gastone Penzo wrote: > Hi, > i just change my solr installation from 1.4 to 3.4.. > i can notice that also jetty configuration file

Re: Find Documents with field = maxValue

2011-10-18 Thread Alireza Salimi
Hi Ahmet, Thanks for your reply, but I want ALL documents with age = max_age. On Tue, Oct 18, 2011 at 9:59 AM, Ahmet Arslan wrote: > > > --- On Tue, 10/18/11, Alireza Salimi wrote: > > > From: Alireza Salimi > > Subject: Find Documents with field = maxValue > > To: solr-user@lucene.apache.or

performace jetty (jetty.xml)

2011-10-18 Thread Gastone Penzo
Hi, i just change my solr installation from 1.4 to 3.4.. i can notice that also jetty configuration file (jetty.xml) is changed. default threads number is higher, theadpool is higher and other default value are higher. is it normal?? what number of these value do you seems are correct for me? i ha

Re: Find Documents with field = maxValue

2011-10-18 Thread Ahmet Arslan
--- On Tue, 10/18/11, Alireza Salimi wrote: > From: Alireza Salimi > Subject: Find Documents with field = maxValue > To: solr-user@lucene.apache.org > Date: Tuesday, October 18, 2011, 4:10 PM > Hi, > > It might be a naive question. > Assume we have a list of Document, each Document contains >

Re: millions of records problem

2011-10-18 Thread Tom Gullo
Getting a solid-state drive might help -- View this message in context: http://lucene.472066.n3.nabble.com/millions-of-records-problem-tp3427796p3431309.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to change default operator in velocity?

2011-10-18 Thread hadi
thanks for your reply,i delete the dismax conf from solrconf.xml and it works,is it any side effect? On 10/18/11, Jan Høydahl / Cominvent [via Lucene] wrote: > > > Hi, > > The reason why AND is default with /browse is that it uses the "dismax" > query parser, which does not currently respect defa

Find Documents with field = maxValue

2011-10-18 Thread Alireza Salimi
Hi, It might be a naive question. Assume we have a list of Document, each Document contains the information of a person, there is a numeric field named 'age', how can we find those Documents whose *age* field is *max(age) *in one query. So far I've found that function queries don't support aggreg

solr fullpledged

2011-10-18 Thread nagarjuna
Hi everybody i just downloaded solr application and modified the config files as per my requirement and i successfully got the results and i also developed a sample client application using javascript and i used my solr url there to retrieve the results evrything is fine

Re: How to change default operator in velocity?

2011-10-18 Thread Jan Høydahl
Hi, The reason why AND is default with /browse is that it uses the "dismax" query parser, which does not currently respect defaultOperator. If you want an OR like behaviour, try to add at the end of the url: &mm=0 (which means minumum number of terms that should match=0), e.g. http://localhost:8

Re: Instructions for Multiple Server Webapps Configuring with JNDI

2011-10-18 Thread Tod
On 10/14/2011 2:44 PM, Chris Hostetter wrote: : modified the solr/home accordingly. I have an empty directory under : tomcat/webapps named after the solr home directory in the context fragment. if that empty directory has the same base name as your context fragment (ie: "tomcat/webapps/solr0"

IndexBasedSpellChecker on multiple fields

2011-10-18 Thread Simone Tripodi
Hi all guys, I need to configure the IndexBasedSpellChecker that uses more than just one field as a spelling dictionary, is it possible to achieve? In the meanwhile I configured two spellcheckers and let users switch from a checkeer to another via params on GET request, but looks like people are no

Re: help with phrase query

2011-10-18 Thread elisabeth benoit
I think you can use pf2 and pf3 in your requestHandler. Best regards, Elisabeth 2011/10/16 Vijay Ramachandran > Hello. I have an application where I try to match longer queries > (sentences) > to short documents (search phrases). Typically, the documents are 3-5 terms > in length. I am facing a

How to retreive multiple documents using one unique field?

2011-10-18 Thread kiran.bodigam
I have four different documents in single xml file(to be indexed), i don't want inject the unique field for each and every document .when i search with with the unique field all the four documents should come in result.i.e can common unique field should be applied to the all documents? My xml form

How to change default operator in velocity?

2011-10-18 Thread hadi
in solr schema the defaultOperator value is OR but when i use browse(http://localhost:8983/solr/browse)for searching AND is a defaultOperator,and that config in solr is not affect on velocity how can i change the velocity template engine default operators? -- View this message in context: http:/

Re: upgrading 1.4 to 3.x

2011-10-18 Thread deniz
well i made a little diggin on web... so the problem is also described here too https://issues.apache.org/bugzilla/show_bug.cgi?id=40719 basically there was no details in the tomcat logs (maybe in some other logs but well i dont know) i came up with the same problem while implementing something

Re: Solr scraping: Nutch and other alternatives.

2011-10-18 Thread Óscar Marín Miró
Hi Luis, just an opinion (worked with Nutch intensively, 2005-2008). Web crawling is a bitch, and Nutch won't make it any easier. Some problems you'll find along the way: 1. Spidering tunnels/traps 2. Duplicate and near-duplicate content removal 3. GET parameter explosion in dynamic page

Re: Question about near query order

2011-10-18 Thread Ahmet Arslan
> "analyze term"~2 > "term analyze"~2 > > In my case, two queries return different result set. > Isn't that in your case? Hmm you are right, I tested with a trunk instance using lucene query parser. Results sets were different. If I am not wrong they were same at some version. I can suggest y

Re: Solr scraping: Nutch and other alternatives.

2011-10-18 Thread Markus Jelsma
I'm a bit biased but i would certainly use Nutch as it's the right tool for the job, it seems. Developing custom plugins is actually easier than you might think. Solr, with it's extracting request handling, can only help in a very limited way. > Hello everyone. > > I've been thinking about a

Re: Controlling the order of partial matches based on the position

2011-10-18 Thread Marco Martinez
Hi, I would use a custom function query that uses termPositions to calculate the order of the values in the field to accomplished your requirements. Marco Martínez Bautista http://www.paradigmatecnologico.com Avenida de Europa, 26. Ática 5. 3ª Planta 28224 Pozuelo de Alarcón Tel.: 91 352 59 42

Re: Solr scraping: Nutch and other alternatives.

2011-10-18 Thread Marco Martinez
Hi Luis, Have you tried the copyField function with custom analyzers and tokenizers? bye, Marco Martínez Bautista http://www.paradigmatecnologico.com Avenida de Europa, 26. Ática 5. 3ª Planta 28224 Pozuelo de Alarcón Tel.: 91 352 59 42 2011/10/18 Luis Cappa Banda > Hello everyone. > > I've b

Re: Controlling the order of partial matches based on the position

2011-10-18 Thread lee carroll
this link is on he mailing list recently. http://www.lucidimagination.com/search/document/dfa18d52e7e8197c/getting_answers_starting_with_a_requested_string_first#b18e9f922c1e4149 On 18 October 2011 00:59, aronitin wrote: > Guys, > > It's been almost a week but there are no replies to the questi