Re: feeding while solr is running ?

2011-10-17 Thread lorenlai
Hello Robert, also many thanks to you for the LINKS and the short explanation. ;-) *hug* & cheers Loren -- View this message in context: http://lucene.472066.n3.nabble.com/feeding-while-solr-is-running-tp3428500p3430483.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: feeding while solr is running ?

2011-10-17 Thread lorenlai
Hello Alireza, thank you for your reply. I will read the solr tutorial ;-) Cheers Loren -- View this message in context: http://lucene.472066.n3.nabble.com/feeding-while-solr-is-running-tp3428500p3430478.html Sent from the Solr - User mailing list archive at Nabble.com.

Solr scraping: Nutch and other alternatives.

2011-10-17 Thread Luis Cappa Banda
Hello everyone. I've been thinking about a way to retrieve information from a domain (for example, http://www.ign.com) to process and index. My idea is to use Solr as a searcher. I'm familiarized with Apache Nutch and I know that the latest version has a gateway to Solr to retrieve and index infor

Re: OS Cache - Solr

2011-10-17 Thread Sujatha Arun
Hello Jan, Thanks for your response and clarification. We are monitoring the JVM cache utilization and we are currently using about 18 GB of the 20 GB assigned to JVM. Out total index size being abt 14GB Regards Sujatha On Tue, Oct 18, 2011 at 1:19 AM, Jan Høydahl wrote: > Hi Sujatha, > > Ar

Re: Question about near query order

2011-10-17 Thread Jason, Kim
"analyze term"~2 "term analyze"~2 In my case, two queries return different result set. Isn't that in your case? -- View this message in context: http://lucene.472066.n3.nabble.com/Question-about-near-query-order-tp3427312p3429916.html Sent from the Solr - User mailing list archive at Nabble.com

Word de-compounding using the terms in the index?

2011-10-17 Thread mtraynham
Say for instance at query time, I would like to use the terms within my index to de-compound query Terms. The current solution I'm aiming for is to build a "suggester" like component into the query pipeline using TSTLookups. Since all Lookups require to be SolrCoreAware, that is; rebuilt when com

Re: Controlling the order of partial matches based on the position

2011-10-17 Thread aronitin
Guys, It's been almost a week but there are no replies to the question that I posted. If its a small problem and already answered somewhere, please point me to that post. Otherwise please suggest any pointer to handle the requirement mentioned in the question, Nitin -- View this message in con

Re: Selective Result Grouping

2011-10-17 Thread entdeveloper
Not necessarily collapse.type=adjacent. That is only when two docs with the same field value appear next to each other. I'm more concerned with the case where we only want a group of a certain type (no matter where the subsequent docs may be), leaving the rest of the documents ungrouped. The curre

score based on unique words matching

2011-10-17 Thread Craig Stadler
Heres my problem : field1 (text) - subject q=david bowie changes Problem : If a record mentions david bowie a lot, it beats out something more relevant (more unique matches) ... A. (now appearing david bowie at the cineplex 7pm david bowie goes on stage, then mr. bowie will sign autographs)

Re: Query with star returns double type values equal 0

2011-10-17 Thread Ahmet Arslan
> I am experiencing an unexpected behavior using solr 3.4.0. > > if my query includes a star, all the properties of type > 'long' or 'LatLon' > have 0 as value > (ex: select/?start=0&q=way*&rows=10&version=2) > > Though the same request without stars returns correct > values > (ex: select/?start=

Can you please guide me through step-by-step installation of Solr Cell ?

2011-10-17 Thread Sina Fakhraee
Dear Sir/Mam, I am trying to use curl "http://localhost:8080/solr/update/extract?literal.id=doc1&commit=true"; -F "myfile=@somefile.pdf" from the wiki site... but I get the error cause by: Caused by: org.apache.solr.common.SolrException: Error loading class 'solr.extraction.ExtractingRequestHa

Transformer is applied on deltaQuery rather than deltaImportQuery

2011-10-17 Thread Jeff Zhang
Hi all, I have a custom transformer, but it make me confused that the custom transformed is applied on deltaQuery rather than deltaImportQuery. As my understanding is that solr first execute deltaQuery and then deltaImportQuery. So I think the output format of deltaImportQuery should be the same a

Re: Callback on starting solr?

2011-10-17 Thread Jan Høydahl
Check your SolrConfig whether your ping handler is not configured http://wiki.apache.org/solr/SolrConfigXml#The_Admin.2BAC8-GUI_Section -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. okt. 2011, at 20:07, Jithin wrote: > How

Re: OS Cache - Solr

2011-10-17 Thread Jan Høydahl
Hi Sujatha, Are you sure you need 20Gb for Tomcat? Have you profiled using JConsole or similar? Try with 15Gb and see how it goes. The reason why this is beneficial is that you WANT your OS to have available memory for disk caching. If you have 17Gb free after starting Solr, your OS will be abl

RE: Xsl for query output

2011-10-17 Thread Jaeger, Jay - DOT
It depends upon whether you want Solr to do the XSL processing, or the browser. After fussing a bit, and doing some reading and thinking, we decided it was best to let the browser do the work, at least in our case. If the browser is doing the processing, you don't need to modify sorlconfig.xml

RE: Error loading class 'solr.extraction.ExtractingRequestHandler'

2011-10-17 Thread Jaeger, Jay - DOT
It sounds like maybe you either have not told Solr where the Solr home directory is, or , more likely, have not copied the jar files for this particular class into the right directory (typically a "lib" directory) so Tomcat cannot find that class. There is other correspondence on this list that

Re: Callback on starting solr?

2011-10-17 Thread Jithin
How do I configure solr with a ping request? http://localhost:8983/solr/admin/ping/ gives HTTP 404. On Mon, Oct 17, 2011 at 1:06 AM, Jan Høydahl / Cominvent [via Lucene] < ml-node+s472066n3426539...@n3.nabble.com> wrote: > Your app-server will start listening to the port some time before the Solr

OS Cache - Solr

2011-10-17 Thread Sujatha Arun
Hello I am trying to understand the OS cache utilization of Solr .Our server has several solr instances on a server .The total combined Index size of all instances is abt 14 Gb and the size of the maximum single Index is abt 2.5 GB . Our Server has Quad processor with 32 GB RAM .Out of which 20

Re: NRT and replication

2011-10-17 Thread Esteban Donato
thanks Yonik. Any idea of when this should be completed? In the meantime I think I will have to add docs to every replica, possibly implementing an update processor. Something similar to SOLR-2355? On Fri, Oct 14, 2011 at 7:31 PM, Yonik Seeley wrote: > On Fri, Oct 14, 2011 at 5:49 PM, Esteban

Re: millions of records problem

2011-10-17 Thread Vadim Kisselmann
Hi, a number of relevant questions is given. i have another one: which type of docs do you have? Do you add some new docs every day? Or is it a stable number of docs (500Mio.) ? What about Replication? Regards Vadim 2011/10/17 Otis Gospodnetic > Hi Jesús, > > Others have already asked a number

Re: feeding while solr is running ?

2011-10-17 Thread Robert Stewart
See below... On Oct 17, 2011, at 11:15 AM, lorenlai wrote: > 1) I would like to know if it is possible to import data (feeding) while > Solr is still running ? Yes. You can search and index new content at the same time. But typically in production systems you may have one or more "master" SOL

Query with star returns double type values equal 0

2011-10-17 Thread romain
Hello, I am experiencing an unexpected behavior using solr 3.4.0. if my query includes a star, all the properties of type 'long' or 'LatLon' have 0 as value (ex: select/?start=0&q=way*&rows=10&version=2) Though the same request without stars returns correct values (ex: select/?start=0&q=way&rows

Re: feeding while solr is running ?

2011-10-17 Thread Alireza Salimi
Well, I'm not a Solr expert, but the first thing that you should start reading is the Solr tutorial and then Solr wiki. It won't take long to read both of them. Regarding your questions: 1) It's possible 2 and 3) There are different ways to update (HTTP or Java). You can create a CRON job to send

feeding while solr is running ?

2011-10-17 Thread lorenlai
Hello Solr experts, I'm newbie regarding Solr. 1) I would like to know if it is possible to import data (feeding) while Solr is still running ? Is it possible or Solr should be "shut down", I then can start my feeding process ? Any LINKS regarding to this topic? :-) 2) How can I import my data

Re: Question about near query order

2011-10-17 Thread Ahmet Arslan
> I have some near query like "analyze term"~2. > That is matched in that order. > But I want to search regardless of order. > So far, I just queried "analyze term"~2 OR "term > analyze"~2. > Is there a better way than what i did? I think, PhraseQuery should be unordered with slop values grater th

Re: Multi CPU Cores

2011-10-17 Thread Robert Brown
Thanks Otis, I certainly won't be copying & pasting - Good to know such options are available tho. On Mon, 17 Oct 2011 07:01:24 -0700 (PDT), Otis Gospodnetic wrote: > Robert, > > You have to add (some of) that stuff to the command for starting > Java/Tomcat.  Likely in a catalina.sh script. >

Re: millions of records problem

2011-10-17 Thread Otis Gospodnetic
Hi Jesús, Others have already asked a number of relevant question.  If I had to guess, I'd guess this is simply a disk IO issue, but of course there may be room for improvement without getting more RAM or SSDs, so tell us more about your queries, about disk IO you are seeing, etc. Otis Se

Re: Multi CPU Cores

2011-10-17 Thread Otis Gospodnetic
Robert, You have to add (some of) that stuff to the command for starting Java/Tomcat.   Likely in a catalina.sh script. That said, I do NOT recommend you use those parameters at all because they may be completely unneeded or even unsuitable for your environment. Otis Sematext :: http://sem

Re: Getting errors thrown from sun.nio.ch.FileDispatcher with native or simple or single lock .Please , i need help in resolving the issue.

2011-10-17 Thread Otis Gospodnetic
Anitha, I don't know about others, but your image didn't come through.  Try describing and pasting the text of the error instead. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ > >From: An

Getting errors thrown from sun.nio.ch.FileDispatcher with native or simple or single lock .Please , i need help in resolving the issue.

2011-10-17 Thread Anitha Muppural (amuppura)
Hi, My name is Anitha Muppural and I work as a software Engineer at Cisco. I have a solr/lucene 3.3.0 installed in I have a single core. The sorl web application is deployed to websphere application Server 6.1 in a cluster JVM.(2) I do delta imports programmatically using solrj and applic

Re: millions of records problem

2011-10-17 Thread Nick Veenhof
You could use this technique? I'm currently reading up on it http://khaidoan.wikidot.com/solr-common-gram-filter On 17 October 2011 12:57, Jan Høydahl wrote: > Hi, > > What exactly do you mean by "slow" search? 1s? 10s? > Which operating system, how many CPUs, which servlet container and how muc

RE: document update / nested documents / document join

2011-10-17 Thread Kai Gülzau
I just found another feature/ticket to be able to update fields: https://issues.apache.org/jira/browse/SOLR-2753 https://issues.apache.org/jira/browse/LUCENE-1231 -> CSF Column Stride Fields This should work well with simple fields like category/date/...!? So I have 2 options: 1.) Introduce a ra

Re: Multi CPU Cores

2011-10-17 Thread Johannes Goll
Yes, same thing. This was for the jetty servlet container not tomcat. I would refer to the tomcat documentation on how to modify/configure the java runtime environment (JRE) arguments for your running instance. Johannes On Oct 17, 2011, at 4:01 AM, Robert Brown wrote: > Where exactly do you se

Re: document update / nested documents / document join

2011-10-17 Thread Thijs
Hi, First. I'm not sure you know. But the join isn't like a join in a database it's more like select * from (set of documents that match query) where exists (set of documents that match join query) I have some complex (multiple join fq) in one call and that is fine, so I think this query ma

Re: millions of records problem

2011-10-17 Thread Jan Høydahl
Hi, What exactly do you mean by "slow" search? 1s? 10s? Which operating system, how many CPUs, which servlet container and how much RAM have you allocated to your JVM? (-Xmx) What kind and size of docs? Your numbers indicate about 100bytes per doc? What kind of searches? Facets? Sorting? Wildcard

Re: Performance issue: Frange with geodist()

2011-10-17 Thread roySolr
Hi Yonik, I have used your suggestion to implement a better radius searcher: &facet.query={!geofilt d=10 key=d10} &facet.query={!geofilt d=20 key=d20} &facet.query={!geofilt d=50 key=d50} It is a little bit faster than with geodist() but still a bottleneck i think. -- View this message in cont

millions of records problem

2011-10-17 Thread Jesús Martín García
Hi, I've got 500 millions of documents in solr everyone with the same number of fields an similar width. The version of solr which I used is 1.4.1 with lucene 2.9.3. I don't have the option to use shards so the whole index has to be in a machine... The size of the index is about 50Gb and t

RE: document update / nested documents / document join

2011-10-17 Thread Kai Gülzau
Nobody? SOLR-139 seems to be the most popular issue but I don’t think this will be resolved in near future (this year). Right? So I will try SOLR-2272 as a workaround, split up my documents in "static" and " frequently updated" and join them at query time. What is the exact join query to do a

Re: multiple document types in a core

2011-10-17 Thread lee carroll
Just as a follow up it looks like stored fields are stored verbatim for every doc. hotel index and store dest attributes index size: 131M number of records 49147 hotel index only dest attributes index size: 111m number of records 49147 ~400 chars(bytes) of destination data * 49147 (number of

Solr indexing plugin: skip single faulty document?

2011-10-17 Thread samuele.mattiuzzo
Hi all, as far as i know, when solr finds a faulty document (inside an xml containing let say 1000 docs) it skips the whole file and the indexing process exits with exception (am i correct?) I'm using a custom indexing plugin, and i can trap the exception. Instead of using "default" values if that

Re: Multi CPU Cores

2011-10-17 Thread Robert Brown
Where exactly do you set this up? We're running Solr3.4 under tomcat, OpenJDK 1.6.0.20 btw, is the JRE just a different name for the VM? Apologies for such a newbie Java question. On Sun, 16 Oct 2011 12:51:44 -0400, Johannes Goll wrote: > we use the the following in production > > java -ser