Re: And results before Or results

2012-05-11 Thread Jack Krupansky
Pass the &debugQuery=true request option and then look at the "explain" section of the response to see how a three-term result score turned out to be less than a two-term result score. -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Friday, May 11, 2012 10:54 PM To: s

Re: Replication issues after machine failure

2012-05-11 Thread Mark Miller
So it's easy to reproduce? What do you mean restored from a prior state? What snapshot are you on these days for future ref? You have double checked to make sure that shard is listed as ACTIVE right? On May 11, 2012, at 4:55 PM, Jamie Johnson wrote: > I've had a few instances where a machine ha

Re: Invalid version (expected 2, but 60) on CentOS in production please Help!!!

2012-05-11 Thread Mark Miller
Yeah, 9 times out of 10, this error is a 404 - which wouldn't be logged anywhere. On May 11, 2012, at 6:12 PM, Ravi Solr wrote: > Guys, just to give you an update, we think we "might" have found the > issue. iptables was enabled on one query server and disabled on the > other. The server where i

Re: And results before Or results

2012-05-11 Thread Jack Krupansky
I vaguely recall seeing this situation myself a couple of years ago. I think it was because there were multiple occurrences of the pair of terms in a single document vs. a lesser number of occurrences of all three of the terms in a single document. -- Jack Krupansky -Original Message-

Re: Join Query syntax

2012-05-11 Thread Sohail Aboobaker
Is it available in Solr 3.5 or is there a way to do something similar in Solr 3.5,

Re: Invalid version (expected 2, but 60) on CentOS in production please Help!!!

2012-05-11 Thread Ravi Solr
Guys, just to give you an update, we think we "might" have found the issue. iptables was enabled on one query server and disabled on the other. The server where iptables is enabled is the one having issues, we disabled the iptables today to test out the theory that the iptables might be causing thi

searching when in a solr-component?

2012-05-11 Thread Paul Libbrecht
Hello SOLR experts, can I see the same index while responding another query? If yes how? thanks in advance Paul

Re: ConcurrentUpdateSolrServer and unable to override default http settings

2012-05-11 Thread Gopal Patwa
Is this possible to make this improvement, so it can save lot of time and code for using ConcurrentUpdateSolrServer with allowing to override default http settings On Sun, Apr 29, 2012 at 8:56 PM, Gopal Patwa wrote: > In Solr4j client trunk build for 4.0, ConcurrentUpdateSolrServer class > does

Re: How to change data subdirectory in Solr

2012-05-11 Thread Erik Hatcher
It isn't possible to point at just the index directory like this. Solr uses a "data dir" and requires the main index be in index/ under that. There are other things that can be put into the data directory besides just the main Lucene index, such as side car spell check indexes and thus there i

How to change data subdirectory in Solr

2012-05-11 Thread Vitor M. Barbosa
I'm trying to set up Solr to work with some existing Lucene indexes, which are under this folder structure: /D:\indexes\core_name\/ But Solr always tries to look for /D:\indexes\core_name\index/, even after changing the dataDir in solrconfig.xml *and *in solr.xml. I know I can create symlinks in t

Re: And results before Or results

2012-05-11 Thread Ahmet Arslan
> I want to have a strick enforcement > that In case of a 3 word search, those > results that match all 3 term should be presented ahead of > those that match > 2 terms when I set mm=2. > > I have seen quite some cases where, those results that match > 2 out of 3 > words appear ahead of those matc

Re: Problems with Memory

2012-05-11 Thread Carlos Alberto Schneider
Good afternoon, It may be a problem in your app If your crawler is a java app, try to limit the amount of memory it uses, ex: java -jar my-app-with-dependencies.jar -Xms64m -Xmx128m -XX:NewSize=64m -XX:MaxNewSize=64m -XX:PermSize=128m -XX:MaxPermSize=128m ; Look for this parameters in the sc

Problems with Memory

2012-05-11 Thread Thiago
I'm having problems with memory when I'm using Solr. I have an application that crawl the web for some documents. It does a lot of consecutively indexing. But after some days of crawling, I'm having problems with memory. My Java process is consuming a lot of memory and it doesn't seems OK. My compu

Is it possible to index pdfs and database into single document?

2012-05-11 Thread anarchos78
Hello again, I can index pdf using: *data-config.xml* I c

Re: Question about cache

2012-05-11 Thread Shawn Heisey
On 5/11/2012 9:30 AM, Anderson vasconcelos wrote: HI Kuli The free -m command gives me total used free sharedbuffers cached Mem: 9991 9934 57 0 75 5759 -/+ buffers/cache: 4099 5892 Swap: 81

RE: Indexing data from pdf

2012-05-11 Thread Dyer, James
The document you tried to index has an "id" but not a "fake_id". Because "fake_id" is your index uniqueKey, you have to include it in every document you index. Your most likely fix for this is to use a Transformer to generate a "fake_id". You might get away with changing this: to this:

RE: Indexing data from pdf

2012-05-11 Thread anarchos78
I have included the extras and I am getting the following: *From Solr:* 0 2 data-config.xml full-import idle 0 2 0 2012-05-11 20:21:50 Indexing completed. Added/Updated: 0 documents. Deleted 0 documents. 2012-05-11 20:21:51 01 0:0:1.284This response format is experimental. It is likely to

RE: Indexing data from pdf

2012-05-11 Thread Dyer, James
It looks like maybe you do not have "apache-solr-dataimporthandler-extras.jar" in your classpath. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: anarchos78 [mailto:rigasathanasio...@hotmail.com] Sent: Friday, May 11, 2012 11:00 AM To: solr-use

Re: Populating 'multivalue' fields (m:1 relationships)

2012-05-11 Thread Mike Sokolov
You can specify a solr field as "multi-valued", and then supply multiple values for it. What that really does is concatenate all the values with a positional gap between them to prevent phrases and other positional queries from traversing the boundary between the distinct values. -Mike On 05

Re: Indexing data from pdf

2012-05-11 Thread anarchos78
Now I am getting the following: *From Solr:* 0 1 data-config.xml full-import idle 0:0:4.231 0 1 0 0 2012-05-11 18:43:30 Indexing failed. Rolled back all changes. 2012-05-11 18:43:30This response format is experimental. It is likely to change in the future. *The log file:* org.apache.sol

Re: And results before Or results

2012-05-11 Thread Karthick Duraisamy Soundararaj
I want to have a strick enforcement that In case of a 3 word search, those results that match all 3 term should be presented ahead of those that match 2 terms when I set mm=2. I have seen quite some cases where, those results that match 2 out of 3 words appear ahead of those matching all 3 words.

Re: Question about cache

2012-05-11 Thread Anderson vasconcelos
HI Kuli The free -m command gives me total used free sharedbuffers cached Mem: 9991 9934 57 0 75 5759 -/+ buffers/cache: 4099 5892 Swap: 8189 3395 4793 You can see that has only 5

Solr 3.6 fails when using XSLT

2012-05-11 Thread pramila_tha...@ontla.ola.org
Hi Everyone, I have recently upgraded to *solr 3.6 from solr 1.4.* My XSL where working fine in solr 1.4. but now with Solr 3.6 I keep getting the following Error /getTransformer fails in getContentType java.lang.RuntimeException: getTransformer fails in getContentType / But instead of results

Re: And results before Or results

2012-05-11 Thread Jack Krupansky
Strict enforcement? Of what? Your query rule seems rather loose, and compatible with simple OR of the terms. -- Jack Krupansky -Original Message- From: Karthick Duraisamy Soundararaj Sent: Friday, May 11, 2012 11:03 AM To: solr-user@lucene.apache.org Subject: Re: And results before Or

Re: And results before Or results

2012-05-11 Thread Karthick Duraisamy Soundararaj
Sure but it doesnt seem to be doing a strict enforcement. On Fri, May 11, 2012 at 10:56 AM, Jack Krupansky wrote: > If you simply "OR" the terms (or specify no operator and make sure that > the default operator is "OR"), normal query scoring will rank results with > more terms matching higher. >

Re: how to use multiple query operators?

2012-05-11 Thread Jack Krupansky
Please clarify the question. You certainly can write queries as you have suggested, at least using the lucene/solr and edismax query parsers), so what is the problem or issue or concern that you have? The Dismax query parser doesn't support field specification in the query (only in the qf param

Re: {!term f)xy OR device:0 in fq has strange results

2012-05-11 Thread abhayd
reformatted the same hi I am having some issues in using {!term} in fq with OR Following query returns 6 results and it is working as expected q=navigation&fq={!term f=model}Vivid(PH39100) And debug out put is also as expected Debug: "QParser":"LuceneQParser", "filter_queries":["{!term f=model

Re: Editing long Solr URLs - Chrome Extension

2012-05-11 Thread Jan Høydahl
I've been testing https://chrome.google.com/webstore/detail/mbnigpeabbgkmbcbhkkbnlidcobbapff?hl=en but I don't think it's great. Great work on this one. Simple and straight forward. A few wishes: * Sticky mode? This tool would make sense in a sidebar, to do rapid refinements * If you edit a valu

Re: And results before Or results

2012-05-11 Thread Jack Krupansky
If you simply "OR" the terms (or specify no operator and make sure that the default operator is "OR"), normal query scoring will rank results with more terms matching higher. -- Jack Krupansky -Original Message- From: Karthick Duraisamy Soundararaj Sent: Friday, May 11, 2012 10:44 AM

{!term f)xy OR device:0 in fq has strange results

2012-05-11 Thread abhayd
hi I am having some issues in using {!term} in fq with OR Following query returns 6 results and it is working as expected q=navigation&fq={!term f=model}Vivid(PH39100) And debug out put is also as expected Debug: "QParser":"LuceneQParser", "filter_queries":["{!term f=model}Vivid(PH39100)"], "par

Re: And results before Or results

2012-05-11 Thread Jack Krupansky
With the edismax query parser you can specify "phrase boosting" using the pf, pf2, and pf3 (and ps, ps2, ps3) request parameters, and you can set the boost factor for each. pf, pf2, and pf3 have the same format as qf. See: http://wiki.apache.org/solr/ExtendedDisMax You can also simulate that

Re: solr.WordDelimiterFilterFactory query time

2012-05-11 Thread abhayd
hi jack, It worked with dismax. I was using a our search partner provided wrapper around dismax and it seems like it has a bug. I switched to dismax and all is working fine now. Thanks for help -- View this message in context: http://lucene.472066.n3.nabble.com/solr-WordDelimiterFilterFactory-

how to use multiple query operators?

2012-05-11 Thread G.Long
Hi :) I'm can't find how to write a query like : field1:value1 AND (field2:value2 OR field2:value3). I read the documentation about local parameters which allows to define the query operator but it seems to be for the entire query. Gary

RE: SOLR Security

2012-05-11 Thread Welty, Richard
in fact, there's a sample proxy.php on the ajax-solr web page which can easily be modified into a security layer. my solr servers only listen to requests issued by a narrow list of systems, and everything gets routed through a modified copy of the proxy.php file, which checks whether the user is

Re: Question about cache

2012-05-11 Thread Michael Kuhlmann
Am 11.05.2012 15:48, schrieb Anderson vasconcelos: Hi Analysing the solr server in glassfish with Jconsole, the Heap Memory Usage don't use more than 4 GB. But, when was executed the TOP comand, the free memory in Operating system is only 200 MB. The physical memory is only 10GB. Why machine us

How detect slave replication termination

2012-05-11 Thread Jamel ESSOUSSI
Hi, I have an indexer that indexes solr documents, at the end of the indexing I will initiate replication by activating it on the master and on all slaves, my question is : how I will know when the replication between the master and the slave1 will be ended to replicate with the slave2. Best Rega

Re: Slow indexing in solr 3.6

2012-05-11 Thread not interesting
Are you using DIH and CachedSqlEntityProcessor? I have a similar issue; the 3.6.1 jars of DIH might help you, see: http://www.mail-archive.com/solr-user@lucene.apache.org/msg65912.html Kellen

Re: SOLR Security

2012-05-11 Thread Jan Høydahl
Hi, There is nothing stopping you from pointing Ajax-SOLR to a URL on your app-server, which acts as a security insulation layer between the Solr backend and the world. In this (thin) layer you can analyze the input and choose carefully what to let through and not. -- Jan Høydahl, search solut

Re: Identify indexed terms of document

2012-05-11 Thread Anderson vasconcelos
Thanks 2012/5/11 Michael Kuhlmann > Am 10.05.2012 22:27, schrieb Ahmet Arslan: > > >> >> It's possible to see what terms are indexed for a field of >>> document that >>> stored=false? >>> >> >> One way is to use >> http://wiki.apache.org/solr/**LukeRequestHandler

Re: Indexing data from pdf

2012-05-11 Thread Ahmet Arslan
> org.apache.solr.common.SolrException log > SEVERE: Full Import failed:java.lang.RuntimeException: > java.lang.RuntimeException: > org.apache.solr.handler.dataimport.DataImportHandlerException: > java.lang.NoClassDefFoundError: > org/apache/tika/parser/AutoDetectParser Did you put all of the jar

Slow indexing in solr 3.6

2012-05-11 Thread mechravi25
Hi, I am migrating from solr 1.4 to solr 3.6. I have used the latest 3.6 jars. After indexing for few data, I noticed that the indexing is taking a lot of time and the statistics are shown below 1737 1133174 0 2012-05-11 00:16:03 Indexing completed. Added/Updated: 434 doc

DataImportHandler - Custom EventListener

2012-05-11 Thread andre.schneider
Hi there, i want to register a custom EventListener to the DataImportHandler, but i get a NoClassDefFoundError. My configuration: Gentoo Linux. Solr home is /opt/solr. The solr.war file is deployed in an existing tomcat at /opt/tomcat/webapps The solr version is 3.6, tomcat version is 6.0.35, orac

Re: Indexing data from database

2012-05-11 Thread anarchos78
Thank you thank you thank you! -- View this message in context: http://lucene.472066.n3.nabble.com/Indexing-data-from-database-tp3979692p3979778.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Indexing data from database

2012-05-11 Thread Jack Krupansky
Maybe the id values overlap between your database tables. Solr needs unique values for the id, so if a document is indexed from a different database table but the same id value, Solr will replace the existing document with that id. You need to make sure that the id values are unique across all o

With solr.MappingCharFilterFactory, highlighting doesn't work with transformed characters

2012-05-11 Thread remus
Hi, In my schema.xml I have for my text field type: mapping="mapping-ISOLatin1Accent.txt"/> (See below for complete fieldType definition.) This correctly transforms all accented characters, umlauts, etc. to their "normal" form. The problem is this: When I search for any word with such a chara

Indexing data from database

2012-05-11 Thread anarchos78
Hello friends, I am trying to index data from database. I am doing that successfully. But I have a problem. I want to use one index for whole database. All the db tables have at least 3 columns with the same name (I want to be like this). For instance I have these tables: members, new_members, book

Re: Suddenly OOM

2012-05-11 Thread Jasper Floor
Outr rambuffer is the default. the Xmx is 75% of the available memory on the machine which is 4GB. We've tried increasing it to 85% and even gave the machine 10GB of memory. So we more than doubled the memory. The amount of data wasn't double but where it used to be enough now it seems to never be

Re: slave index not cleaned

2012-05-11 Thread Jasper Floor
Hi, On Thu, May 10, 2012 at 5:59 PM, Otis Gospodnetic wrote: > Hi Jasper, Sorry, I should've added more technical info wihtout being prompted. > Solr does handle that for you.  Some more stuff to share: > > * Solr version? 1.4 > * JVM version? 1.7 update 2 > * OS? Debian (2.6.32-5-xen-amd64)

Merging two DocSets in solr

2012-05-11 Thread Ramprakash Ramamoorthy
Dear all, I get two different DocSets from two different searchers. I need to merge them into one and get the facet counts from the merged docSets. How do I do it? Any pointers would be appreciated. -- With Thanks and Regards, Ramprakash Ramamoorthy, Project Trainee, Zoho Corporation. +91

Lucene FieldCache doesn' get cleaned up and OOM occurs

2012-05-11 Thread Mathias Hodler
Hi, sorting on a field increases the Lucene FieldCache. If I'm starting 10 queries and each query sorting on a different field, 9 queries could be executed but then the Lucene FieldCache exceeds max memory and OOM occurs. In my opinion Lucene Field Cache should be cleaned up if there is not enough

Re: Fwd: Delete documents

2012-05-11 Thread Tolga
That worked, thanks a lot Jack :) On 5/11/12 7:44 AM, Jack Krupansky wrote: Try using the actual id of the document rather than the shell substitution variable - if you're trying to delete one document. To delete all documents, use delete by query: *:* See: http://wiki.apache.org/solr/FAQ#Ho

Re: Identify indexed terms of document

2012-05-11 Thread Michael Kuhlmann
Am 10.05.2012 22:27, schrieb Ahmet Arslan: It's possible to see what terms are indexed for a field of document that stored=false? One way is to use http://wiki.apache.org/solr/LukeRequestHandler Another approach is this: - Query for exactly this document, e.g. by using the unique field -