Re: Integrating Solr with Database

2012-11-12 Thread Gora Mohanty
On 12 November 2012 13:17, 122jxgcn wrote: [...] >> If you still want to do what you had in mind, you should handle >> that as part of your indexing process, i.e., update both Solr and >> the database at the same time > > I have thought about that, but I could not figure out how to update database

Re: Integrating Solr with Database

2012-11-12 Thread Gora Mohanty
On 12 November 2012 13:54, 122jxgcn wrote: > I was thinking of using RESTful API instead of client application accessing > database directly. > Something like, client application opens url named http://(server > url)/updatedb?(id of document), > or something similar, then Java logic inside the Sol

Re: SolrZKClient changed interface

2012-11-12 Thread Trym R. Møller
Can anyone verify that the jira has been created sensible? Thanks in advance. https://issues.apache.org/jira/browse/SOLR-4066 Best regards Trym Den 10-11-2012 00:54, Mark Miller skrev: Please file a JIRA issue for this change. - Mark On Nov 9, 2012, at 8:41 AM, Trym R. Møller wrote: Hi T

RE: How Index word document in solr.

2012-11-12 Thread Markus Jelsma
hi - Check the Extracting Request Handler manual: http://wiki.apache.org/solr/ExtractingRequestHandler -Original message- > From:veena rani > Sent: Mon 12-Nov-2012 10:09 > To: solr-user@lucene.apache.org > Subject: How Index word document in solr. > > Hi, > > Please help me out in i

RE: DIH nested entities don't work

2012-11-12 Thread mroosendaal
Hi, I've created a jar with 5 files: 4 files with DIHCache*.java 1 file named BerkleyBackedCache.java I've changed the data-config based on your input. What i see it doing is that it is building a cache at the given location. However the first testrun took almost *3* hours before i got a message:

Unable to run two multicore Solr instances under Tomcat

2012-11-12 Thread Adam Neal
Hi, I have been running two multicore Solr instances under Tomcat using a nightly build of 4.0 from September 2011. This has been running fine but when I try to update these instances to the release version of 4.0 I'm hitting problems when the second instance starts up. If I have one instance o

Re: Handle Queries which return 1000s of records

2012-11-12 Thread Rafał Kuć
Hello! By pieces you mean by paging the results ? If yes, please look at http://wiki.apache.org/solr/CommonQueryParameters - start and rows parameters. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch > Hi, > I am integrating solr search on my w

RE: Skewed IDF in multi lingual index

2012-11-12 Thread Markus Jelsma
I'd like to add that multiplicative boosting on very scarce properties, e.g. you want to boost on a boolean value of which there are only very few, causes a problem in scoring when using docCount instead of maxDoc. If docCount is one IDF will be ~0.3, with the fieldWeight you'll end up with a sc

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-12 Thread Antony Steiner
Hello guys thank you for your input. I now took the schema from the trunk. This helped me: I had another version which has the version number 1.5.1. I changed EnglishPorterFilter to PorterStemFilterFactory. But I still keep failing at starting up solr. Following error: Nov 12, 2012 1:55:58 PM or

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-12 Thread Rafał Kuć
Hello! Add the following field to your schema.xml file to the fields sections: -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch > Hello guys > thank you for your input. I now took the schema from the trunk. This helped > me: I had another ver

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
Any ideas? On Mon, Nov 12, 2012 at 9:03 AM, Carlos Alexandro Becker wrote: > Good Morning Sirs, > > Following your suggestions, I ran: > > JAVA_OPTS="-DzkRun -DnumShards=2 -Dbootstrap_conf=true > -Dcollection.configName=myconf -Xmx2048m -XX:MaxPermSize=512m" > ./standalone.sh > > And tried to c

RE: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-12 Thread Markus Jelsma
You're in SolrCloud mode, it needs that field. Just put it as explained in the error somewhere within your element. -Original message- > From:Antony Steiner > Sent: Mon 12-Nov-2012 14:25 > To: solr-user@lucene.apache.org > Subject: Re: Apache Nutch 1.5.1 + Apache Solr 4.0 > > Hello

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
I'm not sure what could be the issue here, maybe there is a problem with finding the name of your machine? can you manually find ' http://carlos-OptiPlex-790:8080/solr' ? Maybe if you set the "host" parameter in solr.xml ( http://wiki.apache.org/solr/SolrCloud#SolrCloud_Instance_Params ) to 127.0.0

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
It's strange, my solrconfig.xml is inside the war file. Just in case, here is the actual content: https://gist.github.com/9f0ea455fd6a0d92a196 Thanks in advance On Mon, Nov 12, 2012 at 1:33 PM, Carlos Alexandro Becker wrote: > Hm, I still getting nullpointers accessing 127.0.0.1:8080/solr, so

RE: DIH nested entities don't work

2012-11-12 Thread Dyer, James
Here's what I'd do next: - double check you're only caching the child entity, not the parent. - Replace the "SELECT *" queries with a list of actual fields you want. - specify the persistCacheFieldNames and persistCacheFieldTypes parameters (see the doc-comment for DIHCachePersistProperties) - Tr

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
In the last stacktrace that you sent, it looks like there is no configuration directory in zk called "myconf". Make sure you use a configuration that was already uploaded to zk. You can use zkCli.sh (in the zookeeper bin directory) to see what's inside of the "configs" zk node. If you started Solr

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
yeah, I changed to collection1, but I not understand why I have to pass a configName if it use collection1 by default? also, I tried with collection1 config, and got another error: 13:49:02,226 ERROR [org.apache.solr.core.SolrCore] (http--0.0.0.0-8080-2) org.apache.solr.common.SolrException: Error

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
Once the configuration is uploaded to zk, Solr will stop using the directory you specify in the solr.xml to start using the configuration from zk (to make sure all the nodes of the collection have the same configuration). Maybe for some reason the solrconfig file was not uploaded properly? Could yo

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
Actually, that's really strange, I cant even see the dump button, and got this error: SolrCore Initialization Failures - *collection1:* org.apache.solr.common.cloud.ZooKeeperException:org.apache.solr.common.cloud.ZooKeeperException: Please check your logs for more information On Mon

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-12 Thread Antony Steiner
Thank you very much. Everything is working fine now. Best regards Antony

java.io.IOException: Map failed :: OutOfMemory

2012-11-12 Thread uwe72
While adding lucene document we got this problem: What can we do here? Nov 12, 2012 3:25:09 PM org.apache.solr.update.DirectUpdateHandler2 commit INFO: start commit(optimize=false,waitFlush=true,waitSearcher=true,expungeDeletes=false) Exception in thread "Lucene Merge Thread #0" org.apache.lucene

Admin Permissions

2012-11-12 Thread Michael Long
I really like the new admin in solr 4.0, but specifically I don't want developers to be able to unload, rename, swap, reload, optimize, or add core. Any ideas on how I could still give access to the rest of the admin without giving access to these? It is very helpful for them to have access t

SOLR - Search Issue

2012-11-12 Thread ravi.n
Hello Guys, We had a requirement to search existing lucene indexes and configure to indexing new data. 1. Installed apache-solr-3.6.1 and setup multicore for the existing indexes. 2. Solr is running on windows platform with tomcat 7 64 bit 3. Coll1 and Coll2 has old lucene index files and new ind

Solr 3.5 DIH Delta Import hanging

2012-11-12 Thread Arcadius Ahouansou
Hello. We have a Solr 3.5 master randomly hanging during delta import. It does not happen very often. When it does, the output of dataimport?command=status is shown below. After each page refresh, the only thing that changes is the "Time Elapsed". I have also done a thread dump (see below). It

RE: DIH nested entities don't work

2012-11-12 Thread mroosendaal
Thanks, i'll give that a try tomorrow. Here's the data-config.xml which i will try when i get to work tomorrow: :1521/ENDDEV" user="un" password="pw"/> the view

How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Aeroox Aeroox
Hi folks, I have a solr index with up to 50M documents. A document contain 62 fields (docid, name, location). The facet count took 1 to 2 minutes with this params : http://.../select/?q=solr&; version=2.2&start=0&rows=0&facet=true&facet.limit=6&facet.mincount=1&mm=3<-1&facet.field=school

Re: sort by function error

2012-11-12 Thread Yonik Seeley
On Mon, Nov 12, 2012 at 5:24 AM, Kuai, Ben wrote: > more information, problem only happends when I have both sort by function > and grouping in query. I haven't been able to duplicate this with a few ad-hoc queries. Could you give your complete request (or at least all of the relevant grouping

Re: php client for Solr 4.0.0

2012-11-12 Thread Jorge Luis Betancourt Gonzalez
I'm currently using solarium with solr 3.6, perhaps you can tweak solarium as needed? I suppose that pull requests are welcome into solarium for solr 4. Greetings! On Nov 12, 2012, at 2:56 PM, Bill Au wrote: > Anyone know of a PHP client that is compatible with Solr 4.0.0? I am using > an old

Re: customize solr search/scoring for performance

2012-11-12 Thread jchen2000
The following was generated from jvisualvm. Seems like the perf is related to scoring a lot. Any idea/pointer on how to customize that part? -- View this message in context: http://lucene.472066.n3.nabble.com/customize-sol

RE: sort by function error

2012-11-12 Thread Kuai, Ben
Hi Yonik Thanks for the reply. My sample query, q="cafe"&sort=geodist(geoLocation,-32.123323,108.123323)+asc&group.field=familyId as long as I remove the group field the query working. BTW, I just find out that the version of solr we are using is an old copy of 4.0 snapshot before the alpha

Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
Hi, I'm migrating from Solr 1.2 to 3.6.1. I used the same analyzer as I was, and re-indexed my data. I did not add solr.ReversedWildcardFilterFactory to my index analyzer, but yet leading wild cards are working!! Does this mean it's turned on by default? If so, how do I turn it off, and

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread François Schiettecatte
John You can still use leading wildcards even if you dont have the ReversedWildcardFilterFactory in your analysis but it means you will be scanning the entire dictionary when the search is run which can be a performance issue. If you do use ReversedWildcardFilterFactory you wont have that perf

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
Thanks for the quick response. So, I do not want to use ReversedWildcardFilterFactory, but leading wildcard is working and thus is ON by default. How do I disable it to prevent the use of it and the issues that come with it? -- MJ -Original Message- From: François Schiettecat te

Re: How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Otis Gospodnetic
Hi, Have you tried the other facet method or newer Solr? Otis -- Performance Monitoring - http://sematext.com/spm On Nov 12, 2012 2:40 PM, "Aeroox Aeroox" wrote: > Hi folks, > > I have a solr index with up to 50M documents. A document contain 62 fields > (docid, name, location). > > The fac

Re: How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Yonik Seeley
On Mon, Nov 12, 2012 at 8:39 PM, Aeroox Aeroox wrote: > Hi folks, > > I have a solr index with up to 50M documents. A document contain 62 fields > (docid, name, location). > > The facet count took 1 to 2 minutes with this params : > > http://.../select/?q=solr&; > version=2.2&start=0&rows=

Example for Scheduling Solr Indexing - Hadoop

2012-11-12 Thread Britt
Background I have a file that gets dropped into a new directory every 10 minutes. Examples: /2012/11/05/HH/10/bigfile.txt /2012/11/05/HH/20/bigfile.txt /2012/11/05/HH/30/bigfile.txt /2012/11/05/HH/40/bigfile.txt I need to schedule a job to index these files every 10 minutes. Examples: /2012/11/05/

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread François Schiettecatte
I suspect it is just part of the wildcard handling, maybe someone can chime in here, you may need to catch this before it gets to SOLR. François On Nov 12, 2012, at 5:44 PM, johnmu...@aol.com wrote: > Thanks for the quick response. > > > So, I do not want to use ReversedWildcardFilterFactory,

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
At one point, in some version of Solr, it was OFF by default, and you had to enable it via a setting (either in solrconfig.xml or schema.xml, I don't remember). It looks like this is no longer the case. Even worse, and if this is true, disabling it no longer seems to be possible to disable it

Re: 4.0 query question

2012-11-12 Thread dm_tim
That is terrific. Thanks. Regards, Tim -- View this message in context: http://lucene.472066.n3.nabble.com/4-0-query-question-tp4019397p4019890.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Michael Ryan
Yeah, the situation is kind of a pain right now. In https://issues.apache.org/jira/browse/SOLR-2438, it was enabled by default and there is no way to disable without patching SolrQueryParser. There's also the edismax parser which doesn't have a setting for this, which I've made a jira for at ht

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
I'm surprised that this has not been logged as adefect. The fact that this is ON bydefault, means someone can bring down a server; this is bad enough to categorizethis as a security issue. --MJ -Original Message- From: Michael Ryan [mailto:mr...@moreover.com] Sent: Monday, November

4.0 query question

2012-11-12 Thread dm_tim
I just discovered the idf scoring component and it's biting me on the butt. In the particular case I have all terms are equally relevant. Is there any way to turn off idf scoring? Regards, Tim -- View this message in context: http://lucene.472066.n3.nabble.com/4-0-query-question-tp4019898.htm

Re: 4.0 query question

2012-11-12 Thread Otis Gospodnetic
Hi, Yes, via a custom Similarity, which you then reference in solrconfig.xml, that always returns 1 from the idf method. Was mentioned on this list a few days back - see search-lucene.com. Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://

Re: Solr GC issues - Too many BooleanQuery & BooleanClause objects in heap

2012-11-12 Thread Otis Gospodnetic
Hi, I've never seen this. You don't have a custom query parser or anything else custom, do you? Have you tried dumping and analyzing heap? YourKit has a 7 day eval, or you can use things like jhat, which may be included on your machine already (see http://docs.oracle.com/javase/6/docs/technotes/

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Yonik Seeley
On Tue, Nov 13, 2012 at 2:27 AM, wrote: > I'm surprised that this has not been logged as adefect. The fact that this > is ON bydefault, means someone can bring down a server; this is bad enough to > categorizethis as a security issue. It's all relative. There are tons of queries that can tak

Re: Example for Scheduling Solr Indexing - Hadoop

2012-11-12 Thread Otis Gospodnetic
Hi, This could be as simple as writing an app that periodically checks the appropriate directories, looks for any new files added since last checks, and then reads/parses them (presumably there are data for records/documents that need to be indexed in those files), constructs SolrInputDocuments, a

Re: SOLR - Search Issue

2012-11-12 Thread Otis Gospodnetic
Hi, Have you tried optimizing your indices? That should upgrade their format to whatever Solr/Lucene version you are using. You could also try IndexUpgrader - http://search-lucene.com/jd/lucene/core/org/apache/lucene/index/IndexUpgrader.html Otis -- Search Analytics - http://sematext.com/search

Re: customize solr search/scoring for performance

2012-11-12 Thread Robert Muir
Whenever I look at solr users' stacktraces for disjunctions, I always notice they get BooleanScorer2. Is there some reason for this or is it not intentional (e.g. maybe a in-order collector is always being used when its possible at least in simple cases to allow for out-of-order hits?) When I exa

Re: Solr 4.0 - distributed updates without zookeeper?

2012-11-12 Thread Otis Gospodnetic
Hi Peter, Not sure I have the answer for you, but are you looking to avoid using ZK for some reason? Or are you OK with ZK per se, but just don't want any leader re-election and any other dynamic/cloudy behaviour? Could you not simply treat 1 node as the "master" to which you send all your update

Re: sort by function error

2012-11-12 Thread Yonik Seeley
I can't reproduce this with the example data. Here's an example of what I tried: http://localhost:8983/solr/query?q=*:*&sort=geodist(store,-32.123323,108.123323)+asc&group.field=inStock&group=true Perhaps this is an issue that's since been fixed. -Yonik http://lucidworks.com On Mon, Nov 12, 2

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Jack Krupansky
Be sure to realize that even with reverse wildcard support, the user can add a trailing wildcard as well (double-ended wildcard) and then you are back in the same boat. The overall idea is that: 1) Hardware is much faster than just 3 or 4 years ago, and 2) even though document counts are getti

Re: SOLR - Search Issue

2012-11-12 Thread ravi.n
Thanks. Can be this done by Solr or should we write Java class? Regards, Ravi -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Search-Issue-tp4019816p4019939.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr4.0 problem zkHost with multiple hosts throws out of range exception

2012-11-12 Thread deniz
so do we need to add one of the servers from the -DzkHost string to -DzkRun? should it look like -DzkRun=host1:port -DzkHost=host:port, host1:port, host2:port in the start up command? and will wiki page be updated? because the example there is still letting into the error that was mention

Re: SOLR - Search Issue

2012-11-12 Thread Otis Gospodnetic
You can issue an optimize request to Solr, just peek at the Wiki. IndexUpgrader is a command-line tool. No coding needed (tm) Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Mon, Nov 12, 2012 at 11:59 PM,