Re: Custom update handler? Some progress, new issue

2013-03-11 Thread Jack Park
Further progress now hampered by configuring an update log. When I follow instructions found around the web, I get this: SEVERE: Unable to create core: collection1 caused by Caused by: java.lang.NullPointerException at org.apache.solr.common.params.SolrParams.toSolrParams(SolrParams.java:

Re: question about syntax for multiple terms in filter query

2013-03-11 Thread geeky2
otis and jack - thank you VERY much for the feedback - jack - >> use a single fq containing two mandatory clauses if those clauses appear together often << this is the use case i have to account for - eg, right now i have this in my request handler ... itemType:1 ...

Re: Custom update handler?

2013-03-11 Thread Jack Park
Many thanks. Let me record here what I have tried. I have viewed: http://wiki.apache.org/solr/UpdateXmlMessages and this github project which is suggestive: https://github.com/industria/solrprocessors I now have two UpdateRequestChains: hello and the new

Solr _docid_ parameter

2013-03-11 Thread mark12345
In Solr, I noticed that I can sort by the internal Lucene _docid_. -> http://wiki.apache.org/solr/CommonQueryParameters > You can sort by index id using sort=_docid_ asc or sort=_docid_ desc * I have also read the docid is represented by a

SolrException: Error opening new searcher

2013-03-11 Thread mark12345
I am running into issues where my Solr instance is behaving weirdly. After I get the SolrException "Error opening new searcher", my Solr instance fails to handle even the simplest of update requests. > http://lucene.472066.n3.nabble.com/exceeded-limit-of-maxWarmingSearchers-td494732.html I have

Re: Some nodes have all the load

2013-03-11 Thread Mark Miller
On Mar 11, 2013, at 7:47 PM, Shawn Heisey wrote: >> > > I've just locate a previous message on this list from Mark Miller saying that > in Solr 4, commitWithin is a soft commit. Yes, that's true. > > You should definitely wait for Mark or another committer to verify what I'm > saying in t

Re: Some nodes have all the load

2013-03-11 Thread Mark Miller
On Mar 11, 2013, at 5:52 PM, jimtronic wrote: > Should I omit commitWithin and set DIH to commit=false and just let soft and > autocommit do their jobs? Yeah, that's one valid option. You def are not able to keep up with the current commit / open searcher level. It looks like DIH will do a har

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Shawn Heisey
On 3/11/2013 5:59 PM, feroz_kh wrote: One more question related to backward compatibilty. Previously we had upgraded our solr master/slaves from 1.4 version to 3.5 version - We didn't reformat the whole index then. So i believe there will be some files with 1.4 format present in our index. Now w

RE: DataDirectory: relative path doesn't work

2013-03-11 Thread Patrick Mi
Thanks for fixing the wiki page http://wiki.apache.org/solr/SolrConfigXml now it says this: 'If this directory is not absolute, then it is relative to the directory you're in when you start SOLR.' It will be nice if you drop me a line here after you make the change on the document ... -Origin

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Yonik Seeley
On Mon, Mar 11, 2013 at 5:51 PM, Chris Hostetter wrote: > : I guess my main point is, we shouldn't decide a priori that using the > : API means you can no longer hand edit. > > and my point is we should build a feature where solr has the ability to > read/write some piece of information, we should

[ANNOUNCE] Apache Solr 4.2 released

2013-03-11 Thread Robert Muir
March 2013, Apache Solr™ 4.2 available The Lucene PMC is pleased to announce the release of Apache Solr 4.2 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, d

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread feroz_kh
Thanks Tomas/Shawn! One more question related to backward compatibilty. Previously we had upgraded our solr master/slaves from 1.4 version to 3.5 version - We didn't reformat the whole index then. So i believe there will be some files with 1.4 format present in our index. Now when we upgrade from

Re: Some nodes have all the load

2013-03-11 Thread Shawn Heisey
On 3/11/2013 3:52 PM, jimtronic wrote: The load test was fairly heavy (ie lots of users) and designed to mimic a fully operational system with lots of users doing normal things. There were two things I gleaned from the logs: PERFORMANCE WARNING: Overlapping onDeckSearchers=2 appeared for severa

Re: How to Integrate Solr With Hbase

2013-03-11 Thread Bharat Mallampati
We do have same kind of scenario in our application also. The way we are achieving it is we have a batch process to read the data from Hbase using Hbase API and write it to SOLR using SOLRJ API. Thanks Bharat On Mon, Mar 11, 2013 at 5:38 AM, kamaci wrote: > I have crawled data into Hbase w

Re: [Beginner] wants to contribute in open source project

2013-03-11 Thread Chris Hostetter
: This article I wrote about getting started contributing to projects may give you some ideas. : : http://blog.smartbear.com/software-quality/bid/167051/14-Ways-to-Contribute-to-Open-Source-without-Being-a-Programming-Genius-or-a-Rock-Star Or pehaps even the followup i did of Andy's article la

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Shawn Heisey
On 3/11/2013 3:43 PM, feroz_kh wrote: Thanks Tomas! I see the latest available version is 4.1 - but you have suggested a 4.2 version, where can i grab 4.2 version from? It is already accessible from many mirrors. Because it is not yet accessible from a large enough percentage of mirrors, the

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Shawn Heisey
On 3/11/2013 3:39 PM, feroz_kh wrote: Thanks Shawn. So if we have new segments in 4.1 format and all old files in 3.5 format at the same time, then will it cause any performance degradation on slaves while reading index files ( which will contain both 3.5 formatted and 4.1 formatted files)? The

Re: Some nodes have all the load

2013-03-11 Thread jimtronic
The load test was fairly heavy (ie lots of users) and designed to mimic a fully operational system with lots of users doing normal things. There were two things I gleaned from the logs: PERFORMANCE WARNING: Overlapping onDeckSearchers=2 appeared for several of my more active cores and The non-l

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Chris Hostetter
: we needed to, we could just assert that the schema file is the : persistence mechanism, as opposed to the system of record, hence if : you hand edit it and then use the API to change it, your hand edit may : be lost. Or we may decide to do away with "local FS" mode altogether. presuming that i

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread feroz_kh
Thanks Tomas! I see the latest available version is 4.1 - but you have suggested a 4.2 version, where can i grab 4.2 version from? -- View this message in context: http://lucene.472066.n3.nabble.com/Upgrade-Solr3-5-to-Solr4-1-Index-Reformat-tp4046391p4046471.html Sent from the Solr - User maili

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread feroz_kh
Thanks Shawn. So if we have new segments in 4.1 format and all old files in 3.5 format at the same time, then will it cause any performance degradation on slaves while reading index files ( which will contain both 3.5 formatted and 4.1 formatted files)? -- View this message in context: http://

Re: How to set Configuration setting for search

2013-03-11 Thread Otis Gospodnetic
Hello Deepshikha, No need for regular expressions once you index some data try using keywords... like Google. :) Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Mar 11, 2013 at 6:05 AM, Deepshikha Raghav wrote: > Hi Team , > > In Solr how to set"FREE TEXT SEARCH" confi

Re: question about syntax for multiple terms in filter query

2013-03-11 Thread Jack Krupansky
Point number 3 from that wiki says it all: 3.The document sets from each filter query are cached independently. Thus, concerning the previous examples: use a single fq containing two mandatory clauses if those clauses appear together often, and use two separate fq params if they are relatively

Re: question about syntax for multiple terms in filter query

2013-03-11 Thread Otis Gospodnetic
Hello Mark, I think fq=+term1:foo +term2:bar doesn't actually result in 2 filters being created/used, while &fq=term1:foo&fq=term2:bar does Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Mar 11, 2013 at 4:41 PM, geeky2 wrote: > hello everyone, > > i have a question on t

PostingsHighlighter and analysis

2013-03-11 Thread Trey Hyde
debug=timing has told me for a very long time that 99% of my query time for slow queries is in the highlighting component so I've been eagerly awaiting the postingshighlighter for quite some time. Mean query times 50ms or less, with certain queries able to generate > 30s worth of highlighting.

question about syntax for multiple terms in filter query

2013-03-11 Thread geeky2
hello everyone, i have a question on the filter query syntax for multiple terms, after reading this: http://wiki.apache.org/solr/CommonQueryParameters#fq i see from the above that two (2) syntax constructs are supported fq=term1:foo & fq=term2:bar and fq=+term1:foo +term2:bar is there a reas

RE: Need help with delta import

2013-03-11 Thread Xavier Pell
This is absolutely a sintax error, I had the same problem, and with dih.delta.id it solves all my problems. Thanks to god and the special person who post the answer in this page. You have to revise your sintax in queries for delta import and watch the catalina (i use tomcat) log file for any error

Re: Solr replication takes long time

2013-03-11 Thread Victor Ruiz
Thanks for your answer Mark. I think I'll try to update to 4.2. I'll keep you updated. Anyway, I'd not say that the full index is replicated, I've been monitoring the replication process in the Solr admin console and there I see that usually not more than 50-100 files are transferrend, the total s

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Yonik Seeley
On Mon, Mar 11, 2013 at 2:50 PM, Chris Hostetter wrote: > > : > 2) If you wish to use the /schema REST API for read and write operations, > : > then schema information will be persisted under the covers in a data store > : > whose format is an implementation detail just like the index file format.

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Tomás Fernández Löbbe
Hi Feroz, due to Lucene's backward compatibility policy ( http://wiki.apache.org/lucene-java/BackwardsCompatibility ), a Solr 4.1 instance should be able to read an index generated by a Solr 3.5 instance. This would not be true if you need to change the schema. Also, be careful because Solr 4.1 cou

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Shawn Heisey
On 3/11/2013 11:56 AM, feroz_kh wrote: We are planning to upgrade our solr servers from version 3.5 to 4.1. We have master slave configuration and the index size is quite big (i.e. around 14 GB ). 1. Do we really need to re-format the whole index , when we upgrade to 4.1 ? 2. What will be the con

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Chris Hostetter
To revisit sarowes comment about how/when to decide if we are using the "config file" version of schema info (and hte API is read only) vs "internal managed state data" version of schema info (and the API is read/write)... On Wed, 6 Mar 2013, Steve Rowe wrote: : Two possible approaches: : :

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Chris Hostetter
: > 2) If you wish to use the /schema REST API for read and write operations, : > then schema information will be persisted under the covers in a data store : > whose format is an implementation detail just like the index file format. : : This really needs to be driven by costs and benefits... :

Re: Dynamic schema design: feedback requested

2013-03-11 Thread Yonik Seeley
On Wed, Mar 6, 2013 at 7:50 PM, Chris Hostetter wrote: > 2) If you wish to use the /schema REST API for read and write operations, > then schema information will be persisted under the covers in a data store > whose format is an implementation detail just like the index file format. This really n

Re: Solr replication takes long time

2013-03-11 Thread Mark Miller
Okay - yes, 4.0 is a better choice for replication than 4.1. It almost sounds like you may be replicating the full index rather than just changes or something. 4.0 had a couple issues as well - a couple things that were discovered while writing stronger tests for 4.2. 4.2 is spreading onto mirr

Re: Solr replication takes long time

2013-03-11 Thread Victor Ruiz
no, Solr 4.0.0, I wanted to update to Solr 4.1 but I read that there was an issue with the replication, so I decided not to try it for now Mark Miller-3 wrote > Are you using Solr 4.1? > > - Mark > > On Mar 11, 2013, at 1:53 PM, Victor Ruiz < > bik1979@ > > wrote: > >> Hi guys, >> >> I have

Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread feroz_kh
Hello, We are planning to upgrade our solr servers from version 3.5 to 4.1. We have master slave configuration and the index size is quite big (i.e. around 14 GB ). 1. Do we really need to re-format the whole index , when we upgrade to 4.1 ? 2. What will be the consequences - if we do not re-forma

Re: Solr replication takes long time

2013-03-11 Thread Mark Miller
Are you using Solr 4.1? - Mark On Mar 11, 2013, at 1:53 PM, Victor Ruiz wrote: > Hi guys, > > I have a problem with Solr replication. I have 2 solr servers (Solr 4.0.0) 1 > master and 1 slave (8 processors,16GB RAM ,Ubuntu 11, ext3, each). In > every server, there are 2 independent instances

Solr replication takes long time

2013-03-11 Thread Victor Ruiz
Hi guys, I have a problem with Solr replication. I have 2 solr servers (Solr 4.0.0) 1 master and 1 slave (8 processors,16GB RAM ,Ubuntu 11, ext3, each). In every server, there are 2 independent instances of solr running (I tried also multicore config, but having independent instances has for me

Re: SolrCloud index timeout

2013-03-11 Thread yriveiro
Hi, The version is the 4.1 I'm not mixing deletes and adds, are only adds. I have a 4 nodes in 2 physical machines, 2 instances of tomcat in each machine. In this case the leader is located in a diferent physical machine that the replica. The collection has all shards in different nodes, I have

Re: Memory Guidance

2013-03-11 Thread Shawn Heisey
On 3/11/2013 11:14 AM, Shawn Heisey wrote: On 3/10/2013 8:00 PM, jimtronic wrote: I'm having trouble finding some problems while load testing my setup. If you saw these numbers on your dashboard, would they worry you? Physical Memory 97.6% 14.64 GB of 15.01 GB File Descriptor Count 19.1% 19

Re: Memory Guidance

2013-03-11 Thread Shawn Heisey
On 3/10/2013 8:00 PM, jimtronic wrote: I'm having trouble finding some problems while load testing my setup. If you saw these numbers on your dashboard, would they worry you? Physical Memory 97.6% 14.64 GB of 15.01 GB File Descriptor Count 19.1% 196 of 1024 JVM-Memory 95% 1.67 GB (dark gra

Re: [Beginner] wants to contribute in open source project

2013-03-11 Thread Tomás Fernández Löbbe
You can also take a look at http://wiki.apache.org/solr/HowToContribute Tomás On Mon, Mar 11, 2013 at 9:20 AM, Andy Lester wrote: > > On Mar 11, 2013, at 11:14 AM, chandresh pancholi < > chandreshpancholi...@gmail.com> wrote: > > > I am beginner in this field. It would be great if you help me

Re: [Beginner] wants to contribute in open source project

2013-03-11 Thread Andy Lester
On Mar 11, 2013, at 11:14 AM, chandresh pancholi wrote: > I am beginner in this field. It would be great if you help me out. I love > to code in java. > can you guys share some link so that i can start contributing in > solr/lucene project. This article I wrote about getting started contribut

Re: Some nodes have all the load

2013-03-11 Thread Mark Miller
There is an open JIRA issue about trying to "spread the leader load" during elections. Was waiting to get reports that it was really a problem for someone though. How much load were you putting on? How long were the nodes unresponsive? Unresponsive to everything? Just updates? Searches? What ve

Re: writing doc to another collection from UpdateReqeustProcessor

2013-03-11 Thread Mark Miller
Sure, seems reasonable. - Mark On Mar 11, 2013, at 11:52 AM, mike st. john wrote: > Whats the best approach in writing the current doc inside an > UpdateRequestProcessor to another collection ? > > > Would i just call up CloudSolrServer and process it as i normally would in > solrj? > > >

Re: SolrCloud index timeout

2013-03-11 Thread Mark Miller
What Solr version? Are you mixing deletes and adds? Do you have more than one shard for a collection per machine? ie are you oversharding? Can you post the stack traces (using jstack, or jconsolr, or visualvm, or…)? - Mark On Mar 11, 2013, at 11:39 AM, yriveiro wrote: > Hi, > > I have th

writing doc to another collection from UpdateReqeustProcessor

2013-03-11 Thread mike st. john
Whats the best approach in writing the current doc inside an UpdateRequestProcessor to another collection ? Would i just call up CloudSolrServer and process it as i normally would in solrj? Thanks msj

Some nodes have all the load

2013-03-11 Thread jimtronic
I was doing some rolling updates of my cluster ( 12 cores, 4 servers ) and I ended up in a situation where one node was elected leader by all the cores. This seemed very taxing to that one node. It was also still trying to serve query requests so it slowed everything down. I'm trying to do a lot of

SolrCloud index timeout

2013-03-11 Thread yriveiro
Hi, I have the next issue: I have a collection with a leader and a replica, both are synchronized. When I try to index data to this collection I have a timeout error (the output is python): (, Timeout(TimeoutError("HTTPConnectionPool(host='192.168.20.50', port=8983): Request timed out. (timeout

Re: Boost maximum match in a field

2013-03-11 Thread Jack Krupansky
The length normalization "factor" is a very coarse value, so it may not be fine-grained enough to distinguish these particular field lengths. Normally, it is a short vs. long distinction rather than actual length. In any case, add &debugQuery=true to your query and look at the "explain" sectio

Re: Boost maximum match in a field

2013-03-11 Thread Timothy Potter
I'm curious if the default ranking doesn't already return these in 3,2,1 order. Doc 3 should get an implicit boost with norms enabled for your title field, so make sure the title field has omitNorms="false", ie. in schema.xml: Tim On Mon, Mar 11, 2013 at 8:02 AM, Nicholas Ding wrote: > Hello,

RE: Boosting based on filter query

2013-03-11 Thread Van Tassell, Kristian
Thank you! -Original Message- From: Erik Hatcher [mailto:erik.hatc...@gmail.com] Sent: Monday, March 11, 2013 8:50 AM To: solr-user@lucene.apache.org Subject: Re: Boosting based on filter query Definitely can do this, but how depends on the query parser you're using. With dismax/edism

Re: "abc.def@gmail*" not retrieved but without double quotes retrieved

2013-03-11 Thread Jack Krupansky
The simple rule is that a wildcard suppresses any analysis steps that are not "multi-term aware". Unfortunately, the word delimiter filter is not "multi-term aware" (the lower case filter is). So, the query tries to find "abc.def@gmail" as a single (wildcard) term, which it won't find, since the

Re: Boosting based on filter query

2013-03-11 Thread Erik Hatcher
Definitely can do this, but how depends on the query parser you're using. With dismax/edismax you can use bq=filetype:pdf^10 (where "filetype:pdf" is a valid Lucene query parser expression for your documents). Erik On Mar 11, 2013, at 09:31 , Van Tassell, Kristian wrote: > I want to

Boosting based on filter query

2013-03-11 Thread Van Tassell, Kristian
I want to be able to boost results where the filetype is a pdf: Here is some pseudo code so I don't misrepresent/misinterpret via a URL: search("foobar") foreach result (where filetype==pdf) { boost^10 } Is there a way to do this? Thanks in advance!

Re: AW: Highlighting problems

2013-03-11 Thread Dirk Wintergruen
Hi Andre, thanks this did the job. I also had to enable edismax and set the default parameter there - otherwise no highlighting at all. Best Dirk Am 11.03.2013 um 13:59 schrieb André Widhani : > Hi Dirk, > > please check > http://wiki.apache.org/solr/HighlightingParameters#hl.requireFieldMatc

AW: Highlighting problems

2013-03-11 Thread André Widhani
Hi Dirk, please check http://wiki.apache.org/solr/HighlightingParameters#hl.requireFieldMatch - this may help you. Regards, André Von: Dirk Wintergruen [dwin...@mpiwg-berlin.mpg.de] Gesendet: Montag, 11. März 2013 13:56 An: solr-user@lucene.apache.org B

How to Integrate Solr With Hbase

2013-03-11 Thread kamaci
I have crawled data into Hbase with my Nutch. How can I use Solr to index the data at Hbase? (Is there any solution from Nutch side, you are welcome) PS: I am new to such kind of technologies and I run Solr from under example folder as startup.jar -- View this message in context: http://lucene

Highlighting problems

2013-03-11 Thread Dirk Wintergruen
Hi all, I have problems with the higlighting mechanism: The query is: http://127.0.0.1:8983/solr/mpiwgweb/select?facet=true&facet.field=description&facet.field=lang&facet.field=main_content&start=0&q=meier+AND+%28description:member+OR+description:project%29 after that: In the field "main_cont

How to set Configuration setting for search

2013-03-11 Thread Deepshikha Raghav
Hi Team , In Solr how to set"FREE TEXT SEARCH" configuration. Is there any regular expression setting so that I can configure to obtain search results. With Warm Regards Deepshikha Raghav IBM , Gurgaon --- Mobile-+91-8800140037

Re: SolrCloud: port out of range:-1

2013-03-11 Thread roySolr
On the end i want 3 servers, this was only a test. I now that a majority of servers is needed to provide service. I read some tutorials about zookeeper and looked at the wiki. I installed zookeeper seperate on the server and connect them with eachother(zoo.cfg). In the log i see the zookeeper know

Re: Zookeeper and DataImportHandler properties

2013-03-11 Thread adfel70
I use zookeeper eclipse plugin: http://www.massedynamic.org/mediawiki/index.php?title=Eclipse_Plug-in_for_ZooKeeper -- View this message in context: http://lucene.472066.n3.nabble.com/Zookeeper-and-DataImportHandler-properties-tp4046269p4046270.html Sent from the Solr - User mailing list archiv

Zookeeper and DataImportHandler properties

2013-03-11 Thread Nathan Findley
I realize this is not a zookeeper specific mailing list, but I am wondering if anybody has a simple process for updating zookeeper files other than restarting a solr instance? Specifically the data-import.properties value, which doesn't appear to be written to disk, but, rather, only exists in

"abc.def@gmail*" not retrieved but without double quotes retrieved

2013-03-11 Thread adfel70
I have the following field type: the following field: I add the value "abc@gmail.com" to this email field. When I search : 1. "abc.def@gmail*" - I get the result. 2. abc.def@gmail* (without doeble quotes) - I dont get the result. Am I m

Re: Custom update handler?

2013-03-11 Thread Upayavira
You need to refer to your chain in a RequestHandler config. Search for /update, duplicate that, and change the chain it points to. Upayavira On Mon, Mar 11, 2013, at 05:22 AM, Jack Park wrote: > With 4.1, not in cloud configuration, I have a custom response handler > chain which injects an additi