Authentication Issue in Shards Query

2012-06-20 Thread tosenthu
Hi I have a Solr server with 5 Cores, I have modified the Web.xml of solr.war to have a basic authentication feature enabled for all the web resources. Also I have written my own Login Module to have the login check. Now when I query a single core It asks for the User name and password, with prope

Re: Indexation Speed?

2012-06-20 Thread Bruno Mannina
Ok thanks for this information, Le 20/06/2012 05:44, Lance Norskog a écrit : M. Della Bitta is right- we're not talking about post.jar, but starting Solr: java -xMx300m -jar start.jar On Tue, Jun 19, 2012 at 10:05 AM, Erick Erickson wrote: Well, it _used_ to be defaulted in the code, but on

Solr with Tomcat on VPS

2012-06-20 Thread Hill Michael (NHQ-AC)
I am running Solr in a shared Tomcat v5.5.28 (I have access to all instances) on a Linux VPS server. When I set it all up, Tomcat starts properly and I can see that it has accesses my Solr Config directory properly. I can access the JSP pages if I reference them directly (http://mysite.com/solr/

Solr Autosuggest

2012-06-20 Thread Shri Kanish
Hi, I have a question regarding solr Autosuggest. (If this is not the correct link to Post, Please suggest).   I have implemented solr Autosuggest with Suggester component. I have read in a blog saying, "Currently implemented Lookups keep their data in memory, so unlike spellchecker data, this d

Re: parameters to decide solr memory consumption

2012-06-20 Thread Erick Erickson
This is really difficult to answer because there are so many variables; the number of unique terms, whether you store fields or not (which is really unrelated to memory consumption during searching), etc, etc, etc. So even trying the index and just looking at the index directory won't tell you much

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread Erick Erickson
Internal Lucene document IDs are signed 32 bit numbers, so having 2.5B docs seems to be just _asking_ for trouble. Which could explain the fact that this just came out of thin air. If you kept adding docs to the problem instance, you wouldn't have changed configs etc, just added more docs I re

Re: 3 Way Solr Join . . ?

2012-06-20 Thread Sabeer Hussain
I have a similar situation in my application. I have five different entities. The relationships among entities as follows Protocol --> ( zero or more) Study --> ( zero or more) Patient Protocol --> ( zero or more) Drug Patient --> (zero or more) Study Form --> (zero or many) Study Moreover, all

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread avenka
Erick, thanks for pointing that out. I was going to say in my original post that it is almost like some limit on max documents got violated all of a sudden, but the rest of the symptoms didn't seem to quite match. But now that I think about it, the problem probably happened at 2B (corresponding exa

Re: Schema / Config Error?

2012-06-20 Thread Jan Høydahl
As I understand, James is not upgrading, but trying to start a fresh downloaded 3.6.0. James, can you provide some more details, especially, which AppServer are you using, how did you start Solr... Can you copy/paste the error msg from your log files? -- Jan Høydahl, search solution architect

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread Erick Erickson
Let's make sure we're talking about the same thing. Solr happily indexes and stores long (64) bit values, no problem. What it doesn't do is assign _internal_ documents IDs as longs, those are ints. on admin/statistics, look at maxDocs and numDocs. maxDocs +1 will be the next _internal_ lucene doc

Re: Indexation Speed?

2012-06-20 Thread Bruno Mannina
Little question please: I have directories with around 30 files of 40Mo with around 17 000 doc for each files. is it better to index: - file by file with java -jar 1.xml, java -jar 2.xml, etc or - all at the same time with java -jar *.xml All files are verified, so my question is just con

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread avenka
Yes, wonky indeed. numDocs : -2006905329 maxDoc : -1993357870 And yes, I meant that the holes are in the database auto-increment ID space, nothing to do with lucene IDs. I will set up sharding. But is there any way to retrieve most of the current index? Currently, all select queries even in

Malay Language Detection

2012-06-20 Thread Rohit
Hi, We are using http://code.google.com/p/language-detection/ along with Solr for language detection, but it seems that the following jar doesn't have support for Malay detection. So, I created the profile for malay which is used by the jar, this works in local test environment, but I don'

How to import this Json-line by DIH?

2012-06-20 Thread jueljust
-- View this message in context: http://lucene.472066.n3.nabble.com/How-to-import-this-Json-line-by-DIH-tp3990544.html Sent from the Solr - User mailing list archive at Nabble.com.

solrj and replication

2012-06-20 Thread tom
hi, i was just wondering if i need to do smth special if i want to have an embedded slave to get replication working ? my setup is like so: - in my clustered application that uses embedded solr(j) (for performance). the cores are configured as slaves that should connect to a master which run

Re: Indexation Speed?

2012-06-20 Thread Erick Erickson
I doubt you'll find any significant difference in indexing speed. But the post.jar file is really intended as a demo program to quickly get the examples working. It was never intended to be a production-ready program. I'd think about using something like SolrJ etc. to index the docs. And I'm assum

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread Erick Erickson
That indeed sucks. But I don't personally know of a good way to try to split apart an existing index into shards. I'm afraid you're going to be stuck with re-indexing Wish I had a better solution Erick On Wed, Jun 20, 2012 at 10:45 AM, avenka wrote: > Yes, wonky indeed. >  numDocs : -2006905

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread avenka
Thanks. Do you know if the tons of index files with names like '_zxt.tis' in the index/data/ directory have the lucene IDs embedded in the binaries? The files look good to me and are partly readable even if in binary. I am wondering if I could just set up a new solr instance and move these index fi

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread Erick Erickson
Don't even try to do that. First of all, you have to have a reliable way to index the same docs to the same shards. The docs are all mixed up in the segment files and would lead to chaos. Solr/Lucene report the same doc multiple times if it's indifferent shards, so if you ever updated a document, y

write.lock

2012-06-20 Thread Christopher Gross
I'm running Solr 3.4. The past 2 months I've been getting a lot of write.lock errors. I switched to the "simple" lockType (and made it clear the lock on restart), but my index is still locking up a few times a week. I can't seem to determine what is causing the locks -- does anyone out there hav

Help with Solr File Based spell check

2012-06-20 Thread Sanjay Dua - Network
Hi, We are trying to implement file based search in our application using Solr 1.4. This is the code we have written - - default solr.Fi

Re: LanguageDetection inside of ExtractingRequestHandler

2012-06-20 Thread Jan Høydahl
Hi, In my opinion, instead of hardcoding such functionality into multiple request handlers, we should go the opposite direction -> modularization, factoring out Tika extraction into its own UpdateProcessor (https://issues.apache.org/jira/browse/SOLR-1763). Then the ExtractingRequestHandler wou

Exception using distributed field-collapsing

2012-06-20 Thread Bryan Loofbourrow
I am doing a search on three shards with identical schemas (I double-checked!), using the group feature, and Solr/Lucene 3.5. Solr is giving me back the exception listed at the bottom of this email: Other information: My schema uses the following field types: StrField, DateField, TrieDateFiel

Re: Exception using distributed field-collapsing

2012-06-20 Thread Martijn v Groningen
Hi Bryan, What is the fieldtype of the groupField? You can only group by field that is of type string as is described in the wiki: http://wiki.apache.org/solr/FieldCollapsing#Request_Parameters When you group by another field type a http 400 should be returned instead if this error. At least that

RE: Exception using distributed field-collapsing

2012-06-20 Thread Bryan Loofbourrow
> Hi Bryan, > > What is the fieldtype of the groupField? You can only group by field > that is of type string as is described in the wiki: > http://wiki.apache.org/solr/FieldCollapsing#Request_Parameters > > When you group by another field type a http 400 should be returned > instead if this error.

Re: Indexation Speed?

2012-06-20 Thread Bruno Mannina
Hi Erick, I doubt you'll find any significant difference in indexing speed. But the post.jar file is really intended as a demo program to quickly get the examples working. It was never intended to be a production-ready program. I'd think about using something like SolrJ etc. to index the docs.

RE: How to import this Json-line by DIH?

2012-06-20 Thread Steven A Rowe
Hi jueljust, Nabble removed the entire content of your email before sending it to the mailing list. Maybe use a different service that doesn't throw away your message? Steve From: jueljust [juelj...@gmail.com] Sent: Wednesday, June 20, 2012 10:56 AM To:

Re: Indexation Speed?

2012-06-20 Thread Erik Hatcher
I think it's a bit of an "it depends" on whether post.jar is the Right choice for production. It -is- SolrJ inside after all, Erick :) and it's pretty much the same as using curl. Just be sure you control commits as needed. Erik On Jun 20, 2012, at 15:18, Bruno Mannina wrote: > Hi Eric

Re: solr java.lang.NullPointerException on select queries

2012-06-20 Thread avenka
Erick, thanks for the advice, but let me make sure you haven't misunderstood what I was asking. I am not trying to split the huge existing index in install1 into shards. I am also not trying to make the huge install1 index as one shard of a sharded solr setup. I plan to use a sharded setup only fo

Re: Apache Lucene Eurocon 2012

2012-06-20 Thread Lance Norskog
Hello Mikhail- Your mail did not come through. Hope things are well, Lance Norskog Lucid Imagination On Wed, Jun 20, 2012 at 11:16 AM, Mikhail Khludnev wrote: > up > > -- > Sincerely yours > Mikhail Khludnev > Tech Lead > Grid Dynamics > > >   -- Lance Norskog

Re: Editing solr update handler sub class

2012-06-20 Thread Shameema Umer
Can anybody tell me where are the lucene jar files org.apache.lucene.index and org.apache.lucene.search located? Thanks Shameema On Wed, Jun 20, 2012 at 4:44 PM, Shameema Umer wrote: > Hi, > > I decompiled DirectUpdateHandler2.class to .java file and edited it to > suit my requirement to stop ov

Re: Editing solr update handler sub class

2012-06-20 Thread irshad siddiqui
Hi, Jar file are located in dist folder . check ur dist folder or you can check your solrconfig.xml file where you will get jar location path. On Thu, Jun 21, 2012 at 9:47 AM, Shameema Umer wrote: > Can anybody tell me where are the lucene jar files > org.apache.lucene.index and org.apache.

Re: parameters to decide solr memory consumption

2012-06-20 Thread Sachin Aggarwal
thanks for help hey I tried some exercise I m storing schema (uuid,key, userlocation) uuid and key are unique and user location have cardinality as 150 uuid and key are stored and indexed while userlocation is indexed not stored. still the index directory size is 51 MB just for 200,000 records do

Re: solr limits

2012-06-20 Thread Sachin Aggarwal
hello, plz clarify documents means unique id's or something else lets say i have file indexed each file no. is unique so file count will b 2.14 billions assume i have content in database as records each record have unique id so record count will be 2.14 billions m i right? -- Thanks & Regar

Re: Apache Lucene Eurocon 2012

2012-06-20 Thread Mikhail Khludnev
Ok. Do you know when and where Lucene Eurocon 2012 gonna happen? On Wed, Jun 20, 2012 at 10:16 PM, Mikhail Khludnev < mkhlud...@griddynamics.com> wrote: > up > > -- > Sincerely yours > Mikhail Khludnev > Tech Lead > Grid Dynamics > > > > > -- Sincerely yours Mi

Re: solr limits

2012-06-20 Thread irshad siddiqui
Hi, One index records is one documents along with one unique id. like in database one rows is one document is solr. On Thu, Jun 21, 2012 at 11:39 AM, Sachin Aggarwal < different.sac...@gmail.com> wrote: > hello, > > plz clarify documents means unique id's or something else > > lets say i hav

Re: solr limits

2012-06-20 Thread Sachin Aggarwal
thanks .. On Thu, Jun 21, 2012 at 11:51 AM, irshad siddiqui wrote: > Hi, > > One index records is one documents along with one unique id. like in > database one rows is one document is solr. > > > > > > On Thu, Jun 21, 2012 at 11:39 AM, Sachin Aggarwal < > different.sac...@gmail.com> wrote: > > >