Re: Out of memory while creating indexes

2011-03-03 Thread Upayavira
post.jar is intended for demo purposes, not production use, so it doesn;t surprise me you've managed to break it. Have you tried using curl to do the post? Upayavira On Thu, 03 Mar 2011 17:02 -0500, "Solr User" wrote: > Hi All, > > I am trying to create indexes out of a 400MB XML file using th

Re: Content-Type of XMLResponseWriter / QueryResponseWriter

2011-03-03 Thread Bernd Fehling
Hi Walter, many thanks! Bernd Am 03.03.2011 17:01, schrieb Walter Underwood: > Never use text/xml, that overrides any encoding declaration inside the XML > file. > > http://ln.hixie.ch/?start=1037398795&count=1 > http://www.grauw.nl/blog/entry/489 > > wunder > == > Lead Engineer, MarkLogic >

Problem using solr 4.0 in java environment

2011-03-03 Thread Isha Garg
Hi, i am using fcaet.pivoy feature of solr4.0 it works well and shows result on browser. But when I used solr 4.0 in java i got following error Exception in thread "main" java.lang.NoSuchMethodError: org.slf4j.spi.LocationAwareLogger.log(Lorg/slf4j/Marker;Ljava/lang/String;ILjava/lang/St

Re: Model foreign key type of search?

2011-03-03 Thread Alex Dong
Gora, thanks for the quick reply. Yes, I'm aware of the differences between Solr vs. DBMS. We've actually written some c++ analytical engine that can process through a billion tweets with multiple facets drill down. We may end up cook our own in the end but so far solr suites our needs quite well.

Re: Model foreign key type of search?

2011-03-03 Thread Gora Mohanty
On Fri, Mar 4, 2011 at 10:24 AM, Alex Dong wrote: > Hi there,  I need some advice on how to implement this using solr: > > We have two tables: urls and bookmarks. > - Each url has four fields:  {guid, title, text, url} > - One url will have one or more bookmarks associated with it. Each bookmark >

Re: SolrJ Tutorial

2011-03-03 Thread Grijesh
It comes with every solr source code download directory under src/test - Thanx: Grijesh http://lucidimagination.com -- View this message in context: http://lucene.472066.n3.nabble.com/SolrJ-Tutorial-tp2307113p2631223.html Sent from the Solr - User mailing list archive at Nabble.com.

Model foreign key type of search?

2011-03-03 Thread Alex Dong
Hi there, I need some advice on how to implement this using solr: We have two tables: urls and bookmarks. - Each url has four fields: {guid, title, text, url} - One url will have one or more bookmarks associated with it. Each bookmark has these: {link.guid, user, tags, comment} I'd like to retu

Re: Out of memory while creating indexes

2011-03-03 Thread Gora Mohanty
On Fri, Mar 4, 2011 at 3:32 AM, Solr User wrote: > Hi All, > > I am trying to create indexes out of a 400MB XML file using the following > command and I am running into out of memory exception. Is this a single record in the XML file? If it is more than one, breaking it up into separate XML files

Max Document Size

2011-03-03 Thread Sean Todd
Is there a maximum document size that Solr can handle? I'm trying to index documents greater than 15MB, but every time I do I get a random error. One of the other problems with what I'm documenting is that they are not in a human language. They are EDI documents (EDI is a B2B communication syste

Out of memory while creating indexes

2011-03-03 Thread Solr User
Hi All, I am trying to create indexes out of a 400MB XML file using the following command and I am running into out of memory exception. $JAVA_HOME/bin/java -Xms768m -Xmx1024m -*Durl*=http://$SOLR_HOST SOLR_PORT/solr/customercarecore/update -jar $SOLRBASEDIR/*dataconvertor*/common/lib/post.jar $S

Re: Limiting on dates in Solr

2011-03-03 Thread Steve Lewis
Ugh. Of course. I fixed that a couple weeks ago, something must have crept back in! Thanks a mil! From: Andreas Kemkes To: solr-user@lucene.apache.org Sent: Thu, March 3, 2011 4:12:02 PM Subject: Re: Limiting on dates in Solr 2011-03-03T59:59:99.999Z - should

Re: Limiting on dates in Solr

2011-03-03 Thread Andreas Kemkes
2011-03-03T59:59:99.999Z - shouldn't that be 2011-03-03T23:59:59.999Z From: Steve Lewis To: solr-user@lucene.apache.org Sent: Thu, March 3, 2011 11:21:53 AM Subject: Limiting on dates in Solr I am treating Solr as a NoSQL db that has great search capabilities.

Fwd: [Announce] Now Open: Call for Participation for ApacheCon North America

2011-03-03 Thread Grant Ingersoll
Begin forwarded message: > From: Grant Ingersoll > Date: March 3, 2011 3:52:05 PM EST > To: u...@mahout.apache.org, solr-user@lucene.apache.org, > java-u...@lucene.apache.org, opennlp-u...@incubator.apache.org > Subject: Fwd: [Announce] Now Open: Call for Participation for ApacheCon North > A

Fwd: [Announce] Now Open: Call for Participation for ApacheCon North America

2011-03-03 Thread Grant Ingersoll
Begin forwarded message: > From: Sally Khudairi > Date: March 3, 2011 3:10:17 PM EST > To: annou...@apachecon.com > Subject: [Announce] Now Open: Call for Participation for ApacheCon North > America > Reply-To: s...@apache.org > > Call for Participation > ApacheCon North America 2011 > 7-11

Re: mixing version of solr

2011-03-03 Thread Ofer Fort
we've been running like this for almost six months now and it's working ok. We have a post-commit event on the "master" that executes a commit call on the "slave", this forces the slave to reload the index. We started with a "standard" master/slave replication, but we had a few times that the slav

Re: mixing version of solr

2011-03-03 Thread Jonathan Rochkind
In general, no. I think there are index format changes between 1.4.1 and 4.0. If the two versions of Solr have the exact same index formats, it would theoretically work, but you'd need to figure that out and be sure of it, any two arbitrary versions of Solr/lucene may or may not have the exact

Re: FilterQuery OR statement

2011-03-03 Thread Jonathan Rochkind
You might also consider splitting your two seperate "AND" clauses into two seperate fq's: &fq=field1:(1 OR 2 OR 3 OR 4) &fq=field2:(4 OR 5 OR 6 OR 7) That will cache the two seperate clauses seperately in the field cache, which is probably preferable in general, without knowing more about your

Limiting on dates in Solr

2011-03-03 Thread Steve Lewis
I am treating Solr as a NoSQL db that has great search capabilities. I am querying on a few fields: 1. text (default) 2. type (my own string field) 3. calibration (my own date field) I'd like to limit the results to only show the calibration using this query: calibration:[2011-03-03T00:00:00.00

Re: mixing version of solr

2011-03-03 Thread Frederik Kraus
No, that won't work as the index format has changed. On Donnerstag, 3. März 2011 at 20:03, Ofer Fort wrote: > Hey all, > I have a master slave using the same index folder, the master only writes, > and the slave only reads. > Is it possible to use different versions of solr for those two servers?

mixing version of solr

2011-03-03 Thread Ofer Fort
Hey all, I have a master slave using the same index folder, the master only writes, and the slave only reads. Is it possible to use different versions of solr for those two servers? Let's say i want to gain from the improved search speed of solr4.0 but since it's my production system, am not willin

Re: Solr Admin Interface, reworked - Go on? Go away?

2011-03-03 Thread Stefan Matheis
Am 02.03.2011 23:48, schrieb Robert Muir: On Wed, Mar 2, 2011 at 5:34 PM, Stefan Matheis wrote: Robert, even in this WIP-State? if so .. i'll try one tomorrow evening after work Its totally up to you, sometimes it can be useful to upload a partial or WIP solution to an issue: as Hoss menti

Location of Main Class in Solr?

2011-03-03 Thread Anurag
I searched SolrIndexSearcher.java file but there is no main class. I wanted to know as to where this class resides. Can i call this main class (if it exists) using command line options in terminal , rather than through war file? - Kumar Anurag -- View this message in context: http://lucene

Re: FilterQuery OR statement

2011-03-03 Thread Tanner Postert
That worked, thought I tried it before, not sure why it didn't before. Also, is there a way to query without a q parameter? I'm just trying to pull back all of the field results where field1:(1 OR 2 OR 3) etc. so I figured I'd use the FQ param for caching purposes because those queries will likel

Re: FilterQuery OR statement

2011-03-03 Thread Ahmet Arslan
--- On Thu, 3/3/11, Ahmet Arslan wrote: > From: Ahmet Arslan > Subject: Re: FilterQuery OR statement > To: solr-user@lucene.apache.org > Date: Thursday, March 3, 2011, 8:05 PM > > Trying to figure out how I can > run > > something similar to this for the fq > > parameter > > > > Field1 in ( 1,

Re: uniqueKey merge documents on commit

2011-03-03 Thread Jonathan Rochkind
Nope, there is not. On 3/3/2011 10:55 AM, Tim Gilbert wrote: Hi, I have a unique key within my index, but rather than the default behavour of overwriting I am wondering if there is a method to "merge" the two different documents on commit of the second document. I have a testcase which expla

Re: FilterQuery OR statement

2011-03-03 Thread Ahmet Arslan
> Trying to figure out how I can run > something similar to this for the fq > parameter > > Field1 in ( 1, 2, 3 4 ) > AND > Field2 in ( 4, 5, 6, 7 ) > > I found some examples on the net that looked like this: > &fq=+field1:(1 2 3 > 4) +field2(4 5 6 7) but that yields no results. May be your defa

Re: Dismax, q, q.alt, and defaultSearchField?

2011-03-03 Thread mrw
Thanks, Jan. It looks like we need to do is use both q and q.alt, such that q.alt is always "*:*" and q is either empty for filter-only queries, or has the user text. That seems to work. Jan Høydahl / Cominvent wrote: > > Hi, > > Try > q.alt={!dismax}banana > > -- > Jan Høydahl, search solut

FilterQuery OR statement

2011-03-03 Thread Tanner Postert
Trying to figure out how I can run something similar to this for the fq parameter Field1 in ( 1, 2, 3 4 ) AND Field2 in ( 4, 5, 6, 7 ) I found some examples on the net that looked like this: &fq=+field1:(1 2 3 4) +field2(4 5 6 7) but that yields no results.

Re: Omit hour-min-sec in search?

2011-03-03 Thread Shane Perry
Not sure if there is a means of doing explicitly what you ask, but you could do a date range: +mydate:[-MM-DD 0:0:0 TO -MM-DD 11:59:59] On Thu, Mar 3, 2011 at 9:14 AM, bbarani wrote: > Hi, > > Is there a way to omit hour-min-sec in SOLR date field during search? > > I have indexed a fiel

Re: SolrJ Tutorial

2011-03-03 Thread Bing Li
Dear Lance, Could you tell me where I can find the unit tests code? I appreciate so much for your help! Best regards, LB On Sat, Jan 22, 2011 at 3:58 PM, Lance Norskog wrote: > The unit tests are simple and show the steps. > > Lance > > On Fri, Jan 21, 2011 at 10:41 PM, Bing Li wrote: > > Hi

Re: Solr Admin Interface, reworked - Go on? Go away?

2011-03-03 Thread Stefan Matheis
Hey Jan, On Thu, Mar 3, 2011 at 11:37 AM, Jan Høydahl wrote: > This alone is worthy including, but I also (of course) have some > comments/ideas: [...] Really nice! i'll try to make a list of open todos / missing items and attach it to the JIRA-Ticket. Especially for the dismax- & spatial-query

Omit hour-min-sec in search?

2011-03-03 Thread bbarani
Hi, Is there a way to omit hour-min-sec in SOLR date field during search? I have indexed a field using TrieDateField and seems like it uses UTC format. The dates get stored as below, lastupdateddate">2008-02-26T20:40:30.94Z I want to do a search based on just -MM-DD and omit T20:40:30.94Z..

Re: Content-Type of XMLResponseWriter / QueryResponseWriter

2011-03-03 Thread Walter Underwood
Never use text/xml, that overrides any encoding declaration inside the XML file. http://ln.hixie.ch/?start=1037398795&count=1 http://www.grauw.nl/blog/entry/489 wunder == Lead Engineer, MarkLogic On Mar 3, 2011, at 7:30 AM, Bernd Fehling wrote: > Dear list, > > is there any deeper logic behind

uniqueKey merge documents on commit

2011-03-03 Thread Tim Gilbert
Hi, I have a unique key within my index, but rather than the default behavour of overwriting I am wondering if there is a method to "merge" the two different documents on commit of the second document. I have a testcase which explains what I'd like to happen: @Test public void testM

Why is SolrDispatchFilter using 90% of the Time?

2011-03-03 Thread Stijn Vanhoorelbeke
Hi, I'm working with a recent NightlyBuild of Solr and I'm doing some serious ZooKeeper testing. I've NewRelic monitoring enabled on my solr machines. When I look at the distribution of the Response-time I notice 'SolrDispatchFilter.doFilter()' is taking up 90% of the time. The other 10% is used

deletedPKQuery does not perform with compound PK

2011-03-03 Thread Jérôme Droz
Hello, I'm using a DIH to import documents from a database. Documents in the index represent a relationship between two entities, units and dealpoints ("unit has dealpoint"); thus document keys in the index refer to a compound SQL key. Full import works fine. In order to optimize the import p

Content-Type of XMLResponseWriter / QueryResponseWriter

2011-03-03 Thread Bernd Fehling
Dear list, is there any deeper logic behind the fact that XMLResponseWriter is sending CONTENT_TYPE_XML_UTF8="application/xml; charset=UTF-8" ? I would assume (and also most browser) that for XML Output to receive "text/xml" and not "application/xml". Or do you want the browser to call and XML-E

error in log INFO org.apache.solr.core.SolrCore - webapp=/solr path=/admin/ping params={} status=0 QTime=1

2011-03-03 Thread Mike Franon
I am using solr under jboss, so this might be more of a jboss config issue, not really sure. But my logs keep getting spammed, because solr sends it as ERROR [STDERR] INFO org.apache.solr.core.SolrCore - webapp=/solr path=/admin/ping params={} status=0 QTime=1 Has anyone seen this and found a wor

Re: adding a document using curl

2011-03-03 Thread Jayendra Patil
If you are using the ExtractingRequestHandler, you can also try using the stream.file or stream.url. e.g. curl "http://localhost:8080/solr/core0/update/extract?stream.file=C:/777045.zip&literal.id=777045&literal.title=Test&commit=true"; More detailed explaination @ http://www.lucidimagination.co

Re: adding a document using curl

2011-03-03 Thread Gora Mohanty
On Thu, Mar 3, 2011 at 5:31 PM, pankaj bhatt wrote: > Hi All, >       is there any Custom open source SOLR ADMIN application like what > lucid imagination provides in its distribution. >       I am trying to create thing, however thinking it would be a > reinventing of wheel. > >       Request you

Re: adding a document using curl

2011-03-03 Thread Gora Mohanty
On Thu, Mar 3, 2011 at 5:15 PM, Ken Foskey wrote: > On Thu, 2011-03-03 at 12:36 +0100, Markus Jelsma wrote: >> Here's a complete example >> http://wiki.apache.org/solr/UpdateXmlMessages#Passing_commit_parameters_as_part_of_the_URL > > I should have been clearer.   A rich text document,  XML I can

Re: Solr TermsComponent: space in term

2011-03-03 Thread Ahmet Arslan
You need to remove EdgeNGramFilterFactory from your analyzer chain. --- On Thu, 3/3/11, shrinath.m wrote: > From: shrinath.m > Subject: Re: Solr TermsComponent: space in term > To: solr-user@lucene.apache.org > Date: Thursday, March 3, 2011, 1:41 PM > > Markus Jelsma-2 wrote: > > > > http

RE: Understanding multi-field queries with q and fq

2011-03-03 Thread mrw
Yes, we're investigating dismax (with the qf param), but we're not sure it supports our syntax needs. The users want to put put AND/OR/NOT in their queries, and we don't want to write a lot of code converting those queries into dismax (+/-/mm) format. So, until 3.1 (edismax) ships, we're also try

Re: Solr Admin Interface, reworked - Go on? Go away?

2011-03-03 Thread mrw
Picture the URI field above the response field, only half-screen. This facilitates breaking the query apart on different lines in order to debug it. When you have a lot of shards, fq clauses, etc., you end up with a very long URI that is difficult to get your head around and manipulate. We ta

Re: adding a document using curl

2011-03-03 Thread Gary Taylor
As an example, I run this in the same directory as the msword1.doc file: curl "http://localhost:8983/solr/core0/update/extract?literal.docid=74&literal.type=5"; -F "file=@msword1.doc" The "type" literal is just part of my schema. Gary. On 03/03/2011 11:45, Ken Foskey wrote: On Thu, 2011-0

Re: AlternateDistributedMLT.patch not working

2011-03-03 Thread Edoardo Tosca
Hi all, I am currently working on this AlternateDistributedMLT patch. I've applied it manually on solr 1.4 an solved some Null Pointer Exception issues. It's now working properly. But I'm not sure about its behaviour so i'll ask you, list: I saw that every MLT query for a doc that is in the resul

Custom SOLR ADMIN Application

2011-03-03 Thread pankaj bhatt
Hi All, is there any Custom open source SOLR ADMIN application like what lucid imagination provides in its distribution. I am trying to create thing, however thinking it would be a reinventing of wheel. Request you to please redirect me, if there is any open source application

Re: adding a document using curl

2011-03-03 Thread pankaj bhatt
Hi All, is there any Custom open source SOLR ADMIN application like what lucid imagination provides in its distribution. I am trying to create thing, however thinking it would be a reinventing of wheel. Request you to please redirect me, if there is any open source application

Re: adding a document using curl

2011-03-03 Thread Ken Foskey
On Thu, 2011-03-03 at 12:36 +0100, Markus Jelsma wrote: > Here's a complete example > http://wiki.apache.org/solr/UpdateXmlMessages#Passing_commit_parameters_as_part_of_the_URL I should have been clearer. A rich text document, XML I can make work and a script is in the example docs folder http

Re: Looking for help with Solr implementation

2011-03-03 Thread Anurag
What is the problem that you are facing in the working of solr? we have done a project on it and it would be good if you send the details on what to implement in the project - Kumar Anurag -- View this message in context: http://lucene.472066.n3.nabble.com/Looking-for-help-with-Solr-impl

Re: Solr TermsComponent: space in term

2011-03-03 Thread shrinath.m
Markus Jelsma-2 wrote: > > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory > well, thank you Markus, Now My schema has the following :

Re: adding a document using curl

2011-03-03 Thread Markus Jelsma
Here's a complete example http://wiki.apache.org/solr/UpdateXmlMessages#Passing_commit_parameters_as_part_of_the_URL On Thursday 03 March 2011 12:31:11 Ken Foskey wrote: > I have read the various pages and used Curl a lot but i cannot figure out > the correct command line to add a document to the

adding a document using curl

2011-03-03 Thread Ken Foskey
I have read the various pages and used Curl a lot but i cannot figure out the correct command line to add a document to the example Solr instance. I have tried a few things however they seem to be for the file on the same server as solr, in my case I am pushing the document from a windows ma

Re: Solr TermsComponent: space in term

2011-03-03 Thread Markus Jelsma
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory On Thursday 03 March 2011 12:15:07 shrinath.m wrote: > iorixxx wrote: > > TermsComponent operates on indexed terms. One way to achieve multi-word > > suggestions is to use ShingleFilterFactory at index time. > >

Re: Solr TermsComponent: space in term

2011-03-03 Thread shrinath.m
iorixxx wrote: > > TermsComponent operates on indexed terms. One way to achieve multi-word > suggestions is to use ShingleFilterFactory at index time. > Thank you @iorixxx. Could you point me where I can find a good docs on how to do this ? -- View this message in context: http://lucene.472

Re: Solr TermsComponent: space in term

2011-03-03 Thread Ahmet Arslan
> Is there no way to achieve what the Op > had to say ? > TermsComponent operates on indexed terms. One way to achieve multi-word suggestions is to use ShingleFilterFactory at index time.

Date range query with mixed inclusive/exclusive

2011-03-03 Thread Tim Terlegård
Is there any chance that https://issues.apache.org/jira/browse/LUCENE-996 will be backported to the 3x branch? I see that it's fixed in trunk, but it will be a while until it's in a release. How do people generally search for documents from lets say year 2009? I thought it would be convenient to d

Re: perfect match in dismax search

2011-03-03 Thread Jan Høydahl
Hi, I'm working on a Filter which enables boundary match using syntax title:"^hello I love you$" which will make sure that the match is exact. See SOLR-1980 (no working patch yet) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 3. mars 2011, at 11.07, Markus Jelsm

Re: Solr Admin Interface, reworked - Go on? Go away?

2011-03-03 Thread Jan Høydahl
Hi, This is simply great! Bravo! This alone is worthy including, but I also (of course) have some comments/ideas: The links section on top: * Move the links on top to bottom, reserving the top for navigation. * The "send email" could be changed to "Community forum" and instead of linking to

Re: Solr TermsComponent: space in term

2011-03-03 Thread shrinath.m
why was this thread left unanswered ? Is there no way to achieve what the Op had to say ? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-TermsComponent-space-in-term-tp1898889p2624203.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Selection Between Solr and Relational Database

2011-03-03 Thread Markus Jelsma
Well, an RDBMS can be very fast but Solr using fq can be very fast as well. Just try fq=group:sports&fq=createdtime: > Dear all, > > I have started to learn Solr for two months. At least right now, my system > runs good in a Solr cluster. > > I have a question when implementing one feature in m

Re: perfect match in dismax search

2011-03-03 Thread Markus Jelsma
Use either the string fieldType or a field with very little analysis (KeywordTokenizer + LowercaseFilter). > How to obtain perfect match with dismax query?? > > es: > > i want to search "hello i love you" with deftype=dismax in the title field > and i want to obtain results which title is exact

perfect match in dismax search

2011-03-03 Thread Gastone Penzo
How to obtain perfect match with dismax query?? es: i want to search "hello i love you" with deftype=dismax in the title field and i want to obtain results which title is exactly "hello i love you" with all this terms in this order. Not less words or other. how is it possilbe?? i tryed with +(h

Re: Boost function problem with disquerymax

2011-03-03 Thread Gastone Penzo
You are right. it was not and index field. just stored Thanx 2011/3/2 Yonik Seeley > On Wed, Mar 2, 2011 at 11:34 AM, Gastone Penzo > wrote: > > HI, > > for search i use disquery max > > and a i want to boost a field with bf parameter like: > > ...&bf=boost_has_img^5& > > the boost_has_img

Selection Between Solr and Relational Database

2011-03-03 Thread Bing Li
Dear all, I have started to learn Solr for two months. At least right now, my system runs good in a Solr cluster. I have a question when implementing one feature in my system. When retrieving documents by keyword, I believe Solr is faster than relational database. However, if doing the following

Re: Dismax, q, q.alt, and defaultSearchField?

2011-03-03 Thread Jan Høydahl
Hi, Try q.alt={!dismax}banana -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 2. mars 2011, at 23.06, mrw wrote: > We have two banks of Solr nodes with identical schemas. The data I'm > searching for is in both banks. > > One has defaultSearchField set to field1,

Re: Solr Admin Interface, reworked - Go on? Go away?

2011-03-03 Thread Stefan Matheis
Hey Guys, you're completly right :) Will clean up the existing Code a little bit, and create a JIRA-Ticket. On Wed, Mar 2, 2011 at 11:32 PM, Chris Hostetter wrote: > If you run into any issues where you can't replicate something > in the existing JSPs (or accomplish some new desirable functional