date:20100811

Problem instantiating CommonsHttpSolrServer using solrj

2010-08-11 Thread bijeet singh

Hi all, I'm trying to use solrj for indexing in solr, but when I try to instantiate the server, using : SolrServer server = new CommonsHttpSolrServer("http://localhost:8080/solr";); I get the following runtime error: Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/solr/cl

Delta-import with solrj client

2010-08-11 Thread Hando420

Greetings. I have a solrj client for fetching data from database. I am using delta-import for fetching data. If a column is changed in database using timestamp with delta-import i get the latest column indexed but there are duplicate values in the index similar to the column but the data is older.

Re: solr query result not read the latest xml file

2010-08-11 Thread Jan Høydahl / Cominvent

Hi, Yes, this is normal behavior. This is because Solr is *document* based, it does not know about *files*. What happens here is that your source database (or whatever) has had deletinons within this category in addition to updates, and you need to relay those to Solr. The best way to integrat

timestamp field

2010-08-11 Thread Frederico Azeiteiro

Hi, I have on my schema This field is returned as 2010-08-11T10:11:03.354Z For an article added at 2010-08-11T11:11:03.354Z! And the server has the time of 2010-08-11T11:11:03.354Z... This is a w2003 server using solr 1.4. Any guess of what could be wrong here? Tha

Re: timestamp field

2010-08-11 Thread Jan Høydahl / Cominvent

Hi, Which time zone are you located in? Do you have DST? Solr uses UTC internally for dates, which means that "NOW" will be the time in London right now :) Does that appear to be right 4 u? Also see this thread: http://search-lucene.com/m/hqBed2jhu2e2/ -- Jan Høydahl, search solution architect

RE: timestamp field

2010-08-11 Thread Frederico Azeiteiro

Hi Jan, Dah, I didn't know that :( I always thought it used the servertime. Anyway,just out of curiosity, the hour is UTC but NOT the time in London right now. London is UTC+1 (same as here in Portugal) :). So, London solr users should have the same "problem". Well, I must be careful when us

Re: Delta-import with solrj client

2010-08-11 Thread kenf_nc

Short answer is no, there isn't a way. Solr doesn't have the concept of 'Update' to an indexed document. You need to add the full document (all 'columns') each time any one field changes. If doing that in your DataImportHandler logic is difficult you may need to write a separate Update Service tha

Re: timestamp field

2010-08-11 Thread Mark Allan

For what it's worth, London and the rest of the UK is currently observing British Summer Time (called Daylight Savings Time in other parts of the world) which is why we appear to be UTC+1 between the last Sunday in March and the last Sunday in October. Mark On 11 Aug 2010, at 12:36 pm, Fre

Re: Delta-import with solrj client

2010-08-11 Thread Jan Høydahl / Cominvent

Hi, Make sure you use a proper "ID" field, which does *not* change even if the content in the database changes. In this way, when your delta-import fetches changed rows to index, they will update the existing rows in your index. -- Jan Høydahl, search solution architect Cominvent AS - www.comin

Re: Solr 1.4 - stats page slow

2010-08-11 Thread Yonik Seeley

FYI, I opened https://issues.apache.org/jira/browse/SOLR-2036 for this. -Yonik http://www.lucidimagination.com On Tue, Aug 10, 2010 at 8:35 PM, entdeveloper wrote: > > Apologies if this was resolved, but we just deployed Solr 1.4.1 and the stats > page takes over a minute to load for us as well

DataImportHandler in Solr 1.4.1: exception handling in FileListEntityProcessor

2010-08-11 Thread Sascha Szott

Hi folks, why does FileListEntityProcessor ignores onError="continue" and abort indexing if a directory or a file does not exist? I'm using both XPathEntityProcessor and FileListEntityProcessor with onError set to continue. In case a directory or file is not present an Exception is thrown an

Re: DataImportHandler in Solr 1.4.1: exception handling in FileListEntityProcessor

2010-08-11 Thread Sascha Szott

Sorry, there was a mistake in the stack trace. The correct one is: SEVERE: Full Import failed org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' value: /home/doe/foo is not a directory Processing Document # 3 at org.apache.solr.handler.dataimport.FileListEntityProc

Re: Solr Doc Lucene Doc !?

2010-08-11 Thread stockii

i have a question about the solr index mechanism with DIH ... i try to understand how solr index an doc, and on wich code-elements, solr using lucene. thats my stand: DIH is using the SolrWriter to add an doc. To create an SolrInoputDocument SolrWriter uses the addUpdateCommand, This Command an

RE: PDF file

2010-08-11 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]

Thanks so much for your help! I got "Remote Streaming is disabled" error. Would you please tell me if I miss something? Thanks, -Original Message- From: Jayendra Patil [mailto:jayendra.patil@gmail.com] Sent: Tuesday, August 10, 2010 8:51 PM To: solr-user@lucene.apache.org Subject:

Re: Solr Doc Lucene Doc !?

2010-08-11 Thread stockii

oh, i see that i mixed DIH classes with other Solr classes ^^ -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Doc-Lucene-Doc-tp995922p1088738.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: PDF file

2010-08-11 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]

Thanks, I knew how to enable Streaming. But I got another error, ERROR:unknown field 'metadata_trapped'. Does anyone know how to match up with SolrCell metadata? I found the following in schema.xml. I don't know how to make changes for PDF. I really appreciate your help! Thanks, -Origin

SolrException log

2010-08-11 Thread Bastian Spitzer

Hi, we are using solr 1.4.1 in a master-slave setup with replication, requests are loadbalanced to both instances. this is just working fine, but the slave behaves strange sometimes with a "SolrException log" (trace below). We are using 1.4.1 for weeks now, and this has happened only a few times s

RE: Improve Query Time For Large Index

2010-08-11 Thread Burton-West, Tom

Hi Peter, Can you give a few more examples of slow queries? Are they phrase queries? Boolean queries? prefix or wildcard queries? If one word queries are your slow queries, than CommonGrams won't help. CommonGrams will only help with phrase queries. How are you using termvectors? That may be

Re: how to support "implicit trailing wildcards"

2010-08-11 Thread yandong yao

Hi Jan, Seems q=mount OR mount* have different sorting order with q=mount for those documents including mount. Change to q=mount^100 OR (mount?* -mount)^1.0, and test well. Thanks very much! 2010/8/10 Jan Høydahl / Cominvent > Hi, > > You don't need to duplicate the content into two fields to

Re: Improve Query Time For Large Index

2010-08-11 Thread Robert Muir

On Wed, Aug 11, 2010 at 11:47 AM, Burton-West, Tom wrote: > Hi Peter, > > Can you give a few more examples of slow queries? > Are they phrase queries? Boolean queries? prefix or wildcard queries? > If one word queries are your slow queries, than CommonGrams won't help. > CommonGrams will only hel

Re: Need help with facets

2010-08-11 Thread Moazzam Khan

That's awesome. Thanks Ahmet! On Wed, Aug 11, 2010 at 1:50 AM, Ahmet Arslan wrote: > > > --- On Wed, 8/11/10, Moazzam Khan wrote: > >> From: Moazzam Khan >> Subject: Re: Need help with facets >> To: solr-user@lucene.apache.org >> Date: Wednesday, August 11, 2010, 1:32 AM >> Thanks Ahmet that w

Analysing SOLR logfiles

2010-08-11 Thread Jay Flattery

Hi there, Just wondering what tools people use to analyse SOLR log files. We're looking to do things like extracting common queries, calculating averaging Qtime and hits, returning particularly slow/expensive queries, etc. Would prefer not to code something (completely) from scratch. Thanks

Filter Performance in Solr 1.3

2010-08-11 Thread Bargar, Matthew B

Hi there, I have a question about filter (fq) performance in Solr 1.3. After doing some testing it seems as though adding a filter increases search time. From what I've read here http://www.derivante.com/2009/06/23/solr-filtering-performance-increase/ and here http://www.lucidimagination.com/blog/

Re: Filter Performance in Solr 1.3

2010-08-11 Thread Geert-Jan Brits

fq's are the preferred way to use for filtering when the same filter is often used. (since the filter-set can be cached seperately) . as to your direct question: > My question is whether there is anything that can be done in 1.3 to help alleviate the problem, before upgrading to 1.4? I don't thi

Data Import Handler Query

2010-08-11 Thread Manali Joshi

Hi, I have installed solr 1.4 and am trying to use the Data Import Handler to import data from a database. I have 2 tables which share a 1 to many relation (1 Story to Many Images). I want my index to contain attributes regarding “Story” and also all “Images” that it has. Based on the DIH doc

RE: Filter Performance in Solr 1.3

2010-08-11 Thread Bargar, Matthew B

The search with the filter takes longer than a search for the same term but no filter after repeated searches, after the cache should have come into play. To be more specific, this happens on filters that exclude very few results from the overall set. For instance, type:video returns few results

Re: how to support "implicit trailing wildcards"

2010-08-11 Thread Jan Høydahl / Cominvent

I guess q=mount OR (mount*)^0.01 would work equally as well, i.e. diminishing the effect of wildcard matches. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 11. aug. 2010, at 17.53, yandong yao wrote: > Hi Jan, > > Seems q

Re: Data Import Handler Query

2010-08-11 Thread kenf_nc

It may not be the data config. Do you have the fields in the schema.xml that the image data is going to set to be multiValued="true"? Although, I would think the last image would be stored, not the first, but haven't really tested this. -- View this message in context: http://lucene.472066.n3.n

bug or feature???

2010-08-11 Thread Jean-Sebastien Vachon

Hi, Can someone tell me why the two following queries do not return the same results? Is that a bug or a feature? http://localhost:8983/jobs/select?fq=title:(NOT janitor)&fq=description:(NOT janitor)&q=*:* http://localhost:8983/jobs/select?q=title:(NOT janitor) AND description:(NOT janitor)

General questions about distributed solr shards

2010-08-11 Thread JohnRodey

1) Is there any information on preferred maximum sizes for a single solr index. I've read some people say 10 million, some say 80 million, etc... Is there any official recommendation or has anyone experimented with large datasets into the tens of billions? 2) Is there any down side to running m

Indexing and ExtractingRequestHandler

2010-08-11 Thread Harry Hochheiser

I'm trying to use Solr to index the contents of an Excel file, using the ExtractingRequestHandler (CSV handler won't work for me - I need to consider the whole spreadsheet as one document), and I'm running into some trouble. Is there any way to see what's going on during the indexing process? I'm

Re: Analysing SOLR logfiles

2010-08-11 Thread Jan Høydahl / Cominvent

Have a look at www.splunk.com -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Training in Europe - www.solrtraining.com On 11. aug. 2010, at 19.34, Jay Flattery wrote: > Hi there, > > > Just wondering what tools people use to analyse SOLR log files. > > We're lookin

Re: bug or feature???

2010-08-11 Thread Jan Høydahl / Cominvent

Your syntax looks a bit funny. Which version of Solr are you using? Pure negative queries are not supported, try q=(*:* -title:janitor) instead. Also, for debugging what's going on, please add &debugQuery=true and share the parsed query for both cases with us. -- Jan Høydahl, search solution a

Re: Data Import Handler Query

2010-08-11 Thread Manali Joshi

I tried making the schema fields that get the image data to multiValued="true". But it still gets only the first image data. It doesn't have information about all the images. On Wed, Aug 11, 2010 at 1:15 PM, kenf_nc wrote: > > It may not be the data config. Do you have the fields in the schem

Re: Indexing and ExtractingRequestHandler

2010-08-11 Thread Jan Høydahl / Cominvent

Hi, You can try Tika command line to parse your Excel file, then you will se the exact textual output from it, which will be indexed into Solr, and thus inspect whether something is missing. Are you sure you use a version of Luke which supports your version of Lucene? -- Jan Høydahl, search so

Re: DIH transformer script size limitations with Jetty?

2010-08-11 Thread harrysmith

To follow up on my own question, it appears this is only an issue when using the DataImport console debugging tools. It looks like when submitting the debugging request, the data-config.xml is sent via a GET request, which would fail. However, using the exact same data-config.xml via a full-impor

DIH - Insert another record After first load

2010-08-11 Thread Girish

Hi, I did load of the data with DIH and now once the data is loaded. I want to load the records dynamically as an when I received. Use cases: 1. I did load of 7MM records and now everything is working fine. 2. A new record is received, now I want to add this new record into the indexed dat

How to "OR" facet queries

2010-08-11 Thread Frank A

Hi, I have 3 facet fields (A,B,C) the values of each facet field will be shown as check boxes to users: Field A [x] Val1a [x] Val2a [] Val3a Field B [x] Val1b [] Val2b [] Val3b Within a field if the user selects two items I want the queries to be an "OR" query. Currently I'm generating some

Re: How to "OR" facet queries

2010-08-11 Thread Geek Gamer

On Thu, Aug 12, 2010 at 7:12 AM, Frank A wrote: > Hi, I have 3 facet fields (A,B,C) the values of each facet field will > be shown as check boxes to users: > > Field A > [x] Val1a > [x] Val2a > [] Val3a > > Field B > [x] Val1b > [] Val2b > [] Val3b > > Within a field if the user selects two i

Re: DIH transformer script size limitations with Jetty?

2010-08-11 Thread Girish Pandit

Have you tried changing the -Xmx value to bump to -Xmx1300m? I had some problem with DIH loading the data and when I bumped the memory everything worked fine! harrysmith wrote: To follow up on my own question, it appears this is only an issue when using the DataImport console debugging tools.

Re: Indexing and ExtractingRequestHandler

2010-08-11 Thread Harry Hochheiser

Thanks. I've done Tika command line to parse the Excel file, and I see contents in it that don't appear to be indexed. I've tried the path of using Tika to parse the Excel and then using extracting request handler to index the resulting text, and that doesn't work either. As far as Luke goes, I'v

Re: Schema Definition Question

2010-08-11 Thread Lance Norskog

Can do you a DB join on OurID? That makes the association in the database, before it gets to the DataImportHandler. On Sun, Aug 8, 2010 at 6:17 PM, Frank A wrote: > Hi, > > I have a db handler with the following definition: > > query="select OurID,Name,City,State,lat,ln

In multicore env, can I make it access core0 by default

2010-08-11 Thread Chengyang

Thus when I access http://localhost/solr/select?q=*:* equals http://localhost/solr/core0/select?q=*:*.

Re: Schema Definition Question

2010-08-11 Thread harrysmith

I think I know where you're headed, I was struggling with the same issue. In my case, using results from Solr I link to a detailed profile using an ID, but I am displaying the String value. I was looking for something like: 12345 Feature 1 label 1 Feature 2 label 2 ...or so

44 matches

Mail list logo