Hi all,
I'm trying to use solrj for indexing in solr, but when I try to instantiate
the server, using :
SolrServer server = new CommonsHttpSolrServer("http://localhost:8080/solr";);
I get the following runtime error:
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/solr/cl
Greetings. I have a solrj client for fetching data from database. I am using
delta-import for fetching data. If a column is changed in database using
timestamp with delta-import i get the latest column indexed but there are
duplicate values in the index similar to the column but the data is older.
Hi,
Yes, this is normal behavior. This is because Solr is *document* based, it does
not know about *files*.
What happens here is that your source database (or whatever) has had deletinons
within this category in addition to updates, and you need to relay those to
Solr.
The best way to integrat
Hi,
I have on my schema
This field is returned as
2010-08-11T10:11:03.354Z
For an article added at 2010-08-11T11:11:03.354Z!
And the server has the time of 2010-08-11T11:11:03.354Z...
This is a w2003 server using solr 1.4.
Any guess of what could be wrong here?
Tha
Hi,
Which time zone are you located in? Do you have DST?
Solr uses UTC internally for dates, which means that "NOW" will be the time in
London right now :) Does that appear to be right 4 u?
Also see this thread: http://search-lucene.com/m/hqBed2jhu2e2/
--
Jan Høydahl, search solution architect
Hi Jan,
Dah, I didn't know that :(
I always thought it used the servertime.
Anyway,just out of curiosity, the hour is UTC but NOT the time in London right
now.
London is UTC+1 (same as here in Portugal) :).
So, London solr users should have the same "problem".
Well, I must be careful when us
Short answer is no, there isn't a way. Solr doesn't have the concept of
'Update' to an indexed document. You need to add the full document (all
'columns') each time any one field changes. If doing that in your
DataImportHandler logic is difficult you may need to write a separate Update
Service tha
For what it's worth, London and the rest of the UK is currently
observing British Summer Time (called Daylight Savings Time in other
parts of the world) which is why we appear to be UTC+1 between the
last Sunday in March and the last Sunday in October.
Mark
On 11 Aug 2010, at 12:36 pm, Fre
Hi,
Make sure you use a proper "ID" field, which does *not* change even if the
content in the database changes. In this way, when your delta-import fetches
changed rows to index, they will update the existing rows in your index.
--
Jan Høydahl, search solution architect
Cominvent AS - www.comin
FYI, I opened https://issues.apache.org/jira/browse/SOLR-2036
for this.
-Yonik
http://www.lucidimagination.com
On Tue, Aug 10, 2010 at 8:35 PM, entdeveloper
wrote:
>
> Apologies if this was resolved, but we just deployed Solr 1.4.1 and the stats
> page takes over a minute to load for us as well
Hi folks,
why does FileListEntityProcessor ignores onError="continue" and abort
indexing if a directory or a file does not exist?
I'm using both XPathEntityProcessor and FileListEntityProcessor with
onError set to continue. In case a directory or file is not present an
Exception is thrown an
Sorry, there was a mistake in the stack trace. The correct one is:
SEVERE: Full Import failed
org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir'
value: /home/doe/foo is not a directory Processing Document # 3
at
org.apache.solr.handler.dataimport.FileListEntityProc
i have a question about the solr index mechanism with DIH ...
i try to understand how solr index an doc, and on wich code-elements, solr
using lucene.
thats my stand:
DIH is using the SolrWriter to add an doc.
To create an SolrInoputDocument SolrWriter uses the addUpdateCommand,
This Command an
Thanks so much for your help! I got "Remote Streaming is disabled" error. Would
you please tell me if I miss something?
Thanks,
-Original Message-
From: Jayendra Patil [mailto:jayendra.patil@gmail.com]
Sent: Tuesday, August 10, 2010 8:51 PM
To: solr-user@lucene.apache.org
Subject:
oh, i see that i mixed DIH classes with other Solr classes ^^
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-Doc-Lucene-Doc-tp995922p1088738.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thanks, I knew how to enable Streaming. But I got another error, ERROR:unknown
field 'metadata_trapped'.
Does anyone know how to match up with SolrCell metadata? I found the following
in schema.xml. I don't know how to make changes for PDF.
I really appreciate your help!
Thanks,
-Origin
Hi,
we are using solr 1.4.1 in a master-slave setup with replication,
requests are loadbalanced to both instances. this is just working fine,
but the slave
behaves strange sometimes with a "SolrException log" (trace below). We
are using 1.4.1 for weeks now, and this has happened only a few times
s
Hi Peter,
Can you give a few more examples of slow queries?
Are they phrase queries? Boolean queries? prefix or wildcard queries?
If one word queries are your slow queries, than CommonGrams won't help.
CommonGrams will only help with phrase queries.
How are you using termvectors? That may be
Hi Jan,
Seems q=mount OR mount* have different sorting order with q=mount for those
documents including mount.
Change to q=mount^100 OR (mount?* -mount)^1.0, and test well.
Thanks very much!
2010/8/10 Jan Høydahl / Cominvent
> Hi,
>
> You don't need to duplicate the content into two fields to
On Wed, Aug 11, 2010 at 11:47 AM, Burton-West, Tom wrote:
> Hi Peter,
>
> Can you give a few more examples of slow queries?
> Are they phrase queries? Boolean queries? prefix or wildcard queries?
> If one word queries are your slow queries, than CommonGrams won't help.
> CommonGrams will only hel
That's awesome.
Thanks Ahmet!
On Wed, Aug 11, 2010 at 1:50 AM, Ahmet Arslan wrote:
>
>
> --- On Wed, 8/11/10, Moazzam Khan wrote:
>
>> From: Moazzam Khan
>> Subject: Re: Need help with facets
>> To: solr-user@lucene.apache.org
>> Date: Wednesday, August 11, 2010, 1:32 AM
>> Thanks Ahmet that w
Hi there,
Just wondering what tools people use to analyse SOLR log files.
We're looking to do things like extracting common queries, calculating
averaging
Qtime and hits, returning particularly slow/expensive queries, etc.
Would prefer not to code something (completely) from scratch.
Thanks
Hi there, I have a question about filter (fq) performance in Solr 1.3.
After doing some testing it seems as though adding a filter increases
search time. From what I've read here
http://www.derivante.com/2009/06/23/solr-filtering-performance-increase/
and here
http://www.lucidimagination.com/blog/
fq's are the preferred way to use for filtering when the same filter is
often used. (since the filter-set can be cached seperately) .
as to your direct question:
> My question is whether there is anything that can be done in 1.3 to
help alleviate the problem, before upgrading to 1.4?
I don't thi
Hi,
I have installed solr 1.4 and am trying to use the Data Import Handler to
import data from a database. I have 2 tables which share a 1 to many
relation (1 Story to Many Images).
I want my index to contain attributes regarding “Story” and also all
“Images” that it has. Based on the DIH doc
The search with the filter takes longer than a search for the same term
but no filter after repeated searches, after the cache should have come
into play. To be more specific, this happens on filters that exclude
very few results from the overall set.
For instance, type:video returns few results
I guess q=mount OR (mount*)^0.01 would work equally as well, i.e. diminishing
the effect of wildcard matches.
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Training in Europe - www.solrtraining.com
On 11. aug. 2010, at 17.53, yandong yao wrote:
> Hi Jan,
>
> Seems q
It may not be the data config. Do you have the fields in the schema.xml that
the image data is going to set to be multiValued="true"?
Although, I would think the last image would be stored, not the first, but
haven't really tested this.
--
View this message in context:
http://lucene.472066.n3.n
Hi,
Can someone tell me why the two following queries do not return the same
results?
Is that a bug or a feature?
http://localhost:8983/jobs/select?fq=title:(NOT janitor)&fq=description:(NOT
janitor)&q=*:*
http://localhost:8983/jobs/select?q=title:(NOT janitor) AND description:(NOT
janitor)
1) Is there any information on preferred maximum sizes for a single solr
index. I've read some people say 10 million, some say 80 million, etc...
Is there any official recommendation or has anyone experimented with large
datasets into the tens of billions?
2) Is there any down side to running m
I'm trying to use Solr to index the contents of an Excel file, using
the ExtractingRequestHandler (CSV handler won't work for me - I need
to consider the whole spreadsheet as one document), and I'm running
into some trouble.
Is there any way to see what's going on during the indexing process?
I'm
Have a look at www.splunk.com
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Training in Europe - www.solrtraining.com
On 11. aug. 2010, at 19.34, Jay Flattery wrote:
> Hi there,
>
>
> Just wondering what tools people use to analyse SOLR log files.
>
> We're lookin
Your syntax looks a bit funny.
Which version of Solr are you using? Pure negative queries are not supported,
try q=(*:* -title:janitor) instead.
Also, for debugging what's going on, please add &debugQuery=true and share the
parsed query for both cases with us.
--
Jan Høydahl, search solution a
I tried making the schema fields that get the image data to
multiValued="true". But it still gets only the first image data. It doesn't
have information about all the images.
On Wed, Aug 11, 2010 at 1:15 PM, kenf_nc wrote:
>
> It may not be the data config. Do you have the fields in the schem
Hi,
You can try Tika command line to parse your Excel file, then you will se the
exact textual output from it, which will be indexed into Solr, and thus inspect
whether something is missing.
Are you sure you use a version of Luke which supports your version of Lucene?
--
Jan Høydahl, search so
To follow up on my own question, it appears this is only an issue when using
the DataImport console debugging tools. It looks like when submitting the
debugging request, the data-config.xml is sent via a GET request, which
would fail. However, using the exact same data-config.xml via a full-impor
Hi,
I did load of the data with DIH and now once the data is loaded. I want to
load the records dynamically as an when I received.
Use cases:
1. I did load of 7MM records and now everything is working fine.
2. A new record is received, now I want to add this new record into the
indexed dat
Hi, I have 3 facet fields (A,B,C) the values of each facet field will
be shown as check boxes to users:
Field A
[x] Val1a
[x] Val2a
[] Val3a
Field B
[x] Val1b
[] Val2b
[] Val3b
Within a field if the user selects two items I want the queries to be
an "OR" query. Currently I'm generating some
On Thu, Aug 12, 2010 at 7:12 AM, Frank A wrote:
> Hi, I have 3 facet fields (A,B,C) the values of each facet field will
> be shown as check boxes to users:
>
> Field A
> [x] Val1a
> [x] Val2a
> [] Val3a
>
> Field B
> [x] Val1b
> [] Val2b
> [] Val3b
>
> Within a field if the user selects two i
Have you tried changing the -Xmx value to bump to -Xmx1300m?
I had some problem with DIH loading the data and when I bumped the
memory everything worked fine!
harrysmith wrote:
To follow up on my own question, it appears this is only an issue when using
the DataImport console debugging tools.
Thanks.
I've done Tika command line to parse the Excel file, and I see
contents in it that don't appear to be indexed. I've tried the path of
using Tika to parse the Excel and then using extracting request
handler to index the resulting text, and that doesn't work either.
As far as Luke goes, I'v
Can do you a DB join on OurID? That makes the association in the
database, before it gets to the DataImportHandler.
On Sun, Aug 8, 2010 at 6:17 PM, Frank A wrote:
> Hi,
>
> I have a db handler with the following definition:
>
> query="select OurID,Name,City,State,lat,ln
Thus when I access http://localhost/solr/select?q=*:* equals
http://localhost/solr/core0/select?q=*:*.
I think I know where you're headed, I was struggling with the same issue. In
my case, using results from Solr I link to a detailed profile using an ID,
but I am displaying the String value. I was looking for something like:
12345
Feature 1 label
1
Feature 2 label
2
...or so
44 matches
Mail list logo