On Jan 8, 2008 8:13 PM, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> Perhaps consider using a copyField to copy the relevant values into
> another field - then you can get the top tokens across all these fields
> with luke.
That sounds like the best solution, thanks. Also means I'd be able to
have i
Hi, I am using the post.jar tool to post files to solr. I d like to post
everything in a folder, e.g., "myfolder". I typed in command:
java -jar post.jar c:/myfolder/*.xml.
This works perfectly when I test on a sample of 100k xml files. But when I
work on the real dataset, there are over 1m file
I have a problem - memory and performance issues for more than 10 request
(solr Search and Facet) per second.
On tomcat, it requires 4 to 5 G Bytes, but still not enough.
Does anyone have any experience regarding high volume and performance issue
on Tomcat and JBOss and resolutions share with me?
Is there a practical reason behind trying to post 1m different files
instead of several large files. If this is a unix setup can you try
post.sh instead.
-Original Message-
From: zqzuk [mailto:[EMAIL PROTECTED]
Sent: Wednesday, January 09, 2008 6:14 AM
To: solr-user@lucene.apache.org
Subj
Best practice indexing doesn't create intermediate files at all, but
constructs in memory and posts to solr via an indexing program. There
are java, ruby, python, etc clients to help you talk to Solr over
HTTP.
If you don't want to do any programming and your data is in a
database, using a CSV du
For solrj users...
The SolrDocumentList numFound and offsets just changed to support >2B
docs -- you may need to update some code to avoid loss of precision errors.
Unless you are using SOLR-303, the numFound will never be out of integer
range, so in that case a simple cast should be enough.
see the other responses to address the solr/best practices aspects of your
sitaution, but as to the specifics of why you are seeing what you are
seeing:
your shell (regardless of wether you are on a unix box or a windows box)
has toresolve the "*.xml" part of that command line and convert it t
I have a ps of 10 set in the config. so for search terms like 'work injury' ,
results that have work and injury within 10 words of one another are coming
up in the first few results which is good. But it also returns results that
have work and injury are more than 10 words apart although they are
On 9-Jan-08, at 2:16 PM, anuvenk wrote:
I have a ps of 10 set in the config. so for search terms like 'work
injury' ,
results that have work and injury within 10 words of one another
are coming
up in the first few results which is good. But it also returns
results that
have work and injury
Hello-
I've put together a little sample using wicket and solr.
http://example.solrstuff.org/solr-voyager-example/
this uses solrj to hit:
http://example.solrstuff.org/pvs/
Its still got some problems and I'm still stumbling through getting the
models/structure correct, so if anyone is intere
10 matches
Mail list logo