Re: Luke response format explained

2008-01-09 Thread Robert Young
On Jan 8, 2008 8:13 PM, Ryan McKinley <[EMAIL PROTECTED]> wrote: > Perhaps consider using a copyField to copy the relevant values into > another field - then you can get the top tokens across all these fields > with luke. That sounds like the best solution, thanks. Also means I'd be able to have i

Using the post tool - too many files in a folder?

2008-01-09 Thread zqzuk
Hi, I am using the post.jar tool to post files to solr. I d like to post everything in a folder, e.g., "myfolder". I typed in command: java -jar post.jar c:/myfolder/*.xml. This works perfectly when I test on a sample of 100k xml files. But when I work on the real dataset, there are over 1m file

Tomcat and JBOss

2008-01-09 Thread Jae Joo
I have a problem - memory and performance issues for more than 10 request (solr Search and Facet) per second. On tomcat, it requires 4 to 5 G Bytes, but still not enough. Does anyone have any experience regarding high volume and performance issue on Tomcat and JBOss and resolutions share with me?

RE: Using the post tool - too many files in a folder?

2008-01-09 Thread Gunther, Andrew
Is there a practical reason behind trying to post 1m different files instead of several large files. If this is a unix setup can you try post.sh instead. -Original Message- From: zqzuk [mailto:[EMAIL PROTECTED] Sent: Wednesday, January 09, 2008 6:14 AM To: solr-user@lucene.apache.org Subj

Re: Using the post tool - too many files in a folder?

2008-01-09 Thread Yonik Seeley
Best practice indexing doesn't create intermediate files at all, but constructs in memory and posts to solr via an indexing program. There are java, ruby, python, etc clients to help you talk to Solr over HTTP. If you don't want to do any programming and your data is in a database, using a CSV du

Solrj changes (int now long) SOLR-434

2008-01-09 Thread Ryan McKinley
For solrj users... The SolrDocumentList numFound and offsets just changed to support >2B docs -- you may need to update some code to avoid loss of precision errors. Unless you are using SOLR-303, the numFound will never be out of integer range, so in that case a simple cast should be enough.

Re: Using the post tool - too many files in a folder?

2008-01-09 Thread Chris Hostetter
see the other responses to address the solr/best practices aspects of your sitaution, but as to the specifics of why you are seeing what you are seeing: your shell (regardless of wether you are on a unix box or a windows box) has toresolve the "*.xml" part of that command line and convert it t

phrase slop

2008-01-09 Thread anuvenk
I have a ps of 10 set in the config. so for search terms like 'work injury' , results that have work and injury within 10 words of one another are coming up in the first few results which is good. But it also returns results that have work and injury are more than 10 words apart although they are

Re: phrase slop

2008-01-09 Thread Mike Klaas
On 9-Jan-08, at 2:16 PM, anuvenk wrote: I have a ps of 10 set in the config. so for search terms like 'work injury' , results that have work and injury within 10 words of one another are coming up in the first few results which is good. But it also returns results that have work and injury

wicket + solr

2008-01-09 Thread Ryan McKinley
Hello- I've put together a little sample using wicket and solr. http://example.solrstuff.org/solr-voyager-example/ this uses solrj to hit: http://example.solrstuff.org/pvs/ Its still got some problems and I'm still stumbling through getting the models/structure correct, so if anyone is intere