RE: Problems indexing very large set of documents

2011-04-11 Thread Brandon Waterloo
whether indexing was successful or not. So my question is, how can I tell from the response whether or not indexing was actually successful? ~Brandon Waterloo From: Lance Norskog [goks...@gmail.com] Sent: Sunday, April 10, 2011 5:22 PM To: solr-user@lucen

RE: Problems indexing very large set of documents

2011-04-08 Thread Brandon Waterloo
I think I've finally found the problem. The files that work are PDF version 1.6. The files that do NOT work are PDF version 1.4. I'll look into updating all the old documents to PDF 1.6. Thanks everyone! ~Brandon Waterloo From: Ezequiel Calder

RE: Problems indexing very large set of documents

2011-04-08 Thread Brandon Waterloo
. From: Brandon Waterloo [brandon.water...@matrix.msu.edu] Sent: Friday, April 08, 2011 10:40 AM To: solr-user@lucene.apache.org Subject: RE: Problems indexing very large set of documents I had some time to do some research into the problems. From what I can

RE: Problems indexing very large set of documents

2011-04-08 Thread Brandon Waterloo
their filenames. In addition they are being used by other applications. Is there something I can change in Solr configs to fix this issue or am I simply SOL until the Solr dev team can work on this? (assuming I put in a ticket) Thanks again everyone, ~Brandon Waterloo

RE: Problems indexing very large set of documents

2011-04-05 Thread Brandon Waterloo
/extract?literal.id=32-130-B00-84&commit=false HTTP/1.1" 500 4557 That is by no means all the errors, that is just a sample of a few. You can see they all threw HTTP 500 errors. What is strange is, nearly every file succeeded before about the 2200-files-mark, and nearly every file after th

RE: Problems indexing very large set of documents

2011-04-04 Thread Brandon Waterloo
Looks like I'm using Tika 0.4: apache-solr-1.4.1/contrib/extraction/lib/tika-core-0.4.jar .../tika-parsers-0.4.jar ~Brandon Waterloo From: Anuj Kumar [anujs...@gmail.com] Sent: Monday, April 04, 2011 2:12 PM To: solr-user@lucene.apache.org Cc: Br

Problems indexing very large set of documents

2011-04-04 Thread Brandon Waterloo
often than that for the sake of efficiency. Obviously it takes long enough to index 4000 documents and there's no reason to make it take longer. Thanks for your help! ~Brandon Waterloo

RE: Multiple Cores with Solr Cell for indexing documents

2011-03-25 Thread Brandon Waterloo
is to move back to single-core searching and then test it with a huge amount of documents to see whether performance is seriously impacted or not. So for now, I guess we can consider this thread resolved. Thanks for all your help guys! ~Brandon Waterloo __

Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
logous data, there's no need to have any different library modules for each. The relevant line in my solr.xml file is . That is housed in .../example/solr/. So, does it look in .../example/lib or .../example/solr/lib? ~Brandon Waterloo From:

RE: Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
logous data, there's no need to have any different library modules for each. The relevant line in my solr.xml file is . That is housed in .../example/solr/. So, does it look in .../example/lib or .../example/solr/lib? ~Brandon Waterloo From:

Multiple Cores with Solr Cell for indexing documents

2011-03-24 Thread Brandon Waterloo
link me to would be greatly appreciated. Thank you all for your time! ~Brandon Waterloo

Multiple Cores with Solr Cell for indexing documents

2011-03-22 Thread Brandon Waterloo
ava.lang.ClassLoader.loadClass(ClassLoader.java:248) ... 27 more RequestURI=/solr/core0/update/extracthttp://jetty.mortbay.org/";>Powered by Jetty:// Any assistance you could provide or installation guides/tutorials/etc. that you could link me to would be great