Best platform for hosting Solr

2015-02-18 Thread Ganesh.Yadav
Guys, 1. Can anyone suggest what would be the best platform to host Solr on any Unix or windows server? 2. All I will be doing is importing lots of PDF documents into Solr. I believe Solr will automatically build the schema for imported documents. 3. Can someone suggest what

PDF search functionality using Solr

2015-01-06 Thread Ganesh.Yadav
Hello Solr-users and developers, Can you please suggest, 1. What I should do to index PDF content information column wise? 2. Do I need to extract the contents using one of the Analyzer, Tokenize and Filter combination and then add it to Index? How can test the results on command pr

RE: PDF search functionality using Solr Schema.xml and SolrConfig.xml question

2015-01-06 Thread Ganesh.Yadav
Thanks Jürgen for your quick reply. Still looking for answer on Schema.xml and SolrConfig.xml 1. Do I need to tell Solr, to extract Title from PDF, go look for Title word and extract entire line after the Tag and collect all such occurrence’s from hundreds of PDFs and build the Title co

RE: Running Multiple Solr Instances

2015-01-06 Thread Ganesh.Yadav
Nishanth, 1. I understand you are implementing clustering for the web apps which is running the same application on multiple different instances on one or more machines. 2. If each of your web apps start pointing to the different index directory, how it will switch to the next web

RE: PDF search functionality using Solr Schema.xml and SolrConfig.xml question

2015-01-06 Thread Ganesh.Yadav
Still looking for answer on Schema.xml and SolrConfig.xml 1. Do I need to tell Solr, to extract Title from PDF, go look for Title word and extract entire line after the Tag and collect all such occurrence’s from hundreds of PDFs and build the Title column data and index it? 2. How

RE: .htaccess / password

2015-01-06 Thread Ganesh.Yadav
Craig, 1. What is .htaccess file meant for? 2. What are the contents inside this file? 3. How will you or how Solr knows that it needs to look for this file to bring in the needed security to this (which) area? 4. What event is causing for you to re-index the engine eve

OutOfMemoryError for PDF document upload into Solr

2015-01-14 Thread Ganesh.Yadav
Hello, Can someone pass on the hints to get around following error? Is there any Heap Size parameter I can set in Tomcat or in Solr webApp that gets deployed in Solr? I am running Solr webapp inside Tomcat on my local machine which has RAM of 12 GB. I have PDF document which is 4 GB max in size

RE: OutOfMemoryError for PDF document upload into Solr

2015-01-15 Thread Ganesh.Yadav
Siegfried and Michael Thank you for your replies and help. -Original Message- From: Siegfried Goeschl [mailto:sgoes...@gmx.at] Sent: Thursday, January 15, 2015 3:45 AM To: solr-user@lucene.apache.org Subject: Re: OutOfMemoryError for PDF document upload into Solr Hi Ganesh, you can incr