Solr on Amazon EC2

2013-05-06 Thread Rajesh Nikam
Hello, I am looking into how to do document classification for categorization of html documents. I see Solr/Lucene + MoreLikeThis that suits to find similar documents for given document. I am able to do classification using Lucene + MoreLikeThis example. Then I was looking for how to host Solr o

Install Solr on EC2

2013-05-10 Thread Rajesh Nikam
Hi All, I am looking for steps to run Solr 3.6.2 or latest stable version on Amazon EC2. I want this image to saved once created. Could you please help with the steps that needs to be followed. I have tried steps from https://github.com/sunspot/sunspot/wiki/Configure-Solr-on-Ubuntu,-the-quickes

Re: [ANNOUNCE] Web Crawler

2013-05-22 Thread Rajesh Nikam
Hi, crawl anywhere seems to using old versions of java, tomcat, etc. http://www.crawl-anywhere.com/installation-v300/ Will it work with new versions of these required software ? Is there updated installation guide available ? Thanks Rajesh On Wed, May 22, 2013 at 6:48 PM, Dominique Bejean

using solr for web page classification

2013-05-27 Thread Rajesh Nikam
Hello, I am working on implementation of system to categorize URLs/Web Pages. I would have categories like ... Adult Health Business Arts Home Science I am looking at how Lucence/Solr could help me out to achive this. I came across links that mention MoreLik

Re: [blog post] Automatically Acquiring Synonym Knowledge from Wikipedia

2013-05-27 Thread Rajesh Nikam
Hello Koji, This is seems pretty useful post on how to create synonyms file. Thanks a lot for sharing this ! Have you shared source code / jar for the same so at it could be used ? Thanks, Rajesh On Mon, May 27, 2013 at 8:44 PM, Koji Sekiguchi wrote: > Hello, > > Sorry for cross post. I jus

Re: [blog post] Automatically Acquiring Synonym Knowledge from Wikipedia

2013-05-28 Thread Rajesh Nikam
llowing synonym library. > > koji > > > (13/05/28 14:12), Rajesh Nikam wrote: > >> Hello Koji, >> >> This is seems pretty useful post on how to create synonyms file. >> Thanks a lot for sharing this ! >> >> Have you shared source code / jar for the sam