Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-10 Thread Israel Ekpo
On Tue, Nov 10, 2009 at 8:26 AM, Eugene Dzhurinsky wrote: > On Tue, Nov 03, 2009 at 05:49:23PM -0800, Lance Norskog wrote: > > The DIH has improved a great deal from Solr 1.3 to 1.4. You will be > > much better off using the DIH from this. > > > > This is the current Solr release candidate binary

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-10 Thread Eugene Dzhurinsky
On Tue, Nov 03, 2009 at 05:49:23PM -0800, Lance Norskog wrote: > The DIH has improved a great deal from Solr 1.3 to 1.4. You will be > much better off using the DIH from this. > > This is the current Solr release candidate binary: > http://people.apache.org/~gsingers/solr/1.4.0/ In fact we are pr

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-03 Thread Lance Norskog
The DIH has improved a great deal from Solr 1.3 to 1.4. You will be much better off using the DIH from this. This is the current Solr release candidate binary: http://people.apache.org/~gsingers/solr/1.4.0/ On Tue, Nov 3, 2009 at 8:08 AM, Eugene Dzhurinsky wrote: > On Mon, Nov 02, 2009 at 05:45:

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-03 Thread Eugene Dzhurinsky
On Mon, Nov 02, 2009 at 05:45:37PM -0800, Lance Norskog wrote: > About large XML files and http overhead: you can tell solr to load the > file directly from a file system. This will stream thousands of > documents in one XML file without loading everything in memory at > once. > > This is a new bo

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-02 Thread Lance Norskog
About large XML files and http overhead: you can tell solr to load the file directly from a file system. This will stream thousands of documents in one XML file without loading everything in memory at once. This is a new book on Solr. It will help you through this early learning phase. http://www

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-11-02 Thread Alexey Serba
Hi Eugene, > - ability to iterate over all documents, returned in search, as Lucene does >  provide within a HitCollector instance. We would need to extract and >  aggregate various fields, stored in index, to group results and aggregate > them >  in some way. > > Also I did not find any way

Re: adding and updating a lot of document to Solr, metadata extraction etc

2009-10-30 Thread Israel Ekpo
On Fri, Oct 30, 2009 at 11:23 AM, Eugene Dzhurinsky wrote: > Hi there! > > We are trying to evaluate Apache Solr for our custom search implementation, > which > includes the following requirements: > > - ability to add/update/delete a lot of documents at once > > - ability to iterate over all docu