Re: Embedded about 50% faster for indexing

Erik Hatcher Sun, 26 Aug 2007 13:11:16 -0700


On Aug 24, 2007, at 5:29 PM, Wu, Daniel wrote:

Theoretically and practically, embedded solution will be faster than
going through http/xml.  I would like to see solr has some sort of
document source adapter architecture which will iterate through allthedocuments available in the document source. This way, if thedocumentscome from database for example, it can be simply a sql query in thesolr
configuration file.


In Ruby land we already have this sort of mechanism:

  source = YourCustomDataSource.new
  mapping = {
    :title = :custom_data_source_title,
    :author = :custom_data_source_author
    # ...
  }
  indexer = Solr::Indexer.new(source, mapping)
  indexer.index

The only requirements are that YourCustomDataSource returns documentssuccessively via the #each method, and that each of those documentshave an attribute accessor #[] method.This already works with CSV files, another Solr server, or an arrayof objects with the current solr-ruby library.

It is my dream to see JRuby blend into Solr to facilitate this sortof thing "in process".


        Erik

Re: Embedded about 50% faster for indexing

Reply via email to