On Jan 29, 2007, at 8:49 PM, Antonio Eggberg wrote:
After doing quite a bit of searching what I understand is that the medicine to my problem of word count is in docTermFreq and TermEnum ... as Chris Hostetter points out clearly for statistical purpose in the post below. (Please note I am not so familer with java)

http://www.mail-archive.com/solr-dev@lucene.apache.org/msg02347.html

Based on the discussion in SOLR-116 it seems like it is there somewhere .. Can I access it somehow view solrb i.e ruby gem.. oh it would save me so much trouble trying to get it right in Java.

I would appreciate a clarification if its possible to access it via ruby/solrb..


solrb provides this kind of capability now, via the &qt=indexinfo request handler:

> require 'solr'
> require 'pp'  # pretty print
> connection = Solr::Connection.new("http://localhost:8983/solr";)
> pp connection.send(Solr::Request::IndexInfo.new)
#<Solr::Response::IndexInfo:0x7dd46c
@data=
  {"NOTICE"=>"This interface is experimental and may be changing",
   "fields"=>
    {"author_text"=>{"type"=>"text"},
     "subject_genre_facet"=>{"type"=>"string"},
     "text"=>{"type"=>"text"},
     "subject_geographic_facet"=>{"type"=>"string"},
     "subject_format_facet"=>{"type"=>"string"},
     "id"=>{"type"=>"string"},
     "subject_era_facet"=>{"type"=>"string"},
     "subject_topic_facet"=>{"type"=>"string"},
     "title_text"=>{"type"=>"text"}},
"index"=>{"numDocs"=>50000, "version"=>1168970065801, "maxDoc"=>50000},
   "responseHeader"=>{"status"=>0, "QTime"=>0}},
@header={"status"=>0, "QTime"=>0},
@raw_response="{'responseHeader'=>{'status'=>0,'QTime'=>0},'fields'=> {'title_text'=>{'type'=>'text'},'subject_format_facet'=> {'type'=>'string'},'subject_geographic_facet'=> {'type'=>'string'},'subject_topic_facet'=> {'type'=>'string'},'subject_genre_facet'=>{'type'=>'string'},'text'=> {'type'=>'text'},'author_text'=>{'type'=>'text'},'subject_era_facet'=> {'type'=>'string'},'id'=>{'type'=>'string'}},'index'=> {'maxDoc'=>50000,'numDocs'=>50000,'version'=>1168970065801},'NOTICE'=>'T his interface is experimental and may be changing'}">

So, no, the current information provided by this handler does not contain frequency information. I'd be happy to consider patches that allow it to provide more information, though I'd like to keep the basic index information request as succinct as possible, using additional parameters to output more details if requested.

        Erik

Reply via email to