On Jan 29, 2007, at 8:49 PM, Antonio Eggberg wrote:
After doing quite a bit of searching what I understand is that the
medicine to my problem of word count is in docTermFreq and
TermEnum ... as Chris Hostetter points out clearly for statistical
purpose in the post below. (Please note I am not so familer with java)
http://www.mail-archive.com/solr-dev@lucene.apache.org/msg02347.html
Based on the discussion in SOLR-116 it seems like it is there
somewhere .. Can I access it somehow view solrb i.e ruby gem.. oh
it would save me so much trouble trying to get it right in Java.
I would appreciate a clarification if its possible to access it via
ruby/solrb..
solrb provides this kind of capability now, via the &qt=indexinfo
request handler:
> require 'solr'
> require 'pp' # pretty print
> connection = Solr::Connection.new("http://localhost:8983/solr")
> pp connection.send(Solr::Request::IndexInfo.new)
#<Solr::Response::IndexInfo:0x7dd46c
@data=
{"NOTICE"=>"This interface is experimental and may be changing",
"fields"=>
{"author_text"=>{"type"=>"text"},
"subject_genre_facet"=>{"type"=>"string"},
"text"=>{"type"=>"text"},
"subject_geographic_facet"=>{"type"=>"string"},
"subject_format_facet"=>{"type"=>"string"},
"id"=>{"type"=>"string"},
"subject_era_facet"=>{"type"=>"string"},
"subject_topic_facet"=>{"type"=>"string"},
"title_text"=>{"type"=>"text"}},
"index"=>{"numDocs"=>50000, "version"=>1168970065801,
"maxDoc"=>50000},
"responseHeader"=>{"status"=>0, "QTime"=>0}},
@header={"status"=>0, "QTime"=>0},
@raw_response="{'responseHeader'=>{'status'=>0,'QTime'=>0},'fields'=>
{'title_text'=>{'type'=>'text'},'subject_format_facet'=>
{'type'=>'string'},'subject_geographic_facet'=>
{'type'=>'string'},'subject_topic_facet'=>
{'type'=>'string'},'subject_genre_facet'=>{'type'=>'string'},'text'=>
{'type'=>'text'},'author_text'=>{'type'=>'text'},'subject_era_facet'=>
{'type'=>'string'},'id'=>{'type'=>'string'}},'index'=>
{'maxDoc'=>50000,'numDocs'=>50000,'version'=>1168970065801},'NOTICE'=>'T
his interface is experimental and may be changing'}">
So, no, the current information provided by this handler does not
contain frequency information. I'd be happy to consider patches that
allow it to provide more information, though I'd like to keep the
basic index information request as succinct as possible, using
additional parameters to output more details if requested.
Erik