Re: Wikipedia or reuters like index for testing facets?

Grant Ingersoll Tue, 14 Jul 2009 18:22:15 -0700

Probably not as generated by the EnwikiDocMaker, but theWikipediaTokenizer in Lucene can pull out richer syntax which couldthen be Teed/Sinked to other fields. Things like categories, relatedlinks, etc. Mostly, though, I was just commenting on the fact that itisn't hard to at least use it for getting docs into Solr.


-Grant
On Jul 14, 2009, at 7:38 PM, Jason Rutherglen wrote:

You think enwiki has enough data for faceting?
On Tue, Jul 14, 2009 at 2:56 PM, GrantIngersoll<gsing...@apache.org> wrote:
At a min, it is trivial to use the EnWikiDocMaker and then send thedoc over
SolrJ...

On Jul 14, 2009, at 4:07 PM, Mark Miller wrote:
On Tue, Jul 14, 2009 at 3:36 PM, Jason Rutherglen <
jason.rutherg...@gmail.com> wrote:
Is there a standard index like what Lucene uses for contrib/benchmark forexecuting faceted queries over? Or maybe we can randomly generateone
that
works in conjunction with wikipedia? That way we can execute realworldqueries against faceted data. Or we could use the Lucene/Solrmailing
lists
and other data (ala Lucid's faceted site) as a standard index?
I don't think there is any standard set of docs for solr testing -there
is
not a real benchmark contrib - though I know more than a few of ushavehacked up pieces of Lucene benchmark to work with Solr - I thinkI've done
it twice now ;)
Would be nice to get things going. I was thinking the other day: Iwonderhow hard it would be to make Lucene Benchmark generic enough toaccept
Solr
impls and Solr algs?

It does a lot that would suck to duplicate.

--
--
- Mark

http://www.lucidimagination.com
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)using
Solr/Lucene:
http://www.lucidimagination.com/search


--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)using Solr/Lucene:

http://www.lucidimagination.com/search

Re: Wikipedia or reuters like index for testing facets?

Reply via email to