One reason to consider separate indexes is in terms of relevance. Do
you want content from classifieds effecting the rankings of your news
searches? May not be an issue for you depending on your term
distributions, but might be something to consider. As you suspect,
though, having multiple indexes will require more management of the
various instances. Perhaps you can logically group things to only
have a couple of indexes? For instance, maybe home, auto, classifieds
are similar in content and structure and news and community-generated
content are similar?
-Grant
On Nov 5, 2007, at 9:34 AM, Tim Archambault wrote:
Typical newspaper site with: news, jobs, homes, autos, classifieds,
community-generated content, guestimate of .5 million documents
Do I really need to create a different solr index for each vertical?
How
ineffecient is it to add a few additional fields for each content
type?
Thinking of having a string field name "vertical" that would be used
to
segment by verticals above.
My intuition is that most of the additional fields would be numbers:
integers, prices, decimals.
Thanks,
Tim
--
True innovation is not just about changing a product, a service or
even a
marketplace; its also about recognizing and relishing the need to
change
yourself.
--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com
Lucene Boot Camp Training:
ApacheCon Atlanta, Nov. 12, 2007. Sign up now! http://www.apachecon.com
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ