Hello,
I'm yet another new solr user and I'll confess that I haven't read the documentation in great depth but hope someone can at least point me in the right direction.
I have an application that manages documents in real-time into collections where a given document can live in more than one collection and multiple users can create collections on the fly.
I get from reading that it's better to have a single index over all documents than to have one per collection or per user. However, to limit searches to one collection or to the collections owned by a single user it seems that I'd need to store collection id and user id metadata fields in each document. Every time a user adds a document to one of their collections, if that document has been added by other users to their collections, I'd have to update the metadata fields in that document and reindex it. This seems like trouble if I scale to hundreds of users over thousands of collections over millions of documents. At a minimum it sounds like indexing could be slow and my query cache would never be of much value since I'd be doing a lot of reindexing.
Am I completely off base here? What's a solr/lucene best practice scenario for this sort of application?
Regards, Phil ____________________________ DLPS, University of Michigan