Presumably requesting pivot facets returns what are you asking for. However, it takes a time. Overall problem seems like more suitable for Mahout, or (really sorry for mentioning it) Hadoop.
On Thu, Nov 27, 2014 at 3:01 PM, Norgorn <lsunnyd...@mail.ru> wrote: > I'm working with social media data. > We have blog posts in our index - text + authors_id. > Now we need to clusterize authors by their texts. We need to get term > vector > not for documents, but one vector per one author (for all authors > documents). > > We can't get all documents and then unite 'em cause It'll take ages. > > And we can't just concat all posts in one mega-post per author (to have > one > document per author), cause our data grows every day and we receive new > posts for authors. > > Can u suggest any solution? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Terms-vector-for-multiple-documents-tp4171297.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com>