Presumably requesting pivot facets returns what are you asking for.
However, it takes a time. Overall problem seems like more suitable for
Mahout, or (really sorry for mentioning it) Hadoop.

On Thu, Nov 27, 2014 at 3:01 PM, Norgorn <lsunnyd...@mail.ru> wrote:

> I'm working with social media data.
> We have blog posts in our index - text + authors_id.
> Now we need to clusterize authors by their texts. We need to get term
> vector
> not for documents, but one vector per one author (for all authors
> documents).
>
> We can't get all documents and then unite 'em cause It'll take ages.
>
> And we can't just concat all posts in one mega-post per author (to have
> one
> document per author), cause our data grows every day and we receive new
> posts for authors.
>
> Can u suggest any solution?
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Terms-vector-for-multiple-documents-tp4171297.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>



-- 
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics

<http://www.griddynamics.com>
<mkhlud...@griddynamics.com>

Reply via email to