Maybe I misunderstood the question, but why you need to create the
full size vectors? can't you just compute the cosine using the sparse
vectors?

On Fri, Jan 5, 2018 at 10:09 PM, marco <marco.mutt...@gmail.com> wrote:
> At the moment I have another problem: is there an efficient way to calculate
> the cosine similarity between  documents?
> I'm following (with the required modifications)  THIS
> <https://stackoverflow.com/a/16991131>   code that calculates the cosine
> similarity between 2 documents, but it doesn't look too efficient when it
> comes to repeat everything between the user profile and every document
> retreived by the query.
> This because the termvectors returned by the IndexSearcher function
> getTermVector(...) only contain the terms present in the associated
> document, forcing you to create manually the full vectors.
> Isn't there the possibility to obtain full size vectors? (or are they way
> too big?)
>
>
>
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Reply via email to