Maybe I misunderstood the question, but why you need to create the full size vectors? can't you just compute the cosine using the sparse vectors?
On Fri, Jan 5, 2018 at 10:09 PM, marco <marco.mutt...@gmail.com> wrote: > At the moment I have another problem: is there an efficient way to calculate > the cosine similarity between documents? > I'm following (with the required modifications) THIS > <https://stackoverflow.com/a/16991131> code that calculates the cosine > similarity between 2 documents, but it doesn't look too efficient when it > comes to repeat everything between the user profile and every document > retreived by the query. > This because the termvectors returned by the IndexSearcher function > getTermVector(...) only contain the terms present in the associated > document, forcing you to create manually the full vectors. > Isn't there the possibility to obtain full size vectors? (or are they way > too big?) > > > > -- > Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html