On Nov 26, 2007 6:58 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
LSA (http://en.wikipedia.org/wiki/Latent_semantic_indexing) is
patented, so it is not likely to happen unless the authors donate the
patent to the ASF.
-Grant
There are many ways to catch a bird... LSA reduces to SVD on the TF
graph. I have had limited success using JAMA's SVD, which is PD. It's
pure java; for something serious you'd want to wrap the hard bits in
MKL/Accelerate.
A more interesting solr related question is where a very heavy
process like SVD would operate. You'd want to run the 'training' half
of it separate from a indexing or querying. It'd almost be like an
optimize. Is there any hook right now to give Solr a "command" like
<updateModels/> and map it to the class in the solrconfig? The
classify half of the SVD can happen at query or index time, very
quickly, I imagine that could even be a custom field type.