Consider the following use case.

Certain words are extracted from a document and indexed. The exact sentence
containing the word cannot be stored alongside the extracted word because
of the volume at which the documents grow; How can the index and, lets call
it doc servers be separated ?

An option is to store the sentences in MongoDB or a RDBMS. But there seems
to be a schema level design issue. Assuming 'word' to be a multivalued
field, how do we associate to it a reference to the corresponding entry in
the doc server.

May create (word_1, ref_1) tuples. Is there any other in-built feature ?

Any related project which separates index & doc servers ?

Thanks,
Sourajit

Reply via email to