Consider the following use case. Certain words are extracted from a document and indexed. The exact sentence containing the word cannot be stored alongside the extracted word because of the volume at which the documents grow; How can the index and, lets call it doc servers be separated ?
An option is to store the sentences in MongoDB or a RDBMS. But there seems to be a schema level design issue. Assuming 'word' to be a multivalued field, how do we associate to it a reference to the corresponding entry in the doc server. May create (word_1, ref_1) tuples. Is there any other in-built feature ? Any related project which separates index & doc servers ? Thanks, Sourajit