Hello I think you can start from http://www.lucenerevolution.org/2013/What-is-in-a-lucene-index
On Thu, Feb 13, 2014 at 12:56 PM, Harshvardhan Ojha < ojha.harshvard...@gmail.com> wrote: > Hi All, > > I have a question regarding retrieval of documents by lucene. > I know lucene uses many files on disk to keep documents, each comprising > fields in it, and uses many IR algorithms, and inverted index to match > documents. > > My question is : > 1. How lucene stores these documents inside file system and gets it so > fast? > 2. Does lucene uses any Hashing algorithm to get docs in O(1) ? If not > which DS is used by lucene ? > 3. Except id provided by us at the time of indexing, is there any other > unique identifier which is assigned by lucene to its documents ? > > I will appreciate If someone can provide me with source file names to study > these algorithms in detail. > > Regards > Harshvardhan Ojha > -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com>