[ https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
wuda updated LUCENE-10035: -------------------------- Description: Simple text codec add skip list data( include impact) to help understand index format,For debugging, curiosity, transparency only!! When term's docFreq greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default value is 8), Simple text codec will write skip list, the *.pst (simple text term dictionary file)* file will looks like this {code:java} field title term args doc 2 freq 2 pos 7 pos 10 ## we omit docs for better view ...... doc 98 freq 2 pos 2 pos 6 skipList ? level 1 skipDoc 65 skipDocFP 949 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 13 impacts_end ? level 0 skipDoc 17 skipDocFP 284 impacts impact freq 1 norm 2 impact freq 2 norm 12 impacts_end skipDoc 34 skipDocFP 624 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 14 impacts_end skipDoc 65 skipDocFP 949 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 13 impacts_end skipDoc 90 skipDocFP 1311 impacts impact freq 1 norm 2 impact freq 2 norm 10 impact freq 3 norm 13 impact freq 4 norm 14 impacts_end END checksum 00000000000829315543 {code} compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, impact, freq, norm* nodes, at the same, simple text codec can support advanceShallow when search time. was: Simple text codec add skip list data( include impact) to help understand index format,For debugging, curiosity, transparency only!! When term's docFreq greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default value is 8), Simple text codec will write skip list, the *.pst (simple text term dictionary file)* file will looks like this {code:java} field title term args doc 2 freq 2 pos 7 pos 10 ## we omit docs for better view ...... doc 98 freq 2 pos 2 pos 6 skipList ? level 1 skipDoc 65 skipDocFP 949 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 13 impacts_end ? level 0 skipDoc 17 skipDocFP 284 impacts impact freq 1 norm 2 impact freq 2 norm 12 impacts_end skipDoc 34 skipDocFP 624 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 14 impacts_end skipDoc 65 skipDocFP 949 impacts impact freq 1 norm 2 impact freq 2 norm 12 impact freq 3 norm 13 impacts_end skipDoc 90 skipDocFP 1311 impacts impact freq 1 norm 2 impact freq 2 norm 10 impact freq 3 norm 13 impact freq 4 norm 14 impacts_end END checksum 00000000000829315543 {code} compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, impact, freq, norm* nodes, at the same, simple text codec can support advanceShallow when search time. > Simple text codec add multi level skip list data > -------------------------------------------------- > > Key: LUCENE-10035 > URL: https://issues.apache.org/jira/browse/LUCENE-10035 > Project: Lucene - Core > Issue Type: New Feature > Components: core/codecs > Affects Versions: main (9.0) > Reporter: wuda > Priority: Major > Labels: Impact, MultiLevelSkipList, SimpleTextCodec > Time Spent: 10m > Remaining Estimate: 0h > > Simple text codec add skip list data( include impact) to help understand > index format,For debugging, curiosity, transparency only!! When term's > docFreq greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default > value is 8), Simple text codec will write skip list, the *.pst (simple text > term dictionary file)* file will looks like this > {code:java} > field title > term args > doc 2 > freq 2 > pos 7 > pos 10 > ## we omit docs for better view ...... > doc 98 > freq 2 > pos 2 > pos 6 > skipList > ? > level 1 > skipDoc 65 > skipDocFP 949 > impacts > impact > freq 1 > norm 2 > impact > freq 2 > norm 12 > impact > freq 3 > norm 13 > impacts_end > ? > level 0 > skipDoc 17 > skipDocFP 284 > impacts > impact > freq 1 > norm 2 > impact > freq 2 > norm 12 > impacts_end > skipDoc 34 > skipDocFP 624 > impacts > impact > freq 1 > norm 2 > impact > freq 2 > norm 12 > impact > freq 3 > norm 14 > impacts_end > skipDoc 65 > skipDocFP 949 > impacts > impact > freq 1 > norm 2 > impact > freq 2 > norm 12 > impact > freq 3 > norm 13 > impacts_end > skipDoc 90 > skipDocFP 1311 > impacts > impact > freq 1 > norm 2 > impact > freq 2 > norm 10 > impact > freq 3 > norm 13 > impact > freq 4 > norm 14 > impacts_end > END > checksum 00000000000829315543 > {code} > compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, > impact, freq, norm* nodes, at the same, simple text codec can support > advanceShallow when search time. > > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org