[ 
https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wuda updated LUCENE-10035:
--------------------------
    Description: 
Simple text codec add skip list data( include impact) to help understand index 
format,For debugging, curiosity, transparency only!! When term's docFreq 
greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default value is 8), 
Simple text codec will write skip list, the *.pst (simple text term dictionary 
file)* file will looks like this
{code:java}
field title
  term args
    doc 2
      freq 2
      pos 7
      pos 10
    ## we omit docs for better view ......
    doc 98
      freq 2
      pos 2
      pos 6
    skipList 
?
      level 1
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end 
?
      level 0
        skipDoc 17
        skipDocFP 284
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
        impacts_end         
        skipDoc 34
        skipDocFP 624
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 14
        impacts_end         
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end         
        skipDoc 90
        skipDocFP 1311
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 10
          impact 
            freq 3
            norm 13
          impact 
            freq 4
            norm 14
        impacts_end 
END
checksum 00000000000829315543

{code}
compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, 
impact, freq, norm* nodes, at the same, simple text codec can support 
advanceShallow when search time.

 

 

  was:
Simple text codec add skip list data( include impact) to help understand index 
format,For debugging, curiosity, transparency only!! When term's docFreq 
greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default value is 8), 
Simple text codec will write skip list, the *.pst (simple text term dictionary 
file)* file will looks like this
{code:java}
field title
  term args
    doc 2
      freq 2
      pos 7
      pos 10
    ## we omit docs for better view ......
    doc 98
      freq 2
      pos 2
      pos 6
    skipList 
?
      level 1
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end 
?
      level 0
        skipDoc 17
        skipDocFP 284
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
        impacts_end         
        skipDoc 34
        skipDocFP 624
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 14
        impacts_end         
        skipDoc 65
        skipDocFP 949
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 12
          impact 
            freq 3
            norm 13
        impacts_end         
        skipDoc 90
        skipDocFP 1311
        impacts 
          impact 
            freq 1
            norm 2
          impact 
            freq 2
            norm 10
          impact 
            freq 3
            norm 13
          impact 
            freq 4
            norm 14
        impacts_end 
END
checksum 00000000000829315543

{code}
compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, 
impact, freq, norm* nodes, at the same, simple text codec can support 
advanceShallow when search time.


> Simple text codec add  multi level skip list data 
> --------------------------------------------------
>
>                 Key: LUCENE-10035
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10035
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: core/codecs
>    Affects Versions: main (9.0)
>            Reporter: wuda
>            Priority: Major
>              Labels: Impact, MultiLevelSkipList, SimpleTextCodec
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Simple text codec add skip list data( include impact) to help understand 
> index format,For debugging, curiosity, transparency only!! When term's 
> docFreq greater than or equal to SimpleTextSkipWriter.BLOCK_SIZE (default 
> value is 8), Simple text codec will write skip list, the *.pst (simple text 
> term dictionary file)* file will looks like this
> {code:java}
> field title
>   term args
>     doc 2
>       freq 2
>       pos 7
>       pos 10
>     ## we omit docs for better view ......
>     doc 98
>       freq 2
>       pos 2
>       pos 6
>     skipList 
> ?
>       level 1
>         skipDoc 65
>         skipDocFP 949
>         impacts 
>           impact 
>             freq 1
>             norm 2
>           impact 
>             freq 2
>             norm 12
>           impact 
>             freq 3
>             norm 13
>         impacts_end 
> ?
>       level 0
>         skipDoc 17
>         skipDocFP 284
>         impacts 
>           impact 
>             freq 1
>             norm 2
>           impact 
>             freq 2
>             norm 12
>         impacts_end         
>         skipDoc 34
>         skipDocFP 624
>         impacts 
>           impact 
>             freq 1
>             norm 2
>           impact 
>             freq 2
>             norm 12
>           impact 
>             freq 3
>             norm 14
>         impacts_end         
>         skipDoc 65
>         skipDocFP 949
>         impacts 
>           impact 
>             freq 1
>             norm 2
>           impact 
>             freq 2
>             norm 12
>           impact 
>             freq 3
>             norm 13
>         impacts_end         
>         skipDoc 90
>         skipDocFP 1311
>         impacts 
>           impact 
>             freq 1
>             norm 2
>           impact 
>             freq 2
>             norm 10
>           impact 
>             freq 3
>             norm 13
>           impact 
>             freq 4
>             norm 14
>         impacts_end 
> END
> checksum 00000000000829315543
> {code}
> compare with previous,we add *skipList,level, skipDoc, skipDocFP, impacts, 
> impact, freq, norm* nodes, at the same, simple text codec can support 
> advanceShallow when search time.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to