zacharymorn commented on pull request #128:
URL: https://github.com/apache/lucene/pull/128#issuecomment-851169333
Some test results with corrupted index (`_gx_Lucene90_0.dvd`):
### Full check
```
> Task :lucene:core:CheckIndex.main()
NOTE: testing will be more thorough if you run java with
'-ea:org.apache.lucene...', so assertions are enabled
Opening index @
/Users/xichen/IdeaProjects/benchmarks/indices/corrupted/index/
Checking index with async threadCount: 12
0.00% total deletions; 6647577 documents; 0 deletions
Segments file=segments_2 numSegments=15 version=9.0.0
id=59c6he3dhebad46x7proh30nq userData={userData=multi}
1 of 15: name=_32 maxDoc=1197893
version=9.0.0
id=59c6he3dhebad46x7proh2zhm
codec=Lucene90
compound=false
numFiles=17
size (MB)=2,531.843
diagnostics = {timestamp=1622100146526, lucene.version=9.0.0,
java.vm.version=11.0.9+11, java.version=11.0.9, mergeMaxNumSegments=-1,
os.version=10.15.5, java.vendor=AdoptOpenJDK, source=merge, os.arch=x86_64,
mergeFactor=10, java.runtime.version=11.0.9+11, os=Mac OS X}
no deletions
test: open reader.........OK [took 0.125 sec]
test: check integrity.....OK [took 20.451 sec]
test: check live docs.....OK [took 0.000 sec]
test: field infos.........OK [17 fields] [took 0.000 sec]
test: field norms.........OK [2 fields] [took 0.044 sec]
test: terms, freq, prox...OK [20065511 terms; 450728331 terms/docs
pairs; 1175837878 tokens] [took 109.702 sec]
test: stored fields.......OK [3593679 total field count; avg 3.0 fields
per doc] [took 0.967 sec]
test: term vectors........OK [0 total term vector count; avg 0.0
term/freq vector fields per doc] [took 0.000 sec]
test: docvalues...........OK [10 docvalues fields; 3 BINARY; 0 NUMERIC;
5 SORTED; 0 SORTED_NUMERIC; 2 SORTED_SET] [took 2.575 sec]
test: points..............OK [2 fields, 2395786 points] [took 0.204 sec]
test: vectors.............OK [0 fields, 0 vectors] [took 0.000 sec]
2 of 15: name=_65 maxDoc=1197893
version=9.0.0
id=59c6he3dhebad46x7proh2zqv
codec=Lucene90
compound=false
numFiles=17
size (MB)=1,539.981
diagnostics = {timestamp=1622100810971, lucene.version=9.0.0,
java.vm.version=11.0.9+11, java.version=11.0.9, mergeMaxNumSegments=-1,
os.version=10.15.5, java.vendor=AdoptOpenJDK, source=merge, os.arch=x86_64,
mergeFactor=10, java.runtime.version=11.0.9+11, os=Mac OS X}
no deletions
test: open reader.........OK [took 0.124 sec]
test: check integrity.....OK [took 13.612 sec]
test: check live docs.....OK [took 0.000 sec]
test: field infos.........OK [17 fields] [took 0.000 sec]
test: field norms.........OK [2 fields] [took 0.042 sec]
test: terms, freq, prox...OK [15042354 terms; 274837439 terms/docs
pairs; 686566591 tokens] [took 76.072 sec]
test: stored fields.......OK [3593679 total field count; avg 3.0 fields
per doc] [took 0.982 sec]
test: term vectors........OK [0 total term vector count; avg 0.0
term/freq vector fields per doc] [took 0.000 sec]
test: docvalues...........OK [10 docvalues fields; 3 BINARY; 0 NUMERIC;
5 SORTED; 0 SORTED_NUMERIC; 2 SORTED_SET] [took 2.351 sec]
test: points..............OK [2 fields, 2395786 points] [took 0.194 sec]
test: vectors.............OK [0 fields, 0 vectors] [took 0.000 sec]
...
...
10 of 15: name=_gx maxDoc=119789
version=9.0.0
id=59c6he3dhebad46x7proh30n7
codec=Lucene90
compound=false
numFiles=17
size (MB)=129.046
diagnostics = {timestamp=1622102767300, lucene.version=9.0.0,
java.vm.version=11.0.9+11, java.version=11.0.9, mergeMaxNumSegments=-1,
os.version=10.15.5, java.vendor=AdoptOpenJDK, source=merge, os.arch=x86_64,
mergeFactor=10, java.runtime.version=11.0.9+11, os=Mac OS X}
no deletions
test: open reader.........OK [took 0.125 sec]
test: check integrity.....FAILED
WARNING: exorciseIndex() would remove reference to this segment; full
exception:
org.apache.lucene.index.CorruptIndexException: checksum failed (hardware
problem?) : expected=87e2aa4 actual=7b3afcbd
(resource=BufferedChecksumIndexInput(MMapIndexInput(path="/Users/xichen/IdeaProjects/benchmarks/indices/corrupted/index/_gx_Lucene90_0.dvd")))
at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440)
at
org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:614)
at
org.apache.lucene.codecs.lucene90.Lucene90DocValuesProducer.checkIntegrity(Lucene90DocValuesProducer.java:1656)
at
org.apache.lucene.codecs.perfield.PerFieldDocValuesFormat$FieldsReader.checkIntegrity(PerFieldDocValuesFormat.java:364)
at
org.apache.lucene.index.CodecReader.checkIntegrity(CodecReader.java:252)
at
org.apache.lucene.index.SegmentReader.checkIntegrity(SegmentReader.java:391)
at org.apache.lucene.index.CheckIndex.testSegment(CheckIndex.java:913)
at
org.apache.lucene.index.CheckIndex.lambda$checkIndex$0(CheckIndex.java:742)
at
org.apache.lucene.index.CheckIndex.lambda$callableToSupplier$1(CheckIndex.java:842)
at
java.base/java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1700)
at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:834)
11 of 15: name=_gy maxDoc=11979
version=9.0.0
id=59c6he3dhebad46x7proh30na
codec=Lucene90
compound=false
numFiles=17
size (MB)=13.473
diagnostics = {os.version=10.15.5, java.runtime.version=11.0.9+11,
os.arch=x86_64, source=flush, lucene.version=9.0.0, java.vm.version=11.0.9+11,
java.version=11.0.9, timestamp=1622102778710, os=Mac OS X,
java.vendor=AdoptOpenJDK}
no deletions
test: open reader.........OK [took 0.124 sec]
test: check integrity.....OK [took 0.065 sec]
test: check live docs.....OK [took 0.000 sec]
test: field infos.........OK [17 fields] [took 0.000 sec]
test: field norms.........OK [2 fields] [took 0.005 sec]
test: terms, freq, prox...OK [295343 terms; 1930448 terms/docs pairs;
4662478 tokens] [took 1.269 sec]
test: stored fields.......OK [35937 total field count; avg 3.0 fields
per doc] [took 0.077 sec]
test: term vectors........OK [0 total term vector count; avg 0.0
term/freq vector fields per doc] [took 0.000 sec]
test: docvalues...........OK [10 docvalues fields; 3 BINARY; 0 NUMERIC;
5 SORTED; 0 SORTED_NUMERIC; 2 SORTED_SET] [took 0.130 sec]
test: points..............OK [2 fields, 23958 points] [took 0.014 sec]
test: vectors.............OK [0 fields, 0 vectors] [took 0.000 sec]
...
...
15 of 15: name=_h2 maxDoc=11248
version=9.0.0
id=59c6he3dhebad46x7proh30nm
codec=Lucene90
compound=false
numFiles=17
size (MB)=10.617
diagnostics = {os.version=10.15.5, java.runtime.version=11.0.9+11,
os.arch=x86_64, source=flush, lucene.version=9.0.0, java.vm.version=11.0.9+11,
java.version=11.0.9, timestamp=1622102791291, os=Mac OS X,
java.vendor=AdoptOpenJDK}
no deletions
test: open reader.........OK [took 0.014 sec]
test: check integrity.....OK [took 0.155 sec]
test: check live docs.....OK [took 0.000 sec]
test: field infos.........OK [17 fields] [took 0.000 sec]
test: field norms.........OK [2 fields] [took 0.001 sec]
test: terms, freq, prox...OK [253387 terms; 1570705 terms/docs pairs;
3390075 tokens] [took 0.703 sec]
test: stored fields.......OK [33744 total field count; avg 3.0 fields
per doc] [took 0.015 sec]
test: term vectors........OK [0 total term vector count; avg 0.0
term/freq vector fields per doc] [took 0.000 sec]
test: docvalues...........OK [10 docvalues fields; 3 BINARY; 0 NUMERIC;
5 SORTED; 0 SORTED_NUMERIC; 2 SORTED_SET] [took 0.074 sec]
test: points..............OK [2 fields, 22496 points] [took 0.004 sec]
test: vectors.............OK [0 fields, 0 vectors] [took 0.000 sec]
WARNING: 1 broken segments (containing 119789 documents) detected
Took 134.361 sec total.
WARNING: would write new segments file, and 119789 documents would be lost,
if -exorcise were specified
> Task :lucene:core:CheckIndex.main() FAILED
Execution failed for task ':lucene:core:CheckIndex.main()'.
> Process 'command
'/Library/Java/JavaVirtualMachines/adoptopenjdk-11.jdk/Contents/Home/bin/java''
finished with non-zero exit value 1
```
### Check with `-segment _gx` flag
```
> Task :lucene:core:CheckIndex.main() FAILED
NOTE: testing will be more thorough if you run java with
'-ea:org.apache.lucene...', so assertions are enabled
Opening index @
/Users/xichen/IdeaProjects/benchmarks/indices/corrupted/index/
Checking index with async threadCount: 12
0.00% total deletions; 6647577 documents; 0 deletions
Segments file=segments_2 numSegments=15 version=9.0.0
id=59c6he3dhebad46x7proh30nq userData={userData=multi}
Checking only these segments: _gx:
10 of 15: name=_gx maxDoc=119789
version=9.0.0
id=59c6he3dhebad46x7proh30n7
codec=Lucene90
compound=false
numFiles=17
size (MB)=129.046
diagnostics = {timestamp=1622102767300, lucene.version=9.0.0,
java.vm.version=11.0.9+11, java.version=11.0.9, mergeMaxNumSegments=-1,
os.version=10.15.5, java.vendor=AdoptOpenJDK, source=merge, os.arch=x86_64,
mergeFactor=10, java.runtime.version=11.0.9+11, os=Mac OS X}
no deletions
test: open reader.........OK [took 0.086 sec]
test: check integrity.....FAILED
WARNING: exorciseIndex() would remove reference to this segment; full
exception:
org.apache.lucene.index.CorruptIndexException: checksum failed (hardware
problem?) : expected=87e2aa4 actual=7b3afcbd
(resource=BufferedChecksumIndexInput(MMapIndexInput(path="/Users/xichen/IdeaProjects/benchmarks/indices/corrupted/index/_gx_Lucene90_0.dvd")))
at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440)
at
org.apache.lucene.codecs.CodecUtil.checksumEntireFile(CodecUtil.java:614)
at
org.apache.lucene.codecs.lucene90.Lucene90DocValuesProducer.checkIntegrity(Lucene90DocValuesProducer.java:1656)
at
org.apache.lucene.codecs.perfield.PerFieldDocValuesFormat$FieldsReader.checkIntegrity(PerFieldDocValuesFormat.java:364)
at
org.apache.lucene.index.CodecReader.checkIntegrity(CodecReader.java:252)
at
org.apache.lucene.index.SegmentReader.checkIntegrity(SegmentReader.java:391)
at org.apache.lucene.index.CheckIndex.testSegment(CheckIndex.java:913)
at
org.apache.lucene.index.CheckIndex.lambda$checkIndex$0(CheckIndex.java:742)
at
org.apache.lucene.index.CheckIndex.lambda$callableToSupplier$1(CheckIndex.java:842)
at
java.base/java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1700)
at
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:834)
WARNING: 1 broken segments (containing 119789 documents) detected
Took 0.734 sec total.
WARNING: would write new segments file, and 119789 documents would be lost,
if -exorcise were specified
Execution failed for task ':lucene:core:CheckIndex.main()'.
> Process 'command
'/Library/Java/JavaVirtualMachines/adoptopenjdk-11.jdk/Contents/Home/bin/java''
finished with non-zero exit value 1
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]