https://bugs.kde.org/show_bug.cgi?id=491824
--- Comment #1 from Igor Kushnir <igor...@gmail.com> --- OBSERVED RESULT Step 2: QDEBUG : KEncodingProberTest::testProbe() Text: "Этот" QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.99 encoding: "Shift_JIS" XFAIL : KEncodingProberTest::testProbe() KEncodingProber misdetects short UTF-8 text as Shift_JIS or gb18030 Loc: [kcodecs/autotests/kencodingprobertest.cpp(54)] QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.277778 encoding: "gb18030" XFAIL : KEncodingProberTest::testProbe() KEncodingProber::reset() leaves behind earlier fed data, so this is detected as gb18030 now Loc: [kcodecs/autotests/kencodingprobertest.cpp(60)] QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.444444 encoding: "gb18030" XFAIL : KEncodingProberTest::testProbe() KEncodingProber::reset() leaves behind earlier fed data, so the confidence is lower now Loc: [kcodecs/autotests/kencodingprobertest.cpp(66)] Step 4: same as step 2, except that "Shift_JIS" is replaced by "gb18030" at the end of the second line of the posted output. EXPECTED RESULT A. KEncodingProber detects the encoding of the short UTF-8 text as UTF-8. Or as some other encoding but with a confidence much lower than 0.99. B. The documented and actual behavior of KEncodingProber::reset() match. -- You are receiving this mail because: You are watching all bug changes.