https://bugs.kde.org/show_bug.cgi?id=491824

--- Comment #1 from Igor Kushnir <igor...@gmail.com> ---
OBSERVED RESULT
Step 2:
QDEBUG : KEncodingProberTest::testProbe() Text: "Этот"
QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.99 encoding:
"Shift_JIS"
XFAIL  : KEncodingProberTest::testProbe() KEncodingProber misdetects short
UTF-8 text as Shift_JIS or gb18030
   Loc: [kcodecs/autotests/kencodingprobertest.cpp(54)]
QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.277778
encoding: "gb18030"
XFAIL  : KEncodingProberTest::testProbe() KEncodingProber::reset() leaves
behind earlier fed data, so this is detected as gb18030 now
   Loc: [kcodecs/autotests/kencodingprobertest.cpp(60)]
QDEBUG : KEncodingProberTest::testProbe() state: 2 confidence: 0.444444
encoding: "gb18030"
XFAIL  : KEncodingProberTest::testProbe() KEncodingProber::reset() leaves
behind earlier fed data, so the confidence is lower now
   Loc: [kcodecs/autotests/kencodingprobertest.cpp(66)]

Step 4: same as step 2, except that "Shift_JIS" is replaced by "gb18030" at the
end of the second line of the posted output.

EXPECTED RESULT
A. KEncodingProber detects the encoding of the short UTF-8 text as UTF-8. Or as
some other encoding but with a confidence much lower than 0.99.
B. The documented and actual behavior of KEncodingProber::reset() match.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to