https://bugs.kde.org/show_bug.cgi?id=449076

            Bug ID: 449076
           Summary: BOM-less UTF8 cannot be detected
           Product: kdiff3
           Version: 1.9.4
          Platform: Microsoft Windows
                OS: Microsoft Windows
            Status: REPORTED
          Severity: normal
          Priority: NOR
         Component: application
          Assignee: reeves...@gmail.com
          Reporter: lapsap7+...@gmail.com
  Target Milestone: ---

Created attachment 145855
  --> https://bugs.kde.org/attachment.cgi?id=145855&action=edit
Observed result -- some gibberish characters

SUMMARY
By default the "Auto Detect Unicode" only works for UTF8 files with BOM. More
precision needs to be added in the Help for this part, telling people to either
manually change encoding or add a BOM.

PS: I haven't got the time to do test with UTF16 or UTF32 so I have no idea.

STEPS TO REPRODUCE
1. In Options > Regional Settings, make sure "Auto Detect Unicode" is checked.
2. Using files which are UTF8 but without BOM
3. Compare the files

OBSERVED RESULT
Characters which are outside 7-bit ASCII are displayed incorrectly.  Take a
look at my attached image (kdiff3-bomless-utf3-observed-result.png) in which
every ONE of those characters is displayed as TWO characters which is a sign
that UTF8 text files is not detected correctly.

EXPECTED RESULT
Correct characters are displayed.  This will be shown in my other attached
image (kdiff3-bomless-utf3-expected-result.png) IF we specifify UTF-8 instead
of relying on "Auto Detect Unicode" option.

SOFTWARE/OS VERSIONS
Windows: Windows 11 (but this is irrelevant, IMO)
KDE Frameworks Version: 5.88.0
Qt Version: 

ADDITIONAL INFORMATION
This bug was previously reported in:
https://sourceforge.net/p/kdiff3/discussion/197499/thread/78e8dcc2/?limit=25#0a95
and in:
https://sourceforge.net/p/kdiff3/bugs/197/

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to