https://bugs.kde.org/show_bug.cgi?id=449076
Bug ID: 449076 Summary: BOM-less UTF8 cannot be detected Product: kdiff3 Version: 1.9.4 Platform: Microsoft Windows OS: Microsoft Windows Status: REPORTED Severity: normal Priority: NOR Component: application Assignee: reeves...@gmail.com Reporter: lapsap7+...@gmail.com Target Milestone: --- Created attachment 145855 --> https://bugs.kde.org/attachment.cgi?id=145855&action=edit Observed result -- some gibberish characters SUMMARY By default the "Auto Detect Unicode" only works for UTF8 files with BOM. More precision needs to be added in the Help for this part, telling people to either manually change encoding or add a BOM. PS: I haven't got the time to do test with UTF16 or UTF32 so I have no idea. STEPS TO REPRODUCE 1. In Options > Regional Settings, make sure "Auto Detect Unicode" is checked. 2. Using files which are UTF8 but without BOM 3. Compare the files OBSERVED RESULT Characters which are outside 7-bit ASCII are displayed incorrectly. Take a look at my attached image (kdiff3-bomless-utf3-observed-result.png) in which every ONE of those characters is displayed as TWO characters which is a sign that UTF8 text files is not detected correctly. EXPECTED RESULT Correct characters are displayed. This will be shown in my other attached image (kdiff3-bomless-utf3-expected-result.png) IF we specifify UTF-8 instead of relying on "Auto Detect Unicode" option. SOFTWARE/OS VERSIONS Windows: Windows 11 (but this is irrelevant, IMO) KDE Frameworks Version: 5.88.0 Qt Version: ADDITIONAL INFORMATION This bug was previously reported in: https://sourceforge.net/p/kdiff3/discussion/197499/thread/78e8dcc2/?limit=25#0a95 and in: https://sourceforge.net/p/kdiff3/bugs/197/ -- You are receiving this mail because: You are watching all bug changes.