On 10/11/13 9:22 AM, Branko Čibej wrote: > You'd have to extend Subversion's file type detection to detect UTF-16. > See svn_io_detect_mimetype2 in line 3333 in this file: > > http://svn.apache.org/viewvc/subversion/trunk/subversion/libsvn_subr/io.c?view=markup > Subversion currently only looks at the first 1k Bytes of a file. It may > be enough to check that this initial part of the file contains only > valid UTF-16 (BE or LE) codes.
Even if all we looked for is the BOM it might be helpful enough. I suspect the development tools producing UTF-16 are including BOMs. Windows seems to be fond of including them, Notepad puts one even on UTF-8.