Character encoding autodetection fails for APT source files
-----------------------------------------------------------

                 Key: DOXIA-278
                 URL: http://jira.codehaus.org/browse/DOXIA-278
             Project: Maven Doxia
          Issue Type: Bug
          Components: Module - Apt
    Affects Versions: 1.0-alpha-11
         Environment: Mac OS X 10.5.6, Java 1.6.0_07
            Reporter: Trevor Harmon
         Attachments: HelloWorld.zip

Doxia unnecessarily forces all APT source files to be encoded in ISO-8859-1. 
Files encoded in UTF-8 can have garbage characters as a result. Doxia should be 
able to autodetect the encoding of the APT file to prevent this problem, as it 
already does for XML (see DOXIA-133).

A test case is attached. It includes two APT source files, one encoded in 
ISO-8859-1 and another encoded in UTF-8. Both contain the copyright symbol. To 
reproduce the problem, simply run "mvn site" on the project and open the 
target/site/test-utf8.html and target/site/test-iso-8859-1.html. The file 
encoded with ISO-8859-1 should display the copyright symbol correctly, while 
the one encoded with UTF-8 contains a garbage character immediately before the 
symbol.


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to