If my memory is correct, UTF-8 has been the default encoding per XML specification from a very early stage. If the XML parser is not defaulting to UTF-8 in absence of the encoding attribute, that means the XML parser has a bug, and the code should be corrected.
(I don't have an objection to add the encoding attribute for clarity, however.) -kuro > -----Original Message----- > From: Walter Underwood [mailto:[EMAIL PROTECTED] > Sent: Wednesday, May 09, 2007 4:33 PM > To: solr-user@lucene.apache.org > Subject: Re: Facet only support english? > > I didn't remember that requirement, so I looked it up. It was added > in XML 1.0 2nd edition. Originally, unspecified encodings were open > for auto-detection. > > Content type trumps encoding declarations, of course, per RFC 3023 > and allowed by the XML spec. > > wunder > > On 5/9/07 4:19 PM, "Mike Klaas" <[EMAIL PROTECTED]> wrote: > > > I thought that conformant parsers use UTF-8 as the default anyway: > > > > http://www.w3.org/TR/REC-xml/#charencoding > > > > -Mike > >