Re: charset encoding

2014-03-26 Thread Alexandre Rafalovitch
uation where you cannot control the header of the >> request or modify the content itself to include charset information, or >> there's some reason you would rather not take that route, there will be >> another way with the next Solr release. >> >> https://issue

Re: charset encoding

2014-03-26 Thread Antoine LE FLOC'H
ira/browse/SOLR-5082 > > Solr 4.5 will support an "ie" (input encoding) parameter for the update > request so you can inform Solr what charset encoding to expect. The > release process for Solr 4.5 has been started, it usually takes 2-3 > weeks to complete. > > Thanks, > Shawn > >

Re: charset encoding

2013-09-12 Thread Shawn Heisey
e's some reason you would rather not take that route, there will be another way with the next Solr release. https://issues.apache.org/jira/browse/SOLR-5082 Solr 4.5 will support an "ie" (input encoding) parameter for the update request so you can inform Solr what charset encodi

Re: charset encoding

2013-09-12 Thread Andreas Owen
it was the http-header, as soon as i force a iso-8859-1 header it worked On 12. Sep 2013, at 9:44 AM, Andreas Owen wrote: > could it have something to do with the meta encoding tag is iso-8859-1 but > the http-header tag is utf8 and firefox inteprets it as utf8? > > On 12. Sep 2013, at 8:36 AM,

Re: charset encoding

2013-09-12 Thread Andreas Owen
could it have something to do with the meta encoding tag is iso-8859-1 but the http-header tag is utf8 and firefox inteprets it as utf8? On 12. Sep 2013, at 8:36 AM, Andreas Owen wrote: > no jetty, and yes for tomcat i've seen a couple of answers > > On 12. Sep 2013, at 3:12 AM, Otis Gospodneti

Re: charset encoding

2013-09-11 Thread Andreas Owen
no jetty, and yes for tomcat i've seen a couple of answers On 12. Sep 2013, at 3:12 AM, Otis Gospodnetic wrote: > Using tomcat by any chance? The ML archive has the solution. May be on > Wiki, too. > > Otis > Solr & ElasticSearch Support > http://sematext.com/ > On Sep 11, 2013 8:56 AM, "Andreas

Re: charset encoding

2013-09-11 Thread Otis Gospodnetic
Using tomcat by any chance? The ML archive has the solution. May be on Wiki, too. Otis Solr & ElasticSearch Support http://sematext.com/ On Sep 11, 2013 8:56 AM, "Andreas Owen" wrote: > i'm using solr 4.3.1 with tika to index html-pages. the html files are > iso-8859-1 (ansi) encoded and the met

charset encoding

2013-09-11 Thread Andreas Owen
i'm using solr 4.3.1 with tika to index html-pages. the html files are iso-8859-1 (ansi) encoded and the meta tag "content-encoding" as well. the server-http-header says it's utf8 and firefox-webdeveloper agrees. when i index a page with special chars like ä,ö,ü solr outputs it completly forei