Re: SOLR - problems with non-english symbols when extracting HTML

2011-03-28 Thread kushti
Grijesh wrote: > > Try to send HTML data using format CDATA . > Doesn't work with > $content = ""; > And my goal is not to avoid extraction, but have no problems with non-english chars -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-problems-with-non-english-symb

SOLR - problems with non-english symbols when extracting HTML

2011-03-25 Thread kushti
When I send plain utf-8 text to index(non-english text), all ok, but with HTML I have wrong characters instead of non-ASCII symbols. So $this->solr->extractContents($url, strip_tags($code), array("literal.url"=>$url,"fmap.content"=>"body")); Works well, but just $this->solr->extractContents($u