This is what i see in your original email...
>>> I am attempting to import documents to Solr from MySQL using DIH. One
>>> of the field contains the text - =E2=80=9CFuture of Mobile Value Added
>>> Service=s (VAS) in Australia=E2=80=9D .Notice the character =E2=80=9C
>>> and =E2=80=9D.
"E2 80
Mr Prakash,
On 27 September 2012 02:06, Pranav Prakash wrote:
>
> | Variable_name| Value |
> +--++
> | character_set_client | latin1 |
> | character_set_connection | latin1
The output of Show variables goes like this. I have verified with the hex
values and they are different in MySQL and Solr.
| Variable_name| Value |
+--++
| character_set_client | latin1 |
|
On 21 September 2012 11:19, Pranav Prakash wrote:
> I am seeing the garbage text in browser, Luke Index Toolbox and everywhere
> it is the same. My servlet container is Jetty which is the out-of-box one.
> Many other special chars are getting indexed and stored properly, only few
> characters cau
I looked at the HEX codes of the texts. The hex code in MySQL is different
from that which is stored in the index.
The hex code in index is longer than the hex code in MySQL, this leads me
to the fact that somewhere in between smething is messing up,
*Pranav Prakash*
"temet nosce"
On Fri, Sep
I am seeing the garbage text in browser, Luke Index Toolbox and everywhere
it is the same. My servlet container is Jetty which is the out-of-box one.
Many other special chars are getting indexed and stored properly, only few
characters causes pain.
*Pranav Prakash*
"temet nosce"
On Fri, Sep 14
Is your _browser_ set to handle the appropriate character set? Or whatever
you're using to inspect your data? How about your servlet container?
Best
Erick
On Mon, Sep 10, 2012 at 7:47 AM, Pranav Prakash wrote:
> Hi Folks,
>
> I am attempting to import documents to Solr from MySQL using DIH. On