Thank you. I will look into that.

> -----Original Message-----
> From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
> Sent: Sunday, February 09, 2014 9:35 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Highlight results in Arabic are backword
> 
> You will most probably put your English and Arabic content into different
> fields. Mostly because you will want to apply different field type definitions
> to your English and Arabic text (tokenizers, etc).
> 
> Also, I would search around the web for articles on multilingual approach to
> Solr, if you are doing some deliberate design now. There are some deeper
> issues. Some good questions are covered here:
> http://info.basistech.com/blog/bid/171842/Indexing-Strategies-for-
> Multilingual-Search-with-Solr-and-Rosette
> (even if it is talking about the commercial tool). There is also a series of 
> 12
> blog posts on dealing with Solr for CJK in the libraries.
> Your issues will be different, but there will be overlap.
> 
> Regards,
>    Alex.
> Personal website: http://www.outerthoughts.com/
> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
> - Time is the quality of nature that keeps events from happening all at once.
> Lately, it doesn't seem to be working.  (Anonymous  - via GTD
> book)
> 
> 
> On Sun, Feb 9, 2014 at 12:56 PM, Fatima Issawi <issa...@qu.edu.qa> wrote:
> > Thank you both for responding.
> >
> > Is there a way to specify to Solr  to add those attributes on the field 
> > when it
> returns results (e.g. Language is Arabic, English. Or direction is LTR or 
> RTL.)?
> >
> > Right now I only have Arabic content indexed, but we plan to add English in
> the near future. I don't want to have to re-do everything later if there is a
> better way of designing this now.
> >
> > Regards,
> > Fatima
> >
> >> -----Original Message-----
> >> From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
> >> Sent: Friday, February 07, 2014 3:48 AM
> >> To: solr-user@lucene.apache.org
> >> Subject: Re: Highlight results in Arabic are backword
> >>
> >> Arabic if complex. Basically, don't trust anything you see until you
> >> put that content on the screen with the surrounding tag marked with
> >> attribute dir='rtl' (e.g. <p dir='rlt'>arabic test</p>).
> >>
> >> Regards,
> >>    Alex.
> >> Personal website: http://www.outerthoughts.com/
> >> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
> >> - Time is the quality of nature that keeps events from happening all at
> once.
> >> Lately, it doesn't seem to be working.  (Anonymous  - via GTD
> >> book)
> >>
> >>
> >> On Thu, Feb 6, 2014 at 10:12 PM, Steve Rowe <sar...@gmail.com>
> wrote:
> >> > Hi Fatima,
> >> >
> >> > I don’t think there’s an actual problem, it just looks like it
> >> > because the
> >> program you’re using to look at the JSON makes a different choice for
> >> laying out the highlighting results than it does for the field values.
> >> >
> >> > In fact, all the bytes are the same, and in the same order for both
> >> > the
> >> “author” field text and the highlighting text, though some space
> >> characters are ASCII space (U+0020) in one and non-breaking space
> >> (U+00A0) in the other.
> >> >
> >> > By the way, I see the same thing as you in my email client (OS X
> >> > Mail.app).  I
> >> assume there is a rule shared by our programs about complex layout
> >> like this, where right-to-left text is mixed with left-to-right text,
> >> likely based on the proportion of each, that triggers a left-to-right
> >> word sequencing instead of the expected right-to-left word sequencing.
> >> >
> >> > Anyway, I pulled out the author field and highlighting texts into
> >> > an HTML
> >> document and viewed it in my browser (Safari), and both are layed out
> >> the same (with the exception of the emphasis given the highlighted
> word):
> >> >
> >> > ——
> >> > <html>
> >> > <body>
> >> > <p>"author": "د. فيشر السعر",</p>
> >> > <p>"highlighting": { "1": { "author": [ "د. <em>فيشر</em> السعر" ]
> >> > } }</p> </body> </html> ——
> >> >
> >> > Steve
> >> >
> >> > On Feb 6, 2014, at 8:23 AM, Fatima Issawi <issa...@qu.edu.qa> wrote:
> >> >
> >> >> Hello,
> >> >>
> >> >> I am getting highlight results in Arabic, but the order of the
> >> >> words are
> >> backwards. Querying on that field gives me the correct result,
> >> though. Is there are setting I’m missing?
> >> >>
> >> >> An extract from an example query from my Solr Console is below:
> >> >>
> >> >> {
> >> >>  "responseHeader": {
> >> >>    "status": 0,
> >> >>    "QTime": 1,
> >> >>    "params": {
> >> >>      "indent": "true",
> >> >>      "q": "author:\"فيشر\"",
> >> >>      "_": "1391692704242",
> >> >>      "hl.simple.pre": "<em>",
> >> >>      "hl.simple.post": "</em>",
> >> >>      "hl.fl": "author",
> >> >>      "wt": "json",
> >> >>      "hl": "true"
> >> >>    }
> >> >>  },
> >> >>  "response": {
> >> >>    "numFound": 4,
> >> >>    "start": 0,
> >> >>    "docs": [
> >> >>      {
> >> >>        "pagenumber": 1,
> >> >>        "id": "1",
> >> >>        "author": "د. فيشر السعر",
> >> >>        "author_s": "د. فيشر السعر",
> >> >>        "collector": "فاطمة عيساوي",  },
> >> >>  "highlighting": {
> >> >>    "1": {
> >> >>      "author": [
> >> >>        "د. <em>فيشر</em> السعر"
> >> >>      ]
> >> >

Reply via email to