No, you'll have to index the Unicode version of the domain name. Nutch 1.x 
already deals with this conversion for you. Or you could create a custom update 
processor for Solr and code it there. It's quite simple, IDN is in java.net 
package.
 
-----Original message-----
> From:Furkan KAMACI <furkankam...@gmail.com>
> Sent: Friday 19th July 2013 14:39
> To: solr-user@lucene.apache.org
> Subject: Re: IDNA Support For Solr
> 
> I mean that:
> 
> there is a web adress: *çorba.com <http://xn--orba-zoa.com>*
> 
> However its IDNA coded version is: *xn--orba-zoa.com*
> 
> You can check it from here: *
> http://www.whois.com.tr/?q=%C3%A7orba&sldtld=com*
> 
> Let's assume that I've indexed a web page with that URL:
> *xn--orba-zoa.com*and one searches that word:
> *çorba *Than I have to say that there is a URL match for that search.
> However I've indexed that URL as IDNA coded I will not able to see that URL
> includes that word: *çorba.*
> 
> 
> 
> 2013/7/19 Markus Jelsma <markus.jel...@openindex.io>
> 
> > Hi - What kind of support would you expect Solr to provide? IDN is only
> > about conversion between Unicode in your address bas and ASCII in the DNS.
> >
> > -----Original message-----
> > > From:Furkan KAMACI <furkankam...@gmail.com>
> > > Sent: Friday 19th July 2013 11:09
> > > To: solr-user@lucene.apache.org
> > > Subject: IDNA Support For Solr
> > >
> > > Hi;
> > >
> > > Is there any support for IDNA at Solr? (IDNA:
> > > http://en.wikipedia.org/wiki/Internationalized_domain_name)
> > >
> >
> 

Reply via email to