On Jul 26, 2007, at 10:26 PM, Sundling, Paul wrote:
Are there any known Solr sites that are in Chinese or Japenese?
This might be the first mention of this project in the Solr
community, and I'm certainly not confident our server can handle the
load but here goes anyway :)
<http://blacklight.betech.virginia.edu/>
The bulk of the content, 3.8M documents, is not Chinese, but there
are 320 Tang dynasty poems indexed there with both English and
Chinese content. Click on the "Tang Dynasty Poems" on the top right
facet. You can search in Chinese, no problem too:
<http://blacklight.betech.virginia.edu/search?q=火+AND+水>
(hopefully that link will pass through e-mail ok)
Blacklight is an unsupported demo of library data + Solr + Ruby on
Rails. The library data comes from 3 different sources:
* MARC data from our integrated library system, converted to UTF8 -
there are non-English words in some of this data (tinker with the
language facet to stumble on Russian and other stuff)
* TEI data sample from our "Digital Library"
* HTML scrapped Tang dynasty poems
Erik