Justa let everybody know. I use DIH+template (without TIKA and Solr Cell, I really don't understand that part in reference guide) to achieve what I want. But still need to test more various form of HTML source.
Scott Chu,scott....@udngroup.com 2016/5/24 (週二) p.s. There're really many many extensive, worthy stuffs in Solr. If the project team can provide some "dictionary" of them, It would be a "Santa Claus" for we solr users. Ha! Just a X'mas wish! Sigh! I know it's quite not possbile. I really like to study them one after another, to learn about all of them. However, Internet IT goes too fast to have time to congest all of the great stuffs in Solr. ----- Original Message ----- From: scott.chu To: solr-user CC: Date: 2016/5/21 (週六) 03:39 Subject: Re: Import html data in mysql and map schemas using onlySolrCELL+TIKA+DIH [scottchu] For this project, I intend to use Solr 5.5 or Solr 6. I know how to modify config to go back to use ClassicIndex, ie. manual schema.xml. Scott Chu,scott....@udngroup.com 2016/5/21 (週六) ----- Original Message ----- From: Siddhartha Singh Sandhu To: solr-user ; scott.chu CC: Date: 2016/5/21 (週六) 03:33 Subject: Re: Import html data in mysql and map schemas using only SolrCELL+TIKA+DIH [scottchu] You will have to configure your schema.xml in Solr. What version are you using? On Fri, May 20, 2016 at 2:17 AM, scott.chu <scott....@udngroup.com> wrote: > > I have a mysql table with over 300M blog articles. The records are in html > format. Is it possible to import these records using only Solr > CELL+TIKA+DIH to some Solr with schema? I mean when importing, I can map > schema on mysql to schema in Solr? > > scott.chu,scott....@udngroup.com > 2016/5/20 (週五) > ----- 未在此訊息中找到病毒。 已透過 AVG 檢查 - www.avg.com 版本: 2015.0.6201 / 病毒庫: 4568/12265 - 發佈日期: 05/20/16 ----- 未在此訊息中找到病毒。 已透過 AVG 檢查 - www.avg.com 版本: 2015.0.6201 / 病毒庫: 4568/12265 - 發佈日期: 05/20/16