Hi Team, 

I am new to solr. 
I am trying to index html file as text file. 
http://localhost:9050/solr-4.5.1/testSolr/update/extract?stream.file=C:\Basha/47363.txt&commit=true&literal.id=$i
 
Using above query i indexed successfully. 

when i used 
http://localhost:9050/solr-4.5.1/testSolr/select?q=*%3A*&wt=json&indent=true to 
retrieve 
all tags are grouped 
{
        " id ": "1" ,
        " attr_contentrobots ": [ "index,follow" ] ,
        " attr_contentDescription ": [ "Pirtek UK are Europe's Number One 
provider of fluid transfer solutions and mobile hose replacement." ] ,
        " attr_contentstream_source_info ": [ 
"file:/C:/Basha/Instance/testSolr/47346.txt" ] ,
        " author ": "Zebedee Creations, t...@zebedeecreations.com" ,
        " author_s ": "Zebedee Creations, t...@zebedeecreations.com" ,
        " text ": [ "Zebedee Creations, t...@zebedeecreations.com" , "Pirtek UK 
: Europe's Number One provider of fluid transfer solutions. Hose replacement, 
mobile hose replacement." ] ,
        " title ": [ "Pirtek UK : Europe's Number One provider of fluid 
transfer solutions. Hose replacement, mobile hose replacement." ] 

and so on... 
html are removed 
but my requirement is i want to hold html tags as it is.(i dont wnat to remove 
html tags) 

</li> 

<li class="spacer">|</li> 

<li><a href="info.php?p=7">CONTACT</a></li> 

</ul> 

</div> 



<div id="content"> 





<div id="btns"> 

<ul> 

<li id="above_btn"></li> 

<li class="btn"><a href="info.php?p=8">Terms & Conditions</a></li> 



<li id="link_btn"><a href="links.php">Links</a></li> 

</ul> 

</div> 



<div id="info"> 

<h2>ABOUT US</h2> 

please help me how to store html file with tags in solr. 
Thanks in advance. 

with regards 
Jiyas 

Problems are only opportunities with thorns on them. 

Reply via email to