if your indexing while searching is causing problems, one way to reduce the impact is to index on a master instance and then use the replication scripts to sync it up with a slave instance (where all of your searches happen)
if you are specificly seeing high CPU when indexing HTML, that's probably because the HTML Analyzers have to do a lot of compelx stuff to strip out hte HTML ... another option might be to parse that HTML on the client side before sending it to Solr. : I find index html will make tomcat obtain cpu 100% . It make seach become : slow. : : So how to balance index and search. : : : web i use apache+php : : solr i use tomcat 6+java1.6 -Hoss