> Yeah great idea, thanks. Does anyone know if there is code out there > that > will do this sort of thing? >
Perhaps a much simpler option would be to use this: http://lucene.apache.org/solr/api/org/apache/solr/analysis/PatternReplaceFilterFactory.html with a regex of "<[^>]*>" or something like that - I'm no regex expert. Of course it could get tricky to handle escaped characters and the like, but it may be a good enough poor man's solution. -Ken