You can map the attributes to the ignore field. Alternatively, override the SolrContentHandler's newMethod() method to skip adding them. Come to think of it, I'll put up a quick patch that breaks that out a bit more and makes it easier to override. Longer term, a patch to exclude metadata would be good.
On May 9, 2011, at 9:47 AM, Tod wrote: > I'm indexing content from a CMS' database of metadata. The client would > prefer that Solr exclude the properties (metadata) of any documents being > indexed. Is there a way to tell Tika to only index a document's text and not > its properties? > > Thanks - Tod -------------------------- Grant Ingersoll http://www.lucenerevolution.org