bq: How do I get a list of all valid field names based on the file type bq: You don't. At least I've never found any. Plus various document formats will allow custom meta-data fields so there's no definitive list.
It would be trivial to add field counts per mime to tika-eval. If you're interested in this, please open a ticket on Tika's JIRA.