Hi Jaya
Text extraction is a step before you put data into solr. Say, you have pdf
or doc type documents, you will extract the text (minus unnecessary
formatting details etc.) and store in solr. Later you can query it as you
said. i have not worked in extraction area, but look at this for an idea:
Hi:
I am trying to ingest a few memos - they do not have any standard format (json,
xml etc etc) but just plain text however the memos all follow some template.
What I would like to od post ingestion is to extract keywords and some values
around it. So say for instance if the text contains the k