Hi,

how to parse json response from Solr Term Vector Component?

I got following json structure from response when testing Solr 5.3.0
tvComponent:

{'responseHeader': {'status': 0, 'QTime': 4}, 'response': {'docs':
[{'resourcename': 'XXX.txt', 'id': 'XXX.txt', '_version_':
1511851008560463872, 'content_type': ['text/plain; charset=windows-1252'],
'content': 'XXX'}], 'numFound': 50, 'start': 0}, 'termVectors':
['uniqueKeyFieldName', 'id', 'warnings', ['noPayloads', ['content']],
*'XXX.txt',
['uniqueKey', 'XXX.txt', 'content', ['TXX', ['tf', 1, 'positions',
['position', 4], 'offsets', ['start', 81, 'end', 85], 'df', 8, 'tf-idf',
0.125], 'TXX', ['tf', 1, 'positions', ['position', 4, 'position', 10],
'offsets', ['start', 81, 'end', 92, 'start', 105, 'end', 116], 'df', 2,
'tf-idf', 0.5], ...]*}

It requires extra transformation to extract term vector information (the
bold part) correctly.

Is it correct ? Why solr makes the json response for term vector
information so difficult to extract from the client side ? why it use list
to encode rather than dictionary?

Is any python client available to extract the json result ?

Thanks for the clarification,
Jerry

Reply via email to