Hi All,

I have a file in HDFS spanning across many blocks. Say the file has many
words in it from W1, W2 , W3 ...Wn.

I want to find the edit distance between all pairs of words. Is this is
possible in Mapreduce.

For example I have 2 blocks for the file: And an edit distance function
which returns value between 0 to 1(0 means two words are very close t0 each
other). Say edit distance between even and odd words is 1

Block 1 has the words W1, W2, W3, W4
Block 2 has the words W1, W2, W5, W6

Now is there a way to find all pairs edit distance. If so how?

Thanks,
Praveen

Reply via email to