On 11/02/2019 10:10, Alan Gauld via Tutor wrote:
>> def minhash(s, prime=4294967311):
>> vec = [float('inf') for i in range(N)]
>>
>> for val in s:
>> if not isinstance(val, int): val = hash(val)
>>
>> for perm_idx, perm_vals in enumerate(perms):
>> a, b = perm_vals
>> outp
On 11/02/2019 09:13, lokesh kumar wrote:
> i want to make a code to run few DNA seg. so that i will be able to find
> similarity in them. file are in million as well as seq. are large so i
> tried developing program but fails in it i think minhash and lsh can able
> to solve my problem.
Bear in m
Hi There,
i want to make a code to run few DNA seg. so that i will be able to find
similarity in them. file are in million as well as seq. are large so i
tried developing program but fails in it i think minhash and lsh can able
to solve my problem.
i need kind of program that will be easy to handle