Re: [Tutor] regarding minhash and lsh

2019-02-11 Thread Alan Gauld via Tutor
On 11/02/2019 10:10, Alan Gauld via Tutor wrote: >> def minhash(s, prime=4294967311): >> vec = [float('inf') for i in range(N)] >> >> for val in s: >> if not isinstance(val, int): val = hash(val) >> >> for perm_idx, perm_vals in enumerate(perms): >> a, b = perm_vals >> outp

Re: [Tutor] regarding minhash and lsh

2019-02-11 Thread Alan Gauld via Tutor
On 11/02/2019 09:13, lokesh kumar wrote: > i want to make a code to run few DNA seg. so that i will be able to find > similarity in them. file are in million as well as seq. are large so i > tried developing program but fails in it i think minhash and lsh can able > to solve my problem. Bear in m

[Tutor] regarding minhash and lsh

2019-02-11 Thread lokesh kumar
Hi There, i want to make a code to run few DNA seg. so that i will be able to find similarity in them. file are in million as well as seq. are large so i tried developing program but fails in it i think minhash and lsh can able to solve my problem. i need kind of program that will be easy to handle