[Numpy-discussion] record arrays and vectorizing

Moroney, Catherine M (388D) Wed, 02 May 2012 11:07:18 -0700

Hello,

Can somebody give me some hints as to how to code up this function
in pure python, rather than dropping down to Fortran?


I will want to compare a 7-element vector (called "element") to a large list of 
similarly-dimensioned
vectors (called "target", and pick out the vector in "target" that is the 
closest to "element"
(determined by minimizing the Euclidean distance).  

For instance, in (slow) brute force form it would look like:

element = numpy.array([1, 2, 3, 4, 5, 6, 7])
target  = numpy.array(range(0, 49)).reshape(7,7)*0.1

min_length = 9999.0
min_index  = 
for i in xrange(0, 7):
   distance = (element-target)**2
   distance = numpy.sqrt(distance.sum())
   if (distance < min_length):
      min_length = distance
      min_index  = i

Now of course, the actual problem will be of a much larger scale.  I will have
an array of elements, and a large number of potential targets.  

I was thinking of having element be an array where each element itself is
a numpy.ndarray, and then vectorizing the code above so as an output I would
have an array of the "min_index" and "min_length" values.  

I can get the following simple test to work so I may be on the right track:

import numpy

dtype = [("x", numpy.ndarray)]

def single(data):
    return data[0].min()

multiple = numpy.vectorize(single)

if __name__ == "__main__":

    a = numpy.arange(0, 16).reshape(4,4)
    b = numpy.recarray((4), dtype=dtype)
    for i in xrange(0, b.shape[0]):
        b[i]["x"] = a[i,:]

    print a
    print b

    x = multiple(b)
    print x
  
What is the best way of constructing "b" from "a"?  I tried b = 
numpy.recarray((4), dtype=dtype, buf=a)
but I get a segmentation fault when I try to print b.

Is there a way to perform this larger task efficiently with record arrays and 
vectorization, or
am I off on the wrong track completely?  How can I do this efficiently without 
dropping
down to Fortran?

Thanks for any advice,

Catherine
_______________________________________________
NumPy-Discussion mailing list
[email protected]
http://mail.scipy.org/mailman/listinfo/numpy-discussion

[Numpy-discussion] record arrays and vectorizing

Reply via email to