Re: [Numpy-discussion] Using numpy on hadoop streaming: ImportError: cannot import name multiarray

2015-02-11 Thread Kartik Kumar Perisetla
hin core package only, but it is still not found. Can anyone throw some light on it. Thanks! Kartik On Wed, Feb 11, 2015 at 7:17 AM, Daπid wrote: > On 11 February 2015 at 08:06, Kartik Kumar Perisetla > wrote: > > Thanks David. But do I need to install virtualenv on every node in had

Re: [Numpy-discussion] Using numpy on hadoop streaming: ImportError: cannot import name multiarray

2015-02-10 Thread Kartik Kumar Perisetla
: > On 11 February 2015 at 03:38, Kartik Kumar Perisetla > wrote: > > Also, I don't have root access thus, can't install numpy or any other > > package on cluster > > You can create a virtualenv, and install packages on it without > needing root access. To minimi

[Numpy-discussion] Using numpy on hadoop streaming: ImportError: cannot import name multiarray

2015-02-10 Thread Kartik Kumar Perisetla
Hi all, for one of my projects I am using basically using NLTK for pos tagging, which internally uses a 'english.pickle' file. I managed to package the nltk library with these pickle files to make them available to mapper and reducer for hadoop streaming job using -file option. However, when nltk