On Wed, Sep 22, 2010 at 11:25 PM, Neal Becker wrote:
> David Cournapeau wrote:
>
>> On Wed, Sep 22, 2010 at 11:10 PM, Neal Becker wrote:
>>> A colleague of mine posed the following problem. He wants to search
>>> large files of binary data for sequences.
>>>
>>
>> Is there a reason why you canno
On Wed, Sep 22, 2010 at 09:10, Neal Becker wrote:
> A colleague of mine posed the following problem. He wants to search large
> files of binary data for sequences.
>
> I thought of using mmap (to avoid reading all data into memory at once) and
> then turning this into a numpy array (using buffer=
David Cournapeau wrote:
> On Wed, Sep 22, 2010 at 11:10 PM, Neal Becker wrote:
>> A colleague of mine posed the following problem. He wants to search
>> large files of binary data for sequences.
>>
>
> Is there a reason why you cannot use one of the classic string search
> algorithms applied to
On Wed, Sep 22, 2010 at 11:10 PM, Neal Becker wrote:
> A colleague of mine posed the following problem. He wants to search large
> files of binary data for sequences.
>
Is there a reason why you cannot use one of the classic string search
algorithms applied to the bytestream ?
David
___
A colleague of mine posed the following problem. He wants to search large
files of binary data for sequences.
I thought of using mmap (to avoid reading all data into memory at once) and
then turning this into a numpy array (using buffer=).
But, how to then efficiently find a sequence?
Note t