Re: [Python-Dev] A fast startup patch (was: Python startup time)

Ryan Gonzalez Mon, 07 May 2018 19:50:42 -0700

On May 7, 2018 9:15:32 PM Steve Dower <steve.do...@python.org> wrote:

“the data shows that a focused change to address file system inefficiencieshas the potential to broadly and transparently deliver benefit to userswithout affecting existing code or workflows.”
This is consistent with a Node.js experiment I heard about where theycompiled an entire application in a single (HUGE!) .js file. Reading asingle large file from disk is quicker than many small files on everysignificant file system I’m aware of. Is there benefit to supporting importof .tar files as we currently do .zip? Or perhaps having a specialfast-path for uncompressed .zip files?

I kind of built something like this, though I haven't really put in theeffort to make it overly usable yet:


https://github.com/kirbyfan64/bluesnow

(Bonus points to anyone who gets the character reference in the name,though I seriously doubt it.)

Main thing I noticed was that reading compiled .pyc files is far fasterthan uncompiled Python code, even if you eliminate the disk access. Kind ofobvious in retrospect, but still something to note

However, there are more obstacles to this in the Python world than the JSworld. C extensions have a heavier prevalence here, distribution is a bitweirder (sorry, even with Pipfiles), and JavaScript already has an entireecosystem built around packing files together from the web world.

Top-posted from my Windows phone

From: Carl Shapiro
Sent: Monday, May 7, 2018 14:36
To: Nathaniel Smith
Cc: Nick Coghlan; Python Dev
Subject: Re: [Python-Dev] A fast startup patch (was: Python startup time)

On Fri, May 4, 2018 at 6:58 PM, Nathaniel Smith <n...@pobox.com> wrote:
What are the obstacles to including "preloaded" objects in regular .pycfiles, so that everyone can take advantage of this without rebuilding theinterpreter?
The system we have developed can create a shared object file for eachcompiled Python file. However, such a representation is not directlyusable. First, certain shared constants, such as interned strings, must bekept globally unique across object code files. Second, some marshaledobjects, such as the hashed collections, must be initialized withrandomization state that is not available until after the hosting runtimehas been initialized.
We are able to work around the first issue by generating a heap image withthe transitive closure of all modules that will be loaded which allows usto easily maintain uniqueness guarantees. We are able to work around thesecond issue with some unobservable changes to the affected data structures.
 
Based on our numbers, it appears there should be some hesitancy--at thistime--to changing the format of compiled Python file for the sake ofload-time performance. In contrast, the data shows that a focused changeto address file system inefficiencies has the potential to broadly andtransparently deliver benefit to users without affecting existing code orworkflows.
----------
_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe:https://mail.python.org/mailman/options/python-dev/rymg19%40gmail.com



--
Ryan (ライアン)
Yoko Shimomura, ryo (supercell/EGOIST), Hiroyuki Sawano >> everyone else
https://refi64.com/


_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
https://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] A fast startup patch (was: Python startup time)

Reply via email to