Wes McKinney gave the keynote presentation at PyCon APAC 2016 in Seoul. He discussed his work on Python data analysis tools like pandas, Apache Arrow, and Feather. He also talked about open source sustainability and governance. McKinney is working on the second edition of his book Python for Data Analysis, which is scheduled for release in 2017.
34. High Performance Sharing & Interchange
Today With Arrow
â˘âŻ Each system has its own
internal memory format
â˘âŻ 70-80% CPU wasted on
serialization and
deserialization
â˘âŻ Similar functionality
implemented in multiple
projects
â˘âŻ All systems utilize the same
memory format
â˘âŻ No overhead for cross-
system communication
â˘âŻ Projects can share
functionality (eg, Parquet-
to-Arrow reader)