Essential Open-Source Library pandas Awarded CZI Grant to Further Development


We’re pleased to announce that pandas, the open-source library providing high-performance data structures for tabular data analysis, has received grant funding from the Chan Zuckerberg Initiative (CZI) as part of their Essential Open Source Software for Science program. This funding will help the pandas project continue to thrive. Anaconda has supported pandas development over the years, most recently through contributions from employees Brock Mendel and Tom Augspurger.

We were happy to see that the CZI request for applications explicitly acknowledged the importance of maintenance for foundational open-source projects. While pandas is fortunate to have some institutional partners — including Anaconda —- securing funding for development is difficult , and maintaining a project and community as large and active as pandas is a time-consuming endeavor. 

Basic project maintenance is often one of the first things to cut for over-worked maintainers. We’re explicitly dedicating a portion of the funding to increased project maintenance. This will mean fewer open issues, higher average quality and clarity of open issues, faster responses to new issues, and faster and better reviews on pull requests. All of this adds up to a better contributing experience and a more stable pandas.

We also plan to fund development time on some of our larger roadmap items. Specifically, we’ll work on:

  • Improved Extension Array Interface: We recently introduced an interface for storing custom array-like objects inside pandas’ data structures. This is a large change to pandas and would benefit from dedicated time to improve the interface and implementation.
  • Native String Refactor: Change how we store and process strings, resulting in lower memory usage and higher performance on text datasets.

We’re excited to put these funds to use to ensure the continued health of the pandas project and community.

More information about the Essential Open Source Software program is available in the CZI announcement.

You May Also Like

For Practitioners
Intake: Discovering and Exploring Data in a Graphical Interface
Motivation Do you have data that you’d like people to be able to explore on their own? Are you always passing around snippets of code to load specific data files? These ...
Read More
Enterprise Data Science
Deriving Business Value from Data Science Deployments
One of the biggest challenges facing organizations trying to derive value from data science and machine learning is deployment. In this post, we’ll take a look at three comm...
Read More
Anaconda Repository Changes Afoot
In August 2017, Continuum Analytics announced it is now Anaconda, Inc. Here at Anaconda, we are all excited about the change, and have spent the last several months switching ...
Read More