background

Strata Data Conference

Find the Anaconda booth at Strata Data Conference in New York, September 25-28! We’ll have exclusive swag, new demos, and plenty of surprises, so be sure to stop by. 

Dask core developer Matt Rocklin and Data Scientist Ben Zaitlen will be hosting their tutorial ‘Scaling Python Data Analysis‘ on Tuesday, September 26, from 9AM-12:30PM. Matt Rocklin will also give a talk ‘Dask: Flexible Parallelism in Python for Advanced Analytics‘ on Wednesday, September 27, from 2:55-3:35PM. Co-founder and CTO Peter Wang will also be giving a talk ‘Data Science Beyond the Sandbox’ on Wednesday, September 27, from 4:35-5:15PM. See the full Strata Data Conference schedule here.  See the full Strata Data Conference schedule here. 

TUTORIAL: Scaling Python Data Analysis, Matt Rocklin (@mrocklin) & Ben Zaitlen (@quasiben)

This tutorial teaches you to parallelize and scale your Python data science workloads to multi-core machines and multi-machine clusters. We cover a variety of tools including the standard library, Spark, and Dask. This comparative approach will help students understand how to think broadly about parallel applications, and choosing the right tool for the job.

See the full tutorial abstract here. 

TALK: Dask: Flexible Parallelism in Python for Advanced Analytics, Matt Rocklin (@mrocklin)

The data science Python ecosystem (NumPy, Pandas, and Scikit-learn) are efficient and intuitive for advanced analytics workloads. Unfortunately, these tools are restricted to data that fits into memory and runs on a single core. Dask is a parallel computing library that complements the Python ecosystem by providing a distributed parallel framework for high-performance task scheduling. Matthew Rocklin discusses the basic architecture of Dask, classes of applications in which it is commonly useful, and how it fits into the broader big data ecosystem.

See the full talk abstract here. 

TALK: Data Science Beyond the Sandbox, Peter Wang (@pwang

Peter Wang explores the typical problems data science teams experience when working with other teams and explains how these issues can be overcome through cohesive collaborative efforts among data scientists, business analysts, IT teams, and more.

See the full talk abstract here. 

Follow us @ContinuumIO and the #StrataData hashtag to stay up to date on the conference!

Find the Anaconda booth at Strata Data Conference in New York, September 25-28! We’ll have exclusive swag, new demos, and plenty of surprises, so be sure to stop by. 

Dask core developer Matt Rocklin and Data Scientist Ben Zaitlen will be hosting their tutorial ‘Scaling Python Data Analysis‘ on Tuesday, September 26, from 9AM-12:30PM. Matt Rocklin will also give a talk ‘Dask: Flexible Parallelism in Python for Advanced Analytics‘ on Wednesday, September 27, from 2:55-3:35PM. See the full Strata Data Conference schedule here.