Anaconda Parcel Brings Open Source Anaconda to Hadoop to Power Data Science Analytics

AUSTIN, TX – February 17, 2016 – Continuum Analytics, the creator and driving force behind Anaconda, the leading modern open source analytics platform powered by Python, today announced the release of Anaconda for C​loudera. This n​ew solution​ makes it easy to use Anaconda within a Cloudera­-managed Hadoop cluster to power data science analytics.

Anaconda Parcel Brings Open Source Anaconda to Hadoop to Power Data Science Analytics

AUSTIN, TX – February 17, 2016 – Continuum Analytics, the creator and driving force behind Anaconda, the leading modern open source analytics platform powered by Python, today announced the release of Anaconda for C​loudera. This n​ew solution​ makes it easy to use Anaconda within a Cloudera­-managed Hadoop cluster to power data science analytics.

Previously, Cloudera users had to manually install a complete Python data science stack on a Hadoop cluster and manage runtime dependencies themselves. With the A​naconda parcel,​ they now have a simple and compatible way to install Python on a Hadoop cluster. The Anaconda parcel enables users to easily build and run Python based solutions across a Cloudera cluster and alongside Spark jobs. Now, data scientists using Python and PySpark on a Hadoop cluster can exploit the full power of Anaconda analytic libraries to easily and effectively create powerful, high impact data science solutions.

“The recent certification of Anaconda with Cloudera Enterprise makes Python much more accessible to customers and allows data scientists to easily scale out their data science solutions and realize benefits faster,” said Tim Stevens, vice president of Corporate and Business Development at Cloudera.

Continuum worked closely with Cloudera to improve the process of using Python packages for data science and data analysis in a Hadoop cluster with Spark. The Anaconda parcel is installed via Cloudera Manager, which makes it easy to have the most popular open source Python packages available across a Hadoop cluster.

“Spark has clearly demonstrated that Python is one of the most important technologies in modern Open Data Science. Nearly half of all Spark users are using Python for their data science needs – including data exploration and predictive modeling – in their Hadoop cluster,” said Peter Wang, Continuum CTO and co-­founder. “We’re excited about the low level technology advancements in Hadoop, such as Parquet, as well as the pioneering advancements by Cloudera on Impala and Kudu. These advancements have set the foundation for our next generation Hadoop innovations, which extend Python from an interface for data science on Hadoop to a full­-fledged native analytic computational platform for Hadoop.”

About Cloudera

Cloudera delivers the modern data management and analytics platform built on Apache Hadoop and the latest open source technologies. The world’s leading organizations trust Cloudera to help solve their most challenging business problems with Cloudera Enterprise, the fastest, easiest and most secure data platform available for the modern world. Our customers efficiently capture, store, process and analyze vast amounts of data, empowering them to use advanced analytics to drive business decisions quickly, flexibly and at lower cost than has been possible before. To ensure our customers are successful, we offer comprehensive support, training and professional services. Learn more at ​h​ttp://cloudera.com.​

Connect with Cloudera

Read our blogs:​ cloudera.com/engblog ​and ​vision.cloudera.com
Follow us on Twitter: ​twitter.com/cloudera
Visit us on Facebook: ​facebook.com/cloudera
Join the Cloudera Community: ​cloudera.com/community
Cloudera, Cloudera’s Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition, Cloudera Navigator Optimizer ​and CDH ​are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trademarks of their respective owners.

About Continuum Analytics

Continuum Analytics is the creator and driving force behind Anaconda, the leading, modern open source analytics platform powered by Python. We put superpowers into the hands of people who are changing the world.

With more than 2.25M downloads annually and growing, Anaconda is trusted by the world’s leading businesses across industries – financial services, government, health & life sciences, technology, retail & CPG, oil & gas – to solve the world’s most challenging problems. Anaconda does this by helping everyone in the data science team discover, analyze, and collaborate by connecting their curiosity and experience with data. With Anaconda, teams manage their open data science environments without any hassles to harness the power of the latest open source analytic and technology innovations.

Our community loves Anaconda because it empowers the entire data science team – data scientists, developers, DevOps, architects, and business analysts – to connect the dots in their data and accelerate the time­-to­-value that is required in today’s world. To ensure our customers are successful, we offer comprehensive support, training and professional services.

Continuum Analytics’ founders and developers have created or contribute to some of the most popular open data science technologies, including NumPy, SciPy, Matplotlib, pandas, Jupyter/IPython, Bokeh, Numba and many others. Continuum Analytics is venture­-backed by General Catalyst and BuildGroup.

To learn more about Continuum Analytics, visit w​w​w.continuum.io.​

###

Media Contacts:

Treble ​for Continuum Analytics

Aaron DeLucia

(512) 960­8222

[email protected]

Deborah Wiltshire

Cloudera

[email protected]

+1 (650) 644­3900