Big Data, advanced analytics and scientific computing bring exciting opportunities for businesses to leverage more and richer types of data to create meaningful business value and impact. However, it also creates serious computational challenges. To effectively manage this increasingly complex landscape, Data Science teams need technologies that will easily scale and take advantage of the available processing power from their desktop to high performance clusters with the latest advances in chipsets.

The Anaconda platform easily delivers high performance analysis to Data Science teams by leveraging investments in all types of infrastructure to both Scale Up and Scale Out workloads.

In this paper, you’ll discover why Anaconda, the leading Open Data Science platform, is the right solution to deliver that range of flexibility and performance.

Big Data, advanced analytics and scientific computing bring exciting opportunities for businesses to leverage more and richer types of data to create meaningful business value and impact. However, it also creates serious computational challenges. To effectively manage this increasingly complex landscape, Data Science teams need technologies that will easily scale and take advantage of the available processing power from their desktop to high performance clusters with the latest advances in chipsets.

Python is the fastest growing Open Data Science programming language with an incredibly rich open source ecosystem. It is the go-to language for Data Science teams because of the easy learning curve and simplicity in programming and readability, which results in faster development time compared with compiled languages. However, because it’s an interpreted language, it is not associated with high performance. But with today’s increasingly large data sets and complex analytics, scaling analytics has become an issue for many Data Science teams.

The Anaconda platform easily delivers high performance analysis to Data Science teams by leveraging investments in all types of infrastructure to both Scale Up and Scale Out workloads. The Anaconda Python distribution is a high performance distribution of Python that has been compiled with the Intel® Math Kernel Library (MKL) that delivers faster throughput on numerically intensive calculations. Additionally, there are a multitude of Scale Up options available in Anaconda that allow Data Science workloads to maximize the investment in single machine infrastructure.


In this paper, you’ll discover why Anaconda, the leading Open Data Science platform, is the right solution to deliver that range of flexibility and performance. We will:

  • Define the computing constraints
  • Identify techniques for resolving the computing constraints
  • Describe ways to deliver high performance on today’s modern architectures
  • Showcase the flexibility of the Anaconda platform to scale with any modern infrastructure