Machine Learning with PySpark

  • flag Datacamp
  • student All Levels
  • database Video
  • earth English
  • clock 4h


Learn how to make predictions with Apache Spark.

Covered topics:

  • Machine Learning for Everyone
  • Big Data with PySpark
  • Machine Learning Scientist with Python


Spark is a powerful^ general purpose tool for working with Big Data. Spark transparently handles the distribution of compute tasks across a cluster. This means that operations are fast^ but it also allows you to focus on the analysis rather than worry about technical details. In this course you ll learn how to get data into Spark and then delve into the three fundamental Spark Machine Learning algorithms: Linear Regression^ Logistic Regression/Classifiers^ and creating pipelines. Along the way you ll analyse a large dataset of flight delays and spam text messages. With this background you ll be ready to harness the power of Spark and apply it on your own Machine Learning projects!