Machine Learning with PySpark: Data Analysis using SQL
half-circle
vector

Machine Learning with PySpark: Data Analysis using SQL

Highlights

This Guided Project is for beginning Python Developers. In this 1-hour long project-based course, you will learn how to Describe PySpark and Machine Learning, Use PySpark to Capture data, Use PySpark SQL to observe the data, Use PySpark MLlib to prepare training data, and Use PySpark MLlib to predict an outcome. To achieve this, we will work through using PySpark to read data into a PySpark Dataframe, View the Data using PysPark SQL, Prepare the Test and Training data using a heart disease data set, and attempt to predict heart disease using independent variables.

About the Course Provider

Coursera provides access to more than 3000+ courses across a wide variety of subjects in parntership with different universities and organizations.

Course by

  • self
    Self paced
  • dueration
    Duration 3 hours
  • domain
    Domain IT & Computer Science
  • subs
    Monthly Subscription Option not available
  • fee
    Buy Now Free
  • language
    Language English