Training, evaluating, and deploying machine learning models are now routine in many organizations, and having the right environment around this process is often what sets a company apart from its competitors. The Databricks Machine Learning Runtime, along with MLFlow, manages your experiment’s runs, and models make training and hyperparameter tuning of your models simple and intuitive.
In this course, Building Machine Learning Models on Databricks, you will learn to build and train regression and classification models using the scikit-learn framework.
First, you will load, explore, and process your data using Databricks notebooks and you will use Bamboolib for no-code data analysis and transformations.
Next, you will create experiments and track your model’s parameters and metrics using runs, and compare runs using the MLflow UI.
After that, you will build and train regression and classification models using gradient boosting algorithms which are part of the XGBoost framework. You will also productionize and serve your models using Classic MLFlow Model Serving and perform real-time inference using your deployed models.
Finally, you will learn how you can use the Hyperopt tool for hyperparameter tuning of your models, as well as running hyperparameter tuning in a distributed fashion on a Spark cluster using the SparkTrials class.
When you are finished with this course, you will have the skills and knowledge to build and train traditional machine learning models on Databricks using MLflow to manage your machine learning workflow.