Introduction to Machine Learning#
Overview#
Welcome to Intro to Machine Learning, a course designed to introduce you to core machine learning concepts. You will explore the machine learning landscape, and the main stages of the machine learning pipeline. We will cover model selection, error, evaluation and validation, and put all of these into practice using Python. We will introduce Scikit-learn, and use it to pre-process data, train models, and build pipelines.
Course objectives#
Explain the main theoretical concepts of machine learning (at a high level), the machine learning landscape, and be able to provide some examples of machine learning applications.
Train basic linear models using
scikit-learn, building understanding of model-based learning.Understand concepts such as model selection, error, evaluation, validation, and be able to put these into practice with Python.
Explain the main stages of the machine learning pipeline, and be able to create a pipeline running on real-world data.
Prerequisites#
Attendees should have taken Introduction To Python, and Python for Data Analysis. Attendees are also encouraged to have some familiarity with Python virtual environments. Those who would like to build or refresh this knowledge can self-study the short course on the topic: Python Environments
Session content#
This course will be delivered via a mix of slides and programming tutorials. Materials can be found at the following links:
Session 1#
Part 1: What is machine learning? - Click for slides
Part 2: Linear regression - Click for Jupyter Notebook
Part 3: Model selection and evaluation - Click for slides
Session 2#
Part 1: Model selection and evaluation - Click for Jupyter Notebook
Part 2: The machine learning pipeline - Click for slides
Part 3: Machine learning pipeline task - Click for Jupyter Notebook
Session 3#
Continued: Machine learning pipeline task - Click for Jupyter Notebook
Part 2: Unsupervised learning - Click for Jupyter Notebook