Machine Learning with Python

  • 2 Days
  • Intermediate
  • Virtual | Classroom
  • £ On Request

A two-day intensive course on the standard machine learning analytics pipeline

Book For My Team

Your team will learn...

How to build and quantitatively assess a variety of models suitable for a range of problems

The importance of data preprocessing and regularisation

Confidently compare the efficacy of their models using a rigorous training and testing framework

How various types of models operate

Modern, state of the art machine learning techniques


Python (along with R) has become the dominant language in machine learning and data science. It is now commonly used to fit complex models to messy datasets. This two-day intensive course will equip you with the knowledge and tools to undertake a variety of tasks in a standard machine learning analytics pipeline. We stress the importance of data preparation, both in terms of data standardisation and feature selection, before tackling model building. The course covers regression and classification models, including, tree-based methods, clustering and sparse regression models. Model selection is introduced using cross-validation and bootstrapping.

This workshop is delivered by our training partner Jumping Rivers


Introducing Machine Learning (ML)

An introduction to machine learning and the associated packages in Python, such as Numpy, Scipy, andSciKit-Learn.

Data Reprocessing

Learn the why and how about preprocessing your data with scaling transformations and one hot encoding. We cover typical standardisation and normalisation procedures.

Introduction to Modelling

Introductory modelling techniques such as linear regression and how we move from a statistical model to a machine learning model.

Model Assessment

Quantify the effectiveness of your models using training, validation and test sets plus techniques such as cross-validation. We discuss the different metrics that can be used to judge a model and which are appropriate.


Techniques to avoid overfitting and to perform feature selection, such as lasso, ridge and elastic net regression.


An unsupervised learning technique for uncovering patterns and structure within data.

Advanced Techniques

Some more advanced model fitting using algorithms such as gradient boosted trees and support vector machines.


It is expected that participants are comfortable using the Python programming language and common data structures. Some exposure to common statistical terms would be an advantage, but not essential. Attendance of the Introduction to Python course or equivalent experience should be sufficient.

Ryan Adams

Used to make software for learning as a developer, now helping software makers learn.

Follow Ryan
Andrew Paul

Was a teacher, then a lecturer and now a trainer at Instil. Has been completed the circle.

Gemma Esler

Software developer in the semiconductor industry before switching to lecturing and then Instil as a trainer.

For a breakdown of what to expect in our training, check out our training overview page.
Deloitte logo
Atlassian logo
Workday logo
BMW logo
Amex logo
McAfee logo
PWC logo