Introducing Regression Models
Last updated on 2024-03-12 | Edit this page
Overview
Questions
- What is a Regression Model?
Objectives
- Explain what a Regression Model is.
- Recognize a Simple Linear Regression Model.
- Understand its usages and interpret it in real life context.
What is a “Regression”?
Regression analysis is a core concept in machine learning, specifically supervised learning where our dataset has both input features and output labels. The goal is to determine a relationship amongst variables by analyzing how one affects the others.
The example that we’ll work with in this lesson is related to movies. Imagine if you are a producer who wants to know the success of a movie before it is released. You have a dataset of around 5000 films, containing information on their ratings, popularity, etc… How can we predict the revenue based on the ratings or popularity of a certain film? Understanding regression models may allow you to answer such questions.
What is an SLR?
The most basic regression model and well-known algorithm is the Simple Linear Regression Model (SLR), which we will focus on in this introductory modual.
A linear regression line has an equation of the form \(Y = \beta_1x + \beta_0 + \epsilon\)
\(\beta_1\): slope of line
\(\beta_0\): intercept
\(X\): explanatory variable
\(Y\): dependent variable
\(\epsilon\): difference between our predicted value from the regression line and the actual value
- Predict film revenue based on their ratings
- Predict wine price by its date
How does it apply to a real life scenario?
Now that you know about the components of a simple linear regression model, take some time to think about how it may apply to our movies dataset.
Ratings might be seen as \(X\), our explanatory variable, and revenue as \(Y\), the dependent variable. We could construct a linear relationships where ratings are used to predict revenue in the form of \(\hat{Revenue} = \beta_1 ratings + \beta_0 + \epsilon\).