Linear Regression

This is a short interactive tutorial on linear regression.

How It Works

So if you have a dataset of points, we can plot that dataset.
Example: What if we have a dataset where the x-axis is temperature and the y-axis is ice cream sales:

So you know, for example, it was 24 degrees, and there were only 3 ice creams sold, and on another day, it was 90 degrees, and there were 18 ice creams sold, so you get the idea, you have a hole lot of points.
What linear regression seas is "Ok, tomorrow it's going to be 50 degrees. Could you gess how many ice creams you are going to sale?".
So the idea of a linear regression is to figure out: "how could we fit a line to this data?"

The Algorithm

So the formula for a line is typically written as: y = mx + b.
In statistics textbooks, you will see instead: y = b0 + b1 * x,
This is the same exact formula, m refers to b1 and it represents the "slope" and b refers to b0 and it represents the "y-intercept", which is where the line intersects the y-axis.
So the slope determines: "which way does the line point", and the y-intercept determines: "where is that line relative to the y-axis".
Now, here is the formula:

X Bar is the mean or the average, or the sum of all of the x values divided by the total number of x values. And oviously, same thing for y bar.
This formula actually makes sense, because, let's say the formula for the line is just y = x. Then the slope would be 1. Right? So you can see why this makes sense.


Now let's take a look at a simple example of linear regression.