Linear Regression using Matrices

Introduction

For points (x₁, y₁), (x₂, y₂), ... (x_n, y_n), the least square regression line can be given by:

f(x) = b + mx

which will minimize the sum of the squared error, which are the error in using the regression function f(x) to estimate the true y values

Linear Regression using Matrices

where e_i = y_i - f(x_i) is the error approximating y_i .

Let's see if we can set this up as a system of equations and then solve using matrices.

Using our points (x₁, y₁), (x₂, y₂), ... (x_n, y_n) we would have the following system of equations:

Linear Regression using Matrices

Now let's set up an matrix equation. Let:

Linear Regression using Matrices

This gives us the matrix equation: Y = XA + E.

We now just need to solve this for A.

The solution to least square regression equation Y = XA + E is:

Linear Regression using Matrices

The sum of the squared errors (SSE) is:

Linear Regression using Matrices

Example

Example: Determine the least squares regression line using a matrics. The price is $x and y is the monthly sales. Then find the sum of the squared errors.

Linear Regression using Matrices

The solution is to work out A using:

Linear Regression using Matrices

Step 1: Get the matrices Y and X.

Linear Regression using Matrices

Step 2: Work out the X^TX

Linear Regression using Matrices

Step 3: Find the inverse of X^TX. (The tutorial about the Inverse is here)

Linear Regression using Matrices

Step 4: Find the X^TY

Linear Regression using Matrices

Step 5: Finally, the result

Linear Regression using Matrices

and the Squared Error is

Linear Regression using Matrices

References & Resources

http://www.youtube.com/watch?v=Qa_FI92_qo8

Latest Post