SSE vs MSE vs RMSE - Error Metrics Formulas Explained

$$ y = mx + b $$

Linear Regression Equation

What it is: The basic formula for a straight line used to make predictions.

$$ SSE = \sum_{i=1}^{n} (y_i - \hat{y}_i)^2 $$

SSE (Sum of Squared Errors)

What it is: The sum of the squares of all errors. The fundamental metric.

Why it's useful:

Downside: Value grows with the size of the dataset.

$$ MSE = \frac{SSE}{n} = \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2 $$

MSE (Mean Squared Error)

What it is: The average value of the squared error.

Why it's useful:

Downside: Units are "squared dollars" or "squared meters", which are hard to interpret.

$$ RMSE = \sqrt{MSE} = \sqrt{\frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2} $$

RMSE (Root Mean Squared Error)

What it is: The square root of the mean squared error.

Why it's useful:

Linear Regression Formulas