Why least squares is used
An overdetermined system has more equations than unknowns, so an exact solution usually does not exist. Least squares finds the vector that makes the total residual as small as possible.
The standard formula is , built from the normal equations.