the Least Squares method

The Least Squares method

Introduction
The relation between two numbers may be defined by

a table
a coordinates system
an equation

Each (x,y) pair we may paint as a point in a coordinate system.
Then we may search for the best fitting polynomial.

Note: a polynomial degree n is: ......y = c₀+c₁x+c₂x²+........+c_nxⁿ

please look at the figure above:

painted are points (x₁,y₁)................and asked is the best fitting polynomial degree 1 through these points.

"Best fitting" means, that the sum of the squared differences for each point is minimal.
These differences are painted as dotted lines in the figure.

The applied "Least Squares" method to find the best fitting polynomial is a nice application of linear algebra.

My equation grapher Graphics-Explorer uses this method,
the degree may be 0 to 7.

The Least Squares method
Given are points (x₁,y₁) , (x₂,y₂)...(x_n , y_n)

requested:
a polynomial degree m, y = c₀ + c₁x + c₂x² + ... + c_mx^m through these points having the minimal deviation.

If all points are exactly on the the polynomial, so m+1 = n, we have:

₁

₀

₁

₂

₁

₂

₀

₁

₂

₀

₁

₂

written in matrix form:

	y₁
	y₂
	...
	...
	y_n

1	x₁¹	x₁²	...	x₁^m
1	x₂¹	x₂²	...	x₂^m
..	..	..	..	..
..	..	..	..	..
1	x_n¹	x_n²	...	x_n^m

	c₀
	c₁
	..
	..
	c_m

y = M . c

If the points are not exactly on the polynomial, there will be a difference vector:

y - M . c

The norm of this difference vector is the sum of the squared differences.

So, we look for c , where || y - M . c || is minimal.

this will be the case if the difference vector is perpendicular to the column space of M.
The inner product is zero in this case.

(M c )^t ( y - M c) = 0

(c^t M^t) (y - M c) = 0

c^t ( M^t ( y - M c)) = 0

c^t ( M^t y - M^t M c ) = 0

M^t y - M^t M c = 0

M^t M c = M^t y

(M^t M)^-1M^t M c = (M^t M)^-1M^t y

c = (M^t M)^-1 M^t y

remarks
1.
M^t means matrix M transposed, mirrored in it's diagonal, so writing the rows as columns.

if

M =

	2	0	3
	1	5	8

then

M^t

	2	1
	0	5
	3	8

2.
rule: ( A B)^t = B^tA^t

3.
The inner product of two vectors a en b may be written as a^t.b

4.
In the case of linear regression, where m = 1 and c =[b,a] .......{because the line has the equation y = b + ax..}
this is true:

	y₁
	y₂
	...
	y_n

	1	x₁
	1	x₂
	...	...
	1	x_n

	1	1	...	1
	x₁	x₂	...	x_n

starting with

	1	1	...	1
	x₁	x₂	...	x_n

	1	x₁
	1	x₂
	...	...
	1	x_n

	b
	a

	1	1	...	1
	x₁	x₂	...	x_n

	y₁
	y₂
	...
	y_n

x_i

x_i²

	b
	a

y_i

x_i y_i

b n + a

x_i

+ a

x_i²

y_i

x_i y_i

The next system of linar equations has to be solved:

b n + a

x_i

y_i

x_i

+ a

x_i²

x_i y_i

− a

x_i²

− b

x_i

= 0

y_i

− a

x_i

− b n = 0

Σ	(x_i y_i − a x_i² − b x_i)

= 0

Σ	(y_i − a x_i − b)

= 0

For the solution please refer to my article Linear Regression
See formula's ....1) and ...........2)

Example
Find the least square straight line through points.........(0,1) (1,3) (2,4) en (3,4)

	1	0
	1	1
	1	2
	1	3

M^t =

	1	1	1	1
	0	1	2	3

M^t M =

	4	6
	6	14

(M^t M)⁻¹ = 0 , 1

	7	−3
	−3	2

c = 0 ,

	7	−3
	−3	2

	1	1	1	1
	0	1	2	3

	1
	3
	4
	4

	1.5
	1

So, the line is y = 1.5 + x