Correction and Generalization of the Least Square Method

 

by

© Ph. D. & Dr. Sc. Lev Gelimson

Academic Institute for Creating Fundamental Sciences (Munich, Germany)



Mathematical Journal

of the "Collegium" All World Academy of Sciences

Munich (Germany)

 

8 (2008), 1

 

 

For any data processing, the least square method [1] by Legendre and Gauss is practically the unique known one applicable to contradictory problems. But this method is based on the absolute error not invariant by equivalent transformations of a problem and ignores the possibly noncoinciding physical dimensions (units) of relations in a problem. The method does not correlate the deviations of the desired approximation from the approximated objects with these objects themselves and simply mixes those deviations without their adequate weighing. The method considers equal changes of the squares of those deviations with relatively less and greater moduli (absolute values) as equivalent ones. It foresees no iterating, is based on a fixed algorithm accepting no a priori flexibility, and provides no own a posteriori adapting. These defects in the essence of the least square method result in many fundamental shortcomings in its applicability. The method loses any sense and is not applicable at all to problems simulated by a set of equations with different physical dimensions (units), e.g., to a mechanics problem with one equation via the impulse conservation law and another equation via the energy conservation law. There is no result invariance by equivalent transformations of a problem and no objective sense of the result along with possibly ignoring some part of a problem and paradoxical approximation. And the method presents each result as the highest truth without any justification.

For extending the least square method via setting a general problem, let Z ⊆ X × Y be a point set to be approximated by a function (curve) in X × Y with an expansion y = Σk=0 m ak fk(x). Here x ∈ X, y ∈ Y, a∈ R, m ∈ N = {0, 1, 2, ...} with using certain given ”standard” functions fk(x) defined on X with ranges in Y and taken with any real factors (coefficients) to be chosen. The problem consists in finding (in this class) functions with graphs nearest to Z in a certain reasonable sense. To exactly fit this, Z ⊆ { (x, y) | x ∈ X } should hold. Otherwise,

(2S = ΣxZ/XRx (rxk=0  m ak fk(x), y) ), 2S = ΣxZ/Xr2k=0 m ak fk(x), y)

where Rx and rx (which can be similar to distances in Y × Y) are some nonnegative functions generally individual for different x ∈ Z/X; R and r are (independently from x) the second power function and the distance, respectively. 2S has to be minimized as a measure of nearness. Let Z/X be a finite point set

 

{i=0  n  (xi , yi) } = { (x0 , y0),  (x1 , y1), (x2 , y2), ... , (xn , yn) }.

 

To provide belonging all the n + 1 points to the above graph, the set of n + 1 equations

Σk=0  m ak fk(xi) = a0 f0(xi) + a1 f1(xi) + a2f2(xi) + ... + am fm(xi)  = yi , i {0, 1, 2, ... , n}

with m +1 unknown factors (coefficients)a0 , a1 , a2 , ... , am to be found, has to hold. Keeping X arbitrary, consider Y multidimensional and receive the set of m + 1 linear equations

Y Rp , yi = (yi(1), yi(2), ... , yi(p)), fk(xi) = (fk(1)(xi), fk(2)(xi), ... , fk(p)(xi)), p N+ = {1, 2, ... },

Σk=0  m akΣi=0  n Σl=1  p fk(l)(xi) fs(l)(xi) = Σi=0  n Σl=1  p fs(l)(xi) yi(l), s {0, 1, 2, ... , m}.

By the proposed iteration method of the least normed powers [2], instead of absolute errors, unierrors are minimized, and both the power and the unierror parameters may be freely chosen. The known formulae in the least square method can be used also here by m = 2 to determine all the currently unknown variables in the next approximation in the numerators after replacing all the variables in the prior approximation with their values already known in the denominators. Replacing the unierrors with the reserves that naturally have to be maximized brings a further method generalization also applicable to determining the supersolution [2] to a set of relations.

By a unierror equalizing iteration method [2], order the quantiset of the unierrors δλ(k) of each (λth) equation (λ∈Λ). The left-hand sides of two equations with the greatest absolute value of the difference of their unierrors δλ(k) with corresponding indexes λ are setting equal to each other. Repeat this for the remaining equations, and so on. By every step of designing such an equalizing quantiset of equations, each initial equation is used at most once if possible. If this quantiset contains all the possible independent new equations and nevertheless has more than one solution, the required amount of additional new equations that express equalizing the greatest unierrors δλ(k) to zero must be also designed. Stabilizing the equalizing quantiset, its quasisolution approximation, and the unierrors of the separate initial equations and of their quantiset can be a criterion for finishing the iterations.

A direct-solution method [2] proposes the approximation (Q the multiquantity)

 

x= Q(A’)//Q(A)(... + αa/||a||n+ ... + βb/||b||n+ …)//(… + α/||a||n+ … + β/||b||n+ …)

 

to any quantiset A using also A’ after excluding exactly all quantielements with zero elements:

 

A= {... , αa, ... , βb, ... , γ0, ...}°,  A’ = {... , αa, ... , βb, ... }°

 

whose elements ... , a, ... , b, ... , 0, ... belong to a common normed vector space and their quantities ... , α, ... , β, ... , γ, ... are nonnegative real numbers.

These methods are very effective by solving problems simulated by overdetermined quantisets of equations, which is always the case by best approximating experimental data.

 

[1] Encyclopaedia of Mathematics. Ed. M. Hazewinkel. Volumes 1 to 10. Kluwer Academic Publishers, Dordrecht, 1988-1994

[2] Lev Gelimson: Elastic Mathematics. General Strength Theory. The "Collegium" All World Academy of Sciences Publishers, Munich, 2004