Ph. D. & Dr. Sc. Lev Gelimson's Quadratic Mean Theories for Two Dimensions in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

Quadratic Mean Theories for Two Dimensions in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

© Ph. D. & Dr. Sc. Lev Gelimson

Academic Institute for Creating Fundamental Sciences (Munich, Germany)

Mathematical Journal

of the "Collegium" All World Academy of Sciences

Munich (Germany)

10 (2010), 4

To solving contradictory (e.g., overdetermined) problems in approximation and data processing, the least square method (LSM) [1] by Legendre and Gauss only usually applies. Overmathematics [2, 3] and fundamental sciences of estimation [4], approximation [5], data modeling [6] and processing [7] have discovered a lot of principal shortcomings [2-8] of this method. Additionally, minimizing the sum of the squared differences of the alone preselected coordinates (e.g., ordinates in a two-dimensional problem) of the graph of the desired approximation function and of everyone among the given data depends on this preselection, ignores the remaining coordinates, and provides no objective sense of the result. Moreover, the method is correct in the unique case of a constant approximation only and gives systematic errors increasing together with the declination of an approximation function.

In fundamental sciences of estimation [4], approximation [5], data modeling [6] and processing [7], quadratic mean theories (QMT) are valid by coordinate system linear transformation invariance of the given data. Show the essence of these theories by a linear approximation in the two-dimensional case.

Given n (n ∈ N⁺ = {1, 2, ...}, n > 2) points [_j=1ⁿ (x'_j , y'_j )] = {(x'₁ , y'₁), (x'₂ , y'₂), ... , (x'_n , y'_n)] with any real coordinates. Use clearly invariant centralization transformation x = x' - Σ_j=1ⁿ x'_j / n , y = y' - Σ_j=1ⁿ y'_j / n to provide coordinate system xOy central for the given data and further work in this system with points [_j=1ⁿ (x_j , y_j)] to be approximated with a straight line y = ax containing origin O(0, 0).

First, use the least square method [1] by its common approach to minimizing the sum of the squared y-coordinate differences between this line and everyone of the n data points [_j=1ⁿ (x_j , y_j)]:

²_yS(a) = Σ_j=1ⁿ (ax_j - y_j)², ²_yS'_a = 2Σ_j=1ⁿ (ax_j - y_j)x_j = 0, Σ_j=1ⁿ x_j² a = Σ_j=1ⁿ x_jy_j , a_y = Σ_j=1ⁿ x_jy_j / Σ_j=1ⁿ x_j² , ^²_yS''_aa = 2Σ_j=1ⁿ x_j² > 0

(in any nontrivial case) providing namely the minimum of ²_yS(a) at a_y as the value of a by minimizing the sum of the squared y-coordinate differences.

Secondly, minimize the sum of the squared x-coordinate differences:

x = 1/a y , 1/a = a', ²_xS(a') = Σ_j=1ⁿ (a'y_j - x_j)², ²_xS'_a' = 2Σ_j=1ⁿ (a'y_j - x_j)y_j = 0,

Σ_j=1ⁿ y_j² a' = Σ_j=1ⁿ x_jy_j , a'_x = Σ_j=1ⁿ x_jy_j / Σ_j=1ⁿ y_j² , a_x = 1/a'_x = Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_jy_j , ^²_xS''_a'a' = 2Σ_j=1ⁿ y_j² > 0

(in any nontrivial case) providing namely the minimum of ²_xS(a') at a_x as the value of a by minimizing the sum of the squared x-coordinate differences.

Now (similarly to the direct solution method in fundamental science of solving general problems [9]) immediately take

a = (a_xa_y)^1/2 sign Σ_j=1ⁿ x_jy_j = (Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_j²)^1/2 sign Σ_j=1ⁿ x_jy_j ,

y = sign Σ_j=1ⁿ x_jy_j (Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_j²)^1/2 x

for the transformed centralized data, whereas for the initial noncentralized data we obtain

y' = sign[Σ_j=1ⁿ (x'_j - Σ_j=1ⁿ x'_j / n)(y'_j - Σ_j=1ⁿ y'_j / n)] [Σ_j=1ⁿ (y'_j - Σ_j=1ⁿ y'_j / n)²/ Σ_j=1ⁿ (x'_j - Σ_j=1ⁿ x'_j / n)²]^1/2(x'_j - Σ_j=1ⁿ x'_j / n) + Σ_j=1ⁿ y'_j / n .

By nonzero but relatively very small absolute values of the divisor by a_x , Σ_j=1ⁿ x_jy_j , namely by |Σ_j=1ⁿ x_jy_j| << (Σ_j=1ⁿ x_j² Σ_j=1ⁿ y_j²)^1/2, the used sign can become oversensitive to small data variations. In such a case, use either horizontal y = 0 (y' = Σ_j=1ⁿ y'_j / n) by ²_yS = Σ_j=1ⁿy_j² - (Σ_j=1ⁿy_j)²/n < ²_xS = Σ_j=1ⁿx_j² - (Σ_j=1ⁿx_j)²/n or vertical x = 0 (x' = Σ_j=1ⁿ x'_j / n) by ²_yS > ²_xS straight line approximation. The last line cannot be obtained by general equation y = ax and has to be considered separately.

There is an even more direct and natural deductive way to obtain the above formula for a . After centralization, additionally introduce normalization transformation

X = x/(Σ_j=1ⁿ x_j²)^1/2 ,

Y = y/(Σ_j=1ⁿ y_j²)^1/2

to provide coordinate system XOY which is central normalized for the given data and further work in this system with points [_j=1ⁿ (X_j , Y_j)] to be approximated with a straight line Y = AX containing origin O(0, 0). Note that y = ax gives

Y(Σ_j=1ⁿ y_j²)^1/2 = a(Σ_j=1ⁿ x_j²)^1/2 X

and hence

A = (Σ_j=1ⁿ x_j² / Σ_j=1ⁿ y_j²)^1/2 a ,

a = (Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_j²)^1/2 A .

Unlike previously considering the squared differences of y-coordinates and x-coordinates separately, now regard their sum of Y-coordinates and X-coordinates by a ≠ 0 and A ≠ 0 at once, which is reasonable due to equalizing the weights of the both normalized data point coordinates:

²_YXS(A) = Σ_j=1ⁿ [(AX_j - Y_j)² + (Y_j/A - X_j)²],

²_YS'_A = 2Σ_j=1ⁿ [(X_jA - Y_j)X_j + (Y_j/A - X_j)Y_j(-1/A²)] = 0,

Σ_j=1ⁿ X_j² A - Σ_j=1ⁿ X_jY_j - Σ_j=1ⁿ Y_j² / A³ + Σ_j=1ⁿ X_jY_j / A² = 0,

Σ_j=1ⁿ X_j² A⁴ - Σ_j=1ⁿ X_jY_j A³ + Σ_j=1ⁿ X_jY_j A - Σ_j=1ⁿ Y_j² = 0.

Note that, due to normalization,

Σ_j=1ⁿ X_j² = Σ_j=1ⁿ [x_j/ (Σ_j=1ⁿ x_j²)^1/2]² = 1,

Σ_j=1ⁿ Y_j² = Σ_j=1ⁿ [y_j/ (Σ_j=1ⁿ y_j²)^1/2]² = 1.

Then we obtain

A⁴ - Σ_j=1ⁿ X_jY_j A³ + Σ_j=1ⁿ X_jY_j A - 1 = 0,

(A² - 1)(A² - Σ_j=1ⁿ X_jY_j A + 1) = 0.

Such representation of this equation of the 4th power in one unknown A allows to find two solutions to this equation at once:

A² - 1 = 0,

A₁ = 1,

A₂ = - 1.

Note that generally by any real X_j and Y_j , inequality

(Σ_j=1ⁿ X_jY_j)² ≤ Σ_j=1ⁿ X_j² Σ_j=1ⁿ Y_j²

holds. Here due to normalization, we have

(Σ_j=1ⁿ X_jY_j)² ≤ Σ_j=1ⁿ X_j² Σ_j=1ⁿ Y_j² = 1

Hence the discriminant

(Σ_j=1ⁿ X_jY_j)² - 4

of the remaining quadratic equation

A² - Σ_j=1ⁿ X_jY_j A + 1 = 0

is negative, the both solutions to this equation are imaginary, and there are no additional real solutions to this equation of the 4th power in one unknown A .

Compare

²_YXS(A) = Σ_j=1ⁿ X_j² A² - 2Σ_j=1ⁿ X_jY_j A + Σ_j=1ⁿ Y_j² + Σ_j=1ⁿ Y_j² / A² - 2Σ_j=1ⁿ X_jY_j / A + Σ_j=1ⁿ X_j²,

or, due to normalization,

²_YXS(A) = A² - 2Σ_j=1ⁿ X_jY_j A + 2 - 2Σ_j=1ⁿ X_jY_j / A + 1/A²,

by A = A_{1 , 2} providing ²_YXS_min(A) and ²_YXS_max(A), or simply ²S_min(A) and ²S_max(A).

Note that theoretically by

Σ_j=1ⁿ X_jY_j = 0,

practically by

|Σ_j=1ⁿ X_jY_j| << 1,

we have to investigate the pair of straight lines Y = 0 and X = 0.

Otherwise, we have Y = X and Y = - X obtained above by a ≠ 0 and A ≠ 0.

Namely

A = sign(Σ_j=1ⁿ X_jY_j)

provides ²S_min(A), whereas

A = - sign(Σ_j=1ⁿ X_jY_j)

provides ²S_max(A).

Determine ²S_min(A), ²S_max(A), and then define and determine

S_L = [²S_min(A) / ²S_max(A)]^1/2

as a measure of data scatter with respect to linear approximation.

This is an upper estimation of data scatter with respect to approximation at all because nonlinear approximation is also possible.

Denote a measure of data scatter with respect to approximation at all with S . Then S_L ≥ S .

Also introduce a measure of data trend with respect to linear approximation

T_L = 1 - S_L = 1 - [²S_min(A) / ²S_max(A)]^1/2

and a measure of data trend with respect to approximation at all

T = 1 - S .

Then, naturally, T_L ≤ T .

It is possible to give still more universal (but much more complicated) formulae for a and y . Namely, denote

t = |Σ_j=1ⁿ x_jy_j|/(Σ_j=1ⁿ x_j² Σ_j=1ⁿ y_j²)^1/2 ,

r = 1/2 - t + |1/2 - t|.

Then

a = (Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_j²)^1/2 t^r sign Σ_j=1ⁿ x_jy_j ,

y = sign Σ_j=1ⁿ x_jy_j (Σ_j=1ⁿ y_j² / Σ_j=1ⁿ x_j²)^1/2 t^r x

for the transformed centralized data, whereas for the initial noncentralized data we obtain

y' = sign[Σ_j=1ⁿ (x'_j - Σ_j=1ⁿ x'_j / n)(y'_j - Σ_j=1ⁿ y'_j / n)] [Σ_j=1ⁿ (y'_j - Σ_j=1ⁿ y'_j / n)²/ Σ_j=1ⁿ (x'_j - Σ_j=1ⁿ x'_j / n)²]^1/2 t^r (x'_j - Σ_j=1ⁿ x'_j / n) + Σ_j=1ⁿ y'_j / n .

Unlike the LSM, QMT provide best linear approximation to the given data, e.g. in numeric tests, see Figures 1, 2 with replacing (x’, y’) via (x , y):

Figure 1. S_L = 0.218. T_L = 0.782

Figure 2. S_L = 0.507. T_L = 0.493

Nota bene: By linear approximation, the results of distance quadrat theories (DQT) and general theories of moments of inertia (GTMI) [4, 5] coincide. By Σ_j=1ⁿ y_j² = Σ_j=1ⁿ x_j² (and the best linear approximation y = ± x + C), the same also holds for QMT. Here y = x + 2 (Figures 1, 2). By Σ_j=1ⁿ y_j² ≠ Σ_j=1ⁿ x_j² , QMT give other results than DQT and GTMI. But QMT are valid by another invariance type than DQT and GTMI. The data symmetry straight line y = x + 2 is the best linear approximation in the both above tests. The LSM gives y = 0.909x + 2.364 (Figure 1) and even y = 0.591x + 3.636 (Figure 2) with the same data center (4, 6) and underestimating the modulus (absolute value) of the declination to the x-axis (which is typical) due to considering y-coordinate differences instead of distances with ignoring the declination of the approximation straight line to the x-axis.

Quadratic mean theories are very efficient in data estimation, approximation, and processing and reliable even by great data scatter.

Acknowledgements to Anatolij Gelimson for our constructive discussions on coordinate system transformation invariances and his very useful remarks.

References

[1] Encyclopaedia of Mathematics. Ed. M. Hazewinkel. Volumes 1 to 10. Kluwer Academic Publ., Dordrecht, 1988-1994

[2] Lev Gelimson. Providing Helicopter Fatigue Strength: Flight Conditions. In: Structural Integrity of Advanced Aircraft and Life Extension for Current Fleets – Lessons Learned in 50 Years After the Comet Accidents, Proceedings of the 23rd ICAF Symposium, Dalle Donne, C. (Ed.), 2005, Hamburg, Vol. II, 405-416

[3] Lev Gelimson. Overmathematics: Fundamental Principles, Theories, Methods, and Laws of Science. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[4] Lev Gelimson. Fundamental Science of Estimation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[5] Lev Gelimson. Fundamental Science of Approximation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[6] Lev Gelimson. Fundamental Science of Data Modeling. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[7] Lev Gelimson. Fundamental Science of Data Processing. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[8] Lev Gelimson. Corrections and Generalizations of the Least Square Method. In: Review of Aeronautical Fatigue Investigations in Germany during the Period May 2007 to April 2009, Ed. Dr. Claudio Dalle Donne, Pascal Vermeer, CTO/IW/MS-2009-076 Technical Report, International Committee on Aeronautical Fatigue, ICAF 2009, EADS Innovation Works Germany, 2009, 59-60

[9] Lev Gelimson. Fundamental Science of Solving General Problems. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010