Ph. D. & Dr. Sc. Lev Gelimson's Distance Biquadrat Theories in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

Distance Biquadrat Theories in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

© Ph. D. & Dr. Sc. Lev Gelimson

Academic Institute for Creating Fundamental Sciences (Munich, Germany)

Mathematical Journal

of the "Collegium" All World Academy of Sciences

Munich (Germany)

11 (2011), 11

By estimation, approximation, and data processing, the least square method (LSM) [1] by Legendre and Gauss only usually applies to contradictory (e.g. overdetermined) problems, by methods of finite elements, points, etc. Overmathematics [2, 3] and fundamental sciences of estimation [4], approximation [5], data modeling and processing [6] have discovered a lot of principal shortcomings [7] of the least square method. Additionally consider its simplest approach which is typical. Minimizing the sum of the squared differences of the alone preselected coordinates (e.g., ordinates in a two-dimensional problem) of the graph of the desired approximation function and of everyone among the given data depends on this preselection, ignores the remaining coordinates, and provides no coordinate system rotation invariance and hence no objective sense of the result. Moreover, the method is correct by constant approximation or no data scatter only and gives systematic errors increasing together with data scatter and the deviation (namely declination) of an approximation from a constant. And the second power can be not sufficient, especially by great data point numbers.

In fundamental sciences of estimation [4], approximation [5], data modeling and processing [6], distance biquadrat theories (DBQT), as well as distance quadrat theories (DQT), are valid in the case of coordinate system rotation invariance. By coordinate system translation invariance of the given data, centralize them by subtracting every coordinate of the data center from the corresponding coordinate of every data point. Show the essence of least biquadratic distance theories in the simplest but most important linear approximation in the two-dimensional case in which it is possible to explicitly algebraically provide the unique best quasisolution [2, 3] to the following problem setting.

Given n (n ∈ N⁺ = {1, 2, ...}, n > 2) points [_j=1ⁿ (x'_j , y'_j )] = {(x'₁ , y'₁), (x'₂ , y'₂), ... , (x'_n , y'_n)] with any real coordinates. Use centralization transformation x = x' - Σ_j=1ⁿ x'_j / n , y = y' - Σ_j=1ⁿ y'_j / n to provide coordinate system xOy central for the given data and further work in this system with points [_j=1ⁿ (x_j , y_j)] to be approximated with a straight line y = ax (a is any real constant) containing origin O(0, 0). Case x = 0 (a vertical straight line) should be considered separately if necessary. The distance between this line and the jth data point (x_j , y_j) and further the sum of the biquadratic distances between this line and everyone of the n data points [_j=1ⁿ (x_j , y_j)] are, respectively,

d_j = |ax_j - y_j|/(a² + 1)^1/2,

⁴S(a) = Σ_j=1ⁿ d_j⁴ = Σ_j=1ⁿ(ax_j - y_j)⁴/(a² + 1)² =

[Σ_j=1ⁿ x_j⁴ a⁴ - 4Σ_j=1ⁿ x_j³y_j a³ + 6Σ_j=1ⁿ x_j²y_j² a² - 4Σ_j=1ⁿ x_jy_j³ a + Σ_j=1ⁿ y_j⁴]/(a² + 1)² .

This nonnegative differentiable function takes its least value at a point with vanishing the first order derivative of this function by a :

⁴S'_a = - 4a/(a² + 1)³ Σ_j=1ⁿ (ax_j - y_j)⁴ + 4/(a² + 1)² Σ_j=1ⁿ (ax_j - y_j)³x_j = 0,

Σ_j=1ⁿ x_j³y_j a⁴ + (Σ_j=1ⁿ x_j⁴ - 3Σ_j=1ⁿ x_j²y_j²) a³ + 3(Σ_j=1ⁿ x_jy_j³ - Σ_j=1ⁿ x_j³y_j) a² + (3Σ_j=1ⁿ x_j²y_j² - Σ_j=1ⁿ y_j⁴) a- Σ_j=1ⁿ x_jy_j³ = 0,

a⁴ + (Σ_j=1ⁿ x_j⁴ - 3Σ_j=1ⁿ x_j²y_j²)/Σ_j=1ⁿ x_j³y_j a³ + 3(Σ_j=1ⁿ x_jy_j³/Σ_j=1ⁿ x_j³y_j - 1) a² + (3Σ_j=1ⁿ x_j²y_j² - Σ_j=1ⁿ y_j⁴)/Σ_j=1ⁿ x_j³y_j a- Σ_j=1ⁿ x_jy_j³/Σ_j=1ⁿ x_j³y_j = 0,

a⁴ + A₃ a³ + A₂ a² + A₁ a + A₀ = 0

where

A₃ = (Σ_j=1ⁿ x_j⁴ - 3Σ_j=1ⁿ x_j²y_j²)/Σ_j=1ⁿ x_j³y_j ,

A₂ = 3(Σ_j=1ⁿ x_jy_j³/Σ_j=1ⁿ x_j³y_j - 1),

A₁ = (3Σ_j=1ⁿ x_j²y_j² - Σ_j=1ⁿ y_j⁴)/Σ_j=1ⁿ x_j³y_j ,

A₀ = - Σ_j=1ⁿ x_jy_j³/Σ_j=1ⁿ x_j³y_j ,

To provide vanishing the factor by a³ in this quartic equation in a , following the Ferrari method [1], introduce such h that

a = h - A₃ / 4

and obtain equation (in h)

h⁴ + (A₂ - 3/8 A₃²) h² + (A₁ - 1/2 A₂A₃+ 1/8 A₃³) h + A₀ - 1/4 A₁A₃+ 1/16 A₂A₃²- 3/256 A₃⁴ = 0,

h⁴ + Ph² + Qh + R = 0

where

P = A₂ - 3/8 A₃² ,

Q = A₁ - 1/2 A₂A₃+ 1/8 A₃³ ,

R = A₀ - 1/4 A₁A₃+ 1/16 A₂A₃²- 3/256 A₃⁴ .

Now introduce real-number parameter A , consider

h⁴ + Ph² + Qh + R = (h² + 1/2 P + A)² - S

where

S = 2Ah² - Qh + A² + PA + 1/4 P² - R ,

and find such value A that S is the square of some linear polynomial in h . Then and only then the discriminant of S vanishes:

Q² - 8A(A² + PA + 1/4 P² - R) = 0 ,

A³ + PA² + (1/4 P² - R)A - 1/8 Q² = 0 .

Introducing such B that

A = B - 1/3 P ,

we obtain a reduced cubic equation [1]

B³ + pB + q = 0,

namely

B³ + (- 1/12 P² - R)B + (- 1/108 P³ + 1/3 PR - 1/8 Q²) = 0

with

p = - 1/12 P² - R ,

q = - 1/108 P³ + 1/3 PR - 1/8 Q².

Due to the Cardano formulae [1], we obtain by H = (p/3)³ + (q/2)² > 0 one real solution and two conjugated imaginary solutions, by H = 0 one real solution and another doubled real solution (a triple real solution by p = q = 0), by H < 0 three different real solutions:

B₁ = F + G ,

B_{2 ,
3} = - (F + G)/2 ± 3^1/2/2 i (F - G) where i² = -1,

F = (- q/2 + H^1/2)^1/3,

G = (- q/2 - H^1/2)^1/3,

for each value F , take value G with FG = - p/3; for real equations (which is here the case), take real values of F and G . We consider exclusively real values of B and A .

By such value A , we have

S = 2Ah² - Qh + A² + PA + 1/4 P² - R = 2A[h - Q/(4A)]² ,

h⁴ + Ph² + Qh + R = (h² + 1/2 P + A)² - S = (h² + 1/2 P + A)² - 2A[h - Q/(4A)]² ,

(h² + 1/2 P + A)² - 2A[h - Q/(4A)]² = 0.

Check whether A ≥ 0. If it is the case, then we obtain two quadratic equations in h :

{h² + (2A)^1/2h - Q/[2(2A)^1/2]+ 1/2 P + A}{h² - (2A)^1/2h + Q/[2(2A)^1/2]+ 1/2 P + A} = 0;

h² + (2A)^1/2h - Q/[2(2A)^1/2]+ 1/2 P + A = 0,

h_{1 ,
2} = - (A/2)^1/2 ± {A/2 + Q/[2(2A)^1/2]- 1/2 P - A}^1/2 ;

h² - (2A)^1/2h + Q/[2(2A)^1/2]+ 1/2 P + A = 0,

h_{3 ,
4} = (A/2)^1/2 ± {A/2 - Q/[2(2A)^1/2]- 1/2 P - A}^1/2 .

By each solution h , determine the corresponding solution

a = h - A₃ / 4

and

⁴S''_aa = d² ⁴S(a) / dA² = (20a² - 4)/(a² + 1)⁴ Σ_j=1ⁿ(ax_j - y_j)⁴ - 32a/(a² + 1)³ Σ_j=1ⁿ (ax_j - y_j)³x_j + 12/(a² + 1)² Σ_j=1ⁿ (ax_j - y_j)²x_j² ,

⁴S''_aa = 1/(a² + 1)⁴ [Σ_j=1ⁿ x_j⁴ (- 12a⁴ + 12a²) + Σ_j=1ⁿ x_j³y_j (- 8a⁵ + 64a³ - 24a) + Σ_j=1ⁿ x_j²y_j² (36a⁴ - 96a² + 12) + Σ_j=1ⁿ xy_j³ (- 48a³ + 48a) + Σ_j=1ⁿ y_j⁴ (20a² - 4)]

providing the required minimum of ⁴S(a) by d² ⁴S / dA² > 0 and its maximum by d² ⁴S / dA² < 0, whereas the case d² ⁴S / dA² = 0 needs further investigations.

Compare

⁴S(a) = [Σ_j=1ⁿ x_j⁴ a⁴ - 4Σ_j=1ⁿ x_j³y_j a³ + 6Σ_j=1ⁿ x_j²y_j² a² - 4Σ_j=1ⁿ x_jy_j³ a + Σ_j=1ⁿ y_j⁴]/(a² + 1)²

by a = a_{1 , 2, 3, 4} providing ⁴S_min(a) and ⁴S_max(a).

Determine ⁴S_min(a), ⁴S_max(a), and then

^1|4S = [⁴S_min(a) / ⁴S_max(a)]^1/4

as a measure of data scatter with respect to linear (1st power) approximation and the 4th power of a distance.

This is an upper estimation of data scatter with respect to approximation at all because nonlinear approximation is also possible.

Denote a measure of data scatter with respect to approximation at all with S . Then ^1|4S ≥ S .

Also introduce a measure of data trend with respect to linear approximation and the 4th power of a distance

^1|4T = 1 - ^1|4S = 1 - [⁴S_min(a) / ⁴S_max(a)]^1/4

and a measure of data trend with respect to approximation at all

T = 1 - S .

Then, naturally, ^1|4T ≤ T .

Unlike the least square method, least biquadratic distance theories provide best linear approximation to the given data by rotation invariance, e.g. in the following numeric tests, see Figures 1, 2 with replacing (x’, y’) via (x , y):

Figure 1. ^1|4S = 0.189. ^1|4T = 0.811

Figure 2. ^1|4S = 0.487. ^1|4T = 0.513

Nota bene: By linear approximation, as well as data symmetry axis (and the best linear approximation) y = ± x + C , the results of least biquadratic distance theories and biquadratic mean theories [4-6] coincide. Here y = x + 2. The LSM gives y = 0.909x + 2.364 (Figure 1) and even y = 0.591x + 3.636 (Figure 2) with the same data center (x , y) = (4, 6) and underestimating the modulus (absolute value) of the declination to the x-axis (which is typical) due to considering y-coordinate differences instead of distances with ignoring the declination of the approximation straight line to the x-axis.

Distance biquadrat theories (DBQT) are very efficient in data estimation, approximation, and processing by coordinate system rotation invariance.

Acknowledgements to Anatolij Gelimson for our constructive discussions on coordinate system transformation invariances and his very useful remarks.

References

[1] Encyclopaedia of Mathematics. Ed. M. Hazewinkel. Volumes 1 to 10. Kluwer Academic Publ., Dordrecht, 1988-1994

[2] Lev Gelimson. Providing Helicopter Fatigue Strength: Flight Conditions. In: Structural Integrity of Advanced Aircraft and Life Extension for Current Fleets – Lessons Learned in 50 Years After the Comet Accidents, Proceedings of the 23rd ICAF Symposium, Dalle Donne, C. (Ed.), 2005, Hamburg, Vol. II, 405-416

[3] Lev Gelimson. Overmathematics: Fundamental Principles, Theories, Methods, and Laws of Science. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[4] Lev Gelimson. Fundamental Science of Estimation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[5] Lev Gelimson. Fundamental Science of Approximation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[6] Lev Gelimson. Fundamental Science of Data Modeling and Processing. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[7] Lev Gelimson. Corrections and Generalizations of the Least Square Method. In: Review of Aeronautical Fatigue Investigations in Germany during the Period May 2007 to April 2009, Ed. Dr. Claudio Dalle Donne, Pascal Vermeer, CTO/IW/MS-2009-076 Technical Report, International Committee on Aeronautical Fatigue, ICAF 2009, EADS Innovation Works Germany, 2009, 59-60