Ph. D. & Dr. Sc. Lev Gelimson's Distance Quadrat Theories for Three Dimensions in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

Distance Quadrat Theories for Three Dimensions in Fundamental Sciences of Estimation, Approximation, Data Modeling and Processing

© Ph. D. & Dr. Sc. Lev Gelimson

Academic Institute for Creating Fundamental Sciences (Munich, Germany)

Mathematical Journal

of the "Collegium" All World Academy of Sciences

Munich (Germany)

11 (2011), 19

By estimation, approximation, and data processing, the least square method (LSM) [1] by Legendre and Gauss only usually applies to contradictory (e.g. overdetermined) problems, by methods of finite elements, points, etc. Overmathematics [2, 3] and fundamental sciences of estimation [4], approximation [5], and data processing [6] have discovered a lot of principal shortcomings [7] of the least square method. Additionally consider its simplest approach which is typical. Minimizing the sum of the squared differences of the alone preselected coordinates (e.g., ordinates in a two-dimensional problem) of the graph of the desired approximation function and of everyone among the given data depends on this preselection, ignores the remaining coordinates, and provides no coordinate system rotation invariance and hence no objective sense of the result. Moreover, the method is correct by constant approximation or no data scatter only and gives systematic errors increasing together with data scatter and the deviation (namely declination) of an approximation from a constant.

In fundamental sciences of estimation [4], approximation [5], and data processing [6], distance quadrat theories (DQT) are valid in the case of coordinate system rotation invariance. By coordinate system translation invariance of the given data, centralize them by subtracting every coordinate of the data center from the corresponding coordinate of every data point.

Additionally to showing the essence of least squared distance theories in the simplest but most important linear approximation in the two-dimensional case in which it is possible to explicitly algebraically provide the unique best quasisolution [2, 3] to the corresponding problem setting, now consider the two-dimensional case, too.

Given n (n ∈ N⁺ = {1, 2, ...}, n > 3) points [_j=1ⁿ (x'_j , y'_j , z'_j)] = {(x'₁ , y'₁ , z'₁), (x'₂ , y'₂ , z'₂), ... , (x'_n , y'_n , z'_n)] with any real coordinates. Use clearly invariant centralization transformation x = x' - Σ_j=1ⁿ x'_j / n , y = y' - Σ_j=1ⁿ y'_j / n , z = z' - Σ_j=1ⁿ z'_j / n to provide coordinate system Oxyz central for the given data and further work in this system with points [_j=1ⁿ (x_j , y_j , z_j)] to be approximated with a plane ax + by + cz = 0 containing origin O(0, 0, 0).

Case c = 0 bringing clear simplification should be considered separately. Now regard the general case c ≠ 0. It is possible to equivalently transform plane equation ax + by + cz = 0 via simultaneously multiplicating (or dividing) all the three factors a , b , and c by any nonzero number, in particular - 1/c (or - c , respectively). Therefore, we can consider case c = -1 and the plane equation z = ax + by only.

The distance between this plane and the jth data point (x_j , y_j , z_j) and further the sum of the squared distances between this plane and everyone of the n data points [_j=1ⁿ (x_j , y_j , z_j)] are, respectively,

d_j = |ax_j + by_j - z_j|/(1 + a² + b²)^1/2,

²S(a , b) = Σ_j=1ⁿ d_j² = Σ_j=1ⁿ(ax_j + by_j - z_j)²/(1 + a² + b²).

This nonnegative differentiable function takes its minimum at a point with vanishing the both first order derivatives of this function by a and b:

²S'_a = - 2a/(1 + a² + b²)² Σ_j=1ⁿ (x_j²a² + y_j²b² + z_j² + 2x_jy_jab - 2x_jz_ja - 2y_jz_jb) + 2/(1 + a² + b²) Σ_j=1ⁿ (x_j²a + x_jy_jb - x_jz_j) = 0,

²S'_b = - 2b/(1 + a² + b²)² Σ_j=1ⁿ (x_j²a² + y_j²b² + z_j² + 2x_jy_jab - 2x_jz_ja - 2y_jz_jb) + 2/(1 + a² + b²) Σ_j=1ⁿ (x_jy_ja + y_j²b - y_jz_j) = 0;

- Σ_j=1ⁿ x_jy_ja²b +Σ_j=1ⁿ x_jz_ja²+ (Σ_j=1ⁿ x_j² - Σ_j=1ⁿ y_j²)ab² + 2Σ_j=1ⁿ y_jz_jab+ (Σ_j=1ⁿ x_j² - Σ_j=1ⁿ z_j²)a + Σ_j=1ⁿ x_jy_jb³ - Σ_j=1ⁿ x_jz_jb² + Σ_j=1ⁿ x_jy_jb - Σ_j=1ⁿ x_jz_j = 0,

Σ_j=1ⁿ x_jy_ja³+ (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ x_j²)a²b - Σ_j=1ⁿ x_jy_jab² - Σ_j=1ⁿ y_jz_ja²+ 2Σ_j=1ⁿ x_jz_jab + Σ_j=1ⁿ y_jz_jb² + Σ_j=1ⁿ x_jy_ja + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ z_j²)b - Σ_j=1ⁿ y_jz_j = 0;

(Σ_j=1ⁿ x_j² - Σ_j=1ⁿ z_j²)a + Σ_j=1ⁿ x_jy_jb = Σ_j=1ⁿ x_jz_j - Σ_j=1ⁿ x_jz_ja²- 2Σ_j=1ⁿ y_jz_jab + Σ_j=1ⁿ x_jz_jb²+ Σ_j=1ⁿ x_jy_ja²b + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ x_j²)ab² - Σ_j=1ⁿ x_jy_jb³ ,

Σ_j=1ⁿ x_jy_ja + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ z_j²)b = Σ_j=1ⁿ y_jz_j + Σ_j=1ⁿ y_jz_ja²- 2Σ_j=1ⁿ x_jz_jab - Σ_j=1ⁿ y_jz_jb² - Σ_j=1ⁿ x_jy_ja³+ (Σ_j=1ⁿ x_j² - Σ_j=1ⁿ y_j²)a²b + Σ_j=1ⁿ x_jy_jab² .

Nota bene: The last two equations in a and b (namely in these forms) are very suitable for using iteration approach. Let us use the next approximations a_k+1 and b_k+1 (k = 0, 1, 2, ...) to a and b , respectively, instead of a and b themselves in the left-hand parts of these equations, as well as the previous approximations a_k and b_k to a and b , respectively, instead of a and b themselves in the right-hand parts of these equations:

(Σ_j=1ⁿ x_j² - Σ_j=1ⁿ z_j²)a_k+1 + Σ_j=1ⁿ x_jy_jb_k+1 = Σ_j=1ⁿ x_jz_j - Σ_j=1ⁿ x_jz_ja_k²- 2Σ_j=1ⁿ y_jz_ja_kb_k + Σ_j=1ⁿ x_jz_jb_k²+ Σ_j=1ⁿ x_jy_ja_k²b_k + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ x_j²)a_kb_k² - Σ_j=1ⁿ x_jy_jb_k³ ,

Σ_j=1ⁿ x_jy_ja_k+1 + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ z_j²)b_k+1 = Σ_j=1ⁿ y_jz_j + Σ_j=1ⁿ y_jz_ja_k²- 2Σ_j=1ⁿ x_jz_ja_kb_k - Σ_j=1ⁿ y_jz_jb_k²- Σ_j=1ⁿ x_jy_ja_k³+ (Σ_j=1ⁿ x_j² - Σ_j=1ⁿ y_j²)a_k²b_k + Σ_j=1ⁿ x_jy_ja_kb_k².

The last two formulae provide simply determining every next approximations a_k+1 and b_k+1 (k = 0, 1, 2, ...) to a and b , respectively, via the previous approximations a_k and b_k to a and b , respectively.

To begin with, there are many reasonable possibilities for the initial approximations a₀ and b₀ to a and b , respectively, e.g.:

1) zero approximations a₀ = 0 and b₀ = 0;

2) the solution via the least square method (LSM) [1]

²S(a , b) = Σ_j=1ⁿ(ax_j + by_j - z_j)²;

²S'_a = 2Σ_j=1ⁿ (x_j²a + x_jy_jb - x_jz_j) = 0,

²S'_b = 2Σ_j=1ⁿ (x_jy_ja + y_j²b - y_jz_j) = 0

via solving equation set

Σ_j=1ⁿ x_j² a + Σ_j=1ⁿ x_jy_jb = Σ_j=1ⁿ x_jz_j ,

Σ_j=1ⁿ x_jy_ja + Σ_j=1ⁿ y_j² b = Σ_j=1ⁿ y_jz_j

in a and b .

Nota bene: Zero approximations a₀ = 0 and b₀ = 0 lead to similar but another equation set in a and b for determining a₁ and b₁ :

(Σ_j=1ⁿ x_j² - Σ_j=1ⁿ z_j²)a + Σ_j=1ⁿ x_jy_jb = Σ_j=1ⁿ x_jz_j ,

Σ_j=1ⁿ x_jy_ja + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ z_j²)b = Σ_j=1ⁿ y_jz_j .

By linear approximation in the two-dimensional case in which it is possible to explicitly algebraically provide the unique best quasisolution [2, 3] to the corresponding problem setting but iteration approach is also possible, the corresponding two sets of equations coincide. Namely, for straight line z = ax , the least square method (LSM) [1] gives

²S(a , b) = Σ_j=1ⁿ(ax_j - z_j)²;

²S'_a = 2Σ_j=1ⁿ (x_j²a - x_jz_j) = 0

and equation set consisting of one equation

Σ_j=1ⁿ x_j² a = Σ_j=1ⁿ x_jz_j

in a . Distance quadrat theories (DQT) give

²S'_a = - 2a/(1 + a²)² Σ_j=1ⁿ (x_j²a²+ z_j² - 2x_jz_ja) + 2/(1 + a²) Σ_j=1ⁿ (x_j²a - x_jz_j) = 0

and for zero approximation a₀ = 0, the same equation set consisting of one equation

Σ_j=1ⁿ x_j² a = Σ_j=1ⁿ x_jz_j

in a .

This is an interesting phenomenon of adding the third dimension.

Now we also see that it is possible to introduce another iteration approach than above. Namely, at once substitute every next approximations a_k+1 and b_k+1 (k = 0, 1, 2, ...) to a and b , respectively, for a and b in expressions

x_j²a + x_jy_jb - x_jz_j ,

x_jy_ja + y_j²b - y_jz_j

only of the initial formulae for ²S'_a and ²S'_b because these two expressions only correspond to ignoring the plane declinations, e.g. via the least square method (LSM) [1]. At all the remaining occurences of a and b in the initial formulae for ²S'_a and ²S'_b , substitute every previous approximations a_k and b_k to a and b , respectively. Then we obtain:

(²S'_a =) - 2a_k/(1 + a_k² + b_k²)² Σ_j=1ⁿ (x_j²a_k² + y_j²b_k² + z_j² + 2x_jy_ja_kb_k - 2x_jz_ja_k - 2y_jz_jb_k) + 2/(1 + a_k² + b_k²) Σ_j=1ⁿ (x_j²a_k+1 + x_jy_jb_k+1 - x_jz_j) = 0,

(²S'_b =) - 2b_k/(1 + a_k² + b_k²)² Σ_j=1ⁿ (x_j²a_k² + y_j²b_k² + z_j² + 2x_jy_ja_kb_k - 2x_jz_ja_k - 2y_jz_jb_k) + 2/(1 + a_k² + b_k²) Σ_j=1ⁿ (x_jy_ja_k+1 + y_j²b_k+1 - y_jz_j) = 0;

Σ_j=1ⁿ (x_j²a_k+1 + x_jy_jb_k+1 - x_jz_j) = a_k/(1 + a_k² + b_k²) Σ_j=1ⁿ (x_j²a_k² + y_j²b_k² + z_j² + 2x_jy_ja_kb_k - 2x_jz_ja_k - 2y_jz_jb_k),

Σ_j=1ⁿ (x_jy_ja_k+1 + y_j²b_k+1 - y_jz_j)= b_k/(1 + a_k² + b_k²) Σ_j=1ⁿ (x_j²a_k² + y_j²b_k² + z_j² + 2x_jy_ja_kb_k - 2x_jz_ja_k - 2y_jz_jb_k);

Σ_j=1ⁿ x_j²a_k+1 + Σ_j=1ⁿ x_jy_j b_k+1 = Σ_j=1ⁿ x_jz_j + a_k/(1 + a_k² + b_k²) (Σ_j=1ⁿ x_j²a_k² + Σ_j=1ⁿ y_j²b_k² + Σ_j=1ⁿ z_j² + 2Σ_j=1ⁿ x_jy_j a_kb_k - 2Σ_j=1ⁿ x_jz_j a_k - 2Σ_j=1ⁿy_jz_j b_k),

Σ_j=1ⁿ x_jy_j a_k+1 + Σ_j=1ⁿ y_j²b_k+1 = Σ_j=1ⁿ y_jz_j + b_k/(1 + a_k² + b_k²) (Σ_j=1ⁿ x_j²a_k² + Σ_j=1ⁿ y_j²b_k² + Σ_j=1ⁿ z_j² + 2Σ_j=1ⁿ x_jy_ja_kb_k - 2Σ_j=1ⁿ x_jz_j a_k - 2Σ_j=1ⁿ y_jz_jb_k).

In this second approach, zero approximations a₀ = 0 and b₀ = 0 lead to the same equation set in a and b for determining a₁ and b₁ as the least square method (LSM) [1] does:

(Σ_j=1ⁿ x_j² - Σ_j=1ⁿ z_j²)a + Σ_j=1ⁿ x_jy_jb = Σ_j=1ⁿ x_jz_j ,

Σ_j=1ⁿ x_jy_ja + (Σ_j=1ⁿ y_j² - Σ_j=1ⁿ z_j²)b = Σ_j=1ⁿ y_jz_j .

Therefore, this second, so-called essential, approach seems to be less formal and more natural than the first, so-called formal, approach with formally substituting every next approximations a_k+1 and b_k+1 (k = 0, 1, 2, ...) to a and b , respectively, for a and b in all the linear terms, the powers of a and b being added, whereasevery previous approximations a_k and b_k to a and b , respectively, are substituted for a and b , respectively, in all the remaining (higher, nonlinear) terms, which provides simple solvability of the equation set expressing every next approximations a_k+1 and b_k+1 (k = 0, 1, 2, ...) to a and b , respectively, via every previous approximations a_k and b_k to a and b , respectively.

Due to intuition, we can expect that the essential approach can give results in less iteration steps than the formal approach does.

Nota bene: Both the essential and the formal approaches principally give the same results which are solutions to their common initial equation setsproviding vanishing the both derivatives of the first order before considering iterations via substituting approximations to a and b .

Distance quadrat theories (DQT) are very efficient in data estimation, approximation, and processing by coordinate system rotation invariance.

Acknowledgements to Anatolij Gelimson for our constructive discussions on coordinate system transformation invariances and his very useful remarks.

References

[1] Encyclopaedia of Mathematics. Ed. M. Hazewinkel. Volumes 1 to 10. Kluwer Academic Publ., Dordrecht, 1988-1994

[2] Lev Gelimson. Providing Helicopter Fatigue Strength: Flight Conditions. In: Structural Integrity of Advanced Aircraft and Life Extension for Current Fleets – Lessons Learned in 50 Years After the Comet Accidents, Proceedings of the 23rd ICAF Symposium, Dalle Donne, C. (Ed.), 2005, Hamburg, Vol. II, 405-416

[3] Lev Gelimson. Overmathematics: Fundamental Principles, Theories, Methods, and Laws of Science. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[4] Lev Gelimson. Fundamental Science of Estimation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[5]Lev Gelimson. Fundamental Science of Approximation. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[6] Lev Gelimson. Fundamental Science of Data Processing. The ”Collegium” All World Academy of Sciences Publishers, Munich, 2010

[7] Lev Gelimson. Corrections and Generalizations of the Least Square Method. In: Review of Aeronautical Fatigue Investigations in Germany during the Period May 2007 to April 2009, Ed. Dr. Claudio Dalle Donne, Pascal Vermeer, CTO/IW/MS-2009-076 Technical Report, International Committee on Aeronautical Fatigue, ICAF 2009, EADS Innovation Works Germany, 2009, 59-60