[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Smoothing noisy data with spline functions

Published: 01 December 1978 Publication History

Abstract

Smoothing splines are well known to provide nice curves which smooth discrete, noisy data. We obtain a practical, effective method for estimating the optimum amount of smoothing from the data. Derivatives can be estimated from the data by differentiating the resulting (nearly) optimally smoothed spline.
We consider the model y i ( t i )+ i , i =1, 2, ..., n , t i [0, 1], where g W 2 ( m ) ={ f : f , f , ..., f ( m 1) abs. cont., f ( m ) 2[0,1]}, and the { i } are random errors with E i =0, E i j = 2 ij . The error variance 2 may be unknown. As an estimate of g we take the solution g n, to the problem: Find f W 2 (m) to minimize $$\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j ) - y_j )^2 + \lambda \int\limits_0^1 {(f^{(m)} (u))^2 du} }$$ . The function g n, is a smoothing polynomial spline of degree 2 m 1. The parameter controls the tradeoff between the "roughness" of the solution, as measured by $$\int\limits_0^1 {[f^{(m)} (u)]^2 du}$$ , and the infidelity to the data as measured by $$\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j ) - y_j )^2 }$$ , and so governs the average square error R( ; g)=R( ) defined by $$R(\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g_{n,\lambda } (t_j ) - g(t_j ))^2 }$$ . We provide an estimate $$\hat \lambda$$ , called the generalized cross-validation estimate, for the minimizer of R( ) . The estimate $$\hat \lambda$$ is the minimizer of V ( ) defined by $$V(\lambda ) = \frac{1}{n}\parallel (I - A(\lambda ))y\parallel ^2 /\left[ {\frac{1}{n}{\text{Trace(}}I - A(\lambda ))} \right]^2$$ , where y=(y 1, ..., y n)t and A ( ) is the n n matrix satisfying (g n, ( t 1), ..., g n, ( t n))t= A ( ) y . We prove that there exist a sequence of minimizers $$\tilde \lambda = \tilde \lambda (n)$$ of EV( ) , such that as the (regular) mesh {t i} i=1 n becomes finer, $$\mathop {\lim }\limits_{n \to \infty } ER(\tilde \lambda )/\mathop {\min }\limits_\lambda ER(\lambda ) \downarrow 1$$ . A Monte Carlo experiment with several smooth g 's was tried with m =2, n =50 and several values of 2, and typical values of $$R(\hat \lambda )/\mathop {\min }\limits_\lambda R(\lambda )$$ were found to be in the range 1.01---1.4. The derivative g of g can be estimated by $$g'_{n,\hat \lambda } (t)$$ . In the Monte Carlo examples tried, the minimizer of $$R_D (\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g'_{n,\lambda } (t_j ) - } g'(t_j ))$$ tended to be close to the minimizer of R(¿) , so that $$\hat \lambda$$ was also a good value of the smoothing parameter for estimating the derivative.

References

[1]
Abramowitz, M., Stegun, I.: Handbook of mathematical functions with formulas, graphs, and mathematical tables. U.S. Department of Commerce, National Bureau of Standards Applied Mathematics Series No.55, pp. 803---819, 1964
[2]
Aronszajn, N.: Theory of reproducing kernels. Trans. Amer. Math. Soc.68, 337---404 (1950)
[3]
Golomb, M.: Approximation by periodic spline interpolants on uniform meshes. J. Approximation Theory1, 26---65 (1968)
[4]
Golub, G., Heath, M., Wahba, G.: Generalized cross validation as a method for choosing a good ridge parameter, to appear, Technometrics
[5]
Golub, G., Reinsch, C.: Singular value decomposition and least squares solutions. Numer. Math.14, 403---420 (1970)
[6]
Hudson, H.M.: Empirical Bayes estimation. Technical Report #58, Stanford University, Department of Statistics, Stanford, Cal., 1974
[7]
Kimeldorf, G., Wahba, G.: A correspondence between Bayesian estimation on stochastic processes and smoothing by splines. Ann. Inst. Statist. Math.41, 495---502 (1970)
[8]
Mallows, C.L.: Some comments onCp. Technometrics15, 661---675 (1973)
[9]
Reinsch, C.M.: Smoothing by spline functions. Numer. Math.10, 177---183 (1967)
[10]
Reinsch, C.M.: Smoothing by spline functions, II. Numer. Math.16, 451---454 (1971)
[11]
Schoenberg, I.J.: Spline functions and the problem of graduation. Proc. Nat. Acad. Sci. (USA)52, 947---950 (1964)
[12]
Wahba, G.: Convergence rates for certain approximate solutions to first kind integral equations. J. Approximation Theory7, 167---185 (1973)
[13]
Wahba, G.: Smoothing noisy data with spline functions. Numer. Math.24, 383---393 (1975)
[14]
Wahba, G.: Practical approximate solutions to linear operator equations when the data are noisy. SIAM J. Numer. Anal.14, 651---667 (1977)
[15]
Wahba, G., Wold, S.: A completely automatic French curve: Fitting spline functions by crossvalidation. Comm. Statist.4, 1---17 (1975)
[16]
Wahba, G., Wold, S.: Periodic splines for spectral density estimation: The use of cross-validation for determining the degree of smoothing. Comm. Statist.4, 125---141 (1975)
[17]
Wahba, G.: A survey of some smoothing problems and the method of generalized cross validation for solving them. University of Wisconsin-Madison, Statistics Dept., Technical Report #457. In: Proceedings of the Conference on Applications of Statistics, Dayton, Ohio (P.R. Krishnaiah, ed.) June 14---18, 1976
[18]
Wahba, G.: Improper priors, spline smoothing and the problem of guarding against model errors in regression. J. Roy. Statist. Soc., Ser. B. To appear

Cited By

View all
  • (2024)A new computationally efficient algorithm to solve feature selection for functional data classification in high-dimensional spacesProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692245(4383-4402)Online publication date: 21-Jul-2024
  • (2024)Functional graph convolutional networksProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/795(7188-7196)Online publication date: 3-Aug-2024
  • (2024)Estimating the first and second derivatives of discrete audio dataEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-024-00355-52024:1Online publication date: 18-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Numerische Mathematik
Numerische Mathematik  Volume 31, Issue 4
December 1978
125 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 December 1978

Author Tags

  1. CR:5.17
  2. MOS:65D10
  3. MOS:65D25

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A new computationally efficient algorithm to solve feature selection for functional data classification in high-dimensional spacesProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692245(4383-4402)Online publication date: 21-Jul-2024
  • (2024)Functional graph convolutional networksProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/795(7188-7196)Online publication date: 3-Aug-2024
  • (2024)Estimating the first and second derivatives of discrete audio dataEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-024-00355-52024:1Online publication date: 18-Jun-2024
  • (2024)Digital Twin of Rail for Defect AnalysisProceedings of the 2024 8th International Conference on Virtual and Augmented Reality Simulations10.1145/3657547.3657549(53-60)Online publication date: 14-Mar-2024
  • (2024)A Tensor Based Varying-Coefficient Model for Multi-Modal Neuroimaging Data AnalysisIEEE Transactions on Signal Processing10.1109/TSP.2024.337576872(1607-1619)Online publication date: 11-Mar-2024
  • (2024)Approximate Leave-One-Out Cross Validation for Regression With ℓ₁ RegularizersIEEE Transactions on Information Theory10.1109/TIT.2024.345000270:11(8040-8071)Online publication date: 1-Nov-2024
  • (2024)Impact of Internet and mobile communication on cyber resilienceInternational Journal of Critical Infrastructure Protection10.1016/j.ijcip.2024.10072247:COnline publication date: 1-Dec-2024
  • (2024)Hierarchical Bayesian spectral regression with shape constraints for multi-group dataComputational Statistics & Data Analysis10.1016/j.csda.2024.108036200:COnline publication date: 1-Dec-2024
  • (2024)A fuzzy nonparametric regression model based on an extended center and range methodJournal of Computational and Applied Mathematics10.1016/j.cam.2023.115377436:COnline publication date: 15-Jan-2024
  • (2024)Forecasting Electricity Demand in Greece: A Functional Data Approach in High Dimensional Hourly Time SeriesSN Computer Science10.1007/s42979-024-02926-x5:5Online publication date: 20-May-2024
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media