[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Smoothing noisy data with spline functions

Published: 01 December 1978 Publication History

Abstract

Smoothing splines are well known to provide nice curves which smooth discrete, noisy data. We obtain a practical, effective method for estimating the optimum amount of smoothing from the data. Derivatives can be estimated from the data by differentiating the resulting (nearly) optimally smoothed spline.
We consider the model y i ( t i )+ i , i =1, 2, ..., n , t i [0, 1], where g W 2 ( m ) ={ f : f , f , ..., f ( m 1) abs. cont., f ( m ) 2[0,1]}, and the { i } are random errors with E i =0, E i j = 2 ij . The error variance 2 may be unknown. As an estimate of g we take the solution g n, to the problem: Find f W 2 (m) to minimize $$\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j ) - y_j )^2 + \lambda \int\limits_0^1 {(f^{(m)} (u))^2 du} }$$ . The function g n, is a smoothing polynomial spline of degree 2 m 1. The parameter controls the tradeoff between the "roughness" of the solution, as measured by $$\int\limits_0^1 {[f^{(m)} (u)]^2 du}$$ , and the infidelity to the data as measured by $$\frac{1}{n}\sum\limits_{j = 1}^n {(f(t_j ) - y_j )^2 }$$ , and so governs the average square error R( ; g)=R( ) defined by $$R(\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g_{n,\lambda } (t_j ) - g(t_j ))^2 }$$ . We provide an estimate $$\hat \lambda$$ , called the generalized cross-validation estimate, for the minimizer of R( ) . The estimate $$\hat \lambda$$ is the minimizer of V ( ) defined by $$V(\lambda ) = \frac{1}{n}\parallel (I - A(\lambda ))y\parallel ^2 /\left[ {\frac{1}{n}{\text{Trace(}}I - A(\lambda ))} \right]^2$$ , where y=(y 1, ..., y n)t and A ( ) is the n n matrix satisfying (g n, ( t 1), ..., g n, ( t n))t= A ( ) y . We prove that there exist a sequence of minimizers $$\tilde \lambda = \tilde \lambda (n)$$ of EV( ) , such that as the (regular) mesh {t i} i=1 n becomes finer, $$\mathop {\lim }\limits_{n \to \infty } ER(\tilde \lambda )/\mathop {\min }\limits_\lambda ER(\lambda ) \downarrow 1$$ . A Monte Carlo experiment with several smooth g 's was tried with m =2, n =50 and several values of 2, and typical values of $$R(\hat \lambda )/\mathop {\min }\limits_\lambda R(\lambda )$$ were found to be in the range 1.01---1.4. The derivative g of g can be estimated by $$g'_{n,\hat \lambda } (t)$$ . In the Monte Carlo examples tried, the minimizer of $$R_D (\lambda ) = \frac{1}{n}\sum\limits_{j = 1}^n {(g'_{n,\lambda } (t_j ) - } g'(t_j ))$$ tended to be close to the minimizer of R(¿) , so that $$\hat \lambda$$ was also a good value of the smoothing parameter for estimating the derivative.

References

[1]
Abramowitz, M., Stegun, I.: Handbook of mathematical functions with formulas, graphs, and mathematical tables. U.S. Department of Commerce, National Bureau of Standards Applied Mathematics Series No.55, pp. 803---819, 1964
[2]
Aronszajn, N.: Theory of reproducing kernels. Trans. Amer. Math. Soc.68, 337---404 (1950)
[3]
Golomb, M.: Approximation by periodic spline interpolants on uniform meshes. J. Approximation Theory1, 26---65 (1968)
[4]
Golub, G., Heath, M., Wahba, G.: Generalized cross validation as a method for choosing a good ridge parameter, to appear, Technometrics
[5]
Golub, G., Reinsch, C.: Singular value decomposition and least squares solutions. Numer. Math.14, 403---420 (1970)
[6]
Hudson, H.M.: Empirical Bayes estimation. Technical Report #58, Stanford University, Department of Statistics, Stanford, Cal., 1974
[7]
Kimeldorf, G., Wahba, G.: A correspondence between Bayesian estimation on stochastic processes and smoothing by splines. Ann. Inst. Statist. Math.41, 495---502 (1970)
[8]
Mallows, C.L.: Some comments onCp. Technometrics15, 661---675 (1973)
[9]
Reinsch, C.M.: Smoothing by spline functions. Numer. Math.10, 177---183 (1967)
[10]
Reinsch, C.M.: Smoothing by spline functions, II. Numer. Math.16, 451---454 (1971)
[11]
Schoenberg, I.J.: Spline functions and the problem of graduation. Proc. Nat. Acad. Sci. (USA)52, 947---950 (1964)
[12]
Wahba, G.: Convergence rates for certain approximate solutions to first kind integral equations. J. Approximation Theory7, 167---185 (1973)
[13]
Wahba, G.: Smoothing noisy data with spline functions. Numer. Math.24, 383---393 (1975)
[14]
Wahba, G.: Practical approximate solutions to linear operator equations when the data are noisy. SIAM J. Numer. Anal.14, 651---667 (1977)
[15]
Wahba, G., Wold, S.: A completely automatic French curve: Fitting spline functions by crossvalidation. Comm. Statist.4, 1---17 (1975)
[16]
Wahba, G., Wold, S.: Periodic splines for spectral density estimation: The use of cross-validation for determining the degree of smoothing. Comm. Statist.4, 125---141 (1975)
[17]
Wahba, G.: A survey of some smoothing problems and the method of generalized cross validation for solving them. University of Wisconsin-Madison, Statistics Dept., Technical Report #457. In: Proceedings of the Conference on Applications of Statistics, Dayton, Ohio (P.R. Krishnaiah, ed.) June 14---18, 1976
[18]
Wahba, G.: Improper priors, spline smoothing and the problem of guarding against model errors in regression. J. Roy. Statist. Soc., Ser. B. To appear

Cited By

View all
  • (2024)Estimating the first and second derivatives of discrete audio dataEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-024-00355-52024:1Online publication date: 18-Jun-2024
  • (2024)Digital Twin of Rail for Defect AnalysisProceedings of the 2024 8th International Conference on Virtual and Augmented Reality Simulations10.1145/3657547.3657549(53-60)Online publication date: 14-Mar-2024
  • (2024)A Tensor Based Varying-Coefficient Model for Multi-Modal Neuroimaging Data AnalysisIEEE Transactions on Signal Processing10.1109/TSP.2024.337576872(1607-1619)Online publication date: 11-Mar-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Numerische Mathematik
Numerische Mathematik  Volume 31, Issue 4
December 1978
125 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 December 1978

Author Tags

  1. CR:5.17
  2. MOS:65D10
  3. MOS:65D25

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Estimating the first and second derivatives of discrete audio dataEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-024-00355-52024:1Online publication date: 18-Jun-2024
  • (2024)Digital Twin of Rail for Defect AnalysisProceedings of the 2024 8th International Conference on Virtual and Augmented Reality Simulations10.1145/3657547.3657549(53-60)Online publication date: 14-Mar-2024
  • (2024)A Tensor Based Varying-Coefficient Model for Multi-Modal Neuroimaging Data AnalysisIEEE Transactions on Signal Processing10.1109/TSP.2024.337576872(1607-1619)Online publication date: 11-Mar-2024
  • (2024)Hierarchical Bayesian spectral regression with shape constraints for multi-group dataComputational Statistics & Data Analysis10.1016/j.csda.2024.108036200:COnline publication date: 1-Dec-2024
  • (2024)A fuzzy nonparametric regression model based on an extended center and range methodJournal of Computational and Applied Mathematics10.1016/j.cam.2023.115377436:COnline publication date: 15-Jan-2024
  • (2024)Forecasting Electricity Demand in Greece: A Functional Data Approach in High Dimensional Hourly Time SeriesSN Computer Science10.1007/s42979-024-02926-x5:5Online publication date: 20-May-2024
  • (2024)A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressionsStatistics and Computing10.1007/s11222-024-10435-334:4Online publication date: 29-May-2024
  • (2024)Spatio-temporal clustering analysis using generalized lasso with an application to reveal the spread of Covid-19 cases in JapanComputational Statistics10.1007/s00180-023-01331-x39:3(1513-1537)Online publication date: 1-May-2024
  • (2023)How much does initialization affect generalization?Proceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619597(28637-28655)Online publication date: 23-Jul-2023
  • (2023)A fast, well-founded approximation to the empirical neural tangent kernelProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619452(25061-25081)Online publication date: 23-Jul-2023
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media