Computer Science > Machine Learning

arXiv:2405.17508 (cs)

[Submitted on 26 May 2024 (v1), last revised 26 Nov 2024 (this version, v2)]

Title:Unveiling the Secrets: How Masking Strategies Shape Time Series Imputation

Authors:Linglong Qian, Yiyuan Yang, Wenjie Du, Jun Wang, Zina Ibrahim

Abstract:Time series imputation is a critical challenge in data mining, particularly in domains like healthcare and environmental monitoring, where missing data can compromise analytical outcomes. This study investigates the influence of diverse masking strategies, normalization timing, and missingness patterns on the performance of eleven state-of-the-art imputation models across three diverse datasets. Specifically, we evaluate the effects of pre-masking versus in-mini-batch masking, augmentation versus overlaying of artificial missingness, and pre-normalization versus post-normalization. Our findings reveal that masking strategies profoundly affect imputation accuracy, with dynamic masking providing robust augmentation benefits and overlay masking better simulating real-world missingness patterns. Sophisticated models, such as CSDI, exhibited sensitivity to preprocessing configurations, while simpler models like BRITS delivered consistent and efficient performance. We highlight the importance of aligning preprocessing pipelines and masking strategies with dataset characteristics to improve robustness under diverse conditions, including high missing rates. This study provides actionable insights for designing imputation pipelines and underscores the need for transparent and comprehensive experimental designs.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2405.17508 [cs.LG]
	(or arXiv:2405.17508v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17508

Submission history

From: Linglong Qian [view email]
[v1] Sun, 26 May 2024 18:05:12 UTC (98 KB)
[v2] Tue, 26 Nov 2024 13:26:58 UTC (268 KB)

Computer Science > Machine Learning

Title:Unveiling the Secrets: How Masking Strategies Shape Time Series Imputation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unveiling the Secrets: How Masking Strategies Shape Time Series Imputation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators