This review paper offers a thorough analysis of different data imputation methods that can be applied to time series data. Time series data is an essential element in various analytical and predictive models used in different domains. Time series data frequently experiences missing values as a result of diverse factors, such as system errors, human influences, or inherent gaps in data collection. The presence of these missing values significantly undermines the precision and dependability of models constructed using this data. This paper classifies imputation methods into basic and advanced techniques, providing a comprehensive examination of each. The simplicity and effectiveness of basic techniques, such as mean or median imputation and linear interpolation, are discussed in specific scenarios. The study investigates the efficacy of advanced techniques, such as ARIMA statistical models, K-Nearest Neighbors machine learning approaches, and Long Short-Term Memory networks deep learning techniques, in managing intricate and extensive time series datasets. The paper emphasizes a comparative approach, assessing each method's complexity, accuracy, and computational demands. The paper concludes by emphasizing the need for continuous innovation in imputation techniques to meet the growing complexity of time series data across various domains. It advocates for a collaborative approach that combines domain expertise with advanced data science methods to develop tailored, efficient, and accurate imputation strategies.
Primary Language | English |
---|---|
Subjects | Software Engineering (Other) |
Journal Section | Articles |
Authors | |
Early Pub Date | July 18, 2024 |
Publication Date | July 1, 2024 |
Submission Date | February 7, 2024 |
Acceptance Date | April 4, 2024 |
Published in Issue | Year 2024Volume: 27 |