Optimal Subsampling from Big Datasets in Presence of Misspecification
Contributo in Atti di convegno
Data di Pubblicazione:
2025
Citazione:
Optimal Subsampling from Big Datasets in Presence of Misspecification / L. Deldossi, C. Tommasi (ITALIAN STATISTICAL SOCIETY SERIES ON ADVANCES IN STATISTICS). - In: Methodological and Applied Statistics and Demography II / [a cura di] A. Pollice, P. Mariani. - Prima edizione. - [s.l] : Springer, 2025. - ISBN 978-3-031-64350-7. - pp. 458-464 (( 52. SIS2024 Bari 2024 [10.1007/978-3-031-64350-7_77].
Abstract:
In the era of Big Data, several design based subsampling methods are proposed to reduce costs (and time) and to help in informed decision making. Most of these approaches require the specification of a model. A wrong model assumption and/or the possible presence of outliers represent a limitation for the most commonly applied subsampling criteria.
Through a simulation study, we explore if a subsampling method, originally introduced by [1] to avoid outliers, works well to account for model uncertainty and, on the other side, if the subsampling approach introduced by [2] to account for model misspecification, is robust to the presence of outliers.
Tipologia IRIS:
03 - Contributo in volume
Keywords:
D-optimality; model misspecification; outliers; subsampling
Elenco autori:
L. Deldossi, C. Tommasi
Link alla scheda completa:
Titolo del libro:
Methodological and Applied Statistics and Demography II