Loading…
useR! 2024
Attending this event?
In Person & Virtual
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Time (UTC+1)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Wednesday, July 10 • 15:40 - 16:00
ML-Based Imputation Methods in R Package VIM: Performance and Considerations - Johannes Gussenbauer & Alexander Kowarik, Statistics Austria; Nina Niederhametner, Statistik Austria

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Missing data poses a pervasive issue in statistical analysis across various domains. Ignoring missing values or using incongruous imputation methods can introduce bias and decrease the validity of statistical results. To overcome the challenge of missing data imputation, we propose the use of novel machine learning algorithms: The R package VIM (Visualization and Imputation of Missing Values) has incorporated machine learning (ML)-based imputation methods, including xgboost and transformer models. This presentation will elucidate the recent advancements in VIM, with a special emphasis on the performance of these ML models in handling missing data, comparing them to more conventional imputation methods, and highlight their advantages and disadvantages. Through real-world examples, we aim to demonstrate the effectiveness of our models in improving accuracy and reliability.

Speakers
avatar for Alexander Kowarik

Alexander Kowarik

Head of Statistical methods and survey methodology, Statistics Austria
Dr. Alexander Kowarik is head of the methods unit at Statistics Austria with more than 10 years of experience working at a NSI. He is an active contributor to the R open source community with a focus on official statistics application.
avatar for Johannes Gussenbauer

Johannes Gussenbauer

Methodologist, Statistics Austria
I studied Mathematics at the Universtiy of Technology in Vienna and am working as a methodoligst at Statistics Austria since 2017. My main topics at work cover imputation, calibration and error estimation for surveys as well as text classification using R. I contribute to various... Read More →
avatar for Nina Niederhametner

Nina Niederhametner

Methodologist, Statistics Austria
Nina Niederhametner started working as a methodologist at Statistik Austria in November 2023, where her main work centers around imputation and classification using large language models. She also specializes in data privacy and anonymization with special focus on synthetic data... Read More →


Wednesday July 10, 2024 15:40 - 16:00 CEST
Pinzgau + Tennegau
Feedback form isn't open yet.