Loading…
useR! 2024
Attending this event?
In Person & Virtual
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Time (UTC+1)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Wednesday, July 10 • 11:50 - 12:10
Generative Modelling of Mixed Tabular Data with the R Package ‘Arf’ - Jan Kapar, Leibniz Institute for Prevention Research and Epidemiology - BIPS

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Generative machine learning has gained world-wide attention and, especially since the rise of ChatGPT and DALL-E, has started to become an integral tool both in business and everyday life. While the hype has mainly focused on text, image, audio and video synthesis so far, generative modelling of mixed tabular data with both continuous and categorical variables has great unexploited potential in many research fields and industry applications. However, recent attempts to adapt the existing, mainly deep learning-based methods to this more general setting have not shown the same overwhelming successes yet. We present the CRAN package ‘arf’, an easy-to-use implementation of adversarial random forests based on ‘ranger’, which has shown the ability to match and often outperform current deep learning approaches in terms of performance, tuning efforts and runtime, also on small or high dimensional data. ‘arf’ provides tools for both synthetic data generation and density estimation. Optional conditioning on events further extends the possible area of application, enabling for use cases like missing data imputation, data balancing and augmentation.

Speakers
avatar for Jan Kapar

Jan Kapar

M. Sc., Leibniz Institute for Prevention Research and Epidemiology - BIPS
since 2022: Doctoral Student / Research Fellow in Machine Learning, Faculty for Mathematics and Computer Science, Universität Bremen, and Leibniz Institute for Prevention Research and Epidemiology - BIPS 2011 - 2016: B.Sc Mathematics and M.Sc. Business Mathematics, Julius-Maxmilians-Universität... Read More →


Wednesday July 10, 2024 11:50 - 12:10 CEST
Wolfgangsee
Feedback form isn't open yet.