Loading…
useR! 2024
Attending this event?
In Person & Virtual
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Time (UTC+1)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Thursday, July 11 • 12:30 - 12:50
Neural Network-Based Text Classification for International Standardized Codes Using R - Nina Niederhametner, Statistik Austria & Johannes Gussenbauer, Statistics Austria

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

International standard classifications such as ISCO (for Occupation), ISCED (for Education) and COICOP (for Consumption) serve as pivotal statistical frameworks for the organization and classification of information. In official statistical practices, adherence to these codes is essential for thorough analysis and comparison of findings. Survey respondents typically provide information in an unstructured free textual format, requiring subsequent assignment to standardized code. This process is often done manually, resulting in time-consuming laborious tasks. In our talk, we propose an approach that automates the classification of textual data into various standardized codes using simple mathematical techniques combined with neural network-based language models, utilizing the R libraries TensorFlow and Keras. Additionally, we illustrate the development of application programming interfaces (APIs) using plumber, and the deployment of our models through posit connect, establishing accessibility to a broad user base.

Speakers
avatar for Johannes Gussenbauer

Johannes Gussenbauer

Methodologist, Statistics Austria
I studied Mathematics at the Universtiy of Technology in Vienna and am working as a methodoligst at Statistics Austria since 2017. My main topics at work cover imputation, calibration and error estimation for surveys as well as text classification using R. I contribute to various... Read More →
avatar for Nina Niederhametner

Nina Niederhametner

Methodologist, Statistics Austria
Nina Niederhametner started working as a methodologist at Statistik Austria in November 2023, where her main work centers around imputation and classification using large language models. She also specializes in data privacy and anonymization with special focus on synthetic data... Read More →


Thursday July 11, 2024 12:30 - 12:50 CEST
Pinzgau + Tennegau
Feedback form isn't open yet.