Loading…
useR! 2024
Attending this event?
In Person & Virtual
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Time (UTC+1)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Machine learning and AI [clear filter]
Tuesday, July 9
 

13:20 CEST

Predictors Optimization for Sensory Profiles Modelling Based on Electronic Signals - Jean-Vincent Le Bé, Nestlé Research
The sensory profiles of coffee products can be generated by using predictive models with inputs from specific electrodes immersed in the liquid coffee. The electrical signals are generated by molecules diffusing through selective bio-polymers and reaching electrodes. These signals are time-series data, and various features are calculated from them (such as transient peaks or steady-state averages). These features are then used to train models for the prediction of sensory profiles or proximity to a given reference product. Six features are calculated for each of the 15 pairs of electrodes resulting in 90 variables for the classification model (proximity to a reference) or the regression model (sensory profile). These variables can be correlated to a certain extent and lead to over-fitting or unnecessary recording. This work presents a method for reducing the number of variables to a minimum relevant set using clustering and random forest variable importance. It shows that a proper selection results in a more robust model across different experimental batches.

Tuesday July 9, 2024 13:20 - 13:25 CEST
Wolfgangsee

13:35 CEST

BayesCVI: A Bayesian Cluster Validity Index - Nathakhun Wiroonsri, King Mongkut's University of Technology Thonburi
Selecting the appropriate number of clusters is a critical step in applying clustering methods. To assist in this process, various cluster validity indices (CVIs) have been developed. These indices are designed to identify the optimal number of clusters within a dataset. However, users may not always seek the absolute optimal number of clusters but rather a secondary option that better aligns with their contexts. This realization has led us to introduce a Bayesian cluster validity index (BCVI), which builds upon existing indices. The BCVI utilizes a Dirichlet prior, resulting in the same posterior distribution. We evaluate BCVI using the Wiroonsri index for hard clustering and the WP index for soft clustering as underlying indices. We compare the performance of BCVI with that of the original underlying indices and several other existing CVIs, including DB, STR, XB, and KWON2 indices. Our BCVI offers clear advantages in situations where users can specify their desired range for the final number of clusters. Additionally, we showcase the practical applicability of our approach through MRI images. These tools are also published as a new R package `BayesCVI' available on CRAN.

Speakers
avatar for Nathakhun Wiroonsri

Nathakhun Wiroonsri

Assistant Professor, King Mongkut's University of Technology Thonburi
Nathakhun Wiroonsri earned his B.Sc. in Mathematics with first-class honors from Chulalongkorn University, Master of Financial Mathematics from North Carolina State University, and Ph.D. in Applied Mathematics from the University of Southern California in 2010, 2013, and 2018, respectively... Read More →


Tuesday July 9, 2024 13:35 - 13:40 CEST
Salzburg I
 
  • Timezone
  • Filter By Date useR! 2024 Jul 7 -11, 2024
  • Filter By Venue Salzburg, Austria
  • Filter By Type
  • Big and high-dimensional data
  • Biostatistics + epidemiology + bioinformatics
  • Breaks + Special Events
  • Community and outreach
  • Cross-industry collaboration
  • Data handling and management
  • Data science education
  • Data visualisation
  • Economics + finance + insurance + business
  • Efficient programming
  • Environmental sciences
  • Interfaces with other programming languages
  • Keynote Sessions
  • Machine learning and AI
  • Numerical methods
  • Open and reproducible science
  • Predictive modelling and forecasting
  • Public sector and NGO
  • Quarto and reporting
  • R workflow + deployment + production
  • Registration
  • Research software engineering
  • Shiny + dashboards + web apps
  • Social sciences
  • Spatial data and maps
  • Sponsor Showcase
  • Statistical modelling
  • Text data and NLP
  • Level

Filter sessions
Apply filters to sessions.