Loading…
useR! 2024
Attending this event?
In Person
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Summer Time (UTC+02:00)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

The virtual program will take place on 2 July. Please see the virtual schedule page for more information.
Data handling and management [clear filter]
Wednesday, July 10
 

11:30 CEST

{Admiral} – the {Dplyr} of the Pharmaceutical Industry? - Stefan Pascal Thoma, Roche & Edoardo Mancini, Roche Products Ltd
{admiral} is a package developed across the pharmaceutical industry to derive datasets that comply with industry specific data standards. In this presentation, we'd like to give a brief exposition of the {admiral} package. The talk commences by introducing our problem statement, how we solve it in {admiral} by compartmentalizing domain specific functionalities, and how the package and its family expanded to a wide cross-industry collaboration. We conclude showcasing a case-study where pandemic-driven interests led to an industry effort to create a vaccine-specific {admiral} toolset.

Speakers
avatar for Edoardo Mancini

Edoardo Mancini

Data Scientist, Roche
Edoardo is a Data Scientist at Roche with 3+ years of experience in pharmaceuticals. He specializes in statistical programming, leading studies in ophthalmology and immunology. Edoardo promotes R for clinical reporting and holds degrees in Mathematics and Applied Mathematics, and... Read More →
avatar for Stefan Pascal Thoma

Stefan Pascal Thoma

Data Scientist, Roche
Stefan Thoma is a statistical programmer, statistician, and core {admiral} developer at Roche, joining in November 2022.He has a Masters degree in Statistics from ETH Zurich and a Masters degree in Psychology from the University of Bern.


Wednesday July 10, 2024 11:30 - 11:50 CEST
Attersee

11:50 CEST

Tackling Formatted Tabular Data from Excel - Jeremy Selva, National Heart Centre Singapore
Reading tabular data with formatted cell in Microsoft Excel can be really tricky. Unexpected things may happen if I read it blindly in R using readxl::read_excel. I have tried to use the col_types argument but it was not enough for me. Unfortunately, there are limited resources to deal with reading tabular data with formatted cells in Excel. In my presentation, I will share some problematic formatted columns that I have encountered during my work with clinical data. Examples are Date in General (Text) and Date number format Numeric column with different colour font representing different units of the same measurements Numeric columns with some numbers provided in text More importantly, I will share how I managed to handle them in R using these three R packages collateral (https://collateral.jamesgoldie.dev/), pointblank (https://rstudio.github.io/pointblank/index.html) and tidyxl (https://nacnudus.github.io/tidyxl/index.html). For more details, I have written a blog post on https://jeremy-selva.netlify.app/blog/2024-02-15-tackling-formatted-cell-data/

Speakers
avatar for Jeremy Selva

Jeremy Selva

Jeremy John Selva, National Heart Centre Singapore
Jeremy is a Research Officer at the National Heart Centre Singapore. His job involves cleaning and harmonisation of clinical data from multiple labs related to cardiology such as cardiac medication, coronary artery calcium score and stenosis severity. He is curious to find ways to... Read More →


Wednesday July 10, 2024 11:50 - 12:10 CEST
Attersee
 
  • Timezone
  • Filter By Date useR! 2024 Jul 7 -11, 2024
  • Filter By Venue Salzburg, Austria
  • Filter By Type
  • Big and high-dimensional data
  • Biostatistics + epidemiology + bioinformatics
  • Breaks + Special Events
  • Community and outreach
  • Cross-industry collaboration
  • Data handling and management
  • Data science education
  • Data visualisation
  • Economics + finance + insurance + business
  • Efficient programming
  • Environmental sciences
  • Interfaces with other programming languages
  • Keynote Sessions
  • Machine learning and AI
  • Numerical methods
  • Open and reproducible science
  • Predictive modelling and forecasting
  • Public sector and NGO
  • Quarto and reporting
  • R workflow + deployment + production
  • Registration
  • Research software engineering
  • Shiny + dashboards + web apps
  • Social sciences
  • Spatial data and maps
  • Sponsor Showcase
  • Statistical modelling
  • Text data and NLP
  • Level

Filter sessions
Apply filters to sessions.