Loading…
useR! 2024
Attending this event?
In Person & Virtual
8 - 11 July, 2024
Learn more and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for useR! 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Time (UTC+1)To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Thursday, July 11 • 11:30 - 11:50
Designing a Drop-in Replacement for Dplyr - Kirill Müller, cynkra GmbH

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The dplyr package is a powerful tool for data manipulation in R. It provides a consistent grammar for manipulating data frames and is widely used by data scientists and analysts. However, dplyr requires that the entire dataset fit into memory, and can be slow for large datasets. The duckdb package is a new in-memory database that is designed to be blazing fast and efficient for analytical workloads. A relational frontend, modeled after Codd's relational algebra, is provided alongside an SQL interface. The new duckplyr package uses this relational frontend: unlike dbplyr, which translates dplyr commands into SQL, duckplyr translates dplyr commands into relational algebra. The package has been designed to be a fully compatible drop-in replacement for dplyr from day one. Operations are run in duckdb when possible, and fall back to dplyr when not. The project's goal is to speed up more and more dplyr verbs, R functions, and data types, towards becoming the primary implementation of the dplyr grammar of data manipulation. In this talk, I will present duckdb and duckplyr, and discuss the design of duckplyr and the supporting tools.

Speakers
avatar for Kirill Müller

Kirill Müller

Founding partner, cynkra GmbH
Kirill Müller has been working on the boundary between data and computer science for more than 25 years. He has been awarded five R consortium projects to improve database connectivity and performance in R. Kirill is a core contributor to several tidyverse packages, including dplyr... Read More →


Thursday July 11, 2024 11:30 - 11:50 CEST
Attersee
Feedback form isn't open yet.