Analysis Results Datasets Using Open-Source Tools from the {pharmaverse}
A Hands-on Workshop
🗓️ Tuesday March 18, 2025 | 04:00 - 05:30PM EDT
🏢 Renaissance Orlando at SeaWorld, Oceans 1
💥 FREE for PHUSE US Connect Attendees!
Description
The CDISC Analysis Results Data (ARD) Model is an emerging standard for encoding statistical analysis summaries in a machine-readable format. Its primary objective is to streamline the processes of automation, ensuring reproducibility, promoting reusability, and enhancing traceability.
The newly released {cards} R package, a collaborative effort in the Pharmaverse including Roche, GSK, and Novartis, offers a variety of functions for ARD generation. These range from basic univariate summaries like means and tabulations to complex multivariable summaries encompassing regression models and statistical tests.
In this interactive workshop, attendees will learn about ARDs and how they fit into the larger CDISC Analysis Results Standard. They will get hands-on experience using {cards} to build ARDs for both simple and complex statistical summaries. Lastly, attendees will also be introduced to open-source tools for converting their ARDs into tables using the {gtsummary} and {tfrmt} R packages. Attendees will leave with a solid understanding of ARDs and the knowledge of how to integrate them into their daily work using open-source tools.
Pre-work
If you are planning to participate in the hands-on portion of the workshop, please come prepared the following packages installed on your local machine:
install.packages(c("cards", "cardx", "gtsummary", "tfrmt", "tfrmtbuilder", "pharmaverseadam"))
Or, you can make a copy of the following RStudio Cloud instance that has these packages pre-installed. A (free) Posit Cloud account is required.
Slides
Introduction to the Analysis Results Standard and {cards}
Introduction to the {cardx} Package and ARD Extras
Instructors
Daniel D. Sjoberg (he/him) is a Senior Principal Data Scientist at Genentech. Previously, he was a Lead Data Science Manager at the Prostate Cancer Clinical Trials Consortium, and a Senior Biostatistician at Memorial Sloan Kettering Cancer Center in New York City. He enjoys R package development, creating many packages available on CRAN, R-Universe, and GitHub. He’s a co-organizer of rainbowR (a community that supports, promotes and connects LGBTQ+ people who code in the R language) and of the R Medicine Conference. His research interests include adaptive methods in clinical trials, precision medicine, and predictive modeling. Daniel is the winner of the 2021 American Statistical Association (ASA) Innovation in Statistical Programming and Analytics award.
Becca Krouse is a data scientist in GSK’s Statistics and Data Science Innovation Hub. A biostatistician by training, she has experience spanning 14+ years in the field of clinical research and specializes in developing R-based tools.