Tidying and manipulating data using the tidyverse in R
This beginner-level session will teach essential data wrangling skills in R using the tidyverse (dplyr, tidyr, pivot_* etc.). It will not delve into advanced statistical models or coding techniques, but a base understanding of the R programming language is necessary.
The focus will be on solidifying foundational best practices for programmatically shaping ‘messy’ data into tidy data fit for analysis. This will involve learning to format and manipulate data, covering skills such as pivoting, recoding, joining, filtering, summarising, and reshaping. These skills are distinct from coding and statistics, but are often overlooked. Yet, in real-world data analysis, they constitute the majority of the work.
The session will include a series of short lectures followed by hands-on practical activities using real-world data. Attendees will need to bring a laptop with R and RStudio pre-installed on their system. They will also be provided with a link to a Github repository containing the practical exercises and data that will be used throughout the training, which should be downloaded in advance.
Session outcomes:
Students will learn to format and manipulate data to support analysis, covering skills such as pivoting, recoding, joining, filtering, summarising, and reshaping.
This training session will be delivered in person at the University of Sheffield.
This training session will be recorded, and the recording will be made available on the WRDTP website.
Places are limited, so please only book a ticket if you can guarantee your attendance.
Bookings will close at 9am on Tuesday 25th February.