
Scraping Digital Media Platforms: Theory, Ethics and Data Analysis
UPDATE (05/11/25): Please note that this event has now been cancelled due to localised industrial action. We hope to rearrange it, and announce a new date, in the near future.
************************
Over the last five years, scraping has become increasingly popular amongst academic researchers, becoming a vital tool for digital methods research. In this three-part workshop, we will explain the theoretical underpinnings of scraping, explore high quality digital tools for academic research and introduce options for data analysis.
In the first part, we will explain scraping’s rise in importance, covering both the theoretical underpinnings of digital methods research – notably, its reliance on repurposed data – and a media environment in which access to large data sets via APIs (application programming interfaces) is increasingly restricted by technology companies. We will then explore the methodological implications of scraping, how it shifts the research perspective from platform overview to user experience and ethical considerations. In the second part, we will introduce some digital methods tools used in academic research to scrape platforms such as Google, Instagram, TikTok and X, and some options for data analysis. Finally, in the third part, we will provide time to experiment using these tools, with advice and supervision from the workshop leader.
Outcomes
Participants in this workshop will:
- Understand the broader contexts for the increasing importance of scraping in academic research;
- Understand the methodological implications of using data scraped from digital platforms;
- Become familiar with a range of scraping tools designed for academic research;
- Gain hands-on experience of using scraping tools relevant to their research.
Contributors
Dr Warren Pearce is Senior Lecturer in Digital Methods, Science and Society at University of Sheffield. Warren’s research is engaged with ‘science in public’: how science is organised, presented and reported by government, experts and media. His research has been published in a range of leading academic journals, such as Nature Climate Change; Information, Communication & Society; Public Understanding of Science; WIREs Climate Change; Environmental Communication.

Dr Warren Pearce
Director of Digital Technologies, Communication and Artificial Intelligence (DCA), WRDTP
Please note: The WRDTP is committed to sustainability and to reducing the waste from excess catering at events. A key challenge here is non-attendance at events. From October 1st 2025, the WRDTP will be changing the way we manage the non-attendance of PGR students who have booked place/s at WRDTP Training events. Any PGR student who does not inform the WRDTP (via training@wrdtp.ac.uk) that they will not be able to attend a WRDTP event at least 3 working days before the event takes place will have the cost of their place deducted from their RTSG (if a WRDTP-funded student), or have this charged to their department (if not funded by the WRDTP). This will allow us to better plan for events and to avoid catering waste. Thank you in advance for your cooperation on this matter.
This is an in-person only event at the University of Sheffield.
Bookings will close at 9am on Tuesday 25th November.
When booking your place, we ask that you use your institutional (.ac.uk) email address and complete all fields of the booking form. Thank you for your understanding.






