Introduction to Data Cleaning with OpenRefine
OpenRefine is an open source tool to explore, clean, organise, combine and transform data. OpenRefine is particularly powerful when working with large datasets.
Learn basic data cleaning techniques in this self-paced online workshop such as:
- Exploring tabular data through facets and filters
- Implementing ‘tidy data’ principles
- Cleaning, organising and preparing data for analysis
- Extracting and using a script to automate wrangling on similar data
Download the software and dataset, do activities and watch videos to guide you through the lessons. Give yourself around 2 1/2 hours to complete the workshop.
Adapted from Data Carpentry & Library Carpentry lessons.
Hosted by Griffith University Library, 2022.
Griffith University acknowledges the people who are the traditional custodians of the land and pays respect to the Elders, past and present, and extends that respect to all Aboriginal and Torres Strait Islander peoples.
Theme: workshop-template-b by evanwill is built using Jekyll on GitHub Pages. The site is styled using Bootstrap with FontAwesome icons.
Copyright: © 2022 Griffith University. Apart from Griffith logos or 3rd party material used with permission or under another license, this material is licensed under a CC BY-NC 4.0 license. Portions of this work are adapted from ‘Data Carpentry’, © 2022 The Carpentries, licensed under a CC BY 4.0 license.
Contributors: Sharron Stapleton
Get source code for this online workshop.
Griffith University - CRICOS Provider Number 00233E.