Transformations
LO3
– Students should demonstrate the ability to adapt a given Jupyter Notebook to transform a novel datasetLO5
- Students should be able to identify appropriate tools for ‘data wrangling’ and explain the purpose of the tools
Once you have a handle on your data, and are starting to understand the changes that need to be made - you can start transforming the data.
🛠 Tools
- Python / Jupyter
📚 Resources
- Alakh Sethi, March 4, 2020. Learn how to use the Transform Function in Pandas (with Python Code). Analyics Vidhya
- @Kartikaybhutani, August 23, 2019. Filtering Data with Pandas .query() Method. Geeks for Geeks
- Hassan Syyid, August 10, 2019. Python Data Transformation Tools for ETL. Towrds Data Science
☑️ Tasks
- Adapt the transformations jupyter notebook for your dataset to clean and create summary tables:
- From Richard Peterson’s GitHub:
- https://github.com/rchardptrsn/Pandas-Data-Transformation/blob/master/Pandas%20Data%20Transformation.ipynb
- Review building a Jupyter Notebook for data transformations:
Optional Submissions
- Submit your data transformation Jupyter Notebook to GitHub Classroom: Moodle Submission- Data Wrangling GitHub Classroom
Indicate whether this submission is for feedback or automated grading