Appendix A — Reading List

Note

Effective Data Science is still a work-in-progress. This chapter is largely complete and just needs final proof reading.

If you would like to contribute to the development of EDS, you may do so at https://github.com/zakvarty/data_science_notes.

This reading list is organised by topic, according to each week of the course. These are split into several categories.

A.1 Effective Data Science Workflows

Core Materials

Reference Materials

Materials of Interest

  • Broman et al (2017). Recommendations to Funding Agencies for Supporting Reproducible Research. American Statistical Association.

A.2 Aquiring and Sharing Data

Core Materials

Reference Materials

Materials of Interest

This covers importing data from spreadsheets, databases, using Apache Arrow and importing hierarchical data as well as web scraping.

A.3 Data Exploration and Visualisation

Core Materials

Chapters 3 and 4 are core reading, respectively introducing data frame manipulation with {dplyr} and an example workflow for exploratory data analysis. Other chapters may be useful as references.

Referene Materials

Materials of Interest

A.4 Preparing for Production

Core Materials

Reference Materials

Materials of Interest

A.5 Data Science Ethics

Core Materials

Reference Materials

Materials of Interest