Teaching


15.003 ― Data Science Tools
Star


Welcome to my teaching page! My twin brother Shervine and I created this set of illustrated study guides covering the content of the Data Science part of the 15.003 Software Tools class that I have been teaching at MIT since 2020. They can (hopefully!) be useful to all students of this course as well as to anyone interested in Data Science.


Data retrieval


SQL
  • • Filtering, conditions and data types
  • • Types of joins (inner, full, left, right, cross)
  • • Aggregations, window functions
  • • Table manipulation

Data manipulation


R
  • • Filtering, conditions and data types
  • • Types of joins (inner, full, left, right, cross)
  • • Aggregations, window functions
  • • Data frame transformations
Python
  • • Filtering, conditions and data types
  • • Types of joins (inner, full, left, right, cross)
  • • Aggregations, window functions
  • • Data frame transformations
R-Python conversion
  • Conversion made easy between R (tidyr, dplyr, lubridate) and Python (pandas, numpy, datetime)

Data visualization


R
  • • Scatterplots, line plots, histograms
  • • Boxplots, maps
  • • Customized legend
Python
  • • Scatterplots, line plots, histograms
  • • Boxplots, maps
  • • Customized legend
R-Python conversion
  • Conversion made easy between R (ggplot2) and Python (matplotlib, seaborn)

Engineering tips


Git, Bash
  • • Version control with Git
  • • Working with the terminal with Bash
  • • Mastering editors with Vim