Welcome to my teaching page! My twin brother Shervine and I created this set of illustrated study guides covering the content of the Data Science part of the 15.003 Software Tools class that I have been teaching at MIT since 2020. They can (hopefully!) be useful to all students of this course as well as to anyone interested in Data Science.
Data retrieval
SQL
• Filtering, conditions and data types
• Types of joins (inner, full, left, right, cross)
• Aggregations, window functions
• Table manipulation
Data manipulation
R
• Filtering, conditions and data types
• Types of joins (inner, full, left, right, cross)
• Aggregations, window functions
• Data frame transformations
Python
• Filtering, conditions and data types
• Types of joins (inner, full, left, right, cross)
• Aggregations, window functions
• Data frame transformations
R-Python conversion
Conversion made easy between R (tidyr, dplyr, lubridate) and Python (pandas, numpy, datetime)
Data visualization
R
• Scatterplots, line plots, histograms
• Boxplots, maps
• Customized legend
Python
• Scatterplots, line plots, histograms
• Boxplots, maps
• Customized legend
R-Python conversion
Conversion made easy between R (ggplot2) and Python (matplotlib, seaborn)