Programming for Data Analysis
Instructor: Amir Mahdi Sadeghzadeh | Certificate: Official (bilingual) |
Term: Summer 2025 | Prerequisite: Python Programming |
Schedule: Sundays, 17:00–20:00 | Online Class: Online Class |
General Objective
The goal of this course is to master the Python programming language and use it for storing, analyzing, and visualizing data.
Topics
- Review of Python Programming (2 sessions)
- Working with Python in IPython and Jupyter interactive environments
- Modular programming and use of libraries
- Generating random numbers and Monte Carlo simulation
- Data Storage and Handling (5 sessions)
- Data storage structures
- Organizing data using dataframes
- Relational and non-relational databases, and data warehouses
- Data manipulation with Pandas
- Data Preparation (2 sessions)
- Data formatting, normalization, and binning
- Filling in missing data
- Data Analysis (5 sessions)
- Understanding data distribution
- Creating data pipelines
- Applying analysis techniques to real datasets using Numpy and Scipy libraries
- Data Visualization and Charting (5 sessions)
- Exploratory data analysis
- Plotting in Python using Matplotlib, Seaborn, and Plotly libraries
- Various data visualization techniques
- Important considerations for effective data visualization
Assessment
- Exercises: 20%
- Quizzes: 20%
- Final Exam: 60%
References
- Wes McKinney. Python for Data Analysis: Data Wrangling with pandas. NumPy, and Jupyter, 3rd Edition, 2022.