Intro to Data Science

DSCI 101 Introduction to Data Science

Instructor: Lorenzo Luzi

Course Description:

This is an introductory level course where students learn about fundamentals and principles in data science by completing data analysis projects using Python. Each student will select a real-world dataset and data science challenge aligned with their interests.

During the semester, class will meet for three times each week. Lectures will cover course material and introduce tools students need to complete the weekly assignments which allow students to apply the knowledge and techniques to their project. One lecture each week will be devoted to students working on these assignments as a team under the guidance of the instructor. These assignments are designed to assess students’ understanding and check their progress as they move forward along the data science pipeline. As they complete certain project milestones they will receive feedback and guidance for the next step.

At the end of the semester, each student will produce a final project report and give a formal presentation on their work. Course content includes foundations in managing and analyzing data; data mining techniques and tools; exploratory data analysis and data visualization; applied statistical methods and inference; machine learning algorithms and predictive models.  

This course will use Python and also teach fundamentals of Python programming

Course Objectives:

Students completing this course will be able to:

- Define and explain key concepts in the data science pipeline and work to complete data science life cycle and analyze real-world data.
- Gain fluency in basic programming skills in Python with a focus on statistical modeling and machine learning.
- Use applied statistical knowledge to analyze data, derive data summaries, build predictive models, and make scientific inferences.
- Interpret modeling results and communicate their findings to both a general and a technical audience.

Prerequisites:


This is a non-calculus based course with no prior background in statistics or programming required.

Meeting Times:

Class time: MW 4:00PM - 5:15PM @ DCH Sym II Lab
Lab sessions: T 4:00PM - 6:00PM @ DCH Sym II Lab

Questions?

Contact Lorenzo Luzi at luzi@rice.edu.