INFO 5001: Computing for Information Science

Modified

August 20, 2025

This page contains an outline of the topics, content, and assignments for the semester. Note that this schedule will be updated as the semester progresses and the timeline of topics and assignments might be updated throughout the semester.

WEEK DATE TOPIC PREPARE MATERIALS DUE
1 Tue, Aug 26 Welcome to INFO 5001 πŸ‘©β€πŸ’» Login to Cornell’s GitHub server



Thu, Aug 28 Grammar of graphics πŸ“— r4ds - intro
πŸ“— r4ds - ch 1.1-.3, 1.7
πŸ“˜ ims - ch 1



Fri, Aug 29 Hello data science!


2 Tue, Sep 2 Visualizing various types of data πŸ“˜ ims - ch 4
πŸ“˜ ims - ch 5



Wed, Sep 3


HW 00 at 11:59pm

Thu, Sep 4 Grammar of data wrangling πŸ“— r4ds - ch 3


Fri, Sep 5 Data visualization


3 Tue, Sep 9 Working with relational data πŸ“— r4ds - ch 19


Thu, Sep 11 Tidying data To be posted


Fri, Sep 12 Git workflows (basics + merge conflicts) To be posted

4 Tue, Sep 16 Data types and classes To be posted


Thu, Sep 18 Importing and recoding data To be posted


Fri, Sep 19
To be posted

5 Tue, Sep 23 Databases + SQL To be posted


Thu, Sep 25 Getting data from the web: Scraping To be posted


Fri, Sep 26 Quiz 01 To be posted

6 Tue, Sep 30 Functions To be posted


Thu, Oct 2 Iteration To be posted


Fri, Oct 3 Develop project proposals To be posted

7 Tue, Oct 7 Getting data from the web: APIs To be posted


Thu, Oct 9 No class (out-of-town) To be posted


Fri, Oct 10 No class (out-of-town) To be posted

8 Tue, Oct 14 No class (Fall Break) To be posted


Thu, Oct 16 Rectangling data To be posted


Fri, Oct 17 Functions + iteration To be posted

9 Tue, Oct 21
To be posted


Thu, Oct 23 Reproducible project-based workflows To be posted


Fri, Oct 24 Quiz 02 To be posted

10 Tue, Oct 28 Introduction to machine learning To be posted


Thu, Oct 30 Build better training data To be posted


Fri, Oct 31 Git workflows (branches + PRs) To be posted

11 Tue, Nov 4 Tree-based inference and hyperparameter optimization To be posted


Thu, Nov 6 Interactive web applications using Shiny To be posted


Fri, Nov 7
To be posted

12 Tue, Nov 11 An introduction to LLMs To be posted


Thu, Nov 13 Prompt design To be posted


Fri, Nov 14
To be posted

13 Tue, Nov 18 Structured data To be posted


Thu, Nov 20 Tool/function calling To be posted


Fri, Nov 21 Quiz 03 To be posted

14 Tue, Nov 25 Project peer review To be posted


Thu, Nov 27 No class (Thanksgiving Break) To be posted


Fri, Nov 28 No class (Thanksgiving Break) To be posted

15 Tue, Dec 2 Improving data communication To be posted


Thu, Dec 4 Wrap-up: Where to go from here To be posted


Fri, Dec 5 Project presentations To be posted