INFO 5001: Computing for Information Science

Modified

September 26, 2025

This page contains an outline of the topics, content, and assignments for the semester. Note that this schedule will be updated as the semester progresses and the timeline of topics and assignments might be updated throughout the semester.

WEEK DATE TOPIC PREPARE MATERIALS DUE
1 Tue, Aug 26 Welcome to INFO 5001 πŸ‘©β€πŸ’» Login to Cornell’s GitHub server
πŸ“½οΈ slides 01
⌨️ ae 00


Thu, Aug 28 Grammar of graphics πŸ“— r4ds - intro
πŸ“— r4ds - ch 1.1-.3, 1.7
πŸ“˜ ims - ch 1
πŸ“½οΈ slides 02
⌨️ ae 00
βœ… ae 00


Fri, Aug 29 Hello data science!
⌨️ hw 00
2 Tue, Sep 2 Visualizing various types of data πŸ“˜ ims - ch 4
πŸ“˜ ims - ch 5
πŸ“½οΈ slides 03
⌨️ ae 01
βœ… ae 01


Wed, Sep 3

βœ… hw 00 HW 00 at 11:59pm

Thu, Sep 4 Grammar of data wrangling πŸ“— r4ds - ch 3 πŸ“½οΈ slides 04
⌨️ ae 02
βœ… ae 02


Fri, Sep 5 Data visualization
⌨️ hw 01
3 Tue, Sep 9 Working with relational data πŸ“— r4ds - ch 19 πŸ“½οΈ slides 05
⌨️ ae 03
βœ… ae 03


Wed, Sep 10

βœ… hw 01 HW 01 at 11:59pm

Thu, Sep 11 Tidying data πŸ“— r4ds - ch 5 πŸ“½οΈ slides 06
⌨️ ae 04
βœ… ae 04


Fri, Sep 12

⌨️ hw 02
4 Tue, Sep 16 Data types and classes πŸ“— r4ds - ch 4
πŸ“— r4ds - ch 12 (read 12.1-12.2, skim the rest)
πŸ“— r4ds - ch 14.1-.3
πŸ“— r4ds - ch 16
πŸ“½οΈ slides 07
⌨️ ae 05
βœ… ae 05


Wed, Sep 17

βœ… hw 02 HW 02 at 11:59pm

Thu, Sep 18 Importing and recoding data πŸ“— r4ds - ch 7
πŸ“— r4ds - ch 17.1 - 17.3
πŸ“½οΈ slides 08
⌨️ ae 06
βœ… ae 06


Fri, Sep 19 No lab meeting
⌨️ hw 03
5 Tue, Sep 23 Databases + SQL πŸ“— r4ds - ch 21 πŸ“½οΈ slides 09
⌨️ ae 07
βœ… ae 07


Wed, Sep 24

βœ… hw 03 HW 03 at 11:59pm

Thu, Sep 25 Getting data from the web: Scraping πŸ“— r4ds - ch 24
πŸ‘©β€πŸ’» Install SelectorGadget
πŸ“½οΈ slides 10
⌨️ ae 08
βœ… ae 08


Fri, Sep 26 Quiz 01
πŸ“œ quiz 01
6 Tue, Sep 30 Functions πŸ“— r4ds - ch 25.2-.3


Thu, Oct 2 Iteration πŸ“— r4ds - ch 26
πŸ“— r4ds - ch 27 (skim for familiarity with base R syntax)



Fri, Oct 3 Git workflows (basics + merge conflicts) πŸ“„ Happy Git with R - ch 1

7 Tue, Oct 7 Getting data from the web: APIs πŸ“„ Application Programming Interface
πŸ“„ Obtaining World Bank indicators
πŸ“„ Securely storing API keys



Thu, Oct 9 No class (out-of-town)



Fri, Oct 10 No class (out-of-town)


8 Tue, Oct 14 No class (Fall Break)



Thu, Oct 16 Rectangling data πŸ“— r4ds - ch 23


Fri, Oct 17 Functions + iteration


9 Tue, Oct 21 Reproducible reporting with Quarto πŸ“— r4ds - ch 28


Thu, Oct 23 Reproducible project-based workflows πŸ“™ wtfR - ch 2-4, 7


Fri, Oct 24 Quiz 02


10 Tue, Oct 28 Introduction to machine learning πŸ“• tmwr - ch 4-6, 10


Thu, Oct 30 Build better training data πŸ“• tmwr - ch 7-9


Fri, Oct 31 Git workflows (branches + PRs)


11 Tue, Nov 4 Tree-based inference and hyperparameter optimization πŸ“• tmwr - ch 12-14


Thu, Nov 6 Interactive web applications using Shiny To be posted


Fri, Nov 7
To be posted

12 Tue, Nov 11 An introduction to LLMs To be posted


Thu, Nov 13 Prompt design To be posted


Fri, Nov 14
To be posted

13 Tue, Nov 18 Structured data To be posted


Thu, Nov 20 Tool/function calling To be posted


Fri, Nov 21 Quiz 03 To be posted

14 Tue, Nov 25 Project peer review To be posted


Thu, Nov 27 No class (Thanksgiving Break) To be posted


Fri, Nov 28 No class (Thanksgiving Break) To be posted

15 Tue, Dec 2 Improving data communication To be posted


Thu, Dec 4 Wrap-up: Where to go from here To be posted


Fri, Dec 5 Project presentations To be posted