📜 Course Description

Curating raw data presents issues of technology, speed, and efficiency, as well as broader ethical considerations of what it means to represent and make conclusions about groups and individuals from their data. This course covers the database, semi-structured data, and unstructured data. Students will gain familiarity with underlying data structures; techniques and tools, including acquisition, augmentation, and restructuring; data storage and aggregation; access to parallel and distributed computing; high-volume data, disparate sources, and performance; and streaming data, real time, and dynamic queries.

🏆 Grading

Engagement (Canvas and in-class discussions) 10%
Weekly Assignments (10 total) 50%
Final assignment 40%
Total 100%

📖 Required and Recommended Reading

"Raw Data" Is an Oxymoron

NoSQL and SQL Data Modeling: Bringing Together Data, Semantics, and Software

Designing Data-Intensive Applications

Database Aesthetics: Art in the Age of Information Overflow on JSTOR

The Stuff of Bits

📆 Course Outline

Untitled