Curating raw data presents issues of technology, speed, and efficiency, as well as broader ethical considerations of what it means to represent and make conclusions about groups and individuals from their data. This course covers the database, semi-structured data, and unstructured data. Students will gain familiarity with underlying data structures; techniques and tools, including acquisition, augmentation, and restructuring; data storage and aggregation; access to parallel and distributed computing; high-volume data, disparate sources, and performance; and streaming data, real time, and dynamic queries.
Engagement (Canvas and in-class discussions) | 10% |
---|---|
Weekly Assignments (10 total) | 50% |
Final assignment | 40% |
Total | 100% |
NoSQL and SQL Data Modeling: Bringing Together Data, Semantics, and Software
Designing Data-Intensive Applications
Database Aesthetics: Art in the Age of Information Overflow on JSTOR