Datasets
Conferences
- ConfSearch
- Ranking: Tsinghua CCF by Osmar Zaiane CORE
A curated list of computer science courses
- UMASS COMPSCI 514: Algorithms for Data Science by Cameron Musco
- CMU Database Systems (15-445/645) by Andy Pavlo
- CMU Advanced Database Systems (15-721) by Andy Pavlo
- UIUC Algorithms for Big Data (CS498ABD) by Chandra Chekuri
- CMU Algorithms for Big Data (15-859) by David Woodruff
- Harvard Algorithms for Big Data (CS 229r) by Jelani Nelson
- Columbia Algorithms for Massive Data (COMS E6998-9) by Alexandr Andoni
- Utah Models of Computation for Massive Data (cs7960) by Jeff Phillips
- NUS Algorithms at Scale (CS5234) by Seth Gilbert
- Dartmouth Data Stream Algorithms (CS 35/135) by Amit Chakrabarti
- MIT Advanced Data Structures (6.851) by Erik Demaine
- NTU, Taiwan Machine Learning (2021,Spring) by Hung-yi Lee
- UCR Algorithm Engineering (CS 260) by Yan Gu
- CMU 15-451/651: Algorithms by Anupam Gupta and David Woodruff
A curated list of technical books
Mathematics, Algorithms and Data Structures
- Mathematical Foundations for Data Analysis
- Mathematics for Machine Learning
- Foundations of Data Science
- Foundations of Machine Learning
- Introduction to Linear Algebra
- Randomized Algorithms
- Sketching as a Tool for Numerical Linear Algebra
- High-Dimensional Probability: An Introduction with Applications in Data Science
Database, Data Management, Parallel and Distributed Computing
- Designing Data-Intensive Applications
- Shared-Memory Parallelism Can Be Simple, Fast, and Scalable
- Small Summaries for Big Data
Data Mining and Machine Learning
- Mining of Massive Datasets
- Probabilistic Machine Learning: An Introduction
- Probabilistic Machine Learning: Advanced Topics
- The Elements of Statistical Learning
- Pattern Recognition and Machine Learning
A curated list of data management papers
- DBMS Indexology
- Papers on Graph Analytics
- Awesome Database Learning
- Quals Reading List in Databases
- Paper Reading List by Luna