Big Data and Education


A Massive Online Open Textbook (MOOT)
4th Edition
by Ryan Baker
in cooperation between the University of Pennsylvania, Teachers College, Columbia University, and the Columbia Center for New Media Teaching and Learning

As seen on Coursera (2013) and EdX (2015, 2017, 2018)

Chapter 1: Prediction Modeling
Video 1: Introduction [YouTube] [pdf]
Video 2: Regressors [YouTube] [pdf]
Video 3: Classifiers part 1 [YouTube] [pdf]
Video 4: Classifiers part 2 [YouTube] [pdf]
Video 5: Case study in classification [YouTube] [pdf]
Video 6: Advanced Classifiers [YouTube] [pdf]

Chapter 2: Model Goodness and Validation
Video 1: Detector confidence [YouTube] [pdf]
Video 2: Diagnostic metrics: part 1 [YouTube] [pdf]
Video 3: Diagnostic metrics: part 2 [YouTube] [pdf]
Video 4: Diagnostic metrics: part 3 [YouTube] [pdf]
Video 5: Cross-validation and over-fitting [YouTube] [pdf]
Video 6: Types of validity [YouTube] [pdf]

Chapter 3: Behavior Detection
Video 1: Ground Truth [YouTube] [pdf]
Video 2: Data synchronization [YouTube] [pdf]
Video 3: Feature engineering [YouTube] [pdf]
Video 4: Automated feature generation and selection [YouTube] [pdf]
Video 5: Knowledge engineering and data mining [YouTube] [pdf]

Chapter 4: Knowledge Inference
Video 1: Knowledge Inference [YouTube] [pdf]
Video 2: Bayesian Knowledge Tracing [YouTube] [pdf]
Video 3: Performance Factors Analysis [YouTube] [pdf]
Video 4: Item Response Theory [YouTube] [pdf]
Video 5: Advanced Bayesian Knowledge Tracing [YouTube] [pdf]
Video 6: KT-IDEM and DKT [YouTube] [pdf]
Video 7: Memory Algorithms [YouTube] [pdf]

Chapter 5: Relationship Mining
Video 1: Correlation Mining [YouTube] [pdf]
Video 2: Causal Mining [YouTube] [pdf]
Video 3: Association Rule Mining [YouTube] [pdf]
Video 4: Sequential Pattern Mining [YouTube] [pdf]
Video 5: Network Analysis [YouTube] [pdf]

Chapter 6: Visualization
Video 1: Introduction to Educational Visualization and Learning Curves [YouTube] [pdf]
Video 2: Scatter Plots, Heat Maps, and Parameter Space Maps [YouTube] [pdf]
Video 3: State Space Networks [YouTube] [pdf]
Video 4: Other Visualizations [YouTube] [pdf]

Chapter 7: Structure Discovery
Video 1: Clustering [YouTube] [pdf]
Video 2: Cluster Validation [YouTube] [pdf]
Video 3: Advanced Clustering Algorithms [YouTube] [pdf]
Video 4: Applications of Clustering in EDM [YouTube] [pdf]
Video 5: Factor Analysis [YouTube] [pdf]
Video 6: Knowledge Structure: Q-Matrixes [YouTube] [pdf]
Video 7: Knowledge Structures: Other Approaches [YouTube] [pdf]

Chapter 8: Advanced Topics
Video 1: Discovery with Models [YouTube] [pptx]
Video 2: Discovery with Models Case Study [YouTube] [pptx]
Video 3: Text Mining [YouTube] [pptx]
Video 4: Hidden Markov Models [YouTube] [pptx]
Video 5: Conclusions and Future Directions [YouTube] [pptx]

Acknowledgements: Sincerest thanks to Elle Wang, Miggy Andres, Michael Cennamo, Stephanie Ogden, Luc Paquette, Jose Diaz, Michael de Leon, Therese Condit, Megan Carr, students who have recommended additions or corrections, and others.

These materials were created with generous support from the Army Research Laboratory, the National Science Foundation (#DRL-1418378), and the Provost and President of Teachers College, Columbia University. The content represents the views of the author, and does not necessarily represent the views of the National Science Foundation.

Bugs? Errors? Email Ryan Baker.

Please cite this MOOT and MOOC as Baker, R.S. (2018) Big Data and Education. 4th Edition. Philadelphia, PA: University of Pennsylvania.

All materials here copyright Teachers College, Columbia University, the University of Pennsylvania, and Columbia University, 2013-2018.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.