Mining Massive Datasets
About this courseSkip About this course
The course is based on the text Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, and Jeff Ullman, who by coincidence are also the instructors for the course.
The book is published by Cambridge Univ. Press, but by arrangement with the publisher, you can download a free copy Here. The material in this on-line course closely matches the content of the Stanford course CS246.
The major topics covered include: MapReduce systems and algorithms, Locality-sensitive hashing, Algorithms for data streams, PageRank and Web-link analysis, Frequent itemset analysis, Clustering, Computational advertising, Recommendation systems, Social-network graphs, Dimensionality reduction, and Machine-learning algorithms.
At a glance
- Institution: StanfordOnline
- Subject: Computer Science
- Level: Advanced
The course is intended for graduate students and advanced undergraduates in Computer Science. At a minimum, you should have had courses in Data structures, Algorithms, Database systems, Linear algebra, Multivariable calculus, and Statistics.
- Language: English
- Video Transcript: English
What you'll learnSkip What you'll learn
- MapReduce systems and algorithms
- Locality-sensitive hashing
- Algorithms for data streams
- PageRank and Web-link analysis
- Frequent itemset analysis
- Computational advertising
- Recommendation systems
- Social-network graphs
- Dimensionality reduction
- Machine-learning algorithms
About the instructors
Frequently Asked QuestionsSkip Frequently Asked Questions
How much work is expected?
The amount of work will vary, depending on your background and the ease with which you follow mathematical and algorithmic ideas. However, 10 hours per week is a good guess.