Data Science
Professional Certificate in

Data Science

Harvard University (HarvardX)
9 Courses for $441.90 USD
Career-oriented learning to develop in-demand skills
Enrolling Now

Learn key data science essentials, including R and machine learning, through real-world case studies to jumpstart your career as a data scientist.

What You Will Learn

  • Fundamental R programming skills
  • Statistical concepts such as probability, inference, and modeling and how to apply them in practice
  • Gain experience with the tidyverse, including data visualization with ggplot2 and data wrangling with dplyr
  • Become familiar with essential tools for practicing data scientists such as Unix/Linux, git and GitHub, and RStudio
  • Implement machine learning algorithms
  • In-depth knowledge of fundamental data science concepts through motivating real-world case studies

Courses in this Program

Courses areIntroductory.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Build a foundation in R and learn how to wrangle, analyze, and visualize data. This course covers common programming commands, how to operate on vectors, and when to use advanced functions such as sorting.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Learn basic data visualization principles and how to apply them using ggplot2.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Gain important foundational knowledge in probability theory, essential for a data scientist, as you learn key concepts using a case study on the financial crisis of 2007–2008.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Learn inference and modeling: two of the most widely used statistical tools in data analysis.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Data science projects involve keeping track of many data files and analysis scripts. Learn GitHub, git, Unix/Linux and RStudio to keep your projects organized and produce reproducible reports.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Learn an indispensable part of data science known as data wrangling, a process that involves converting raw data to formats needed for further analysis.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Learn how to use R to implement linear regression, one of the most common statistical modeling approaches used in data science.
Cost:$49
Effort:2–4 hours per week, for 4 weeks
Learn the basics of machine learning, the science behind the most popular and successful data science techniques, to build a movie recommendation system.
Cost:$99
Effort:15–20 hours per week, for 2 weeks
In this capstone course, show what you’ve learned from the Professional Certificate Program in Data Science. You will have the opportunity to create a long project of your own and have it assessed.
Rafael Irizarry
Professor of Biostatistics
Harvard University

The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. The HarvardX Data Science program prepares you with the necessary knowledge base and useful skills to tackle real-world data analysis challenges. The program covers concepts such as probability, inference, regression, and machine learning and helps you develop an essential skill set that includes R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with Unix/Linux, version control with git and GitHub, and reproducible document preparation with RStudio.

In each course, we use motivating case studies, ask specific questions, and learn by answering these through data analysis. Case studies include: Trends in World Health and Economics, US Crime Rates, The Financial Crisis of 2007-2008, Election Forecasting, Building a Baseball Team (inspired by Moneyball), and Movie Recommendation Systems.

Throughout the program, we will be using the R software environment. You will learn R, statistical concepts, and data analysis techniques simultaneously. We believe that you can better retain R knowledge when you learn how to solve a specific problem. Furthermore, HarvardX has partnered with DataCamp for all assignments, which use code checking technology that will permit you to get hands-on practice during the courses.

  • R is listed as a required skill in 64% of data science job postings and was Glassdoor’s Best Job in America in 2016 and 2017. (source: Glassdoor)
  • Companies are leveraging the power of data analysis to drive innovation. Google data analysts use R to track trends in ad pricing and illuminate patterns in search data. Pfizer created customized packages for R so scientists can manipulate their own data.
  • 32% of full-time data scientists started learning machine learning or data science through a MOOC, while 27% were self-taught. (source: Kaggle, 2017)
  • Data Scientists are few in number and high in demand. (source: TechRepublic)

This program was supported in part by NIH grant R25GM114818.

HarvardX requires individuals who enroll in its courses on edX to abide by the terms of the edX honor code. HarvardX will take appropriate corrective action in response to violations of the edX honor code, which may include dismissal from the HarvardX course; revocation of any certificates received for the HarvardX course; or other remedies as circumstances warrant. No refunds will be issued in the case of corrective action for such violations. Enrollees who are taking HarvardX courses as part of another program will also be governed by the academic policies of those programs.

HarvardX pursues the science of learning. By registering as an online learner in an HX course, you will also participate in research about learning. Read our research statement to learn more.

Harvard University and HarvardX are committed to maintaining a safe and healthy educational and work environment in which no member of the community is excluded from participation in, denied the benefits of, or subjected to discrimination or harassment in our program. All members of the HarvardX community are expected to abide by Harvard policies on nondiscrimination, including sexual harassment, and the edX Terms of Service. If you have any questions or concerns, please contact [email protected] and/or report your experience through the edX contact form.

Also in Data Science at edX

Propelling

Drive your career forward with university-backed credit programs and verified certificates

Convenient

Study and demonstrate knowledge on your schedule

Flexible

Try a course before you pay

Supportive

Learn with university partners and peers from around the world