Skip to main content

R Data Science Capstone Project

Apply various data analysis and visualization skills and techniques you have learned by taking on the role of a data scientist working with real-world data sets.

R Data Science Capstone Project

There is one session available:

After a course session ends, it will be archived.
Starts Sep 20
Estimated 4 weeks
1–2 hours per week
Self-paced
Progress at your own speed
Free
Optional upgrade available

About this course

Skip About this course

In this capstone course, you will apply various data science skills and techniques that you have learned as part of the previous courses in the IBM Data Science with R or IBM Data Analytics with Excel and R Professional Certificate Programs.

In this capstone project, you will take on the role of a data scientist who has recently joined an organization and is presented with a challenge that requires data collection, analysis, basic hypothesis testing, visualization, and modeling to be performed on real-world datasets. You will collect and understand data from multiple sources, conduct data wrangling and preparation with Tidyverse, perform exploratory data analysis with SQL, Tidyverse and ggplot2, model data with linear regression, create charts and plots to visualize the data, and build an interactive dashboard.

The project will culminate with a presentation of your data analysis report, with an executive summary for the various stakeholders in the organization.

At a glance

What you'll learn

Skip What you'll learn
  • Prepare data for modelling by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.

  • Do exploratory data analysis using descriptive statistics, data grouping, data analysis and correlation statistics.

About the instructors

Interested in this course for your business or team?

Train your employees in the most in-demand topics, with edX for Business.