Skip to main content

Analyzing Data with R

R is the key that opens the door between the problems you want to solve with data and the answers you need. This course walks you through the process of answering questions through data.

Analyzing Data with R

There is one session available:

After a course session ends, it will be archived.
Starts Aug 23
Estimated 6 weeks
2–3 hours per week
Progress at your own speed
Optional upgrade available

About this course

Skip About this course

The R programming language is purpose-built for data analysis. R is the key that opens the door between the problems you want to solve with data and the answers you need to meet your objectives. This course starts with a question, and then walks you through the process of answering it through data. You will first learn important techniques for preparing (or wrangling) your data for analysis. Then you will learn how to gain a better understanding of your data through exploratory data analysis, helping you to summarize your data and identify relevant relationships between variables that can lead to insights. ****

Once your data is ready to analyze, you will learn how to develop your model, evaluate it and tune its performance. By following this process, you can be sure that your data analysis performs to the standards that you have set, so that you can have confidence in the results. ****

By playing the role of a data analyst who is analyzing airline departure and arrival data to predict flight delays, you will build hands-on experience delivering insights using data. Using an Airline Reporting Carrier On-Time Performance Dataset, you will practice reading data files, preprocessing data, creating models, improving models, and evaluating them to ultimately choose the best one to use.

Note: The prerequisite for this course is basic R programming skills. For example, ensure that you have completed a course like Introduction to R Programming for Data Science from IBM.

At a glance

What you'll learn

Skip What you'll learn
  • Prepare data for analysis by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.

  • Conduct exploratory data analysis using descriptive statistics, data grouping, analysis of variance (ANOVA), and correlation statistics.

  • Develop a predictive model using various regression methods.

  • Evaluate a model for overfitting and underfitting conditions and tune its performance using regularization and grid search.

About the instructors

Interested in this course for your business or team?

Train your employees in the most in-demand topics, with edX for Business.