Skip to main content

Programming for Data Science

Learn how to apply fundamental programming concepts, computational thinking and data analysis techniques to solve real-world data science problems.

Programming for Data Science

There is one session available:

After a course session ends, it will be archived.
42,579 already enrolled!
Estimated 10 weeks
8–10 hours per week
Progress at your own speed
Optional upgrade available

About this course

Skip About this course

There is a rising demand for people with the skills to work with Big Data sets and this course can start you on your journey through our Big Data MicroMasters program towards a recognised credential in this highly competitive area.

Using practical activities you will learn how digital technologies work and will develop your coding skills through engaging and collaborative assignments.

You will learn algorithm design as well as fundamental programming concepts such as data selection, iteration and functional decomposition, data abstraction and organisation. In addition to this you will learn how to perform simple data visualisations using Processing and embed your learning using problem-based assignments.

This course will test your knowledge and skills in solving small-scale data science problems working with real-world datasets and develop your understanding of big data in the world around you.

At a glance

  • Language: English
  • Associated programs:

What you'll learn

Skip What you'll learn
  • How to analyse data and perform simple data visualisations using Processing
  • Understand and apply introductory programming concepts such as sequencing, iteration and selection
  • Equip you to study computer science or other programming languages

Section 1: Creative code - Computational thinking
Understanding what you can do with Processing and apply the basics to start coding with colour; Learn how to qualify and express how algorithms work.

Section 2: Building blocks - Breaking it down and building it up
Understand how data can be represented and used as variables and learn to manipulate shape attributes and work with weights and shapes using code.

Section 3: Repetition - Creating and recognising patterns
Explain how and why using repetiton can aid in creating code and begin using repetition to manipulate and visualise data.

Section 4: Choice - Which path to follow
How to create simple and complicated choices and how to create and use decision points in code.

Section 5: Repetition - Going further
Discussing advantages of repetition for data visualisation and applying and reflecting on the power of repetitions in code. Creating curves, shapes and scale data in code.

Section 6: Testing and Debugging
Understanding why and how to comprehensively test your code and debug code examples using line tracing techniques.

Section 7: Arranging our data
Exploring how and why arrays are used to represent data and how static and dynamic arrays can be used to represent data.

Section 8: Functions - Reusable code
Understand how functions work in Processing and demonstate how to deconstruct a problem into useable functions.

Section 9: Data Science in practice
Exploring how data science is used to solve programming problems and how to solve big data problems by applying skills and knowledge learned throughout the course.

Section 10: Where next?
Understand the context of big data in programming and transform a problem description into a complete working solution using the skills and knowledge you've learned throughout the course, and explore how you can expand the skills learned in this course by participating in future courses.

About the instructors

Frequently Asked Questions

Skip Frequently Asked Questions

Question: Why does this course use Processing?
Answer: We have chosen to use Processing within ProgramX as this language gives visual feedback to the learner, is readily accessible (only requires a free install of Processing) and is suitable as a language to teach fundamental programming concepts that can be readily adapted to other languages. Other courses within the Big Data MicroMasters program build upon the programing concepts and are taught using languages selected as appropriate for the teaching and learning context.

Question: This course is self-paced, but is there a course end date?
Answer: Yes. The first course release started on May 15, 2017 and ended on December 1, 2018.
The second release of the course started on December 1, 2018 and ends on December 1, 2020.
The third release of the course starts on March 1, 2019 and ends on December 1, 2020.

Who can take this course?

Unfortunately, learners from one or more of the following countries or regions will not be able to register for this course: Iran, Cuba and the Crimea region of Ukraine. While edX has sought licenses from the U.S. Office of Foreign Assets Control (OFAC) to offer our courses to learners in these countries and regions, the licenses we have received are not broad enough to allow us to offer this course in all locations. edX truly regrets that U.S. sanctions prevent us from offering all of our courses to everyone, no matter where they live.

Interested in this course for your business or team?

Train your employees in the most in-demand topics, with edX for Business.