• Length:
    5 Weeks
  • Effort:
    2–4 hours per week
  • Price:

    FREE
    Add a Verified Certificate for $149 USD

  • Institution
  • Subject:
  • Level:
    Introductory
  • Language:
    English
  • Video Transcript:
    English
  • Course Type:
    Self-paced on your time

Associated Programs:

Prerequisites

High school mathematics and basic statistics concepts

About this course

Skip About this course

Want to learn data processing and interpreting the result you’ve got? This course is for you! Get acquainted with preparing and analyzing large amount of data, as well as data storage fundamentals.

This course is an introduction to initial data processing. We will start with data types and sources, methods of data preparation: cleaning, filling in the missing values, data smoothing and normalization. The course will familiarize you with the descriptive statistics and data visualization methods. You will also learn how to analyze time series and find trends.

Get acquainted with the fundamentals of data storage and access: databases types, relational and NoSQL databases, big data initials.

No previous programming knowledge needed.

What you'll learn

Skip What you'll learn
  • Initial data processing (data cleaning and filling in the missing values)
  • Data smoothing and normalization
  • Data visualization
  • Time series analysis
  • Descriptive statistics
  • Data storage and access by means of relational DBMS
  • NoSQL databases and Big data

Week 1: Data preprocessing. Basic concepts of data processing. Stages of data analysis (collection, sorting, transformation, building models and interpretation). Data measurements and scales. Data types and sources. Data preparing.

Week 2: Data processing tools and visualization. Digital spreadsheets. Data visualization goals. Methods and purposes of correct data visualization.

Week 3: Data processing. Descriptive statistics. Data normalization and transformation. Time-series analysis and forecasting. Types of time-series smoothing. Trends, seasonal time series modelling.

Week 4: Relational databases management systems. Introduction to relational DBMS starting from relational data model. SQL statements and queries creation. Database indexes and transactions requirements.

Week 5: NoSQL. Main characteristics of not only SQL databases. Non-structured and semi-structured data and scalability of NoSQL databases. Types of NoSQL databases: column-oriented, key-value store, document store and graph databases.

Meet your instructors

Natalia Grafeeva
Associate Professor at the Higher School of Digital Culture
ITMO University
Elena Mikhailova
Director of Higher School of Digital Culture
ITMO University
Olga Egorova
Assistant Professor at the Higher School of Digital Culture
ITMO University
Anton Boitsev
Associate Professor at the Higher School of Digital Culture
ITMO University

Pursue a Verified Certificate to highlight the knowledge and skills you gain
$149 USD

View a PDF of a sample edX certificate
  • Official and Verified

    Receive an instructor-signed certificate with the institution's logo to verify your achievement and increase your job prospects

  • Easily Shareable

    Add the certificate to your CV or resume, or post it directly on LinkedIn

  • Proven Motivator

    Give yourself an additional incentive to complete the course

  • Support our Mission

    EdX, a non-profit, relies on verified certificates to help fund free education for everyone globally