• Duración:
    5 semanas
  • Dedicación:
    2–4 horas por semana
  • Precio:

    GRATIS
    Agregar un Certificado Verificado por $149 USD

  • Institución
  • Tema:
  • Nivel:
    Introductory
  • Idioma:
    English
  • Transcripción de video:
    English
  • Tipo de curso:
    A tu ritmo

Prerrequisitos

High school math, basic knowledge of calculus

Sobre este curso

Omitir Sobre este curso

Want to learn how to analyze the huge amounts of data? In this course you will learn modern methods of machine learning to help you choose the right methods to analyze your data and interpret the results correctly.

This course is an introduction to machine learning. It will cover the modern methods of statistics and machine learning as well as mathematical prerequisites for them. We will discuss the methods used in classification and clustering problems. You will learn different regression methods.

Various examples and different software applications are considered in the course. You will get not only the theoretical prerequisites, but also practical hints on how to work with your data in MS Azure.

Lo que aprenderás

Omitir Lo que aprenderás
  • Introduction to machine learning and mathematical prerequisites
  • Regression types (linear, polynomial, multi variable regression)
  • Classification methods: Logistic regression, Naïve Bayes and K-nearest neighbours
  • Clustering methods: hierarchical and k-means clustering

Plan de estudios

Omitir Plan de estudios

Week 1: Introduction to machine learning and mathematical prerequisites. The concepts of machine and statistical learning are introduced. We discuss the main branches of ML such as supervised, unsupervised and reinforcement learning, give specific examples of problems to be solved by the described approaches. Besides, we show that ML is not as powerful as one can think. Finally, we remind you of some basic concepts of mathematics used in further lectures.

Week 2: Regression (linear, polynomial, multivariable regression). Regression problem is one of the main problems in supervised learning. We start with the heuristic approach trying to solve a very practical problem and come to rigorous mathematical construction of the simple linear regression model. We go further and describe statistical properties of the model: confidence intervals for the model's parameters, hypothesis testing of linear dependence. Finally, we come to a so-called multivariable linear and polynomial regressions and show some examples and applications.

Week 3: Logistic regression. The second branch of supervised learning is a classification problem. We deal with a two-class logistic regression and emphasise that it is not a regression at all. Then why is it called so? It's construction is closely connected with linear regression described in the 2nd lecture. We remind you a maximum likelihood estimation method and its applications to logistic regression. Finally, we discuss some applications of the logistic regression to a football game predictions and describe ROC analysis or a quality testing approach for the described model.

Week 4: Naïve Bayes and K-nearest neighbors. In this lecture we continue with classification problem. We introduce a so-called naive Bayes approach to classification widely used in e-mail spam recognition until 2010. Then we come to a multi-class classification using K-nearest neighbors method. What are the metrics that we will use? How does a particular metric influence the result? What is K and how do you choose it solving a particular problem? These are the questions that are rigorously discussed in the lecture.

Week 5: Clustering methods: hierarchical and k-means clustering. Clusterization problem is at the heart of unsupervised learning. We have a lot of data and nothing else: we don't know the amount of classes, similarities in objects, we know almost nothing. We show how to establish some order in the given chaotic data using hierarchical clustering method and k-means approach. How to establish the initial clusters, what metric to choose, what actually means "close and far" objects? These questions are discussed in the lecture.

Conoce a tus instructores

Anton Boitsev
Assistant Professor at the Higher School of Digital Culture
ITMO University
Aleksei Romanov
Assistant Professor at the Higher School of Digital Culture
ITMO University
Dmitry Volchek
Assistant Professor at the Higher School of Digital Culture
ITMO University
Elena Mikhailova
Director of Higher School of Digital Culture
ITMO University

Obtén un Certificado Verificado para destacar los conocimientos y las habilidades que adquieras
$149 USD

Ver un modelo de certificado de edX en PDF
  • Oficial y verificado

    Obtén un certificado con la firma del instructor y el logotipo de la institución para demostrar tus logros y aumentar las posibilidades de conseguir trabajo

  • Fácil de compartir

    Agrega el certificado a tu currículum o publícalo directamente en LinkedIn

  • Incentivo comprobado

    El certificado te da un motivo más para completar el curso

  • Apoya nuestra labor

    edX, una organización sin fines de lucro, se sustenta con los certificados verificados para financiar la educación gratuita para todo el mundo

¿Quién puede hacer este curso?

Lamentablemente, las personas de uno o más de los siguientes países o regiones no podrán registrarse para este curso: Irán, Cuba y la región de Crimea en Ucrania. Si bien edX consiguió licencias de la Oficina de Control de Activos Extranjeros de los EE. UU. (U.S. Office of Foreign Assets Control, OFAC) para ofrecer nuestros cursos a personas en estos países y regiones, las licencias que hemos recibido no son lo suficientemente amplias como para permitirnos dictar este curso en todas las ubicaciones. edX lamenta profundamente que las sanciones estadounidenses impidan que ofrezcamos todos nuestros cursos a cualquier persona, sin importar dónde viva.