Ir al contenido principal

StanfordOnline: Statistical Learning with Python

Learn some of the main tools used in statistical modeling and data science. We cover both traditional as well as exciting new methods, and how to use them in Python.

Statistical Learning with Python
11 semanas
3–5 horas por semana
A tu ritmo
Avanza a tu ritmo
Gratis
Verificación opcional disponible

Hay una sesión disponible:

¡Ya se inscribieron 10,083! Una vez finalizada la sesión del curso, será archivadoAbre en una pestaña nueva.
Comienza el 17 may

Sobre este curso

Omitir Sobre este curso

This is an introductory-level course in supervised learning, with a focus on regression and classification methods. The syllabus includes: linear and polynomial regression, logistic regression and linear discriminant analysis; cross-validation and the bootstrap, model selection and regularization methods (ridge and lasso); nonlinear models, splines and generalized additive models; tree-based methods, random forests and boosting; support-vector machines; neural networks and deep learning; survival models; multiple testing. Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical).

This is not a math-heavy class, so we try and describe the methods without heavy reliance on formulas and complex mathematics. We focus on what we consider to be the important elements of modern data science. Computing in this course is done in Python. There are lectures devoted to Python, giving tutorials from the ground up, and progressing with more detailed sessions that implement the techniques in each chatper. We also offer the separate and original version of this course called Statistical Learning with R – the chapter lectures are the same, but the lab lectures and computing are done using R.

The lectures cover all the material in An Introduction to Statistical Learning, with Applications in Python by James, Witten, Hastie, Tibshirani, and Taylor (Springer, 2023. The pdf for this book is available for free on the book website.

De un vistazo

  • Language English
  • Video Transcript English
  • Associated skillsSupport Vector Machine, Statistics, Unsupervised Learning, Basic Math, Principal Component Analysis, Lasso (Programming Language), Polynomial Regression, Data Science, Supervised Learning, Logistic Regression, Deep Learning, K-Means Clustering, Boosting, Linear Discriminant Analysis, Statistical Learning Theory, Python (Programming Language), Bootstrap (Front-End Framework), Lecturing, Random Forest Algorithm, Statistical Modeling, Artificial Neural Networks

Lo que aprenderás

Omitir Lo que aprenderás
  • Overview of statistical learning
  • Linear regression
  • Classificaiton
  • Resampling methods
  • Linear model selection and regularization
  • Moving beyond linearity
  • Tree-based methods
  • Support vector machines
  • Deep learning
  • Survival modeling
  • Unsupervised learning
  • Multiple testing

Preguntas frecuentes

Omitir Preguntas frecuentes

Do I need to buy a textbook?

The book is nice to own, and is available from Amazon and other booksellers. However a free online version of An Introduction to Statistical Learning, with Applications in Python by James, Witten, Hastie, Tibshirani, and Taylor (Springer, 2023) is available from that website. Springer has agreed to this, so no need to worry about copyright. Of course you may not distribute printed versions of that pdf file.

Are Python and Jupyter Notebook available for free?

Yes. You get Python for free from https://python.org/downloads/. Typically it installs with a click. You get Jupyter Notebook from https://jupyter.org/install, also for free, and a similarly easy install.

How many hours of effort are expected per week?

We anticipate it will take approximately 3-5 hours per week to go through the materials and exercises in each section.

¿Te interesa este curso para tu negocio o equipo?

Capacita a tus empleados en los temas más solicitados con edX para Negocios.