Skip to main content

HarvardX: Digital Humanities in Practice: From Research Questions to Results

Combine literary research with data science to find answers in unexpected ways. Learn basic coding tools to help save time and draw insights from thousands of digital documents at once.

10 weeks
2–3 hours per week
Progress at your own speed
Optional upgrade available

There is one session available:

After a course session ends, it will be archivedOpens in a new tab.
Starts Apr 17
Ends Aug 21

About this course

Skip About this course

From the printing press to the typewriter, there is a long history of scholars adapting to new technologies. In the last forty or fifty years, the most significant advance has been the digitization of books. We now have whole libraries—centuries of history, literature, and philosophy—available instantaneously. This new access is a wonderful benefit, but it can also be overwhelming. If you have hundreds of thousands of books available to you in an instant, where do you even start? With a bit of elementary code, you can study all of these books at once, and derive new sorts of insights.

Computation is changing the very nature of how we do research in the humanities. Tools from data science can help you to explore the record of human culture in ways that just wouldn’t have been possible before. You’re more likely to reach out to others, to work across disciplines, and to assemble teams. Whether you're a student wanting to expand your skillset, a librarian supporting new modes of research, or a journalist who has just received a massive cache of leaked e-mails, this course will show you how to draw insights from thousands of documents at once. You will learn how, with a few simple lines of code, to make use of the metadata—the information about our objects of study—to zero in on what matters most, and visualize your results so that you can understand them at a glance.

In this course, you’ll work on building parts of a search engine, one tailor-made to the needs of academic research. Along the way, you'll learn the fundamentals of text analysis: a set of techniques for manipulating the written word that stand at the core of the digital humanities.

By the end of the course, you will be able to apply what you learn to what interests you most, be it contemporary speeches, journalism, caselaw, and even art objects. This course will analyze pieces of 18th-century literature, showing you how these methods can be applied to philosophical works, religious texts, political and historical records – material from across the spectrum of humanistic inquiry.

Combine your traditional research skills with data science to find answers you never might have expected.

At a glance

  • Language: English
  • Video Transcript: English
  • Associated skills:Research, Printing Press, Digital Humanities, Typewriters, Public Speaking, Text Mining, Journalism, Electronic Documents, Data Science

What you'll learn

Skip What you'll learn

By the end of this course, learners will:

  • Understand which digital methods are most suitable to meaningfully analyze large databases of text
  • Identify the resources needed to complete complex digital projects and learn about their possible limitations
  • Download existing datasets and create new ones by scraping websites and using APIs
  • Enrich metadata and tag text to optimize the results of your analysis
  • Analyze thousands of books with digital methods such as topic modeling, vector models, and concept search
  • Test your knowledge by writing and editing code in Python, and use these skills to explore new methods of search

Interested in this course for your business or team?

Train your employees in the most in-demand topics, with edX For Business.