About the course

In the era of data-driven decision-making, economic analysis increasingly relies on computational tools and statistical techniques. This course introduces economists to essential concepts in data science, with a focus on databases, basic bayesian, machine learning and deep learning methods relevant to economic research and policy analysis. The objective is to develop proficiency in key data science techniques while fostering a critical understanding of their application in economic contexts.

While few organizations require specialized machine learning engineers, all benefit from having economists with a strong foundation in data science. This enables economists to engage with interdisciplinary teams and effectively apply quantitative methods within a Fourth Industrial Revolution (4IR) framework.

The course is divided into two parts. In Nico Katzke's half, you'll develop a solid foundation in programming concepts and data exploration techniques. Building on this, the second half shifts focus analysing data. We will explore various modeling approaches, with a special emphasis on statistical learning methods.

We kick off the course with SQL fundamentals, then move on to an introduction to Bayesian principles. This is followed by modules on both unsupervised and supervised machine learning, natural language processing (NLP), and finally, deep learning - with a particular focus on sequential models and recurrent neural networks (RNNs). Throughout the course, you'll engage in strong, hands-on practical applications to reinforce your learning.

Presenter

With over 12 years experience working throughout Africa, Hanjo leads our analytics, statistics and engineer aspects on projects. His expertise focuses on how data can be employed to answer impactful and meaningful questions that always aim to put people at the center. He has worked in banking, insurance, housing, agriculture, telecommunications and credit; focusing mostly on desigining and managing the implementation of analytical solutions to deliver real-time insights for the client. Hanjo also has a passion for research and teaching and regularly lectures on the applications of big data to drive changes in traditional economic thinking. Hanjo holds a PhD in Economics from the University of Stellenbosch. His thesis focused on the application of natural language processing and statistical learning applied within traditional economic frameworks.

Skills

  • Databases

    Excel sheets aren't databases. We help you make the transition

  • SQL

    Skill for life

  • Basic Bayesian

    What is MCMC and how to {brms}

  • Machine Learning

    Unsupervised & Supervised

  • Deep Learning

    Focus on sequential modeling

Assessment

The assessment for this part of the module takes the form of a semester project. Students are expected to craft their own research question using four unique data sets. The submission date for the project is the 20th of June at midnight. Details of the project will be provided at the start of the semester. To pass the module, a final mark of at least 50% has to be obtained. To obtain a distinction in this module, a minimum final mark of 75% is required

Summary

  • ~8 Sections
  • 4 Months
  • 3 Languages

  • Fun