Course schedule
| Topic | Estimated Date | Time |
|---|---|---|
| π Introduction + Texevier Test | 2025-03-14 (Friday) | 11:30-12:30 |
| β Databases and SQL | 2025-03-17 (Monday) | 16:30-18:30 |
| β° Recess | 2025-03-21 to 2025-04-07 | |
| π² Basic Bayesian | 2025-04-07 (Monday) | 11:30-13:30 |
| π NLP: Text Analysis | 2025-04-14 (Monday) | 16:30-18:30 |
| π§βπ« ML: Supervised | 2025-04-25 (Friday) | 11:30-13:30 |
| β ML: Unsupervised | 2025-05-02 (Friday) | 11:30-13:30 |
| π€ Deep Learning: RNN | 2025-05-05 (Monday) | 16:30-18:30 |
| π Exam | 2025-06-17 to 2025-06-19 | |
| π Project | 2025-06-20 |
Materials
| Session | Learning Outcomes |
|---|---|
| β Databases and SQL | |
| π² Basic Bayesian |
|
| π NLP: Text Analysis |
|
| π§βπ« ML: Supervised |
|
| β ML: Unsupervised | |
| π€ Deep Learning: RNN | |
| π― Targets |
|
Project
Please fill in your slot: HERE
The assessment for this part of the module takes the form of a semester project. Students are expected to craft their own research question using one of four unique data sets. The submission date for the project is the 20th of June at midnight. Details of the project will be provided at the start of the semester. To pass the module, a final mark of at least 50% has to be obtained. To obtain a distinction in this module, a minimum final mark of 75% is required.
β οΈ All projects need to use `R Projects` and {texevier} (OR {elsevier}) to knit to a final PDF in order to qualify for a mark greater than 50%. So make sure to practice this skill throughout! Please ask for help earlier than later.
π€ What about using AI? You are welcome to utilize this new technology, BUT be aware that writing style will still matter. All generative models are very verbose, so they are easy to spot (especially for someone like me who use them often). Make sure to use them as a tool to bounce ideas off of, but write your in your own words and understanding! I will be checking to see how much of the paper was AI generated.
π ProjectThe final project is a write-up of 2000 words (Think Economic Letters). The project must contain:
- Motivation and economic question
- A literature review section
- Exploratory data description and analysis
- Statistical Modeling
- Conclusion
- Wine Reviews
- Dataset of Vivino reviews for Calitzdorp, South Africa for the years 2014 to 2016.
- Data in your Database in table "wine"
- CLICK HERE FOR RDS FILE
- Job Ads
- Large dataset of variables that contain job ad data in South Africa over 4 years.
- CLICK HERE FOR ZIP FILE
- Global Macro Database
- This dataset complements the paper from MΓΌller, Xu, Lehbib, and Chen (2025), which introduces a panel dataset of 46 macroeconomic variables across 243 countries from historical records beginning in the year 1086 to projections through the year 2030.
- CLICK HERE FOR LINK TO DATA
- CBS Speeches
- The dataset features 35,487 unique speeches from 131 central banks, for the period going from the beginning of January 1986 (date of the first online speech by a central bank) to the end of December 2023.
- CLICK HERE FOR RDS FILE
