
Enrol Here
- –
- 4 dias
- Online via Teams
Join us for the 2025 Data Science Winter School, a dynamic, four-day online training series designed to equip researchers, data analysts, and students with the skills to manage, visualise, and analyse data effectively using Stata.
How It Works
Flexible Learning for All Levels
The Winter School is made up of three distinct courses. Participants can choose to attend the entire series or select the specific course(s) most relevant to their research or career goals:
- Course 1: An Introduction to working in Stata - Master data processing and management fundamentals in Stata
- Course 2: Stata and Python Integration - Unlock new capabilities by learning how to integrate Python with Stata for advanced data analysis
- Course 3: Introduction to Machine Learning with Stata - Dive into machine learning techniques and generative AI applications for data-driven economic and policy decisions
Expert Instruction and Practical Skills
Courses are led by experienced instructors with a focus on:
- Good research practices and efficient workflows.
- Hands-on learning using real-world examples from medical statistics.
- Emphasis on reproducibility, effective data management, and clear communication of results.
What You’ll Gain
- Practical experience through worked examples, take-home materials, and Q&A sessions.
- Skills that are transferable across disciplines
Who Should Attend?
Whether you're looking to start your journey with Stata or sharpen your analytical toolkit, the 2025 Data Science Winter School offers an engaging and practical way to advance your data skills this Winter.
Register now to secure your place and take the next step in your data journey!
Agenda
Course 1: 8 December 2025
An Introduction to Stata for Exploratory Analysis and Essential Data Management
Date 8 December 2025
Delivered byTim Collier, LSHTM
Prerequisitesnone
This one-day introductory course is for people interested in using Stata effectively in their research. This course requires no prior knowledge of Stata but assumes an interest in research and in learning to use Stata efficiently.
Throughout the course there will be an emphasis on good practice for research, understanding work flow, good data management, efficiency and reproducibility of results.
This course introduces the Stata working environment and explains and compares the two main ways of working in Stata. We then introduce some essential tools for data management – getting data into Stata from Excel, merging datasets and creating new variables. The afternoon sessions focus on essential tools for descriptive statistics, common hypothesis tests, and a quick introduction to regression modelling (linear and logistic regression).
The examples used throughout will be from the field of medical statistics. However, the underlying principles will have application across all areas of research. Participants will be able to take away a comprehensive set of course notes and data used on the course as well as files created throughout the day. There will be a Q&A session at the end of the course, but also opportunities to ask questions throughout the day.
Course Outline
We will cover:
- Introduction to the key Stata windows
- Working interactively in Stata via the Graphical User Interface
- Exploratory data analysis: getting to know your data and identifying errors using statistical and graphical summaries
- Saving commands and results
- Essential data management tools: importing and saving data, generating new variables, correcting errors, keeping data tidy
Learning Objectives
By the end of this course you will:
- be familiar with the main Stata windows;
- understand how to work in Stata via the Graphical User Interface and do-files and the advantages and disadvantages of the two approaches;
- be able to use a small toolkit of commands to carry out exploratory data analyses;
- be able to create and save Stata datasets;
- be able to create new variables and correct errors;
- understand how Stata handles data and the importance of good practice for data management;
- know how to use Stata’s online help facilities so that you will be able to continue learning beyond the course
Course 2: 9 December 2025
Stata and Python Integration
Date 9 June 2025
Delivered byThomas Pical, Equancy
Prerequisitesnone
This course will introduce the basics of the Stata and Python language.
Python is a widespread and powerful programming language. There are many free libraries in different fields including statistics, econometrics, web scraping, machine learning, etc.
Python can be called from a running Stata session so that the extended functionality of the Python language can be exploited from within Stata. We call this Python integration, which was introduced in Stata 16. With this integration you can embed and run Python code interactively or in do-files and ado-files.
Course Outline
We will cover:
- Introduction to Python and a reminder of the basics
- Various ways to call Python from Stata, store information and load variables into Python
- Calling Stata from a Jupyter Notebook
Course 3: 10 - 11 December 2025
Introduction to Machine Learning with Stata
Date10-11 June 2025
Delivered bySebastian Laurent, Lancaster University
PrerequisitesSome familiarity with Stata is desirable
The aim of this two-day course is to introduce participants to machine learning, a relatively new approach to data analytics at the intersection between statistics, computer science, and artificial intelligence.
Students will be taught how to master the theory and the techniques that allow turning information into knowledge and value by 'letting the data speak'. The teaching approach will be based on the graphical language and intuition more than on algebra. The course will make use of instructional as well as real-world examples, with a balance of theory and practical sessions using Stata.
Learning Objectives
By the end of this course you will have knowledge and understanding of:
- Implementing and optimising machine learning approaches
- Assessing model performance
- Selecting key features
- Using standard machine learning libraries
Day 2:
Session 1
- Comparing estimators for static panel models for your research question
- Testing for serial correlation
Session 2: Dynamic Panel Models
- The Arello Bond estimator and post-estimation diagnostic test
- The Blundell Bond estimator and post estimation diagnostic tests
- Case study: the determinants of bank risk-taking in European banks.
Prerequisites
Course 1 –
- A Gentle Introduction to Stata, Fifth Edition - Alan C. Acock
- An Introduction to Stata for Health Researchers, Fourth Edition - Morten Frydenberg, Svend Juul
Course Timetable
Terms
- Additional discounts are available for multiple registrations
- Delegates are provided with temporary licences for the principal software package(s) used in the delivery of the course. It is essential that these temporary training licenses are installed on your computers prior to the start of the course.
- Payment of course fees required prior to the course start date.
Cancellations
- 100% fee returned for cancellations made over 28-calendar days prior to start of the course.
- 50% fee returned for cancellations made 14-calendar days prior to the start of the course.
- No fee returned for cancellations made less than 14-calendar days prior to the start of the course.
The number of attendees is restricted. Please register early to guarantee your place.