Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM/IPF method to build a tax-benefit model for the Essex region in the UK

23 June 2026 (Online)

Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM/IPF method to build a tax-benefit model for the Essex region in the UK

Speaker Dr Rejoice M Frimpong (University of Essex) presents on Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM/IPF method to build a tax-benefit model for the Essex region in the UK. This seminar is part of the Microsimulation Seminar Series for young researchers.

Details

  • Title: Machine learning regionalisation of input data for microsimulation models: An application of a hybrid GBM/IPF method to build a tax-benefit model for the Essex region in the UK
  • Presenter: Speaker Dr Rejoice M Frimpong
  • Affiliation: University of Essex
  • Date: 23 June 2026, 10:00 CET
  • Location: Online

Registration

Abstract

Development of microsimulation models often requires reweighting some input dataset to reflect the characteristics of a different population of interest. In this paper we explore a machine learning approach where a variant of decision trees (Gradient Boosted Machine) is used to replicate the joint distribution of target variables observed in a large commercially available but slightly biased dataset, with an additional raking step to remove the bias and ensure consistency of relevant marginal distributions with official statistics. The method is applied to build a regional variant of UKMOD, an open-source static tax-benefit model for the UK belonging to the EUROMOD family, with an application to the Greater Essex region in the UK.