Seminars

Statistics seminars are held on Wednesdays 14:00 – 15:00. Everyone is welcome!  We gather for coffee/tea and biscuits around 15 minutes before the seminar begins.

The organiser is Ben Baer.  Please contact Ben to find out more about the seminars, to suggest a future seminar speaker, or to request joining seminars online.

Most of the seminars this year will be held in-person and few online. The in-person seminars will be held at the Observatory seminar room (except for the JJ Valletta memorial seminar, which is in the Mathematical Institute). Please see below for more details.

Forthcoming statistics seminars 2024-25

  •  
    • 5th November: Fiona Seaton, Centre for Ecology & Hydrology

Title: TBA 

Abstract: TBA

    • 12th November: Jere Koskela, Newcastle University

Title: TBA 

Abstract: TBA

    • 19th November: Alex Aylward, University of Oxford

Title: TBA 

Abstract: TBA

    • 26th November: Hannah Wauchope, University of Edinburgh

Title: TBA 

Abstract: TBA

    • 3rd December: Janine Illian, University of Glasgow

Title: TBA 

Abstract: TBA

Past seminars

This academic year

    • 10th September: Devin Johnson, NOAA

Title: A Computationally Flexible Approach to Population-Level Inference and Data Integration

Abstract: We propose a multistage method for making inference at all levels of a Bayesian hierarchical model (BHM) using natural data partitions to increase efficiency by allowing computations to take place in parallel form using software that is most appropriate for each data partition. The full hierarchical model is then approximated by the product of independent normal distributions for the data component of the model. In the second stage, the Bayesian maximum a posteriori (MAP) estimator is found by maximizing the approximated posterior density with respect to the parameters. If the parameters of the model can be represented as normally distributed random effects, then the second-stage optimization is equivalent to fitting a multivariate normal linear mixed model. We consider a third stage that updates the estimates of distinct parameters for each data partition based on the results of the second stage. The method is demonstrated with two ecological data sets and models, a generalized linear mixed effects model (GLMM) and an integrated population model (IPM). The multistage results were compared to estimates from models fit in single stages to the entire data set. In both cases, multistageresults were very similar to a full MCMC analysis.

    • 24th September: Cornelia Oedekoven, CREEM

Title: TBA

Abstract: TBA

    • 1st October: Dan Kowal, Cornell University

Title: Facilitating heterogeneous effect estimation via statistically efficient categorical modifiers 

Abstract: Categorical covariates such as race, sex, or group are ubiquitous in regression analysis. While main-only (or ANCOVA) linear models are predominant, linear models that include categorical-continuous or categorical-categorical interactions are increasingly important and allow heterogeneous, group-specific effects. However, with standard approaches, the addition of categorical interactions fundamentally alters the estimates and interpretations of the main effects, often inflates their standard errors, and introduces significant concerns about group (e.g., racial) biases. We advocate an alternative parametrization and estimation scheme using abundance-based constraints (ABCs). ABCs induce a model parametrization that is both interpretable and equitable. Crucially, we show that with ABCs, the addition of categorical interactions 1) leaves main effect estimates unchanged and 2) enhances their statistical power, under reasonable conditions. Thus, analysts can, and arguably should include categorical interactions in linear models to discover potential heterogeneous effects—without compromising estimation, inference, and interpretability for the main effects. Using simulated data, weverify these invariance properties for estimation and inference and showcase the capabilities of ABCs to increase statistical power. We apply these tools to study demographic heterogeneities among the effects of social and environmental factors on STEM educational outcomes for children in North Carolina. An R package lmabc is available.

    • 29th October: Karla Diaz Ordaz, University College London (JJ Valletta Memorial Lecture)

Title: From causal inference to machine learning and back: a two-way street towards better science 

Abstract: Machine learning methods have become established for prediction problems, but there is increasing interest in using these algorithms for causal inference. However, causal effect estimation often involves counterfactuals, and prediction tools from the machine learning literature cannot be used “out-of-the-box” for causal inference. 

At the same time, there is an increasing interest in using causal reasoning when building and interpreting machine learning algorithms. Doing so can help reduce unfairness and other algorithmic biases stemming from the training data not being representative of the target population. Causality can also help with interpretability and explainability of machine learning outputs.  

In this talk, I will review Causal Machine learning,  a framework to `de-bias’ standard machine learning algorithms so they perform well for causal tasks. I will also discuss the role causal inference can play in machine learning to improve fairness and explainability of so-called “black-box” models. 

 This two-way street opens the way to making better use of the data and obtaining reliable answers to real-life scientific problems, while maintaining good statistical principles.  

Previous academic years

Seminars from previous academic years (since 2022) are listed here.

Recent Posts

    Recent Comments

    No comments to show.