Conference Agenda

Session

Infectious disease and longitudinal modelling

Time:

Wednesday, 27/Aug/2025:

9:00am - 10:30am

Session Chair: Hein Putter

Location: ETH E23

D-BSSE, ETH, 84 seats

Presentations

35-1 Infectious disease: 1

Flexible parametric additive hazards regression for modeling excess mortality in pandemics

Liesbeth C de Wreede¹, Marina T Dietrich², Ilaria Prosepe¹, Hein Putter¹, Mar Rodriguez-Girondo¹

¹LUMC (the Netherlands); ²University of Augsburg (Germany)

Effective decision-making during a pandemic requires balancing hospital capacity, protecting vulnerable groups, and minimizing societal burden. To make optimal policy decisions, it is crucial to quantify the impact of the pandemic on (excess) mortality and to assess the effects of public health interventions across various groups, accounting for infection and vaccination dynamics. However, a major challenge in the analysis of pandemic data is the absence of reliable cause-of-death information, which complicates the assessment of the pandemic’s specific impact.

Relative survival techniques are often used to assess excess mortality in a specific patient population by splitting observed mortality into background and excess mortality. These methods have been widely used to estimate cancer-specific mortality without the need for precise cause-of-death data. However, applying these techniques to estimate excess mortality in pandemic settings presents special challenges: standard relative survival methods assume (1) excess hazards are always positive, (2) background mortality is specified externally rather than derived from the data and based solely on demographic factors, and (3) a single time scale suffices. Pandemics, however, can involve negative excess hazards due to the protective effects of public health measures, large variations in background mortality across groups, and multiple time scales, such as time since vaccination and infection, that impact mortality risk beyond the main calendar time scale. Moreover, unlike in settings where the study population is a small subset of the reference population, in pandemic settings the background and study populations refer to the same group observed at different time points (pre-pandemic vs. pandemic).

To address these challenges, we propose a novel flexible parametric additive hazards model for pre-pandemic and pandemic data, using B-splines to estimate baseline hazards and time-dependent covariate effects. This new perspective on excess mortality estimation through a single additive hazards model accommodates negative excess hazards and integrates relevant risk factors into background mortality, equal to pre-pandemic mortality and estimated from the data. Moreover, this approach can handle multiple time scales and is less prone to overfitting than the classical non-parametric Aalen’s method. We investigated two estimation procedures: one based on the least-squares approach used for Aalen’s method, and the other exploiting the equivalence between additive hazards models with piecewise constant hazards and Poisson models with an identity link. The performance of the approach is demonstrated through a simulation study based on the COVID-19 pandemic and scenarios mimicking plausible pandemic conditions.

35-1 Infectious disease: 2

Investigating the Association between Risky Sexual Behaviors and HIV Risk Using Multivariate Joint Models

Nobuhle Nokubonga Mchunu¹, Henry Mwambi², Tarylee Reddy¹, Nonhlanhla Yende-Zuma¹, Dimitris Rizopoulos³

¹Biostatistics Research Unit, South African Medical Research Council, Durban, South Africa; ²University of KwaZulu-Natal, School of Mathematics, Statistics and Computer Science, Pietermaritzburg, South Africa; ³Department of Biostatistics, Erasmus University Medical Center, Rotterdam, The Netherlands

Background
HIV remains a major public health challenge in South Africa, particularly among young women and adolescents. Research indicates that sexual health behaviors—including contraceptive use, sexually transmitted infections (STIs), and condom use—do not operate independently but interact in complex ways that influence HIV risk. For example, inconsistent condom use, high STI prevalence, and certain contraceptive methods may increase susceptibility to HIV acquisition. However, traditional statistical approaches often fail to capture these interdependencies over time. Multivariate joint modeling provides an advanced analytical framework for examining these associations, offering a more comprehensive understanding of how sexual behaviors contribute to HIV transmission risk.

Methods
Inspired by the CAP004 clinical trial conducted by CAPRISA which enrolled sexually active, HIV-uninfected women aged 18 to 40 years in South Africa, this study employs multivariate joint modeling to analyze longitudinal data on sexual behaviors (contraceptive use, STIs, and condom use) alongside time-to-event outcomes (HIV infection and pregnancy, used as a proxy for HIV risk). By simultaneously accounting for correlated processes, this approach enables more accurate estimation of how these behaviors evolve and influence HIV acquisition over time. Data will be drawn from a cohort of individuals at high risk for HIV, with repeated measures of sexual health indicators and HIV outcomes. This modeling framework allows for the identification of trends and causal pathways that conventional regression techniques may overlook.

Results
We anticipate that inconsistent condom use, STI presence, and specific contraceptive methods—such as depot medroxyprogesterone acetate—will be associated with an increased risk of HIV acquisition. Multivariate joint modeling is expected to provide stronger evidence of how these factors interact over time, uncovering key risk patterns. The results will offer deeper insights into the longitudinal dynamics of sexual health behaviors and their contribution to HIV transmission, demonstrating the advantages of this modeling approach over traditional methods.

Conclusion
While the application of joint models in South African HIV research remains limited, studies in other contexts have shown their potential to uncover critical risk factor interactions. This study will contribute novel insights into the complex interplay of sexual health behaviors and HIV risk in South Africa, informing the development of more effective, evidence-based HIV prevention strategies tailored to high-risk populations.

35-1 Infectious disease: 3

Revealing Platelet Aggregation Dynamics: A Functional Data Analysis Approach Using Penalized Splines and Linear Mixed Models.

Souvik Kumar Bandyopadhyay

Cytel, India

The study of platelet aggregation kinetics often requires methods that can capture the full complexity of dynamic processes. Traditional approaches frequently rely on summary statistics, which can obscure critical information embedded within the temporal evolution of the data. This work presents a powerful alternative: a Functional Data Analysis (FDA) approach that models aggregation curves using penalized splines within a Linear Mixed Model (LMM) framework.

Our method leverages the flexibility of Truncated Polynomial Splines (TPS) to represent complex curve shapes. TPS combines polynomial terms and truncated power functions, enabling smooth curve estimation while penalizing roughness through the LMM structure. By treating polynomial coefficients as fixed effects and truncated terms as random effects, we can utilize Best Linear Unbiased Prediction (BLUP) for robust estimation and analytical derivation of derivatives.

A key innovation of this approach lies in the estimation and interpretation of derivatives, such as velocity and acceleration, which provide critical insights into the underlying kinetics of the system. Unlike traditional methods that focus on static endpoints, these derivatives allow us to model the dynamic system using differential equations, revealing phenomena such as bistability and phase transitions.

We demonstrate the utility of this method through an application to platelet aggregation kinetics, where we analyze the effects of ADP and gold nanoparticles on aggregation profiles. This application showcases how the method can capture the nuances of complex biological interactions, revealing the interplay between ADP concentration, purinergic receptor activation, and nanoparticle-induced effects. Specifically, the method allowed for discerning the underlying kinetics, as well as to study the effects of ADP dosage and perturbation with gold nanoparticles.

This FDA approach, implemented using R with the nlme package, offers significant advantages over traditional methods. By capturing full temporal profiles and identifying metastable states, it provides a more comprehensive understanding of platelet aggregation dynamics. However, it's crucial to address the computational challenges associated with high-dimensional random effects and the sensitivity of derivatives to noise.

This method's generality makes it applicable to other temporal biomedical datasets requiring kinetic analysis. By providing a flexible and statistically rigorous framework for modeling dynamic processes, this work contributes to a deeper understanding of complex biological systems.

35-1 Infectious disease: 4

Source Data Mapping Approach to CDMV5.4: Innovations in Longitudinal Data Integration for Machine Learning Application

Bylhah Mugotitsa^1,2, Michael Ochola¹, Pauline Andeso¹, David Amadi³, Reinpeter Momanyi¹, Evans Omondi¹, Jim Todd⁴, Agnes Kiragga¹

¹African Population and Health Research Center, Kenya; ²Strathmore Business School, Strathmore University, Nairobi, Kenya.; ³Department of Population Health, London School of Hygiene and Tropical Medicine, London, United Kingdom; ⁴Department of Epidemiology, Catholic University for Health and Allied Sciences Mwanza, Tanzania.

Longitudinal studies offer critical insights into health conditions but face challenges in integrating survey and psychometric data into standardized frameworks like the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). With the cancellation of the survey_conduct table in CDM 6.0, this study proposes a novel methodology to map standardized instruments such as PHQ-9 and GAD-7 into the OMOP CDM, addressing critical gaps in the model.

The methodology leverages a structured process for data integration, starting with the creation of measurements using vocabularies like LOINC and SNOMED. Instrument scores and individual panel items are mapped as observations, with relationships like “Has Answer” ensuring consistency. Tying measurements to visit occurrences through observation_event_id maintains data integrity across longitudinal encounters. Conditions such as "anxiety disorder" are identified and mapped using SNOMED concepts, linked back to corresponding measurements. Insights from OHDSI forums informed iterative refinements, ensuring compatibility with existing vocabularies.

Results show that the methodology effectively integrates psychometric instruments into the OMOP CDM, enabling advanced analysis of mental health outcomes. By addressing the absence of survey_conduct capabilities, the framework facilitates machine learning applications, such as predictive modeling and clustering of longitudinal data, while preserving the semantic integrity of health data.

This work provides a scalable solution for mapping survey data into the OMOP CDM, bridging a key gap in longitudinal data management and advancing global health research. The framework enhances data standardization and usability, contributing to evidence-based policy-making and interventions. Future efforts will extend this methodology to emerging instruments and vocabularies, supporting the evolution of data science frameworks in health research.