Article Text

Download PDFPDF

P15 Latent class regression modelling: a novel approach to predict survival in patients with chronic heart failure (CHF)
Free
  1. JL Mbotwa1,2,3,
  2. M de Kamps4,
  3. PD Baxter1,2,
  4. R Cubbon2,
  5. MS Gilthorpe1,2
  1. 1Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
  2. 2School of Medicine, University of Leeds, Leeds, UK
  3. 3Department of Applied Department, Malawi University of Science and Technology, Malawi
  4. 4Institute of Artificial and Biological Intelligence, School of Computing, University of Leeds, Leeds, UK

Abstract

Background Chronic Heart Failure (CHF) is one of the leading cause of hospitalizations and deaths, more especially in old people, and this causes a substantial clinical and economic burden to the government. Using risk prediction models to accurately understand the dynamics of survival patterns amongst patients with CHF conditions would provide guidance to health care professionals in decision making on how to improve delivery of care. However, prediction models used in medical research often fail to accurately predict health outcomes due to methodological limitations. These models particularly perform poorly when predicting narrowly targeted subgroups of patients. We explore the role of latent class regression (LCR) analysis to model the survival of patients with CHF. We seek to show that using LCR improves the modelling of health outcomes as it accounts for unobserved heterogeneity that exists naturally within the patient data.

Methods LCR generally involves identifying hidden latent classes within data and uses patient’s demographic characteristics and other covariates to predict class membership and separate regression models for each class. These latent classes may correspond to subgroups of patients with specific characteristics that affect their survival. The rationale is that one class will be more susceptible to deaths compared to another. The United Kingdom Heart Failure Evaluation and Assessment of Risk Trial (UK-HEART) recruited patients with signs and symptoms of CHF between July 2006 and December 2014. A total of 1802 records were available on patient characteristics as well as medications. We used some of these variables to model survival of patients within a latent class framework by estimating a single regression model for both latent classes. We increased complexity of our model by allowing each class to have a separate survival model.

Results We used the area under the receiver operating characteristic (ROC) curve to assess the performance of these two class models. Overall, our novel approach performed better than the traditional one-model-fits-all approach. Our model gave an area under the curve (AUC) of 0.87 while the traditional model yielded an AUC of 0.68.

Conclusion Ignoring the natural heterogeneity that exists within the patient data affects the accuracy of estimates in prediction models. Researchers can utilise the available data to identify hidden latent classes within the data. Fitting a regression model to each latent class improves the accuracy of the prediction estimates.

  • prediction
  • big data
  • methods

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.