Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Appl. Math. Stat.

Sec. Statistics and Probability

Measuring the performance of LPA, LCGA, LGCM, and GMM in identifying the homogenous subgroups (Latent Classes) within the wider heterogeneous population of patients on DTG

Provisionally accepted
  • Malawi University of Business and Applied Sciences (MUBAS), Blantyre, Malawi

The final, formatted version of the article will be published soon.

Background Identifying heterogeneity in longitudinal data is critical for understanding diverse trajectories in clinical and epidemiological research. Traditional analytical methods often fail to distinguish latent subpopulations. More advanced statistical models such as Latent Profile Analysis (LPA), Latent Class Growth Analysis (LCGA), Latent Growth Curve Modeling (LGCM), and Growth Mixture Modeling (GMM) provide a data-driven approach to uncovering the distinct patterns. This study evaluated the performance of these models in classifying longitudinal weight gain trajectories. Methods A retrospective longitudinal dataset of 3,525 HIV positive individuals on DTG based regimen with repeated weight measurements over 24 months was analysed. Models were implemented using a stepwise approach: (1) LPA was applied to identify latent subgroups based on weight gain patterns without incorporating time, (2) LCGA and LGCM modelled individual trajectories assuming class-invariant and class-specific variances, respectively, and (3) GMM incorporated within-class variability to allow flexible trajectory shapes. Model performance was assessed using Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Deviance Statistics, and log-likelihood. Average Posterior Probability (AvePP) was used to evaluate classification certainty by measuring the mean probability of individuals being correctly classified into their assigned latent class. Clinical interpretability was also considered to assess real-world applicability. Results LCGA demonstrated the best model fit, with the lowest AIC (42,239.43) and BIC (42,301.1) and the highest log-likelihood (-21,109.71), identifying three distinct weight gain trajectories in the process. Although GMM captured greater within-class variability, LCGA demonstrated superior fit statistics, with the lowest AIC (42,239.43) and BIC (42,301.1) and the highest log-likelihood (- 21,109.71), identifying three distinct trajectories. Conclusion LCGA and GMM were the most effective models for identifying distinct latent trajectories, with LCGA demonstrating the best overall fit for our data. These findings emphasize the importance of appropriate model selection in longitudinal data analysis, as different approaches yield varying capacities to detect meaningful subpopulations. Selecting an optimal model is essential for improving trajectory classification and supporting evidence-based decision-making in clinical and epidemiological research.

Keywords: Latent class growth analysis, Growth mixture model, Model performance, trajectory analysis, longitudinal data, Statistical model comparison

Received: 11 Jul 2025; Accepted: 12 Nov 2025.

Copyright: © 2025 Nkhoma, Mulaga, Kumwenda and Kamndaya. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Harrid Bashir Nkhoma, phdaps23-hbnkhoma@mubas.ac.mw

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.