IRIS UniCatt

Aims Metabolic dysfunction Associated Steatotic Liver Disease (MASLD) outcomes such as MASH (metabolic dysfunction associated steatohepatitis), fibrosis and cirrhosis are ordinarily determined by resource-intensive and invasive biopsies. We aim to show that routine clinical tests offer sufficient information to predict these endpoints.Methods Using the LITMUS Metacohort derived from the European NAFLD Registry, the largest MASLD dataset in Europe, we create three combinations of features which vary in degree of procurement including a 19-variable feature set that are attained through a routine clinical appointment or blood test. This data was used to train predictive models using supervised machine learning (ML) algorithm XGBoost, alongside missing imputation technique MICE and class balancing algorithm SMOTE. Shapley Additive exPlanations (SHAP) were added to determine relative importance for each clinical variable.Results Analysing nine biopsy-derived MASLD outcomes of cohort size ranging between 5385 and 6673 subjects, we were able to predict individuals at training set AUCs ranging from 0.719-0.994, including classifying individuals who are At-Risk MASH at an AUC = 0.899. Using two further feature combinations of 26-variables and 35-variables, which included composite scores known to be good indicators for MASLD endpoints and advanced specialist tests, we found predictive performance did not sufficiently improve. We are also able to present local and global explanations for each ML model, offering clinicians interpretability without the expense of worsening predictive performance.Conclusions This study developed a series of ML models of accuracy ranging from 71.9-99.4% using only easily extractable and readily available information in predicting MASLD outcomes which are usually determined through highly invasive means.

Mcteer, M., Applegate, D., Mesenbrink, P., Ratziu, V., Schattenberg, J. M., Bugianesi, E., Geier, A., Romero Gomez, M., Dufour, J., Ekstedt, M., Francque, S., Yki-Jarvinen, H., Allison, M., Valenti, L., Miele, L., Pavlides, M., Cobbold, J., Papatheodoridis, G., Holleboom, A. G., Tiniakos, D., Brass, C., Anstee, Q. M., Missier, P., Machine learning approaches to enhance diagnosis and staging of patients with MASLD using routinely available clinical information, <<PLOS ONE>>, 2024; 19 (2): N/A-N/A. [doi:10.1371/journal.pone.0299487] [https://hdl.handle.net/10807/273391]

Machine learning approaches to enhance diagnosis and staging of patients with MASLD using routinely available clinical information

McTeer, Matthew;Applegate, Douglas;Mesenbrink, Peter;Ratziu, Vlad;Schattenberg, Jörn M;Bugianesi, Elisabetta;Geier, Andreas;Romero Gomez, Manuel;Dufour, Jean-Francois;Ekstedt, Mattias;Francque, Sven;Yki-Jarvinen, Hannele;Allison, Michael;Valenti, Luca;Miele, Luca;Pavlides, Michael;Cobbold, Jeremy;Papatheodoridis, Georgios;Holleboom, Adriaan G;Tiniakos, Dina;Brass, Clifford;Anstee, Quentin M;Missier, Paolo

2024

Abstract

Aims Metabolic dysfunction Associated Steatotic Liver Disease (MASLD) outcomes such as MASH (metabolic dysfunction associated steatohepatitis), fibrosis and cirrhosis are ordinarily determined by resource-intensive and invasive biopsies. We aim to show that routine clinical tests offer sufficient information to predict these endpoints.Methods Using the LITMUS Metacohort derived from the European NAFLD Registry, the largest MASLD dataset in Europe, we create three combinations of features which vary in degree of procurement including a 19-variable feature set that are attained through a routine clinical appointment or blood test. This data was used to train predictive models using supervised machine learning (ML) algorithm XGBoost, alongside missing imputation technique MICE and class balancing algorithm SMOTE. Shapley Additive exPlanations (SHAP) were added to determine relative importance for each clinical variable.Results Analysing nine biopsy-derived MASLD outcomes of cohort size ranging between 5385 and 6673 subjects, we were able to predict individuals at training set AUCs ranging from 0.719-0.994, including classifying individuals who are At-Risk MASH at an AUC = 0.899. Using two further feature combinations of 26-variables and 35-variables, which included composite scores known to be good indicators for MASLD endpoints and advanced specialist tests, we found predictive performance did not sufficiently improve. We are also able to present local and global explanations for each ML model, offering clinicians interpretability without the expense of worsening predictive performance.Conclusions This study developed a series of ML models of accuracy ranging from 71.9-99.4% using only easily extractable and readily available information in predicting MASLD outcomes which are usually determined through highly invasive means.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2024
			
	Lingua del contenuto
	
				Inglese
			
	Nome del periodico
	
				PLOS ONE
			
	DOI del contributo
	
				https://dx.doi.org/10.1371/journal.pone.0299487
			
	Citazione
	
				Mcteer, M., Applegate, D., Mesenbrink, P., Ratziu, V., Schattenberg, J. M., Bugianesi, E., Geier, A., Romero Gomez, M., Dufour, J., Ekstedt, M., Francque, S., Yki-Jarvinen, H., Allison, M., Valenti, L., Miele, L., Pavlides, M., Cobbold, J., Papatheodoridis, G., Holleboom, A. G., Tiniakos, D., Brass, C., Anstee, Q. M., Missier, P., Machine learning approaches to enhance diagnosis and staging of patients with MASLD using routinely available clinical information, <<PLOS ONE>>, 2024;  19 (2): N/A-N/A. [doi:10.1371/journal.pone.0299487] [https://hdl.handle.net/10807/273391]
			
	Appare nelle tipologie:
	
				Articolo in rivista, Nota a sentenza

File in questo prodotto:

File	Dimensione	Formato
machine.pdf accesso aperto Tipologia file ?: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 1.43 MB Formato Adobe PDF Visualizza/Apri	1.43 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/273391

Citazioni

1

16

17

social impact