AUTHOR=Dargazanli Cyril , Zub Emma , Deverdun Jeremy , Decourcelle Mathilde , de Bock Frédéric , Labreuche Julien , Lefèvre Pierre-Henri , Gascou Grégory , Derraz Imad , Riquelme Bareiro Carlos , Cagnazzo Federico , Bonafé Alain , Marin Philippe , Costalat Vincent , Marchi Nicola TITLE=Machine Learning Analysis of the Cerebrovascular Thrombi Proteome in Human Ischemic Stroke: An Exploratory Study JOURNAL=Frontiers in Neurology VOLUME=Volume 11 - 2020 YEAR=2020 URL=https://www.frontiersin.org/journals/neurology/articles/10.3389/fneur.2020.575376 DOI=10.3389/fneur.2020.575376 ISSN=1664-2295 ABSTRACT=Objective: Mechanical retrieval of thrombotic material from acute ischemic stroke subjects provides a unique entry point for translational research investigations. Here, we resolved the proteomes of cardioembolic and atherothrombotic cerebrovascular human thrombi and applied an articial intelligence routine to analyze potential protein signatures between the two selected groups. Methods: We collected n=32 cardioembolic and n=28 atherothrombotic diagnosed thrombi from patients suffering from acute stroke and treated by mechanical thrombectomy. Thrombi proteins were successfully separated by gel-electrophoresis. For each thrombi, peptide samples were analyzed by nano-flow liquid chromatography coupled to tandem mass spectrometry (nano-LC-MS/MS) to obtain specific proteomes. Relative protein quantification was performed using a label-free LFQ algorithm and all dataset were analyzed using a support-vector-machine (SVM) learning method. Clinical data were also analysed using SVM, alone or in combination with the proteomes. Results: A total of 2,455 proteins were identified by nano-LC-MS/MS in the samples analyzed, with 438 proteins commonly detected in all samples. SVM analysis of LFQ proteomic data delivered combinations of three proteins achieving a maximum of 88.3% for correct classification of the cardioembolic and atherothrombotic samples in our cohort. The coagulation factor XIII appeared in all of the SVM protein trios, associating with cardioembolic thrombi. A combined SVM analysis of the LFQ proteome and clinical data did not deliver a better discriminatory score as compared to the proteome only. Conclusion: Our results advance the characterization of the human cerebrovascular thrombi proteome. The exploratory SVM analysis identified sets of proteins to categorize our cohort cardioembolic and atherothrombotic samples. The integrated analysis here used could be further developed to better understand stroke origin and pathophysiology.