Your new experience awaits. Try the new design now and help us make it even better

OPINION article

Front. Neurol.

Sec. Artificial Intelligence in Neurology

Volume 16 - 2025 | doi: 10.3389/fneur.2025.1650968

This article is part of the Research TopicTechnology Developments and Clinical Applications of Artificial Intelligence in Neurodegenerative DiseasesView all 10 articles

Combined with multimodal medical imaging and artificial intelligence for early diagnosis of Parkinson's disease

Provisionally accepted
Sitong  LuSitong Lu1Shuang  GuiShuang Gui2*Chunyun  WangChunyun Wang3*Yu  Liu MMYu Liu MM4*
  • 1School of Medicine and Life Sciences,Chengdu University of Traditional Chinese Medicine, Sichuan, China
  • 2Department of radiology,Qianjiang Central Hospital of Chongqing,Chongqing, Chongqing, China
  • 3Medical Imaging Department, Beijing Anzhen Nanchong Hospital of Capital Medical University & Nanchong Central Hospital, Chongqing, China
  • 4Department of Radiology, Chongqing Public Health Medical Center, Chongqing, Chongqing, China

The final, formatted version of the article will be published soon.

Parkinson's disease (PD ), also known as "shaking palsy", is the second most common neurodegenerative disease in the world. Its main symptoms are divided into motor: static tremor, bradykinesia, postural balance disorder and myotonia [1]; and nonexercise: constipation, memory loss, frequent urination, urgent urination, depression, decreased sense of smell [2], etc. Epidemiological data show that both PD incidence rate and prevalence rate show a significant age-dependent growth trend [3]. Male incidence rate is 1.5 times higher than female . Although the average survival time of patients after diagnosis is long and most of them die due to complications, due to the significant difficulties in early diagnosis, most patients have irreversible loss of dopaminergic neurons in substantia nigra when they are diagnosed. Limited by the existing medical cognition and technical level, there is no radical cure for Parkinson's disease at present. Clinical treatment still takes symptomatic intervention and delaying disease progress as its core goal. With the acceleration of global human-aging process, the number of Parkinson's disease patients continues to rise, which brings severe challenges to medical resource allocation and long-term care system. Therefore, achieving early and precise diagnosis of Parkinson's disease is of great clinical significance for improving the cure rate of the disease, prolonging the survival period of patients and enhancing their quality of life.The current clinical diagnosis of Parkinson's disease mainly relies on typical symptom recognition, but this diagnosis method based on subjective symptom description of patients has significant limitations. The heterogeneity of symptoms between individuals can easily lead to missed diagnosis and misdiagnosis. In addition, the early symptoms of Parkinson's disease are hidden and some symptoms overlap with nervous system diseases such as multi-system atrophy and progressive supranuclear palsy, which further aggravates the difficulty of diagnosis. Although multimodal medical imaging techniques such as ultrasound CT、MRI have been applied to the diagnosis of Parkinson's disease, due to the lack of specific biology markers, the diagnosis based solely on conventional imaging features is still highly dependent on clinical experience. This diagnostic mode not only has insufficient sensitivity and poor stability, but also often has atypical imaging manifestations, which is difficult to meet the precise needs of early diagnosis. Finding an objective and accurate inspection method is a problem to be solved at present. In recent years, the cross-fusion of artificial intelligence and medical images has brought new breakthroughs for early diagnosis of PD. For example, rediomics can extract quantitative features such as texture and shape from medical images with highthroughput manner, which are invisible to the naked eye, and transform images into analyzable data matrices; artificial intelligence algorithms, especially deep learning models, can deeply mine these high-dimensional features to achieve accurate pattern recognition. As a high-resolution structural imaging technology, magnetic resonance imaging ( MRI ) can provide high-resolution three-dimensional images of patients' brains, which helps to enhance the interpretability of models. For example, the significance map of convolutional neural network ( CNN ) generated by Camacho [5] Parkinson's disease with the advantage of machine learning algorithm in big data integration analysis. Chougar, L. et al. [6]used MRI data to build a machine learning algorithm, and successfully distinguished Parkinson's disease ( PD ), progressive supranuclear palsy ( PSP ) and multi-system rhomboid contraction ( MSA ), especially in distinguishing PD from MSA. However, MRI misdiagnosis was also found during the study. Therefore, in the follow-up study, high-field MRI or dynamic enhanced MRI technology can be explored to further tap the potential of multimodal MRI combined with artificial intelligence in the specific diagnosis of Parkinson's disease. In addition, Ye, Q. et al. [7]used two different structural MRI sequences ( T2-FLAIR and T1WI ) to build an imaging model, and achieved AUC of 0.896 and 0.899 respectively, which confirmed the good clinical practicality of the model through decision curve analysis ( DCA ). Pahuja, G. et al. [8] focused on the optimization of model framework, and explored PD classification performance under different frameworks based on T1weighted MRI and single photon emission computed tomography ( SPECT ) images.Based on the above studies, it is not difficult to find that the clinical effect of Chougar , L [6] may not be satisfactory in the case of classification characteristics. However, when meeting the clinical effect, such as Ye , Q [7] , the data limitations are relatively large;As for the study by Camacho et al. [5], although it also has clinical limitations, thanks to the introduction of CNN, it is possible to determine high-precision markers for specific regions and enhance the interpretability of the model. Therefore, MRI, as a high-resolution structural imaging technique, can enhance the interpretability of the model, but there is a problem of heterogeneity in diagnostic criteria in retrospective studies; meanwhile, MRI technology can assist in the classification of different diseases and subtypes, but it is also limited by clinical application. With the development of imaging omics, multimodal MRI evaluation has become a clinical trend, and its fusion data combined with machine learning algorithm can effectively assist early diagnosis of Parkinson's disease. One of the core strategies for accurate diagnosis of early Parkinson's disease is to explore its characteristic differences from similar diseases. Magnetic resonance imaging ( MRI ) is often used to assist in the classification of Parkinson's disease subtypes due to its ability to image high-resolution anatomical structures. Positron emission tomography ( PET ) imaging technology can quantify the function of dopaminergic system and glucose metabolism pattern, and provide specific biology markers. The multimodal fusion of the two not only realizes the correlation analysis between function and structure, but also significantly improves the diagnostic accuracy, making PET-MRI technology an important tool to identify similar neurodegenerative diseases. Sun et al. [9]showed that the constructed multimodal model outperformed the single-modal model regardless of how PET and MRI data were combined and sequenced. The developed PET/MRI radiomics-clinical combined model achieved an area under the receiver operating characteristic curve (AUC) of 0.993, fully demonstrating the significant potential of this technology in the clinical differentiation of Parkinson's disease (PD) and multiple system atrophy (MSA). Another study focused on [ 18F ] FDG PET/MRI [10], by training an artificial intelligence model integrating metabolic, structural and functional information, confirmed that the comprehensive imaging model was significantly superior to the simple clinical diagnosis model in distinguishing PD from MSA, which mutually confirmed the conclusions of Sun et al. [9]. However, the study also points out that although automatic region of interest ( ROI ) segmentation improves efficiency, its accuracy is still not as good as manual labeling, and finding a more accurate automatic sketching algorithm is the key to achieve fully automated diagnosis. J. Silva-Rodriguez et al. [11] used [ 18F ] FDGPET/MRI technology to evaluate the effectiveness of structural MRI ( sMRI ) and diffusion magnetic resonance imaging ( dMRI ) assisted by machine learning algorithms in detecting mild cognitive impairment ( PD-MCI) ) and dementia ( PDD ) in Parkinson's disease, and found that dMRI has more advantages in revealing microstructural changes in early brain regions of patients. It is worth noting that neither of the first two studies included the gold standard of pathological results. Sun et al. [9]conducted a retrospective study, sample heterogeneity cannot be ruled out. HU, X et al. [10] also cannot guarantee the accuracy of automatic segmentation. While the third study [11]lacked control group and follow-up data, and only conducted a cross-sectional study.To sum up, in order to fully utilize the application value of the hybrid PET/MRI technology in the early diagnosis of Parkinson's disease, it is urgent to establish a largescale and long-term follow-up control database, providing solid data support for the feature extraction and optimization of artificial intelligence models. At present, it is believed that the main cause of Parkinson's disease is the loss of dopaminergic neurons in substantia nigra. For early Parkinson's patients, the new PET/CT technology has higher resolution on minor changes in small lesions, such as uptake in PD by PET/CT imaging technology, and found that DAT uptake was related to the decrease of glucose metabolism in brain region, while only some visual functions were significantly affected by DAT, and DAT uptake decreased in the order of PP AP and caudate nucleus, and then the average standard uptake ratio ( SUVR ) decreased, resulting in visuospatial cognitive dysfunction. However, this study is still retrospective, and the control of some variables and the universality of the results are not up to the expected level. We also learned that, as mentioned in the Wu, X. et al. [14]study, the 3D parameters of the new striatum of PD patients included in the study based on good consistency of quantitative parameters between 11C-CFT PET / CT planar and 3D images are more associated with disease progression than planar parameters, providing another potential evidence for future Parkinson's diagnosis. In the early diagnosis of Parkinson's disease, the multimodal imaging fusion task such as MRI, PET, CT can be systematically divided into four core steps. The first is the acquisition and preprocessing of raw data, converting the collected data into the DICOM standard format, verifying the integrity of metadata, and performing denoising processing for PET and CT data; using rotation translation, ANTs, etc. to achieve spatial registration, bridging resolution differences, and enhancing image contrast to highlight edge features. The second is the feature fusion stage, adopting multi-dimensional fusion strategies: directly superimposing MRI and PET images through image registration; combining MRI volume features with PET metabolic features and other different modal features; using deep learning algorithms to achieve more complex feature fusion.Based on the above content, further supplements are given. It is recommended that the selection parameters of the spatial registration threshold be set to rigid registration: the maximum mutual information error threshold was 0.3bits to balance accuracy and efficiency and nonlinear registration were adopted, and the original RMSE root mean square error threshold was less than 2.0mm [15]to meet the resolution requirements of substantia nigra nucleus in Parkinson's diagnosis 1 . In addition, it is recommended to use third-order B-spline interpolation to preserve image texture to help better identify specific markers. Based on existing literature [16,17]and theoretical analysis in this paper, we recommend using a deep learning-based U-Net architecture to process PET images, with input as 4D dynamic time frames, and the loss function employing Poisson and SSIM weighted (weight ratio of 0.7:0.3) to better preserve functional metabolic texture. Additionally, a multi-channel local attention module is developed for feature extraction from CT images. A five-fold stratified cross-validation strategy is adopted, along with the NAdamW optimizer configuration, to avoid training instability caused by differences in gradient magnitudes between modalities. Since clinical trials have not been carried out in this paper, the real parameters are expected to be supplemented and improved in real applications. The construction of an end-to-end intelligent early Parkinson's diagnosis model based on standardized multimodal image data can be promoted according to the following processes. First, feature extraction of multimodal data is carried out, and a combination of manual and deep learning is adopted. The CT ventricular volume and PET standard uptake values were extracted manually, and the global features of each mode were obtained by using the pre-training model. With the help of Python library, 3D Slicer, MONAI and other tools, various types of radiological features such as shape, texture and functional metabolism can be extracted. Secondly, the model architecture is designed, and 3D CNN is selected to process volume image based on its ability to stably maintain good accuracy [18], Transformer to process multimodal sequences and global features, and a multi-branch fusion network is constructed. By designing key modules, multi-level image fusion is realized to form an end-to-end network in which the original data can be input and the diagnostic results can be output. Then, the model training and optimization are carried out, and the continuous and discrete features are randomly enhanced. The loss function and optimizer are selected, and the model with the best performance in the verification set is retained under the scenario of multi-center data sharing. Finally, the model is verified and iterated, taking AUC, sensitivity and accuracy as evaluation indicators, visualizing key areas to improve interpretability, and regularly fine-tuning according to new case data to realize dynamic update of the model and ensure continuous optimization of diagnostic efficiency. The application of end-to-end intelligent diagnosis model in clinical auxiliary diagnosis of Parkinson's disease needs to be promoted from three aspects: visual integration, system docking and clinical verification. At the level of visual integration, by generating heat maps of MRI images, the key lesion areas such as iron deposition in the substantia nigra and density changes can be visually highlighted. After dimensionality reduction of multimodal image features, the distribution differences between healthy people and patients were displayed by scatter diagram, and the decision-making basis of the model was clearly presented. Moreover, by calculating the contribution values of each pattern to the final diagnosis and visual attention weights, the model's interpretability is enhanced to ensure the reliability of the diagnostic logic.For example, using GradientSHAP technology to generate heatmaps primarily focused on striatal dopamine transporter activity improves interpretability. The model shall be deeply integrated with the PACS system during system docking. On the one hand, it ensures that the model can directly analyze standard DICOM format data and unify metadata cleaning rules of different manufacturers. On the other hand, encrypted communication and patient anonymity are adopted, only necessary metadata such as age and gender are reserved, data access rights are strictly restricted to authorized medical care and development teams, and operation logs are recorded. In addition, a doctor feedback channel is established to collect cases of misdiagnosis and missed diagnosis in real time, and the model is continuously optimized through automatic sample recovery function and regular performance report of PACS system. In the clinical verification stage, case samples can be collected from several large hospitals to verify the sensitivity and accuracy of the model through independent input data; or carry out controlled trials with traditional diagnostic methods, and use statistical means to evaluate the clinical value of the model, so as to provide a solid basis for its clinical promotion.In fact, the aforementioned viewpoints still have deficiencies in clinical application, so we further improved the relevant mechanisms. First, retrospective data validation was conducted to check if the technical test imaging data can be seamlessly transmitted and to confirm its bidirectional synchronization with the RIS system.Second, prospective comparative studies were carried out, such as rolling deployment in three large hospitals each month, using real-time tracking of diagnostic decisionmaking paths through doctor group interviews. Finally, federated learning frameworks were utilized to achieve cross-institutional model optimization. Despite this, we still acknowledge the potential shortcomings of this deployment strategy.We believe that the future early Parkinson's clinical diagnosis model will realize the automation and intelligence of the whole process to build a closed-loop system from data input to decision output. This will help clinicians to efficiently use artificial intelligence models to accurately judge the condition in PACS environment, and promote the diagnosis and treatment of Parkinson's disease to a new height. At present, artificial intelligence models have shown great advantages in screening, diagnosis and treatment of early Parkinson's disease, but there are still inevitable shortcomings and challenges. Although artificial intelligence technology is widely used in the medical field at present, in order to convincingly prove the great potential of artificial intelligence in early Parkinson's diagnosis, a clearer and more visual data output interpretation system is needed, which is one of the difficulties faced by artificial intelligence at present. We can only effectively observe the input and final output results of data, how to get this result, and the unclear principle behind the result is like a " black box" [4], which also leads to people's difficulty in understanding the processing flow to a certain extent.Coupled with the small impact of changing the internal structure of the model on the algorithm performance, whether the artificial intelligence model can be widely used in the medical field depends on whether it can overcome the uninterpretability of the large algorithm model. In the early diagnosis of Parkinson's disease, standardized data acquisition and processing is the key foundation for building an efficient multi-modal artificial intelligence model. Different medical imaging technologies have significant differences in data acquisition stage. The imaging principles, equipment parameters and scanning protocols of CT, MRI, PET and SPECT are different. In addition, the operation specifications and equipment models of different medical institutions are different, resulting in the lack of uniform standards for original data in terms of format, resolution and gray value, which brings great challenges to subsequent data integration and model training. In the process, data standardization process is particularly important. First of all, the original data must be converted into DICOM standard format to ensure the standardization of data storage and transmission .At the same time, strict quality control shall be carried out to eliminate unqualified data with excessive noise and obvious artifacts. Before multimodal data fusion, spatial registration techniques, such as affine transformation and nonlinear registration based on ANTs, must be used to eliminate the spatial positional deviations between different modal images, and interpolation algorithms should be adopted to unify the resolution. In addition, the quantitative data of radioactive tracers for PET and SPECT shall be standardized to eliminate measurement errors caused by different equipment and batches of tracers. Only by establishing a perfect standardized data acquisition and processing system can the stability and diagnostic efficiency of multimodal models be effectively improved. As an emerging technology, artificial intelligence faces many ethical and legal challenges in the development process. Model training relies on the continuous updating of a large amount of data, but if the principle of informed consent is not strictly followed in data collection, it will easily lead to the risk of data abuse and leakage [19].At the same time, the lack of responsibility definition mechanism has become a prominent problem. When artificial intelligence technology generates benefits or causes damage, there is no clear legal basis for the division of responsibilities among developers, users and the system itself. In addition, at present, the regulation of artificial intelligence technology by international laws and regulations lags behind, and a perfect data tracing and supervision system has not yet been established. Under the traditional legal framework, the restraint mechanism for non-human subjects is almost blank, which leads to a large number of regulatory blind spots in artificial intelligence applications. These problems make the large-scale promotion of artificial intelligence in the medical and clinical field face great obstacles, and it is necessary to build a safe and compliant technology application environment through legal system innovation and ethical norms improvement. Unfortunately, the model proposed in this study is a conceptual framework and not yet be carried out in clinical trials, so the specific clinical application of the model is not clear. Future work will be dedicated to filling the gap between theoretical development and clinical application. The continuous iteration of artificial intelligence technology has opened up a new path for early diagnosis of Parkinson's disease and brought hope for optimizing treatment schemes. Future research can deeply integrate artificial intelligence and multimodal medical imaging technology to build an end-to-end automatic early screening system. The system will realize seamless connection with the medical system and achieve intelligent visualization of the whole process from data collection to diagnosis decision. Through this efficient and convenient diagnosis mode, it is expected to promote the early diagnosis and treatment of Parkinson's disease, reduce the proportion of patients in the middle and advanced stage, and provide a new direction for overcoming the " incurable" problem of Parkinson's disease.

Keywords: Parkinson's disease, early diagnosis, Multimodal medical imaging, artificial intelligence, MRI

Received: 20 Jun 2025; Accepted: 07 Aug 2025.

Copyright: © 2025 Lu, Gui, Wang and MM. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Shuang Gui, Department of radiology,Qianjiang Central Hospital of Chongqing,Chongqing, Chongqing, China
Chunyun Wang, Medical Imaging Department, Beijing Anzhen Nanchong Hospital of Capital Medical University & Nanchong Central Hospital, Chongqing, China
Yu Liu MM, Department of Radiology, Chongqing Public Health Medical Center, Chongqing, Chongqing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.