- 1Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, United States
- 2Harvard Stem Cell Institute, Cambridge, MA, United States
- 3Broad Institute of MIT and Harvard, Cambridge, MA, United States
- 4Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, United States
Introduction: Spinal muscular atrophy (SMA) is a genetic disease characterized by degeneration of spinal cord motor neurons and neuromuscular junctions. Despite recent developments in therapies for SMA, treatment efficacy largely relies on the administration of drugs early in disease progression and is impacted by underlying patient genetics. Drug discovery for other diseases of the central nervous system (CNS) has also been hindered by heterogeneity in patient genetics and clinical presentations, as well as the need for early intervention.
Methods: To address these hurdles, we utilized a chemical-genetic-based screening approach to adapt the Connectivity Map (CMAP)/L1000 platform to study SMA. To do this, we differentiated moderate and severe SMA patient-specific induced pluripotent stem cells into neuronal cells utilizing a forward programming differentiation protocol, exposed each to 360 neuroactive or CNS disease-related compounds, and interrogated resulting changes in expression of >400 neural genes in a platform we term CMAPneuro.
Results: In doing so, we generated 4,559 transcriptional profiles identifying stimuli that modulate gene expression differences across SMA neurons. Finally, we make these data queryable, allowing the research community to (1) identify CNS disease-related perturbagens that mimic or reverse differentially expressed genes, or (2) explore the transcriptional response of a given perturbation in diverse SMA neuronal cells.
Discussion: Taken together, CMAPneuro represents a novel tool to identify candidate stimuli for follow-up investigation into the biology of SMA and related disorders.
Introduction
Spinal muscular atrophy (SMA) is a degenerative motor neuron disease caused by a deficiency of survival of motor neuron (SMN) protein resulting from mutations in the SMN1 gene (Crawford and Pardo, 1996). In patients, disease severity is mainly, though not exclusively, determined by the copy number of the truncated, less functional SMN2 gene, with clinical features classified by the age of detectable changes in motor symptoms and maximum motor function achieved (Crawford and Pardo, 1996). Two currently approved therapies for SMA modulate splicing of the SMN2 transcript to increase SMN levels, while a third acts via viral delivery of the SMN1 gene to patient cells (Paik, 2022; Neil and Bisaccia, 2019; Blair, 2022). For each, efficacy relies on administration of the drug early in disease progression—a problem exacerbated when considering frequently late diagnoses, particularly in milder forms of the disease (Lin et al., 2015). New approaches seek to identify additional drugs capable of rescuing phenotypic changes resulting from low SMN levels in conjunction with therapeutics to raise SMN levels (Long et al., 2019), and clinical recommendations for newborn screening for SMA now employed in the United States (Zaidman et al., 2024).
Historically, drug discovery approaches for central nervous system (CNS) diseases have proven challenging in part due to the heterogeneity of genetics and patient presentation. Adding to this complexity, a majority of these disorders have multiple traits and phenotypes with unclear linkage between genetic risk loci and functional consequences (Pihlstrøm et al., 2017; Geschwind and Flint, 2015; Hu et al., 2014). At the same time, cellular models for CNS disorders can be limited by the inability of cultured cells to replicate disease physiology, especially in the case of aging-related diseases. To address these limitations, several studies have utilized potent stressors such as the proteasome inhibitor MG132 or ER stressors such as thapsigargin or tunicamycin to exacerbate in vitro phenotypes for CNS disorders (Harhouri et al., 2017; Watts et al., 2023). Although successful in this regard, such robust stimuli may mask individual, physiologically relevant cellular and genetic variation, thereby masking more subtle differences caused by disease-associated genes. Building on these approaches, we posit that studying the response of patient-specific cells to large numbers of diverse perturbagens may uncover novel aspects of CNS disease biology.
Fortuitously, novel chemical genetic approaches for drug discovery have coincided with technological advances in high-throughput screening and gene expression profiling platforms (Subramanian et al., 2017; Lamb et al., 2006). In one notable example, the connectivity map (CMAP) allows for the discovery of functional connections between diseases, genes, and drugs by combining comprehensive perturbation databases with transcriptomic profiles of cancer cell lines treated with thousands of perturbagens (Subramanian et al., 2017). Further, the Luminex1000 (L1000), a cost-effective, high-throughput bead-based platform, is employed within the CMAP pipeline to measure the expression profile of approximately 1,000 landmark genes and impute expression of the remaining genes in the transcriptome. To date, CMAP includes 1.3 million open-access, queryable perturbation profiles and has uncovered previously unknown mechanisms of drug effects, allowing researchers to identify transcriptional responses and/or perturbagens that mimic or reverse them. Although studies have employed CMAP to identify druggable targets across multiple diseases (Musa et al., 2017), the most commonly used version of the platform utilizes perturbation profiles generated in immortalized cell lines, limiting its ability to uncover complexities of CNS disease biology, particularly those impacted by patient genetics.
Here, we adapt the CMAP/L1000 platform to study the biology of CNS diseases such as SMA, generating a tool we term CMAPneuro. To this end, we utilized a forward programming-based differentiation protocol to generate Neurogenin-2 (NGN2) cortical glutamatergic neurons from SMA patient-specific iPSCs (Nehme et al., 2018). While NGN2 neurons are distinct from motor neurons impacted in SMA, this approach allows us to generate large numbers of neuronal cells for downstream perturbational screening. To this end, we subsequently exposed each line to a novel library of 360 neuroactive CNS disease-related perturbagens and measured transcriptional changes in a curated set of 467 neural genes. In doing so, we generated a database of gene expression profiles depicting the response of SMA neurons to diverse perturbagens, and in the process, identified compounds that reversed or exacerbated differences between severe and moderate SMA. Finally, to expand the accessibility of this resource, we generated a queryable dataset as part of CMAP’s CLUE.io portal1 for researchers to: (1) identify CNS disease-relevant perturbagens that mimic or reverse a set of differentially expressed genes, or (2) explore the transcriptional response of a given perturbation in diverse SMA neuronal cells. Utilization of CMAPneuro will allow researchers to identify candidate compounds for follow-up study into SMA disease-related processes.
Results
Generation and characterization of type 0, 1, and 3 SMA patient-specific iPSC-derived NGN2 neurons
To represent SMA subtypes in our CMAPneuro platform, we took advantage of previously characterized iPSC lines derived from 3 type 0/1 and 3 type 3 SMA patients (Figure 1a; Rodriguez-Muela et al., 2018; Ng et al., 2015; Rodriguez-Muela et al., 2017; Leow et al., 2024; Heo et al., 2025). Included lines represent two female and four male individuals, generated from biopsies obtained from patients with varying disease severities as noted by Hammersmith Functional Motor Scale (HFSME) and Children’s Hospital of Philadelphia Infant Test of Neuromuscular Disorders (CHOP INTEND) scores (clinical metrics for SMA patients in children and infants, respectively) (Glanzman et al., 2010; O’Hagen et al., 2007). Notably, SMA is diagnosed in patients via genetic testing of the SMN1 locus, while type is determined based on a variety of clinical parameters (e.g., age-of-onset, maximal motor function achieved) (Arnold et al., 2015). In line with this, iPSC lines included in this study were derived from patients with a range of phenotypes and SMN2 copy numbers.
Figure 1. Development and characterization of SMA patient-derived NGN2 neurons. (a) Clinical characteristics of SMA patients from which iPSC lines were derived. (b) SMA patient fibroblast samples were reprogrammed into iPSCs and differentiated into NGN2 cortical-like neuronal cells via forward programming. 7-days post-dissociation and plating into 96-well plate format, cells were stained via IF, or protein lysates were generated for Western blot and ELISA −/+ splicing modulator C3. (c) Type 1 (138D) and Type 3 (149A) iPSCs successfully differentiate into cortical-like NGN2 cells that express TUJ1 and SMN by IF. (d) NGN2 cells differentiated from type 1 (138D) and type 3 (149A) iPSC lines exhibit equivalent neurite length as measured by TUJ1 staining. The total neurite length relative to the number of cells in each well is depicted. NS = not significant. (e) Type 0/1 NGN2 neurons display lower levels of SMN than type 3 NGN2 neurons by Western blot. (f) Quantification of SMN protein levels by Western blot, normalized to β-tubulin. N = 3 differentiations. Type 0/1 lines are noted in red, type 3 lines in blue. ANOVA for significance; F(5,12) = 42.45; p = 3.21×10−7. (g) Western blot denotes increases in SMN level upon treatment with 0.5 and 1 μM C3 for 72 h in type 1 (138D) NGN2 neurons. (h) Quantification of SMN protein signal in (g) normalized to β-tubulin in type 1 NGN2 neurons −/+ C3. Significance determined via unpaired t-test; *p-value<0.05, **p < 0.01. (i) At baseline, type 1 (138D) iPSC-derived NGN2 cells exhibit lower SMN levels compared to type 3 (149A) iPSC-derived NGN2 cells via ELISA, with detectible increases in SMN being produced by exposure to the SMN splicing modulator C3 (24h, 72 h; C3-1 = 1 μM and C3-2 = 5 μM). Each condition compared to its respective DMSO control; unpaired t-test for significance; *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. Blue dots denote type 3 cells, red dots denote type 1 cells.
To generate a sufficient quantity of homogeneous, post-mitotic human neurons for perturbation-based screening, we employed a commonly used, rapid, and reproducible neuronal differentiation method based on combining small molecule patterning toward a cortical excitatory identity via overexpression of the lineage-specific transcription factor Neurogenin-2 (NGN2) (Nehme et al., 2018; Figure 1b). In differentiating each iPSC line into NGN2 neurons of cortical and glutamatergic identity, in type 1 and type 3 cells, we observed widespread expression of SMN and pan-neuronal marker TUJ1 by immunofluorescence (IF) (Figure 1c). NGN2 neurons derived from moderate or severe SMA types exhibited similar neurite length normalized to cell number via TUJ1 staining (Figure 1d). This observation is in contrast with SMA patient iPSC-derived motor neurons, which exhibit observable phenotypic differences such as type 1 neurons degenerating more rapidly than type 3 neurons under standard culture conditions (i.e., without added stressors) (Rodriguez-Muela et al., 2017). However, in line with clinical observations (Crawford and Pardo, 1996), type 1 NGN2 neurons exhibited substantially lower SMN protein levels than type 3 cells by Western blot and ELISA (Figures 1e,f). In both type 1 and 3 cells, exposure to a small molecule SMN splicing modulator C3 (a chemical analog of risdiplam) for 24 and 72 h increased SMN protein levels by Western blot and ELISA (Figures 1g–i). Aside from intracellular SMN levels, all other morphological characteristics examined were similar between all type 0/1 and type 3 cell lines tested. As noted, NGN2 neurons are in contrast to motor neurons, the major cell type affected in SMA. As a result, we recommend that users of CMAPneuro cross-validate predicted perturbation-induced changes in an appropriate cell type (e.g., motor neurons) depending on the biological question of interest.
A chemical genetic screening platform to measure the transcriptional responses of SMA neurons to diverse neuroactive perturbagens
After assessing the ability of patient-specific iPSC-derived NGN2 cells to recapitulate some aspects of SMA biology, we next sought to modify and expand the CMAP platform originally focused on cancer-related immortalized cell lines to SMA NGN2 neurons (Figure 2a). To accomplish this, we differentiated 3 type 0/1 and 3 type 3 patient-specific iPSC lines into NGN2 neurons, matured the cells for 7 days (a total of 14 days of differentiation), and exposed each line to a curated set of 360 compounds at 2 concentrations for 24 h. Compounds were selected based on known neuroactivity and represent diverse mechanisms of action implicated in many neurological disorders, including calcium channel blockers, glutamate receptor antagonists, acetylcholine receptor agonists, and HDAC inhibitors (Figure 2b; Supplementary Table 1).
Figure 2. A chemical genetic platform to measure transcriptional responses to neuroactive perturbagens in iPSC-derived NGN2 neurons. (a) IPSCs derived from 6 SMA patients (1 type 0, 2 type 1, and 3 type 3) were differentiated into NGN2 neurons. After dissociation and plating into 384-well plates, cells were exposed to 360 compounds at 2 concentrations (0.5 μM, 5 μM) for 24 or 72 h and subsequently lysed for targeted transcriptional profiling. (b) A mosaic plot depicting the approximate mechanism of action for each compound is included. 360 perturbagens were selected based on known activity on neurons (perturbagens noted in Supplementary Table 1). (c) Bar graph noting pathways related to genes transcriptionally profiled post-perturbation. Magnitude represents the number of genes belonging to each pathway. HS = Homo sapiens; GLUT = glutamatergic neurons; quality control = stress response (genes noted in Supplementary Table 2). (d) t-SNE projection of NDIS-D L1000. Each point corresponds to a particular compound, cell line, and dose combination, stratified by SMA cell line, with the size of the dot reflective of TAS. (e) t-SNE projection of CMAPneuro data depicting response of type 0/1 and type 3 NGN2 cells to neuroactive compounds. Each point corresponds to a particular compound, cell line, and dose combination, stratified by SMA cell line, with the size of the dot reflective of TAS.
Given the importance of cell type in mediating response to various perturbations, along with the tissue-specific nature of SMA, we sought to develop a customized Luminex gene panel tailored to measure changes in CNS-related genes. To this end, we curated a panel of 467 neuronal-specific genes to probe within CMAPneuro (Figure 2c; Supplementary Table 2). Genes were selected based on a combination of empirical analyses and literature searches using the following criteria: high expression levels in the brain or SMA-related cell types, previous implication in SMA pathogenesis, inclusion in Nanostring’s neuropathology panel, and existing L1000 landmark genes (Supplementary Table 3).
To detect gene-specific sequences, Luminex beads were coupled to DNA oligos complementary to each barcode used in the CMAPneuro probe pool. Probe and primer design and coupling were performed as described (Subramanian et al., 2017); however, because the CMAPneuro probe pool contains only 467 unique genes, we were able to couple one gene’s DNA barcode to each bead color, as opposed to the 2:1 ratio used in the standard 978-gene L1000 probe pool. As a result, we circumvented the need to perform the peak deconvolution procedure used in the standard L1000 pipeline. The CMAPneuro probes were validated by performing a signature analysis comparing their measured expression values with RNAseq data from a panel of 96 reference cell lines (see Methods), with 76% of probes being recalled successfully (Supplementary Figure 1a). Probes that appeared to fail recall were enriched for genes with generally low expression or constant expression after perturbation, suggesting that these recall failures were not due to poor probe performance (Supplementary Figure 1b). As a quality control metric, within the L1000 pipeline, expression levels of an invariant gene set are measured within all samples, allowing for normalization and exclusion of samples with potentially non-biologically relevant signal (Lamb et al., 2006). Furthermore, within the CMAP analysis, cell line and dosage are evaluated individually. Because of this, results can be deconvoluted with respect to each donor and perturbagen dosing paradigm, allowing for exclusion of individual samples from downstream analyses.
SMA NGN2 neurons exhibit changes in transcription of neurological disease-associated genes in response to CMAPneuro perturbagens
In order to visualize transcriptional differences across disease types after perturbation, we first performed t-distributed Stochastic Neighbor Embedding (t-SNE) (Maaten and Hinton, 2008) on the CMAPneuro signatures resulting from exposure of each NGN2 line to the newly curated library of neuroactive compounds. Specifically, we plotted transcriptional activity score (TAS), a metric that incorporates the magnitude and consistency of a given response. Initially, we plotted TAS post-perturbation in genes included within the original CMAP/L1000 platform. In doing so, we observed robust changes in TAS resulting from exposure to many compounds, with few outlier compounds detected (Figure 2d). Building on this, we next sought to investigate TAS performance using our novel CMAPneuro gene panel. In doing so, in contrast to the L1000 tSNE, we observed a greater number of outlier perturbagens with high tau score (Figure 2e).
The “tau” connectivity score is a measure of similarity between a query gene signature and a reference perturbagen signature, ranging from −100 (negative) to +100 (positive). A score of +100 indicates a strong positive connection (highly similar gene expression, or mimic of signature), while a score of −100 indicates a strong negative connection (opposite gene expression patterns, or reversal of signature).
Querying CMAPneuro to identify compounds that modulate SMA subtype-affected genes
Above, we show that by incorporating neuroactive compounds and assessing neurological disease-associated genes utilizing CMAPneuro, we can identify compounds that elicit strong transcriptional effects in SMA patient-specific NGN2 neurons. We next sought to employ the traditional CMAP workflow to identify compounds that mimic or reverse a transcriptional signature of interest on a novel dataset—here, differences in gene expression between moderate and severe SMA subtypes.
To test the “Query” function of CMAPneuro, we differentiated 3 type 0/1 and 3 type 3 SMA patient-specific iPSCs into NGN2 neurons, isolated RNA from each at baseline, and performed bulk RNAseq (Figure 3a). In doing so, we observed separation of moderate and severe SMA NGN2 neurons via gene expression as noted by principal component analysis (PCA) (Figure 3b; Supplementary Data File 1) (N = 383 differentially expressed genes (DEGs), unadjusted p-value < 0.05). NGN2 neurons from type 0/1 and type 3 SMA backgrounds exhibited similar expression levels of pan-neuronal markers (TUBA1A, MAP2, RBFOX3, STMN2), glutamatergic neuron markers (CAMK2A, SLC17A6, GRIA1, GRIN1), and mature neuron markers (DLG4, VAMP2, SYN1, SNAP25, SYT1, KCNQ2), with limited expression of pluripotency genes (POU5F1, NANOG) (Supplementary Figure 2). Moreover, DEGs between type 0/1 and 3 neurons were related to neuronal development, cell motility, and transcriptional regulation by RNA polymerase II (Figures 3c,d), processes found to be relevant to SMA and SMN biology (Zhang et al., 2013; Shi et al., 2025; Pellizzoni et al., 2001) by MSigDB (Supplementary Data File 2).
Figure 3. Identification of perturbagens that shift SMA-type transcriptional signatures in NGN2 neurons. (a) Type 0/1 and type 3 SMA patient-specific iPSCs were differentiated into NGN2 neurons, RNA was isolated from each, and RNAseq was performed. DEGs were subsequently queried against the CMAPneuro database to identify compounds that mimic or reverse SMA type-specific gene signatures. Samples were run in biological triplicate with one technical replicate. (b) Type 0/1 and type 3 NGN2 neurons by PCA, separated by DEGs. Samples were run in biological triplicate (single technical replicates). (c) MSigDB over enrichment analysis of DEGs (N = 383, unadjusted p-value < 0.05) from type 0/1 and type 3 SMA iPSC-derived NGN2 cells (see also Supplementary Data File 1). Overlap notes the number of genes within each pathway noted. (d) Heatmap noting gene expression differences between type 0/1 (left) and type 3 (right) SMA NGN2 neurons. Each column represents NGN2 neurons differentiated from 1 SMA iPSC line. Log2FC depicted. (e,f) DEGs between type 0/1 and type 3 cells were input into CMAPneuro, and compounds that mimic (red) or reverse (blue) the signatures were identified. Median tau scores represent the strength of the connectivity across the cell lines. Type 0/1 cells are noted in red, while type 3 cells are noted in blue text.
We subsequently mapped the Ensembl identifiers from these data to HGNC and split the DEG list into the top 137 positive and 150 negative log2 fold-change genes to input into CMAPneuro. Based on the median tau score, the perturbagen alvocidib positively connected with (i.e., “mimics”) gene expression differences between type 0/1 and 3 SMA cells (Figure 3e). Alvocidib (flavopiridol) has been shown to prevent apoptosis and has been associated with neuroprotection in cerebellar granule neurons (Jorda et al., 2003). Conversely, disulfiram negatively connected with (i.e., “reverses”) gene expression differences between types 0/1 and 3 cell lines based on median tau score (Figure 3f), and has been demonstrated to prevent pyroptosis (Xu et al., 2024) and may have potential in modulating AD processes (Reinhardt et al., 2018). Here, CMAPneuro allowed for the identification of several compounds that may impact SMA subtype-specific biology.
An open-source queryable SMA perturbational dataset to identify compounds that mimic or reverse transcriptional states and disease-associated processes
Through the CMAPneuro platform, we generated a dataset of >13,000 perturbational gene expression profiles (>4,500 signatures after collapsing biological replicates) in six SMA lines, representing the largest perturbational transcriptional dataset in SMA cells. To maximize the utility of these data for the larger research community, we have made them available for download and interactive query analysis via https://clue.io/data/ (Figure 4a). Collectively, these SMA-focused datasets enable researchers to: (1) identify compounds that mimic or reverse selected gene expression signatures, and (2) explore the transcriptional responses of SMA neurons to given compounds.
Figure 4. An open-source queryable platform to identify neuroactive perturbagens that induce or reverse disease-relevant gene expression changes in SMA patient-specific NGN2 neurons. (a) Two use cases for CMAPneuro. In the first step, users enter a list of DEGs (e.g., comparing disease to control cells) into the CMAPneuro platform and receive a list of compounds that mimic or reverse similar DEGs in SMA NGN2 neurons. In the second, users enter a compound of interest (e.g., one associated with a given biological process) and receive genes whose expression levels are modulated in NGN2 neurons. (b) DEGs were obtained from Ng et al. (2015) in an experiment in which control and SMA iPSC-derived motor neurons were produced and subjected to RNA-seq. (c) Selected DEGs from (b) were input into CMAPneuro, and signature mimetic or reversal compounds were identified. Directionality of noted DEGs was SMA/control motor neurons. Top 25 mimicking (red bar) and 25 reversing (blue bar) compounds are listed. (d) AMG-925, a compound found to induce opposite gene expression changes compared to inputted SMA/wild-type motor neurons from (b), was entered into CMAPneuro. (e) The top 30 perturbagens that elicit gene expression changes similar to AMG-925 in type 0/1 and type 3 NGN2 neurons are noted. pc_selection = perturbagen selected, pert_id = perturbagen name, cell_id = cell line data is received from, pert_idose = dose of compound eliciting given response, pert_itime = exposure time of compound eliciting given response, query_cell_id = cell line data is derived from in Touchstone Connectivity, pc = the percent of total perturbagens querying the row against the column, type = classification of perturbagen entered (here, AMG-925), CP = compound, CTRL = control, KD = knockdown, OE = overexpression. Type 0/1 cells are noted in red, while Type 3 cells are noted in blue text.
To further test the applicability of CMAPneuro and its corresponding L1000 datasets, we analyzed publicly available RNA-seq data comparing motor neurons differentiated from type 1 SMA and control iPSCs (Ng et al., 2015; Figure 4b). To do this, we sorted a total of 913 DEGs (FDR < 0.01) by absolute log2 fold change and input the top 150 up and down gene symbols into both CMAPneuro and the original CMAP/L1000 queries. Utilizing CMAPneuro, positive median tau scores (“mimic” signatures) showed overrepresentation of perturbagens such as AMG-517, BRD-K68405354, perzinfotel, AZD-2858, itopride, phenformin, LY-2835219, taranabant, and VU-10010 (Figure 4c, top). In other words, these compounds result in gene expression differences similar to those seen when comparing SMA to control motor neurons. Conversely, the negative median tau scores (reverse signatures) showed hits for nizatidine, SB-269970, AA-29504, NBXQ, CP-945598, SKF-81297, piracetam, dapiprazole, AMG-925, and BML-190 (Figure 4c, bottom). Here, these compounds result in gene expression changes that are directionally opposite to those observed between SMA and wild-type motor neurons (i.e., shifting an SMA transcriptional state toward a wild-type state). Important to note, the CMAPneuro platform requires users to enter a list of DEGs with directionality (i.e., foldchange). As a result, depending on the directionality of said foldchanges, the transcriptional consequence of a given compound may change. In other words, if foldchanges describing a disease state over control were entered, a “mimicking” compound would result in gene expression changes more similar to the disease state.
In a second use case, the “Touchstone Connectivity” function available in the L1000 dataset allows researchers to select a perturbagen, cell line, and concentration (or multiple selections thereof) and identify genes whose expression is altered in SMA NGN2 neurons. To demonstrate this function, we selected the compound AMG-925 (found in Figure 4c to reverse an SMA vs. wild-type DEG signature in CMAPneuro). After entering AMG-925 into the Touchstone Connectivity function (Figure 4d), we received perturbagens that elicit similar gene expression changes to AMG-925 in type 0/1 and type 3 NGN2 neurons (Figure 4e). For example, in all backgrounds, genes modified via exposure to AMG-925 are similar to overall expression patterns affected by knockdown of genes involved in proliferation/regulation of cell cycle (BRD2, TBX3, BMI1, ATPS5), epigenetic modifications (SENP5), and cell signaling (CDH3, EVL, CACNA1C, IL6). As demonstrated here, CMAPneuro represents a novel tool for researchers to identify CNS disease-related compounds and the neuro-related genes they impact.
Discussion
Despite advances in our understanding of SMA biology, the efficacy of current therapeutic strategies relies on increasing SMN levels. Added clinical benefit may be achieved via therapeutics that improve disease-associated phenotypes even without increasing SMN levels. Due to the genetic and clinical heterogeneity of CNS disorders such as SMA, identification of such druggable targets is highly complicated. To address this, we developed the CMAPneuro platform to employ chemical-genetic-based screening to identify gene expression changes in SMA patient-derived neurons resulting from exposure to a compendium of neuroactive compounds. In doing so, we generated more than 4,000 transcriptional profiles representing the response of moderate and severe SMA iPSC-derived neuronal cells to diverse neuroactive compounds.
In addition to allowing insight into differences in response to perturbations across SMA neurons, CMAPneuro provides a rich dataset representing transcriptional responses of cortical-like neurons to diverse, neuroactive chemical perturbagens. Similar to the existing CMAP/L1000 platform, CMAPneuro represents an open-access, readily queryable dataset. Utilizing the CLUE.io application (see text footnote 1), users can enter a set of genes and be directed to a list of perturbagens that elicit a similar or related transcriptional response in SMA 0, 1, or 3 neurons. Conversely, researchers can query a compound related to a particular pathway or mechanism of action to ask how its activation may affect the transcriptional profile of human SMA neurons.
Here, we have adapted the CMAP/L1000 platform, initially developed to study cancer cells, to investigate neurodegenerative disorders such as SMA via two approaches. First, the original platform utilized immortalized cancer cell lines (e.g., prostate cancer, breast cancer, cervical cancer, myelogenous leukemia, and acute promyelocytic leukemia cells) (Subramanian et al., 2017; Lamb et al., 2006) to generate its reference dataset. Although advantageous given the initial focus of the program, immortalized non-neuronal cell lines are often suboptimal for studying neurodegenerative disorders, neglecting the highly important genetic context of the patient and endogenous cell type. In place of immortalized cell lines, we utilized NGN2 cortical-like neurons differentiated from iPSCs derived from SMA patients. Importantly, these cells are more likely to reflect processes dysregulated in neurodegenerative disorders compared to immortalized cancer cell lines. Second, we included a targeted library of perturbagens and a panel of genes associated with neurological tissue and disease.
In this work, we highlight the ability of the CMAPneuro platform to (1) identify compounds that mimic or reverse a given gene expression signature, and (2) assess the impact of a given compound on the transcriptional profile of SMA NGN2 neurons. To achieve the first goal, we input DEGs from a previously published RNAseq analysis (Ng et al., 2015) comparing wild-type and type 1 SMA iPSC-derived motor neurons (Figure 4) into CMAPneuro (see text footnote 1), with the goal of identifying compounds that induce transcriptional changes seen between healthy and SMA motor neurons. In doing so, we identified AMG-925, a CDK4 inhibitor, as one of the top compounds phenocopying transcriptional differences between wild-type and SMA motor neurons. Building on this, we then entered AMG-925 into CMAPneuro to profile gene expression changes resulting from its exposure to moderate and severe SMA NGN2 cells. In doing so, we observed changes in the expression of genes associated with proliferation (e.g., BRD2, TBX3, BMI1, ATPS5). Future work should first validate the predicted gene expression changes in iPSC-derived motor neurons upon exposure to AMG-925 prior to hypothesis testing with the compound.
In a similar vein, researchers have been utilizing the original CMAP/L1000 platform to identify compounds that mimic or reverse gene expression signatures observed in a variety of diseases and cell types. In one example, researchers employed CMAP/L1000 to identify compounds that reverse gene expression differences observed within the muscle tissue of SMN knockout and wild-type mice (Meijboom et al., 2021). In doing so, they identified several candidate compounds that induce gene expression patterns opposite of SMN−/− compared to wild-type mice that, upon further investigation, were found to rescue SMA-related muscle damage in vitro and in vivo. Beyond SMA, Shindyapina et al. (2025) recently utilized CMAP/L1000 to identify compounds that induce gene expression patterns associated with proposed biomarkers of aging and longevity. Impressively, several compounds they identified were found to increase lifespan in male mice.
The CMAPneuro platform described here can prioritize candidate compounds for follow-up investigation in SMA disease biology. Despite this promise, there are several limitations to be considered regarding this assay. First, this platform is based on transcriptional profiles generated in NGN2 neurons. Although these cells are non-cancerous and more relevant to diseases of the CNS, non-neuronal cells (e.g., vasculature and glial cells) have also been shown to contribute to SMA disease pathology (Simone et al., 2015; Zhou et al., 2022; Hamilton and Gillingwater, 2013). Similarly, NGN2 neurons are closer to neuron populations found in the cortex of the brain, in contrast to spinal cord motor neurons more typically affected by SMA.
Here, NGN2 neurons were selected given the robustness, relative homogeneity, and large scalability of their differentiation protocol—characteristics necessary for generating a chemical-genetic-based transcriptional screening dataset. However, it is important to note that these cells did not reproduce phenotypic differences originally seen in SMA motor neurons. Despite this caveat, although NGN2 neurons are distinct from motor neurons directly impacted in SMA, several studies demonstrate the ability to detect disease-relevant phenotypes in cells not directly linked to disease onset. Several examples of this can be seen in neurological diseases, including fibroblasts in SMA (Yang et al., 2019) and Parkinson’s Disease (Schiff et al., 2022), and neural progenitor cells in schizophrenia (Tegtmeyer et al., 2025). Additionally, recent work has highlighted the involvement of cortical cells in SMA disease progression in mouse models (Wishart et al., 2010; d’Errico et al., 2013) and a limited number of patients (Mendonça et al., 2019). Importantly, however, NGN2 neurons may exhibit transcriptional responses different from those of other cell types (e.g., motor neurons) in response to perturbagens. As a result, we encourage users to validate CMAPneuro-predicted gene expression changes in their given cell type of interest prior to biological follow-up. Finally, we note that the patient iPSC lines included in the original perturbational screen used to generate CMAPneuro were not balanced for sex (2 female, 4 male lines) and do not include type 2 patients. As a result, biological follow-up and validation are recommended when investigating questions regarding sex or type 2 SMA biology.
Improvements to CMAPneuro could be made by performing bulk RNA sequencing or optical pooled screening as outputs of disease-specific neurons under conditions in which perturbagen treatment produces detectable phenotypic differences. Notably, longer exposure paradigms are more likely to replicate processes dysregulated in CNS disorders that often evolve over much greater periods of time.
Despite these limitations, comprehensive use of compound-based perturbation, coupled with transcriptional screening, has great potential for uncovering novel insights into neuronal cell biology and disease processes. Importantly, though originally designed to study SMA biology, this platform can be expanded to diseases with commonly dysregulated biological mechanisms, such as generalized neurodegenerative disorders.
Methods
Cell culture of human pluripotent stem cells
Patient-specific iPSC lines utilized were derived as previously described (Rodriguez-Muela et al., 2018; Ng et al., 2015; Rodriguez-Muela et al., 2017; Leow et al., 2024; Heo et al., 2025). Cells were maintained in feeder-free conditions on Matrigel (VWR, Cat. No. BD35427) and cultured in STEMFLEX media (Life Technologies, Cat. No. A3349401). Work with hiPSCs was determined to be Not Human Subjects Research by the Harvard University Area Institutional Review Board (IRB No. IRB25-1308). IPSC lines included were generated from fibroblasts obtained from the Pediatric Neuromuscular Clinical Research Network and generated via Sendai virus reprogramming. All lines were determined to be karyotypically normal by G-banding.
Lentivirus production and transduction of iPSCs
Lentiviral particles containing pTet-O-Ngn2-puro constructs were generated via the SuperLenti Lentiviral Packaging Mix kit (ALSTEM, Cat. No. VP100) (Nehme et al., 2018). Briefly, packaging 293 T cells were seeded at 4×106 cells per 100 mm plate. 2.5 μg tet-inducible NGN2 plasmid was added to the lentiviral packaging mix diluted in DMEM (ThermoFisher, Cat. No. 11330057) and added dropwise to cells. Virus-containing media were collected over the next 72 h and concentrated via addition of Lentivirus Precipitation Solution (ALSTEM, Cat. No. VC100) and centrifugations per manufacturer’s instructions.
NGN2 cell differentiation
NGN2 cortical-like cells were generated via a two-dimensional, feeder-free, cytokine- and transgene-driven differentiation protocol as previously described (Nehme et al., 2018). Briefly, iPSCs were removed via EDTA and plated at high density in STEMFLEX media supplemented with ROCK inhibitor. After 24 h, cells were exposed to N2 basal media supplemented with SB431542 (10 μM; R&D Systems, Cat. No. 1614), XAV939 (2 μM; ReproCell, Cat. No. 04-0046), LDN-193189 (100 nM; ReproCell, Cat. No. 04-0074), and doxycycline (2 μg/mL). The following day, cells were treated with N2 basal media containing SB431542 (5 μM), XAV939 (1 μM), LDN-193189 (50 nM), doxycycline (2 μg/mL), and puromycin (5 μg/mL; Life Technologies, Cat. No. LA1113803). On day 3, cells were exposed to N2 media supplemented with B27 (1:50), doxycycline (2 μg/mL), and puromycin (1:2,000). On day 4, cells were cultured in NBM basal media supplemented with doxycycline (2 μg/mL), BDNF (10 ng/mL; Miltenyi Biotec, Cat. No. 130-103-435), CNTF (10 ng/mL; Miltenyi Biotec, Cat. No. 130-123-659), and GDNF (10 ng/mL; Miltenyi Biotec, Cat. No. 130-108-986). On day 5, cells were exposed to NBM basal media supplemented with doxycycline (2 μg/mL) (Sigma Aldrich, Cat. No. D9891), BDNF (10 ng/mL), CNTF (10 ng/mL), GDNF (10 ng/mL), and U/FDU (10 μM).
Prior to re-plating on day 7, wells were first coated with poly-L-ornithine (1:1000) (Sigma Aldrich, Cat. No. P4957) and poly-D-lysine (1:100) (ThermoFisher, Cat. No. A-003-E) in borate buffer overnight and then coated with fibronectin (1:200) (Corning, Cat. No. 356008) and laminin (1:200) (Life Technologies, Cat. No. 23017015) for 3 h. On day 7, cells were removed from their original plates with trypsin (0.25%) supplemented with DNaseI and plated in media containing ROCKi in 6-, 12-, or 96-well plate format. Day 7 media is composed of NBM basal media supplemented with B27, BDNF (10 ng/mL), CNTF (10 ng/mL), and GDNF (10 ng/mL). On day 8, the media was completely removed and replaced with day 7 media minus ROCKi. Half-media changes were performed every 3–4 days after plating until downstream application.
N2 basal media consists of DMEM/F12 supplemented with glutamax (1:100), dextrose (0.3%), and N2 supplement (1:500). NBM basal media consists of neurobasal media supplemented with glutamax (1:100), dextrose (0.3%), MEM NEAA (1:100), and B27 (1:50).
Western blot
To generate protein lysates, cells were gently washed with ice cold PBS. RIPA buffer containing phosphatase inhibitor (1:100) (Avantor, Cat. No. 78420) and protease inhibitor cocktail (1:100) (Avantor, Cat. No. 87786) was added to each plate. Cells were removed using a cell scraper and incubated in a microcentrifuge tube on ice for 30 min. Samples were then centrifuged at 17,000×g for 20 min at 4 °C. Supernatant was then transferred to a fresh microcentrifuge tube. Protein lysates were quantified via Pierce BCA Protein Assay Kit (ThermoFisher, Cat. No. 23225), and 25 μg of each lysate was loaded onto an SDS-PAGE gel. Blots were subsequently probed for SMN (1:2,000) (BD Biosciences, Cat. No. 610647) or Beta tubulin (1:10,000) (Abcam, Cat. No. 6046). Resulting bands were quantified via Fiji software (version 2.3.0).
SMN ELISA
Protein was isolated from each well via Enzo SMN ELISA kit (Enzo, Cat. No. ADI-900-209) per manufacturer’s instructions. Briefly, lysis buffer supplemented with PMSF (1 mM), phosphatase inhibitor (Avantor, Cat. No. 78420), and protease inhibitor cocktail (Avantor, Cat. No. 87786) was added to each well and cells were incubated on ice for 30 min and subsequently centrifuged at 14,000×g for 20 min. Supernatant was stored at −80 °C until used. SMN concentration was determined via ELISA per manufacturer’s protocol.
Immunofluorescence
Cells were fixed in 4% paraformaldehyde (VWR, Cat. No. 100503-917) for 20 min at room temperature. Fixed cells were washed and incubated for 1 h at room temperature in blocking/permeabilization solution (5% FBS, 2% BSA, 0.3% Triton X-100). Following blocking/permeabilization, cells were incubated with primary antibody [TUJ1 (Sigma Aldrich, Cat. No. T2200), 1:200; SMN (BD Biosciences, Cat. No. 610647), 1:150] diluted in blocking/permeabilization solution overnight at 4 °C. Cells were washed and incubated with respective secondary antibody for 1 h at room temperature. Cells were subsequently stained with Hoechst (1:2,000) for 15 min at room temperature and imaged via the ImageXpress Micro Confocal High-Content Imaging System (Molecular Devices).
RNA sequencing
Bulk RNA sequencing was performed using an Illumina NextSeq (v 2.1.0) sequencer, with library preparation via Illumina TruSeq RNA Library Prep Kit v2. Base calls were quality filtered and converted to FASTQ files using bcl2fastq2 (v 2.20.0.422, Illumina) using the default parameters, and resulted in an average of 25.1 M paired-end reads per cell line (average of 88% of reads passing filter). Pseudo-counts were generated from the FASTQ files using kallisto (v 0.45.0, Bray et al., 2016). Kallisto transcript abundance was imported and reduced to Ensembl gene identifiers via tximport (v 1.34.0). Genes were analyzed via DESeq2 (v 1.46.0), first filtering out genes with less than three counts in three samples. Apeglm shrinkage estimates were obtained (v 1.28.0). PCA and heatmap analysis (pheatmap v 1.0.12) was conducted on the DEGs (unadjusted p-value ≤ 0.05, no log2 foldchange cutoff). Genes were converted to HGNC via org.Hs.eg.db (v 3.20.0). The DEGs were sorted by log2 foldchange, and highest positive (n = 137) and negative (n = 150) genes were used as input to the Clue.io “Query” function against the SMA and L1000 datasets. Over enrichment analysis of the significant DEG (unadjusted p-value ≤ 0.05, no log2 foldchange cutoff). was performed using MSigDB, querying the Hallmark, KEGG, Reactome, and GO Biological Processes gene sets, with an FDR cutoff of 0.05. Significant hits were visualized using ggplot2 (v3.5.1).
For Ng et al. (2015) SMA versus control dataset, the Cuffdiff DEGs were extracted from the supplementary data. The FDR ≤ 0.01 DEGs were sorted by log2 fold change, and the highest positive and negative 150 genes were used as input to the Clue.io “Query” function against the SMA and L1000 datasets.
Neuro probe pool validation
To assess the fidelity of the L1000-neuro probe pool, we used the probes to generate baseline L1000 gene expression profiles for 96 unique cancer cell lines. We then performed a gene recall analysis as follows:
1. We computed the gene-wise Spearman correlation between the L1000 neuro measurements and the equivalent RNAseq log2 RPKM values for the same cell lines, obtained from the cancer cell line encyclopedia (CCLE).
2. We converted the correlations to a recall rank by computing the percentage of the 22,225 genes from the CCLE dataset with a higher correlation coefficient than the matched gene.
We observed that 309 of the 409 L1000-neuro probes (76%) included in the analysis had recall ranks less than 5%, suggesting that the majority of probes’ L1000 measurements match their corresponding expression pattern in the RNAseq data. We also observed that 65% (58 of 89) L1000-neuro probes with poor recall corresponded to genes with low average expression across the 96 cancer cell lines (Supplementary Figure 2), suggesting that poor recall performance was generally not due to failure of the L1000 probes.
L1000 profiling
L1000 data were generated as described in Subramanian et al. (2017). Briefly, differentiated NGN2 neurons were plated into 384-well plates, treated with compound or vehicle control for 24 h, and then lysed. mRNA was captured from the lysates, reverse-transcribed into cDNA, and subjected to ligation-mediated amplification (LMA) with the L1000-neuro probe pool, resulting in barcoded, biotinylated PCR amplicons. The amplicon was then hybridized to Luminex beads with complementary barcodes, stained with streptavidin-phycoerythrin (SAPE), and detected on a Luminex FlexMap 3D flow cytometer, capturing bead color (i.e., transcript identity) and the fluorescent intensity of biotinylated probe.
L1000 data processing
L1000 data were processed as described in Subramanian et al. (2017), generating 5 levels of data:
• Level 1: Raw fluorescent intensities (FI) were captured from the Luminex FlexMAP 3D scanner for each measured gene (either neuro or standard L1000 panel).
• Level 2: To account for two genes measured by each bead barcode, for the original L1000 panel, FI data were deconvoluted, extracting the median FI (MFI) for the two genes. For the neuro panel, which measures only one gene per bead barcode, the median FI value was computed across all measured beads for the given barcode to generate MFI values.
• Level 3: MFI values were loess-normalized to the 10 L1000 invariant gene sets within each well (same for neuro and standard L1000), and all wells on the same detection plate were then quantile-normalized, resulting in each sample having the same empirical distribution.
• Level 4: Gene-wise robust z-scores were then computed for each sample, with reference distribution noted as all samples on the same plate.
• Level 5: Biological replicates were collapsed using a weighted average (each replicate weighted by its average correlation to the others).
Signature recall analysis
We extracted the level 5 signatures of 228 compounds common to both the SMA L1000 dataset and the CMap LINCS2020 release. The LINCS signatures were restricted to exemplar signatures in the core LINCS cell lines. This resulted in 1,886 signatures. For each SMA L1000 compound, we computed the Spearman correlation between its signatures and each LINCS signature using the standard L1000 gene space. We then computed the rank of the matched compound signature in the LINCS data, with lower ranks corresponding to higher correlations. We applied thresholds of TAS ≥ 0.212 to identify active SMA L1000 signatures and a recall rank of ≤190 (in the top ~10% of all LINCS signatures) to signify successful recall.
Transcriptional activity score and constituent metrics
The replicate correlation, signature strength, and transcriptional activity score metrics have been previously described (Subramanian et al., 2017). We have included brief summaries of each metric below.
Replicate correlation
To assess reproducibility across replicates included in the L1000 data, we calculate the 75th quantile of Spearman correlation values obtained from all possible pairwise combinations of replicates across level 4 data.
Signature strength
Signature strength is calculated via the following equations, where SS = signature strength, Z = z-scores, and N = number of replicates.
Transcriptional activity score (TAS)
Transcriptional activity score (TAS) for a given perturbagen is calculated as follows, where SS = signature strength, CC = replicate correlation.
Data availability statement
The datasets presented in this study can be found in online repositories. CMAP data is available for download and interactive query analysis via clue.io/data. RNAseq data generated in this study are available in processed counts matrix form via Figshare at https://doi.org/10.6084/m9.figshare.30753266.
Ethics statement
The use of human material was determined to be Not Human Subjects Research by the Harvard University Area IRB. The research was conducted in accordance with local legislation and institutional requirements. Written informed consent for participation in this study was obtained by the provider of the material from the participants’ legal guardians/next of kin.
Author contributions
RG: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. KH: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing. XH: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. TN: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. SaG: Data curation, Formal Analysis, Investigation, Methodology, Writing – review & editing. StG: Data curation, Formal Analysis, Investigation, Writing – review & editing. NL: Data curation, Formal analysis, Investigation, Writing – review & editing. AS: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. LR: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article.This work was supported by the National Institute on Aging of the NIH (1F32AG079593-01 to R.M.G., 1R01AG072086 to L.L.R.), the National Institute of Neurological Disease and Stroke of the NIH (1R01NS117407 to L.L.R.), the American Federation for Aging Research (to R.M.G.), the Simons Foundation Collaboration on Plasticity in Brain Aging (to L.L.R.), the Stanley Center for Psychiatric Research of the Broad Institute of Harvard and MIT (to L.L.R.), and a generous gift to the Harvard Stem Cell Institute from the Vranos Family Foundation (to L.L.R.).
Acknowledgments
We thank Marek Orzechowski for aiding in the exploratory analysis of the CMAPneuro dataset, Max Macaluso for assisting with project management and organization, and John Davis and Jacob Asiedu for assistance with maintaining the CMAP portal. Additionally, we thank Jane Lalonde and Isaac Adatto for administrative support. Cartoon schematics were generated via BioRender.com.
Conflict of interest
LR is a founder of Vesalius Therapeutics and VALID Tx, a member of their scientific advisory boards, and a private equity shareholder. Both are interested in formulating approaches intended to treat diseases of the nervous system and other tissues. He is also on the advisory boards of Etiome, Myrobalan Therapeutics, ProjenX, and Corsalex.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnins.2025.1695359/full#supplementary-material
SUPPLEMENTARY FIGURE 1 | L1000neuro probe pool validation. (a) Scatter plot depicting the Spearman correlation vs. the corresponding recall percentile rank for each of the 409 common genes. (b) Scatter plots of the 409 genes’ variance vs. mean expression, derived from the 96 CCLE cell lines’ log2 RPKM values, and stratified by whether the gene had good, moderate, or poor recall, corresponding to recall percentile ranks ≤5%, >5% & ≤10%, and >10%, respectively.
SUPPLEMENTARY FIGURE 2 | Expression of neuronal cell type genes in SMA patient-specific iPSC-derived NGN2 neurons. NGN2 neurons from either SMA type exhibit high expression of pan-neuronal (TUBA1A, MAP2, RBFOX3, STMN2), glutamatergic (CAMK2A, SLC17A6, GRIA1, GRIN1), and synaptic/functional markers (DLG4, VAMP2, SYN1, SNAP25, SYT1, KCNQ2) with minimal expression of pluripotency genes (POU5F1, NANOG). No differences in expression were observed across SMA types (Wilcoxon Rank Sum test, p < 0.05). Dots depict individual cell lines; CPM, counts per million.
Footnotes
References
Arnold, W. D., Kassar, D., and Kissel, J. T. (2015). Spinal muscular atrophy: diagnosis and management in a new therapeutic era. Muscle Nerve 51, 157–167. doi: 10.1002/mus.24497,
Blair, H. A. (2022). Onasemnogene Abeparvovec: a review in spinal muscular atrophy. CNS Drugs 36, 995–1005. doi: 10.1007/s40263-022-00941-1,
Bray, N. L., Pimentel, H., Melsted, P., and Pachter, L. (2016). Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527. doi: 10.1038/nbt.3519,
Crawford, T. O., and Pardo, C. A. (1996). The neurobiology of childhood spinal muscular atrophy. Neurobiol. Dis. 3, 97–110. doi: 10.1006/nbdi.1996.0010,
d’Errico, P., Boido, M., Piras, A., Valsecchi, V., de Amicis, E., Locatelli, D., et al. (2013). Selective vulnerability of spinal and cortical motor neuron subpopulations in delta7 SMA mice. PLoS One 8:e82654. doi: 10.1371/journal.pone.0082654,
Geschwind, D. H., and Flint, J. (2015). Genetics and genomics of psychiatric disease. Science 349, 1489–1494. doi: 10.1126/science.aaa8954,
Glanzman, A. M., Mazzone, E., Main, M., Pelliccioni, M., Wood, J., Swoboda, K. J., et al. (2010). The children’s hospital of Philadelphia infant test of neuromuscular disorders (CHOP INTEND): test development and reliability. Neuromuscul. Disord. 20, 155–161. doi: 10.1016/j.nmd.2009.11.014,
Hamilton, G., and Gillingwater, T. H. (2013). Spinal muscular atrophy: going beyond the motor neuron. Trends Mol. Med. 19, 40–50. doi: 10.1016/j.molmed.2012.11.002,
Harhouri, K., Navarro, C., Depetris, D., Mattei, M. G., Nissan, X., Cau, P., et al. (2017). MG132-induced progerin clearance is mediated by autophagy activation and splicing regulation. EMBO Mol. Med. 9, 1294–1313. doi: 10.15252/emmm.201607315,
Heo, K., Zeng, X., Zhang, K., Chen, K., Zhen, S., Naveen, A., et al. (2025). A human iPSC-derived motor neuron-myogenic cell coculture platform to evaluate neuromuscular junction innervation after axon injury and in Spinal Muscular Atrophy. bioRxiv. 2025-10.
Hu, W. F., Chahrour, M. H., and Walsh, C. A. (2014). The diverse genetic landscape of neurodevelopmental disorders. Annu. Rev. Genomics Hum. Genet. 15, 195–213. doi: 10.1146/annurev-genom-090413-025600,
Jorda, E. G., Verdaguer, E., Canudas, A. M., Jiménez, A., Bruna, A., Caelles, C., et al. (2003). Neuroprotective action of flavopiridol, a cyclin-dependent kinase inhibitor, in colchicine-induced apoptosis. Neuropharmacology 45, 672–683. doi: 10.1016/S0028-3908(03)00204-1,
Lamb, J., Crawford, E. D., Peck, D., Modell, J. W., Blat, I. C., Wrobel, M. J., et al. (2006). The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929–1935. doi: 10.1126/science.1132939,
Leow, D. M.-K., Ng, Y. K., Wang, L. C., Koh, H. W. L., Zhao, T., Khong, Z. J., et al. (2024). Hepatocyte-intrinsic SMN deficiency drives metabolic dysfunction and liver steatosis in spinal muscular atrophy. J. Clin. Invest. 134:e173702. doi: 10.1172/JCI173702,
Lin, C.-W., Kalb, S. J., and Yeh, W.-S. (2015). Delay in diagnosis of spinal muscular atrophy: a systematic literature review. Pediatr. Neurol. 53, 293–300. doi: 10.1016/j.pediatrneurol.2015.06.002,
Long, K. K., O’Shea, K. M., Khairallah, R. J., Howell, K., Paushkin, S., Chen, K. S., et al. (2019). Specific inhibition of myostatin activation is beneficial in mouse models of SMA therapy. Hum. Mol. Genet. 28, 1076–1089. doi: 10.1093/hmg/ddy382,
Maaten, L. V. D., and Hinton, G. (2008). Visualizing data using t-SNE. Journal of machine learning research. 9, 2579–2605.
Meijboom, K. E., Volpato, V., Monzón-Sandoval, J., Hoolachan, J. M., Hammond, S. M., Abendroth, F., et al. (2021). Combining multiomics and drug perturbation profiles to identify muscle-specific treatments for spinal muscular atrophy. JCI Insight 6:e149446. doi: 10.1172/jci.insight.149446,
Mendonça, R. H., Rocha, A. J., Lozano-Arango, A., Diaz, A. B., Castiglioni, C., Silva, A. M. S., et al. (2019). Severe brain involvement in 5q spinal muscular atrophy type 0. Ann. Neurol. 86, 458–462. doi: 10.1002/ana.25549,
Musa, A., Ghoraie, LS., Zhang, SD., Glazko, G., Yli-Harja, O., Dehmer, M., et al. (2017). A review of connectivity map and computational approaches in pharmacogenomics. Brief. Bioinform. 19, 506–523. doi: 10.1093/bib/bbw112
Nehme, R., Zuccaro, E., Ghosh, S. D., Li, C., Sherwood, J. L., Pietilainen, O., et al. (2018). Combining NGN2 programming with developmental patterning generates human excitatory neurons with NMDAR-mediated synaptic transmission. Cell Rep. 23, 2509–2523. doi: 10.1016/j.celrep.2018.04.066,
Neil, E. E., and Bisaccia, E. K. (2019). Nusinersen: a novel antisense oligonucleotide for the treatment of spinal muscular atrophy. J. Pediatr. Pharmacol. Ther. 24, 194–203. doi: 10.5863/1551-6776-24.3.194,
Ng, S.-Y., Soh, B. S., Rodriguez-Muela, N., Hendrickson, D. G., Price, F., Rinn, J. L., et al. (2015). Genome-wide RNA-Seq of human motor neurons implicates selective ER stress activation in spinal muscular atrophy. Cell Stem Cell 17, 569–584. doi: 10.1016/j.stem.2015.08.003,
O’Hagen, J. M., Glanzman, A. M., McDermott, M. P., Ryan, P. A., Flickinger, J., Quigley, J., et al. (2007). An expanded version of the Hammersmith functional motor scale for SMA II and III patients. Neuromuscul. Disord. 17, 693–697. doi: 10.1016/j.nmd.2007.05.009,
Paik, J. (2022). Risdiplam: a review in spinal muscular atrophy. CNS Drugs 36, 401–410. doi: 10.1007/s40263-022-00910-8,
Pellizzoni, L., Baccon, J., Charroux, B., and Dreyfuss, G. (2001). The survival of motor neurons (SMN) protein interacts with the snoRNP proteins fibrillarin and GAR1. Curr. Biol. 11, 1079–1088. doi: 10.1016/S0960-9822(01)00316-5,
Pihlstrøm, L., Wiethoff, S., and Houlden, H. (2017). Genetics of neurodegenerative diseases: an overview. Handb. Clin. Neurol. 145, 309–323. doi: 10.1016/B978-0-12-802395-2.00022-5,
Reinhardt, S., Stoye, N., Luderer, M., Kiefer, F., Schmitt, U., Lieb, K., et al. (2018). Identification of disulfiram as a secretase-modulating compound with beneficial effects on Alzheimer’s disease hallmarks. Sci. Rep. 8:1329. doi: 10.1038/s41598-018-19577-7,
Rodriguez-Muela, N., Litterman, N. K., Norabuena, E. M., Mull, J. L., Galazo, M. J., Sun, C., et al. (2017). Single-cell analysis of SMN reveals its broader role in neuromuscular disease. Cell Rep. 18, 1484–1498. doi: 10.1016/j.celrep.2017.01.035,
Rodriguez-Muela, N., Parkhitko, A., Grass, T., Gibbs, R. M., Norabuena, E. M., Perrimon, N., et al. (2018). Blocking p62-dependent SMN degradation ameliorates spinal muscular atrophy disease phenotypes. J. Clin. Invest. 128, 3008–3023. doi: 10.1172/JCI95231,
Schiff, L., Migliori, B., Chen, Y., Carter, D., Bonilla, C., Hall, J., et al. (2022). Integrating deep learning and unbiased automated high-content screening to identify complex disease signatures in human fibroblasts. Nat. Commun. 13:1590. doi: 10.1038/s41467-022-28423-4,
Shi, T., Zhou, Z., Xiang, T., Suo, Y., Shi, X., Li, Y., et al. (2025). Cytoskeleton dysfunction of motor neuron in spinal muscular atrophy. J. Neurol. 272:19. doi: 10.1007/s00415-024-12724-3,
Shindyapina, A. V., Tyshkovskiy, A., Bozaykut, P., Castro, J. P., Gerashchenko, M. V., Trapp, A., et al. (2025). Molecular signatures of longevity identify compounds that extend mouse lifespan and healthspan. bioRxiv. 2025-06.
Simone, C., Ramirez, A., Bucchia, M., Rinchetti, P., Rideout, H., Papadimitriou, D., et al. (2015). Is spinal muscular atrophy a disease of the motor neurons only: pathogenesis and therapeutic implications? Cell. Mol. Life Sci. 73, 1003–1020. doi: 10.1007/s00018-015-2106-9,
Subramanian, A., Narayan, R., Corsello, S. M., Peck, D. D., Natoli, T. E., Lu, X., et al. (2017). A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 171, 1437–1452.e17. doi: 10.1016/j.cell.2017.10.049
Tegtmeyer, M., Liyanage, D., Han, Y., Hebert, K. B., Pei, R., Way, G. P., et al. (2025). Combining phenomics with transcriptomics reveals cell-type-specific morphological and molecular signatures of the 22q11.2 deletion. Nat. Commun. 16:6332. doi: 10.1038/s41467-025-61547-x,
Watts, M. E., Giadone, R. M., Ordureau, A., Holton, K. M., Harper, J. W., and Rubin, L. L. (2023). Analyzing the ER stress response in ALS patient derived motor neurons identifies druggable neuroprotective targets. Front. Cell. Neurosci. 17:1327361. doi: 10.3389/fncel.2023.1327361,
Wishart, T. M., Huang, J. P. W., Murray, L. M., Lamont, D. J., Mutsaers, C. A., Ross, J., et al. (2010). SMN deficiency disrupts brain development in a mouse model of severe spinal muscular atrophy. Hum. Mol. Genet. 19, 4216–4228. doi: 10.1093/hmg/ddq340,
Xu, C., Jiang, F., Mao, Y., Wei, W., Song, J., Jia, F., et al. (2024). Disulfiram attenuates cell and tissue damage and blood–brain barrier dysfunction after intracranial haemorrhage by inhibiting the classical pyroptosis pathway. Sci. Rep. 14:21860. doi: 10.1038/s41598-024-67118-2,
Yang, S. J., Lipnick, S. L., Makhortova, N. R., Venugopalan, S., Fan, M., Armstrong, Z., et al. (2019). Applying deep neural network analysis to high-content image-based assays. SLAS Discov. 24, 829–841. doi: 10.1177/2472555219857715,
Zaidman, C. M., Crockett, C. D., Wedge, E., Tabatabai, G., and Goedeker, N. (2024). Newborn screening for spinal muscular atrophy: variations in practice and early management of infants with spinal muscular atrophy in the United States. Int. J. Neonatal Screen 10:58. doi: 10.3390/ijns10030058,
Zhang, Z., Pinto, A. M., Wan, L., Wang, W., Berg, M. G., Oliva, I., et al. (2013). Dysregulation of synaptogenesis genes antecedes motor neuron pathology in spinal muscular atrophy. Proc. Natl. Acad. Sci. 110, 19348–19353. doi: 10.1073/pnas.1319280110,
Keywords: spinal muscular atrophy, connectivity map, L1000, iPSCs, chemical genetic screening
Citation: Giadone RM, Holton KM, Hu X, Natoli T, Ghosh S, Gill SP, Lyons N, Subramanian A and Rubin LL (2026) An induced pluripotent stem cell-based chemical genetic approach for studying spinal muscular atrophy. Front. Neurosci. 19:1695359. doi: 10.3389/fnins.2025.1695359
Edited by:
Katherine Roe, People for the Ethical Treatment of Animals, United StatesReviewed by:
Emily Welby, Medical College of Wisconsin, United StatesBarrington G. Burnett, Uniformed Services University of the Health Sciences, United States
Julieth A. Sierra-Delgado, The University of Missouri, United States
Copyright © 2026 Giadone, Holton, Hu, Natoli, Ghosh, Gill, Lyons, Subramanian and Rubin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lee L. Rubin, bGVlX3J1YmluQGhhcnZhcmQuZWR1; Kristina M. Holton, a21ob2x0b25AZy5oYXJ2YXJkLmVkdQ==
‡ORCID: Richard M. Giadone, https://orcid.org/0000-0003-4523-3062
Xiaoyu Hu, https://orcid.org/0009-0006-6045-1251
†These authors have contributed equally to this work
Kristina M. Holton1,2,3,4*†