The Single-Cell Transcriptomic Analysis of Prefrontal Pyramidal Cells and Interneurons Reveals the Neuronal Expression of Genes Encoding Antimicrobial Peptides and Immune Proteins

The investigation of the molecular background of direct communication of neurons and immune cells in the brain is an important issue for understanding physiological and pathological processes in the nervous system. Direct contacts between brain-infiltrating immune cells and neurons, and the neuromodulatory effect of immune cell-derived regulatory peptides are well established. Several aspects of the role of immune and glial cells in the direct neuro-immune communication are also well known; however, there remain many questions regarding the molecular details of signaling from neurons to immune cells. Thus, we report here on the neuronal expression of genes encoding antimicrobial and immunomodulatory peptides, as well as proteins of immune cell-specific activation and communication mechanisms. In the present study, we analyzed the single-cell sequencing data of our previous transcriptomic work, obtained from electrophysiologically identified pyramidal cells and interneurons of the murine prefrontal cortex. We filtered out the genes that may be associated with the direct communication between immune cells and neurons and examined their expression pattern in the neuronal transcriptome. The expression of some of these genes by cortical neurons has not yet been reported. The vast majority of antimicrobial (~53%) and immune cell protein (~94%) transcripts was identified in the transcriptome of the 84 cells, owing to the high sensitivity of ultra-deep sequencing. Several of the antimicrobial and immune process-related protein transcripts showed cell type-specific or enriched expression. Individual neurons transcribed only a fraction of the investigated genes with low copy numbers probably due to the bursting kinetics of gene expression; however, the comparison of our data with available transcriptomic datasets from immune cells and neurons suggests the functional relevance of the reported findings. Accordingly, we propose further experimental and in silico studies on the neuronal expression of immune system-related genes and the potential role of the encoded proteins in neuroimmunological processes.


INTRODUCTION
In the last decades, our knowledge about neuro-immune communication progressed and several complex molecular interactions between the immune system and the central nervous system (CNS) have been revealed. It became clear that the main processes of neuro-immune crosstalk in the brain are performed by microglial cells and astrocytes (1). Modulatory effects of immune system-derived cytokines were described in psychiatric and mental illnesses and also in the healthy brain (2,3). In brain injury or inflammation, the elevated level of cytokines induces behavioral effects as sleepiness, depression, etc. (4). In Alzheimer's disease, the inflammatory processes are enhanced in the brain in parallel with the upregulation of antiinflammatory b-defensins (5). However, the neuronal origin of defensins was not established. Defensins are antimicrobial peptides which are part of the innate immune system and have immunomodulatory effects; besides eliminating bacteria, fungi, and viruses, b-defensins were shown to be chemotactic for T cells through C-C chemokine receptor type 6 (CCR6 receptor) (6,7). Expression of b-defensins and some other antimicrobial peptides has been found in microglial cells and astrocytes (8), but not in neurons.
The action of pro-inflammatory cytokines (e.g., interleukin-1 family, tumor necrosis factor) on neurons is well-known based on the expression of their receptors on nerve cells (9)(10)(11). Indeed, neuromodulatory influence of immune cell-derived regulatory peptides is properly established and supports the existence of a communication channel from CNS infiltrating immune cells, microglial cells, and astrocytes to neurons. However, the communication from neurons to adaptive immune cells is less established. Immune cells express the receptors of neurotransmitters (12) and a direct regulatory action of neurotransmitters on T cell maturation is demonstrated (13). T and B cells can enter the CNS under both physiological and pathological conditions (14)(15)(16) and some of them can come into close contact with neurons (17,18). The number of activated T cells increases under brain injury and inflammation (19,20). The regulation and control of inflammatory processes in the brain may be influenced by the activity of antimicrobial and immunomodulatory peptides (AMPs), similarly to the control of inflammation in peripheral tissues (21). The modulation of T and B cell functions by AMPs is a rapidly growing field in immunology and it is well established that AMPs are expressed by microglial cells and astrocytes as well (22).
Increasing amount of evidence supports that neurons could directly communicate with the cellular components of the innate and/or adaptive immune system; although, it is poorly understood. Direct physical connections of T cells and neurons were uncovered by two-photon microscopy (23). However, the activation of T and B cells by neurons is not completely elucidated in spite of the fact that T and B cells are frequently in active state in the brain. Thus, the question may be raised whether it is a glial cell-mediated indirect action of neurons on adaptive immune cells or there is a direct way of interaction. Therefore, the neuronal expression of proteins -participating in the complex cooperative regulation of T and B cell activation mechanisms and antigen presentation in neurons -is an important issue in understanding the direct bidirectional character of neuro-immune communication, resulting in the complex integration of the immune system and nervous system.
In public databases, as the in situ hybridization (ISH) and single-cell sequencing datasets of Allen Institute for Brain Science (24, 25), the expression of many of the genes encoding proteins of the above listed mechanisms is not reported, which may have biological and/or technical reasons. The generally used methods of proteomics and transcriptomics are not able to reveal all low-abundance proteins and their mRNAs, since their sensitivity is not high enough. The protein composition of a cell canonically contains~20,000 different proteins; in turn, the number of mRNA species transcribed from DNA in a cell should also be~20,000 (26). In contrast, the average number of detected proteins in a proteomics sample rarely exceeds 3,000 (27) and the number of identified mRNAs in single-cell samples is rarely higher than 2,000 due to the applied sequencing method limited by the number of cells and the concomitant budget requirements. The cellular heterogeneity of the brain tissue evidently does not allow the search for low-abundance proteins. In the last decade a highly sensitive single-cell sequencing method was developed by combination of singlecell harvesting after patch clamping and ultra-deep mRNA sequencing (28). Ultra-deep sequencing allows the detection of extremely low mRNA copy numbers in biological samples, so the results are rich in low-abundance mRNAs (29). In the present work, we analyzed the data of our earlier single-cell sequencing study of prefrontal neurons (30) and, using bioinformatic tools, we identified transcripts encoding proteins of immune cellspecific processes and AMPs. The aim was to provide evidence on the possibility of neuron to adaptive immune cell communication in the brain. In silico analysis of the data revealed high transcriptomic coverage of T cell activation, B cell activation, and antigen presentation processes, as well as the expression of several AMP genes in pyramidal cells and fast spiking interneurons of the murine prefrontal cortex. Some of the uncovered mRNAs were unknown in cortical neurons before as established by the comparison of our data with available datasets; however, the transcriptomic similarities found between immune cells, the neurons we sequenced, and other CNS neurons suggest the functional relevance of the neuronal expression of immune system-related genes.

Cell Harvesting From Acute Brain Slices
The single-cell transcriptomic data analyzed in the present in silico study are from our previous work (30; GEO accession number: GSE135060), which includes the detailed methodological description of brain slice preparation, cell harvesting, single-cell sequencing, and data preprocessing.
Here, we refer only briefly to these previously performed experimental parts of our work and discuss the bioinformatics analyzes that form the basis of the present study. We used patch-seq to obtain complex electrophysiological and transcriptomic data from single prefrontal cortical pyramidal (Pyr) cells and fast-spiking (FS) interneurons. This technique is an improvement of the classical whole-cell patch clamp method that allows analyzing the mRNA content of the neuronal cytoplasm harvested by a patch pipette after electrophysiological recordings. Single cortical neurons were harvested from acute brain slices prepared from male C57BL/ 6N mice (n=73) of age between 27 and 40 days (Innovo Kft., Isaszeg, Hungary). All procedures of animal care and the minimization of the suffering and pain of the animals were done under the ethical rules of Eötvös Lorańd University in accordance with the EU Ethical Rules of Using Animals for Research Purposes (2010/63/EU revising Directive 86/609/EEC) and the Hungarian Act of Animal Care and Experimentation (1998, XXVIII).
We used standard brain slice preparation technology, paying special attention to sterile conditions to avoid bacterial contaminations and degradation of mRNAs by RNases. Mice were quickly anesthetized in isoflurane and decapitated. Brains were removed and 300 µm coronal slices were cut by a vibratome (Leica VT1000 S, Leica Biosystems, Wetzlar, Germany) in ice cold artificial cerebrospinal fluid (ACSF). Slices were then incubated in ACSF at room temperature and permanently supplied with carbogen for at least 1 hour before recording and cell harvesting. We pulled 4-10 MW resistance patch pipettes just before use by a vertical pipette puller (Model 720, David Kopf Instruments, Tujunga, CA, USA) and filled with sterile, RNase-free intracellular solution (see 30 for the exact composition of the solutions).
Whole-cell patch clamp recordings were performed using a standard ex vivo electrophysiological setup built in a Faraday cage and equipped with a DIC microscope (Leica DM6000 FS, Leica Microsystems, Wetzlar, Germany). A Sutter MP-285 micromanipulator (Sutter Instrument, Novato, CA, USA) was used for fine movement of patch pipettes. The whole equipment was placed on a Gibraltar platform and X-Y stage (Burleigh Instruments, New York, NY, USA). During the patch clamp recordings, we applied a step-gradient depolarization protocol in bridge mode. Electrophysiological signals were sampled at 10 kHz, amplified by an AxoClamp 2B amplifier (Axon Instruments, Foster City, CA, USA), and digitized by a CED 1401 MK II (Cambridge Electronic Design, Cambridge, UK) data capture device using CED Signal 4.11 software.
For cell type identification, we used morphological and electrophysiological features. Pyr cells were identified on the basis of their larger, pyramid-like cell body, visible apical dendrite, and regular firing pattern. FS interneurons were recognized based on their small, spherical cell body, characteristically high firing rate, and narrow action potentials. We harvested at most 2 cells from each type from one animal and sequenced altogether 59 Pyr and 25 FS cells.

Amplification and Sequencing
After whole-cell patch clamp recording, the cytoplasm of the recorded cell was harvested by aspiration into the patch pipette. The mRNA content of the harvested cytoplasm was reverse transcribed into cDNA, amplified through aRNA amplification (31,32), and libraries were constructed for deep sequencing. Neuronal mRNA content was amplified individually for each cell through three aRNA amplification rounds. ERCC RNA Spike-In controls (Life Technologies, Carlsbad, CA, USA) were added in 1:4,000,000 dilutions to each sample to control for technical variability between samples. During the first amplification round, a synthesized oligo(dT)-T7 primer was annealed to the poly-A tail of the mRNA. cDNA was synthesized using SuperScript III reverse Transcriptase (Invitrogen, Carlsbad, CA, USA), and DNA polymerase I (Invitrogen) following the instructions of the manufacturer. The cDNA served as template during aRNA synthesis using the MEGAscript T7 kit (Invitrogen). For the second and third RNA amplification rounds, aRNA was converted into cDNA using random hexanucleotide primers for the first strand and oligo(dT)-T7 primer for the second strand synthesis. After amplification the aRNA was purified with AGENCOURT RNACLEAN X beads (Beckman Coulter, Brea, CA, USA) and controlled for quality and quantity using Bioanalyzer RNA Picochip and Nanochip (Agilent, Santa Clara, CA, USA).
Libraries were constructed from the aRNA applying the TruSeq Stranded mRNA Library Prep Kit (Illumina, San Diego, CA, USA) without performing the initial fragmentation incubation step. The libraries were quantified using Bioanalyzer DNA1000 chip (Agilent) and sequenced either on HiSeq2500 (producing 100-base paired-end reads), or NextSeq500 (producing 75-base paired-end reads).

Normalization of Raw Sequencing Data
Because of the low sample volumes, the technical difficulties of cell harvesting, and the differences in the actual molecular state of individual cells, the raw single-cell sequencing data shows high variability in the number of expressed genes per cell and in their copy numbers. Therefore, the normalization of the raw sequencing data is a critical issue in transcriptomic analysis of a heterogeneous neuron population. We believe that after adequate normalization, the sum of normalized copy numbers should correlate positively with the number of recovered transcripts. The sequencing data was first normalized using the DESeq R program package (developed for bulk sequencing experiments), which resulted in a poor correlation. The raw copy number data was then normalized on housekeeping genes. However, this normalization method resulted also in a rather weak correlation. We observed that also these genes were expressed only in subsets of neurons and none of the cells expressed all of the genes encoding housekeeping proteins. This incomplete transcription pattern of housekeeping genes may ensue from technical uncertainties or, more likely, from the bursting kinetics of the mammalian gene transcription. As it is well established, gene expression is performed in bursts (33,34) and several genes could be in OFF state in the time point of cytoplasm harvesting. We found that several genes are transcribed in large copy numbers but only in a few cells, and probably gene OFF state is responsible for the zero transcription values. Thus, reference genes suitable for normalization should be transcribed in the majority of cells and their copy numbers should be high and stable. Based on these considerations, we searched for reference genes in an unbiased way, without knowing the function of the encoded proteins.
First, we sorted out the 1,000 most frequently expressed genes. To find the most constantly expressed genes, we carried out a "prenormalization" on the sequencing data using these 1,000 genes as reference (for further details please see 30). Then, from the "prenormalized" data, we sorted out the 500 most stable genes showing the lowest standard deviation between cells. In the next step we further searched for transcripts that show less than tenfold copy number differences between Pyr and FS cells. The selected 409 genes were used as reference genes for copy number normalization. These genes, showing frequent and stable expression in both types of neurons represent the "equi-phenotype" of the sequenced neuronal transcriptomes. It is important to note, that for the majority of genes, sequencing data showed zero copy numbers in several neurons. The high ratio of zero copy numbers considerably lowers the average transcription values, and the calculation of median for single-cell transcriptomic data could be misleading as well. Thus, the average copy numbers were calculated only on the basis of non-zero values (i.e., based on the data of cells in which the genes were in ON state). The rationale of this approach is that the average of non-zero copy numbers may reliably represent the average transcription level in the gene ON state. Together with the ON state frequency, this estimation can provide a more real picture of single-cell transcriptomics than the overall average or median including zero values (gene OFF states). The selected 409 genes were used to normalize the raw data of the individual cells for transcript copy number analyses. The normalization factor for a cell was the multiplication-factor that provided the lowest root-mean-square deviation on the 409 reference genes compared to the averages of the non-zero values. The remarkable positive correlation (Pearson correlation coefficient: r = 0.806) between the number of detected transcripts and the sum of copy numbers indicates that our normalization method can be reliably applied during the preprocessing of single-cell transcriptomic data. The raw sequencing data of Pyr and FS cells were compared on the 409 genes of "equi-phenotype" and on all the genes sequenced. As an average, we detected twice higher amount of mRNA molecules in Pyr cells which may be explained by their larger cell body and higher volume of cytoplasm.
We intended to compare the transcription levels of the investigated genes between FS and Pyr cells to identify cell typespecific or enriched transcripts. However, it was challenging to find an appropriate statistical method to verify the reliability of comparison and reduce false discovery rate, because relatively few cells were sequenced, and the majority of genes were transcribed in only a fraction of cells resulting in a lot of zero copy numbers. Thus, we performed a randomized group comparison analysis to demonstrate the reliability of identifying differentially expressed genes between Pyr and FS cells (for the exact details of the analysis please see 30).

Investigation of the Expression of Genes Encoding AMPs and Proteins of Immunological Processes
To better understand the possible molecular basics of direct neuro-immune interactions and the immunomodulatory potential of cortical neurons, we analyzed the expression pattern of genes encoding AMPs and proteins playing role in immunological processes (T cell dependent B cell activation, T cell independent B cell activation, CD8+ T cell activation, MHC class I antigen presentation, and MHC class II antigen presentation). The list of AMPs was taken from a recently published study (35), which presents a unified comprehensive human antimicrobial and immunomodulatory protein database constructed by collecting and integrating data from online databases and literature search. Based on this dataset, we searched for AMP transcripts in our normalized sequencing data. The reference list of proteins involved in the studied immunological processes was accessed in the Curated Pathways library of Elsevier Pathway Studio v11.0 (36), and based on this, corresponding mRNAs were searched in the sequencing data of Pyr and FS cells.
For each investigated immunological gene, ON and OFF state frequencies were calculated, i.e. the percentage of neurons expressing and not expressing a particular gene. To reveal transcriptomic differences between Pyr and FS cells, the copy numbers of AMP and immune process protein transcripts were compared and cell type-specific (expressed only in Pyr or FS cells) and enriched (having more than tenfold higher copy numbers in one of the cell types) transcripts were identified. We compared the average copy numbers per cell calculated on the basis of ON states (i.e., non-zero values), as described earlier.
The differential distribution of transcripts by cell type may suggest that the neuronal expression of immunological genes is a functionally relevant phenomenon. Before performing statistical comparison or correlation analysis, the normal distribution of data was checked with Shapiro-Wilk test using the Statistics toolbox of OriginPro 8.5 software (OriginLab Corporation, Northampton, MA, USA). Data are presented as average ± standard deviation (SD).
For visualization of transcriptomic data, we built interaction networks of the encoded AMPs and proteins of immune cell processes. Although it is not evident how protein abundances can be inferred from mRNA data, this method may be still suitable for demonstrating the coverage of the studied immunological processes and molecular networks at the transcriptomic level. Interactions of AMPs and proteins from each immunological process were searched with STRING database (37); interactions were excluded below the combined score of 0.7. Then, results were exported to Cytoscape v3.7.2 (38), and protein-protein interaction networks were supplemented with additional interactions from Pathway Studio v11.0. Nodes were colored based on the percentage of neurons in which the corresponding transcripts were detected (ON state frequency). Then, interactions between the AMPs and immunological processes were also searched in STRING; based on their interactions, edges were introduced between AMPs and proteins involved in the examined immune cell processes.
To compare our results to earlier transcriptomic data from the mouse brain, we reviewed the ISH data of Allen Mouse Brain Atlas (24) and the single-cell sequencing data (whole cortex and hippocampus, SMART-seq (2019), Transcriptomics Explorer) of Allen Cell Types Database (25) for the transcripts we have identified. In the ISH data, we searched for genes that showed nonzero raw expression value in any brain structure. Genes with non-zero trimmed mean expression values in at least one cell of the neuronal cell types were searched in the single-cell transcriptomic data.
To further elucidate the possible functional significance of the neuronal expression of immunological genes, we searched the identified AMP and immune process transcripts in online available single-cell sequencing data. The following datasets were reviewed, and the ON state frequencies of the investigated genes were calculated: GSE100337 (39), GSE108989 (40), GSE119373 (41), GSE120575 (42), GSE124675 (43), GSE126030 (44), GSE98638 (45), and GSE75386 (46). These datasets were obtained from human immune cells, with the exception of the latter, which was derived from mouse hippocampal neurons. No quantitative analysis or statistical comparison was performed due to the different experimental and data analysis methods.

RESULTS
The raw sequencing data of the 84 prefrontal cortical (PFC) neurons contained more than 19,000 transcripts, which is comparable to the estimated number of proteins in a mammalian cell (26). However, the number of different transcripts found in single neurons was between 498 and 9,680 (on average 3,420). Thus, each neuron contained a fraction of all identified transcripts at the time of cell harvesting, as due to the bursting kinetics of transcription, genes may often be in OFF state even for longer periods (47). Using published databases of AMPs and immune cell protein networks as reference lists, we were able to identify transcripts encoding AMPs with antimicrobial and immunomodulatory functions, as well as proteins of antigen presentation and T and B cell activation pathways.

Examination of the Neuronal Transcriptomic Pattern of Genes Encoding AMPs
The list of human AMPs published by Kumar et al. contains 186 peptides and proteins with known antimicrobial and/or immunomodulatory effect (35). One hundred of them (53.8%) were found in the transcriptomes of the sequenced PFC neurons (Supplementary Table 1). The protein network of AMPs encoded by the identified transcripts could be composed from proteinprotein interaction databases using STRING and Cytoscape platforms. Figure 1A shows the functional network of AMPs and indicates the ON state frequency of their genes in our sequencing data. Based on network analysis, the AMPs encoded by the identified transcripts are strongly linked to the investigated immunological processes ( Figure 1B) demonstrating their role in the modulation of immune cell functions. The individual cells expressed averagely 10.8 ± 6.4 different transcripts encoding AMPs; in this regard we found no significant differences between Pyr (11.2 ± 6.5) and FS (9.9 ± 6.1) cells (P = 0.39, two-sample ttest) (Figure 2A).
The ON state frequency of the identified AMP transcripts was rather low (Figures 1, 2B). Out of the 100 genes encoding AMPs only 5 (App, Atp5f1, Ccl27a, Fau, and Gapdh) were expressed in at least 50% of the neurons, 19 (e.g., Cxcl12, Furin, Ifih1, Pigr, and Romo1) were expressed in 10-49% of the cells, and 76 (e.g., several b-defensins, CC and CXC motif chemokines) were transcribed in less than 10% of the cells (Supplementary Table 1 and Figure 2B). The average ON state frequency of AMP genes is similar in Pyr (11.2±17.6%) and FS (9.9±15.7%) cells. For the majority of transcripts, it can be observed that higher ON state frequencies are significantly (P = 2.6*10 -13 ) associated with higher average copy numbers (Spearman correlation coefficient: r S = 0.65, Supplementary Figure 1), suggesting that the detection of AMP transcripts does not result from methodological bias or random gene expression noise, but possibly from regulated biological processes. The average normalized copy numbers of AMP transcripts ranged from 0.1 to 1063.9 (on average 75.3, see Supplementary Table 1 for each gene). For comparison, we present here the expression data of some genes encoding proteins with well-known neuronal functions: Syt4 (synaptotagmin-4) was expressed by 66.7% of the neurons with 519.3 average copy number, Scn3a (encoding a sodium channel subunit) was expressed by 51.2% of the neurons with 335.6 average copy number, Gria1 (encoding a glutamate receptor subunit) was expressed by 67.9% of the cells with 864.5 average copy number, and Dlg4 (encoding postsynaptic density protein 95 (PSD-95), a postsynaptic scaffolding protein) was expressed by 44.1% of the cells with 149.9 average copy number. Table 1 shows the distribution by cell type of the 100 identified AMP transcripts. Thirty-four of them were evenly expressed in both cell types (i.e., showing less than tenfold copy number differences). Among these there are some that, in addition to its presumable antimicrobial and/or immunomodulatory role, also have more general cellular functions (e.g., Atp5f1, Gapdh, H2aj, and Hist1h2bc). Fifteen of the AMP transcripts showed cell typeenriched expression (i.e., having more than tenfold higher copy numbers in one of the cell types). These encode mainly proteins with well-documented antimicrobial or other immunological functions (e.g., Lyz2 and Cxcl12 enriched in Pyr cells or Mx1 and Ccl4 enriched in FS cells). Interestingly, 51 AMP genes were expressed in a cell type-specific manner. Forty of them were identified only in Pyr cells (e.g., 8 of the 11 b-defensin genes found in the neurons, Lyz1, Prf1, Cxcl14, and Rnase6), and the remaining 11 were transcribed only in FS cells (e.g., Ccl22, Galp, and Ifnl3). Cxcl12, Mx1, and Cxcl14 also showed high difference in their ON state frequencies between cell types ( Figure 2C). The ON state frequency of Cxcl12 was 8.0% and 30.5% in FS and Pyr cells, respectively; Mx1 was detected in 16.0% of FS and 3.4% of Pyr cells. Finally, Cxcl14 was expressed in 13.6% of Pyr cells and was not detected in FS cells.
Thus, based on the sequencing data, it can be established that the Pyr and FS cells of the murine PFC are able to express several AMP genes. The majority of transcripts could be identified in few cells (low ON state frequency) with low average copy numbers; however, several of them show cell type-specific or enriched expression, suggesting functional relevance. Genes that are expressed in high copy numbers by many cells encode proteins with a more general cell biological role in addition to antimicrobial and/or immunomodulatory functions.

Investigation of the Expression of Genes Encoding Proteins of Immune Cell Processes
Based on the reference list of immunological processes (accessible in the Curated Pathways of Pathway Studio), we identified several mRNAs in the transcriptome of PFC neurons that encode proteins playing role in immune cell communication and activation mechanisms (Supplementary Table 2). On Figures 3, 4 we show the functional protein networks and label the ON state frequency of the corresponding genes in our dataset by color scale.
In the case of T cell dependent B cell activation, we found the transcripts of 96.8% of the proteins in the sequencing data. For the protein networks of T cell independent B cell activation and CD8+ T cell activation, this ratio was 98.2% and 91.9%, respectively. In the case of antigen presentation pathways, similarly high coverage (100% for MHC class I and 90.0% for MHC class II antigen presentation) was found in the neuronal transcriptomes (Supplementary Table 2). The individual neurons expressed averagely 21.1±11.5 different genes encoding proteins of immune cell processes, showing significant differences between Pyr (23.0±11.3) and FS (16.8±10.9) cells (P = 0.022, two-sample t-test) ( Figure 5A).
The ON state frequency of the identified 122 (calculated without overlaps between networks) immune process genes was somewhat higher than in the case of AMP genes ( Figure 5B). We identified 9.8% of the transcripts in at least 50% of the neurons. These transcripts encode proteins mostly with basic cell biological functions (e.g., Jun, Fos, Calr, Canx, Mapk1, and Creb1); however, some of the corresponding proteins also play a significant immunological role (e.g., Cd81 and Ticam1). Nearly one-third (33.6%) of the immune process genes were transcribed in 10-49% of the neurons encoding proteins with miscellaneous functions. Some of them (e.g., Plcg2, Akt1, Pdpk1, Raf1, Mapk3, Map3k7, and Grb2) can be associated with general cellular processes, while others

Mittli et al. Immune Transcripts in Cortical Neurons
Frontiers in Immunology | www.frontiersin.org October 2021 | Volume 12 | Article 749433 (e.g., B2m, Nfatc2, Ccr7, Cd79a, Blnk, and Traf3) have welldocumented immunological functions as well. The majority of immune process genes (56.6% of them) were expressed by less than 10% of the 84 neurons. Several of them (e.g., H2-Ob, H2-Oa, H2-Dma, Cd4, Cd8a, and Ccl4) encode proteins with specific immunological functions (for further details see Supplementary Table 2). The average ON state frequency of genes encoding proteins of immune cell processes is somewhat higher in Pyr (17.8±20.8%) than in FS (13.0±16.4%) cells. Significant (P = 1.2*10 -17 ) positive correlation was found also in the case of these transcripts between the ON state frequencies and average normalized copy numbers (Spearman correlation coefficient: r S = 0.68, Supplementary Figure 2). The average normalized copy numbers of immune process transcripts were found to range from 0.1 to 1375.2 (averagely 118.2, Supplementary Table 2 contains the data of each gene). Table 2 shows the cell type distribution of transcripts encoding proteins of immune cell processes. Several (48.4%) of them were   evenly expressed in both types of neurons. These genes mainly encode proteins with general signaling roles in mammalian cells (e.g., components of the MAP kinase pathway, Jak2, Ptk2b, Plcg1, and Itpr1), but there are also a number of immunologically important genes among them (e.g., Cd19, Cd247, Cd28, Il7, Ticam1, and Tnfrsf17). We found that 21.3% of the immune process genes showed cell type-enriched expression. These genes also encode proteins with miscellaneous (general and/or immunological) functions (e.g., Rac1 and Blnk enriched in Pyr cells or Akt1, Cr2, and Cd3d enriched in FS cells). The remaining nearly 30% of the investigated genes were transcribed in a cell typespecific manner. These include many genes encoding proteins with well-known immunological functions (e.g., H2-Dma, H2-Oa, Il12rb2, Cd14, and Cd86 expressed only in some Pyr cells and Cd3e, Il2rb, and Il5 expressed only in some FS cells). Also, the ON state frequencies of Blnk (4.0% of FS and 18.6% of Pyr cells) and Cr2 (20.0% of FS and 8.5% of Pyr cells) differed between cell types. In addition, Il12rb2, and Cd14 were not detected in FS cells, but were expressed in 10.2% of Pyr cells ( Figure 5C). We found three times more immune process genes expressed only in Pyr cells, than genes expressed only in FS cells; however, it should be noted that the size of a Pyr cell is about twice as large as an FS cell. Thus, the network analysis of transcripts encoding proteins of immune cell communication and activation pathways supported the high coverage of these processes in neurons at mRNA level. The average coverage of the five pathways was about 95% in the entire database of the 84 neurons and 18% in single cells, i.e., the majority of genes were transcribed only in fractions of cells. We also revealed that genes encoding hub proteins, which are the keys for functioning of the protein networks, were expressed in several neurons; however, these proteins also have more general functions, not merely immunological relevance.

Comparison of Our Results to Previously Published Transcriptomic Data
We reviewed how many of the AMP and immune protein transcripts identified in prefrontal neurons are included in the ISH and single-cell sequencing datasets of Allen Brain Map (see Supplementary Tables 1, 2 for each gene). Thirty-eight percent of AMP transcripts were found in the ISH data from different brain regions. These transcripts have higher than average ON state frequencies (22.1% compared to 10.8%) and copy numbers (171.0 compared to 75.3) in our sequencing data. Of the transcripts encoding proteins of immune cell processes, 45.1% were found in the ISH data, also with ON state frequencies and copy numbers higher than the overall average (25.7% compared to 16.4% and 185.1 compared to 118.2, respectively). The singlecell sequencing database of Allen Brain Map contains the transcripts we identified with greater overlap: 46.0% of the AMP and 72.1% of the immune protein encoding genes were expressed in at least one of the neurons included in the database. We compared the AMP and immune process gene expression levels in immune cells and hippocampal neurons on the basis of available single-cell databases to our single-neuron sequencing data and found that the ON state frequencies of the investigated genes do not differ extraordinarily (Supplementary Table 3, 4). The average ON state frequency of AMP genes ranged from 3.8 ± 12.5% to 16.9 ± 27.8% in different immune cell datasets, compared to 11.1 ± 16.9% in our data. In the case of immune process genes, the average ON state frequencies ranged from 6.6 ± 13.3% to 38.7 ± 32.6% in the reviewed immune cell datasets and we found 16.4 ± 19.0% in our data. Interestingly, the mouse hippocampal neurons expressed the investigated genes to a noticeable greater extent than the prefrontal neurons we examined (17.2 ± 26.5% average ON state frequency for AMP genes and 26.1 ± 28.5% for immune process genes). This observation draws attention to the brain region-specific character of neuronal gene expression and further suggests the functional significance of the expression of immunological genes by CNS neurons. Thus, based on these comparative data, transcriptomic similarities can be found between neurons and immune cells regarding the expression of AMP and immune process genes; however, functional conclusions can only be drawn to a limited extent due to the uncertainties in the relationship between cellular mRNA and protein levels (48).

The Use of Patch-Seq in the Study of Molecular Processes of CNS Neurons
We report here on mRNAs of AMPs with immunomodulatory function and proteins involved in lymphocyte activation and antigen presentation mechanisms expressed by prefrontal Pyr and FS cells harvested after electrophysiological identification. Some of the described mRNAs has not been detected in cortical neurons before, probably due to the lower sensitivity of immunostaining and ISH, as well as low-depth sequencing of large number of cells compared to ultra-deep sequencing. We emphasize that the linear amplification of cDNA combined with ultra-deep sequencing up to more than 20 million reads is an extremely sensitive method to detect transcripts with low copy numbers in single-cell samples. It is in correspondence with the fact that we were able to recover more than 19,000 transcripts from less than 90 neurons.
Because of the high labor demand and costs of the method, the number of cells sequenced was low but not remarkably lower than in other similar patch-seq studies (e.g., 46,49), allowing to apply the highest quality sequencing technology. We performed a normalization procedure described earlier (30) to make singlecell sequencing data more reliable and ensure the significant positive correlation between total normalized read number and reconstructed transcripts. We also reviewed available datasets, as the ISH and single-cell databases of Allen Brain Map (24, 25) and single-cell data of immune cells and hippocampal neurons (for GEO accession numbers and references see Materials and Methods) to compare the expression pattern of AMP and immune process genes in neurons and immune cells obtained by different transcriptomic technologies. The genes included in

Genes expressed in Pyr and FS cells Genes expressed only in Pyr or FS cells
Less than tenfold differences in copy numbers More than tenfold higher copy numbers in Pyr cells

Expressed only in Pyr cells Expressed only in FS cells
the ISH data of Allen Brain Map showed remarkably higher ON state frequencies and normalized copy numbers than the overall average, suggesting the accuracy of our sequencing and data analysis protocols. Due to the high sensitivity of the applied mRNA amplification and sequencing technology, we uncovered many AMP and immune process transcripts, which were not detected in the single-cell transcriptomics data of Allen Brain Map. On the other hand (probably as a consequence of the methodological approach somewhat similar to ours), several of the immunological transcripts we identified in PFC neurons, could also be detected in the murine hippocampus. Based on these results, it can be hypothesized, that the neuronal expression of AMP and immune cell process genes may indeed play an important role in nervous system processes (e.g., in direct neuroimmune interactions). The revealed transcriptomic similarities (see Supplementary Tables 3, 4) of the selected genes in CNS neurons and immune cells also support this idea. However, considering the notable differences in experimental (e.g., different species, cell numbers and sequencing techniques) and data analysis methods, the results of qualitative and/or quantitative comparison of different transcriptomic datasets need to be interpreted critically.

Functional Interpretation of Transcriptomic Data
As it can be seen in Figures 2B, 5B, as well as in Supplementary Tables 1, 2, many transcripts could be identified only in small fractions of cells with low average copy numbers. However, comparing the expression data of genes encoding neuronal proteins mentioned as examples in the Results section to the data of AMP and immune process genes, it can be seen that there are no substantial differences that definitely rule out the functional relevance of the neuronal expression of immune system genes. Moreover, it is also known that in biological samples the majority of genes are weakly expressed, and only a fraction of them are expressed in high copy numbers (50). Owing to the bursting kinetics of gene expression (51), at a certain time point several genes can be in OFF state and some others in ON state (52). The transcriptional regulation (e.g., the frequency and the duration of ON and OFF states) shows gene-specific features (53,54) and there is not yet enough knowledge about general regulatory mechanisms to reliably estimate the cellular protein levels from mRNA copy numbers. Ultimately, protein abundances correlate positively with the mRNA copy numbers to some extent (55,56) in a gene and cell type-specific manner (57) and the degree of correlation may depend also on the actual condition of the cells (58). Thus, in general it can be established that increase in the copy number of a particular mRNA initiates an increase in the level of the encoded protein (54). Nonetheless, it should also be emphasized, that the presence of a transcript in a cell does not automatically imply that the corresponding protein can be found in a functional form as well (e.g., due to translational and posttranslational regulatory mechanisms). Detection of the encoded proteins would be relevant for functional analysis of immune system-related transcripts; however, single-cell proteomics can detect only proteins of highest abundances, which is usually not more than 1,000-1,200 different proteins in a cell (27).
Accordingly, transcript copy number differences between Pyr and FS cells can be interpreted so that potentially the protein levels also differ to a certain degree; moreover, the cell typespecific or enriched expression of many of the investigated genes may suggests that the detection of immune transcripts in neurons could not derive from sequencing artifacts or negligible stochastic gene expression noise. The functional analysis of differential expression of immune genes in principal cells and interneurons of the mammalian neocortex is a promising field of future research.
Interestingly, the neuronal ON state frequencies of the investigated genes are comparable to the patterns observed in immune cells (Supplementary Tables 3, 4), proposing that the transcriptional regulation of these genes may be based on similar mechanisms in the two cell types. Since the reviewed databases contain heterogeneous data sets, with different experimental and data processing approaches, the assumption about similarities between neurons and immune cells needs verification directly at the protein level.  (59,60) were expressed in a cell type-specific or enriched manner, especially in Pyr cells ( Table 1). It suggests an important role for Pyr cells in the control and modulation of immune processes in the CNS. The expression of some AMP genes (e.g., Camp, Defb9, Defb11, and Defb35) in the brain was documented in earlier studies at the mRNA (61) and even at the protein level (62); however, these results do not include information on the exact cellular sources of AMPs. Interestingly, it was reported in Drosophila melanogaster that the neuron-specific inhibition of the expression of GNBP-like3 (a peptide with antibacterial activity) resulted in the impairment of long-term memory (63), suggesting that AMPs expressed by neurons may participate in nervous systemspecific processes. On the other hand, it is well established, that several neuropeptides (e.g., proenkephalin A, neurokinin-1, neuropeptide Y, and even amyloid-b) having primarily neurobiological functions, also possess antimicrobial activity and are often termed as neuro-antimicrobial peptides (NAMPs) (64,65). The antimicrobial activity of amyloid-b has supported the hypothesis that chronic CNS infections may contribute to the early onset of Alzheimer's disease via the overproduction of AMPs such as amyloid-b itself (66). In summary, the neuronal expression of AMP genes presented here draws attention to the possibility that, in addition to AMPs produced mainly by endothelial and glial cells, the same peptides expressed by cortical neurons may also contribute to the diverse CNS functions of these host defense molecules (antimicrobial, immunomodulatory and signaling activities, as reviewed by 65).

The Potential Roles of Immune Gene Expression in CNS Neurons
The network analysis of immune system-related proteins encoded by their identified transcripts in prefrontal neurons showed that components of T and B cell activation and antigen presentation processes can be recovered in the single-neuron sequencing data to a notable extent. The observed high coverage of immune cell pathways in the neuronal transcriptomes raises the possibility of functional crosstalk between neurons and CNS infiltrating T and B cells (67,68). Thus, our data reveal the putative basics of molecular neuro-immune interactions and suggest initiating lines of investigations to uncover the details of direct interplay between neurons and immune cells. The proteins encoded by the identified transcripts could have multiple functions as well, so it cannot be excluded that the immune cell-related molecules and processes play different roles in neurons than in lymphocytes or glial cells. The neuronal expression of several proteins of primary immunological relevance (e.g., MHC class I molecules, complement components) and their roles in physiological and pathological CNS processes have been described (69)(70)(71). There are also data suggesting that glial cells and neurons can present antigens to T cells entering the brain parenchyma under certain conditions (72,73). The presence of components of antigen presentation processes in the transcriptome of PFC neurons also supports this idea.
In conclusion, we were able to detect the mRNAs of AMPs, as well as of proteins of T and B cell activation and antigen presentation processes in prefrontal Pyr and FS cells by ultradeep sequencing of single-cell samples. The similarities in gene expression pattern between neurons and immune cells along with the remarkable coverage of immune cell-specific molecular pathways in the neuronal transcriptome suggest putting more effort on the investigation of direct interactions between neurons and lymphocytes and contribution of neurons to immunomodulation in the brain by AMPs. Furthermore, we recommend the reanalysis of CNS omics data focusing on immune system-related mRNAs and proteins. Our results presented here provide a potential molecular substrate for the direct contacts between neurons and T cells observed earlier (18,23). We emphasize that our study is only a first step toward understanding the functional importance of direct neuroimmune interactions in the brain and we propose initiating further research on that field.

DATA AVAILABILITY STATEMENT
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.