Technologies and Approaches to Elucidate and Model the Virulence Program of Salmonella

Salmonella is a primary cause of enteric diseases in a variety of animals. During its evolution into a pathogenic bacterium, Salmonella acquired an elaborate regulatory network that responds to multiple environmental stimuli within host animals and integrates them resulting in fine regulation of the virulence program. The coordinated action by this regulatory network involves numerous virulence regulators, necessitating genome-wide profiling analysis to assess and combine efforts from multiple regulons. In this review we discuss recent high-throughput analytic approaches used to understand the regulatory network of Salmonella that controls virulence processes. Application of high-throughput analyses have generated large amounts of data and necessitated the development of computational approaches for data integration. Therefore, we also cover computer-aided network analyses to infer regulatory networks, and demonstrate how genome-scale data can be used to construct regulatory and metabolic systems models of Salmonella pathogenesis. Genes that are coordinately controlled by multiple virulence regulators under infectious conditions are more likely to be important for pathogenesis. Thus, reconstructing the global regulatory network during infection or, at the very least, under conditions that mimic the host cellular environment not only provides a bird's eye view of Salmonella survival strategy in response to hostile host environments but also serves as an efficient means to identify novel virulence factors that are essential for Salmonella to accomplish systemic infection in the host.


Background
Salmonellae are Gram-negative facultative pathogens that live in diverse environments and infect a wide variety of animals. In addition they can use a great variety of compounds as carbon and energy sources. Gastrointestinal infections are the second most common cause of childhood mortality in the developing world and systemic disease typhoid alone (caused by serovar Typhi) is estimated to result in 500,000 deaths per year (Graham, 2002). In addition to fluid and electrolyte loss, non-typhoidal Salmonella often results in septicemia in children and in HIV infected adults in developing countries with a fatality rate of 25% or greater (Graham, 2010). Salmonella diverged from E. coli more than 100 million years ago (Vernikos et al., 2007), but the evolution within the Salmonella genus is more recent. Although there are now only two species of Salmonella; S. enterica and S. bongori, S. enterica has diverged into numerous subspecies and serovars that show a great variation in their infectivity for different cold blooded and warm blooded hosts but share greater than 95% DNA sequence homology. In fact, there are over 2500 serovars for S. enterica with close genetic homology and yet highly diverse regulatory and virulence expression patterns one can assume based on host preference.
Most of the human pathogenic Salmonella serovars, the general subject of virulence regulation in this article, belong to the S. enterica enterica subspecies. Often serovars are written with a capital sensitivity to oxidizing agents that are found within macrophages. When virulence genes are often present within a contiguous stretch of DNA and are missing in closely related and often non-pathogenic bacteria, it suggests that they were acquired horizontally. These so called pathogenicity islands are about the size that would be carried by a transducing bacteriophage and are often flanked by sequences such as transposons or t-RNA that could permit homologous recombination events. In Salmonella two of these regions important in virulence are called Salmonella pathogenicity island (SPI)-1 and 2.
Both SPI-1 and SPI-2 encode the structural components of their respective type III secretion system (T3SS) apparatus and translocate effectors for distinct purposes. SPI-1-secreted effectors play essential roles for invasion into epithelial cells and promote intestinal inflammation and gastroenteritis. SPI-1, is required for persistent infections in mice (Monack et al., 2004) and for cell invasion (Steele-Mortimer, 2008;McGhie et al., 2009), but is not required following intraperitoneal infection (i.e., systemic infection models) in BALB/c or C57/BL6 mice (rev. in Zhou and Galan, 2001). SPI-2 is induced during intracellular Salmonella infection of a variety of cell types (Geddes et al., 2007) and secretes dozens of distinct effector proteins . Mutation of this T3SS does not have a huge effect on intramacrophage survival, however it does result in completely abrogating infection in mice (Buchmeier and Heffron, 1991;Poh et al., 2008). Following internalization into host cells, the SPI-2 T3SS secretes effectors into the cytoplasmic space of epithelial cells or professional phagocytes such as macrophages. SPI-2-secreted effectors prevent maturation of the phagocytic vesicle thereby blocking phago-lysosome fusions and exposure of Salmonella to oxidative and non-oxidative microbicidal mechanisms (Buchmeier and Heffron, 1991;Ramsden et al., 2007;Poh et al., 2008). While most known Salmonella effectors have been found to be secreted specifically by the SPI-2 or SPI-1 T3SS (Steele-Mortimer, 2008;McGhie et al., 2009;Niemann et al., 2011), some have been shown to utilize both the SPI-1 and SPI-2 T3SSs for efficient secretion (Haraga et al., 2008). This complex mixture of secretion processes for Salmonella effectors suggests that there are multiple levels of regulation from transcription to translation to the secretion apparatus each level is critical for the virulence program of these pathogens.
Besides the complex T3SS, Salmonella possesses alternative mechanisms to translocate virulence factors into the extracellular milieu. These include outer membrane vesicles (OMV), the twopartner secretion system ZirT/ZirS, and a type VI secretion system. OMV are observed in a variety of Gram-negative bacteria including pathogenic bacteria and deliver specific periplasmic proteins as well as some outer membrane components to the host cells, promoting proinflammatory responses (Ellis et al., 2010). The ZirT/ZirS system, conserved throughout the Salmonella genus, secretes ZirS via a hydrophilic β-barrel pore formed by ZirT (Gal-Mor et al., 2008). Salmonella lacking this system was hypervirulent, implying a unique function of ZirT/ZirS as an antivirulence modulator during systemic infection. Salmonella also horizontally acquired a type VI secretion system (Bingle et al., 2008). SciS, a component homologous to IcmF in Legionella, appeared to attenuate virulence to prevent acute infections for a long-term dissemination (Parsons and Heffron, 2005). Due to the variety of secretion mechanisms in Salmonella, the virulence program is in many ways surprisingly robust and often challenging to dissect.
Salmonella dwells a variety of environments, thus being able to quickly recognize and respond to the environmental stimuli is essential for its survival. During the course of systemic infection in mice, bacteria are found within neutrophils, monocytes, dendritic cells, and B and T cells but are not found extracellularly until the last 1 or 2 days immediately before death of the host (Dunlap et al., 1994;Yrlid and Wick, 2002;Geddes et al., 2007). The mechanisms of how Salmonella survives and replicates within the host and how it regulates virulence genes at the appropriate time during systemic infection is imperfectly understood and the subject of this work. In fact, Salmonella encodes more than 300 annotated transcriptional and translational regulators that govern expression of Salmonella genes under a myriad of parasitic and free-living conditions (McClelland et al., 2001). By comparison the eukaryote Saccharomyces cerevisiae has nearer 100 (Goffeau et al., 1996).
The fact that Salmonella can successfully survive in these disparate environments even within a single host is likely one reason for the large number of regulators it encodes. Recently, we have deleted all genes annotated as transcriptional or translational regulators in Salmonella and have tested all of them in an acute mouse infection model. There are many regulators that are involved in chemotaxis and cell motility. These have not been examined in detail except for FliA, the flagellar sigma factor (Lockman and Curtiss, 1990). Previous work suggests that with the exception of one or two genes none of the genes required for chemotaxis and motility are necessary for virulence in a BALB/c mouse. In all we have identified 26 regulators that are required for potent virulence in our mouse infection model. This is a surprisingly small number given that 330 were tested and many might have been assumed to result in less fit Salmonella generally. In fact, only in-frame deletions in 20 regulator genes conferred a strong virulence deficient phenotype following intraperitoneal (i.p.) infection (Table 1), while the remaining deletions affected virulence only in intragastric (i.g.) infection or long-term persistence (l.p.) test, likely suggesting the requirement of those regulators for adaptation to the intestinal environment (i.g. attenuated strains) or for evading the adaptive immune system (l.p. attenuated strains). About 90 of the 330 regulators have been tested by i.g. infection and thus there are likely to be several more than the three listed that are unable to effect by this route.
In this review we discuss recent efforts to understand the regulatory network of Salmonella that controls a robust set of virulence processes and mechanisms. We focus here on the application of high-throughput and/or global methodologies to measure virulence and regulatory aspects of Salmonella, the computational approaches used to determine regulatory networks, and how this information can be used to construct systems models/simulations of Salmonella pathogenesis (Figure 1).

ExpErimEntal mEthods to dEtErminE rEgulatory nEtworks ovErviEw
In the first part of this review we cover some of the experimental methods that have been employed to determine virulence patterns and regulatory networks in Salmonella, or are currently being used  . Changes in gene expression are attributed to the DNA-binding activity of regulators responding to environmental stimuli. Transcriptional regulation is often a complex process composed of multiple regulatory factors and thus systems biology approaches are necessary to integrate the activities of multiple regulators. With the development of the microarray where thousands of sequences are spotted on a chip and the expression of numerous genes is simultaneously compared, the inference of regulatory networks could be accomplished in a high-throughput manner (Faith et al., 2007;Bonneau, 2008). Furthermore, related approaches including ChIP-microarray (ChIP-chip) and ChIP-sequencing (ChIP-seq) methods, discussed below, have accelerated defining the transcriptional regulatory network (Macquarrie et al., 2011). We have used a transcriptomic approach to decipher the regulatory network governed by virulence regulators during Salmonella systemic infection. Regulators sensing the multiple environmental cues execute defense programs and coordinately tune the expression of genes involved in virulence. By deleting Salmonella regulators across the chromosome and its virulence-related plasmid, we defined 20 regulators that were required for Salmonella systemic infection in mice (avirulent strains in i.p. infection in Table 1). These virulence regulators were varied, including two-component regulators (PhoP/ PhoQ, SsrA/SsrB, and OmpR/EnvZ), alternative sigma factors (RpoE, RpoS, and FliA), post-transcriptional/post-translational regulators (SmpB, CsrA, Hfq, and RseA), a response regulator for which the signal sensor is unknown (Hnr), a bending protein essential for some types of recombination (IHF), and an assortment of other transcriptional/putative transcriptional regulators (SlyA, Crp, FruR, RelA/SpoT, STM1547, STM3121, SpvR, and RcsA). We chose 14 Salmonella regulators whose absence caused severe survival defect in mice (see strains in bold in Table 1) and determined the global transcriptional changes by each virulence to develop these networks, focusing on high-throughput techniques. We begin with a short description of the types of approaches used to characterize virulence in Salmonella. We then discuss transcriptomic and chromatin immunoprecipitation (ChIP) approaches, proteomic, and protein-protein interactions (PPIs).

considErations of ExpErimEntal approachEs to studying virulEncE
Regulatory interactions in Salmonella are studied in three principal ways. The first is based on using in vitro media conditions that simulate infectious and non-infectious conditions to elicit changes that can be studied and perturbed (Beuzon et al., 1999;Coombes et al., 2005;Adkins et al., 2006;Shi et al., 2009a). The second is in infected cultured macrophages (depending on perspective this is referred to as in vivo or ex vivo; Eriksson et al., 2003;Shi et al., 2009b). And the third is using model organisms such as mice or cattle, though this is generally aimed at answering relatively narrow questions of virulence (Chan et al., 2005;Lawley et al., 2006). The choice of experimental system used to investigate regulatory networks is driven by the need for coverage and depth of measurements versus the correctness of the biological environment. That is, in vitro culture conditions, while biologically imperfect, are capable of generating large quantities of samples required for many of the assays described in this section. Advances in methodology are allowing more work to be done ex vivo and in vivo, such as in infected macrophages or by direct measurement within the mouse. These advances include those related to next-generation sequencing for expression profiling, improvements in the sensitivity of proteomics, and ChIP-based methods. As more ex vivo data becomes available it will be possible to compare responses in in vitro conditions with those in macrophages to refine regulatory networks and to refine the culture conditions to better reflect the intracellular milieu.

FiGuRe 1 | overview of high-throughput and computational methods to elucidation of the regulatory networks governing Salmonella virulence.
There have been many studies that characterize the global expression of Salmonella under different conditions that are relevant to virulence (Clements et al., 2002;Eriksson et al., 2003;Lawhon et al., 2003;Kelly et al., 2004;Mangan et al., 2006;Sheng et al., 2009). While these studies do not directly address the global regulatory network driving virulence, they have been used to elucidate novel virulence factors, functional pathways involved in pathogenesis and regulons activated by virulence conditions.
Recent advances in high-throughput DNA sequencing technology have also been applied to the studies on bacterial transcriptome (Passalacqua et al., 2009;Perkins et al., 2009) as well as the eukaryotic transcriptome (Nagalakshmi et al., 2008;Wilhelm et al., 2008). Using an RNA-sequencing (RNA-seq) method, it became possible to define the composition of a whole mRNA population and evaluate gene regulation in detail at the transcriptional level. By converting RNA to DNA, Perkins et al. (2009) analyzed the transcription profile of S. Typhi in a strand-specific manner and identified small RNA and RNA transcripts originating from overlapping DNA sequences as well, which is not possible using low-density microarrays. This analysis revealed a number of novel non-coding RNAs and was used to refine the regulon of the OmpR regulator, which plays important roles in virulence.

protEomE profiling
Although transcriptomics is more commonly applied to determine gene regulatory networks, global profiling of protein abundance by proteomic analysis can also be very useful in this regard. A largescale view of protein abundances from regulatory mutants combined with different stimuli can help identify co-regulated groups, and be used to link these groups to specific regulators. The reasons that transcriptomics are more often used to study gene regulatory networks are (i) microarray analysis has more complete coverage; and (ii) protein abundances are complex being regulated by gene expression, post-transcriptional modifications, and degradation. Recent advances in nanoflow liquid chromatography and mass spectrometry (LC-MS) have tremendously improved proteome coverage (Nilsson et al., 2010). For instance, with current technology and applying extensive protein or peptide fraction previous to LC-MS analysis (two-dimensional LC-MS or 2D-LC-MS) it is possible to routinely obtain proteome coverage of better than ∼50% of the predicted ORFs for prokaryotic organisms Malmstrom et al., 2009). Recent analysis of the S. Typhi and S. Typhimurium proteomes led to the identification of 47 and 51% of all predicted ORFs, respectively Ansong et al., 2008). It is also worth noting that a number of genes are expressed only under specific conditions and some of the predicted ORFs may not even be functional, thus the coverage of the actual expressible proteome is likely higher.
Relative to traditional transcriptomic studies, increasing depth of proteome coverage is frequently associated with laborintensive, low-throughput analysis, thus limiting the throughput and number of samples to be studied. More commonly applied multi-dimensional LC-MS/MS experiments can detect a great number of peptides; however, the approach relies on decisions made automatically by the MS instrument on which peptides will be selected and fragmented for identification. To circumvent this issue, our team utilizes a methodology that separates the peptide regulator in intracellular-mimicking conditions. Collective transcriptomic data revealed an interaction network among virulence regulators and furthermore suggested a group of genes that were coordinately controlled by virulence regulators and are likely to be important for Salmonella virulence ). We show a simplified version of the inferred regulatory network that was obtained in this study in Figure 2. This network shows that the primary regulator responsible for SPI-2 virulence expression, SsrB, integrates signals from many regulators and two-component regulatory systems. Interestingly, this network showed that SlyA is directly upstream of SsrB, but that it also seemed to be directly regulating SPI-2 expression. Both these predictions were validated in the study, elaborating the role that SlyA plays in virulence in Salmonella.

protEin-protEin intEraction-BasEd mEthods
Protein-protein interactions play an essential role in cells because many proteins are part of larger complexes (Nooren and Thornton, 2003). There are many approaches to study PPIs including in silico predictions, yeast two-hybrid (Y2H) systems and AP (Perkins et al., 2010). Although in silico predictions are very useful, experimental validation is always required to ensure data accuracy (Perkins et al., 2010). Y2H screening is a very popular methodology to study PPIs, but may present high rates of false-positive and false-negative interactions (Perkins et al., 2010). Chowdhury et al. (2009) studied PPIs for three S. Typhimurium proteins (HimD, PduB, and PhoP), by expressing them fused with a histidine-biotin-histidine tag, chemical cross-link, and two-steps AP followed by proteomic analysis. With this approach several proteins were found, including previously characterized partners for the targeted proteins (Chowdhury et al., 2009). The advantage of cross-linking proteins is that depending on the type of AP, the purification can be performed in very stringent denaturing conditions, thus decreasing undesired unspecific interactions of proteins to the affinity beads (Guerrero et al., 2006(Guerrero et al., , 2008. Another AP approach to study PPIs is by performing the purifications of non-cross-linked samples in much milder conditions. However, this method can lead to more "noisy" data by having proteins binding non-specifically to beads. Recently, progress has been made to distinguish the specific interacting proteins from the background. Based on quantitative proteomic analysis and statistical analysis, Choi et al. designed a computational tool to determine the proteins that are significantly enriched compared to the background (Breitkreutz et al., 2010;Choi et al., 2011). Although, PPI and ChIP approaches are similar in many aspects, we believe that they are complementary. Thus, the combination of both PPI and ChIP may in the future lead to a more comprehensive view of the regulatory processes in Salmonella, which will lead to the ability to predict environmental responses by Salmonella. computational analysis of data to dEtErminE rEgulatory nEtworks ovErviEw High-throughput data acquisition methods have been effectively used to define regulons and regulatory networks in Salmonella as discussed above. However, these approaches require sophisticated computational analysis methods to provide reliable regulatory network predictions. Additionally, analysis of other kinds of data, for example genomic sequences, can provide corroborating or supporting information about virulence regulation in Salmonella. In this section we describe some of the computational approaches that have been applied to supply supporting information about virulence regulation, to derive regulatory networks from highthroughput data, and to develop predictive models of regulatory processes in Salmonella virulence. This kind of analysis is essential to complete a systems biology approach and to make sense of the complicated high-throughput data being generated (Aderem et al., 2011).

gEnomic data intEgration and infErEncE
Mining genomic sequence information has proven to be very informative about virulence in Salmonella, and this information directly supports virulence-related regulatory network analysis.
identifications from the quantitative analysis, in a process called accurate mass and time (AMT) tag approach (Zimmer et al., 2006). The AMT tag approach reduces labor-intensive experimental steps and enables relatively comprehensive analysis of large number of samples with deep proteome coverage.
The AMT tag approach has enabled a number of proteomic analyses of Salmonella that would have been difficult and highly labor-intensive otherwise. This includes analyses of the combined Salmonella and host proteomes during time course analyses (Shi et al., 2006(Shi et al., , 2009b and analysis of post-transcriptional regulatory mutants lacking Hfq and SmpB under multiple conditions (Ansong et al., 2009). In a more recent study, we have utilized the AMT tag approach to revisit the 14 virulence regulators studied by transcriptomic analysis . All 14 mutant strains lacking virulence regulators were grown in conditions that mimic intracellular environments and global proteomics measurements were made. We then inferred networks from proteomics data and used the structure and topology of these networks to find virulence effector candidates that were co-regulated with known virulence factors, such as the SPI-2 T3SS. Five of these candidates were found to be virulence proteins secreted through a non-T3SS mechanism, validating our approach and demonstrating the utility of regulatory networks. These examples show that high dimensional proteomic datasets can be useful to help elucidate gene regulatory networks or determine more complex mechanisms of protein abundance controls.

chromatin immunoprEcipitation-BasEd mEthods
Chromatin immunoprecipitation is a popular methodology to study gene regulation. ChIP-chip and ChIP-seq enable the determination of the genes regulated by certain known regulator. In these procedures DNA is cross-linked with interacting proteins and the regulator of interest is pulled down by immunoprecipitation (Gilchrist et al., 2009). The interacting regions of the genome are determined by either microarray (ChIP-chip) or by next-generation sequencing (ChIP-seq; Gilchrist et al., 2009). An elegant example of a ChIP-chip experiment has been demonstrated by Tomljenovic-Berube et al. (2010) in which they combine microarray analysis of ssrB-mutant bacteria with ChIP-chip experiments to determine the set of genes under regulatory control of SsrA-SsrB, which are major regulators of SPI-2 genes. In another example, the regulatory network of the transcription silencer H-NS was determined by ChIP-chip analysis (Dillon et al., 2010).
Although transcription factors (TFs) are key elements of expression control, they are often associated with large protein complexes. In this context, a valuable approach consists of analyzing the immunoprecipitated protein complex-DNA by mass spectrometry (ChIP-MS) leading to the identification of gene regulator partners (Lambert et al., 2009). An approach was recently introduced that enables the determination of loci-specific regulatory machinery. This method is based on the cross-linking of DNA-protein complexes; then the DNA is hybridized with region-specific probes and pull-down by affinity purifications (AP). This method became known as reverse-ChIP since rather than immunoprecipitating the regulators to discover their interacting loci in the genome, probes are designed to pull-down proteins bound to specific DNA regions (Dejardin and Kingston, 2009). inference method that relies on the similarity between gene profiles as well as a network inference method that assesses the impact of regulator deletions on the expression levels of other regulators to derive a network of regulatory interactions Prill et al., 2010) that was validated using RT-PCR. This regulatory network highlighted the redundancy and interdependency present in the virulence regulatory network in Salmonella (Figure 2), defined points of control for the regulatory network , and identified groups of important virulence genes with conserved regulatory patterns . Network inference is useful as a relatively unbiased approach to determining regulatory networks, but does not make use of the considerable amount of information available about regulation in bacterial systems.

gEnomE-scalE rEconstruction of rEgulatory and mEtaBolic nEtworks
A variety of approaches to modeling transcriptional regulation have been developed over the years, many of which are reviewed by Karlebach and Shamir (2008). When it comes to analyzing transcriptional output or developing predictive integrated models of regulation and metabolism on the genome-scale, constraint-based approaches such as network component analysis (NCA; Liao et al., 2003) and Boolean networks (Kauffman, 1969;Covert et al., 2001Covert et al., , 2008Gianchandani et al., 2006;Klamt et al., 2006;Shlomi et al., 2007;Graudenzi et al., 2011) and more recently probabilistic models (Chandrasekaran and Price, 2010) have been used. Figure 3 provides an overview of these methods. These methods all require a regulatory network structure as an input that may be assembled from manual curation of the literature (Gama-Castro et al., 2008), ChIP studies (Cho et al., 2009), or inference methods (Sabatti et al., 2002(Sabatti et al., , 2005Gardner et al., 2003;Margolin et al., 2006;Faith et al., 2007) discussed above and in a variety of other reviews (Chou and Voit, 2009;De Smet and Marchal, 2010). The regulatory network structure is often referred to as the connectivity matrix as it is modeled as a matrix where all of the TFs are on one axis, all of the genes Spatial organization of genes on the chromosome has been used to identify the Salmonella pathogenicity islands, which are essential for virulence and largely co-regulated (Salama and Falkow, 1999;Vernikos and Parkhill, 2006;Hensel, 2007;Chatterjee et al., 2008). An additional approach to identifying virulence-related genes from sequence information is by using the nucleotide composition of the gene, relying on the observation that higher G + C content is associated with horizontally transferred virulence genes (Collmer et al., 2002;Srividhya et al., 2007). Analysis of the promoter sequences for several TFs involved in virulence has further expanded their regulons (Wozniak and Hughes, 2008;Harari et al., 2009). Finally, we (Samudrala et al., 2009;McDermott et al., 2011) and others (Arnold et al., 2009;Lower and Schneider, 2009) have recently described methods to integrate this kind of genomic data to provide accurate prediction of type III secreted effectors, which do not share easily recognizable signal sequences that direct their secretion. These approaches have led to the definition of a number of novel effectors  and have provided insights about the conservation and regulation of T3SSs across organisms (Arnold et al., 2009;Buchko et al., 2010;McDermott et al., 2011). Since type III effectors are a primary means by which Salmonella promotes virulence in the host, better definition of the secreted effector repertoire is expected to shed light on virulence regulation and evolutionary patterns associated with virulence.

nEtwork infErEncE from high-throughput data
Prediction of regulatory interactions between genes based on highthroughput data is called network inference. Network inference is a data-driven process that relies on the association of genes based on similar expression patterns over a collection of different experimental conditions (De Smet and Marchal, 2010). Regulatory networks can be inferred from high-throughput data with good accuracy, if sufficient data is available. As discussed above we have used network inference approaches to identify interactions between regulators essential to virulence . In that study we applied an and Palsson, 2002) with the metabolic models increases their accuracy (Covert et al., 2004). In Covert et al. (2004), Boolean regulatory constraints were integrated with a metabolic network model for E. coli and used to simulate growth phenotypes for over 13,000 conditions described in the ASAP database (Glasner et al., 2006). The integrated model accurately simulated growth for ∼79% of the conditions; when the regulatory constraints were removed accuracy dropped to ∼65%. Amongst the set of failures, Covert et al. (2004) noticed that low oxygen (O 2 ) concentration was a common theme. Pathogens will likely encounter low O 2 levels when interacting with the innate immune response. This highlights the need to improve our knowledge of regulatory interactions in microaerobic and anaerobic conditions.
Recently, Barua et al. (2010) developed an automated approach for using global phenotypic (phenomic) data in conjunction with BiGG metabolic models to update Boolean regulatory rules. Additionally, they used their approach along with sequence homology data to construct and refine a putative regulatory network for S. Typhimurium LT2. Integrating this putative regulatory network with experimental and inference methods described above may result in an initial global regulatory network for Salmonella that can be used with NCA to interpret transcriptome data or with BiGG metabolic models to develop predictive models.
Chandrasekaran and Price (Covert and Palsson, 2002) devised another promising method to incorporate regulation with metabolism at the genome-scale. This method "PROM" integrates omics data with a priori regulatory network information to model regulation in a continuous fashion. The two attractive features of PROM are that it does not model gene expression as Boolean and that it can be used with an inferred regulatory network to make qualitative phenotypic predictions. While direct measurements of TF/ target interactions are preferred it is often not possible to acquire the information for organisms of interest.
Finally, it is possible to use networks inferred from global gene expression profiling to generate models capable of predicting gene expression under novel conditions (Bonneau et al., 2007). In this approach the relationships between regulators and their target regulons are represented as mathematical equations. This set of equations can then be used to generate predictions of system behavior when growth conditions have changed or when individual regulators have been inactivated.

futurE prospEcts mEtaBolomE profiling
In addition to the explicit effectors of the virulence program (T3SS effectors, etc.), Salmonella must tightly regulate its metabolism to allow survival in the host environment. Changes in Salmonella metabolism can be addressed using metabolomics approaches, which can provide untargeted, quantitative measurements of perturbations in the metabolite complement of an integrated biological system in response to stimuli (Nicholson et al., 1999). In this respect, metabolomics analyses have not yet been widely implemented in the study of pathogenic bacteria (Bundy et al., 2005;Boroujerdi et al., 2009;Henderson et al., 2009;Himmelreich et al., 2009;de Carvalho et al., 2010;Eylert et al., 2010;Raghunathan et al., 2010;Simon et al., 2010). Indeed, only two recent papers report the application of metabolomics in the study of Salmonella: Xu et al.
on the other axis, and a non-zero element in the matrix indicates that a TF is known to influence the expression of the corresponding gene. A second common feature for these methods is that they were designed to only approximate the interactions between a TF and its target genes: NCA employs a Hill-type equation for each TF-gene interaction, Boolean models represent each gene as on or off based on the sum of Boolean interactions for all TFs that interact with the gene, and probabilistic regulation of metabolism (PROM) uses linear weights derived from prior transcriptome data to constrain flux through a gene's associated enzymatic activities based on current TF expression profiles.
It is possible to integrate a variety of data types, such as transcriptome measurements, motifs, promoter regions, ChIP data, and growth phenotypes, with these modeling tools to update the connectivity matrix for a specific condition or during analysis to uncover additional hypothetical interactions (Foat et al., 2005;Sabatti and James, 2006;Hyduke et al., 2007;Wang et al., 2008;Barua et al., 2010;Tran et al., 2010). These methods have been predominantly used with E. coli due to the availability of a global regulatory network, but as Salmonella's regulatory network is delineated through the methods described above it will be possible to apply these methods to the massive amounts of Salmonella highthroughput data. Since classical virulence genes such as adhesins, toxins, and secreted effectors are not included in standard metabolic models the integration of metabolic models with more descriptive data-driven models will be of paramount importance to fully model virulence in Salmonella.
There have been several studies in Salmonella that have modeled regulation of the SPI-1 T3SS, which is important for virulence as described above. Two studies (Temme et al., 2008;Saini et al., 2010) described experimental approaches to define parameters for computational models of the TFs (HilA, HilC, HilD, and RtsA) and their downstream targets (components of the T3SS and effector proteins). These studies allowed determination of how individual components in the SPI-1 regulatory network coordinate its virulence program. Though these studies are not aimed at providing a comprehensive, global view of the virulence regulatory network in Salmonella they provide valuable insight into the details of the regulatory network driving SPI-1 expression.

prEdictivE modEling
Computational models can be used to predict behaviors of the system under conditions not previously considered. Integrated models of regulation and metabolism may be used to simulate the growth phenotypes of microbes in a variety of conditions, including those relevant to infection (Bumann, 2009). Metabolic network reconstructions aim to be comprehensive biochemically, genetically, and genomically (BiGG) consistent knowledge bases for an organism (Reed et al., 2006;Schellenberger et al., 2010). The recent publication of a BiGG metabolic knowledgebase for Salmonella Typhimurium LT2 represents a resource that may be used as a platform for integrated modeling of metabolic activities during infection (Bumann, 2009;Thiele et al., 2011).
Mathematical models derived from BiGG knowledge bases will often predict capabilities that are not biologically realizable; this is due to the lack of regulatory constraints. Integration of Boolean regulatory constraints (Kauffman, 1969;Covert et al., 2001;Covert tial to pathogenesis is exposure to reactive nitrogen species, such as nitric oxide (NO) and nitrosothiols (RSNOs), which are produced by the innate immune in response to the presence of a pathogen and induce cytolysis of the invading pathogen (MacMicking et al., 1997;Fang, 2004). In an effort to unravel the full effects of RNS on E. coli, a variety of transcriptomic surveys were made (Mukhopadhyay et al., 2004;Flatley et al., 2005;Justino et al., 2005) and with ad hoc analysis important interactions were implicated. However, the precise targets and regulatory response have not yet been elucidated.
Based on the hypothesis that it would be possible to trace a shift in regulatory network activity in response to a perturbation back to its originating source, we designed a series of experiments where we measured the transcriptomic response of E. coli to NO and S-nitrosoglutathione and used NCA to identify the TFA perturbations (Hyduke et al., 2007). A number of the significantly perturbed TFs (IscR, NorR, NsrR) have been associated with the NO-response, however, they had not be examined as an integrated system (D'Autreaux et al., 2005;Rodionov et al., 2005;Pullan et al., 2007). Next, for the significantly perturbed TFs we constructed deletion mutants to validate the NCA calculations and observe the changes in phenotypes. From the NCA-deduced perturbations in TFAs and a series of phenotyping studies we were able to identify the iron-sulfur cluster of dihydroxyacid dehydratase as a crucial enzyme that innate immune system targets to inhibit bacterial growth. Salmonella may exhibit similar responses due to its relatively recent divergence from E. coli, however, the larger number of regulators and greater variety of pathogenic strains indicate that there will be key differences. After Salmonella's global regulatory network is further delineated, it will be possible to use NCA to identify the crucial differences.
conclusion Salmonella systemic infection is accomplished by a sophisticated regulatory network orchestrating activities of multiple regulators. Virulence regulators process and integrate a variety of hostile environmental cues including acidic pH, antimicrobial peptides, and reactive oxygen species within macrophages, and trigger induction of specific subsets of genes required for successful replication and evading lysosomal fusion. The coordinated action by this regulatory network involves numerous virulence factors including SPI-1 and -2 T3SSs. Genes that are coordinately controlled by multiple virulence regulators under infectious conditions and show a similar expression profile to that of well-known virulence genes are more likely to be important for pathogenesis. Thus, reconstructing the global regulatory network during infection or, at the very least, under conditions that mimic the host cellular environment can serve as an efficient strategy for the identification of genes that are required for Salmonella virulence .
Impairing microbial virulence could be an interesting strategy to fight infections, so we propose that these secreted effectors among others could be excellent targets for the development of new antimicrobial therapies. Drugs could be designed to interfere in their binding to host proteins or even to disrupt their enzymatic activities. In this context, small compound libraries can be screened to select the drug candidates. Rasko et al. (2008) used a library of small organic compounds to screen for molecules that bind and inhibit QseC. QseC is a sensor histidine kinase that recognizes host adrenergic (2011) used a volatile organic compound (VOC)-based metabolic profiling approach to differentiate between natural spoiled pork and pork contaminated with Salmonella, and White et al. (2010) employed gas chromatography-mass spectrometry and nuclear magnetic resonance spectroscopy to identify metabolite profiles specific for Salmonella cells in radar (red, dry, and rough morphotype) colony biofilms and for cells that do not produce an extracellular matrix due to a csgD deletion mutant. Neither study addressed the Salmonella virulence regulatory network.
The role of metabolites in Salmonella virulence is two-fold. First and foremost, they represent nutrients necessary to sustain cellular growth and division and to provide the basic building blocks of virulence effectors. Studies using metabolic mutants have revealed insights into the nutritional requirements of Salmonella both in vitro and in vivo. For example, glucose is required for efficient intracellular growth in the Salmonella-containing vacuole (Tchawa Yimga et al., 2006;Gotz et al., 2010); however, a mutant unable to uptake glucose was still able to replicate within cells at a reduced rate, indicating that other carbon sources can be utilized when necessary (Bowden et al., 2009). Similar studies revealed various degrees of attenuation in mice infected by mutants lacking a complete TCA cycle (Tchawa Yimga et al., 2006), defective in converting malate to pyruvate, or lacking isocitrate lyase (Mercado-Lubo et al., 2009). The second role of metabolites in Salmonella virulence is as signaling molecules. Quorum sensing, or the recognition of bacterial cell populations based on concentrations of key molecules, is the smallmolecule signaling mechanism in bacteria (Vendeville et al., 2005;Boyen et al., 2009). While the majority of Gram-negative bacteria utilize N-acylhomoserine lactones (AHLs), or autoinducer-1, as the signaling molecules (Boyen et al., 2009), Salmonella does not produce AHLs (Soares and Ahmer, 2011). Instead, Salmonella relies on autoinducer-2 molecules, such as 4,5-dihydroxy-2,3-pentanedione (DPD; Choi et al., 2007). Alternatively, the nucleotide alarmones guanosine 5′-diphosphate-3′-diphosphate and guanosine 5′-triphosphate-3′-diphosphate (collectively referred to as ppGpp) play a comprehensive role in Salmonella virulence (reviewed in Dalebroux et al., 2010). An as yet relatively unexplored area of research is the role played by host cell signaling molecules in virulence. In this respect, epinephrine was recently reported to play an important role in modulating Salmonella virulence (Spencer et al., 2010). Thus, comprehensive metabolomics analyses should be able to provide additional insights into the mechanisms by which Salmonella receives nutrition during infection and the role of small signaling molecules in virulence.

analyzing transcriptional output
Given their ability to make predictions about the outputs of a network (e.g., virulence processes, metabolic function) given inputs (e.g., growth conditions, carbon source, environmental cures) genome-scale reconstructions will be able to provide a more complete model of Salmonella virulence. One goal of constraint-based regulatory network analysis is to be able to deduce the activity of all of an organism's TFs in a given condition. The TF activities (TFAs) will provide insight into how the TF network responds to a given perturbation. Once the TFA changes are known in response to a specific perturbation, it may be possible to determine the underlying factors that drive these changes. One perturbation that is essen-  (1996). Acid shock induction of RpoS is mediated by the mouse virulence gene mviA of Salmonella typhimurium. J. Bacteriol. 178, 2572-2579. Beuzon, C. R., Banks, G., Deiwick, J., Hensel, M., and Holden, D. W. (1999). pH-dependent secretion of SseB, a product of the SPI-2 type III secretion system of Salmonella typhimurium. Mol. Microbiol. 33,[806][807][808][809][810][811][812][813][814][815][816] rEfErEncEs drug discovery efforts. The ability of Salmonella to introduce proteins directly into host cells has raised the possibility of targeted therapeutic approaches for other diseases using Salmonella as the delivery vehicle (Chamekh, 2010). Finally, there have been efforts to use Salmonella for biotechnology applications, such as exporting spider silk monomers (Widmaier et al., 2009). Clearly, the complicated regulatory control of virulence proteins will need to be better understood in order for progress to be made in these efforts.
In this review we have discussed various systems biology approaches to determination of regulatory networks involved in virulence in Salmonella. With the increasing body of knowledge and data arising from the high-throughput approaches discussed, and those that are currently being developed, it is very important that more sophisticated computational approaches to use this information be developed. Comprehensive systems models of Salmonella pathogenesis will have applications for antibiotic development, new strategies for therapeutic treatments, and further understanding of the complex interplay between pathogen and host during infection. Improved understanding of virulence regulation will be essential to fully realize the potential of each of these applications.

acknowlEdgmEnts
This work was supported by the National Institute of Allergy and Infectious Diseases, NIH/DHHS, through interagency agreement Y1-AI-8401-01 and NIH RO1 AI022933. signals by binding to epinephrine and norepinephrine (Rasko et al., 2008). After binding to any of these signaling molecules, QseC autophosphorylates and subsequently phosphorylates the TF QseB leading to a cascade that results in the expression of virulence factors. After screening the library, one compound named LED209 was found to bind with high affinity and to strongly inhibit QseC autophosphorylation disrupting the expression of known virulence factors. Furthermore, LED209 was also shown to decrease bacterial virulence in murine infections. Understanding regulatory networks and virulence processes at the system level will allow development of more sophisticated and nuanced approaches to target specific proteins and entire pathways. Information about the regulatory and metabolic networks controlling virulence can be used to identify better targets for drug development. Small-molecule inhibitors that can target interactions between proteins, enzymes, effector, and host proteins, or transcriptional regulatory interactions, will allow finer control of virulence processes and thus better antibiotics.
Another approach to inhibiting virulence in Salmonella is to target the T3SS apparatus itself with inhibitors. Several smallmolecule inhibitors have been developed that target the T3SS and impair secretion (Nordfelth et al., 2005;Muschiol et al., 2006;Bailey et al., 2007;Negrea et al., 2007). Additionally, we have undertaken preliminary structural studies of the disordered N-terminal secretion signal of T3SS effectors in Salmonella (Buchko et al., 2010) and this information could potentially be used for future