<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Physiol.</journal-id>
<journal-title>Frontiers in Physiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Physiol.</abbrev-journal-title>
<issn pub-type="epub">1664-042X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fphys.2022.815874</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Physiology</subject>
<subj-group>
<subject>Hypothesis and Theory</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Missing Links Between Gene Function and Physiology in Genomics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Collado-Vides</surname> <given-names>Julio</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/420102/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Gaudet</surname> <given-names>Pascale</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>de Lorenzo</surname> <given-names>V&#x00ED;ctor</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/96091/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Centro de Ciencias Gen&#x00F3;micas, Universidad Nacional Aut&#x00F3;noma de M&#x00E9;xico</institution>, <addr-line>Cuernavaca</addr-line>, <country>Mexico</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Biomedical Engineering, Boston University</institution>, <addr-line>Boston, MA</addr-line>, <country>United States</country></aff>
<aff id="aff3"><sup>3</sup><institution>Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Universitat Pompeu Fabra</institution>, <addr-line>Barcelona</addr-line>, <country>Spain</country></aff>
<aff id="aff4"><sup>4</sup><institution>SIB Swiss Institute of Bioinformatics, Swiss-Prot Group</institution>, <addr-line>Geneva</addr-line>, <country>Switzerland</country></aff>
<aff id="aff5"><sup>5</sup><institution>Department of Systems Biology, Centro Nacional de Biotecnolog&#x00ED;a CSIC, Universidad Aut&#x00F3;noma de Madrid</institution>, <addr-line>Madrid</addr-line>, <country>Spain</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Pierre Grenon, University at Buffalo, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Padhmanand Sudhakar, KU Leuven, Belgium; Virag Sharma, University of Limerick, Ireland; Alexey Goltsov, Moscow State Institute of Radio Engineering, Electronics and Automation, Russia</p></fn>
<corresp id="c001">&#x002A;Correspondence: Julio Collado-Vides, <email>colladojulio@gmail.com</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Physiology and Medicine, a section of the journal Frontiers in Physiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>28</day>
<month>02</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>815874</elocation-id>
<history>
<date date-type="received">
<day>15</day>
<month>11</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>01</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Collado-Vides, Gaudet and de Lorenzo.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Collado-Vides, Gaudet and de Lorenzo</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Knowledge of biological organisms at the molecular level that has been gathered is now organized into databases, often within ontological frameworks. To enable computational comparisons of annotations across different genomes and organisms, controlled vocabularies have been essential, as is the case in the functional annotation classifications used for bacteria, such as MultiFun and the more widely used Gene Ontology. The function of individual gene products as well as the processes in which collections of them participate constitute a wealth of classes that describe the biological role of gene products in a large number of organisms in the three kingdoms of life. In this contribution, we highlight from a qualitative perspective some limitations of these frameworks and discuss challenges that need to be addressed to bridge the gap between annotation as currently captured by ontologies and databases and our understanding of the basic principles in the organization and functioning of organisms; we illustrate these challenges with some examples in bacteria. We hope that raising awareness of these issues will encourage users of Gene Ontology and similar ontologies to be careful about data interpretation and lead to improved data representation.</p>
</abstract>
<kwd-group>
<kwd>gene ontology</kwd>
<kwd>microbial annotations</kwd>
<kwd>gene function</kwd>
<kwd>challenges and issues</kwd>
<kwd>mechanisms and physiology</kwd>
</kwd-group>
<contract-num rid="cn001">5RO1GM131643</contract-num>
<contract-sponsor id="cn001">National Institute of General Medical Sciences<named-content content-type="fundref-id">10.13039/100000057</named-content></contract-sponsor><contract-sponsor id="cn002">Universidad Nacional Aut&#x00F3;noma de M&#x00E9;xico<named-content content-type="fundref-id">10.13039/501100005739</named-content></contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="30"/>
<page-count count="7"/>
<word-count count="5300"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>In the first pages of &#x201C;The Elements of Chemistry,&#x201D; Antoine Laurent Lavoisier quoted a philologist (a linguist) of his time, to explain the main goal of his classification, the one of making it possible to simultaneously name a substance and to classify it (<xref ref-type="bibr" rid="B14">Lavoisier, 1790/1965</xref>). In biology, naming the properties of genes and classifying them is a task that, with the emergence of genomics, can be gradually connected with multiple higher levels of description up to the level of addressing the understanding of the cell. We know that the genome sequence of an organism does not provide understanding of an organism&#x2019;s functioning. Moreover, even knowing the function of all genes does not necessarily imply that we can understand the functioning of a whole cell. In this manuscript we will raise questions with examples referring to bacteria; because bacteria are mostly single-cell life forms with smaller genomes (compared to multicellular and multi-organ organisms), they certainly are a good case study for tackling these issues.</p>
<p>Function can be defined as the role played by an element within the system or ensemble it belongs to. Physiology, on the other hand, is that part of biology devoted to explaining the overall function of cells, tissues, and organs, i.e., the outcome of all combined molecular functions, and how these respond to changing conditions. Since the advent of molecular biology and even more recently the easy availability of genome sequencing, our approaches to describe biological systems have become more and more reductionist. It would be highly beneficial to bridge the gap in our knowledge regarding how molecules translate into the function&#x2014;and malfunction&#x2014;of an organism, thus connecting gene function with physiology.</p>
<sec id="S1.SS1">
<title>Functional Categorization Versus Understanding</title>
<p>The major resources, such as Gene Ontology (GO), that capture the annotation of biochemical and functional knowledge of proteins, compounds, reactions, and interactions have made important contributions for genomics (<xref ref-type="bibr" rid="B2">Ashburner et al., 2000</xref>; <xref ref-type="bibr" rid="B4">Carbon et al., 2021</xref>). GO has enabled a major step forward in genomics by constructing a controlled vocabulary that can be used in the annotation of any molecular process or reaction, in any organism. This is the foundation that supports a large number of tools to perform computation on functional data in the three kingdoms of life. However, as is frequent in life, a strength can pose at the same time a limitation. <xref ref-type="fig" rid="F1">Figure 1</xref> shows the most commonly annotated top-level terms of the GO categories that are utilized in the description of knowledge of <italic>Escherichia coli</italic> K-12, one of the best-characterized microbes.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>A portion of the structure for &#x201C;biological processes&#x201D; within Gene Ontology. All terms shown are required to describe properties of genes from <italic>E. coli</italic> K-12, with an expanded view of the top-level categories. The whole set of categories for <italic>E. coli</italic> K-12 genes can be found at <ext-link ext-link-type="uri" xlink:href="https://biocyc.org/ECOLI/new-image?object=GO%3A0008150">https://biocyc.org/ECOLI/new-image?object=GO%3A0008150</ext-link>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fphys-13-815874-g001.tif"/>
</fig>
<p>At the first level below the root node, &#x201C;biological process,&#x201D; there are classes such as cellular processes, biological regulation, response to stimulus, localization, biological adhesion, and metabolic processes. A different classification, MultiFun, devoted to <italic>E. coli</italic>, has 10 major classes: Metabolism, Information Transfer, Regulation, Transport, Cell Processes, Cell Structure, Location, Extra-chromosomal origin, DNA Site, and Cryptic Gene (<xref ref-type="bibr" rid="B25">Riley, 1993</xref>). These major categories are the roots of a hierarchy of several subclasses, and individual genes are classified into one or multiple categories in a structure similar to that in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<p>These numerous classes and their subclasses are the result of essentially a bottom-up approach of annotation of individual gene functions across different species within the GO and, in the case of MultiFun, the result of years of work specifically in the annotation of <italic>E. coli</italic> genes by Monica Riley in the earlier days of genome analysis (<xref ref-type="bibr" rid="B25">Riley, 1993</xref>; <xref ref-type="bibr" rid="B27">Serres and Riley, 2000</xref>). They have the common aim of describing all known functions of gene products. Both GO and MultiFun are examples of classifications used in the annotation of functions of genes as they continue to be uncovered.</p>
<p>While these classifications help in the annotation of gene properties, they are not adequate to fully explain the functioning of an organism. Ideally, the major groupings in a classification system would correspond to the major concepts needed to grasp functional microbiology, or the physiology of a multicellular organism. The prevailing view of a cell is that of an evolving and self-replicating chemical factory run by a program, coded in the DNA, and whose main (if not sole) agenda is to survive and proliferate as much as possible, hence ensuring the transmission of its genome. This involves adapting to a changing and often stressful, competitive environment. This general principle is not conveyed at the high-level description of the functional genomic classifications. Why is there such a large discrepancy between the major classes of functional classification systems and the main concepts required to understand the major principles of microbial physiology?</p>
<p>Function, as defined in GO (<xref ref-type="bibr" rid="B28">Thomas, 2017</xref>), is &#x2018;&#x2018;a specific objective that the organism is genetically programmed to achieve.&#x2019;&#x2019; Note that this definition encompasses what GO refers to as &#x2018;&#x2018;molecular function&#x2019;&#x2019; and &#x2018;&#x2018;biological process.&#x2019;&#x2019; The contributions of both annotations for a gene product identify the properties within a larger &#x2018;&#x2018;system&#x2019;&#x2019; it belongs to. However, these annotations and classes are too granular for general principles such as fitness, survival, and viability to emerge. These types of concepts, concerning the properties of entire organisms (phenotypes, in the wide meaning of the term), are described in other ontologies such as Ascomycete Phenotype Ontology.<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> There is no easy way to computationally navigate between these organism-level observations and the complement of reactions and signaling processes that mediate them.</p>
</sec>
<sec id="S1.SS2">
<title>Granularity of Gene Functions Versus Physiological Processes</title>
<p>The granularity of genes may be too low for understanding where a physiological capability achieved at the molecular level is located in the larger cellular context. The structure of the GO contains three different classifications, defined as: molecular function for the activities of individual genes, biological process for pathways and larger processes also defined as specific objectives that the cell or organism is genetically programmed to achieve, as already mentioned, and cellular component to indicate where the gene products are active<sup><xref ref-type="fn" rid="footnote2">2</xref></sup>.</p>
<p>A less-detailed level of description, to understand processes, could be that of biochemical pathways, or similarly, the level of activities of collections of genes within transcription units, operons, and even at the level of regulons, that is to say, genes subject to regulation by the same transcription factor (TF). An annotation effort at this level is illustrated by the molecular and functional descriptions of regulons or genetic sensory-response units (GENSOR Units) in RegulonDB (<xref ref-type="bibr" rid="B6">Gama-Castro et al., 2016</xref>). Take for instance the TrpR (tryptophan regulator) regulon of <italic>E. coli.</italic> In RegulonDB, the collective function of 9 genes is summarized as: &#x2018;&#x2018;The presence of tryptophan inhibits the expression of genes needful for transport and synthesis of tryptophan from serine and conversion of erythrose-4P to chorismate, a precursor of tryptophan&#x2019;&#x2019;.<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> This would be called tryptophan homeostasis in GO. This example shows a good direction of succinctly describing genetic capabilities of groups of genes. The Gene Ontology Consortium, with the development of the Causal Activity Model (GO-CAM), now provides a mechanism to express this type of information (<xref ref-type="bibr" rid="B29">Thomas et al., 2019</xref>).</p>
<p>We know that only around one-fifth of regulons in <italic>E. coli</italic> K-12 encode genes that collectively belong to a single biological process; several regulons encode genes involved in multiple biological processes, whereas in 16% of the 189 known regulons, the gene products encode reactions that cannot be connected to build pathways (<xref ref-type="bibr" rid="B15">Ledezma-Tejeida et al., 2019</xref>).</p>
<p>When grouping genes into complex regulons, defined as those subject to regulation by multiple and exactly the same TFs, the functional homogeneity of their regulated genes increases considerably in terms of belonging to a dominant functional class [See Figure 3 in <xref ref-type="bibr" rid="B15">Ledezma-Tejeida et al. (2019)</xref>]. It is true that this has to be taken with a grain of salt, since currently described regulons are not yet comprehensive since new TF-gene interactions may still be discovered for many TFs. We need to wait for the full genome characterization of complex regulons to solidify this interesting observation. Certainly, finding biologically motivated classes of genes that together perform a dominant function would offer a highly desired intermediate granular level of functional description connecting gene functions with cellular capabilities. At present, we can only speculate whether this intermediate level will be first confirmed in <italic>E. coli</italic> K-12, present in other bacteria, and less likely, present also in eukaryotes. It will be important to keep capturing whole-genome TF-gene regulatory interactions. Next-generation functional analysis tools should be able to integrate the various levels of knowledge to allow more powerful querying.</p>
</sec>
<sec id="S1.SS3">
<title>Dealing With Unknowns</title>
<p>One non-trivial issue is that&#x2014;logically&#x2014;we follow a conceptual frame of what a living system is and what is necessary for it to work according to the prevalent concepts and notions of the time. This rules out, therefore, far unknown requirements that may be disclosed in the future when more knowledge is acquired, and it raises the challenge of handling the sure <italic>unknown unknowns</italic> that still remain to be understood and cannot yet be assigned sound gene annotations. In fact, we always work with incomplete knowledge. There is no genome yet with all its genes functionally characterized. GO deals with this by annotating a gene directly to the root term (either &#x201C;molecular function,&#x201D; &#x201C;biological process,&#x201D; or &#x201C;cellular component&#x201D;) when any one aspect has not been characterized. This formally means that the gene <italic>enables some &#x201C;molecular function,&#x201D; participates in some &#x201C;biological process</italic>,&#x201D; and <italic>is active in some &#x201C;cellular component.&#x201D;</italic> Fully annotated organisms have at least this information for every gene.</p>
<p>Another confounding factor is semantic ambiguity, even in the scientific community. Unequivocal functionality/annotation of a gene can be assigned in many cases without ambiguity, especially when the meaning of a word or a concept is clearly defined in a dictionary or an ontology. But the information used to annotate genomes is extracted from research articles using natural language processing, and the meaning of a term used in an article may differ significantly from that of the ontological concept definition, leading to misinterpretation and inconsistencies in annotation.</p>
</sec>
<sec id="S1.SS4">
<title>Orthology-Based Annotations</title>
<p>A traditional approach to functionally annotate ORFs (open reading frames) on the basis of the function historically assigned a first biological description and then propagated it to related sequences. Without further evidence, this is a reasonable approach to a first &#x201C;guess&#x201D; of the function of an uncharacterized gene, but this can lead to confusion, since contradicting that initial misannotation can be difficult, if only because researchers are misled rather than just unguided in their hypothesis.</p>
<p>A revealing example is the function of the arch-famous Catabolite Regulation Protein, or CRP. The <italic>crp</italic> gene was first discovered and exhaustively studied in <italic>E. coli</italic> as a key component of the regulation of the equally arch-famous <italic>lac</italic> promoter in this species. It was later found that CRP also regulates a large number of other metabolic processes, in addition to utilization of lactose as a carbon source, both negatively and positively in response &#x2013; <italic>inter alia</italic> &#x2013; to fluctuations in intracellular cAMP levels. Yet, years later it was shown that virtually identical CRP orthologs found in other bacterial species [e.g., <italic>Pseudomonas</italic> (<xref ref-type="bibr" rid="B20">Milanesio et al., 2011</xref>)] have nothing whatsoever to do with metabolism but play a role in membrane-related functions. Finally, recent data suggest that in reality CRP in <italic>E. coli</italic> is not only a promoter-specific regulator but also an authentic genome-scaffolding protein (<xref ref-type="bibr" rid="B9">Heyde et al., 2021</xref>) which responds not just to cAMP but to other physiological effectors like cytidine (<xref ref-type="bibr" rid="B13">Lauritsen et al., 2021</xref>) and thus has an additional global role. This highlights the importance of maintaining ontologies and the databases relying on them up to date to represent as closely as possible the current state of knowledge.</p>
<p>The <italic>crp</italic> gene example is not an isolated case: it illustrates a widespread phenomenon called exaptation, i.e., co-opting of a given biological object (or its gene thereof) for a function very different from what it originally evolved. How to translate such functional ramifications of the same gene into sound criteria for predicting authentic biological roles remains a serious challenge. A conserved mechanism (binding of a TF to a given target DNA in response to a distinct effector) may result in entirely different physiological outcomes. Annotations in genomic databases have traditionally merged molecular mechanisms of individual gene products and their physiological roles-in-context, thereby causing considerable confusion. GO-CAM alleviates this problem, by connecting gene function to activators, substrates, and larger processes.</p>
<p>The Gene Ontology Consortium has been using a phylogenetic-based approach to complement annotations directly derived from experimental data (<xref ref-type="bibr" rid="B7">Gaudet et al., 2011</xref>). Conserved orthologs are annotated with functions that are expected to be conserved in the target species, while taking care not to infer functions that can easily diverge following gene duplication. This increases the coverage of annotations, in particular for non-model organism species for which experimental data are more scarce.</p>
</sec>
<sec id="S1.SS5">
<title>Universal Principles of Microbial Physiology</title>
<p>In his reference book on bacterial physiology, Frederick Neidhardt suggests that the whole set of chemical reactions of bacterial cells can be categorized as: assembly, polymerization, biosynthetic, and fueling reactions (<xref ref-type="bibr" rid="B21">Neidhardt et al., 1990</xref>). This, or alternative higher-level descriptions, could help to identify the most conserved elements present in many organisms and facilitate the identification, or even the validation within new genomes, of plausible general principles of cell physiology.</p>
<p>Certainly, in addition to shared evolutionary origin, any substance has both physical and chemical properties that the cell has to deal with, and as a consequence, some aspects of their biochemistry are shared across many organisms. Processing of carbon sources and nitrogen sources happens with associated consequences in acidification or osmotic changes in <italic>E. coli</italic> and in many other bacteria. As a consequence, some aspects of the combinatorial nature of control may also be shared across many bacteria, such as co-occurrence of nitrogen with extrusion of acid, or the use of glucose and extrusion of acid stress; all of these are due to the common chemical ground of molecules and their properties. It is reasonable to assume that some combinations or integration of physiology will be shared across bacteria given some universal properties of such biochemical and chemical processes, even if the precise mechanisms vary from one species to the next. Most likely, shared combinations of physiology will be found in organisms that have commonalities in their environment.</p>
<p>The top nodes of the biological process GO, similar to the top 10 nodes of MultiFun, include categories that are devoid of physiological content, such as: metabolic process, cellular process, biological regulation, signaling, or transport. These ontology-driven concepts contribute to a way of ordering the activities of genes. An interesting link would be to map gene functions to a separate collection of categories, all of which entail physiological content, with a structure reflecting the organization of bacterial physiology. Bacteria are governed by the need to survive, grow, and reproduce. A first exercise would require an annotation effort that could capture the consensus perspective of a group of experts in the field. This conceptual effort would strongly benefit by defining a dialogue with genome-wide and phenotype characterization experiments with tools&#x2014;similar to gene enrichment&#x2014;that help both the interpretation of experiments and the improvement of the conceptual model. Briefly, we suggest that a mapping of the current GO categories with a new collection of categories&#x2014;devoid of ontological requirements&#x2014;that reflect the priorities of bacterial physiology generated by a group of experts in the field would be a step forward to connect gene categories with principles of microbial physiology.</p>
</sec>
<sec id="S1.SS6">
<title>Physiology (High-Level) Drives Mechanism (Low-Level) and Not the Other Way Around</title>
<p>It is important to note that the physiological needs of an organism under specific conditions select for given mechanisms available to fulfill the function&#x2014;and not vice versa. The specific molecular mechanisms to achieve a given physiological function can vary considerably, even among not-so-distant evolutionarily related microorganisms.</p>
<p>A clear-cut boundary between the biology of mechanisms and physiology in our perspective is the distinction between activator and repressors vs. inducible and repressible systems. Terms like activator and repressor in the description of gene regulators are required when describing mechanisms. Furthermore, genes subject to activation can be induced or repressed just as genes subject to repression, in the sense of being negatively regulated, can be induced or repressed. For instance, the classic LacI repressor is a player of an inducible system since, in the absence of glucose, when lactose is present and is incorporated into the cell, allolactose will bind to LacI, provoking its unbinding from the operator sites and inducing transcription of the <italic>lac</italic> operon. In our opinion the terms &#x201C;inducible&#x201D; and &#x201C;repressible systems&#x201D; as a consequence of the appearance of a signal belong to physiology, whereas the type of regulator characterizes their mechanisms.</p>
<p>Cellular capabilities described as physiological activities are more likely to remain valid definitions as new research continues to expand and new mechanisms are discovered. An example of this challenge can be found in the discussion and update of major concepts of transcriptional regulation in bacteria (<xref ref-type="bibr" rid="B18">Mej&#x00ED;a-Almonte et al., 2020</xref>). Whether the two levels (physiological, mechanistic) can be automatically distinguished through computational means and retrieved with the same ease that users enjoy with existing annotation platforms remains to be seen (<xref ref-type="bibr" rid="B19">Mej&#x00ED;a-Almonte and Collado-Vides, 2019</xref>). Probably advanced, dedicated artificial intelligence approaches will have to be developed and adopted for boiling down the plethora of available data to predict useful and significant functionalities.</p>
</sec>
<sec id="S1.SS7">
<title>Physiology Versus Mechanisms (Molecular Biology)</title>
<p>We envision that ontologies and databases could/would provide data to test general hypotheses, such as &#x201C;physiology is more conserved than the specific mechanisms to achieve a given function or biological program.&#x201D; For this to happen biologically major terms that distinguish molecular biology and mechanistic descriptions from physiology need to be more clearly defined and incorporated into formal frameworks.</p>
<p>Biology is a very rich discipline with knowledge that goes from genetics to behavior, from structure to evolution and dynamics. It is noteworthy that this special journal issue addresses ontology and physiology, whereas most microbial genomic databases are still struggling with mechanisms, processes, and genetics (<xref ref-type="bibr" rid="B10">Kanehisa et al., 2017</xref>; <xref ref-type="bibr" rid="B11">Karp et al., 2019</xref>; <xref ref-type="bibr" rid="B26">Santos-Zavaleta et al., 2019</xref>; <xref ref-type="bibr" rid="B22">Ng et al., 2020</xref>; <xref ref-type="bibr" rid="B12">Keseler et al., 2021</xref>). Genomic databases gather what we can call genotypic properties, where activities of biomolecules are described but, so far, in the case of microbial organisms, without the precise description of the growing conditions under which a subset of genotypic properties are being actively used.</p>
<p>Gene ontology is a highly used resource to compare pairs of experimental and control whole-genome transcriptional profiles to identify the enriched functions within the genes that show changed expression. The expanding genomic high-throughput technologies are accelerating the identification of the elements of transcriptional regulatory networks (transcription start sites, transcription factor binding sites, terminators, transcription units, and expression profiles) at the genomic level. How can we best combine these resources to characterize the dynamics of inducing and repressing complete biological processes and link them to their mechanisms? Combining genomes, functional gene annotations, and the deciphering of regulatory networks should improve our capability of mapping specific mechanisms with cellular collections of physiological capabilities.</p>
</sec>
<sec id="S1.SS8">
<title>Phenotypic High-Throughput Annotations?</title>
<p>The link genotype-phenotype has been most often assigned on the basis of heterogeneous experiments with data expressed in formats that typically lack semantic let alone quantitative standards amenable to computation. One ongoing trend to tackle this issue is massive testing of mutant collections grown under a large number of standardized culture conditions (nutrients, stresses, etc.). The so-called phenotypic microarrays (<xref ref-type="bibr" rid="B3">Bochner et al., 2001</xref>), along with available high-throughput analyses of transcriptomics, proteomics, and metabolomics of the corresponding strains are increasingly establishing the field of phenomics (<xref ref-type="bibr" rid="B1">Acin-Albiac et al., 2020</xref>). Phenomics attempts to assign functionalities to given genes on the basis of a data landscape much wider than the typical reductionist approach and linear logic of traditional genetics. Extensive analyses of this type are already available for <italic>E. coli</italic> (<xref ref-type="bibr" rid="B23">Nichols et al., 2011</xref>; <xref ref-type="bibr" rid="B24">Otsuka et al., 2015</xref>) and are quickly expanding to other species of interest. It comes as no surprise that phenomics has become a fertile target of machine learning approaches (<xref ref-type="bibr" rid="B16">L&#x00FC;rig et al., 2021</xref>), applicable to both wet datasets and text sources (<xref ref-type="bibr" rid="B17">Mao et al., 2016</xref>) in published literature. These new perspectives (which may also involve considerable investments in technological platforms) have the potential to change the way automated annotations and GOs will be established (<xref ref-type="bibr" rid="B8">Gkoutos et al., 2012</xref>) in the not-so-distant future.</p>
</sec>
</sec>
<sec id="S2" sec-type="conclusion">
<title>Conclusion</title>
<p>As mentioned in the introduction, we imagine a continuum from individual functions of gene products into a multilayer architecture of categories that can eventually contribute to the understanding of the whole cell&#x2019;s physiology. It should be clear that, although many examples have been related to GO, we did not aim to focus on limitations of precise ontologies or conceptual models, instead, we offer food for thought to connect step- by-step individual gene annotations with major conceptual frameworks of functioning of cells and bacteria.</p>
<p>We are well aware of the limited nature of this work. For instance, we have relied on one definition of function: the one we used corresponds to the so-called &#x201C;causal role function,&#x201D; whereas the &#x201C;selected effect function&#x201D; definition points to its evolutionary origin (<xref ref-type="bibr" rid="B28">Thomas, 2017</xref>).</p>
<p>The amount of work behind the current biological processes and functions has been immensely productive for classifying genes according to specific and easy-to-comprehend categories. But as biology moves toward a higher-level understanding of living systems, these classifications turn out to be insufficient to grasp what is going on&#x2014;and new criteria for functional annotations become needed. In fact, we still know very little of how genes and functions are involved in such system-level functioning of the software and hardware of cells (<xref ref-type="bibr" rid="B5">Danchin, 2009</xref>) or even to figure out what such &#x201C;major principles&#x201D; are. More precise answers to such questions may come from computational resources rich in annotations, similar to ontologies and databases, that are flexible enough to evaluate different models of what a cell is and to test in quantitative terms hypotheses such as &#x201C;physiology is more conserved than mechanisms,&#x201D; and other similar generalities.</p>
<p>It is known that microbial processes are quite poorly represented in GO compared to those for eukaryotic, multicellular organisms; this is no surprise given the origin of the GO Consortium, precisely within eukaryotic organisms (<xref ref-type="bibr" rid="B30">Tyler, 2009</xref>). Some potential avenues to fill the missing links of physiological categories in bacterial systems include higher-level descriptions, for instance as different modules (i.e., carbon source module; nitrogen source module; acid stress module, etc.) with their defined interactions, where the major objectives of the genetic programs of cells become explicit. Modeling their dynamic behavior as a consequence of changes in growth conditions with the regulatory machinery pointing to specific mechanisms of gene regulation would advance the mapping of the genotype to its physiological capabilities and their flexibility in bacteria.</p>
<p>Defining or even formalizing different types of relations and concepts clearly distinguishing mechanistic, physiological, and evolutionary territories should help in clarifying the complex structure of biological knowledge.</p>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="S3">
<title>Author Contributions</title>
<p>JC-V, PG, and VL participated in the discussions and in the writing of the ideas presented in the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S4" sec-type="funding-information">
<title>Funding</title>
<p>JC-V acknowledges funding from Universidad Nacional Aut&#x00F3;noma de M&#x00E9;xico (UNAM), previous support of a sabbatical in Barcelona by PASPA DGAPA-UNAM, as well as funding by NIGMS-NIH grant number 5RO1GM131643. Work in the VL Laboratory is funded by grants RTI2018-095584-B-C42, ERA-COBIOTECH 2018-PCI2019-111859-2, FET-OPEN-RIA-2017-1-766975, NMBP-BIO-2018-814650, BIO-CN-2019-870294, S2017/BMD-3691, and Y2020/TCS-6555.</p>
</sec>
<ack><p>We acknowledge C&#x00E9;sar Bonavides for final formatting of the manuscript.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Acin-Albiac</surname> <given-names>M.</given-names></name> <name><surname>Filannino</surname> <given-names>P.</given-names></name> <name><surname>Gobbetti</surname> <given-names>M.</given-names></name> <name><surname>Di Cagno</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>Microbial high throughput phenomics: the potential of an irreplaceable omics.</article-title> <source><italic>Comput. Struct. Biotechnol. J.</italic></source> <volume>18</volume> <fpage>2290</fpage>&#x2013;<lpage>2299</lpage>. <pub-id pub-id-type="doi">10.1016/j.csbj.2020.08.010</pub-id> <pub-id pub-id-type="pmid">32994888</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashburner</surname> <given-names>M.</given-names></name> <name><surname>Ball</surname> <given-names>C. A.</given-names></name> <name><surname>Blake</surname> <given-names>J. A.</given-names></name> <name><surname>Botstein</surname> <given-names>D.</given-names></name> <name><surname>Butler</surname> <given-names>H.</given-names></name> <name><surname>Cherry</surname> <given-names>J. M.</given-names></name><etal/></person-group> (<year>2000</year>). <article-title>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>25</volume> <fpage>25</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1038/75556</pub-id> <pub-id pub-id-type="pmid">10802651</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bochner</surname> <given-names>B. R.</given-names></name> <name><surname>Gadzinski</surname> <given-names>P.</given-names></name> <name><surname>Panomitros</surname> <given-names>E.</given-names></name></person-group> (<year>2001</year>). <article-title>Phenotype microarrays for high-throughput phenotypic testing and assay of gene function.</article-title> <source><italic>Genome Res.</italic></source> <volume>11</volume> <fpage>1246</fpage>&#x2013;<lpage>1255</lpage>. <pub-id pub-id-type="doi">10.1101/gr.186501</pub-id> <pub-id pub-id-type="pmid">11435407</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carbon</surname> <given-names>S.</given-names></name> <name><surname>Douglass</surname> <given-names>E.</given-names></name> <name><surname>Good</surname> <given-names>B. M.</given-names></name> <name><surname>Unni</surname> <given-names>D. R.</given-names></name> <name><surname>Harris</surname> <given-names>N. L.</given-names></name> <name><surname>Mungall</surname> <given-names>C. J.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>The Gene Ontology resource: enriching a GOld mine.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>49</volume> <fpage>D325</fpage>&#x2013;<lpage>D334</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkaa1113</pub-id> <pub-id pub-id-type="pmid">33290552</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Danchin</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Bacteria as computers making computers.</article-title> <source><italic>FEMS Microbiol. Rev.</italic></source> <volume>33</volume> <fpage>3</fpage>&#x2013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6976.2008.00137.x</pub-id> <pub-id pub-id-type="pmid">19016882</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gama-Castro</surname> <given-names>S.</given-names></name> <name><surname>Salgado</surname> <given-names>H.</given-names></name> <name><surname>Santos-Zavaleta</surname> <given-names>A.</given-names></name> <name><surname>Ledezma-Tejeida</surname> <given-names>D.</given-names></name> <name><surname>Mu&#x00F1;iz-Rascado</surname> <given-names>L.</given-names></name> <name><surname>Garc&#x00ED;a-Sotelo</surname> <given-names>J. S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>D133</fpage>&#x2013;<lpage>D143</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1156</pub-id> <pub-id pub-id-type="pmid">26527724</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gaudet</surname> <given-names>P.</given-names></name> <name><surname>Livstone</surname> <given-names>M. S.</given-names></name> <name><surname>Lewis</surname> <given-names>S. E.</given-names></name> <name><surname>Thomas</surname> <given-names>P. D.</given-names></name></person-group> (<year>2011</year>). <article-title>Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>12</volume> <fpage>449</fpage>&#x2013;<lpage>462</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbr042</pub-id> <pub-id pub-id-type="pmid">21873635</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gkoutos</surname> <given-names>G. V.</given-names></name> <name><surname>Schofield</surname> <given-names>P. N.</given-names></name> <name><surname>Hoehndorf</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Computational tools for comparative phenomics: the role and promise of ontologies.</article-title> <source><italic>Mamm. Genome</italic></source> <volume>23</volume> <fpage>669</fpage>&#x2013;<lpage>679</lpage>. <pub-id pub-id-type="doi">10.1007/s00335-012-9404-4</pub-id> <pub-id pub-id-type="pmid">22814867</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heyde</surname> <given-names>S. A. H.</given-names></name> <name><surname>Frendorf</surname> <given-names>P. O.</given-names></name> <name><surname>Lauritsen</surname> <given-names>I.</given-names></name> <name><surname>N&#x00F8;rholm</surname> <given-names>M. H. H.</given-names></name></person-group> (<year>2021</year>). <article-title>Restoring global gene regulation through experimental evolution uncovers a NAP (Nucleoid-Associated Protein)-like behavior of Crp/Cap.</article-title> <source><italic>mBio</italic></source> <volume>12</volume>:<issue>e0202821</issue>. <pub-id pub-id-type="doi">10.1128/mBio.02028-21</pub-id> <pub-id pub-id-type="pmid">34700380</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Furumichi</surname> <given-names>M.</given-names></name> <name><surname>Tanabe</surname> <given-names>M.</given-names></name> <name><surname>Sato</surname> <given-names>Y.</given-names></name> <name><surname>Morishima</surname> <given-names>K.</given-names></name></person-group> (<year>2017</year>). <article-title>KEGG: new perspectives on genomes, pathways, diseases and drugs.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>D353</fpage>&#x2013;<lpage>D361</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1092</pub-id> <pub-id pub-id-type="pmid">27899662</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karp</surname> <given-names>P. D.</given-names></name> <name><surname>Billington</surname> <given-names>R.</given-names></name> <name><surname>Caspi</surname> <given-names>R.</given-names></name> <name><surname>Fulcher</surname> <given-names>C. A.</given-names></name> <name><surname>Latendresse</surname> <given-names>M.</given-names></name> <name><surname>Kothari</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>The BioCyc collection of microbial genomes and metabolic pathways.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>20</volume> <fpage>1085</fpage>&#x2013;<lpage>1093</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbx085</pub-id> <pub-id pub-id-type="pmid">29447345</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keseler</surname> <given-names>I. M.</given-names></name> <name><surname>Gama-Castro</surname> <given-names>S.</given-names></name> <name><surname>Mackie</surname> <given-names>A.</given-names></name> <name><surname>Billington</surname> <given-names>R.</given-names></name> <name><surname>Bonavides-Mart&#x00ED;nez</surname> <given-names>C.</given-names></name> <name><surname>Caspi</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>The EcoCyc database in 2021.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>12</volume>:<issue>711077</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.711077</pub-id> <pub-id pub-id-type="pmid">34394059</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lauritsen</surname> <given-names>I.</given-names></name> <name><surname>Frendorf</surname> <given-names>P. O.</given-names></name> <name><surname>Capucci</surname> <given-names>S.</given-names></name> <name><surname>Heyde</surname> <given-names>S. A. H.</given-names></name> <name><surname>Blomquist</surname> <given-names>S. D.</given-names></name> <name><surname>Wendel</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Temporal evolution of master regulator Crp identifies pyrimidines as catabolite modulator factors.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>12</volume>:<issue>5880</issue>. <pub-id pub-id-type="doi">10.1038/s41467-021-26098-x</pub-id> <pub-id pub-id-type="pmid">34620864</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><collab>Lavoisier, A. L.</collab> (<year>1790/1965</year>). <source><italic>Elements of Chemistry.</italic></source> <publisher-loc>Mineola, NY</publisher-loc>: <publisher-name>Dover Publications</publisher-name>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ledezma-Tejeida</surname> <given-names>D.</given-names></name> <name><surname>Altamirano-Pacheco</surname> <given-names>L.</given-names></name> <name><surname>Fajardo</surname> <given-names>V.</given-names></name> <name><surname>Collado-Vides</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Limits to a classic paradigm: most transcription factors in <italic>E. coli</italic> regulate genes involved in multiple biological processes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>6656</fpage>&#x2013;<lpage>6667</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz525</pub-id> <pub-id pub-id-type="pmid">31194874</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x00FC;rig</surname> <given-names>M. D.</given-names></name> <name><surname>Donoughe</surname> <given-names>S.</given-names></name> <name><surname>Svensson</surname> <given-names>E. I.</given-names></name> <name><surname>Porto</surname> <given-names>A.</given-names></name> <name><surname>Tsuboi</surname> <given-names>M.</given-names></name></person-group> (<year>2021</year>). <article-title>Computer vision, machine learning, and the promise of phenomics in ecology and evolutionary biology.</article-title> <source><italic>Front. Ecol. Evol.</italic></source> <volume>9</volume>:<issue>642774</issue>. <pub-id pub-id-type="doi">10.3389/fevo.2021.642774</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>J.</given-names></name> <name><surname>Moore</surname> <given-names>L. R.</given-names></name> <name><surname>Blank</surname> <given-names>C. E.</given-names></name> <name><surname>Wu</surname> <given-names>E. H.</given-names></name> <name><surname>Ackerman</surname> <given-names>M.</given-names></name> <name><surname>Ranade</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Microbial phenomics information extractor (MicroPIE): a natural language processing tool for the automated acquisition of prokaryotic phenotypic characters from text sources.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>17</volume>:<issue>528</issue>. <pub-id pub-id-type="doi">10.1186/s12859-016-1396-8</pub-id> <pub-id pub-id-type="pmid">27955641</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mej&#x00ED;a-Almonte</surname> <given-names>C.</given-names></name> <name><surname>Busby</surname> <given-names>S. J. W.</given-names></name> <name><surname>Wade</surname> <given-names>J. T.</given-names></name> <name><surname>van Helden</surname> <given-names>J.</given-names></name> <name><surname>Arkin</surname> <given-names>A. P.</given-names></name> <name><surname>Stormo</surname> <given-names>G. D.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Redefining fundamental concepts of transcription initiation in bacteria.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>21</volume> <fpage>699</fpage>&#x2013;<lpage>714</lpage>. <pub-id pub-id-type="doi">10.1038/s41576-020-0254-8</pub-id> <pub-id pub-id-type="pmid">32665585</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mej&#x00ED;a-Almonte</surname> <given-names>C.</given-names></name> <name><surname>Collado-Vides</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Towards the prokaryotic regulation ontology: an ontological model to infer gene regulation physiology from mechanisms in bacteria</article-title>,&#x201D; in <source><italic>Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019)</italic></source> (<publisher-loc>Set&#x00FA;bal</publisher-loc>: <publisher-name>SciTePress</publisher-name>), <fpage>495</fpage>&#x2013;<lpage>499</lpage>.</citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Milanesio</surname> <given-names>P.</given-names></name> <name><surname>Arce-Rodr&#x00ED;guez</surname> <given-names>A.</given-names></name> <name><surname>Mu&#x00F1;oz</surname> <given-names>A.</given-names></name> <name><surname>Calles</surname> <given-names>B.</given-names></name> <name><surname>de Lorenzo</surname> <given-names>V.</given-names></name></person-group> (<year>2011</year>). <article-title>Regulatory exaptation of the catabolite repression protein (Crp)-cAMP system in <italic>Pseudomonas putida</italic>.</article-title> <source><italic>Environ. Microbiol.</italic></source> <volume>13</volume> <fpage>324</fpage>&#x2013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02331.x</pub-id> <pub-id pub-id-type="pmid">21281420</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neidhardt</surname> <given-names>F.</given-names></name> <name><surname>Ingraham</surname> <given-names>J. L.</given-names></name> <name><surname>Schaechter</surname> <given-names>M.</given-names></name></person-group> (<year>1990</year>). <source><italic>Physiology of the Bacterial Cell. A Molecular Approcah.</italic></source> <publisher-loc>Sunderland, MA</publisher-loc>: <publisher-name>Sinauer Associates Inc</publisher-name>.</citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>P. C.</given-names></name> <name><surname>Wong</surname> <given-names>E. D.</given-names></name> <name><surname>MacPherson</surname> <given-names>K. A.</given-names></name> <name><surname>Aleksander</surname> <given-names>S.</given-names></name> <name><surname>Argasinska</surname> <given-names>J.</given-names></name> <name><surname>Dunn</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Transcriptome visualization and data availability at the <italic>Saccharomyces</italic> genome database.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>48</volume> <fpage>D743</fpage>&#x2013;<lpage>D748</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz892</pub-id> <pub-id pub-id-type="pmid">31612944</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nichols</surname> <given-names>R. J.</given-names></name> <name><surname>Sen</surname> <given-names>S.</given-names></name> <name><surname>Choo</surname> <given-names>Y. J.</given-names></name> <name><surname>Beltrao</surname> <given-names>P.</given-names></name> <name><surname>Zietek</surname> <given-names>M.</given-names></name> <name><surname>Chaba</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Phenotypic landscape of a bacterial cell.</article-title> <source><italic>Cell</italic></source> <volume>144</volume> <fpage>143</fpage>&#x2013;<lpage>156</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2010.11.052</pub-id> <pub-id pub-id-type="pmid">21185072</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Otsuka</surname> <given-names>Y.</given-names></name> <name><surname>Muto</surname> <given-names>A.</given-names></name> <name><surname>Takeuchi</surname> <given-names>R.</given-names></name> <name><surname>Okada</surname> <given-names>C.</given-names></name> <name><surname>Ishikawa</surname> <given-names>M.</given-names></name> <name><surname>Nakamura</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>GenoBase: comprehensive resource database of <italic>Escherichia coli</italic> K-12.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>D606</fpage>&#x2013;<lpage>D617</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku1164</pub-id> <pub-id pub-id-type="pmid">25399415</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Riley</surname> <given-names>M.</given-names></name></person-group> (<year>1993</year>). <article-title>Functions of the gene products of <italic>Escherichia coli</italic>.</article-title> <source><italic>Microbiol. Rev.</italic></source> <volume>57</volume> <fpage>862</fpage>&#x2013;<lpage>952</lpage>. <pub-id pub-id-type="doi">10.1128/mmbr.57.4.862-952.1993</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Santos-Zavaleta</surname> <given-names>A.</given-names></name> <name><surname>Salgado</surname> <given-names>H.</given-names></name> <name><surname>Gama-Castro</surname> <given-names>S.</given-names></name> <name><surname>S&#x00E1;nchez-P&#x00E9;rez</surname> <given-names>M.</given-names></name> <name><surname>G&#x00F3;mez-Romero</surname> <given-names>L.</given-names></name> <name><surname>Ledezma-Tejeida</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>RegulonDB v 10.5: tackling challenges to unify classic and high throughput knowledge of gene regulation in <italic>E. coli</italic> K-12.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>D212</fpage>&#x2013;<lpage>D220</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1077</pub-id> <pub-id pub-id-type="pmid">30395280</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serres</surname> <given-names>M. H.</given-names></name> <name><surname>Riley</surname> <given-names>M.</given-names></name></person-group> (<year>2000</year>). <article-title>MultiFun, a multifunctional classification scheme for <italic>Escherichia coli</italic> K-12 gene products.</article-title> <source><italic>Microb. Comp. Genomics</italic></source> <volume>5</volume> <fpage>205</fpage>&#x2013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1089/omi.1.2000.5.205</pub-id> <pub-id pub-id-type="pmid">11471834</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>P. D.</given-names></name></person-group> (<year>2017</year>). <article-title>The Gene Ontology and the meaning of biological function.</article-title> <source><italic>Methods Mol. Biol.</italic></source> <volume>1446</volume> <fpage>15</fpage>&#x2013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-3743-1_2</pub-id> <pub-id pub-id-type="pmid">27812932</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>P. D.</given-names></name> <name><surname>Hill</surname> <given-names>D. P.</given-names></name> <name><surname>Mi</surname> <given-names>H.</given-names></name> <name><surname>Osumi-Sutherland</surname> <given-names>D.</given-names></name> <name><surname>Van Auken</surname> <given-names>K.</given-names></name> <name><surname>Carbon</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Gene Ontology causal activity modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>51</volume> <fpage>1429</fpage>&#x2013;<lpage>1433</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0500-1</pub-id> <pub-id pub-id-type="pmid">31548717</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tyler</surname> <given-names>B. M.</given-names></name></person-group> (<year>2009</year>). <article-title>Viewing the microbial world through the lens of the Gene Ontology.</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>17</volume> <fpage>259</fpage>&#x2013;<lpage>261</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2009.05.002</pub-id> <pub-id pub-id-type="pmid">19577929</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.yeastgenome.org/ontology/phenotype/ypo#network">https://www.yeastgenome.org/ontology/phenotype/ypo#network</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="http://geneontology.org/docs/ontology-documentation/">http://geneontology.org/docs/ontology-documentation/</ext-link></p></fn>
<fn id="footnote3">
<label>3</label>
<p><ext-link ext-link-type="uri" xlink:href="http://regulondb.ccg.unam.mx/gensorunit?term=GU0000127697&#x0026;organism=ECK12&#x0026;format=jsp&#x0026;type=gensorunit">http://regulondb.ccg.unam.mx/gensorunit?term=GU0000127697&#x0026;organism=ECK12&#x0026;format=jsp&#x0026;type=gensorunit</ext-link></p></fn>
</fn-group>
</back>
</article>