Visualizing mechanistic models by integrating site-specific molecular details into reaction networks

Vasilescu, Dan; Schaff, James C.; Moraru, Ion I.; Blinov, Michael L.

doi:10.3389/fmolb.2025.1681081

ORIGINAL RESEARCH article

Front. Mol. Biosci., 20 November 2025

Sec. Biological Modeling and Simulation

Volume 12 - 2025 | https://doi.org/10.3389/fmolb.2025.1681081

Visualizing mechanistic models by integrating site-specific molecular details into reaction networks

Dan Vasilescu

James C. Schaff

Ion I. Moraru

Michael L. Blinov ^*

Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, Farmington, CT, United States

Article metrics

View details

983

Views

Downloads

Abstract

Mechanistic modeling in biology aims to describe biological processes based on details on molecular mechanisms and interactions. Rule-based mechanistic modeling enables the simulation of biological systems while explicitly accounting for molecular details, such as protein domains and their specific interactions. Traditionally, mechanistic models are visually represented by reaction or pathway diagrams that depict transformations and modifications of chemical species. In contrast, rule-based descriptions are effective to encode the detailed specificity of individual interactions (e.g., how phosphorylation at a particular residue affects binding affinity elsewhere in the same protein complex) in a compact and precise form, but are complicated to integrate into comprehensive visual representations. Here, we introduce Molecular Process Diagrams, an approach to embed rule-based specificity directly within reaction network diagrams. Our method highlights three fundamental elements: interacting molecular complexes, molecular sites directly modified by a rule, and molecular sites that modulate but are not directly modified by interactions (e.g., phosphorylation-dependent affinity changes). Implemented at multiple resolution levels within the Virtual Cell (VCell) software, these diagrams maintain pathway-like clarity while accurately reflecting detailed molecular interactions. Additionally, we demonstrate compatibility with Systems Biology Graphical Notation (SBGN) process diagrams, ensuring standardized visual conventions.

1 Introduction

Mechanistic kinetic models are critical for predicting dynamics and understanding mechanisms underlying various biological processes, as biomolecular interactions govern most regulatory mechanisms. Large biomolecules such as proteins, RNA, and DNA typically contain multiple functional components, including phosphorylation sites and domains like SH2/PTB (Pawson and Nash, 2003; Pawson, 2004; Seet et al., 2006; Olsen et al., 2006). Importantly, interactions among biomolecules are often dependent on site-specific molecular details. For example, early studies on receptor tyrosine kinase signaling, such as the Epidermal Growth Factor (EGF) receptor pathway, demonstrated that binding of adapter proteins depends on phosphorylation of specific receptor tyrosine residues and the corresponding affinity of adapter protein domains (SH2 or PTB) (Kholodenko et al., 1999). Similarly, IgE receptor (FcεRI) signaling in allergic responses depends on site-specific phosphorylation events required for kinase Syk activation, which is conditioned by phosphorylation of tyrosine residues in the linker and activation loop regions by other kinases (Siraganian et al., 2002; Chylek et al., 2014).

As experimental information about site-specific interactions continues to accumulate, models incorporating such details have become increasingly relevant. However, integrating site-specific details into conventional mechanistic models—which explicitly represent reaction networks—often results in combinatorial explosion, generating excessively large sets of molecular species and reactions (Hlavacek et al., 2006; Mayer et al., 2009). For instance, the EGF receptor alone has nine independently phosphorylated tyrosines, leading to 2⁹ = 512 distinct receptor states. Enumerating each reaction explicitly to construct deterministic or stochastic mathematical models quickly becomes impractical, highlighting an intrinsic limitation of conventional modeling frameworks.

Rule-based modeling methods address this combinatorial complexity by providing compact, precise, and scalable representations of biomolecular interactions (Hlavacek et al., 2006; Blinov et al., 2004; Sneddon et al., 2011; Chylek et al., 2015; Harris et al., 2016). Multiple rule-based models are published every year. Just in 2025, there are at least three rule-based models: by Millan et al. (2025), Bartol et al. (2025), and Wan et al. (2005). Such models specify rules governing interactions among multi-site molecules, capturing all necessary site-specific details succinctly (Chylek et al., 2014). Nevertheless, the detailed complexity captured by rule-based models must be communicated clearly to the human user for effective interpretation, evaluation, and reuse. Conventional reaction network diagrams, which are commonly used to illustrate biological pathways, generally can be used to visualize only reaction networks explicitly generated from rule-based models. As a result, they are limited to visualizing relatively small models. In contrast, rule-based models can generate networks containing hundreds to thousands of chemical species, and in some cases may even yield potentially infinite networks, such as those arising from polymerization. Furthermore, conventional reaction networks are not designed to represent the internal constituents of chemical species. By comparison, networks generated through rule-based approaches capture an important feature: each species is defined by its molecular composition, represented as a connected set of molecules, where individual molecules may contain additional modification sites. Reaction rules then specify changes in connectivity and/or modification states at these sites. Omitting these important features—for example, the ability to track an adapter protein as it progresses through a set of rules—limits the insights that reaction networks can provide, even in cases where they are applicable for visualizing networks generated by rule-based models.

To address this visualization challenge, we started from the rule-based modeling capabilities previously integrated into the widely used Virtual Cell (VCell) modeling and simulation platform (Schaff et al., 1997; Moraru et al., 2008). Although the VCell implementation included a graphical user interface for visually specifying reaction rules (Schaff et al., 2016; Blinov et al., 2017), it shared a fundamental limitation common to existing rule-based modeling tools: rules could not be collectively represented in a diagram resembling traditional reaction networks or pathways.

In this manuscript, we introduce Molecular Process Diagrams (MPDs), a visualization approach designed specifically to bridge the gap between detailed rule-based specifications and conventional pathway diagrams. MPDs integrate rule-based modeling details, including molecular site-specific interactions, into familiar reaction network visualizations, thus maintaining intuitive clarity without sacrificing precision.

In the following sections, we provide a brief background on rule-based modeling and review existing visualization approaches. We then describe the core principles of MPDs and illustrate their implementation within Virtual Cell. Finally, we discuss how MPDs align with established visualization standards in the community—particularly Systems Biology Graphical Notation (SBGN) (Le Novère et al., 2009)—and propose compatible conventions within the SBGN Process Diagram format.

2 Methods

2.1 Reaction networks

Traditionally, biological pathways are depicted as bipartite graphs, where nodes represent either molecular entities or processes, connected by edges indicating participation. Typically, the mathematical representation of a pathway mirrors its visual depiction. For example, the initial signaling events involving the EGF receptor (EGFR) described in multiple modeling papers such as Kholodenko et al., 1999; Schoeberl et al., 2002; Blinov et al., 2006a; Huang et al., 2017, include the following steps: (1) ligand (EGF) binding to EGFR; (2) dimerization of receptor-ligand complexes; (3) receptor tyrosine transphosphorylation; and (4) dephosphorylation of unprotected receptor sites by phosphatases. Figure 1 shows a reaction network corresponding to these events, created within the VCell modeling environment. In conventional modeling software tools such as VCell (Schaff et al., 1997), COPASI (Hoops et al., 2006), or CellDesigner (Funahashi et al., 2003), each chemical species and reaction must be explicitly and manually specified and localized to appropriate compartments (membranes or volumes). However, manual specification inherently limits the complexity and scale of representable models due to the rapidly growing number of potential species and interactions.

FIGURE 1

Reaction network showing EGF interacting with R, forming Ra. The pathway proceeds through several steps, including R2 and RP, with arrows indicating the flow. Green circles and yellow squares represent molecules or complexes. — Visual representation of a reaction network created in Virtual Cell (VCell). Ligand EGF binds receptor R (EGFR) forming the complex Ra (EGF-bound EGFR), which subsequently dimerizes to produce R2 (dimeric EGFR complex). This dimer then undergoes phosphorylation, resulting in the fully phosphorylated receptor dimer, RP.

2.2 Rule-based approach

Rule-based modeling overcomes limitations posed by conventional reaction network approaches by providing a compact and systematic method to represent biomolecular interactions. In this framework, a model is defined by reaction rules—precise descriptions of biomolecular transformations and interactions based on specific molecular features. Rather than explicitly defining every molecular species, each rule defines patterns of molecular states or features required for reactions, enabling a single rule to represent numerous individual reactions (Figure 2). Figure 3 illustrates the steps in EGFR signaling using rule-based modeling conventions implemented in VCell (Schaff et al., 2016).

FIGURE 2

Two diagrams labeled A and B illustrating the interaction between EGF and EGFR, in the context of EGFR phosphorylation. The panel A shows EGF binding to EGFR with uncertain phosphorylation at Y1 and Y2. The panel B shows four particular cases of EGF binding with Y1 and Y2 being either unphosphorylated or phosphorylated, indicating their activity states. — Visualization of a ligand-receptor binding rule in VCell. **(A)** This rule has two reactant patterns (ligand EGF and receptor EGFR) and one product pattern (EGF-EGFR complex). EGFR contains four interaction sites: extracellular domain (“ecd,” binds EGF), transmembrane domain (“tmd,” mediates receptor dimerization), and two distinct tyrosine residues (“Y1” and “Y2,” can be phosphorylated or involved in binding). Sites shown in yellow must be unbound; white indicates sites irrelevant for the rule (tyrosines can be phosphorylated/unphosphorylated and bound/unbound, as indicated by “?”). **(B)** Four explicit reactions generated by this rule, demonstrating all combinations of tyrosine phosphorylation states. “Mem” denotes membrane localization; “ec” denotes extracellular space.

FIGURE 3

Diagram showing the EGFR signaling pathway. It depicts the binding of EGF to EGFR, leading to dimerization and phosphorylation. Each step illustrates molecular interactions, including binding of EGF to EGFR, and phosphorylation/dephosphorylation of sites Y1 and Y2. — Initial steps in EGFR signaling visualized in VCell cartoons. The steps include: (1) EGF ligand binding to EGFR, conditional only on the extracellular binding site availability; (2) receptor dimerization, conditional on ligand binding to extracellular sites; (3) transphosphorylation of receptor tyrosines conditional on transmembrane dimerization; and (4) dephosphorylation of tyrosines.

Rule-based models can be automatically translated into conventional reaction networks by software tools such as BioNetGen (Blinov et al., 2004; Faeder et al., 2009; Harris et al., 2016). This automated generation reduces the burden of explicitly enumerating every possible molecular species, facilitating the execution of time course simulations. Simulation results can subsequently be processed to yield relevant observables, such as the total abundance of molecules with specific phosphorylation states. For cases involving extremely large or even infinite reaction networks, “network-free” agent-based simulation techniques, as implemented in tools like NFsim (Sneddon et al., 2011), provide an efficient alternative. Other rule-based modeling software includes Simmune (Meier-Schellersheim et al., 2006), Kappa (Danos et al., 2007a; Danos et al., 2007b), rxncon (Tiger et al., 2012), PySB (Lopez et al., 2013). Rule-based modeling is also implemented in general-purpose tools such as VCell (Schaff et al., 2016) and MCell (Husar et al., 2024). Of note, beyond their computational utility, rule-based models can also serve as comprehensive repositories of molecular interaction details at the site-specific level (Thomson et al., 2011; Chylek et al., 2014).

2.3 Visualization of rule-based models

Visualizing rule-based models poses significant challenges due to the potentially vast size and complexity of reaction networks generated by rules. For example, a detailed EGFR signaling model involving proteins such as Shc, Grb2, and Sos (Blinov et al., 2006a) includes 3,749 interactions among 356 distinct species, precluding practical visualization as a single comprehensive reaction network.

Several visualization approaches have been proposed to address these limitations. Early methods involved depicting each reaction rule individually as a cartoon illustrating its reactants and products (Meier-Schellersheim et al., 2006; Blinov et al., 2006b; Blinov et al., 2006c). This method, implemented in tools like Simmune (Cheng et al., 2014; Zhang et al., 2013), VCell (Schaff et al., 2016) (Figure 3), and recently bnglViz (Liguori-Bills & Blinov, 2024) facilitates understanding individual rules but fails to provide global insight into overall system connectivity. Consequently, the approach is often confusing for biologists accustomed to pathway-oriented representations.

Although reaction networks visually resemble pathway diagrams—since products from one reaction can become reactants in another—this is rarely the case for rule-based models. Here, reactants and products are defined by broader molecular patterns, making direct chaining between rules uncommon (Blinov and Moraru, 2012). Thus, a straightforward graphical translation of rules often results in disconnected diagrams (Figure 4).

FIGURE 4

Diagram illustrating illustrating reaction rules with distinct reactant and product patterns as separate nodes. The resulting bipartite graph is often highly disconnected due to distinct pattern definitions across rules, as illustrated by the detailed state representations of multiple monomeric EGFR states shown on the right. On the left, EGF binds to EGFR on the membrane, progressing to form EGFR dimers. The right side shows structural diagrams of EGFR variations with possible modifications, depicted with labels like “ecd,” “tmd,” and “Y1” or “Y2.” Red arrows correspond to different EGFR modifications resulting from binding events, with possible phosphorylation marked as “p”. — Graph representation illustrating reaction rules with distinct reactant and product patterns as separate nodes. The resulting bipartite graph is often highly disconnected due to distinct pattern definitions across rules, as illustrated by the detailed state representations of multiple monomeric EGFR states shown on the right.

An alternative, compact visualization is the Molecular Interaction Maps (Kohn et al., 2006) and Extended Contact Maps (Chylek et al., 2011). These maps illustrate possible interactions between molecules at the level of individual molecular sites. While effective at compactly summarizing potential molecular interactions, they fail to clearly represent the temporal or directional flow of information.

Other visualization approaches, such as rule influence diagrams (Smith et al., 2012), atom-rule graphs (Sekar et al., 2017), and the Simmune NetworkViewer (Zhang et al., 2013), aim to illustrate the relationships and influences among rules or molecular features. The NetworkViewer, for example, creates a bipartite graph linking rules to specific reactant/product patterns, closely resembling traditional pathways. However, this approach rapidly becomes impractical for models exhibiting combinatorial growth in the number of nodes when molecules have multiple binding configurations.

To address these limitations, we propose a new visualization approach based on bipartite graphs with two types of nodes: “process” nodes, representing reaction rules, and “molecular complex” nodes, representing collections of molecular species defined by shared reactant or product patterns. This mirrors the classical reaction network formalism, which also employs two node types: “reaction” nodes and “species” nodes. However, while species nodes in reaction networks represent fully defined species, “molecular complex” nodes instead encompass all possible species with a given molecular composition, as specified by the molecules included in reactant and product patterns.

We discuss this approach in more detail in the following section, but note here that, similar to classical reaction networks, the Molecular Process Diagram (MPD) formalism preserves connectivity and clarity, providing an intuitive view of information flow across modeled interactions. Importantly, it scales linearly with the number of reaction rules: the number of “process” nodes equals the number of rules, while the number of “molecular complex” nodes is bounded by the maximum number of molecular patterns in the reaction rules. This, in turn, scales with the product of the maximum number of reactants and products in a reaction rule and the total number of rules, thereby mitigating the challenge of combinatorial complexity.

3 Results

3.1 Molecular process diagrams (MPDs)

The central objective of our visualization approach is to represent rule-based models clearly and intuitively, balancing detailed rule-specific information with a coherent depiction of information flow through the system. We define a molecular complex as a collection of molecules specified by reactant or product patterns. For instance, in the reaction rule depicted in Figure 2, the reactant side comprises two distinct molecular complexes, while the product side is represented by a single complex.

We introduce Molecular Process Diagrams (MPDs) as bipartite graphs composed of two distinct node types: “molecular complexes” and “processes.” The number of process nodes equals the number of reaction rules, while the number of molecular complex nodes is typically fewer than or equal to the total number of unique reactant and product patterns across all rules. For example, the six reaction rules depicted individually in Figure 2 correspond to a compact MPD consisting of six process nodes and only four molecular complex nodes (Figure 5A).

FIGURE 5

Diagram illustrating a layered visualization of a rule-based model in VCell. The panel A illustrates the Molecular Process Diagram providing an overview of a model. Panel B reveals detailed, site-specific reaction rules. Panel C shows distinct state patterns in rules it participates in, highlighted by red boxes. — Layered visualization of a rule-based model in VCell. **(A)** The Molecular Process Diagram (MPD) provides an overview: EGF binds EGFR [1], EGFR undergoes monomeric modifications [2,3], dimerizes [4], and the EGFR dimer further undergoes modifications [5,6]. **(B)** Clicking on a process node (e.g., [1]) reveals detailed, site-specific reaction rules. **(C)** Clicking on a molecular complex node (EGFR) shows distinct state patterns in rules it participates in, highlighted by red boxes (two example rules shown).

This representation offers an intuitive, pathway-like overview. For instance, one can easily follow sequential interactions such as the binding of EGF to EGFR, the subsequent formation of receptor dimers, and the distinct modifications of EGFR molecules within dimeric or monomeric states. However, simplifying the diagram to this level involves a trade-off, as it reduces explicit representation of detailed causal relationships among molecular states.

3.2 Incorporating site-specific details into molecular process diagrams

To retain and display detailed, site-specific interaction information, the VCell implementation of MPDs provides interactive node expansion. Clicking on a process node reveals a detailed cartoon of the corresponding reaction rule, explicitly showing the molecular interaction details (Figure 5B). Similarly, clicking on molecular complex nodes reveals precise reactant and product patterns associated with each complex (Figure 5C). The supplemental material available on GitHub provides examples and details on building molecular process diagrams of several published models that use BioNetGen in VCell software. A bngl file can be imported into VCell and visualized immediately.

Despite these detailed visualizations, the abundance of molecular details can make identifying critical molecular features challenging. To address this, we implemented interactive highlighting options within VCell. Users can selectively highlight interacting molecules (either using distinct colors as shown in Figure 5 or uniformly gray as in Figure 6), molecular sites undergoing modification (Figure 6B), and the contextual state of molecular sites that are unchanged yet required for the interaction to occur (Figure 6C). These interactive visualization options allow users to more clearly identify and focus on key molecular features relevant to each reaction rule.

FIGURE 6

Panel A shows visualization without highlighting; all interacting molecules and sites displayed uniformly. Panel B shows sites undergoing modification by the reaction rule highlighted in bright yellow. Panel C shows sites unchanged but required in a specific state highlighted in pale yellow. — Highlighting site-specific reaction details in VCell. **(A)** Visualization without highlighting; all interacting molecules and sites displayed uniformly. **(B)** Sites undergoing modification by the reaction rule highlighted in bright yellow (e.g., receptor site “ecd” changing from “unbound” to “bound”). **(C)** Sites unchanged but required in a specific state (“molecular context”) highlighted in pale yellow (e.g., receptor site “tmd” must be unbound).

3.3 SBGN-compliant molecular process diagram

The Systems Biology Graphical Notation (SBGN) (Le Novère et al., 2009) provides standardized graphical conventions adopted by the modeler and developer communities for representing biological interactions and processes in a format that is both human-readable and machine-processable. Three orthogonal visual languages exist within SBGN: Process Description (PD), Entity Relationship (ER), and Activity Flow (AF). SBGN PD diagrams explicitly represent the temporal sequence of biochemical reactions; they are widely used in pathway databases (e.g., PantherDB (Mi et al., 2005), Reactome (Joshi-Tope et al., 2005), GeneXplain (Kolpakov et al., 2011)) and supported by modeling software like CellDesigner (Funahashi et al., 2003), COPASI (Hoops et al., 2006), and BIOCHAM (Calzone et al., 2006). Visualization tools compatible with the Biological Pathway Exchange (BioPAX; Demir et al., 2010) standard frequently offer SBGN-PD representations (e.g., VISIBIOweb (Dilek et al., 2010), CHiBE (Babur et al., 2014), Cytoscape plugin cySBGN (Gonçalves et al., 2013), Newt (Balci et al., 2021)). Support for SBGN visualization is facilitated by libSBGN, a dedicated software library implementing the SBGN Markup Language (SBGN-ML) (Van Iersel et al., 2012).

To represent MPDs in compliance with SBGN, we adapted existing PD diagram conventions to accommodate the site-specific information required by reaction rules. Each rule explicitly defines transformations occurring at particular molecular sites.

-
Coarse-grained MPD: The simplest MPD (Figure 7) corresponds closely to the VCell representation in Figure 5A, where complexes (e.g., receptor dimers) are depicted using SBGN multimer glyphs. Site-specific details are minimized, displayed only when explicitly involved in an interaction.
-
Intermediate MPD with site modifications: A more detailed MPD (Figure 8) includes explicitly depicted molecular sites modified by each rule. Reactant site details are placed along consumption arcs, while product site details appear along production arcs, each labeled according to standard SBGN conventions (e.g., phosphorylation denoted as “P@Y1”). The spatial positioning of site-state information adheres strictly to SBGN guidelines, ensuring compatibility with automated processing. Binding interactions specifically display site information only on consumption arcs.
-
Detailed MPD with full contextual information: The most detailed version (Figure 9) comprehensively depicts all site-specific information, including molecular sites required to be in particular states (bound, unbound, phosphorylated, etc.) by the process, even when unchanged by the process itself. These contextual states are presented in dedicated annotation boxes clearly labeled (e.g., “bound” or “unbound”).

FIGURE 7

Molecular Process Diagram showing the Epidermal Growth Factor (EGF) and Epidermal Growth Factor Receptor (EGFR) pathway in SBGN-compliant notation (corresponding to the VCell representation shown in Figure 5A). — Molecular Process Diagram in SBGN-compliant notation (corresponding to the VCell representation shown in Figure 5A). EGF and EGFR represented by macromolecule glyphs; EGF-EGFR complexes use container glyphs, and EGFR dimers use multimer glyphs. Molecular sites (“site,” “ecd,” “tmd,” “Y1,” “Y2”) shown as state variable glyphs (once per molecule). Stoichiometries greater than one explicitly shown on arcs (e.g., “2” for dimerization).

FIGURE 8

Site-specific Molecular Process Diagram showing the Epidermal Growth Factor (EGF) and Epidermal Growth Factor Receptor (EGFR) pathway visualized in SBGN notation. — Site-specific Molecular Process Diagram visualized in SBGN notation. Shows EGF “site” binding to EGFR “ecd” site, EGFR-EGFR interactions via “tmd” sites, and phosphorylation state changes (“U” to “P” or “P” to “U”) for EGFR sites “Y1” and “Y2”, shown as in boxes along reaction arrows next to process nodes.

FIGURE 9

Fully-detailed Molecular Process Diagram showing the Epidermal Growth Factor (EGF) and Epidermal Growth Factor Receptor (EGFR) pathway visualized in SBGN notation. — Fully detailed SBGN-compliant Molecular Process Diagram visualization. Illustrates all required molecular contexts explicitly: EGF-EGFR binding requires “tmd” sites of EGFR to be unbound, and EGFR dimerization requires “ecd” sites to be bound. These conditions are shown in extra boxes along reaction arrows.

This comprehensive MPD visualization allows for accurate reconstruction of the underlying rule-based model, enabling straightforward mapping from process nodes back to reaction rules, from arcs back to reactant/product patterns, and from site annotations to contextual constraints. The supplemental material available on GitHub provides examples and details on building SBGN compliant molecular process diagrams of several published models that use BioNetGen. These diagrams are implemented in yED software.

An MPD is analogous to a reaction network diagram in that it captures interactions, but it omits details needed for simulation, such as the initial species with non-zero concentrations or protocols that activate or deactivate interactions. Rule-based models go beyond a simple set of reactions, combining declarative elements (explicit molecular assumptions) with prescriptive ones (e.g., selection of seed species, kinetic laws, simulation algorithm parameters, and rule inclusion/exclusion criteria). Thus, while MPDs serve effectively as visual analogs of reaction diagrams, they do not represent the prescriptive details required for modeling.

4 Discussion

Dynamical models that explicitly account for the intricate details of multimolecular complexes within cells are becoming increasingly common, driven by the expanding availability of detailed biological data. Historically, cartoon diagrams—whether informal or standardized—have been essential for visualizing biological knowledge and hypotheses. Such diagrams have served as a primary means of representing biological pathways, from qualitative descriptions to quantitative mechanistic models. However, the visualization of models that consider detailed molecular interactions, particularly rule-based models, poses significant challenges. These models have become more widespread as they are inherently able to address the combinatorial complexity that arises from explicitly considering numerous possible molecular interactions, binding states, and site-specific modifications.

We developed a novel visualization approach using multilayered Molecular Process Diagrams (MPDs). MPDs offer a unified representation of rule-based models by combining a clear overview of molecular interactions (“big picture”) with explicit and precise site-specific details and conditions (“detailed assumptions”). The approach’s scalability derives from three distinct but complementary information layers: interacting molecular complexes, molecular sites modified by reactions, and molecular context—sites that, while not modified, are necessary for interactions to occur. This layered structure enables seamless navigation back and forth between general, pathway-level insights and detailed, site-specific molecular interactions, ensuring that modeling assumptions remain transparent and clearly communicated.

While in the manuscript we illustrate a relatively simple model of initial events in EGFR signaling, the MPD is able to illustrate very large rule-based models, such included in the supplemental material the model by Hat et al. (2016) that describes the complex circuitry of the p53 network regulated by Mdm2, Wip1, Wip1 and PTEN. The model has 58 rules specifying interactions among 31 molecules. The models of EGFR are well understood and this is the reason we have them as an example, but they can go in complexity up to 99 rules specifying interactions among EGF ligand, EGF receptor and 34 adapter proteins (the paper by Stites et al., 2015), and the model in the paper by Creamer et al., 2012 describes signaling events in ERBB signaling leading to activation of ERK and Akt. It tracks phosphorylation of 55 individual serine, threonine, and tyrosine residues, and includes 544 rules specifying interactions among 19 molecules. Even such a huge model can be visualized effectively using the different graph layout algorithms that we developed for VCell, providing a connectivity overview that can highlight the importance of specific molecular complexes. Naturally, visualizing a model with 544 rules as a single cartoon is of limited value. However, opening the model in the software and exploring each connected component by zooming in can provide a useful bird’s-eye view of the system.

An important consideration is that the content and organization of MPDs depend explicitly on the formulation of the underlying reaction rules. Specifically, molecules displayed within complex nodes reflect exactly how these complexes are described by the reactant and product patterns in the rules, rather than being derived from broader network-level analyses. Consequently, alternative formulations of rules, such as the use of bond wildcards or explicit specification of bound molecules, can significantly alter the resulting MPD, while not changing the model. For example, reaction rules can be made very concise, a ligand-receptor complex can be shown just as a receptor in a bound state. However, for visualization purposes it may be more useful to alter the model to include ligand explicitly in all rules that include ligand-receptor complexes. While mathematically the model remains the same, visualization may become more intuitive.

The MPD approach has been fully implemented within the widely used VCell modeling and simulation framework. The associated graphical user interface (GUI) provides biologists with a user-friendly, intuitive method for constructing and analyzing rule-based models, clearly specifying the details and spatial localization of each interaction rule.

We have further demonstrated how the existing Systems Biology Graphical Notation (SBGN) standards can be adapted with minimal alterations to effectively visualize MPDs. Importantly, the SBGN-compliant MPDs introduced here (Figures 7–9) maintain compatibility with standard SBGN Process Description (PD) diagrams, as removal of additional site-specific annotations results in a conventional SBGN-PD representation. Given that the syntax and semantics of SBGN are community-driven, our proposed extensions will require ongoing discussion and validation within the SBGN research community. Nonetheless, we anticipate that our approach provides a straightforward and easily adoptable solution for integrating detailed rule-based information within the existing standard. For broader adoption, rule-based elements should be incorporated into the SBGN standard, and tools using SBGN must be able to support these new notations. Achieving the first step requires a community-wide discussion, since SBGN is a standard intended for adoption by both the systems biology community and those interested in visualization. While adapting backend tools such as libSBGN (Van Iersel et al., 2012) is usually straightforward, developing intuitive user interfaces and achieving proper rendering is considerably more challenging. One possible solution is to employ GraphML, which is supported by various visualization tools. We are currently working on designing the most appropriate schema for its implementation.

Finally, while the MPD concept was originally designed to address the specific complexities associated with visualizing rule-based models, its utility is not limited to this context. MPDs offer a valuable visualization tool for any scenario requiring detailed graphical representation of site-specific interactions among biomolecules, potentially extending its applicability across diverse areas of systems biology and molecular modeling.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://github.com/vcellmike/MolecularProcessDiagram.

Author contributions

DV: Methodology, Software, Writing – original draft, Writing – review and editing. JS: Conceptualization, Investigation, Methodology, Software, Writing – original draft, Writing – review and editing. IM: Conceptualization, Formal Analysis, Funding acquisition, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review and editing. MB: Conceptualization, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by National Institutes of Health grants NIH R24 GM137787 and P41 EB023912.

Acknowledgments

We thank summer students Tyler McLaughlin for the initial prototype drawing, and James Greene for drawing SBML representations of multiple models.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1
Babur Ö. Dogrusoz U. Çakır M. Aksoy B. A. Schultz N. Sander C. et al (2014). Integrating biological pathways and genomic profiles with ChiBE 2. BMC Genomics15, 642. 10.1186/1471-2164-15-642
2
Balci H. Siper M. C. Saleh N. Safarli I. Roy L. Kilicarslan M. et al (2021). Newt: a comprehensive web-based tool for viewing, constructing and analyzing biological maps. Bioinformatics37 (10), 1475–1477. 10.1093/bioinformatics/btaa850
3
Bartol T. M. Ordyan M. Sejnowski T. J. Rangamani P. Kennedy M. B. (2025). A spatial model of autophosphorylation of CaMKII predicts that the lifetime of phospho-CaMKII after induction of synaptic plasticity is greatly prolonged by CaM-trapping. Front. Synaptic Neurosci.17, 1547948. 10.3389/fnsyn.2025.1547948
4
Blinov M. L. Moraru I. I. (2012). “Leveraging modeling approaches: reaction networks and rules,” in In advances in systems biology (New York, NY: Springer), 517–530.
- Google Scholar
5
Blinov M. L. Faeder J. R. Goldstein B. Hlavacek W. S. (2004). BioNetGen: software for rule-based modeling of signal transduction based on the interactions of molecular domains. Bioinformatics20 (17), 3289–3291. 10.1093/bioinformatics/bth378
6
Blinov M. L. Yang J. Faeder J. R. Hlavacek W. S. (2006). Depicting signaling cascades. Nat. Biotechnol.24 (2), 137–138. 10.1038/nbt0206-137
7
Blinov M. L. Faeder J. R. Goldstein B. Hlavacek W. S. (2006a). A network model of early events in epidermal growth factor receptor signaling that accounts for combinatorial complexity. Biosystems83 (2-3), 136–151. 10.1016/j.biosystems.2005.06.014
8
Blinov M. L. Yang J. Faeder J. R. Hlavacek W. S. (2006b). Graph theory for rule-based modeling of biochemical networks. Trans. Comput. Syst. Biol. VII.4230, 89–106. 10.1007/11905455_5
- CrossRef
- Google Scholar
9
Blinov M. L. Schaff J. C. Vasilescu D. Moraru I. I. Bloom J. E. Loew L. M. (2017). Compartmental and spatial rule-based modeling with virtual cell. Biophysical J.113 (7), 1365–1372. 10.1016/j.bpj.2017.08.022
10
Calzone L. Fages F. Soliman S. (2006). BIOCHAM: an environment for modeling biological systems and formalizing experimental knowledge. Bioinformatics22 (14), 1805–1807. 10.1093/bioinformatics/btl172
11
Cheng H.-C. Angermann B. R. Zhang F. Meier-Schellersheim M. (2014). NetworkViewer: visualizing biochemical reaction networks with embedded rendering of molecular interaction rules. BMC Syst. Biol.8, 70. 10.1186/1752-0509-8-70
12
Chylek L. A. Hu B. Blinov M. L. Emonet T. Faeder J. R. Goldstein B. et al (2011). Guidelines for visualizing and annotating rule-based models. Mol. Biosyst.7, 2779–2795. 10.1039/c1mb05077j
13
Chylek L. A. Holowka D. A. Baird B. A. Hlavacek W. S. (2014). An interaction library for the FcεRI signaling network. Front. Immunol.5, 172. 10.3389/fimmu.2014.00172
14
Chylek L. A. Harris L. A. Faeder J. R. Hlavacek W. S. (2015). Modeling for (physical) biologists: an introduction to the rule-based approach. Phys. Biol.12, 045007. 10.1088/1478-3975/12/4/045007
15
Creamer M. S. Stites E. C. Aziz M. Cahill J. A. Wee Tan C. Berens M. E. et al (2012). Specification, annotation, visualization and simulation of a large rule-based model for ERBB receptor signaling. BMC Syst. Biolc.6, 107. 10.1186/1752-0509-6-107
16
Danos V. Feret J. Fontana W. Krivine J. (2007a). Scalable simulation of cellular signaling networks. Programming languages and systems. Lect. Notes Comput. Sci., 139–157. 10.1007/978-3-540-76637-7_10
- CrossRef
- Google Scholar
17
Danos V. Feret J. Fontana W. Harmer R. Krivine J. (2007b). Rule-based modelling of cellular signalling LNCS CONCUR 2007 –Concurrency theory. Berlin, Heidelberg: Springer, 17–41.
- Google Scholar
18
Demir E. Cary M. P. Paley S. Fukuda K. Lemer C. Vastrik I. et al (2010). The BioPAX community standard for pathway data sharing. Nat. Biotechnol.28 (9), 935–942. 10.1038/nbt.1666
19
Dilek A. Belviranli M. E. Dogrusoz U. (2010). VISIBIOweb: visualization and layout services for BioPAX pathway models. Nucleic acids Res.38 (Suppl. l_2), W150–W154. 10.1093/nar/gkq352
20
Faeder J. R. Blinov M. L. Hlavacek W. S. (2009). Rule-based modeling of biochemical systems with BioNetGen. Methods Mol. Biol.500, 113–167. 10.1007/978-1-59745-525-1_5
21
Funahashi A. Morohashi M. Kitano H. Tanimura N. (2003). CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. Biosilico1 (5), 159–162. 10.1016/s1478-5382(03)02370-9
- CrossRef
- Google Scholar
22
Gonçalves E. van Iersel M. Saez-Rodriguez J. (2013). CySBGN: a cytoscape plug-in to integrate SBGN maps. BMC Bioinforma.14 (1), 17. 10.1186/1471-2105-14-17
23
Hat B. Kochańczyk M. Bogdał M. N. Lipniacki T. (2016). Feedbacks, Bifurcations, and Cell Fate Decision-Making in the p53 System. PLoS Comput. Biol.12 (2), e1004787. 10.1371/journal.pcbi.1004787
24
Harris L. A. Hogg J. S. Tapia J.-J. Sekar J. A. P. Gupta S. Korsunsky I. et al (2016). BioNetGen 2.2: advances in rule-based modeling. Bioinformatics32, 3366–3368. 10.1093/bioinformatics/btw469
25
Hlavacek W. S. Faeder J. R. Blinov M. L. Posner R. G. Hucka M. Fontana W. (2006). Rules for modeling signal-transduction systems. Sci. STKE2006 (344), re6. 10.1126/stke.3442006re6
26
Hoops S. Sahle S. Gauges R. Lee C. Pahle J. Simus N. et al (2006). COPASI—a complex pathway simulator. Bioinformatics22 (24), 3067–3074. 10.1093/bioinformatics/btl485
27
Huang L. Jiang Y. Chen Y. (2017). Predicting drug combination index and simulating the network-regulation dynamics by mathematical modeling of drug-targeted EGFR-ERK signaling pathway. Sci. Rep.7 (1), 40752. 10.1038/srep40752
28
Husar A. Ordyan M. Garcia G. C. Yancey J. G. Saglam A. S. Faeder J. R. et al (2024). MCell4 with BioNetGen: a Monte Carlo simulator of rule-based reaction-diffusion systems with Python interface. PLOS Comput. Biol.20 (4), e1011800. 10.1371/journal.pcbi.1011800
29
Joshi-Tope G. Gillespie M. Vastrik I. D'Eustachio P. Schmidt E. de Bono B. et al (2005). Reactome: a knowledgebase of biological pathways. Nucleic acids Res.33 (Suppl. l_1), D428–D432. 10.1093/nar/gki072
30
Kholodenko B. N. Demin O. V. Moehren G. Hoek J. B. (1999). Quantification of short term signaling by the epidermal growth factor receptor. J. Biol. Chem.274 (42), 30169–30181. 10.1074/jbc.274.42.30169
31
Kohn K. W. Aladjem M. I. Kim S. Weinstein J. N. Pommier Y. (2006). Depicting combinatorial complexity with the molecular interaction map notation. Mol. Syst. Biol.2 (1), 51. 10.1038/msb4100088
32
Kolpakov F. Poroikov V. Selivanova G. Kel A. (2011). GeneXplain—Identification of causal biomarkers and drug targets in personalized cancer pathways. J. Biomol. Tech. JBT.22 (Suppl. l), S16.
- Google Scholar
33
Le Novère N. Hucka M. Mi H. Moodie S. Schreiber F. Sorokin A. et al (2009). The systems biology graphical notation. Nat. Biotechnol.27, 735–741. 10.1038/nbt.1558
34
Liguori-Bills N. Blinov M. L. (2024). bnglViz: online visualization of rule-based models. Bioinformatics40 (6), btae351. 10.1093/bioinformatics/btae351
35
Lopez C. F. Muhlich J. L. Bachman J. A. Sorger P. K. (2013). Programming biological models in Python using PySB. Mol. Syst. Biol.9, 646. 10.1038/msb.2013.1
36
Mayer B. J. Blinov M. L. Loew L. M. (2009). Molecular machines or pleiomorphic ensembles: signaling complexes revisited. J. Biol.8 (9), 81. 10.1186/jbiol185
37
Meier-Schellersheim M. Xu X. Angermann B. Kunkel E. J. Jin T. Germain R. N. (2006). Key role of local regulation in chemosensing revealed by a new molecular interaction-based modeling method. PLoS Comput. Biol.2, e82–e724. 10.1371/journal.pcbi.0020082
38
Mi H. Lazareva-Ulitsky B. Loo R. Kejariwal A. Vandergriff J. Rabkin S. et al (2005). The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic acids Res.33 (Suppl. l_1), D284–D288. 10.1093/nar/gki078
39
Millan A. J. Allain V. Nayak I. Libang J. B. Quijada-Madrid L. M. Arakawa-Hoyt J. S. et al (2025). SYK negatively regulates ITAM-mediated human NK cell signaling and CD19-CAR NK cell efficacy. J. Immunol.214 (3), 384–398. 10.1093/jimmun/vkaf012
40
Moraru I. I. Schaff J. C. Slepchenko B. M. Blinov M. L. Morgan F. Lakshminarayana A. et al (2008). Virtual Cell modelling and simulation software environment. IET Syst. Biol.2 (5), 352–362. 10.1049/iet-syb:20080102
41
Olsen J. V. Blagoev B. Gnad F. Macek B. Kumar C. Mortensen P. et al (2006). Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell127 (3), 635–648. 10.1016/j.cell.2006.09.026
42
Pawson T. (2004). Specificity in signal transduction: from phosphotyrosine-SH2 domain interactions to complex cellular systems. Cell116 (2), 191–203. 10.1016/s0092-8674(03)01077-8
43
Pawson T. Nash P. (2003). Assembly of cell regulatory systems through protein interaction domains. science.300 (5618), 445–452. 10.1126/science.1083653
44
Schaff J. C. Vasilescu D. Moraru I. I. Loew L. M. Blinov M. L. (2016). Rule-based modeling with Virtual Cell. Bioinformatics32 (18), 2880–2882. 10.1093/bioinformatics/btw353
45
Schaff J. Fink C. C. Slepchenko B. Carson J. H. Loew L. M. (1997). A general computational framework for modeling cellular structure and function. Biophys. J.73 (3), 1135–1146. 10.1016/S0006-3495(97)78146-3
46
Schoeberl B. Eichler-Jonsson C. Gilles E. D. Müller G. (2002). Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors. Nat. Biotechnol.20 (4), 370–375. 10.1038/nbt0402-370
47
Seet B. T. Dikic I. Zhou M. M. Pawson T. (2006). Reading protein modifications with interaction domains. Nat. Rev. Mol. cell Biol.7 (7), 473–483. 10.1038/nrm1960
48
Sekar J. A. Tapia J. J. Faeder J. R. (2017). Automated visualization of rule-based models. PLoS Comput. Biol.13 (11), e1005857. 10.1371/journal.pcbi.1005857
49
Siraganian R. P. Zhang J. Suzuki K. Sada K. (2002). Protein tyrosine kinase Syk in mast cell signaling. Mol. Immunol.38 (16-18), 1229–1233. 10.1016/s0161-5890(02)00068-8
50
Smith A. M. Xu W. Sun Y. Faeder J. R. Marai G. E. (2012). RuleBender: integrated modeling, simulation and visualization for rule-based intracellular biochemistry. BMC Bioinforma.13 (Suppl. 8), S3. 10.1186/1471-2105-13-S8-S3
51
Sneddon M. W. Faeder J. R. Emonet T. (2011). Efficient modeling, simulation and coarse-graining of biological complexity with NFsim. Nat. methods8 (2), 177–183. 10.1038/nmeth.1546
52
Stites E. C. Aziz M. Creamer M. S Von Hoff D. D. Posner R. G. Hlavacek W. S. et al (2015). Use of mechanistic models to integrate and analyze multiple proteomic datasets. Biophys. J.108 (7), 1819–1829. 10.1016/j.bpj.2015.02.030
53
Thomson T. M. Benjamin K. R. Bush A. Love T. Pincus D. Resnekov O. et al (2011). Scaffold number in yeast signaling system sets tradeoff between system output and dynamic range. Proc. Natl. Acad. Sci.108, 20265–20270. 10.1073/pnas.1004042108
54
Tiger C.-F. Krause F. Cedersund G. Palmér R. Klipp E. Hohmann S. et al (2012). A framework for mapping, visualisation and automatic model creation of signal-transduction networks. Mol Syst Biol. EMBO Macmillan Publ. Ltd.8, 578. 10.1038/msb.2012.12
55
Van Iersel M. P. Villéger A. C. Czauderna T. Boyd S. E. Bergmann F. T. Luna A. et al (2012). Software support for SBGN maps: SBGN-ML and LibSBGN. Bioinformatics28 (15), 2016–2021. 10.1093/bioinformatics/bts270
56
Wan Y. Hudson R. Smith J. Forman-Kay J. D. Ditlev J. A. (2025). Protein interactions, calcium, phosphorylation, and cholesterol modulate CFTR cluster formation on membranes. Proc. Natl. Acad. Sci.122 (11), e2424470122. 10.1073/pnas.2424470122
57
Zhang F. Angermann B. R. Meier-Schellersheim M. (2013). The Simmune Modeler visual interface for creating signaling networks based on bi-molecular interactions. Bioinformatics29, 1229–1230. 10.1093/bioinformatics/btt134

Summary

Keywords

systems biology, mathematical modeling, rule-based modeling, reaction network, visualization, SBGN, Virtual Cell, VCell

Citation

Vasilescu D, Schaff JC, Moraru II and Blinov ML (2025) Visualizing mechanistic models by integrating site-specific molecular details into reaction networks. Front. Mol. Biosci. 12:1681081. doi: 10.3389/fmolb.2025.1681081

Received

06 August 2025

Revised

29 September 2025

Accepted

30 October 2025

Published

20 November 2025

Volume

12 - 2025

Edited by

Padmini Rangamani, University of California, San Diego, United States

Reviewed by

Sadegh Sulaimany, University of Kurdistan, Iran

Jin Xu, Independent Researcher, Harbin, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Michael L. Blinov, blinov@uchc.edu

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Biological Modeling and Simulation

ORIGINAL RESEARCH article

Visualizing mechanistic models by integrating site-specific molecular details into reaction networks

Abstract

1 Introduction