REVIEW article

Front. Microbiol., 05 February 2021

Sec. Infectious Agents and Disease

Volume 12 - 2021 | https://doi.org/10.3389/fmicb.2021.628798

Collagen Binding Proteins of Gram-Positive Pathogens

  • Center for Infectious and Inflammatory Diseases, Institute of Biosciences and Technology, Texas A&M Health Science Center, Houston, TX, United States

Abstract

Collagens are the primary structural components of mammalian extracellular matrices. In addition, collagens regulate tissue development, regeneration and host defense through interaction with specific cellular receptors. Their unique triple helix structure, which requires a glycine residue every third amino acid, is the defining structural feature of collagens. There are 28 genetically distinct collagens in humans. In addition, several other unrelated human proteins contain a collagen domain. Gram-positive bacteria of the genera Staphylococcus, Streptococcus, Enterococcus, and Bacillus express cell surface proteins that bind to collagen. These proteins of Gram-positive pathogens are modular proteins that can be classified into different structural families. This review will focus on the different structural families of collagen binding proteins of Gram-positive pathogen. We will describe how these proteins interact with the triple helix in collagens and other host proteins containing a collagenous domain and discuss how these interactions can contribute to the pathogenic processes.

Introduction

Collagen is the most abundant protein in the human body and an integral component of the extracellular matrix (ECM) (Shoulders and Raines, 2009). The ECM is a complex proteinaceous network that provides structural support to tissues along with the necessary signaling for cell adhesion, migration, and growth as well as for tissue development and regeneration (Frantz et al., 2010). Collagen plays a critical role in the functional integrity of most tissues including bone, skin, tendon, and cartilage (Burgeson and Nimni, 1992; Frantz et al., 2010). Collagen can also be the target of surface-anchored adhesins and other virulence factors produced by both Gram-positive and Gram-negative pathogens (Harrington, 1996; Singh et al., 2012; Zhang et al., 2015; Duarte et al., 2016; Paulsson and Riesbeck, 2018; Vaca et al., 2020). Of these, the cell wall anchored collagen binding proteins in Gram-positive bacteria have been more extensively studied, and will be reviewed here.

There are 28 identified types of collagens in humans (Table 1; Ricard-Blum, 2011). Each collagen molecule is formed through the interactions of three protein polypeptides known as α-strands. The α-strands come together to form a canonical right-handed triple helical structure termed the triple helix domain (Kadler et al., 2007; Ricard-Blum, 2011). Triple helices can be formed by association of identical α-strands to form a homotrimer or be composed of different α-strands (heterotrimer) (Ricard-Blum, 2011). The triple helix domain is a flexible rod-shaped structure held together through inter-chain hydrogen bonding (Kadler et al., 2007; Shoulders and Raines, 2009; Ricard-Blum, 2011). The triple helix is defined by Gly-X-X’ amino acid repeats with X and X’ commonly representing proline and 4-hydroxyproline, respectively (Shoulders and Raines, 2009). Glycine residues are required every 3rd residue as any other residue would result in steric hindrance and helix destabilization (Theocharis et al., 2016). Collagens also have non-triple helical domains at their N- and C-termini, which are referred here as “non-collagenous” domains. In addition to the conventional collagens, several other mammalian proteins contain collagenous domains (Fraser and Tenner, 2008; Zani et al., 2015; PrabhuDas et al., 2017; Casals et al., 2019).

TABLE 1

Collagen typeClassificationChain compositionTissue distributionFunctionReferences
IFibril-formingα1[I]2α2[I] α1[I]3Abundant and present in most connective tissues and interstitial membranesKey structural componentRicard-Blum, 2011; Henriksen and Karsdal, 2016
IIFibril-formingα1[II]3Cartilage, vitreous humor, intervertebral diskTissue integrity and resiliency to stressRicard-Blum, 2011; Gudmann and Karsdal, 2016
IIIFibril-formingα1[III]3Tissues containing type I collagen, especially embryonic skin and hollow organs like blood vessels, uterus and bowelStructural component, wound healing, interacts with platelets in blood clotting cascadeRicard-Blum, 2011; Nielsen and Karsdal, 2016a
IVNetwork-formingα1[IV]2 α2[IV] α3[IV] α4[IV] α5[IV] α5[IV]2 α6[IV]Basement membranesBarrier between tissue compartments, signalingRicard-Blum, 2011; Sand et al., 2016
VFibril-formingα1[V]3
α1[V]2 α2[V] α1[V] α2[V] α3[V]
Tissues containing type I collagenRegulates collagen fibrillogenesisRicard-Blum, 2011; Leeming and Karsdal, 2016
VIBeaded-filament-formingα1[VI] α2[VI] α3[VI] α1[VI] α2[VI] α4[VI]Most connective tissuesModulates stiffness and mechanical properties of extracellular matrix, signalingRicard-Blum, 2011; Sun and Karsdal, 2016a
VIIAnchoring fibrilsα1[VII]2 α2[VII] α1[VII]3Many tissuesStability of extracellular matrixRicard-Blum, 2011; Mortensen and Karsdal, 2016
VIIINetwork-formingα1[VIII]3
α2[VIII]3
α1[VIII]2 α2[VIII] α1[VIII] α2[VIII]2
Descemet’s membrane, heart, brain, liver, lung, muscles and around chondrocytes in cartilageStructural component, signalingRicard-Blum, 2011; Hansen and Karsdal, 2016
IXFACITα1[IX] α2[IX] α3[IX]Tissues containing type II collagenStabilization of the fibrillar collagen network, limits collagen fibril diameterRicard-Blum, 2011; He and Karsdal, 2016
XNetwork-formingα1[X]3Hypertrophic cartilageEndochondral ossificationShen, 2005; Ricard-Blum, 2011
XIFibril-formingα1[XI] α2[XI] α3[XI] α1[XI] α1[V] α3[XI]Tissues containing type II collagenRegulate fibrillogenesis of type II collagen fibrils, nucleator for collagen types I and II fibrillogenesisRicard-Blum, 2011; Luo and Karsdal, 2016
XIIFACITα1[XII]3Tissues containing type I collagenOsteoblast/osteocyte differentiation, skin homeostasis and repair, tendon development, regulation of fibrillogenesisRicard-Blum, 2011; Izu et al., 2020; Schönborn et al., 2020
XIIIMACITα1[XIII]3Many tissues but present in low amountsPlays a role in bone formation, presynaptic and postsynaptic maturation and integrityRicard-Blum, 2011; Zainul et al., 2018
XIVFACITα1[XIV]3Tissues containing type I collagenRegulates fibrillogenesis by limiting fibril diameterRicard-Blum, 2011; Manon-Jensen and Karsdal, 2016
XVMultiplexinα1[XV]3Basement membraneCrosslinks collagen type I and III fibrilsRicard-Blum, 2011; Arvanitidis and Karsdal, 2016
XVIFACITα1[XVI]3Many tissuesStability of extracellular matrix, signalingRicard-Blum, 2011; Sand and Karsdal, 2016
XVIIMACITα1[XVII]3HemidesmosomesAdhesion of epithelial cells to extracellular matrix, teeth formationRicard-Blum, 2011; Sun and Karsdal, 2016b
XVIIIMultiplexinα1[XVIII]3Basement membraneIntegrity of basement membrane, inhibit angiogenesis and tumor growthRicard-Blum, 2011; Bager and Karsdal, 2016
XIXFACITα1[XIX]3Basement membraneActs as a cross-bridge between collagen fibrils and other extracellular moleculesRicard-Blum, 2011; Nielsen and Karsdal, 2016b
XXFACITα1[XX]3Cornea, minor component of multiple connective tissuesSpecific role unknownRicard-Blum, 2011; Willumsen and Karsdal, 2016
XXIFACITα1[XXI]3Many tissuesActs as a cross-bridge between collagen fibrils and other extracellular matrix moleculesRicard-Blum, 2011; Kehlet and Karsdal, 2016
XXIIFACITα1[XXII]3Tissue junction in skeletal and heart musclePlays a role in vascular stabilityRicard-Blum, 2011; Ton et al., 2018
XXIIIMACITα1[XXIII]3Cornea, lung, cartilage, amnionInduce keratinocyte adhesion and spreading, cancer cell metastasisRicard-Blum, 2011; Veit et al., 2011; Spivey et al., 2012
XXIVFibril-formingα1[XXIV]3Bone, corneaRegulation of osteoblast differentiation and mineralizationRicard-Blum, 2011; Wang et al., 2012
XXVMACITα1[XXV]3Brain, neuronsFusion of myoblasts into myofibers, regulates intramuscular motor innervationRicard-Blum, 2011; Tanaka et al., 2014; Gonçalves et al., 2019
XXVIα1[XXVI]3Testis, ovaryDevelopment of reproductive tissuesSato et al., 2002; Ricard-Blum, 2011
XXVIIFibril-formingα1[XXVII]3Cartilage, eye, ear, lung, colonStructural role in the pericellular extracellular matrix, transition of cartilage to boneRicard-Blum, 2011; Luo et al., 2017
XXVIIIα1[XXVIII]3Dorsal root ganglia, peripheral nerves, in low amounts in skin and calvariaSpecific role unknownRicard-Blum, 2011; Gebauer et al., 2016

The collagen family.

Bacterial surface proteins contribute to pathogenic processes and play a critical role in mediating adhesion to host cells and tissues, enabling colonization, invasion, and biofilm formation (Foster et al., 2014; Foster, 2019). In addition, binding of bacterial surface proteins to host ligands can lead to evasion of the host defense systems (Foster et al., 2014; Foster, 2019). In Gram-positive bacteria, different classes of surface proteins exist: (1) lipoproteins, (2) proteins covalently anchored to the cell wall, (3) pilus proteins, (4) non-covalently surface-associated proteins, and (5) transmembrane proteins (Desvaux et al., 2006; Fischetti, 2019). Lipoproteins are proteins covalently attached to membrane lipids via their N-terminus (Desvaux et al., 2006). Cell wall anchored proteins and pilus proteins are anchored to the cell wall by the action of enzymes called sortases (Desvaux et al., 2006). Sortases mediate covalent linking of proteins to the peptidoglycan through a transpeptidase reaction, and can also enable assembly of surface pilus and anchor the pilus onto the peptidoglycan layer (Ton-That and Schneewind, 2004; Desvaux et al., 2006; Fischetti, 2019). Lastly, non-covalently surface associated proteins contain cell wall binding domains (Desvaux et al., 2006; Fischetti, 2019).

Bacterial surface proteins are modular multi-domain proteins that can often be grouped into structural families based on their structural similarities. Multiple structurally related families of proteins have been identified in the literature (Waldemarsson et al., 2006; Foster et al., 2014; Frost et al., 2017; Foster, 2019; Taglialegna et al., 2020). Notables examples of structural families in Gram-positive bacteria include the MSCRAMMs (microbial surface components recognizing adhesive matrix molecules) (Foster et al., 2014; Foster, 2019), serine-rich repeat proteins (Lizcano et al., 2012), and M-proteins (Fischetti, 2016). In this review, we will describe collagen binding proteins present on the surface of Gram-positive pathogens that are human pathogens. This review will focus on structural families where more than one protein with structural similarity has been reported to bind collagen directly. Some proteins reported in the literature use fibronectin as a bridging molecule to bind collagen, e.g., streptococcal fibronectin binding protein 1 (SfbI) of Streptococcus pyogenes (Dinkla et al., 2003a) and are not covered here.

Types of Collagen

Collagens can be divided into different categories, which include fibrillar collagen, network forming collagen, FACITs (fibril-associated collagens with interrupted triple helices), MACITs (membrane-associated collagens with interrupted triple helices), anchoring fibrils, beaded-filament-forming collagens, and MULTIPLEXIN (multiple triple-helix domains and interruptions). These major classes of collagen will be discussed briefly below (Ricard-Blum, 2011; Theocharis et al., 2016). Collagen structure, chain composition, tissue distribution and functions are listed in Table 1.

Fibrillar collagen is the most common type of collagen in humans. Collagen types I, II, III, V, XI, XXIV, and XXVII all have a fibrillary configuration (Kadler et al., 2007; Theocharis et al., 2016). During fibrillogenesis, the protocollagen strands assemble into a triple helical formation called procollagen triple helix, which undergoes cleavage of N- and C-termini to generate tropocollagen triple helix molecule (Kadler et al., 2007; Shoulders and Raines, 2009; Theocharis et al., 2016). Tropocollagen triple helix molecule self-assembles into a D-staggered arrangement with a 67 nm periodicity to form collagen microfibrils [for further details on D-staggering, see Boudko and Bächinger (2016), Kadler (2017), Holmes et al. (2018)]. As a last step, collagen fibrils of diameter 15–500 nm are formed by crosslinking of collagen microfibrils (Kadler et al., 2007; Shoulders and Raines, 2009; Theocharis et al., 2016). Collagen fibrils in turn participate in forming larger structures such as ligaments and tendons (Ricard-Blum, 2011).

Network forming collagens include collagen types IV, VIII, and X, with collagen type IV being the archetype (Theocharis et al., 2016). Collagen type IV is found in the basement membrane along with other molecules such as laminin (Hohenester and Yurchenco, 2013; Theocharis et al., 2016). Unlike fibrillary collagens, the non-collagenous domains of these molecules are not cleaved and are utilized to form tail to tail interactions with other non-collagenous domains of collagen (Sundaramoorthy et al., 2002). Stabilizing tetramers are also formed via N-terminal head to head interactions. Once a mature network is formed, these collagens work to support the surrounding epithelial cell layer (Kadler et al., 2007).

Fibril-associated collagens with interrupted triple helices are relatively short flexible collagens that contain small triple helical regions interrupted by non-collagenous domains (Theocharis et al., 2016). Collagen types IX, XII, XIV, XVI, XIX, XX, XXI, and XXII have been reported as FACITs (Shoulders and Raines, 2009). Their primary role is to connect other collagen types together as well as with various ECM components (Theocharis et al., 2016). Collagen type IX, an archetypal FACIT, is covalently linked to collagen type II present in cartilage (Kadler et al., 2007; Ricard-Blum, 2011; Theocharis et al., 2016) and collagen type XIV binds to type I (Kadler et al., 2007; Ricard-Blum, 2011; Theocharis et al., 2016).

Membrane-associated collagens with interrupted triple helices are transmembrane proteins and contain a short N-terminal cytoplasmic tail, a transmembrane helix, and a collagenous C-terminal extracellular domain. These collagens can act as cellular receptors and facilitate cell adhesion and as soluble collagen in ECM upon cleavage (Ricard-Blum, 2011; Theocharis et al., 2016). Examples of MACITs include collagen types XIII, XXIII, and XXV and these are expressed by several cell types (Kadler et al., 2007; Theocharis et al., 2016).

Beaded filament collagens include collagen types VI, XXVI, and XXVIII with type VI being the most studied (Theocharis et al., 2016). Once these collagens are secreted from the cell, they arrange in an anti-parallel fashion to form dimers. Dimers then form tetramers through interactions with other dimers. Next, tetramers connect by their globular domains to form filaments where globular domains appear as beads (Kadler et al., 2007; Theocharis et al., 2016). Beaded filament collagens are found in various connective tissues, e.g., cartilage, bone, tendon, etc. (Fitzgerald et al., 2013).

Multiplexins include collagen types XV and XVIII and have not been studied extensively (Theocharis et al., 2016). They are localized to vascular and epithelial basement membranes and participate in bridging other collagens to underlying structures (Theocharis et al., 2016).

Other Host Proteins With Collagen-Like Regions

G-X-X’ repeats are the defining feature of the collagen triple helix primary sequence. Proteins with collagen-like regions but not classified as conventional collagen have been identified in mammals and microbes (Pyagay et al., 2005; Tom Tang et al., 2005; Ricard-Blum, 2011; Yu et al., 2014; Casals et al., 2019). Mammalian proteins containing collagen-like domains include membrane proteins (e.g., scavenger receptors) (Ricard-Blum, 2011; Zani et al., 2015; PrabhuDas et al., 2017) and secreted proteins (e.g., human defense collagens) (Ricard-Blum, 2011; Casals et al., 2019).

Human defense collagens include members of the collectin family, ficolins, and C1q and TNF-related proteins (Casals et al., 2019). Members of the collectin family are surfactant protein A and D, mannan-binding lectin, collectin liver-1, collectin kidney-1, and the heterotrimeric Collectin CL-LK formed by the combination of collectin liver-1 and collectin kidney -1 (Casals et al., 2019). The ficolin family contains three ficolins: M-, H-, and L-ficolin (Casals et al., 2019). The C1q and TNF-related protein family only contains two members: C1q and adiponectin (Casals et al., 2019).

The defense collagens contain a N-terminal segment, a collagen-like region and a globular recognition domain that recognizes pathogen-associated molecular patterns and danger-associated molecular patterns (Fraser and Tenner, 2008; Casals et al., 2018, 2019). These proteins form multimeric structures and play an important role in pathogen clearance (Fraser and Tenner, 2008; Casals et al., 2018, 2019). The collagen-like regions of defense collagens vary in length and contain G-X-X’ repeats where X is often a proline, and X’ is often a hydroxylysine or a hydroxyproline (Casals et al., 2019). The collagen-like domains in human defense collagens serve two functions: (1) binding to associated proteases responsible for triggering the complement cascade and (2) binding cell receptors involved in clearance of pathogens and dead cells (Casals et al., 2019).

Collagen Binding Proteins

CNA-Like MSCRAMMs

In 1985 Staphylococcus aureus was reported to bind type I procollagen and soluble collagen type I (Carret et al., 1985; Holderbaum et al., 1985). Later the collagen “receptor” on S. aureus was identified as a 135 kDa cell wall-anchored protein and named Collagen Adhesin (CNA) (Speziale et al., 1986). Since then, bioinformatic analyses have identified homologous proteins in other Gram-positive bacteria. These include Enterococcus faecalis (Ace) (Rich et al., 1999), Enterococcus faecium (Acm) (Nallapareddy et al., 2003), Streptococcus mutans (Cnm) (Sato et al., 2004), Streptococcus equi (Cne) (Lannergard et al., 2003), S. mutans (Cbm) (Nomura et al., 2012), Bacillus anthracis (BA0871 and BA5258) (Xu et al., 2004a), Erysipelothrix rhusiopathiae (RspA and RspB) (Shimoji et al., 2003), and Acb from Streptococcus gallolyticus (Sillanpää et al., 2009; Table 2). Amongst these proteins, Cna and Ace are the best-studied members.

TABLE 2

Protein nameSpeciesCollagenKDReferences
CneS. equiI125 nMvan Wieringen et al., 2010
II50 nMvan Wieringen et al., 2010
III100 nMvan Wieringen et al., 2010
CbmS. mutansINDNomura et al., 2012, 2013
IIINDNomura et al., 2013
IVNDNomura et al., 2013
AcmE. faeciumI3.8 μMNallapareddy et al., 2003
IV12.8 μMNallapareddy et al., 2003
AceE. faecalisI48 μMRich et al., 1999; Ross et al., 2012
IVNDNallapareddy et al., 2000
CnaS. aureusI54 nMXu et al., 2004b; Ross et al., 2012
AcbS. gallolyticusI45 nMSillanpää et al., 2009
IV0.3 μMSillanpää et al., 2009
V0.5 μMSillanpää et al., 2009
CnmS. mutansINDSato et al., 2004; Nomura et al., 2013
IIINDNomura et al., 2013
IVNDNomura et al., 2013
BA0871B. anthracisI1.6–3.2 μMXu et al., 2004a
BA5258B. anthracisI0.6–0.9 μMXu et al., 2004a

Collagen binding CNA-like proteins.

ND stands for not determined.

With the exception of Acb, all CNA-like proteins are anchored directly to the cell wall. Acb is unique and is a minor pilus protein of S. gallolyticus (Sillanpää et al., 2009) but has a predicted CNA-like structure. Furthermore, it shares 50–70% sequence identity with Acm, Cna, and Cne (Sillanpää et al., 2009).

Structure

Collagen Adhesin is the prototype of Collagen-binding MSCRAMMs (Foster et al., 2014; Foster, 2019). CNA like proteins harbor a N-terminal signal sequence, an A-region, a variable number of characteristic B repeats, a C-terminal cell wall and membrane spanning region and a short cytoplasmic tail (Figure 1). The ligand-binding A-region of CNA-like proteins is further divided into two or three sub-domains: N1, N2, and N3 (Figure 1; Patti et al., 1993; Zong et al., 2005). X-ray crystallography of Cna and Ace N1N2 sub-domains revealed that these domains adopt IgG-like folds called Dev-IgG and are consequently composed of mostly β-sheets (Figure 2; Foster et al., 2014; Foster, 2019). The N1 and N2 domains are connected by a rather long (10 aa) hydrophobic linker region, which creates a hole of ∼15 Å between the two domains and provides flexibility in domain orientation (Figure 2; Zong et al., 2005). Additionally, proteins in the CNA-like MSCRAMM family have a variable number of B repeats depending upon the protein (Patti et al., 1994a; Kang et al., 2013). One B repeat is ∼180 aa long and is further divided into two ∼90 aa subdomains, D1 and D2. The D subdomains adopt an inverse IgG fold and together B repeats are thought to form a stalk projecting the ligand binding region away from the bacterial cell surface (Deivanayagam et al., 2000).

FIGURE 1

FIGURE 2

Binding Mechanism

The truncated N2 domain is the minimum collagen-binding region of CNA, although optimal binding is achieved by the N1N2 segments. The CNA N1N2 segment binds collagen type I with an affinity of 54 nM (Patti et al., 1993, 1995; Zong et al., 2005; Ross et al., 2012). Electron microscopy imaging of rCNA with collagen triple helix monomers revealed that CNA binds collagen at multiple sites, without any obvious preference for a “hot spot”. Surface plasmon resonance (SPR) studies of rCNA31344 with synthetic collagen peptides further confirmed its preference for a triple helical structure (Zhang et al., 2015). CNA binds preferentially to cleaved collagen in damaged or inflamed tissues (Madani et al., 2017).

Collagen Adhesin-like proteins bind collagen by a “collagen hug” mechanism where the N1N2 segment “hugs” or wraps around the collagen triple helix molecule (Figure 2). A co-crystal of CNA bound in complex with the synthetic collagen peptide (GPO)4GPRGRT(GPO)4, where O is hydroxyproline, provided the insights into the molecular basis of this model. The collagen hug binding mechanism is initiated when the collagen triple helix interacts with the shallow groove on the CNA N2 domain. This interaction is low affinity, and involves polar and hydrophobic residues (Zong et al., 2005). The initial interaction leads to structural rearrangements within the N1 domain that repositions N1 closer to the N2 domain creating a “tunnel-like” structure. Finally, the C-terminal extension of the N2 domain undergoes structural changes, and inserts into the N1 domain by β-strand complementation thus forming a “latch” (Figure 2). The N1 domain of CNA interacts with the middle chain while the N2 domain interacts with the leading and trailing chains of the synthetic collagen peptide. The N1N2 linker region covers the collagen peptide and holds it in place (Zong et al., 2005; Liu et al., 2007).

The two-step binding mechanism of CNA to collagen was confirmed by atomic force microscopy studies where a moderate force (∼250 pN) was observed for the initial hydrophobic interaction between collagen and the N2 domain of CNA (Herman-Bausier et al., 2016). After binding collagen, a strong force of ∼1.2 nN was observed for the full interaction. Although B-repeats of CNA do not bind collagen directly, they act as a spring and help withstand the high mechanical stress encountered in vivo (Herman-Bausier et al., 2016).

Although all members of the CNA-like MSCRAMM family appear to bind collagen by a collagen hug mechanism, the proteins show differences in affinity (Table 2) and mechanistic details because of structural variations. For example, CNA has a higher affinity for the collagen triple helix than Ace (Ross et al., 2012). In contrast to the two-step mechanism used by CNA, Ace binds collagen with a rapid association and dissociation rate in a one-step binding mechanism (Rich et al., 1999; Ross et al., 2012).

Virulence

Most proteins in the CNA-like MSCRAMM sub-family have been shown to act as virulence factors in experimental bacterial infections. CNA-like proteins target collagen to enhance adhesion of the bacteria to host tissues in early and later stages of infection. For example, CNA is a critical virulence factor of S. aureus in experimental septic arthritis and osteomyelitis models and this role depends on its ability to bind collagen (Patti et al., 1994b; Elasri et al., 2002; Xu et al., 2004b). Although CNA is not required in the initial targeting of joints, it is critical for hematogenous spread of S. aureus leading to bone infections (Elasri et al., 2002). Additionally, more bacteria were isolated from joints of mice infected with collagen binding cna+ bacterial strains than those infected with non-collagen binding strains. Most CNA-like proteins also bind to collagen present in vegetations observed in non-bacterial thrombotic endocarditis, thus leading to infective endocarditis (Hienz et al., 1996; Nallapareddy et al., 2008; Singh et al., 2010). Ace and Acm, the enterococcal CNA-like proteins, are important virulence factors in infective endocarditis (Nallapareddy et al., 2008; Singh et al., 2010). The ace deletion mutant of E. faecalis OG1RF strain showed decreased colonization of heart valves in a mixed-infection rat endocarditis model compared to the wild type strain. Higher bacterial colony forming unit (CFU) counts were recovered from aortic valve vegetations at 4 h in mono endocarditis infection of rats with ace expressing E. faecalis OG1RF compared to the ace deletion mutant, indicating a role in early colonization of heart valves (Singh et al., 2010). Similarly, significantly more wild type (WT) E. faecium TX0082 CFUs were recovered from rat vegetations after mixed endocarditis infection compared to acm deletion mutant E. faecalis TX6051. Furthermore, Acm was also shown to enhance early adherence to heart valves (Nallapareddy et al., 2008). On the other hand, CNA’s ability to bind collagen is of limited significance in early stages of attachment to traumatized aortic valves, but like Acm and Ace (Nallapareddy et al., 2008), CNA does contribute to establishment of infection at a 24 h time point in both mono and mixed endocarditis infections of rats with S. aureus isolates (Hienz et al., 1996).

Cbm and Cnm are homologous S. mutans proteins with 78% identity in their collagen binding domains. The cnm gene is sufficient and necessary for primary human coronary artery endothelial cell invasion by S. mutans isolates as shown with Δcnm S. mutans clinical isolates as well as cnm+Lactococcus lactis (Abranches et al., 2011; Freires et al., 2017). The cnm gene also permits invasion of other non-phagocytic cells like human gingival fibroblasts and human oral keratinocytes (Miller et al., 2015). In addition, cnm+S. mutants OMZ175 and cnm+L. lactis outcompeted Δcnm S. mutans OMZ175 and L. lactis, by 10 and 100-fold, respectively, in ex vivo bacterial adherence to aortic valve sections. Using a rabbit model of infective endocarditis, it was shown that cnm L. lactis mediated attachment to injured endocardium but not to the vegetations (Freires et al., 2017). Similar to Cnm, cbm+S. mutans attaches to aortic valves and leads to larger vegetations formed on the impaired heart valve tissue compared to cbmS. mutans.

In addition, collagen-binding proteins have been implicated in various infections. For example, CNA has been implicated in pathogenesis of S. aureus keratitis (Rhem et al., 2000) and orthopedic prosthesis infections (Montanaro et al., 1999). Similarly, Cnm has been implicated in S. mutans cerebral hemorrhaging (Tonomura et al., 2016) and colonization of dental pulp (Nomura et al., 2016).

M and M-Like Proteins

M-protein, described by Rebecca Lancefield almost a century ago (Lancefield, 1928), is a major cell wall-anchored protein and virulence factor present on the surface of Group A, B, and C streptococci (GAS, GBS, and GCS) (Dinkla et al., 2007; Barroso et al., 2009; Reissmann et al., 2012). There are around ∼250 known M-protein types in GAS based on sequence variation in the first 50 amino acids of the protein. Variations in the M-protein lead to strain-specific immunity and, hence, M-proteins serve as a strain typing marker (Lancefield, 1928). M proteins have multiple functions, including inhibition of phagocytosis and binding to fibrinogen, collagen, complement, and other host proteins (Metzgar and Zampolli, 2011).

Structure

M proteins are multi-domain proteins that adopt an elongated α- helical structure and dimerize to form helical coiled-coil structures, a structure form also seen in mammalian proteins like tropomyosin and myosin (McNamara et al., 2008; Fischetti, 2016). M-protein fibrils are ∼500 Å long and coat the surface of Group A streptococcus (Phillips et al., 1981; Fischetti, 1989). When viewed by transmission electron microscopy, M-protein appears like “fuzz on a tennis ball” (Phillips et al., 1981). All M-proteins contain a signal peptide, a hypervariable region, a less variable central domains and a highly conserved C-terminus (Figure 1; Fischetti, 1989).

The prototypic M6 protein consists of a cleavable signal sequence, A repeats, which includes the hypervariable region (HVR), B repeats, C repeats, D-region and a LPXTG motif for sortase mediated anchoring to the cell wall (Figure 1). The HVR region is the first 50 amino acids of the mature M protein and shows variation amongst the different M-proteins. The M6 A-repeat region consists of five repeats of 14 amino acids each, where the central repeats are identical and end repeats are slightly divergent (Smeesters et al., 2010; Fischetti, 2016). The B-repeat region contains five repeats, each 25 amino acid long (Fischetti, 1989, 2016). The M6 protein contains two C-repeats where each repeat is 35 residues long (Fischetti, 1989, 2016). C-repeats show higher sequence conservation compared to A- and B-repeats. Lastly, the M6 protein contains four D-repeats, each 7 amino acid long (Fischetti, 1989, 2016). Amongst the A, B, C, and D repeat regions, D-repeats show highest sequence homology to each other for any M protein (Smeesters et al., 2010). Together, A-, B-, C-, and D-repeats form the central helical rod (Fischetti, 1989, 2016, 2019; McNamara et al., 2008).

As observed in tropomyosin and myosin, the coiled-coil nature of a protein molecule comes from heptad repeats, where the first and fourth residues in the register are generally hydrophobic (Fischetti, 1989, 2016; McNamara et al., 2008). Hydrophobic residues form the core of the coiled coil and the remaining residues in the heptad repeats are generally helix promoting (McNamara et al., 2008; Fischetti, 2016). Heptad repeats found in M-proteins are not perfect, which leads to irregularities and instabilities of the coiled-coil region (McNamara et al., 2008; Macheboeuf et al., 2011). McNamara et al. found that destabilizing residues in the coiled-coil region of M1 protein promote conformational dynamics, which is required for binding of M1 protein to fibrinogen (McNamara et al., 2008; Stewart et al., 2016). These irregularities in the heptad repeats also form the basis for sub-division of the protein into A-, B-, and C-repeats (Fischetti, 2019).

Sequence and structural variations amongst M-proteins are common. Homologous recombination in M-protein leads to differences in the frequency and length of the repeats and an overall variation in size (Fischetti, 1989). As a result, A- and B- repeats are not present in all M-proteins and when present, their sizes can vary. However, all M-proteins contain C-repeats and their total number can vary from two to four (Smeesters et al., 2010). The sequence variations between M-proteins lead to functional differences and hence not all M-proteins possess all the functional capabilities described in the literature.

Binding Mechanism

Amongst the >250 known types of M-proteins, about 20 have been shown to bind collagen (Table 3). M-proteins bind directly to the triple helical regions of collagen (Nitsche et al., 2006; Barroso et al., 2009; Dinkla et al., 2009; Bober et al., 2011; Reissmann et al., 2012) with the exception of the M1 protein, which also interacts with the globular domain of collagen type VI (Bober et al., 2010). Rotary shadowing electron microscopy revealed that M3 protein binds collagen type IV at two different sites: one located on cyanogen bromide fragment 3 (CB3) and the other at a site 20 nm away from the 7S domain (Eble et al., 1993). CB3 is a fragment of collagen type IV that maintains its triple helix and is generated after cleavage of collagen with cyanogen bromide (Eble et al., 1993). When expressed on the surface of a heterologous non-collagen binding host (Streptococcus gordonii GP1221), M-proteins from GCS and Group G streptococci (GGS) enabled GP 1221 to bind to collagen type IV at the same level as GCS and GGS (Barroso et al., 2009).

TABLE 3

M-proteinSpeciesCollagenKDReferences
stG4545.0SDSEIV1.8 pMBarroso et al., 2009
stC2sk.0SDSEIV3.5 pMBarroso et al., 2009; Reissmann et al., 2012
stC5344SDSEIV920 pMBarroso et al., 2009; Reissmann et al., 2012
stG2574.0SDSEIV1.2 nMBarroso et al., 2009; Reissmann et al., 2012
stC-NSRT2.0SDSEIV830 pMBarroso et al., 2009; Reissmann et al., 2012
stG10.0SDSEIV610 pMBarroso et al., 2009; Reissmann et al., 2012
FOG (stG11.0)SDSEI80 pMNitsche et al., 2006
IV6 nMDinkla et al., 2007; Barroso et al., 2009
M3S. pyogenesIV5 nMDinkla et al., 2009; Reissmann et al., 2012
stG97SDSEIVNDReissmann et al., 2012
stC6746SDSEIVNDReissmann et al., 2012
M31.5SDSEIV0.6 nMReissmann et al., 2012
M3.22S. pyogenesIVNDReissmann et al., 2012
stG211.1SDSEIVNDReissmann et al., 2012
stG120.1SDSEIVNDReissmann et al., 2012
stG351SDSEIVNDReissmann et al., 2012
stCQ343SDSEIVNDReissmann et al., 2012
stG211.0SDSEIVNDReissmann et al., 2012
stC922SDSEIVNDReissmann et al., 2012
M55S. pyogenesIV5 nMReissmann et al., 2012
M1S. pyogenesI54 nMBober et al., 2011
VINDBober et al., 2010

Collagen binding M-proteins.

SDSE stands for S. dysgalactiae subsp. equisimilis; ND stands for not determined.

Peptide associated with rheumatic fever (PARF) is an eight-residue motif present in the hypervariable A region of some M- and M-like proteins (Dinkla et al., 2007; Barroso et al., 2009; Reissmann et al., 2012). Based on careful examination of multiple M-proteins from 69 isolates, a consensus sequence of the PARF motif was determined to be (A/T/E)XYLXX(L/F)N where charged amino acids are preferred at positions 2, 5, and 6, with at least one of the charged amino acids containing a basic side chain (Barroso et al., 2009; Reissmann et al., 2012). A PARF motif is required for binding of these M-proteins to collagen (Dinkla et al., 2007; Reissmann et al., 2012), as one or two substitutions of the conserved residues in the PARF motif abolishes binding to collagen type IV (Reissmann et al., 2012). However, additional data suggests that the binding of M-proteins to collagen can be more complicated and extends beyond the PARF motif. First, a series of recombinant truncated PARF-containing versions of an M-protein bind collagen with significantly different affinities (Dinkla et al., 2007). A full-length recombinant M-protein of GGS called “fibrinogen-binding protein of G streptococci” (FOG) binds to collagen type IV with a KD of 6 nM, whereas a truncated FOG protein containing A- and B-repeats binds collagen type IV with 24 times higher KD and a FOG protein containing the A-region only binds collagen type IV with a 200 fold higher KD compared to the full length FOG protein (Dinkla et al., 2007). Similarly, a truncated recombinant FOG protein binds collagen type I with a 20 fold higher KD than the full length recombinant FOG protein (Nitsche et al., 2006). Furthermore, Reissmann et al. (2012) identified M-proteins with PARF motifs that did not bind collagen type IV. Interestingly, M-proteins stG120.1, stG120.0, and stGM220 all contain the same PARF motif but only stG120.1 binds collagen type IV, while all three proteins bind fibrinogen. Moreover, the M1-protein lacks a PARF motif (Reissmann et al., 2012) but still binds to the triple helix of collagen types I and IV (Bober et al., 2011) and globular domains of collagen type VI (Bober et al., 2010).

M-proteins binding to different types of collagens can have different consequences. Binding of M-proteins to collagen type IV leads to aggregation of collagen on the surface of the bacteria (Dinkla et al., 2003b, 2007; Barroso et al., 2009), which is not observed with the interaction of collagen type I to M-protein (Barroso et al., 2009). Expression of M-protein on the surface of a heterologous host leads to collagen type IV aggregation, demonstrating that the M-protein alone is sufficient for collagen aggregation.

Virulence

M or M-like proteins are major virulence factors of Streptococci and their role in streptococcus pathogenesis have been reported on extensively (Oehmcke et al., 2010; Smeesters et al., 2010; Frost et al., 2017; Fischetti, 2019). In this review article, we will focus on the contribution of the M-protein:collagen interaction to the pathogenesis of streptococci. Binding of M-proteins to collagen can have two consequences: (1) mediating bacterial adhesion to connective tissues and (2) inducing collagen auto-immunity.

M-protein binding to collagen is important in the colonization of human skin by streptococci (Nitsche et al., 2006). When incubated with human dermis ex vivo, higher CFU counts were recovered from a GGS strain expressing FOG protein compared to a FOG-deficient strain. Incubation of the bacteria with collagen type I decreased adherence of the FOG expressing strain to human dermis, thereby also suggesting that the interaction of FOG with collagen type I enables adhesion.

Acute rheumatic fever (ARF) and rheumatic heart disease are antibody-mediated autoimmune sequelae that can develop after a streptococcal infection (Tandon et al., 2013; Carapetis et al., 2016). Binding of M-protein to collagen has been shown to be a relevant factor in developing ARF (Dinkla et al., 2003b, 2007; Barroso et al., 2009). Binding of M or M-like protein to collagen type IV can lead to production of antibodies binding the collagen molecule (Dinkla et al., 2003b, 2007; Barroso et al., 2009). Analysis of mouse sera obtained from immunization with recombinant M or M-like protein led to identification of two distinct antibody populations: anti-collagen type IV antibodies and anti-M protein antibodies. These distinct antibodies did not cross-react with each other (Dinkla et al., 2007), indicating that collagen type IV autoimmunity was not generated through molecular mimicry. In addition, sera of ARF patients contain antibodies that specifically recognize the CB3 region of collagen type IV and the collagen-binding region of the M3 protein (Dinkla et al., 2007, 2009). The N-terminal half of the protein containing the PARF motif is required for generating auto-immunity (Dinkla et al., 2007). Immunization of mice with full-length FOG led to a significantly higher titer of anti-collagen type IV antibodies compared to mice immunized with FOGB2-C2, a region of FOG that does not bind collagen (Dinkla et al., 2007). Similar results have been obtained with other M-proteins (Dinkla et al., 2007; Barroso et al., 2009). While auto-antibodies to collagen type I have not been demonstrated, given the structural similarities between the collagens, anti-collagen type IV antibodies potentially could also react with other collagen types.

Emerging Families of Collagen-Binding Proteins

Numerous collagen-binding proteins of Gram-positive pathogens have been reported in the literature but their mechanisms of collagen binding are unclear. We have identified three emerging families of collagen-binding proteins where, although one or more than one family member binds to collagen, a clear picture of how these proteins bind to collagen is not yet available.

von Willebrand Factor A- Domain Containing Proteins

von Willebrand factor (vWF) is a host glycoprotein found in blood, blood vessel ECM, and platelet α-granules (Manon-Jensen et al., 2016). vWF is a large modular protein that contains two binding sites for collagen located in the A1 and A3 domains. The A3 domain of vWF binds collagen types I and III whereas the A1 domain binds collagen types IV and VI (Manon-Jensen et al., 2016). Crystal structures of both A1 and A3 domains show a central β-sheet composed of six β-strands and flanked on both sides by α-helices (Huizinga et al., 1997; Emsley et al., 1998). These domains are structurally similar to the I-domain of some integrin α-chains, including the collagen-binding α1-, 2-, 10-, and 11- chains. The collagen-binding α-chain integrins also contain a metal ion-dependent adhesion site (MIDAS) important for ligand binding (Lee et al., 1995; Huizinga et al., 1997; Emsley et al., 1998).

Structural homologs of vWF A-domains, called vWA domains, have been found in minor pilus proteins that bind to ECM proteins and host cells. These pilus proteins include RrgA from Streptococcus pneumoniae (Izore et al., 2010), GBS104 from Streptococcus agalactiae (Krishnan et al., 2013), PilA from S. agalactiae (Konto-Ghiorghi et al., 2009; Banerjee et al., 2011), SpaC from Corynebacterium diphtheriae (Mandlik et al., 2007), and EbpA from E. faecalis (Nielsen et al., 2012). Most structural information about bacterial vWA domains comes from crystal structures of the RrgA and the GBS104 proteins (Izore et al., 2010; Krishnan et al., 2013). RrgA and GBS104 are homologs that share 51% sequence identity with each other and have a similar domain organization (Krishnan et al., 2013). Both proteins contain an N-terminal signal sequence, four D domains named D1, D2, D3, and D4, and a C-terminal sorting signal (Figure 1). The primary sequence of both D1 and D2 domains is non-contiguous, and is divided into two regions, one present in the N-terminal half and other present in the C-terminal half of the protein (Figure 1). The two regions fold back on each other to form the tertiary structure of the D1 and D2 domains. The D3 domain is inserted in between the two regions encoding the D1 and D2 domains and the D4 region is located distal to the C-terminal half of the D1-D2 domain (Figure 1). It is worth noting that while RrgA and GBS104 are structural homologs, other pilus proteins containing vWA domains like PilA have a different overall domain organization (Mandlik et al., 2007; Konto-Ghiorghi et al., 2009; Izore et al., 2010; Banerjee et al., 2011; Nielsen et al., 2012; Krishnan et al., 2013).

The D3 domains of both RrgA and GBS104 adopt a structure similar to the vWF A-domain. These D3 domains of both RrgA and GBS104 consist of a central β-sheet flanked by α-helices on both sides as seen in the vWF A-domain and the integrin I-domain (Figure 3A; Huizinga et al., 1997; Emsley et al., 1998; Izore et al., 2010; Krishnan et al., 2013). In addition, both RrgA and GBS104 have two arms inserted into the vWA-domain that are absent in the A-domains of vWF and the I-domain of integrins (Lee et al., 1995; Huizinga et al., 1997; Emsley et al., 1998; Izore et al., 2010; Krishnan et al., 2013). The first arm of RrgA contains two β-hairpins folded together to form an elongated arm (Figure 3A). The second arm of RrgA consists mostly of loops along with one short hairpin, two α-helices and loops (Figure 3A; Izore et al., 2010; Krishnan et al., 2013). The two inserted arms extend away from the core of the domain and extend the length of the protein (Figure 3A; Izore et al., 2010). The D3 domain of the two bacterial proteins also contains a MIDAS motif (Figure 3B) present in the I-domain of integrins but absent in the vWF A-domains.

FIGURE 3

Amongst the pilus proteins with a vWA-domain, RrgA, PilA, and GBS104 have been reported to bind collagen. RrgA binds collagen type I, fibronectin, and laminin (Hilleringmann et al., 2008; Moschioni et al., 2010). However, the D3 vWA domain alone was not able to bind ECM proteins (Moschioni et al., 2010). Full-length RrgA protein is required for binding (Moschioni et al., 2010). RrgA binds to collagen type I with weaker force than expected for a ligand:receptor interaction. It has been suggested that the low binding force might help the pilus adhere and detach under physiological flow conditions. However, kinetic data for RrgA and collagen type I is lacking and the suggested consequences of low binding force awaits elucidation (Becke, 2019). Although recombinant PilA has been reported to bind collagen, its role in S. agalactiae collagen binding is not clear (Banerjee et al., 2011; Dramsi et al., 2012). Similarly, GBS104 has been reported to bind collagen but the interaction in a solid-phase binding assay is weak and does not reach saturation indicating that the interaction of GBS104 and collagen type I may not be specific or have functional relevance (Krishnan et al., 2013).

Two different binding regions in the vWA domain-containing pilus proteins have been proposed: the vWA-domain with the MIDAS motif and the U-shaped cradle formed by the inserted arms (Figure 3B; Izore et al., 2010). Apo-crystal structure of the vWA-domain with the MIDAS motif revealed a trench-like region formed by the two inserted arms and the MIDAS motif present on the central β-sheet (Izore et al., 2010; Krishnan et al., 2013). Based on structural comparison with co-crystals of integrin α2β1 and a synthetic triple helix peptide, the trench-like region has been proposed to be the collagen binding site (Figure 3B; Emsley et al., 2000; Izore et al., 2010). The vWA-domain and the integrin I-domain undergoes conformational change during binding events and transition from a closed form to an open form. Participation of the trench-like region and a change in confirmation upon ECM binding was confirmed using an open form of the GBS104-D3 domain stabilized by a disulfide bridge. The open form of the GBS104-D3 domain alone was sufficient for binding to fibronectin, whereas the closed form of the D3 domain showed no binding (Krishnan et al., 2013). The vWA-domains of pilus proteins have considerable variation in their primary sequence, with the most variations in the inserted arms (Konto-Ghiorghi et al., 2009; Izore et al., 2010; Krishnan et al., 2013). Therefore, despite structural similarities, these pilus proteins have been suggested to bind different ligands with different affinities (Izore et al., 2010; Krishnan et al., 2013). A second binding site is the U-shape cradle formed by the two inserted arms joining together at the tip of the protein (Figure 3B). This cradle contains basic residues and has been proposed to bind negatively charged molecules like glycosaminoglycans attached to ECM proteins (Izore et al., 2010). While the vWA domain is critical for virulence (Konto-Ghiorghi et al., 2009; Nielsen et al., 2012), evidence that the vWA domain of RrgA is responsible for binding to collagen is lacking.

Leucine Rich Repeat Containing Proteins

Leucine rich repeats (LRRs) are protein recognition motifs present in eukaryotic proteins with diverse functions (Kobe and Kajava, 2001). Small leucine rich proteoglycans (SLRPs) in mammals are an example of LRR proteins and play important roles in collagen fibrillogenesis (Kalamajski and Oldberg, 2010). LRR containing proteins have been found in some pathogenic bacteria, e.g., Yersinia pestis, Listeria monocytogenes, plants, animals, and fungi (Kobe and Kajava, 2001). Each repeat is 20–29 aa long and are often present in tandem with multiple LRRs to form an overall curved shape where β-sheets are present on the concave side and α-helices are often on the convex side (Kobe and Kajava, 2001).

Streptococcal leucine rich (Slr) protein is an LRR-containing lipoprotein present on the surface of S. pyogenes (Bober et al., 2011). The N-terminal half of the protein contains a 21 aa long signal sequence and 4 histidine triad motifs (Figure 1; Bober et al., 2011). The C-terminal half of the protein contains 13 leucine rich regions that form β-sheets, followed by histidine rich repeat sequences (Figure 1; Bober et al., 2011). The horseshoe shape of Slr was visible with electron microscopy (Bober et al., 2011). Orthologs of Slr have been identified in Streptococcus suis 05ZYH33 (1577), Streptococcus equi subsp. zooepidemicus H70 (13200), Streptococcus dysgalactiae subsp. equisimilis GGS_124 (1372), Streptococcus agalactiae (Blr), Streptococcus uberis 0140J (1212), and Streptococcus suis 2 (HtpsC) (Waldemarsson et al., 2006; Plumptre et al., 2012; Li et al., 2015). Slr has been demonstrated to bind collagen directly with a KD of 12 nM (Waldemarsson et al., 2006). Interestingly, HtpsC has been reported to bind laminin and fibronectin, but HtpsC did not bind collagen type I (Li et al., 2015).

The extracellular matrix protein (Emp) of S. aureus is 340 aa long secreted protein with a 26 aa long signal peptide at the N-terminus. Emp binds collagen type I with a KD of 27 nM. Emp is structurally intriguing, as Emp is not predicted to be multi-domained. When viewed through a transmission electron microscope, the Emp monomer was revealed to form a horseshoe-type structure with an 8 nm diameter. Interestingly, even though it lacks leucine repeats, structure prediction through I-TASSER identified leucine rich repeat proteins as the top ten structural analogs (Geraci et al., 2017).

While molecular details of Slr and Emp binding to collagen have not been studied beyond the confirmation of their interaction, their intriguing overall structural similarities leads the way for postulating an emerging LRR-containing or LRR like protein family that binds collagen. Given that several human LRR proteins [e.g., decorin (Schönherr et al., 1995), fibromodulin (Font et al., 1998)] interact with collagen, it is not surprising that bacterial LRR proteins bind collagen. Additional studies are needed to determine the residues that mediate the interaction and determine similarities of those interactions with host LRR proteins and collagen.

Sgo0707 N1-Domain Containing Proteins

Streptococci express multiple surface proteins that have been reported to bind collagen (Avilés-Reyes et al., 2017). One emerging family of collagen-binding proteins in Streptococci is related to the N1-domain of Sgo0707 protein from S. gordonii. The Sgo0707 protein, which has been shown to bind collagen, contains a N-terminal signal sequence, a 419 aa long N-terminal region, eight repeats of 84 aa, five repeats of 88 aa, a unique domain, an LPXTG cell wall sorting signal and a transmembrane helix (Figure 1; Nylander et al., 2013). The 419 aa long N-terminal region is divided into two domains: N1 and N2 (Figure 1). Both the N1 and N2 domains adopt a β-sandwich with anti-parallel β-sheets (Nylander et al., 2013), where β-sheet 1 contains nine β-strands and β-sheet 2 contains eight strands (Figure 4A). The N1 domain also contains two small sub-domains A and B. The N2 domain consists of two β-sheets of five β-strands and a third small sheet of three strands and adopts a DeV-IgG fold also observed in the N1N2 domains of CNA (Nylander et al., 2013).

FIGURE 4

A search of proteins with similar N1 domains identified the variable domains in two Ag I/II family proteins; SpaP from S. mutans and SspB from S. gordonii. Both these domains are predicted to adopt a similar structure despite having only 10% sequence identity to N1 of Sgo0707 (Forsgren et al., 2009; Larson et al., 2010; Nylander et al., 2013). These proteins have a different domain organization than Sgo0707, with an N-terminal signal sequence, alanine rich repeats, a variable domain, proline rich repeats, a C-terminal domain, and an LPXTG cell wall sorting signal (Forsgren et al., 2009; Larson et al., 2010). All three proteins form an extended confirmation with the putative collagen binding domain (N-region of Sgo0707 and variable domain of SpaP and SspB) predicted to be located at the tip of the protein (Forsgren et al., 2009; Larson et al., 2010; Nylander et al., 2013).

Docking of the collagen triple helix to the Sgo0707 N1N2 domain identified two different potential binding sites (Figures 4B,C). The first binding site is on top of the N1 domain in the open cleft formed by the two subdomains in the N1 domain (Figure 4B). This site has a higher negative surface potential compared to the SspB and SpaP proteins, and lacks the metal ion located in the cleft found in both of the Ag I/II proteins (Forsgren et al., 2009; Larson et al., 2010; Nylander et al., 2013). A second putative collagen-binding site is formed by the loops of the N1 domain and a β-sheet of the N2 domain, which together form a concave surface where collagen can dock (Figure 4C; Nylander et al., 2013). The concave site consists of mostly non-polar residues (Nylander et al., 2013).

All three proteins (Sgo0707, SspB, and SpaP) have been implicated in collagen binding. Binding of the three proteins to collagen type I was shown in a bacterial adhesion assay using deletion mutants (Love et al., 1997, 2000; Nylander et al., 2013). Deletion of the sgo0707 gene in S. gordonii DL1 decreased collagen type I binding by 40% compared to the WT strain (Nylander et al., 2013). Similarly, an isogenic deletion mutant of the sspB gene in S. gordonii and of the spaP gene in S. mutans showed decreased binding to collagen type I compared to WT strains (Love et al., 1997, 2000). Additionally, binding of S. gordonii DL1 to collagen type I in a bacterial adhesion assay was inhibited by recombinant N-region of Sgo0707, thus narrowing down the N-region as the collagen binding partner (Nylander et al., 2013). While the three proteins have been implicated in collagen binding, their direct binding to collagen has not been demonstrated. The three proteins only share structurally similar N1-domains as both SspB and SpaP lack the N2 domain found in Sgo0707 (Love et al., 1997, 2000; Nylander et al., 2013). Do the three proteins bind collagen at the cleft on top of the N1-domain? Further studies are required to narrow down the collagen-binding site in these proteins and to determine if they form a structural family of proteins that bind collagen.

Concluding Remarks

Gram-positive pathogens utilize their interactions with the ECM for tissue colonization and to establish infections in the host. Molecular insight into these interactions can pave the way for the design of novel anti-infectives. However, studies of collagen-binding proteins in Gram-positive pathogens are in their infancy and do not provide a complete picture of the different binding mechanisms involved. Further structural studies are required to fully understand the molecular basis for the interaction between bacterial collagen-binding proteins and the triple helix of collagen. In particular, the interaction of emerging collagen-binding protein families with collagen needs to be further characterized using biochemical and microbiological techniques to determine which family members bind collagen.

Mammalian proteins containing a collagen-like region play a role in host defense. Collagen-binding host proteins, e.g., the LRR proteoglycan decorin, bind soluble host defense collagens (Krumdieck et al., 1992). Interaction of bacterial collagen-binding proteins with soluble defense collagens can provide an opportunity for pathogens to evade the host immune response. CNA binds C1q, a complement protein and collagen (Kang et al., 2013). The classical complement pathway is initiated upon recognition of pathogen-bound antibodies by the C1 complex, which consists of C1q, C1r, and C1s. C1q protein contains the globular recognition domain and binds pathogen-bound antibodies. C1r and C1s are proteases that are required for the complement cascade. C1r and C1s bind the collagen-like stalk of C1q (Mortensen et al., 2017). CNA uses its interaction with C1q for immune evasion by interfering with the interaction between C1r and C1q and thus deactivating the C1 complex (Kang et al., 2013).

Interaction of collagen-binding bacterial proteins with other host proteins containing collagen like-regions, especially soluble human defense collagens, is an understudied area. While acknowledging that not all collagen-binding proteins will bind soluble defense collagens and vice versa, future studies focusing on the interaction between bacterial collagen binding proteins and host defense collagens will lead to a better understanding of the pathogenic mechanisms utilized by Gram-positive bacteria.

Statements

Author contributions

SA and JG wrote the manuscript. SA and MH edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by NIH R01 grant AI020624-35.

Acknowledgments

We would like to thank Dr. Stefan Siwko for his assistance in editing the review article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AbranchesJ.MillerJ. H.MartinezA. R.Simpson-HaidarisP. J.BurneR. A.LemosJ. A. (2011). The collagen-binding protein Cnm is required for Streptococcus mutans adherence to and intracellular invasion of human coronary artery endothelial cells.Infect. Immun.7922772284. 10.1128/iai.00767-10

  • 2

    ArvanitidisA.KarsdalM. A. (2016). “Type XV Collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 15, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 9799.

  • 3

    Avilés-ReyesA.MillerJ. H.LemosJ. A.AbranchesJ. (2017). Collagen-binding proteins of Streptococcus mutans and related streptococci.Mol. Oral. Microbiol.3289106. 10.1111/omi.12158

  • 4

    BagerC. L.KarsdalM. A. (2016). “Type XVIII Collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 18, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 113121.

  • 5

    BanerjeeA.KimB. J.CarmonaE. M.CuttingA. S.GurneyM. A.CarlosC.et al (2011). Bacterial Pili exploit integrin machinery to promote immune activation and efficient blood-brain barrier penetration.Nat. Commun.2:462. 10.1038/ncomms1474

  • 6

    BarrosoV.RohdeM.DaviesM. R.GillenC. M.Nitsche-SchmitzD. P.DinklaK.et al (2009). Identification of active variants of PARF in human pathogenic group C and group G streptococci leads to an amended description of its consensus motif.Int. J. Med. Microbiol.299547553. 10.1016/j.ijmm.2009.04.004

  • 7

    BeckeT. (2019). Streptococcus Pneumoniae TIGR4 Pilus-1 Biomechanical Aspects of Adhesion During Interaction With Host Extracellular Matrix Proteins Fibronectin and Collagen I.München: Technische Universität München.

  • 8

    BoberM.EnochssonC.CollinM.MörgelinM. (2010). Collagen VI is a subepithelial adhesive target for human respiratory tract pathogens.J. Innate Immun.2160166. 10.1159/000232587

  • 9

    BoberM.MörgelinM.OlinA. I.von Pawel-RammingenU.CollinM. (2011). The membrane bound LRR lipoprotein Slr, and the cell wall-anchored M1 protein from Streptococcus pyogenes both interact with type I collagen.PLoS One6:e20345. 10.1371/journal.pone.0020345

  • 10

    BoudkoS. P.BächingerH. P. (2016). Structural insight for chain selection and stagger control in collagen.Sci. Rep.6:37831. 10.1038/srep37831

  • 11

    BurgesonR. E.NimniM. E. (1992). Collagen types. molecular structure and tissue distribution.Clin. Orthop. Relat. Res.282250272.

  • 12

    CarapetisJ. R.BeatonA.CunninghamM. W.GuilhermeL.KarthikeyanG.MayosiB. M.et al (2016). Acute rheumatic fever and rheumatic heart disease.Nat. Rev. Dis. Primers2:15084. 10.1038/nrdp.2015.84

  • 13

    CarretG.EmonardH.FardelG.DruguetM.HerbageD.FlandroisJ. P. (1985). Gelatin and collagen binding to Staphylococcus aureus strains.Ann. Inst. Pasteur. Microbiol. (1985)136a241245.

  • 14

    CasalsC.Campanero-RhodesM. A.García-FojedaB.SolísD. (2018). The role of collectins and galectins in lung innate immune defense.Front. Immunol.9:1998. 10.3389/fimmu.2018.01998

  • 15

    CasalsC.García-FojedaB.MinuttiC. M. (2019). Soluble defense collagens: sweeping up immune threats.Mol. Immunol.112291304. 10.1016/j.molimm.2019.06.007

  • 16

    DeivanayagamC. C.RichR. L.CarsonM.OwensR. T.DanthuluriS.BiceT.et al (2000). Novel fold and assembly of the repetitive B region of the Staphylococcus aureus collagen-binding surface protein.Structure86778.

  • 17

    DesvauxM.DumasE.ChafseyI.HébraudM. (2006). Protein cell surface display in Gram-positive bacteria: from single protein to macromolecular protein structure.FEMS Microbiol. Lett.256115. 10.1111/j.1574-6968.2006.00122.x

  • 18

    DinklaK.Nitsche-SchmitzD. P.BarrosoV.ReissmannS.JohanssonH. M.FrickI. M.et al (2007). Identification of a streptococcal octapeptide motif involved in acute rheumatic fever.J. Biol. Chem.2821868618693. 10.1074/jbc.M701047200

  • 19

    DinklaK.RohdeM.JansenW. T.CarapetisJ. R.ChhatwalG. S.TalayS. R. (2003a). Streptococcus pyogenes recruits collagen via surface-bound fibronectin: a novel colonization and immune evasion mechanism.Mol. Microbiol.47861869. 10.1046/j.1365-2958.2003.03352.x

  • 20

    DinklaK.RohdeM.JansenW. T. M.KaplanE. L.ChhatwalG. S.TalayS. R. (2003b). Rheumatic fever-associated Streptococcus pyogenes isolates aggregate collagen.J. Clin. Invest.11119051912. 10.1172/JCI17247

  • 21

    DinklaK.TalayS. R.MörgelinM.GrahamR. M. A.RohdeM.Nitsche-SchmitzD. P.et al (2009). Crucial role of the CB3-region of collagen IV in PARF-induced acute rheumatic fever.PLoS One4:e4666. 10.1371/journal.pone.0004666

  • 22

    DramsiS.MorelloE.PoyartC.Trieu-CuotP. (2012). Epidemiologically and clinically relevant Group B Streptococcus isolates do not bind collagen but display enhanced binding to human fibrinogen.Microbes Infect.1410441048. 10.1016/j.micinf.2012.07.004

  • 23

    DuarteA. S.CorreiaA.EstevesA. C. (2016). Bacterial collagenases – A review.Crit. Rev. Microbiol.42106126. 10.3109/1040841X.2014.904270

  • 24

    EbleJ. A.GolbikR.MannK.KühnK. (1993). The alpha 1 beta 1 integrin recognition site of the basement membrane collagen molecule [alpha 1(IV)]2 alpha 2(IV).Embo J.1247954802.

  • 25

    ElasriM. O.ThomasJ. R.SkinnerR. A.BlevinsJ. S.BeenkenK. E.NelsonC. L.et al (2002). Staphylococcus aureus collagen adhesin contributes to the pathogenesis of osteomyelitis.Bone30275280.

  • 26

    EmsleyJ.CruzM.HandinR.LiddingtonR. (1998). Crystal structure of the von Willebrand Factor A1 domain and implications for the binding of platelet glycoprotein Ib.J. Biol. Chem.2731039610401. 10.1074/jbc.273.17.10396

  • 27

    EmsleyJ.KnightC. G.FarndaleR. W.BarnesM. J.LiddingtonR. C. (2000). Structural basis of collagen recognition by integrin alpha2beta1.Cell1014756. 10.1016/s0092-8674(00)80622-4

  • 28

    FischettiV. A. (1989). Streptococcal M protein: molecular design and biological behavior.Clin. Microbiol. Rev.2285314. 10.1128/cmr.2.3.285

  • 29

    FischettiV. A. (2016). M Protein and Other Surface Proteins on Streptococci.Oklahoma: University of Oklahoma Helath Sciences Center.

  • 30

    FischettiV. A. (2019). Surface proteins on gram-positive bacteria.Microbiol. Spectr.7GPP3-0012-2018. 10.1128/microbiolspec.GPP3-0012-2018

  • 31

    FitzgeraldJ.HoldenP.HansenU. (2013). The expanded collagen VI family: new chains and new questions.Connect Tissue Res.54345350. 10.3109/03008207.2013.822865

  • 32

    FontB.EichenbergerD.GoldschmidtD.BoutillonM. M.HulmesD. J. (1998). Structural requirements for fibromodulin binding to collagen and the control of type I collagen fibrillogenesis–critical roles for disulphide bonding and the C-terminal region.Eur. J. Biochem.254580587. 10.1046/j.1432-1327.1998.2540580.x

  • 33

    ForsgrenN.LamontR. J.PerssonK. (2009). Crystal structure of the variable domain of the Streptococcus gordonii surface protein SspB.Protein Sci.1818961905. 10.1002/pro.200

  • 34

    FosterT. J. (2019). The MSCRAMM family of cell-wall-anchored surface proteins of gram-positive cocci.Trends Microbiol.27927941. 10.1016/j.tim.2019.06.007

  • 35

    FosterT. J.GeogheganJ. A.GaneshV. K.HöökM. (2014). Adhesion, invasion and evasion: the many functions of the surface proteins of Staphylococcus aureus.Nat. Rev. Microbiol.124962. 10.1038/nrmicro3161

  • 36

    FrantzC.StewartK. M.WeaverV. M. (2010). The extracellular matrix at a glance.J. Cell Sci.123(Pt 24), 41954200. 10.1242/jcs.023820

  • 37

    FraserD. A.TennerA. J. (2008). Directing an appropriate immune response: the role of defense collagens and other soluble pattern recognition molecules.Curr. Drug Targets9113122. 10.2174/138945008783502476

  • 38

    FreiresI. A.Avilés-ReyesA.KittenT.Simpson-HaidarisP. J.SwartzM.KnightP. A.et al (2017). Heterologous expression of Streptococcus mutans Cnm in Lactococcus lactis promotes intracellular invasion, adhesion to human cardiac tissues and virulence.Virulence81829. 10.1080/21505594.2016.1195538

  • 39

    FrostH. R.Sanderson-SmithM.WalkerM.BotteauxA.SmeestersP. R. (2017). Group A streptococcal M-like proteins: from pathogenesis to vaccine potential.FEMS Microbiol. Rev.42193204. 10.1093/femsre/fux057

  • 40

    GebauerJ. M.KobbeB.PaulssonM.WagenerR. (2016). Structure, evolution and expression of collagen XXVIII: lessons from the zebrafish.Matrix Biol.49106119. 10.1016/j.matbio.2015.07.001

  • 41

    GeraciJ.NeubauerS.PöllathC.HansenU.RizzoF.KrafftC.et al (2017). The Staphylococcus aureus extracellular matrix protein (Emp) has a fibrous structure and binds to different extracellular matrices.Sci. Rep.7:13665. 10.1038/s41598-017-14168-4

  • 42

    GonçalvesT. J. M.BoutillonF.LefebvreS.GoffinV.IwatsuboT.WakabayashiT.et al (2019). Collagen XXV promotes myoblast fusion during myogenic differentiation and muscle formation.Sci. Rep.9:5878. 10.1038/s41598-019-42296-6

  • 43

    GudmannN. S.KarsdalM. A. (2016). “Type II collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 2, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 1320.

  • 44

    HansenN. U. B.KarsdalM. A. (2016). “Type VIII collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 8, ed.KarsdalM. A.(Cambridge, MA: Academic Press), 6165.

  • 45

    HarringtonD. J. (1996). Bacterial collagenases and collagen-degrading enzymes and their potential role in human disease.Infect. Immun.6418851891. 10.1128/iai.64.6.1885-1891.1996

  • 46

    HeY.KarsdalM. A. (2016). “Type IX collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 9, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 6771.

  • 47

    HenriksenK.KarsdalM. A. (2016). “Type I collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 1, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 111.

  • 48

    Herman-BausierP.ValotteauC.PietrocolaG.RindiS.AlsteensD.FosterT. J.et al (2016). Mechanical strength and inhibition of the Staphylococcus aureus collagen-binding protein Cna.mBio7:e152916. 10.1128/mBio.01529-16

  • 49

    HienzS. A.SchenningsT.HeimdahlA.FlockJ. I. (1996). Collagen binding of Staphylococcus aureus is a virulence factor in experimental endocarditis.J. Infect. Dis.1748388.

  • 50

    HilleringmannM.GiustiF.BaudnerB. C.MasignaniV.CovacciA.RappuoliR.et al (2008). Pneumococcal Pili are composed of protofilaments exposing adhesive clusters of Rrg A.PLoS Pathog.4:e1000026. 10.1371/journal.ppat.1000026

  • 51

    HohenesterE.YurchencoP. D. (2013). Laminins in basement membrane assembly.Cell Adh. Migr.75663. 10.4161/cam.21831

  • 52

    HolderbaumD.SpechR. A.EhrhartL. A. (1985). Specific binding of collagen to Staphylococcus aureus.Coll. Relat. Res.5261271.

  • 53

    HolmesD. F.LuY.StarborgT.KadlerK. E. (2018). “Collagen fibril assembly and function,” in Current Topics in Developmental Biology, Chap. Three, edsLitscherE. S.WassarmanP. M. (Cambridge, MA: Academic Press), 107142.

  • 54

    HuizingaE. G.van der PlasR. M.KroonJ.SixmaJ. J.GrosP. (1997). Crystal structure of the A3 domain of human von Willebrand factor: implications for collagen binding.Structure511471156. 10.1016/s0969-2126(97)00266-9

  • 55

    IzoreT.Contreras-MartelC.El MortajiL.ManzanoC.TerrasseR.VernetT.et al (2010). Structural basis of host cell recognition by the pilus adhesin from Streptococcus pneumoniae.Structure18106115. 10.1016/j.str.2009.10.019

  • 56

    IzuY.AdamsS. M.ConnizzoB. K.BeasonD. P.SoslowskyL. J.KochM.et al (2020). Collagen XII mediated cellular and extracellular mechanisms regulate establishment of tendon structure and function.Matrix Biol.10.1016/j.matbio.2020.10.004[Epub ahead of print].

  • 57

    KadlerK. E. (2017). Fell muir lecture: collagen fibril formation in vitro and in vivo.Int. J. Exp. Pathol.98416. 10.1111/iep.12224

  • 58

    KadlerK. E.BaldockC.BellaJ.Boot-HandfordR. P. (2007). Collagens at a glance.J. Cell Sci.120:1955. 10.1242/jcs.03453

  • 59

    KalamajskiS.OldbergÅ (2010). The role of small leucine-rich proteoglycans in collagen fibrillogenesis.Matrix Biol.29248253. 10.1016/j.matbio.2010.01.001

  • 60

    KangM.KoY. P.LiangX.RossC. L.LiuQ.MurrayB. E.et al (2013). Collagen-binding microbial surface components recognizing adhesive matrix molecule (MSCRAMM) of Gram-positive bacteria inhibit complement activation via the classical pathway.J. Biol. Chem.2882052020531. 10.1074/jbc.M113.454462

  • 61

    KehletS. N.KarsdalM. A. (2016). “Type XXI collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 21, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 131133.

  • 62

    KobeB.KajavaA. V. (2001). The leucine-rich repeat as a protein recognition motif.Curr. Opin. Struct. Biol.11725732. 10.1016/S0959-440X(01)00266-4

  • 63

    Konto-GhiorghiY.MaireyE.MalletA.DuménilG.CaliotE.Trieu-CuotP.et al (2009). Dual role for pilus in adherence to epithelial cells and biofilm formation in Streptococcus agalactiae.PLoS Pathog.5:e1000422. 10.1371/journal.ppat.1000422

  • 64

    KrishnanV.DwivediP.KimB. J.SamalA.MaconK.MaX.et al (2013). Structure of Streptococcus agalactiae tip pilin GBS104: a model for GBS pili assembly and host interactions.Acta Crystallogr. D Biol. Crystallogr.69(Pt 6), 10731089. 10.1107/s0907444913004642

  • 65

    KrumdieckR.HöökM.RosenbergL. C.VolanakisJ. E. (1992). The proteoglycan decorin binds C1q and inhibits the activity of the C1 complex.J. Immunol.14936953701.

  • 66

    LancefieldR. C. (1928). The antigenic complex of Streptococcus haemolyticus : I. demonstration of a type-specific substance in extracts of Streptococcus haemolyticus.J. Exp. Med.4791103. 10.1084/jem.47.1.91

  • 67

    LannergardJ.FrykbergL.GussB. (2003). CNE, a collagen-binding protein of Streptococcus equi.FEMS Microbiol. Lett.2226974.

  • 68

    LarsonM. R.RajashankarK. R.PatelM. H.RobinetteR. A.CrowleyP. J.MichalekS.et al (2010). Elongated fibrillar structure of a streptococcal adhesin assembled by the high-affinity association of alpha- and PPII-helices.Proc. Natl. Acad. Sci. U.S.A.10759835988. 10.1073/pnas.0912293107

  • 69

    LeeJ. O.RieuP.ArnaoutM. A.LiddingtonR. (1995). Crystal structure of the A domain from the alpha subunit of integrin CR3 (CD11b/CD18).Cell80631638. 10.1016/0092-8674(95)90517-0

  • 70

    LeemingD. J.KarsdalM. A. (2016). “Type V collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 5, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 4348.

  • 71

    LiM.ShaoZ.-Q.GuoY.WangL.HouT.HuD.et al (2015). The type II histidine triad protein HtpsC is a novel adhesion with the involvement of Streptococcus suis virulence.Virulence6631641. 10.1080/21505594.2015.1056971

  • 72

    LiuQ.PonnurajK.XuY.GaneshV. K.SillanpaaJ.MurrayB. E.et al (2007). The Enterococcus faecalis MSCRAMM ACE binds its ligand by the Collagen Hug model.J. Biol. Chem.2821962919637. 10.1074/jbc.M611137200

  • 73

    LizcanoA.SanchezC. J.OrihuelaC. J. (2012). A role for glycosylated serine-rich repeat proteins in Gram-positive bacterial pathogenesis.Mol. Oral Microbiol.27257269. 10.1111/j.2041-1014.2012.00653.x

  • 74

    LoveR. M.McMillanM. D.JenkinsonH. F. (1997). Invasion of dentinal tubules by oral streptococci is associated with collagen recognition mediated by the antigen I/II family of polypeptides.Infect. Immun.6551575164. 10.1128/iai.65.12.5157-5164.1997

  • 75

    LoveR. M.McMillanM. D.ParkY.JenkinsonH. F. (2000). Coinvasion of dentinal tubules by Porphyromonas gingivalis and Streptococcus gordonii depends upon binding specificity of streptococcal antigen I/II adhesin.Infect. Immun.6813591365. 10.1128/iai.68.3.1359-1365.2000

  • 76

    LuoY.SinkeviciuteD.HeY.KarsdalM.HenrotinY.MobasheriA.et al (2017). The minor collagens in articular cartilage.Protein Cell8560572. 10.1007/s13238-017-0377-7

  • 77

    LuoY. Y.KarsdalM. A. (2016). “Type XI collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 11, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 7780.

  • 78

    MacheboeufP.BuffaloC.FuC.-Y.ZinkernagelA. S.ColeJ. N.JohnsonJ. E.et al (2011). Streptococcal M1 protein constructs a pathological host fibrinogen network.Nature4726468. 10.1038/nature09967

  • 79

    MadaniA.GarakaniK.MofradM. R. K. (2017). Molecular mechanics of Staphylococcus aureus adhesin, CNA, and the inhibition of bacterial adhesion by stretching collagen.PLoS One12:e0179601. 10.1371/journal.pone.0179601

  • 80

    MandlikA.SwierczynskiA.DasA.Ton-ThatH. (2007). Corynebacterium diphtheriae employs specific minor pilins to target human pharyngeal epithelial cells.Mol. Microbiol.64111124. 10.1111/j.1365-2958.2007.05630.x

  • 81

    Manon-JensenT.KarsdalM. A. (2016). “Type XIV collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 14, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 9395.

  • 82

    Manon-JensenT.KjeldN. G.KarsdalM. A. (2016). Collagen-mediated hemostasis.J. Thromb. Haemost.14438448. 10.1111/jth.13249

  • 83

    McNamaraC.ZinkernagelA. S.MacheboeufP.CunninghamM. W.NizetV.GhoshP. (2008). Coiled-coil irregularities and instabilities in group a Streptococcus M1 are required for virulence.Science319:1405. 10.1126/science.1154470

  • 84

    MetzgarD.ZampolliA. (2011). The M protein of group A Streptococcus is a key virulence factor and a clinically relevant strain identification marker.Virulence2402412. 10.4161/viru.2.5.16342

  • 85

    MillerJ. H.Avilés-ReyesA.Scott-AnneK.GregoireS.WatsonG. E.SampsonE.et al (2015). The collagen binding protein Cnm contributes to oral colonization and cariogenicity of Streptococcus mutans OMZ175.Infect. Immun.83:2001. 10.1128/IAI.03022-14

  • 86

    MontanaroL.ArciolaC. R.BaldassarriL.BorsettiE. (1999). Presence and expression of collagen adhesin gene (cna) and slime production in Staphylococcus aureus strains from orthopaedic prosthesis infections.Biomaterials2019451949. 10.1016/s0142-9612(99)00099-x

  • 87

    MortensenJ. H.KarsdalM. A. (2016). “Type VII collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 7, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 5760.

  • 88

    MortensenS. A.SanderB.JensenR. K.PedersenJ. S.GolasM. M.JenseniusJ. C.et al (2017). Structure and activation of C1, the complex initiating the classical pathway of the complement cascade.Proc. Natl. Acad. Sci. U.S.A.114986991.

  • 89

    MoschioniM.EmoloC.BiaginiM.MaccariS.PansegrauW.DonatiC.et al (2010). The two variants of the Streptococcus pneumoniae pilus 1 RrgA adhesin retain the same function and elicit cross-protection in vivo.Infect. Immun.7850335042. 10.1128/iai.00601-10

  • 90

    NallapareddyS. R.QinX.WeinstockG. M.HöökM.MurrayB. E. (2000). Enterococcus faecalis adhesin, ace, mediates attachment to extracellular matrix proteins collagen type IV and laminin as well as collagen type I.Infect. Immun.6852185224. 10.1128/iai.68.9.5218-5224.2000

  • 91

    NallapareddyS. R.SinghK. V.MurrayB. E. (2008). Contribution of the collagen adhesin acm to pathogenesis of Enterococcus faecium in experimental endocarditis.Infect. Immun.7641204128. 10.1128/iai.00376-08

  • 92

    NallapareddyS. R.WeinstockG. M.MurrayB. E. (2003). Clinical isolates of Enterococcus faecium exhibit strain-specific collagen binding mediated by Acm, a new member of the MSCRAMM family.Mol. Microbiol.4717331747.

  • 93

    NielsenH. V.GuitonP. S.KlineK. A.PortG. C.PinknerJ. S.NeiersF.et al (2012). The metal ion-dependent adhesion site motif of the Enterococcus faecalis EbpA pilin mediates pilus function in catheter-associated urinary tract infection.mBio3e17712. 10.1128/mBio.00177-12

  • 94

    NielsenM. J.KarsdalM. A. (2016a). “Type III collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 3, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 2130.

  • 95

    NielsenS. H.KarsdalM. A. (2016b). “Type XIX collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 19, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 123125.

  • 96

    NitscheD. P.JohanssonH. M.FrickI. M.MörgelinM. (2006). Streptococcal protein FOG, a novel matrix adhesin interacting with collagen I in vivo.J. Biol. Chem.28116701679. 10.1074/jbc.M506776200

  • 97

    NomuraR.NakaS.NemotoH.InagakiS.TaniguchiK.OoshimaT.et al (2013). Potential involvement of collagen-binding proteins of Streptococcus mutans in infective endocarditis.Oral Dis.19387393. 10.1111/odi.12016

  • 98

    NomuraR.NakanoK.NakaS.NemotoH.MasudaK.LapirattanakulJ.et al (2012). Identification and characterization of a collagen-binding protein, Cbm, in Streptococcus mutans.Mol. Oral Microbiol.27308323. 10.1111/j.2041-1014.2012.00649.x

  • 99

    NomuraR.OgayaY.NakanoK. (2016). Contribution of the collagen-binding proteins of Streptococcus mutans to bacterial colonization of inflamed dental pulp.PLoS One11:e0159613. 10.1371/journal.pone.0159613

  • 100

    NylanderÅSvensäterG.SenadheeraD. B.CvitkovitchD. G.DaviesJ. R.PerssonK. (2013). Structural and functional analysis of the N-terminal domain of the Streptococcus gordonii adhesin Sgo0707.PLoS One8:e63768. 10.1371/journal.pone.0063768

  • 101

    OehmckeS.ShannonO.MörgelinM.HerwaldH. (2010). Streptococcal M proteins and their role as virulence determinants.Clin. Chim. Acta41111721180. 10.1016/j.cca.2010.04.032

  • 102

    PattiJ. M.AllenB. L.McGavinM. J.HöökM. (1994a). MSCRAMM-mediated adherence of microorganisms to host tissues.Annu. Rev. Microbiol.48585617. 10.1146/annurev.mi.48.100194.003101

  • 103

    PattiJ. M.BolesJ. O.HookM. (1993). Identification and biochemical characterization of the ligand binding domain of the collagen adhesin from Staphylococcus aureus.Biochemistry321142811435.

  • 104

    PattiJ. M.BremellT.Krajewska-PietrasikD.AbdelnourA.TarkowskiA.RydenC.et al (1994b). The Staphylococcus aureus collagen adhesin is a virulence determinant in experimental septic arthritis.Infect. Immun.62152161.

  • 105

    PattiJ. M.House-PompeoK.BolesJ. O.GarzaN.GurusiddappaS.HookM. (1995). Critical residues in the ligand-binding site of the Staphylococcus aureus collagen-binding adhesin (MSCRAMM).J. Biol. Chem.2701200512011.

  • 106

    PaulssonM.RiesbeckK. (2018). How bacteria hack the matrix and dodge the bullets of immunity.Eur. Respir. Rev.27:180018. 10.1183/16000617.0018-2018

  • 107

    PhillipsG. N.Jr.FlickerP. F.CohenC.ManjulaB. N.FischettiV. A. (1981). Streptococcal M protein: alpha-helical coiled-coil structure and arrangement on the cell surface.Proc. Natl. Acad. Sci. U.S.A.7846894693. 10.1073/pnas.78.8.4689

  • 108

    PlumptreC. D.OgunniyiA. D.PatonJ. C. (2012). Polyhistidine triad proteins of pathogenic streptococci.Trends Microbiol.20485493. 10.1016/j.tim.2012.06.004

  • 109

    PrabhuDasM. R.BaldwinC. L.BollykyP. L.BowdishD. M. E.DrickamerK.FebbraioM.et al (2017). A consensus definitive classification of scavenger receptors and their roles in health and disease.J. Immunol.19837753789. 10.4049/jimmunol.1700373

  • 110

    PyagayP.HeroultM.WangQ.LehnertW.BeldenJ.LiawL.et al (2005). Collagen triple helix repeat containing 1, a novel secreted protein in injured and diseased arteries, inhibits collagen expression and promotes cell migration.Circ. Res.96261268. 10.1161/01.RES.0000154262.07264.12

  • 111

    ReissmannS.GillenC. M.FuldeM.BergmannR.NerlichA.RajkumariR.et al (2012). Region specific and worldwide distribution of collagen-binding M proteins with PARF motifs among human pathogenic streptococcal isolates.PLoS One7:e30122. 10.1371/journal.pone.0030122

  • 112

    RhemM. N.LechE. M.PattiJ. M.McDevittD.HöökM.JonesD. B.et al (2000). The collagen-binding adhesin is a virulence factor in Staphylococcus aureus keratitis.Infect. Immun.6837763779. 10.1128/iai.68.6.3776-3779.2000

  • 113

    Ricard-BlumS. (2011). The collagen family.Cold Spring Harb. Perspect. Biol.3:a004978. 10.1101/cshperspect.a004978

  • 114

    RichR. L.KreikemeyerB.OwensR. T.LaBrenzS.NarayanaS. V.WeinstockG. M.et al (1999). Ace is a collagen-binding MSCRAMM from Enterococcus faecalis.J. Biol. Chem.2742693926945.

  • 115

    RossC. L.LiangX.LiuQ.MurrayB. E.HookM.GaneshV. K. (2012). Targeted protein engineering provides insights into binding mechanism and affinities of bacterial collagen adhesins.J. Biol. Chem.2873485634865. 10.1074/jbc.M112.371054

  • 116

    SandJ. M. B.GenoveseF.KarsdalM. A. (2016). “Type IV collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 4, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 3141.

  • 117

    SandJ. M. B.KarsdalM. A. (2016). “Type XVI collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 16, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 101106.

  • 118

    SatoK.YomogidaK.WadaT.YorihuziT.NishimuneY.HosokawaN.et al (2002). Type XXVI collagen, a new member of the collagen family, is specifically expressed in the testis and ovary.J. Biol. Chem.2773767837684. 10.1074/jbc.M205347200

  • 119

    SatoY.OkamotoK.KagamiA.YamamotoY.IgarashiT.KizakiH. (2004). Streptococcus mutans strains harboring collagen-binding adhesin.J. Dent. Res.83534539. 10.1177/154405910408300705

  • 120

    SchönbornK.WillenborgS.SchulzJ. N.ImhofT.EmingS. A.QuondamatteoF.et al (2020). Role of collagen XII in skin homeostasis and repair.Matrix Biol.945776. 10.1016/j.matbio.2020.08.002

  • 121

    SchönherrE.HausserH.BeavanL.KresseH. (1995). Decorin-type I collagen interaction. Presence of separate core protein-binding domains.J. Biol. Chem.27088778883. 10.1074/jbc.270.15.8877

  • 122

    ShenG. (2005). The role of type X collagen in facilitating and regulating endochondral ossification of articular cartilage.Orthod Craniofac. Res.81117. 10.1111/j.1601-6343.2004.00308.x

  • 123

    ShimojiY.OgawaY.OsakiM.KabeyaH.MaruyamaS.MikamiT.et al (2003). Adhesive surface proteins of Erysipelothrix rhusiopathiae bind to polystyrene, fibronectin, and type I and IV collagens.J. Bacteriol.18527392748.

  • 124

    ShouldersM. D.RainesR. T. (2009). Collagen structure and stability.Annu. Rev. Biochem.78929958. 10.1146/annurev.biochem.77.032207.120833

  • 125

    SillanpääJ.NallapareddyS. R.QinX.SinghK. V.MuznyD. M.KovarC. L.et al (2009). A collagen-binding adhesin, Acb, and ten other putative MSCRAMM and pilus family proteins of Streptococcus gallolyticus subsp. gallolyticus (Streptococcus bovis Group, biotype I).J. Bacteriol.19166436653. 10.1128/jb.00909-09

  • 126

    SinghB.FleuryC.JalalvandF.RiesbeckK. (2012). Human pathogens utilize host extracellular matrix proteins laminin and collagen for adhesion and invasion of the host.FEMS Microbiol. Rev.3611221180. 10.1111/j.1574-6976.2012.00340.x

  • 127

    SinghK. V.NallapareddyS. R.SillanpääJ.MurrayB. E. (2010). Importance of the collagen adhesin ace in pathogenesis and protection against Enterococcus faecalis experimental endocarditis.PLoS Pathog.6:e1000716. 10.1371/journal.ppat.1000716

  • 128

    SmeestersP. R.McMillanD. J.SriprakashK. S. (2010). The streptococcal M protein: a highly versatile molecule.Trends Microbiol.18275282. 10.1016/j.tim.2010.02.007

  • 129

    SpezialeP.RaucciG.VisaiL.SwitalskiL. M.TimplR.HookM. (1986). Binding of collagen to Staphylococcus aureus Cowan 1.J. Bacteriol.1677781.

  • 130

    SpiveyK. A.ChungI.BanyardJ.AdiniI.FeldmanH. A.ZetterB. R. (2012). A role for collagen XXIII in cancer cell adhesion, anchorage-independence and metastasis.Oncogene3123622372. 10.1038/onc.2011.406

  • 131

    StewartC. M.BuffaloC. Z.ValderramaJ. A.HenninghamA.ColeJ. N.NizetV.et al (2016). Coiled-coil destabilizing residues in the group A Streptococcus M1 protein are required for functional interaction.Proc. Natl. Acad. Sci. U.S.A.11395159520.

  • 132

    SunS.KarsdalM. A. (2016a). “Type VI collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 6, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 4955.

  • 133

    SunS.KarsdalM. A. (2016b). “Type XVII collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 17, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 107111.

  • 134

    SundaramoorthyM.MeiyappanM.ToddP.HudsonB. G. (2002). Crystal structure of NC1 domains. Structural basis for type IV collagen assembly in basement membranes.J. Biol. Chem.2773114231153. 10.1074/jbc.M201740200

  • 135

    TaglialegnaA.Matilla-CuencaL.Dorado-MoralesP.NavarroS.VenturaS.GarnettJ. A.et al (2020). The biofilm-associated surface protein Esp of Enterococcus faecalis forms amyloid-like fibers.NPJ Biofilms Microbiomes6:15. 10.1038/s41522-020-0125-2

  • 136

    TanakaT.WakabayashiT.OizumiH.NishioS.SatoT.HaradaA.et al (2014). CLAC-P/Collagen Type XXV is required for the intramuscular innervation of motoneurons during neuromuscular development.J. Neurosci.34:1370. 10.1523/JNEUROSCI.2440-13.2014

  • 137

    TandonR.SharmaM.ChandrashekharY.KotbM.YacoubM. H.NarulaJ. (2013). Revisiting the pathogenesis of rheumatic fever and carditis.Nat. Rev. Cardiol.10171177. 10.1038/nrcardio.2012.197

  • 138

    TheocharisA. D.SkandalisS. S.GialeliC.KaramanosN. K. (2016). Extracellular matrix structure.Adv. Drug Deliv. Rev.97427. 10.1016/j.addr.2015.11.001

  • 139

    Tom TangY.HuT.ArterburnM.BoyleB.BrightJ. M.PalenciaS.et al (2005). The complete complement of C1q-domain-containing proteins in Homo sapiens.Genomics86100111. 10.1016/j.ygeno.2005.03.001

  • 140

    TonQ. V.LeinoD.MoweryS. A.BredemeierN. O.LafontantP. J.LubertA.et al (2018). Collagen COL22A1 maintains vascular stability and mutations in COL22A1 are potentially associated with intracranial aneurysms.Dis. Models Mech.11:dmm033654. 10.1242/dmm.033654

  • 141

    TonomuraS.IharaM.KawanoT.TanakaT.OkunoY.SaitoS.et al (2016). Intracerebral hemorrhage and deep microbleeds associated with cnm-positive Streptococcus mutans; a hospital cohort study.Sci. Rep.6:20074. 10.1038/srep20074

  • 142

    Ton-ThatH.SchneewindO. (2004). Assembly of pili in Gram-positive bacteria.Trends Microbiol.12228234. 10.1016/j.tim.2004.03.004

  • 143

    VacaD. J.ThibauA.SchützM.KraiczyP.HapponenL.MalmströmJ.et al (2020). Interaction with the host: the role of fibronectin and extracellular matrix proteins in the adhesion of Gram-negative bacteria.Med. Microbiol. Immunol.209277299. 10.1007/s00430-019-00644-3

  • 144

    van WieringenT.KalamajskiS.LidénÅBihanD.GussB.HeinegårdD.et al (2010). The streptococcal collagen-binding protein CNE specifically interferes with αVβ3-mediated cellular interactions with triple helical collagen.J. Biol. Chem.2853580335813. 10.1074/jbc.M110.146001

  • 145

    VeitG.ZwolanekD.EckesB.NilandS.KäpyläJ.ZweersM. C.et al (2011). Collagen XXIII, novel ligand for integrin alpha2beta1 in the epidermis.J. Biol. Chem.2862780427813. 10.1074/jbc.m111.220046

  • 146

    WaldemarssonJ.AreschougT.LindahlG.JohnssonE. (2006). The streptococcal Blr and Slr proteins define a family of surface proteins with leucine-rich repeats: camouflaging by other surface structures.J. Bacteriol.188378388. 10.1128/jb.188.2.378-388.2006

  • 147

    WangW.OlsonD.LiangG.FranceschiR. T.LiC.WangB.et al (2012). Collagen XXIV (Col24α1) promotes osteoblastic differentiation and mineralization through TGF-β/Smads signaling pathway.Int. J. Biol. Sci.813101322. 10.7150/ijbs.5136

  • 148

    WillumsenN.KarsdalM. A. (2016). “Type XX collagen,” in Biochemistry of Collagens, Laminins and Elastin, Chap. 20, ed.KarsdalM. A. (Cambridge, MA: Academic Press), 127129.

  • 149

    XuY.LiangX.ChenY.KoehlerT. M.HookM. (2004a). Identification and biochemical characterization of two novel collagen binding MSCRAMMs of Bacillus anthracis.J. Biol. Chem.2795176051768. 10.1074/jbc.M406417200

  • 150

    XuY.RivasJ. M.BrownE. L.LiangX.HookM. (2004b). Virulence potential of the staphylococcal adhesin CNA in experimental arthritis is determined by its affinity for collagen.J. Infect. Dis.18923232333. 10.1086/420851

  • 151

    YuZ.AnB.RamshawJ. A.BrodskyB. (2014). Bacterial collagen-like proteins that form triple-helical structures.J. Struct. Biol.186451461. 10.1016/j.jsb.2014.01.003

  • 152

    ZainulZ.HeikkinenA.KoivistoH.RautalahtiI.KallioM.LinS.et al (2018). Collagen XIII is required for neuromuscular synapse regeneration and functional recovery after peripheral nerve injury.J. Neurosci.3842434258.

  • 153

    ZaniI. A.StephenS. L.MughalN. A.RussellD.Homer-VanniasinkamS.WheatcroftS. B.et al (2015). Scavenger receptor structure and function in health and disease.Cells4178201. 10.3390/cells4020178

  • 154

    ZhangY.-Z.RanL.-Y.LiC.-Y.ChenX.-L. (2015). Diversity, structures, and collagen-degrading mechanisms of bacterial collagenolytic proteases.Appl. Environ. Microbiol.8160986107. 10.1128/aem.00883-15

  • 155

    ZongY.XuY.LiangX.KeeneD. R.HookA.GurusiddappaS.et al (2005). A ‘Collagen Hug’ model for Staphylococcus aureus CNA binding to collagen.Embo J.2442244236. 10.1038/sj.emboj.7600888

Summary

Keywords

Gram-positive bacteria, collagen binding proteins, collagen-like proteins, surface proteins, collagen

Citation

Arora S, Gordon J and Hook M (2021) Collagen Binding Proteins of Gram-Positive Pathogens. Front. Microbiol. 12:628798. doi: 10.3389/fmicb.2021.628798

Received

12 November 2020

Accepted

11 January 2021

Published

05 February 2021

Volume

12 - 2021

Edited by

Mattias Collin, Lund University, Sweden

Reviewed by

Minh-Thu Nguyen, University Hospital Münster, Germany; Jack Christopher Leo, Nottingham Trent University, United Kingdom

Updates

Copyright

*Correspondence: Magnus Hook, Srishtee Arora,

This article was submitted to Infectious Diseases, a section of the journal Frontiers in Microbiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics