Discovery of Natural Products as Novel and Potent FXR Antagonists by Virtual Screening

Farnesoid X receptor (FXR) is a member of nuclear receptor family involved in multiple physiological processes through regulating specific target genes. The critical role of FXR as a transcriptional regulator makes it a promising target for diverse diseases, especially those related to metabolic disorders such as diabetes and cholestasis. However, the underlying activation mechanism of FXR is still a blur owing to the absence of proper FXR modulators. To identify potential FXR modulators, an in-house natural product database (NPD) containing over 4,000 compounds was screened by structure-based virtual screening strategy and subsequent hit-based similarity searching method. After the yeast two-hybrid (Y2H) assay, six natural products were identified as FXR antagonists which blocked the CDCA-induced SRC-1 association. The IC50 values of compounds 2a, a diterpene bearing polycyclic skeleton, and 3a, named daphneone with chain scaffold, are as low as 1.29 and 1.79 μM, respectively. Compared to the control compound guggulsterone (IC50 = 6.47 μM), compounds 2a and 3a displayed 5- and 3-fold higher antagonistic activities against FXR, respectively. Remarkably, the two representative compounds shared low topological similarities with other reported FXR antagonists. According to the putative binding poses, the molecular basis of these antagonists against FXR was also elucidated in this report.


INTRODUCTION
The farnesoid X receptor (FXR, NR1H4), a member of the metabolic nuclear receptor superfamily, regulates the expressions and activities of a broad spectrum of genes. Since 1995 when FXR was isolated from a rat cDNA library for the first time (Forman et al., 1995), the studies on the physiological functions of FXR have been appealing and challenging. FXR is conserved from teleost fish to human beings (Maglich et al., 2003) and is abundantly expressed in liver, intestine, and kidney. As the endogenous receptor of bile acids, FXR can be activated by chenodeoxycholic acid (CDCA), lithocholic acid (LCA), deoxycholic acid (DCA), and many other bile acids (Makishima et al., 1999). In addition, FXR is also reported to exert regulatory roles in lipoprotein and glucose homeostasis, fatty acid and triglyceride synthesis, liver regeneration, and bacterial growth in the intestine (Lee et al., 2006;Wang et al., 2008). All these accumulating data make FXR a promising pharmaceutical target for multiple diseases, especially those related to metabolic disorders such as diabetes and cholestasis (Schaap et al., 2014;Gonzalez et al., 2016;Yuan and Li, 2016;Filho et al., 2017).
As a typical nuclear receptor, FXR shares common structural characteristics with other members of this superfamily, which comprises a highly conserved DNA-binding domain (DBD), a moderately conserved ligand-binding domain (LBD), and a ligand-dependent transcriptional activation domain (AF-2) .Upon the binding of proper ligand to the LBD, FXR will undergo a conformational change, which is critical to determine whether a coactivator or a corepressor binds efficiently to the AF-2 motif. If activated by appropriate agonists, the recruitment of coactivators (such as SRC-1, DRIP, and PRMT) to FXR occurs, which further up-or down-regulates the expressions of certain target genes. While for antagonists, the association of FXR with activators will be hindered (Lew et al., 2004). Although it is widely accepted that FXR participates in many biological processes, owing to the diversity and complexity of target genes involved in the FXR signaling pathways (Zhang and Edwards, 2008), the physiological functions of FXR haven't been clearly defined. Therefore, it is still an essential step to identify potential FXR modulators, which may contribute to the elucidation of physiological effects of FXR and provide novel opportunities for the treatment of metabolic diseases by targeting FXR.
Apart from the natural bile acid ligands with steroidal skeleton, over 700 structurally diverse FXR modulators have also been identified (Gaulton et al., 2017), most of which function as agonists (Carotti et al., 2014). Obeticholic acid (6αethyl-chenodeoxycholic acid, 6-ECDCA), a semi-synthetic bile acid analog with highly potent FXR agonistic activity (EC 50 = 0.099 µM) (Pellicciari et al., 2002), is the first FDA-approved drug that is used for treating primary biliary cholangitis (PBC) (Nevens et al., 2016). In contrast, the development of FXR antagonists, which are also useful chemical tools to unravel the physiological roles and relative clinical significance of FXR (Li et al., 2013), does not seem to be satisfactory due to the scanty number of potent FXR antagonists that have been reported so far. Guggulsterone, a natural product extracted from the resin of the guggul tree, is the most described FXR antagonist, with the ability of blocking the agonist-induced coactivator recruitment and decreasing the hepatic cholesterol in wild-type mice (Urizar et al., 2002). However, the researches on guggulsterone are still confined to preclinical and academic studies because of the complexity of its mechanism of action (Fiorucci et al., 2010;Yamada and Sugimoto, 2016). Although other natural or synthetic FXR antagonists have also been developed (Figure 1; Wu et al., 2002;Dussault et al., 2003;Nam et al., 2007;Choi et al., 2011;Huang et al., 2012;Xu et al., 2015), further pharmaceutically relevant activities were rarely reported. Herein, six natural products were identified as antagonists from an inhouse natural product database (NPD) through virtual screening strategy and subsequent biological experiment validation. In the yeast two-hybrid (Y2H) assay (Fields and Sternglanz, 1994;Lin and Lai, 2017), these compounds could abolish CDCAinduced FXR activation at micromolar level. We hope the natural products revealed in this study will offer novel scaffolds for uncovering new FXR regulatory mechanism and provide insights into potential development for further discovery of FXR modulators.

Protein Preparation
The crystal structures of FXR-LBD in complex with 6-ECDCA (Mi et al., 2003) (PDB code 1OSV, a dimer and chain B was used) and fexaramine (Downes et al., 2003) (PDB code 1OSH, monomer) were obtained from the Protein Data Bank. The synthetically modified bile acid ligand 6-ECDCA was derived from the structure of CDCA, but showed almost 100-fold more potent FXR agonistic activity than CDCA and did not activate other nuclear receptors. Fexaramine is a synthetic nonsteroidal FXR agonist, which was identified by optimization of a benzopyrane-based combinatorial derived library. The coactivators and all the water molecules were removed. Hydrogen atoms and charges were added during a brief relaxation performed using the "Protein Preparation Wizard" workflow in Maestro 10.1. After the hydrogen bond network was optimized, the crystal structure was minimized until the root-mean-square deviation (RMSD) between the minimized structure and the starting structure reached 0.3 Å with OPLS_2005 force field.

Glide Docking
The grid-enclosing box was placed on the centroid of the crystallographic ligand in the optimized protein structure and defined to enclose residues located within 15.0 Å of the binding pocket. A scaling factor of 0.8 was set to van der Waals (VDW) radii of those receptor atoms with partial atomic charges of less than 0.15 to soften the nonpolar parts of the receptor. After addition of hydrogen atoms and ionization at a pH range of 5.0-9.0, the three-dimensional structures of compounds in the NPD were generated with Ligprep v3.3 module. Standard precision (SP) and extra precision (XP) approaches of Glide Halgren et al., 2004) were respectively adopted to dock the molecules into the binding site with the default parameters, and only the top one pose for each molecule were retained. After parallel Glide SP scorings using two different protein structures (PDB codes 1OSV and 1OSH), the top 500 docking poses were reserved for each docking calculation and subjected to XP calculation with a more precise scoring function, and the top 200 docking poses were retained, respectively, for further visual observation.

Hit-Based Similarity Searching
The similarity searching process was accomplished in Pipeline Pilot v7.5, and two of the most potent FXR antagonists 2a and 3a were used as query molecules, respectively. The Tanimoto coefficient (Tc) of similarity between the query molecule and the target molecule was calculated using SciTegic functional connectivity fingerprints of radius 4 (FCFP_4) (Bender et al., 2009). The minimum Tc was set to a low value of 0.3, to maximize the number of obtained analogs.

Materials
The restriction and modification enzymes in this work were obtained from New England Biolabs (Beijing, China). P-nitrophenyl α-D-galactopyranoside (PNP-α-Gal), yeast nitrogen base without amino acids, agar, PEG3350, dimethyl sulfoxide (DMSO), lithium acetate, and glucose were all purchased from Sigma (Shanghai, China). The yeast expression plasmids pGADT7 and pGBKT7 were from Clontech (Palo Alto, CA), and CDCA was from Merck. The dropout supplement free from leucine and tryptophan (-Leu/-Trp DO supplement) was bought from Takara, and Salmon Sperm DNA was obtained from invitrogen. The yeast strain AH109 was purchased from Clontech (Palo Alto, CA).

Y2H System Construction
We constructed the Y2H for FXR by yeast co-transformation with pGBKT7-FXR LBD (BD) and pGADT7-SRC-1 (AD) according to the lithium acetate method. Briefly, 500 ng of BD and AD were added to 50 µL of the yeast competent cells and mixed with 36 µL of lithium acetate, 240 µL of 50% PEG3350, and 50 ng single-strain DNA at 30 • C for 30 min, followed by heat-shock (250 rpm) at 42 • C for 30 min. The mixture was subsequently spread on a drop-out-agar plate without leucine and tryptophan (6.7 g/L yeast nitrogen base without amino acids, 1.54 g/L -Leu/-Trp DO supplement, 20 g/L glucose, 20 g/L agar). The plates were incubated at 30 • C for 48 h for yeast growth and the PCR method was used to confirm the successful transformation.

Y2H Assay
We performed Y2H assay to determine the agonistic or antagonistic activities of the compounds. Yeast transformations were incubated with either a control vehicle (DMSO) or the indicated compounds for 24 h in an hFXR agonist testing, and in antagonist assays treated with tested compounds plus 10 µM CDCA. The quantitative α-galactosidase activity assays were carried out by using PNP-α-Gal as the substrate according to the Clontech manual. Each experiment was repeated three times independently.
1. Vortex the overnight culture tube for 0.5-1 min to disperse cell clumps and then each sample needs 200 µL to record the exact OD 600 . 2. Centrifuge the tube at 14,000 rpm (10,000 × g) for 30 s.
Incubate 16 µL cell culture medium supernatant with 48 µL PNP-α-Gal at 30 • C for 60 min. Be sure to cover microtiter plates with a lid or parafilm to prevent evaporation. 3. Terminate the reaction by adding 136 µL of 1 µM sodium carbonate. Record the optical density of each sample at 410 nm.
The α-galactosidase activity was calculated according to the following formula: where t is the elapsed time of incubation (min), V f is the final volume of assay (200 µL), V i is the volume of culture medium supernatant added (16 µL), OD 600 is the optical density of overnight culture, and ε×b is the p-nitrophenol molar absorptivity at 410 nm×the light path (cm) = 10.5 mL/µmol (Yeast Protocols Handbook PT3024-1, Clontech).
The agonistic activation and inhibition rates (%) were calculated as follows: where GA indicates α-galactosidase activity.

RESULTS AND DISCUSSIONS Protein Structure Selection and Redocking Validation
The binding of proper modulators to LBD is the molecular basis for FXR activation that triggers the conformational change of FXR, the subsequent coactivators association, and the final target genes regulation. Previous studies have shown that FXR can be activated by structurally diverse agonists, and an array of crystal structures of FXR-LBD complexed with agonists have been solved. The agonist-binding pocket is positioned in the interior of the LBD, and agonists with different skeletons display enormously distinct binding features (Maloney et al., 2000;Soisson et al., 2008;Flatt et al., 2009;Jin et al., 2013). Computational studies, along with crystallographic experiments (Xu et al., 2015), support the notion that the agonist-binding pocket can also be occupied by antagonists (Meyer et al., 2005). Given the variability of the binding pockets occupied by distinct modulators, it is necessary to use different FXR crystallographic models in the virtual screening process, in order to maximize the diversity of hit compounds. At the time when we initiated this study, there were in total 27 crystal structures of FXR in complex with different modulators in the Protein Data Bank. After deleting the structures bound with structurally analogous ligands and those without published biological data, 10 unique protein structures were reserved for further analysis.
Protein flexibility has vital influence on ligand recognition, and even subtle protein conformational changes can significantly affect the results of docking simulations (Kitchen et al., 2004). Nevertheless, the receptor is usually held rigid for most docking procedures, including Glide used in this study, to speed up virtual screening of large databases. To compensate for the limitations of rigid protein conformation, as well as to simply the computational simulations, we decided to select two receptors representing dramatically dissimilar conformations to the other reported crystal structures to perform two independent docking calculations. Both the binding site similarity and ligand similarity profiles were taken into consideration for docking model selection. On the one hand, pairwise binding pocket similarities among the 10 structures were calculated using our in-house program PocketShape, which is designed for computational evaluation of the binding site similarity based on pocket shape and property and could be accessed through the webserver SiteMapper (http://lilab.ecust.edu.cn/). Residues within 5Å distance around the ligand were extracted as the binding site for each structure. Typically, two binding sites with a score value over than 0.8 are considered similar. On the other hand, pairwise molecular similarities among the 10 crystallographic ligands were calculated with SciTegic FCFP_4 fingerprints in Pipeline Pilot v7.5, and a Tc cutoff value of 0.6 was set to define similarity.
The binding pocket and ligand similarity values of the 10 unique crystal structures were plotted in Figure S1. Meanwhile, for each model, the average pocket and ligand similarity values were calculated, respectively, to evaluate its uniqueness. Four crystal structures 1OSH, 1OSV, 3OLF, and 4WVD were considered, as all of them have top five minimum average values ranked by either pocket or ligand similarity calculations. Although both the pocket and ligand similarity values of 4WVD scatter in a low range, its ligand ivermectin is a large macrocyclic lactone (Jin et al., 2013), making the ligand binding pocket expand to a volume of 1081 Å 3 (Dundas et al., 2006), which is likely to cause artificial enrichment of molecules with large sizes and high molecular weights in docking procedure (Verdonk et al., 2004). Therefore, we did not choose 4WVD. The Tc values of the FXR agonist 6-ECDCA in 1OSV to other ligands are relatively low with predominant scattering below 0.2, and its binding pocket also reveals uniqueness with the average similarity value of 0.62. Moreover, 6-ECDCA is the only FDAapproved FXR modulator so far. Therefore, the 1OSV model was first selected for docking simulation (Mi et al., 2003). After thoroughly analyzing the binding interactions, the biological activities against FXR, as well as the X-ray crystal parameters, the second model of 1OSH was preferentially reserved, the ligand of which has a lower Tc value to 6-ECDCA than that of 3OLF (0.13 vs. 0.15).
Alignment of the 6-ECDCA and fexaramine binding sites exhibited substantial differences in both shape and surrounded residues ( Figure S2A). And the tremendous dissimilarities of the two ligands in terms of topological structures and induced binding conformations render them extend to non-overlapping space. To evaluate the docking accuracy of Glide, the two cocrystallized ligands were redocked into the active pocket using Glide XP scoring for each structure. Superimposition of the best redocked poses and the experimental structures ( Figures S2B,C) gave the RMSD values of 0.53 Å for 6-ECDCA and 0.65 Å for fexaramine, indicating the robustness of Glide in accurately reproducing the bioactive conformations of the ligands for our two docking models. Cross-docking calculations were also carried out where each crystal structure ligand was docked into the binding pocket of the other. The ligand 6-ECDCA could be docked into 1OSH by Glide XP mode, but the predicted binding pose deviated greatly from the crystallized bioactive conformation ( Figure S2D). In agreement with the enormous differences of the two binding sites, fexaramine could not be accommodated by the smaller 6-ECDCA-binding site of 1OSV, hence no proposed docking pose was obtained.

Virtual Screening
To search for potent FXR modulators, structure-based virtual screening strategy, an effective method to identify novel ligands FIGURE 2 | Schematic representation of the whole strategy adopted in this study to identify potential FXR modulators.
based on predicted binding poses and docking scores, was executed using Glide v6.6 (Maestro v10.1, Schrödinger Inc.). And we speculated that if the docking pose of a certain compound to the agonist-binding pocket was computationally favorable, it could be an effective FXR modulator, either an agonist or an antagonist. The screened natural products database is a collection of over 4000 natural products isolated from about 100 plants, the structures of which have been validated by our researchers. After hierarchical virtual screenings independently implemented with Glide (Figure 2) by using two crystal structures of the receptor, a total of 400 top-ranking compounds were retrieved as candidates from the NPD. These candidates were then subjected to visual inspection to remove those that are likely to be nonbinders. With consideration of key interactions observed from the crystal structures, such as predominant VDW complementarity and some critical hydrogen bonds with polar residues, each docked pose of these candidates was carefully checked to delete the inappropriate compounds. Meanwhile, a specific focus was put on the sizes of the candidates, and those compounds with relatively large groups protruding out of the binding pocket were not considered. Additionally, the compounds with the same scaffold were reserved with a maximum of three to maximize the structural diversity. A total of 30 candidates were finally selected for further bioactivity assay. In the coactivatorrecruitment assay based on the Y2H system, none of the 30 compounds could enhance the association of SRC-1 to FXR-LBD, thus no agonist was found. Intriguingly, four compounds (1a, 2a, 3a, 3c) strongly inhibited the CDCA-induced SRC-1 recruitment with the inhibition rate higher than 50% in the concentration of 25 µM, displaying apparent FXR antagonistic profiles. The IC 50 values of the four compounds were determined (Table 1, Figure S4), and guggulsterone was used as the reference compound.
Starting from the potent FXR antagonists 2a and 3a as hit compounds, the in-house NPD was re-screened using similarity searching method, to obtain more potent derivatives as well as to establish underlying structure-activity relationships (SAR). A prior minimum Tc value of 0.6 was first set to retrieve analogous compounds. Unfortunately, only several derivatives were obtained for each hit compound, probably owing to the heterogeneity of the in-house NPD. We, thus, chose a relatively low threshold of 0.3 to maximize the number of obtained analogs. Subsequently, the analogs were also manually checked, and only compounds possessing the same skeleton to the hit compound and with proper sizes were selected. After further in vitro Y2H assay, two compounds 3d and 3e with the Tc values of 0.4 and 0.41, respectively, to compound 3a, were found to display moderate antagonistic activities against FXR. As illustrated in Table 1, compounds 3d and 3e share the phenol moiety with compound 3a, and their IC 50 values were 14.1 and 19.3 µM, respectively.
To evaluate the performance of the virtual screening strategy adopted in this study, the rankings and docking scores of the newly identified natural products were retrospectively examined. The distribution of docking scores of the top ranking candidates reserved from Glide calculations were presented in Figure S3. The natural compounds bearing different scaffolds, including 1c, 2a, 2b, 3b, and 3c, could be ranked in the top 500 candidates during the Glide SP docking process using both crystal structures 1OSV and 1OSH, whereas the Glide XP results are totally different. As shown in Table 2, compounds containing similar chemical skeleton to the crystallographic ligand tend to score higher. Moreover, none of the identified FXR antagonists could be simultaneously ranked in the top 200 candidates by the two Glide XP calculations, confirming the rationality of using two distinct crystal structures for structure-based virtual screening. Despite of the confirmed moderate antagonistic effects against FXR, compounds 3d and 3e with smaller sizes were excluded after the initial Glide SP screening, which may be ascribed to the recognized bias of structure-based virtual screening method toward high molecular weight compounds (Pan et al., 2003). The results also demonstrate that 2D molecular similarity search method is a powerful and complementary approach to structurebased virtual screening, which could retrieve biologically active compounds that are regarded as false-negatives by docking simulations.
Compounds that were ruled out by structure-based virtual screening process but recovered using similarity searching method are labeled with *.

Structural Novelty Assessment
The six natural products were first reported to show FXR antagonistic activity. To evaluate their structural novelty with respect to known FXR antagonists, the pairwise Tc values of chemical similarity were calculated based on the FCFP_4 fingerprints. The Tc value between the similar compounds 3a and 3c is 0.67, a Tc cutoff value of 0.6 was therefore set to define similarity. 15 structurally diverse FXR antagonists including seven natural products (compounds 4-21, Table S1) were collected from literatures, among which the maximum Tc value is 0.46. As shown in Figure 3, all the Tc values of the six newly identified hits to the known 15 FXR antagonists were below 0.4, and the maximum Tc values of compounds 1a, 2a, and 3a were 0.38, 0.25, and 0.31 (Table S2), respectively. Accordingly, the six natural products could be considered to be structurally novel as FXR antagonists.

Analysis of Predicted Binding Poses
From the structural point of view, the six antagonists can be simply categorized into two classes: terpenes possessing polycyclic skeletons and phenols with chain scaffolds. In order to better delineate SAR, their inactive analogs were also displayed and analyzed in this study.

Terpenes
Compounds 1a and 2a belong to triterpenes and diterpenes, respectively (Yang et al., 2010), and both of them are isolated from Abies georgei which grows exclusively in China. In a previous study, compound 1a was reported to have moderate agonistic effect against estrogen receptor (ER). The polycyclic ring skeletons of the two compounds are much similar to that of the bile acids, especially for the tetracyclic triterpene compound 1a. However, the Tc values between compounds 1a and 2a and 6-ECDCA are as low as 0.36 and 0.28, respectively. Intriguingly, despite the opposite activities against FXR, the proposed binding poses of the two compounds closely resemble that of 6-ECDCA when interacting with FXR (Figure 4). Similar to 6-ECDCA, compound 1a adopts cis-orientation in the A/B rings linkage, which is considered to be a unique feature for bile acids. The ring skeleton fits the binding pocket well through favorable VDW contacts and hydrophobic effects with adjacent residues. The 3α-OH group extends to the space between helix 7 and helix 10/11 and putatively participates in hydrogen bond interactions with residues Tyr358 and His444. Additionally, the terminal carboxyl group could interact with residue Arg328, located at the entrance of the binding pocket, through salt bridge or hydrogen bond interactions. All the interactions described above are beneficial to the binding of compound 1a to FXR. However, the hydrogen bond formed between the 7α-OH of 6-ECDCA and Tyr366 is absent for compound 1a due to its structural variation on ring B. Under the physical-shape discrimination mechanism employed by FXR (Mi et al., 2003), bile acids without a 7α-OH group, such as LCA and DCA, showed extremely weak affinity with FXR, which may also cause the moderate antagonistic activity of compound 1a against FXR. For the analogs where the 3α-OH groups are replaced by carbonyl groups (compounds 1b and 1c), no detectable antagonistic activity was found in the coactivator recruitment assay. The planarity of the double bond may restrict the carbonyl oxygen atom to a position distant from residues Tyr358 and His444, and the absence of corresponding hydrogen bond interactions presumably results in the loss of agonistic effects against FXR.
Compared with compound 1a, compound 2a has a relatively smaller volume, but displayed 10-fold stronger antagonistic activity. Apparently, compound 2a doesn't fit the canonical mechanism that nuclear receptors' antagonists are usually voluminous than agonists (Meyer et al., 2005). In the proposed binding pose with FXR, compound 2a reveals the same amphipathic properties as the bile acid ligands. The oxygen atom of the carboxyl group putatively forms hydrogen bond interactions with residues Ser329 and Tyr366. At the other end of compound 2a, hydrogen bond interactions could also form between the hydroxyl group and Arg328. The hydroxyl group seems to be essential for the antagonistic activity of compound 2a, as its analog with an acetyl group (compound 2b) exhibited no observed activity against FXR. Owing to the methyl group located at the10α-position, the carboxyl-substituted hydrocarbon ring A protrudes from the benzene ring panel, making VDW contacts and hydrophobic effects with Met325 on helix 5. Besides, the sequential ring structure could interact with loop H1-H2 (Met262), helix 3 (His291 and Met287), and helix 6 (Leu345) by favorable hydrophobic and VDW interactions.
Because of the relatively smaller volume, compound 2a is not able to extend to the pocket that is occupied by rings A and B of 6-ECDCA, hence no direct interactions with helix 10/11 and helix 12 were observed. Previous studies have suggested that the π-cation interaction between His444 on helix 10/11 and Trp466 on helix 12 plays a critical role for the active conformation of helix 12 induced by endogenous bile acids (Mi et al., 2003;Pellicciari et al., 2005). Steroid agonists with 3α-OH group could facilitate the π-cation interaction by providing appropriate disposition of His444 through the steric restriction of hydrogen bonds formed between 3α-OH and residues His444 and Tyr358. Consequently, without the ability of establishing the triad of Tyr358, His444, and Trp466, compound 2a couldn't secure helix 12 in the active conformation, thus preventing the recruitment of coactivators.

Phenols
Compounds 3a and 3c, named daphneone and daphnenone respectively , are constituents of Daphne odora Thunb. var. marginata, an ornamental plant whose growth is restricted to the south of China. The two compounds, together with their simple structures and small volume, are extraordinarily peculiar to present antagonistic effects against the CDCAinduced SRC-1 association with FXR. Moreover, it is difficult to find common structural features between the two compounds and known agonists or antagonists. Accordingly, we turned to the initial docking poses to probe the structural basis of the FXR antagonistic profiles of this chemical series (Figure 5).
The two compounds were selected from the virtual screening process using the crystal structure of FXR-fexaramine complex. Compound 3a is sandwiched in the cleft enclosed by helices 5, 7, and 10/11, partially overlapping with the fexaraminebinding pocket. The hydroxyl group points toward helix 5 and may form hydrogen bonds with residues Ser336 and His298.
On the other terminal, the benzene ring moiety extends to the aromatic residues-rich groove formed by Phe288 (helix 7), Trp458 (helix 10/11), and Phe465 (loop H10/11-H12), and probably contacts with these residues by advantageous πstacking interactions. The linker between the two benzene rings fits the pocket by hydrophobic interactions and VDW contacts with surrounding residues such as Leu291, Met294, Ile356, and Ile361. Previous molecular dynamics simulation studies assumed that the intrinsically unstable loop H10/11-12 controlled the flexibility of helix 12 . Through offset face-to-face π-stacking interaction between the benzene ring and Phe465, both compounds 3a and 3c could contact with the loop directly, which may interfere with the conformation of the loop and push helix 12 away from its active conformation. For compound 3c which has a double bond in the linker region, the antagonistic effect in the SRC-1-recruinment assay is slightly weaker. Presumably, the relatively flexible hydrocarbon linker is more suitable for the binding process with FXR-LBD, hence compound 3a displayed stronger antagonistic activity. When a hydroxyl group was introduced into the hydrocarbon linker, the antagonistic activity of compound 3b markedly decreased, displaying an IC 50 value higher than 25 µM. Whereas the much smaller compounds 3d and 3e, which share the phenolic moiety with 3a, exhibited moderate antagonistic effects against FXR. Compounds 3d and 3e were docked to FXR by Glide XP mode using the crystal structure 1OSH. As illustrated in Figure S5, the two small molecules occupy merely a fraction of the fexeramine-binding pocket. Hydrophobic effects and shape complementarities presumably dominate the interactions with FXR, as no hydrogen bond was detected in their proposed binding poses.
Notably, compounds 3a and 3c have been previously reported to show cytotoxic activities against a variety of human tumor cell lines, including K562, A549, MCF-7, LOVO, HepG2, and A375-S2, with the IC 50 values ranging from 3.12 to 51.0 µM Wang et al., 2012). Besides, the agonistic profiles of compounds 3d and 3e against ER have also been described in a previous study (Cao et al., 2013). The phenol FXR antagonists identified in this study are relatively small, especially for compounds 3d and 3e, which probably have effects on other targets in living cells. Further thorough investigations are ongoing to better elucidate the exact mechanisms of action of the newly identified natural FXR antagonists and their implications regarding in vivo pharmacological effects.

Druglikeness Evaluation
To assess the drug-like profiles of the six natural products, an in silico prediction of ADME properties was performed using QikProp v4.3 module integrated into Maestro 10.1, and 6-ECDCA was used as the reference compound ( Table 3). All physically significant descriptors and pharmaceutically relevant properties of the natural FXR antagonists, except for compound 1a, fall into the recommended ranges of 95% of known drugs, suggesting remarkable potential of druglikeness. The QPlogPo/w and QPlogS values of compound 1a exceed the limits of either Lipinski's rule of five or Jorgensen's rule of three, therefore the aqueous/lipid solubility should be taken into consideration if further structural optimization was carried out based on the tetracyclic triterpene compound 1a.

CONCLUSION
In summary, we have established a small NPD containing over 4,000 compounds that were previously isolated from about 100 medicinal plants. From the database, six FXR antagonists were identified by strategic virtual screening method, which validated the feasibility of virtual screening to explore the potential targets of natural products. Although procured on the basis of known agonist-binding pocket, two of the most potent compounds 2a and 3a could antagonize the CDCA-induced SRC-1 recruitment to FXR-LBD with the IC 50 values of 1.29 µM and 1.79 µM, respectively. The predicted docking mode of the diterpene 2a against FXR exhibited partially similar binding interactions to those of the crystallographic ligand 6-ECDCA bound to FXR, whereas the daphneone 3a showed noncanonical proposed binding mode, which may directly contact with the intrinsically unstable loop H10/11-12 by π-stacking interactions with the aromatic residue Phe465. Moreover, as assessed by QikProp, most of the natural FXR antagonists displayed comparable drug-like properties to that of 95% of known drugs. We hope our discovery will provide promising chemical scaffolds for further hit-to-lead optimization and for the study of FXR-related biological mechanisms.

AUTHOR CONTRIBUTIONS
HL, WZ, and LS conceived the study. YD and SL performed molecular simulations. YD analysed the data and drafted the manuscript. JJ, SZ, and JH carried out experimental studies. HL reviewed and revised the manuscript. All authors have read and approved the final manuscript.