- 1Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
- 2Laboratory of Chemoinformatics, Infochemistry Scientific Center, ITMO University, Saint Petersburg, Russia
Quantum computing holds promise for molecular similarity analysis in chemoinformatics and drug discovery. We propose a quantum circuit to encode the classically pre-computed Tanimoto similarity (T), obtained from extended-connectivity fingerprints (ECFPs) with RDKit, into a compact three-qubit entangled state using Qiskit. A 3-qubit circuit with RY rotations encodes T coefficients, while CNOT gates create an entangled three-qubit state that serves as a sensitive probe for quantum noise and error-mitigation performance. Simulations under noise demonstrate that exponential mitigation reduces errors by 75.0% for similar pairs (e.g., aspirin–aspirin) and 87.5% for dissimilar pairs (e.g., aspirin–butane) at a 1% error rate, maintaining fidelity within ±0.001 deviation. At 10% depolarization noise, error reduction drops to 25.0% and 17.4% for these pairs, respectively. The overall results show that the mitigation is proportionally more effective for low-similarity pairs. Experiments on IBM Quantum hardware confirm Z-basis reliability but reveal challenges with X-basis noise. Our work demonstrates quantum-encoded T representation and recovery on NISQ devices as a proof-of-concept, highlighting the critical role of error mitigation in hybrid quantum-classical workflows.
1 Introduction
Molecular similarity is a cornerstone of chemoinformatics and plays a vital role in virtual screening, compound clustering, and drug discovery (Isakova et al., 2024; López-Pérez et al., 2024; Shityakov and Dandekar, 2010). Quantifying the likeness between chemical compounds is crucial for predicting their biological activity and identifying potential drug candidates. Among the various metrics available, the Tanimoto similarity (T) coefficient, also known as the Jaccard index, stands out as a widely adopted and effective measure for comparing compounds on the basis of their binary fingerprint representations (Safizadeh et al., 2021).
Fingerprints, such as extended-connectivity fingerprints (ECFPs) or Morgan fingerprints, encode structural information into bit strings, allowing for rapid T calculations. These classical methods have been extensively used in computer-aided drug design, as described by Zhou and Skolnick (2024), and Shityakov et al. (2021) and provide a benchmark for novel computational approaches.
On the other hand, quantum computing presents an intriguing alternative for tackling complex problems in biochemistry and rational drug design. It offers the potential to leverage quantum parallelism to process probabilistic states more efficiently than classical counterparts in some domains.
Various studies have explored the application of quantum algorithms in computational chemistry, aiming to leverage quantum phenomena such as superposition and entanglement to improve performance (Biamonte et al., 2017; Mazzola, 2024; Quinton et al., 2025). For example, researchers have investigated quantum machine learning models for predicting molecular properties and activities, with some studies showing promising results in accuracy and efficiency compared with classical methods (Akrom et al., 2024). This study builds upon the integration of quantum chemical property features to reduce the computational burden while maintaining prediction accuracy for the corrosion inhibition of quinoxaline compounds (Akrom et al., 2024). Some T functions have also been implemented in sophisticated computational techniques to assess the degree of similarity between different images, optimizing the performance of quantum retrieval systems (Yang et al., 2025).
On the other hand, the application of quantum circuits to encode Tanimoto similarity via molecular similarity-based descriptors remains unexplored, particularly concerning how quantum noise affects the accuracy of T estimation. Given that quantum computers are still in their early stages of development and are susceptible to noise, understanding and mitigating the effects of noise is critical for realizing the full potential of quantum algorithms in chemistry (Prasad et al., 2024; Ziems et al., 2025). Therefore, this work contributes to quantum-encoded chemoinformatics by exploring the feasibility and limitations of quantum circuits for molecular similarity analysis. We investigated noise mitigation techniques to improve practical applicability, supporting hybrid quantum-classical workflows.
To our knowledge, this is the first demonstration of a compact 3-qubit quantum circuit that encodes a pre-calculated classical Tanimoto coefficient (derived from extended-connectivity fingerprints) into an entangled GHZ-like state. Although Tanimoto similarity itself is computed classically with maximal efficiency, this encoding provides a chemically motivated, minimal benchmark for rigorously evaluating noise resilience and error-mitigation strategies on current NISQ hardware.
2 Computational methods
We implemented RDKit v.2025.03.2 to compute ECFPs, specifically Morgan fingerprints) for input SMILES strings (Bento et al., 2020; Rogers and Hahn, 2010). The canonical SMILES for aspirin (PubChem CID: 2244) and butane (PubChem CID: 7843) was retrieved from the PubChem database (Cincilla et al., 2010). Equations 1–5 define the Tanimoto similarity and its quantum encoding. Equations 6–13 describe noise modeling, mitigation, and error metrics. The Tanimoto similarity between two fingerprint vectors was calculated as the ratio of the intersection over the union of their bits, defined as
where A and B are the binary fingerprint vectors for two molecules,
The drug-likeness scores were calculated using the MolSoft drug-likeness model leverages a machine learning approach based on Naive Bayes (NB) classifier (MolSoft ICM-Pro v.3.9-2b, MolSoft LLC, academic license) according to the standard protocol (Poola et al., 2023).
The model incorporates a set of 64 physicochemical and topological descriptors, including molecular weight, logP (octanol-water partition coefficient), polar surface area (PSA), number of hydrogen bond donors and acceptors, rotatable bonds, and molecular flexibility indices. These descriptors are evaluated against a trained model derived from a proprietary database of over 10,000 drug-like and non-drug-like compounds, employing an NB classifier to generate a continuous drug-likeness score.
All quantum simulations were conducted in the Python environment, incorporating the Qiskit v. 1.4.2 package libraries (Zhang et al., 2022). Molecular structures were represented via simplified molecular-input line-entry system (SMILES) strings and processed with RDKit to compute Morgan fingerprints (radius = 2). To map classical T scores onto quantum circuits, a rotation encoding scheme was implemented, following established quantum circuit design principles for encoding classical data (Grant et al., 2018). To encode Tanimoto similarity into the quantum state
Next, we need to map the
This implies:
Thus, the quantum state became:
These angles were applied to qubit RY gates in a three-qubit quantum circuit, followed by the entrance of CX gates to simulate correlation propagation. The quantum circuits were executed via both ideal and noisy simulation backends from the Qiskit Aer Python package. Noise models incorporated depolarizing errors with tunable probabilities on three-qubit (CX) gates, which is consistent with typical noise characteristics in NISQ devices (Bharti et al., 2022). To evaluate the impact of quantum noise, simulations were repeated across a logarithmic range of error rates from 10−3 to 10−1, with and without error mitigation techniques. Mitigation was applied by scaling expectation values from measured outcomes based on of an exponential noise model published in the literature elsewhere (Quek et al., 2024). The circuit generates a Greenberger–Horne–Zeilinger (GHZ)-like state, in which ideal measurements are perfectly correlated, yielding either all 0s or all 1s, thereby encoding similarity. Exponential mitigation improves fidelity under noise by scaling the noisy expectation values. Additional validation was conducted via IBM Quantum’s Estimator runtime on both the simulated and hardware backend. Quantum circuits were transpiled to native instruction sets and custom observables (e.g., IIZ, ZIZ, XIX, etc.) were mapped using Qiskit pass managers and sparse Pauli operators.
The measurement outcomes were analyzed to extract expectation values and standard deviations for observable-based inference. All the experiments used 1,000 measurement shots per simulation to ensure statistical significance according to the Qiskit standard protocol (Zhang et al., 2022). The 3-qubit quantum circuit design encoded the base similarity as a rotation angle (Figure 1A). Entanglement was induced via CNOT gates from the first qubit to the second and third qubits, and measurements were performed on all qubits. A depolarizing error model was introduced, including 3-qubit errors, which were applied via Qiskit’s noise model and simulated via AerSimulator. Visualization of the results, including rotation angle mappings, noise-induced similarity distortion, and mitigation effectiveness, was performed via the Matplotlib Python package and GraphPad Prism software.
Figure 1. (A) Quantum circuit with three qubits and three classical bits: an RY(π/2) gate is applied to q0, followed by CNOT gates from q0 to q1 and q2. All qubits are measured and mapped to classical bits. (B) Relationship between base similarity and the RY rotation angle: as T increases from 0 to 1, the RY angle decreases from π to 0, illustrating how quantum rotation affects molecular similarity.
3 Results and discussion
To evaluate quantum-assisted molecular similarity under noise, we simulate pairwise comparisons via ideal and noisy quantum environments, focusing on the Tanimoto similarity derived from molecular fingerprints and encoded into quantum circuits via controlled RY rotations. We demonstrated that a quantum circuit can encode classical similarity values (0–1) into quantum states via an RY rotation gate, with the angle
Unlike a 1-qubit circuit, which cannot produce entanglement, or a 2-qubit circuit, which offers limited complexity, the 3-qubit design balances simplicity with the ability to study noise effects and error mitigation on NISQ devices. By using RY and CNOT gates, the circuit encodes similarity probabilistically and supports rich observable measurements, making it ideal for demonstrating quantum-encoded similarity analysis while remaining feasible for current hardware. This design aligns with recent work on scalable quantum circuits for chemical applications (Cao et al., 2019).
The relationship between the similarity angle and the RY rotation angle is nonlinear, with θ decreasing from π (T = 0) to 0 (T = 1). At T = 0.5, θ = π/2, producing a balanced superposition (Figure 1B). The steepest angle changes occur near similarity extremes, confirming the RY gate’s suitability for encoding similarity probabilistically.
To elucidate the impact of quantum noise on simulation outcomes, we contrasted ideal and noisy environments for a 3-qubit circuit. The probability difference (
where
In the ideal simulation, the probability distribution is sharply concentrated in the |000⟩ and |111⟩ states, reflecting perfectly correlated measurements and high-fidelity similarity encoding, with virtually no probability assigned to other states. This demonstrates the circuit’s ability to encode similarity with precision when noise is absent. In contrast, the noisy simulation reveals significant deviations: while |000⟩ and |111⟩ remain the most likely outcomes, a noticeable probability appears in erroneous states, such as |001⟩ and |010⟩, caused by depolarizing errors in the quantum gates (Figures 2A,B). This finding indicates that quantum noise disrupts the intended correlations, reducing the overall fidelity of the encoded similarity. Nevertheless, the persistence of dominant |000⟩ and |111⟩ populations in the noisy scenario suggests that the circuit retains some resilience to moderate noise levels. Overall, the comparison highlights that quantum noise introduces distortions and erroneous outcomes, underscoring the need for error mitigation to preserve the reliability of quantum-assisted similarity estimation in practical, noisy quantum devices. These findings emphasize the critical need for error mitigation strategies to increase the reliability of quantum computations, particularly in NISQ devices, where maintaining fidelity is essential for practical applications in chemoinformatics (El Azzouzi et al., 2022; Shityakov et al., 2014).
Figure 2. The noisy quantum simulations show how noise affects quantum state fidelity. Measurement outcomes as counts (A) and probability (B) differences in noisy results for 3-qubit states using a noisy backend and revealing erroneous states for most of the quantum states except for |000⟩ and |111⟩ (B), highlighting the need for error mitigation in quantum computing.
Next, we investigated two molecular systems (aspirin vs. itself and aspirin vs. butane) to assess quantum noise effects on molecular similarity, with and without mitigation. The drug molecule, aspirin (CC(=O)OC1=CC=CC=C1C(=O)O), exhibits a positive drug-likeness score (0.27) and is widely used in QSAR studies for designing novel COX-2 inhibitors (Uzzaman and Mahmud, 2020). On the contrary, butane (CCCC) as a simple hydrocarbon substance, has a negative drug-likeness score (−1.27) due to its lack of functional groups typical of drug-like compounds. Positive scores indicate a higher probability of drug-like properties, while negative scores suggest characteristics less typical of drug candidates.
The baseline Tanimoto similarity was quantified for them being 1.0 for aspirin vs. itself and 0.035 for aspirin vs. butane applying Morgan fingerprints with a default radius of 2.0 (Skuta et al., 2020). A 3-qubit quantum circuit encoded similarity with an RY(θ) rotation gate and two CNOT gates to form a GHZ-like state, with a |000⟩ state probability as the similarity metric. The model corrected the noisy expectation value
where Si = ±1 (sign for outcome i), Ci is the count of outcome i, and S is the total shots (frequency of each measurement outcome observed across all shots).
Error mitigation was performed using exponential rescaling of expectation values derived from a depolarizing noise model (Quek et al., 2024). This approach is conceptually aligned with near-term error mitigation strategies that rescale noisy observables to approximate their noiseless values, as discussed in the previously published research on quantum error mitigation (Koczor, 2021).
The 3-qubit GHZ-like circuit encodes Tanimoto similarity T via an RY rotation and entanglement, and the noisy expectation ⟨ZZZ⟩ is corrected by rescaling with eϵ., namely:
where
This exponential mitigation reduced errors stemming from RY gate inaccuracies, CNOT-induced correlations, and bit-flip measurement errors, particularly affecting low-similarity cases.
The simplicity of mitigation is suitable for near-term devices, but its exponential scaling assumption limits versatility, as noted in advanced mitigation studies (Kandala et al., 2019). The unmitigated and mitigated error rates (
Simulations with 1,000 shots compared ideal conditions to a noisy model incorporating three-qubit depolarizing errors ranging from 0.1% to 10% chance of error are shown in Table 1.
Table 1. Summary of unmitigated and mitigated Tanimoto similarity values, error rates, and mitigation effectiveness for aspirin–aspirin (similar) and aspirin–butane (dissimilar) pairs across noise levels in a 3-qubit quantum circuit.
The results revealed a slight nonlinear decrease in unmitigated Tanimoto similarity for the aspirin–aspirin pair, dropping from 1.0 at low error rates (0.1%) and transitioning to an exponential decline at moderate (approximately 1.0%–1.5%) and high (10%) error rates, reaching 0.9 at a 10% chance of error (Figure 3A). In contrast, the mitigated similarity for the same pair, using exponential error mitigation, remained closer to 1.0. This approach reduced the error by up to 75.0% at a 1% chance of error (10−2 error rate), although it was significantly less effective (40% and 25.0%) at higher error rates (5%–10%) (Figure 3B).
Figure 3. Effect of noise and mitigation on quantum Tanimoto similarity. Unmitigated and mitigated similarity for aspirin–aspirin (A,B) and aspirin–butane (C,D) pairs showing significant distortion as error rates increase (10−3 to 10−1).
Interestingly, the unmitigated similarity for the dissimilar aspirin–butane pair exhibited a more pronounced distortion, with a sharp increase in similarity to 0.15 at a 10% chance of error (Figure 3C), suggesting more severe impacts for dissimilar molecules, due to measurement noise, not chemical similarity. However, with mitigation, the similarity remained relatively close to the baseline up to a 1.5% chance of error (Figure 3D), deviating only slightly (±0.001), although effectiveness again declined at higher error rates (5.0%–10%). This approach reduced the error by up to 87.5% at a 1% chance of error (10−2 error rate), although it was less effective (17.39%) at higher error rate (10%). From these findings, it is clear that mitigation strategies are particularly beneficial in low-similarity cases, where small errors can cause disproportionate effects, as supported by research on error sensitivity in quantum algorithms (Li, 2022).
Overall, increasing quantum error rates had distinct impacts on the accuracy of the molecular similarity estimations. At low error rates (∼0.1%), the results closely approximated ideal, noise-free behavior, indicating that the quantum circuits effectively preserved similarity information. At moderate error rates (1%–1.5%), which are typical of current quantum hardware, significant deviations emerged, although exponential mitigation reduced errors by 70%–90%, improving fidelity. At high error rates (e.g., 10%), noise severely disrupts quantum states, impairing the ability of the circuit to distinguish between similar and dissimilar molecules.
These findings highlight the importance of error mitigation and hardware improvements for reliable quantum-assisted chemoinformatics. Exponential mitigation with exponential scaling reduces errors most effectively in the 0.5%–2% range but tends to overcorrect below 0.5% and undercorrect above 5% (Endo et al., 2021; Temme et al., 2017). Given current hardware constraints (e.g., CNOT error rates of 0.5%–2%), chemically relevant results require either maintaining error rates below 1% or employing advanced techniques such as dynamical decoupling (Blunt et al., 2024; Prasad et al., 2024). Future work should investigate adaptive mitigation scaling, alternative encoding strategies, and testing on real quantum hardware.
To assess the impact of real hardware noise on the encoded molecular similarity, we executed the 3-qubit circuit on IBM Quantum hardware and compared the results with the noise-free FakeAlmadenV2 simulator. The circuit encoded T = 1.0 (aspirin–aspirin, θ = 0) and T ≈ 0 (aspirin–butane, θ ≈ π) using an RY(θ) rotation followed by CNOT-induced entanglement. After transpilation with Qiskit’s optimization level 1, six Pauli observables (IIZ, IIX, IZI, IXI, ZIZ, XIX) were measured using Estimator V2 (resilience level 1, 1,000 shots).
For T = 1.0, the Z-basis observables (IIZ, IZI, ZIZ) yielded high expectation values (0.98–0.99) with low standard deviations, confirming robust GHZ-like correlations (Figure 4A). For T ≈ 0, IIZ and IZI showed reduced magnitudes (−0.97 ± 0.004 and −0.93 ± 0.006) and larger uncertainties, reflecting noise-induced disruption of the expected anti-correlated state (Figure 4B). In contrast, the X-basis observables (IIX, IXI, XIX) are ideally exactly zero for any encoded T value because the state
Figure 4. The expectation values ⟨Z0Zi⟩ of Z-basis (IIZ, IZI, ZIZ) and X-basis (IIX, IXI, XIX) observables for a 3-qubit quantum circuit encoding Tanimoto similarity on IBM Quantum hardware, comparing similar (T = 1.0) aspirin–aspirin (A) and dissimilar (T ≈ 0.0) aspirin–butane (B) pairs.
Although the Tanimoto similarity is extracted solely from the Z-basis observable ⟨ZZZ⟩, these X-basis measurements directly explain the higher error sensitivity observed for low-similarity pairs in our simulations and highlight transverse noise as the primary fidelity-limiting factor on today’s NISQ devices.
On IBM Quantum hardware, we estimated ⟨ZZZ⟩ using a product approximation method, without additional error mitigation, from the measured Z-basis observables ⟨IIZ⟩, ⟨IZI⟩, and ⟨ZIZ⟩, with results projected onto the physically allowed range [−1, 1]. The resulting quantum-estimated Tanimoto similarities (Table 2) show strong agreement with classical baselines: ΔT = 0.0113 for aspirin–aspirin and ΔT = 0.0102 for aspirin–butane. These results demonstrate that a simple, physically constrained observable approximation can recover chemically meaningful similarity scores on real NISQ hardware, even in the absence of explicit mitigation, though residual noise contributes to the observed deviations.
Table 2. Comparison of classical Tanimoto similarity with quantum-encoded estimates obtained via the product approximation method.
Overall, our simulations show that expectation-based estimation and mitigation enable accurate similarity recovery at moderate noise levels (1%–1.5%). These findings emphasize the need for error mitigation, such as dynamical decoupling, to improve NISQ hardware reliability, particularly for X-basis operations (Prasad et al., 2024). This approach establishes a scalable foundation for hybrid quantum-classical workflows in chemoinformatics, aligning with recent quantum chemistry advancements (Arute et al., 2019; Bauer et al., 2020).
4 Conclusion
This work introduces a quantum circuit-based framework for encoding molecular similarity via RY rotations and entanglement, offering a practical approach to quantum-encoded chemoinformatics in NISQ devices. By encoding Tanimoto similarity derived from ECFPs into a 3-qubit quantum circuit, we demonstrated that classical similarity scores can be accurately represented as quantum probabilities. Ideal simulations showed high fidelity, with the entangled GHZ-like state yielding perfectly correlated |000⟩ and |111⟩ outcomes, confirming effective similarity encoding.
In noisy simulations with depolarizing error models (0.1%–10%), fidelity degrades, particularly in circuits involving low-similarity molecules such as aspirin–butane, where unmitigated similarity values deviate sharply (e.g., rising to 0.15 at a 10% error rate). Exponential error mitigation reduced errors by up to 75.0% for high-similarity pairs (aspirin–aspirin) and 87.5% for low-similarity pairs, maintaining similarity estimates within ±0.01 of baseline values at up to 1.5% error rates. Mitigation was most effective in the 0.5%–1.5% error range, aligning with current hardware performance levels.
Experiments on IBM Quantum hardware further validate the approach. Z-basis observables (from which the encoded similarity is extracted) retain high fidelity for T = 1.0 and acceptable fidelity for T ≈ 0, whereas X-basis observables, ideally zero for any encoded T, reveal pronounced transverse noise and decoherence as the dominant error sources on today’s superconducting processors. These hardware results align closely with our noisy simulations and underscore why low-similarity encodings are more fragile in practice. Taken together, the study demonstrates that even with today’s noisy hardware and minimal circuit depth, classical chemoinformatics metrics can be reliably encoded into quantum states and recovered using lightweight error mitigation. The proposed 3-qubit GHZ-like benchmark provides a realistic, application-driven testbed for evaluating noise resilience and mitigation protocols—tasks that are critical for the development of hybrid quantum-classical workflows in drug discovery and chemoinformatics. Future directions should focus on optimizing circuit depth, refining mitigation techniques, and leveraging adaptive encoding strategies to expand the applicability of quantum-encoded similarity analysis in drug discovery.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
SS: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing. TD: Writing – original draft, Writing – review and editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Acknowledgements
We thank the IBM Quantum team for providing the simulated backends, the Qiskit Runtime, and the RDKit community for providing robust chemoinformatics tools.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Akrom, M., Rustad, S., Dipojono, H. K., and Maezono, R. (2024). A comprehensive approach utilizing quantum machine learning in the study of corrosion inhibition on quinoxaline compounds. Artif. Intell. Chem. 2 (2), 100073. doi:10.1016/j.aichem.2024.100073
Arute, F., Arya, K., Babbush, R., Bacon, D., Bardin, J. C., Barends, R., et al. (2019). Quantum supremacy using a programmable superconducting processor. Nature 574 (7779), 505–510. doi:10.1038/s41586-019-1666-5
Bauer, B., Bravyi, S., Motta, M., and Chan, G. K. L. (2020). Quantum algorithms for quantum chemistry and quantum materials science. Chem. Rev. 120 (22), 12685–12717. doi:10.1021/acs.chemrev.9b00829
Bento, A. P., Hersey, A., Félix, E., Landrum, G., Gaulton, A., Atkinson, F., et al. (2020). An open source chemical structure curation pipeline using RDKit. J. Cheminform 12 (1), 51. doi:10.1186/s13321-020-00456-1
Bharti, K., Cervera-Lierta, A., Kyaw, T. H., Haug, T., Alperin-Lea, S., Anand, A., et al. (2022). Noisy intermediate-scale quantum algorithms. Rev. Mod. Phys. 94 (1), 015004. doi:10.1103/revmodphys.94.015004
Biamonte, J., Wittek, P., Pancotti, N., Rebentrost, P., Wiebe, N., and Lloyd, S. (2017). Quantum machine learning. Nature 549 (7671), 195–202. doi:10.1038/nature23474
Blunt, N. S., Gehér, G. P., and Moylett, A. E. (2024). Compilation of a simple chemistry application to quantum error correction primitives. Phys. Rev. Res. 6 (1), 013325. doi:10.1103/physrevresearch.6.013325
Cao, Y., Romero, J., Olson, J. P., Degroote, M., Johnson, P. D., Kieferová, M., et al. (2019). Quantum chemistry in the age of quantum computing. Chem. Rev. 119 (19), 10856–10915. doi:10.1021/acs.chemrev.8b00803
Cincilla, G., Thormann, M., and Pons, M. (2010). Structuring chemical space: similarity-based characterization of the PubChem database. Mol. Inf. 29 (1-2), 37–49. doi:10.1002/minf.200900015
El Azzouzi, M., Azzaoui, K., Warad, I., Hammouti, B., Shityakov, S., Sabbahi, R., et al. (2022). Moroccan, Mauritania, and Senegalese gum Arabic variants as green corrosion inhibitors for mild steel in HCl: weight loss, electrochemical, AFM and XPS studies. J. Mol. Liq. 347, 118354. doi:10.1016/j.molliq.2021.118354
Endo, S., Cai, Z., Benjamin, S. C., and Yuan, X. (2021). Hybrid quantum-classical algorithms and quantum error mitigation. J. Phys. Soc. Jpn. 90 (3), 032001. doi:10.7566/jpsj.90.032001
Grant, E., Benedetti, M., Cao, S., Hallam, A., Lockhart, J., Stojevic, V., et al. (2018). Hierarchical quantum classifiers. npj Quantum Inf. 4 (1), 1–8. doi:10.1038/s41534-018-0116-9
Isakova, A. M., Kovalenko, A. A., Skorb, E. V., and Shityakov, S. (2024). NeuroClick: software for mimicking click reaction to generate drug-like molecules permeating the blood-brain barrier. Future Med. Chem. 16 (5), 389–398. doi:10.4155/fmc-2023-0017
Kandala, A., Temme, K., Córcoles, A. D., Mezzacapo, A., Chow, J. M., and Gambetta, J. M. (2019). Error mitigation extends the computational reach of a noisy quantum processor. Nature 567 (7749), 491–495. doi:10.1038/s41586-019-1040-7
Koczor, B. (2021). Exponential error suppression for near-term quantum devices. Phys. Rev. X 11, 031057. doi:10.1103/physrevx.11.031057
Li, K. (2022). The qubit fidelity under different error mechanisms based on error correction threshold. Front. Phys. 10, 893507. doi:10.3389/fphy.2022.893507
López-Pérez, K., Avellaneda-Tamayo, J. F., Chen, L., López-López, E., Juárez-Mercado, K. E., Medina-Franco, J. L., et al. (2024). Molecular similarity: theory, applications, and perspectives. Artif. Intell. Chem. 2 (2), 100077. doi:10.1016/j.aichem.2024.100077
Mazzola, G. (2024). Quantum computing for chemistry and physics applications from a Monte Carlo perspective. J. Chem. Phys. 160 (1), 010901. doi:10.1063/5.0173591
Poola, A. A., Prabhu, P. S., Murthy, T. P. K., Murahari, M., Krishna, S., Samantaray, M., et al. (2023). Ligand-based pharmacophore modeling and QSAR approach to identify potential dengue protease inhibitors. Front. Mol. Biosci. 10, 1106128. doi:10.3389/fmolb.2023.1106128
Prasad, V. K., Cheng, F., Fekl, U., and Jacobsen, H. A. (2024). Applications of noisy quantum computing and quantum error mitigation to “adamantaneland”: a benchmarking study for quantum chemistry. Phys. Chem. Chem. Phys. 26 (5), 4071–4082. doi:10.1039/d3cp03523a
Quek, Y., Stilck França, D., Khatri, S., Meyer, J. J., and Eisert, J. (2024). Exponentially tighter bounds on limitations of quantum error mitigation. Nat. Phys. 20 (10), 1648–1658. doi:10.1038/s41567-024-02536-7
Quinton, F. A., Myhr, P. A. S., Barani, M., Crespo Del Granado, P., and Zhang, H. (2025). Quantum annealing applications, challenges and limitations for optimization problems compared to classical solvers. Sci. Rep. 15 (1), 12733. doi:10.1038/s41598-025-96220-2
Rogers, D., and Hahn, M. (2010). Extended-connectivity fingerprints. J. Chem. Inf. Model. 50 (5), 742–754. doi:10.1021/ci100050t
Safizadeh, H., Simpkins, S. W., Nelson, J., Li, S. C., Piotrowski, J. S., Yoshimura, M., et al. (2021). Improving measures of chemical structural similarity using machine learning on chemical-genetic interactions. J. Chem. Inf. Model 61 (9), 4156–4172. doi:10.1021/acs.jcim.0c00993
Shityakov, S., and Dandekar, T. (2010). Lead expansion and virtual screening of Indinavir derivate HIV-1 protease inhibitors using pharmacophoric - shape similarity scoring function. Bioinformation 4 (7), 295–299. doi:10.6026/97320630004295
Shityakov, S., Puskás, I., Roewer, N., Förster, C., and Broscheit, J. (2014). Three-dimensional quantitative structure-activity relationship and docking studies in a series of anthocyanin derivatives as cytochrome P450 3A4 inhibitors. Adv. Appl. Bioinform Chem. 7, 11–21. doi:10.2147/AABC.S56478
Shityakov, S., Skorb, E. V., Förster, C. Y., and Dandekar, T. (2021). Scaffold searching of FDA and EMA-approved drugs identifies lead candidates for drug repurposing in Alzheimer's disease. Front. Chem. 9, 736509. doi:10.3389/fchem.2021.736509
Skuta, C., Cortés-Ciriano, I., Dehaen, W., Kříž, P., van Westen, G. J. P., Tetko, I. V., et al. (2020). QSAR-derived affinity fingerprints (part 1): fingerprint construction and modeling performance for similarity searching, bioactivity classification and scaffold hopping. J. Cheminform 12 (1), 39. doi:10.1186/s13321-020-00443-6
Temme, K., Bravyi, S., and Gambetta, J. M. (2017). Error mitigation for short-depth quantum circuits. Phys. Rev. Lett. 119 (18), 180509. doi:10.1103/PhysRevLett.119.180509
Urbanek, M., Nachman, B., Pascuzzi, V. R., He, A., Bauer, C. W., and de Jong, W. A. (2021). Mitigating depolarizing noise on quantum computers with noise-estimation circuits. Phys. Rev. Lett. 127 (27), 270502. doi:10.1103/PhysRevLett.127.270502
Uzzaman, M., and Mahmud, T. (2020). Structural modification of aspirin to design a new potential cyclooxygenase (COX-2) inhibitors. In Silico Pharmacol 8 (1), 1. doi:10.1007/s40203-020-0053-0
Yang, Q., Feng, X., and Wei, L. (2025). Image similarity quantum algorithm and its application in image retrieval systems. Entropy (Basel) 27 (2), 137. doi:10.3390/e27020137
Zhang, M., Dong, L., Zeng, Y., and Cao, N. (2022). Improved circuit implementation of the HHL algorithm and its simulations on QISKIT. Sci. Rep. 12 (1), 13287. doi:10.1038/s41598-022-17660-8
Zhou, H., and Skolnick, J. (2024). Utility of the Morgan fingerprint in structure-based virtual ligand screening. J. Phys. Chem. B 128 (22), 5363–5370. doi:10.1021/acs.jpcb.4c01875
Keywords: extended-connectivity fingerprints, near-term quantum computing, noise mitigation, quantum-encoded chemoinformatics, Tanimoto similarity
Citation: Shityakov S and Dandekar T (2026) Entangled fingerprints for quantum-encoded chemoinformatics: quantum circuits for molecular similarity in the noisy era. Front. Chem. 13:1707409. doi: 10.3389/fchem.2025.1707409
Received: 17 September 2025; Accepted: 10 December 2025;
Published: 06 January 2026.
Edited by:
Xinguo Ren, Chinese Academy of Sciences (CAS), ChinaReviewed by:
Vidya Niranjan, R. V. College of Engineering (RVCE), IndiaDingshun Lv, ByteDance Inc., China
Copyright © 2026 Shityakov and Dandekar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sergey Shityakov, c2hpdHlha29mZkBob3RtYWlsLmNvbQ==