- 1QUEST Center for Responsible Research, Berlin Institute of Health at Charité—Universitätsmedizin Berlin, Berlin, Germany
- 2Division of Clinical Pharmacology, Ludwig-Maximilians-Universität (LMU) University Hospital, LMU Munich, Munich, Germany
- 3German Cancer Consortium (DKTK), Ludwig-Maximilians-Universität (LMU) University Hospital, LMU Munich, Munich, Germany
- 4Einheit für Klinische Pharmakologie (EKLiP), Helmholtz Zentrum München—German Research Center for Environmental Health Neuherberg, Munich, Germany
- 5Department of Pediatrics, Division of Oncology and Hematology, Charité—Universitätsmedizin Berlin, Berlin, Germany
- 6Department of Internal Medicine V, University Hospital Schleswig-Holstein, University of Kiel, Kiel, Germany
- 7German Centre for Cardiovascular Research (DZHK), Partner Site Hamburg/Kiel/Lübeck, Kiel, Germany
- 8Faculty of Medicine, Infection Immunology Group, Institute of Medical Microbiology and Hospital Hygiene, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
- 9Research Group Immune Regulation, Helmholtz Centre for Infection Research, Braunschweig, Germany
- 10Leibniz Institute on Aging, Fritz Lipmann Institute, Jena, Germany
- 11Institute of Psychopharmacology, Central Institute of Mental Health, Mannheim, Germany
- 12Guarantors of EQIPD e.V. Heidelberg, Heidelberg, Germany
- 13PAASP GmbH, Heidelberg, Germany
- 14Department of Paediatrics, Children’s Cancer Research Centre, Kinderklinik München Schwabing, TUM School of Medicine, Technical University of Munich, Munich, Germany
- 15Medical Faculty at University Hospital Carl Gustav Carus, Dresden University of Technology, Dresden, Germany
- 16Division Polymer Biomaterials Science, Leibniz Institute for Polymer Research Dresden, Dresden, Germany
- 17Institute for Medical Information Processing, Biometry, and Epidemiology, Faculty of Medicine, LMU Munich, Munich, Germany
Successful translation of promising preclinical findings into clinical application remains challenging. To address the rising concerns of failing clinical trials and the resulting economic, social, and ethical consequences, preclinical confirmatory studies have been proposed to generate sufficiently robust evidence for guiding the decision-making process. In a unique funding call, 17 studies in Germany aimed to confirm exploratory findings across various biomedical research fields in a rigorously planned and executed multi-laboratory set-up. Alongside these preclinical research projects, a meta-research project was funded to provide methodological support and collectively investigate confirmatory study design and experimental outcomes. After the first four-year funding period, an in-person workshop brought together representatives from the preclinical confirmatory studies to discuss lessons learned. We summarize the outcomes of these stakeholder discussions, highlight common pitfalls, and propose optimization strategies for experimental set-up and project coordination. As a result, we advocate for new roles—such as preclinical research coordinators—and improved rules and regulations in preclinical research to facilitate large-scale academic research projects. Moreover, we highlight that diverse stakeholders must collaborate to effectively integrate confirmatory multi-laboratory studies into the preclinical research ecosystem.
1 Introduction
Preclinical research seeks to generate evidence on the safety, efficacy, and mechanisms of novel interventions. Despite advancements in technology and major breakthroughs in patient-relevant discoveries, translational failures are still common where promising therapies fail to demonstrate efficacy in humans (1–4). These setbacks do not only pose serious health risks to patients who may receive new treatments with little to no benefit or worse, unforeseen harmful side effects (1), but also strain financial and healthcare systems (2, 5, 6). Moreover, as preclinical research predominantly relies on animal experiments to evaluate new interventions, studies that are poorly designed, underpowered, or use models that fail to reflect physical and biochemical characteristics of human disease, waste resources and raise ethical concerns (7). The drug development process from preclinical testing to clinical trial success is thus lengthy, uncertain and often high-risk. Consequently, existing strategies must be refined and extended to guide the decision-making in preclinical research towards increased efficiency.
One such extension involves conducting preclinical confirmatory studies as a decisive evaluation step before advancing to clinical trials (8–10). These studies help strengthen the evidence basis and narrow down the number of potential drug candidates, for instance (11). How does a confirmatory study differ from replication and exploratory studies? Most preclinical studies aim to generate hypotheses, making them inherently exploratory (12, 13). Relying on flexible experimental setups and different methodologies, these single-laboratory studies often lack transparency and sufficient standardization in study design, power calculations, statistical analyses, and reporting—frequently resulting in biased, underpowered, and poorly replicable findings (14–16).
In contrast, replication and confirmation studies adhere to rigorous methods mirroring those of clinical trials, including randomization, blinding, and adequate sample size calculations (12, 17). Whereas replication studies aim to repeat a previous experiment or study using the same methods and procedures to assess the reliability of the original result (18), confirmatory studies seek to validate, modify, or refute an existing knowledge claim by systematically altering certain aspects of the original experiment, thereby increasing the external validity of findings (9, 10, 12).
Another way to strengthen the external validity of findings is to conduct experiments across multiple laboratories (multi-laboratory study). Several preclinical multi-lab studies have demonstrated how these rigorous efforts yield more reliable, externally, and internally valid findings (19–21). A systematic assessment of preclinical multi-lab studies showed that these studies often produce smaller, more realistic effects sizes compared to exploratory single-lab studies (22). Furthermore, they often yield null or mixed results, demonstrating that multi-lab approaches better reflect true biological variance (22). Despite these successes, some multi-lab studies have also faced major challenges, for instance, in harmonizing and adhering to protocols across labs (21).
To encourage preclinical confirmatory multi-lab studies and explore their value, the German Federal Ministry of Research, Technology and Space (BMFTR) announced a unique funding line (23, 24). Seventeen funded projects performed rigorous efficacy studies to confirm previous exploratory findings across diverse biomedical research fields (psychiatry, oncology, immunology, cardiology, neurology, orthopedics, gastroenterology, urology). Each project conducted a number of experiments including a designated in vivo experiment across at least two partner laboratories at national or international level. Aiming at translation, these consortia implemented practices that enhanced both the validity (internal, external, translational) (9), and reliability of their results. Concretely, project groups followed harmonized protocols across participating labs, applied strategies to reduce risk of bias (e.g., randomization and blinding), and calculated required sample sizes a priori (25). Furthermore, some groups refined their disease models by, for instance, including both animal sexes or adopting more standardized induction techniques (25, 26). After the first four-year funding period, consortium representatives were invited to participate in an in-person workshop hosted by the DECIDE (Decision-Enabling Confirmation of Innovative Discoveries and exploratory Evidence) project team at the BIH QUEST Center for Responsible Research of the Berlin Institute of Health (BIH) at Charité—Universitätsmedizin Berlin. The DECIDE project, funded under the same BMFTR calls, provides methodological support for the confirmatory studies throughout the funding period while also analyzing study design, conduct, and outcome from a meta-research perspective (27, 28). By convening preclinical researchers, biostatisticians, quality management experts, clinicians, veterinarians, and representatives from the project management agency, the group jointly identified and discussed challenges and success factors of conducting preclinical confirmatory studies. In this article, we summarize the results of these stakeholder discussions and propose actionable recommendations. Whereas no single strategy fits all scenarios, this overview intends to inform the current state of knowledge based on the community’s experience to drive future advancements in preclinical research.
2 Confirmatory studies—common pitfalls and solution strategies
Although consortia varied in disease area, primary outcome, study design, methods, and models used, they all encountered similar challenges during the first funding period. Table 1 provides an overview of common pitfalls in the experimental set-up and project coordination of preclinical confirmatory multi-laboratory studies, along with proposed solution strategies derived from stakeholder discussions. The following sections explore overarching themes in greater detail. To illustrate some of the potential challenges involved in planning and conducting a preclinical confirmatory study, we present a fictionalized case study in Box 1. This example synthesizes experiences drawn from the actual confirmatory studies conducted during the first funding period without disclosing identifiable project details.
Table 1. Overview of common pitfalls and proposed optimization strategies for the experimental set-up and project coordination in preclinical confirmatory multi-lab studies, with selected practical resources and tools.
Illustrative case study—conducting a preclinical confirmatory study.
A consortium has successfully acquired funding to conduct a preclinical confirmatory study. The aim of this study is to confirm previous findings regarding the efficacy of compound X as a novel intervention against disease Y. To test this hypothesis, the hypothesis-generating in vivo experiment of the exploratory study will be conducted again across three different partner laboratories with several adaptations: The confirmatory study will use aged animals of both sexes to better reflect the clinical population. Model induction will follow a new standardized technique to reduce variance in primary outcome measurement. The intervention will be administered acutely rather than prophylactically to improve translational relevance. In addition to saline as a negative control, a clinical gold-standard treatment will serve as an additional comparator.
2.1 Centralized protocols and decision trees
When multiple laboratories participate in a study, the risk of introducing unwanted inter-lab variability increases. This heterogeneity may stem from differences across labs in experimental protocols, animal housing and handling, equipment, or data entry and analysis (10). To minimize undesired variability in the experimental design, data management, and statistical analyses, it is essential to centrally draft and distribute comprehensive, clear, and detailed protocol documents and ideally pre-register them (29, 30). These documents should include Standard Operating Procedures (SOPs) to ensure consistent protocol harmonization, compound preparation, in vitro and in vivo experimentation, as well as blinding and randomization procedures. Ideally, Quality Management Plans (QMPs) (31), Data Management Plans (DMPs), Statistical Analysis Plans (SAPs), and Missing Data Plans should also be drafted centrally prior to project start, then distributed and discussed across all participating labs (32).
It is, however, important to note that not all aspects of a multi-lab study can or should be harmonized. Whereas standardization in animal experimentation can effectively reduce unwanted variability, it may overlook biological variation, thus limiting the generalizability of findings. To address these shortcomings, the concept of systematic heterogenization has been introduced (10, 33). By systematically incorporating sources of biological variability, for instance using animals of different sex or age, varying housing conditions, or changing testing times or methods, this approach is considered to improve study design and result generalizability (33). In other cases, harmonization of all protocols may not be possible, especially when collaborating with international academic labs or industry partners. This includes different laboratory infrastructure and equipment, institutional or regulatory limitations, or financial and resource constraints. It is thus essential to transparently document and communicate these limitations and differences whilst finding a balance between standardization, systematic heterogenization and feasibility.
Moreover, to address possible emergency events such as failure of protocols, allocation disclosure, or compound discontinuation, centralized decision trees, contingency plans, and guidelines should be established early on. However, as workshop participants acknowledged, not all unexpected events can be planned for and even the best-prepared strategies may fall short. It is therefore essential to adapt, evolve, document, and overcome setbacks as they arise without losing focus.
2.2 Training and pilot studies
Standardized training sessions and workshops for staff are crucial to ensure rigorous implementation of centralized procedures across all participating labs. These sessions should cover all aspects of centralized workflows with particular emphasis on animal housing and handling, experimental and operational techniques, quality management, and data management. Initial support and on-site visits of more experienced scientists are highly encouraged to facilitate knowledge transfer and consistent training levels among experimenters, particularly for highly specialized methods. If travel is not feasible, the distribution of learning materials, such as videos, serves as an effective alternative.
Pilot studies are useful for validating protocols, standardizing expected drug dose ranges and establishing new methods or models across participating labs. These small-scale experiments help evaluate feasibility and ensure consistent baseline measurements across sites in alignment with the 3Rs principle (Reduce, Replace, Refine) (34) before launching the full-scale confirmatory study. It should be noted that conducting pilot studies for in vivo experiments can be challenging due to ethical considerations and regulatory constraints. However, if discussed early with regulatory authorities and justified in the sample size calculation of the animal experimentation permit, such studies are still feasible in many cases.
2.3 Regulatory affairs
Obtaining animal experimentation permits for a confirmatory multi-lab study was reported as one of the most significant challenges, whether involving different labs within a single federal state (Bundesland), across federal states, or international partner labs. This process was further complicated through the absence of a standardized application template for animal experimentation across German states, varying local regulatory requirements, and differing expectations from local animal welfare officers.
Confirmatory multi-lab studies are new to the German research landscape. Due to Germany’s federal system, each participating lab is required to submit its own animal permit application—an inherently time-consuming process. Explaining that these studies are not large-scale replications but rather adequately powered confirmatory studies proved challenging, especially since large-scale replication studies are viewed critically by authorities for ethical reasons. To streamline the process, early engagement with regulatory bodies and close collaboration with animal welfare officers proved beneficial. Consultations and expert support, such as those provided by DECIDE and Responsible PrecliniX (RPX), were reported to be particularly helpful. RPX, a research and support unit at the BIH QUEST Center, assists preclinical research projects to generate patient-informed, robust evidence and has extensive experience in drafting animal experimentation permits in Germany (35).
2.4 Documentation
Comprehensive documentation is crucial for managing the complexities of a multi-lab study. To efficiently keep track of protocols, compounds, techniques, blinding, and randomization across labs, transparent and standardized documentation must be established. This also applies to quality management, data management, statistical analyses, and reporting. Several useful guidelines and tools have already been developed to support these latter areas, a selection of which is outlined below.
For quality management, it is recommended to follow GLP (Good Laboratory Practice) (36), PREMIER (Predictiveness and Robustness through Modular Improvement of Experimental Research) (37), and EQIPD (Enhancing Quality in Preclinical Data) (38) quality systems, in addition to the PREPARE guidelines (Planning Research and Experimental Procedures on Animals) (39). Data management is greatly improved by adhering to FAIR (Findable, Accessible, Interoperable, Reusable) principles (40) using standardized formats, a file nomenclature, and a centralized platform aligned with the DMP. Ideally, such a platform should be web-based and secure, supporting collaborative access across labs with role-based permissions. This allows for selective blinding of information from specific users to mitigate risk of bias. Moreover, the platform must ensure data quality, integrate audit trails and version control for traceability, and serve as a central repository for data, protocol documents, and backups—as seen in platforms like REDCap (41). Reporting should follow the ARRIVE 2.0 guidelines (Animal Research: Reporting of In Vivo Experiments) (42) and make use of tools such as the Experimental Design Assistant (EDA) (43).
Documentation is equally important for communication. Meeting minutes should be recorded and stored on an accessible, cloud-based server. Given the frequent turnover of scientists in academia, maintaining consistent and well-structured documentation is key to ensure continuous knowledge transfer within the project.
2.5 Communication
Effective communication is one of the most critical aspects of coordinating a confirmatory multi-lab study. Specific rules, relationships, and obligations should be established in a consortium agreement along with a clear governance structure to prevent possible conflicts. Together with pre-defined agreements and guidelines on leadership, intellectual property (IP), ownership, and authorship, such frameworks ensure accountability, enable effective decision-making, and balance representation across partnering labs. These commitments must undergo regular review to ensure compliance and must be refined as needed.
Centralized platforms and regular (virtual) meetings are essential for effective communication and trouble-shooting in a multi-lab and multi-stakeholder study. Defined communication channels should inform relevant project members of project status updates, facilitate decision-making processes, and ensure the implementation of centralized procedures across labs. Establishing a common language among all stakeholders—including preclinical scientists, clinicians, patient representatives, biostatisticians, research technicians, and project managers—is challenging yet indispensable for an effective communication throughout the project. While patient engagement is increasingly common in clinical trials, it remains underutilized in preclinical research (44). Confirmatory studies, with their explicit translational focus, offer opportunities for meaningful collaborations provided researchers communicate with other stakeholders clearly and accessibly (45, 46).
3 The role of the study coordinator
Confirmatory multi-lab studies hold great promise for generating more robust evidence to guide decision-making in the preclinical trajectory. Nonetheless, they also present an increased workload and diverse challenges in terms of coordination, collaboration, administration, and communication. In clinical research contexts with frequent multi-center trials, roles such as Clinical Research Coordinators (CRC) and Clinical Research Associates (CRA) are common practice. These professionals coordinate clinical trials overseeing data collection, reporting, documentation, and quality control among other responsibilities. In contrast, preclinical research often lacks a dedicated position for study coordination or research management. Of the 12 confirmatory studies funded in the first BMFTR call, nine projects indicated that the leading Principal Investigator (PI) oversaw planning, management, and coordination tasks, often supported by institutional funding. In two other groups, the PI was supported by a postdoc or a clinical trial competence center for project management—and only one group had a designated study coordinator. In alignment with discussions held during the workshop, this highlights the underestimation of the time and resource intensity of such projects.
We thus advocate for new professional figures in the preclinical field, namely preclinical research coordinators (PRCs). Unlike traditional project managers, these roles require strong domain-specific and interdisciplinary knowledge. Specifically, their responsibilities go beyond administrative oversight to include a strong understanding of experimental design, animal models, lab equipment, quality management, GLP compliance, regulatory guidelines, and animal welfare regulations. PRCs serve as a link between all stakeholders of a preclinical study, including scientists, biostatisticians, regulatory authorities, and funding bodies amongst others (Figure 1). As study coordinator, they should be an integral part of the consortium’s coordinating center ensuring effective study oversight. This coordinating center may be one partner lab serving as coordinating lead or may consist of representatives from each partner lab.
Figure 1. Task and stakeholder mapping of a preclinical research coordinator (PRC). This figure highlights the central coordinating role of the PRC in a preclinical research project, connecting with key stakeholders to ensure effective project execution. Stakeholders are arranged uniformly for visual clarity, however their actual degree of involvement with the PRC can vary considerably.
To ensure effective project execution as PRC, we propose five actionable recommendations: First, establish a comprehensive study handbook from the start of the project. This living document should be continuously updated and must define all key aspects of the project including experimental procedures, DMPs, QMPs, and contingency plans in case of adverse events. Second, foster clear and consistent communication. As the first point of contact across collaborating labs, the PRC sets up open communication channels and schedules regular meetings with relevant stakeholders. Third, ensure efficient time and resource management throughout the project. The PRC monitors milestones and ensures structure and visibility around deadlines, thus enabling the research team to remain focused and accountable. Fourth, ensure quality control and data management. The PRC maintains centralized platforms for data collection and sharing across laboratories and implements quality control measures throughout the project to identify potential problems early. Lastly, stay flexible and solution-orientated. As not every challenge will have a pre-defined handbook entry, the PRC should not only anticipate unforeseen challenges but remain adaptable and foster open and collaborative problem-solving among stakeholders.
4 Discussion
The collective experiences gathered here from preclinical consortia emphasize the importance of confirmatory studies in translational research. Whilst generating more robust evidence, these studies are also highly complex, resource-intensive and bear certain pitfalls. Sustainably integrating confirmatory research into the biomedical landscape will require various stakeholders to adapt and work collaboratively, as outlined below.
4.1 Directions for funders and institutions
Funding agencies have yet to fully recognize the value of preclinical confirmatory studies for translational research. Without explicit funder support, adoption of these complex, collaborative study designs will remain limited. We thus advocate for more funding opportunities for both newly emerging preclinical confirmatory infrastructures and existing multi-lab networks and study platforms. A successful example of a sustainable preclinical multi-stakeholder initiative supporting translational research includes ITCC-P4 (47), a non-profit oncology platform that started as a public-private funded project. For funders to effectively assess confirmatory research proposals, we propose a multidisciplinary panel of experts, including domain-specific reviewers, biostatisticians, and experimental design experts to assess the underlying exploratory data and proposed confirmatory study design. Given the complexity of these large-scale projects, funding calls must offer flexibility in timelines, allocation of finances, and allow for international consortia or industry partners. Furthermore, new roles such as preclinical research coordinators (PRCs) should be formally recognized and reflected in funding budgets. Given the current scarcity of these roles, targeted funding is needed to create educational and training resources to build these interdisciplinary competencies. Preclinical confirmatory studies also require stronger institutional support. Translational research centers such as the German Centers for Health Research (DZG) (48) could lead by creating policies and providing infrastructure and resources for such collaborative research. Without coordinated support from both funders and institutions, conducting these studies remains practically infeasible.
4.2 Regulatory requirements
To promote the concept of confirmatory multi-lab studies in the preclinical field, clear guidelines and consistent regulations must be established for effective national and international collaborations. This includes, for example, standardized templates for animal experimentation permits and biometric form sheets to reduce administrative workload. To optimize the regulatory approval of a preclinical multi-lab study, we argue for adopting a model similar to that used in clinical trials. In clinical multi-center trials, a leading ethics committee is responsible for the application processing. Other involved committees receive a document copy and must still complete their own ethical and legal reviews, which remains time-intensive. However, the leading committee significantly streamlines the overall process. For preclinical multi-lab studies, each laboratory currently applies for separate animal experimentation permits with varying local requirements, and regulatory authorities rarely communicate with one another. A lead regulatory authority coordinating the animal experimentation process for the entire multi-lab study would significantly reduce delays and bureaucratic burden for all parties.
4.3 Gaining recognition in the scientific community
Unanimously, workshop participants emphasized the importance for researchers, both experienced and early career scientists, to recognize the value of preclinical confirmatory research. Improved science communication and visibility—through publications, conferences, workshops, and dedicated training programs integrated into graduate student curricula—could serve as impactful strategies to inform and educate.
There is also a strong need for greater recognition from journals, as many editors and reviewers are unfamiliar with preclinical confirmatory multi-lab study designs. Some may even be dismissive, citing a perceived lack of novelty in confirmatory research or expressing reluctance to publish null results. This lack of recognition ultimately discourages both senior and early-career researchers from engaging in such studies. One way to appeal to more preclinical researchers is for funding agencies to ensure publication in collaboration with journals and publishers and to offer incentives for pre-registration of confirmatory studies. For instance, Registered Reports (RR; a publishing format in which study plans undergo peer-review and approval before experiments are conducted) offer an effective way to ensure publication while simultaneously reducing publication bias and reporting bias (49). As an example, the Norwegian funding agency Stiftelsen Dam requires all studies to be publicly pre-registered on its registry on the Open Science Framework (OSF) and provides registration templates (50, 51). Moreover, the agency has implemented Registered Reports into their latest research funding program to strengthen transparency, rigor, and reproducibility (52). Another strategy to encourage researchers to publish RRs is through Registered Reports Funding Partnerships (RRFPs). RRFPs are a novel concept where funding agencies and academic journals collaborate to streamline both the grant application and the initial stage of an RR, ensuring eventual publication as an incentive for the scientific community (53).
In addition to pre-registration and RRs, we advocate for a nuanced adoption of Open Science practices to enhance preclinical research quality, transparency, and reproducibility. Through measures like openly accessible standardized protocols and repositories for all types of data, scientists can benefit from prior research, save resources, optimize the development of new confirmatory infrastructures, and ultimately accelerate progress in preclinical research.
Confirmatory multi-lab studies offer the potential to generate more robust evidence in preclinical research. Through this, they will putatively promote collaborative efforts, drive medical progress, and reduce research waste, ultimately enhancing translation from bench-to-bedside. However, herein described studies are highly complex, time-consuming, and resource-intensive, and require more rigorous planning, collaboration, and oversight than common exploratory single-lab studies. We thus propose actionable strategies and advocate for new professional roles such as preclinical research coordinators (PRCs) and for the involvement of diverse stakeholders to enhance preclinical evidence synthesis. While we acknowledge that our recommendations bear certain limitations and are not fit-for-all-purpose, we stress the importance and necessity of confirmatory preclinical research to drive innovative biomedical advancements.
Data availability statement
The datasets presented in this article are not readily available because the datasets generated and/or analyzed during the current study, including presentation slides and collaborative workshop notes, contain identifiable project-level information from multiple research consortia and are not publicly available due to confidentiality agreements. De-identified summaries of the themes and findings derived from the workshop discussions are available from the corresponding author upon request. Requests to access the datasets should be directed to c29waGlhLnJvdHRlckBjaGFyaXRlLmRl.
Author contributions
SR: Conceptualization, Data curation, Methodology, Project administration, Visualization, Writing – original draft. MA-A: Conceptualization, Funding acquisition, Methodology, Project administration, Writing – original draft, Writing – review & editing. ND: Writing – original draft, Writing – review & editing. PP: Writing – original draft, Writing – review & editing. SK: Funding acquisition, Writing – original draft, Writing – review & editing. GR: Funding acquisition, Writing – original draft, Writing – review & editing. OJM: Data curation, Writing – original draft, Writing – review & editing. DB: Funding acquisition, Writing – original draft, Writing – review & editing. LR: Writing – original draft, Writing – review & editing. BG: Writing – original draft, Writing – review & editing. LS: Writing – original draft, Writing – review & editing. JS-H: Writing – original draft, Writing – review & editing. ID: Writing – original draft, Writing – review & editing. SK: Writing – original draft, Writing – review & editing. JW: Writing – original draft, Writing – review & editing. UT: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This work was funded by the German Federal Ministry for Research, Technology and Space (Bundesministerium für Forschung, Technologie und Raumfahrt; BMFTR) under Grant Numbers 01KC1901A, 01KC2306, 01KC2003A, 01KC2005, 01KC2012E, 01KC2012B, and 01KC2304, as well as by the German Research Foundation (Deutsche Forschungsgesellschaft; DFG) under Grant Number BO3139/7, 01KC2004A, 01KC2005B, 01KC2005A, 01KC2012B, 01KC2006A. JS-H acknowledges additional support from the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) through Grant Numbers TRR 369, Project A03, and SA 4045/8-3, as well as from the Elsbeth Bonhoff Foundation (Project Number 262).
Conflict of interest
BG is a member of the executive board of the Guarantors of EQIPD e.V., a shareholder of PAASP GmbH, and is employed by the Central Institute of Mental Health and the Guarantors of EQIPD e.V.
SK has received honoraria from TCR2 Inc, Novartis, BMS, Miltenyi, Regeneron, Cymab, Galapagos, Plectonic and GSK. SK is inventor of several patents in the field of immuno-oncology. SK received license fees from TCR2 Inc and Carina Biotech. SK received research support from TCR2 Inc., Plectonic GmBH, Catalym GmBH and Arcus Bioscience for work unrelated to the manuscript.
The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was used in the creation of this manuscript. The authors confirm that all contents presented in this manuscript originate solely from the experiences and contributions of the workshop participants. Generative artificial intelligence (ChatGPT, GPT-5, OpenAI) was used only as a language-editing tool to suggest alternative phrasings and improve clarity. No sentences were directly inserted into the manuscript; all suggestions were critically reviewed and adapted by the authors for accuracy and originality.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Seyhan, AA. Lost in translation: the valley of death across preclinical and clinical divide – identification of problems and overcoming obstacles. Transl Med Commun. (2019) 4:18. doi: 10.1186/s41231-019-0050-7
2. Cummings, J, Feldman, HH, and Scheltens, P. The “rights” of precision drug development for Alzheimer’s disease. Alzheimer's Res Ther. (2019) 11:76. doi: 10.1186/s13195-019-0529-5,
3. Hutchinson, L, and Kirk, R. High drug attrition rates—where are we going wrong? Nat Rev Clin Oncol. (2011) 8:189–90. doi: 10.1038/nrclinonc.2011.34,
4. Ineichen, BV, Furrer, E, Grüninger, SL, Zürrer, WE, and Macleod, MR. Analysis of animal-to-human translation shows that only 5% of animal-tested therapeutic interventions obtain regulatory approval for human applications. PLoS Biol. (2024) 22:e3002667. doi: 10.1371/journal.pbio.3002667,
5. Freedman, LP, Cockburn, IM, and Simcoe, TS. The economics of reproducibility in preclinical research. PLoS Biol. (2015) 13:e1002165. doi: 10.1371/journal.pbio.1002165,
6. Macleod, MR, Michie, S, Roberts, I, Dirnagl, U, Chalmers, I, Ioannidis, JPA, et al. Biomedical research: increasing value, reducing waste. Lancet. (2014) 383:101–4. doi: 10.1016/S0140-6736(13)62329-6,
7. Perrin, S. Preclinical research: make mouse studies work. Nature. (2014) 507:423–5. doi: 10.1038/507423a,
8. Mogil, JS, and Macleod, MR. No publication without confirmation. Nature. (2017) 542:409–11. doi: 10.1038/542409a,
9. Drude, NI, Martinez Gamboa, L, Danziger, M, Dirnagl, U, and Toelch, U. Improving preclinical studies through replications. eLife. (2021) 10:e62101. doi: 10.7554/eLife.62101,
10. Carneiro, CFD, Drude, N, Hülsemann, M, Collazo, A, and Toelch, U. Mapping strategies towards improved external validity in preclinical translational research. Expert Opin Drug Discov. (2023) 18:1273–85. doi: 10.1080/17460441.2023.2251886,
11. Lyden, PD, Diniz, MA, Bosetti, F, Lamb, J, Nagarkatti, KA, Rogatko, A, et al. A multi-laboratory preclinical trial in rodents to assess treatment candidates for acute ischemic stroke. Sci Transl Med. (2023) 15:eadg8656. doi: 10.1126/scitranslmed.adg8656,
12. Kimmelman, J, Mogil, JS, and Dirnagl, U. Distinguishing between exploratory and confirmatory preclinical research will improve translation. PLoS Biol. (2014) 12:e1001863. doi: 10.1371/journal.pbio.1001863,
13. Dirnagl, U. Rethinking research reproducibility. EMBO J. (2019) 38:e101117. doi: 10.15252/embj.2018101117,
14. Sena, E, van der Worp, HB, Howells, D, and Macleod, M. How can we improve the pre-clinical development of drugs for stroke? Trends Neurosci. (2007) 30:433–9. doi: 10.1016/j.tins.2007.06.009,
15. Howells, DW, Sena, ES, and Macleod, MR. Bringing rigour to translational medicine. Nat Rev Neurol. (2014) 10:37–43. doi: 10.1038/nrneurol.2013.232,
16. Button, KS, Ioannidis, JPA, Mokrysz, C, Nosek, BA, Flint, J, Robinson, ESJ, et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. (2013) 14:365–76. doi: 10.1038/nrn3475,
17. Drude, NI, Martinez-Gamboa, L, Danziger, M, Collazo, A, Kniffert, S, Wiebach, J, et al. Planning preclinical confirmatory multicenter trials to strengthen translation from basic to clinical research – a multi-stakeholder workshop report. Transl Med Commun. (2022) 7:24. doi: 10.1186/s41231-022-00130-8
19. Llovera, G, Hofmann, K, Roth, S, Salas-Pérdomo, A, Ferrer-Ferrer, M, Perego, C, et al. Results of a preclinical randomized controlled multicenter trial (pRCT): anti-CD49d treatment for acute brain ischemia. Sci Transl Med. (2015) 7:299ra121–1. doi: 10.1126/scitranslmed.aaa9853
20. Jones, SP, Tang, XL, Guo, Y, Steenbergen, C, Lefer, DJ, Kukreja, RC, et al. The NHLBI-sponsored consortium for preclinicAl assESsment of cARdioprotective therapies (CAESAR). Circ Res. (2015) 116:572–86. doi: 10.1161/CIRCRESAHA.116.305462,
21. Maysami, S, Wong, R, Pradillo, JM, Denes, A, Dhungana, H, Malm, T, et al. A cross-laboratory preclinical study on the effectiveness of interleukin-1 receptor antagonist in stroke. J Cereb Blood Flow Metab. (2016) 36:596–605. doi: 10.1177/0271678X15606714,
22. Hunniford, VT, Grudniewicz, A, Fergusson, DA, Montroy, J, Grigor, E, Lansdell, C, et al. A systematic assessment of preclinical multilaboratory studies and a comparison to single laboratory studies. eLife. (2023) 12:e76300. doi: 10.7554/eLife.76300,
23. Internetredaktion RBL. (2019). Deutsche Zentrum für Luft und Raumfahrt e.V. - DLR Gesundheitsforschung. Richtlinie zur Förderung von konfirmatorischen präklinischen Studien – Qualität in der Gesundheitsforschung – - DLR Gesundheitsforschung. Available online at: https://www.gesundheitsforschung-bmbf.de/de/8344.php (Accessed December 16, 2025).
24. Internetredaktion RBL. (2019). Deutsche Zentrum für Luft und Raumfahrt e.V. - DLR Gesundheitsforschung. Richtlinie zur Förderung von präklinischen konfirmatorischen Studien und systematischen Reviews - DLR Gesundheitsforschung. Available online at: https://www.gesundheitsforschung-bmbf.de/de/14868.php (Accessed March 10, 2025).
25. Haller, B, Richter, GHS, Wachtel, M, Schuler, L, Regina, C, Renz, B, et al. Epicycle: a confirmatory preclinical study of the anti-rhabdomyosarcoma efficacy of BET bromodomain and cyclin-dependent kinase 9 inhibitors. Biomed Pharmacother. (2025) 192:118704. doi: 10.1016/j.biopha.2025.118704,
26. Internetredaktion RBL. (n.d.). Deutsche Zentrum für Luft und Raumfahrt e.V. - DLR Gesundheitsforschung. IVOLDAMT-HF - In vivo-Validierung der L-2-Hydroxyglutarat Dehydrogenase als neue metabolische Zielstruktur bei Herzinsuffizienz - DLR Gesundheitsforschung. Available online at: https://www.gesundheitsforschung-bmftr.de/de/ivoldamt-hf-in-vivo-validierung-der-l-2-hydroxyglutarat-dehydrogenase-als-neue-13181.php (Accessed December 16, 2025).
27. Internetredaktion RBL. (n.d.). Deutsche Zentrum für Luft und Raumfahrt e.V. - DLR Gesundheitsforschung. DECIDE II - Wissenschaftliches Begleitprojekt für präklinische konfirmatorische Studien - DLR Gesundheitsforschung. Available online at: https://www.gesundheitsforschung-bmftr.de/de/decide-ii-wissenschaftliches-begleitprojekt-fur-praklinische-konfirmatorische-studien-18001.php (Accessed December 16, 2025).
28. Max-Delbrück-Centrum BI für GC und. Berliner Institut für Gesundheitsforschung - Charité und Max-Delbrü`ck-Centrum. (2015). DECIDE - phase II - Projekte - BIH at Charité. Available online at: https://www.bihealth.org/de/translation/innovationstreiber/quest-center/projekte/projekt/decide-phase-ii (Accessed May 21, 2025)
29. Nosek, BA, Ebersole, CR, DeHaven, AC, and Mellor, DT. The preregistration revolution. Proc Natl Acad Sci. (2018) 115:2600–6. doi: 10.1073/pnas.1708274114,
30. Heinl, C, Scholman-Végh, AMD, Mellor, D, Schönfelder, G, Strech, D, Chamuleau, S, et al. Declaration of common standards for the preregistration of animal research—speeding up the scientific progress. PNAS Nexus. (2022) 1:pgac016. doi: 10.1093/pnasnexus/pgac016,
31. Hülsemann, M, Wiebach, J, Drude, NI, Kniffert, S, Behm, L, Hönzke, K, et al. Introducing quality measures in an academic research consortium: lessons and recommendation from implementing an ad hoc quality management system for organ model research: lessons and recommendation from implementing an ad hoc quality management system for organ model research. EMBO Rep. (2022) 23:e55095. doi: 10.15252/embr.202255095,
32. Arroyo-Araujo, M, Carneiro, CFD, Piper, S, Wilcke, J, Ellenbach, N, Boulesteix, AL, et al. (2024). Data analysis planning and reporting for confirmatory multi-lab preclinical trials. OSF. Available online at: https://osf.io/cnuh7_v1 (Accessed March 7, 2025)
33. Voelkl, B, Altman, NS, Forsman, A, Forstmeier, W, Gurevitch, J, Jaric, I, et al. Reproducibility of animal research in light of biological variation. Nat Rev Neurosci. (2020) 21:384–93. doi: 10.1038/s41583-020-0313-3,
34. Russell, WMS, and Burch, RL. The principles of humane experimental technique. Med J Aust. (1960) 1:500–12. doi: 10.5694/j.1326-5377.1960.tb73127.x,
35. Max-Delbrück-Centrum BI für GC und. Berliner Institut für Gesundheitsforschung - Charité und Max-Delbrück-Centrum. (2015). Responsible PrecliniX (RPX) - service unit - service - BIH at Charité. Available online at: https://www.bihealth.org/en/quest/service/service/responsible-preclinix (Accessed March 28, 2025)
36. OECD. (n.d.). OECD series on principles of good laboratory practice and compliance monitoring. Available online at: https://www.oecd.org/en/publications/oecd-series-on-principles-of-good-laboratory-practice-and-compliance-monitoring_2077785x.html (Accessed March 25, 2025)
37. Home - Premier. (n.d.). Available online at: https://premier-qms.org/ (Accessed March 28, 2025)
38. Bespalov, A, Bernard, R, Gilis, A, Gerlach, B, Guillén, J, Castagné, V, et al. Introduction to the EQIPD quality system. eLife. 2021;10:e63294.
39. Smith, AJ, Clutton, RE, Lilley, E, Hansen, KEA, and Brattelid, T. PREPARE: guidelines for planning animal research and testing. Lab Anim. (2018) 52:135–41. doi: 10.1177/0023677217724823,
40. Wilkinson, MD, Dumontier, M, Aalbersberg, IJ, Appleton, G, Axton, M, Baak, A, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. (2016) 3:160018. doi: 10.1038/sdata.2016.18
41. Harris, PA, Taylor, R, Thielke, R, Payne, J, Gonzalez, N, and Conde, JG. Research electronic data capture (REDCap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. (2009) 42:377–81. doi: 10.1016/j.jbi.2008.08.010,
42. Percie Du Sert, N, Hurst, V, Ahluwalia, A, Alam, S, Avey, MT, Baker, M, et al. The ARRIVE guidelines 2.0: updated guidelines for reporting animal research. PLoS Biol. (2020) 18:e3000410. doi: 10.1371/journal.pbio.3000410
43. du Sert, NP, Bamsey, I, Bate, ST, Berdoy, M, Clark, RA, Cuthill, IC, et al. The experimental design assistant. Nat Methods. (2017) 14:1024–5. doi: 10.1038/nmeth.4462,
44. Fox, G, Fergusson, DA, Daham, Z, Youssef, M, Foster, M, Poole, E, et al. Patient engagement in preclinical laboratory research: a scoping review. EBioMedicine. (2021) 70:103484. doi: 10.1016/j.ebiom.2021.103484,
45. Duffett, L. Patient engagement: what partnering with patient in research is all about. Thromb Res. (2017) 150:113–20. doi: 10.1016/j.thromres.2016.10.029,
46. Foster, M, Fergusson, DA, Thompson, E, Hunniford, V, Scott, T, Daniels, S, et al. Identifying challenges and enablers to engaging patients in preclinical laboratory research. medRxiv. (2025):2025.09.19.25336090. doi: 10.1101/2025.08.15.25332894,
47. ITCCP (2023). Laboratory models for systematic efficacy testing of compounds. Available online at: https://itccp4.com/about-us (Accessed September 25, 2025)
48. DZG – Deutsche Zentren der Gesundheitsforschung. (n.d.). DZG – Deutsche Zentren der Gesundheitsforschung. Available online at: https://deutschezentren.de/ (Accessed November 21, 2025)
49. Chambers, CD, and Tzavella, L. The past, present and future of registered reports. Nat Hum Behav. (2022) 6:29–42. doi: 10.1038/s41562-021-01193-7,
50. Science C for O. (2023). Member spotlight: Stiftelsen dam uses OSF registries to promote open practices in Health Research. Available online at: https://www.cos.io/blog/member-spotlight-stiftelsen-dam (Accessed December 16, 2025).
51. DAM Registry (n.d.). Search. Available online at: https://osf.io/registries/dam/discover
52. DAM (n.d.). Stiftelsen Dams program Forskning. Available online at: https://www.dam.no/programmer/forskning/utlysning/ (Accessed December 16, 2025).
53. Drax, K, Clark, R, Chambers, CD, Munafò, M, and Thompson, J. A qualitative analysis of stakeholder experiences with registered reports funding partnerships. Wellcome Open Res. (2021) 6:230. doi: 10.12688/wellcomeopenres.17029.1 (Accessed December 16, 2025).,
54. Toolbox - EQIPD. (n.d.). Available online at: https://wiki.go-eqipd.org/wiki/Toolbox (Accessed December 16, 2025).
55. 3R Kompetenznetzwerk NRW. (n.d.). SOP im Tierversuchsbereich. Available online at: https://www.3r-netzwerk.nrw/massnahmen-fuer-die-3rs/sop-im-tierversuchsbereich (Accessed December 16, 2025).
56. PAASP 2. 1.7 blinding procedure.docx. (2020). Available online at: https://paasp.sharepoint.com/:w:/s/EQIPD/EZbRvZmZGoRGtsjc1Wk_XOsBLsnNAbg-FBFVj9h199oYMA?rtime=GbPpMIQc3kg (Accessed December 16, 2025).
57. van Eenige, R., Verhave, PS, Koemans, PJ, Tiebosch, IACW, Rensen, PCN, and Kooijman, S. Randomice, a novel, user-friendly randomization tool in animal research. PLoS One 2020;15:e0237096.
58. RvE54. GitHub. (2022). Releases RvE54/RandoMice. Available online at: https://github.com/RvE54/RandoMice/releases (Accessed December 16, 2025).
59. Randomizer (2007). Research randomizer. Available online at: https://www.randomizer.org/#randomize (Accessed December 16, 2025).
60. GraphPad QuickCalcs: (n.d.). Random number calculators. Available online at: https://www.graphpad.com/quickcalcs/randmenu/ (Accessed December 16, 2025).
61. group TR w. (2023). Reporting in vitro experiments responsibly – the RIVER recommendations. MetaArXiv; Available online at: https://osf.io/preprints/metaarxiv/x6aut_v1/ [Accessed November 5, 2025].
62. EQUATOR Network (n.d.). Enhancing the QUAlity and transparency of Health Research. Available online at: https://www.equator-network.org/ (Accessed December 16, 2025).
63. G*Power. (2007). Available online at: https://www.psychologie.hhu.de/arbeitsgruppen/allgemeine-psychologie-und-arbeitspsychologie/gpower (Accessed December 16, 2025).
64. Lenhard, W, and Lenhard, A (2017). Computation of effect sizes. Unpublished. Available online at: https://www.researchgate.net/doi/10.13140/RG.2.2.17823.92329 (Accessed December 16, 2025).
65. The Embassy of Good Science. (2020). Statistical pre-registration. Available online at: https://embassy.science:443/wiki/Theme:349f9eb9-b796-46cb-9a98-214c06db9046 (Accessed December 16, 2025).
66. Data Stewardship Wizard. (n.d.). Data stewardship wizard. Available online at: https://ds-wizard.org/ (Accessed December 16, 2025).
67. PCT. (2023). Available online at: https://preclinicaltrials.eu/ [Accessed November 5, 2025].
68. Home. (n.d.). AsPredicted. Available online at: https://aspredicted.org/ [Accessed November 5, 2025].
69. Animalstudyregistry. (2019). animalstudyregistry.org: welcome. Available online at: https://www.animalstudyregistry.org/asr_web/index.action (Accessed December 16, 2025).
70. OSF (n.d.). Available online at: https://osf.io/ (Accessed December 16, 2025).
71. OSF (n.d.). Welcome to Registrations & Preregistrations! - OSF support. Available online at: https://help.osf.io/article/330-welcome-to-registrations#Select-a-Registration-Template-1g4YO (Accessed December 16, 2025).
72. DESCA (2006). DESCA Model Consortium Agreement - DESCA 2020 Model Consortium Agreement. [cited 2025 Nov 10]. Available online at: https://www.desca-agreement.eu/desca-model-consortium-agreement/ [Accessed November 10, 2025].
73. Officeapps (n.d.). Heads-of-terms-for-consortium-agreements.Docx. Available online at: https://view.officeapps.live.com/op/view.aspx?src=https%3A%2F%2Fassets.publishing.service.gov.uk%2Fmedia%2F5a7f386c40f0b6230268e2d2%2FHeads-of-Terms-for-Consortium-Agreements.docx&wdOrigin=BROWSELINK (Accessed December 16, 2025).
74. Woodhams, J, and Dally, K (2025). Good authorship practice. UK research integrity office (UKRIO). Available online at: https://ukrio.org/wp-content/uploads/Good-Authorship-Practice.pdf (Accessed December 16, 2025).
75. CRediT. (n.d.). CRediT. Available online at: https://credit.niso.org/ (Accessed December 16, 2025).
Keywords: preclinical research, confirmatory studies, multi-laboratory trials, reproducibility, stakeholder engagement, experimental design
Citation: Rotter SC, Arroyo-Araujo M, Drude NI, Pellegrini P, Kobold S, Richter GHS, Müller OJ, Bruder D, Riecken LB, Gerlach B, Schuler L, Salbach-Hirsch J, Dalloul I, Kühn S, Wilcke JC and Toelch U (2026) Strengthening translational preclinical research through confirmatory multi-laboratory studies. Front. Med. 12:1715361. doi: 10.3389/fmed.2025.1715361
Edited by:
Sandor Kerpel-Fronius, Semmelweis University, HungaryReviewed by:
Noel Dybdal, Genentech Inc., United StatesCopyright © 2026 Rotter, Arroyo-Araujo, Drude, Pellegrini, Kobold, Richter, Müller, Bruder, Riecken, Gerlach, Schuler, Salbach-Hirsch, Dalloul, Kühn, Wilcke and Toelch. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sophia C. Rotter, c29waGlhLnJvdHRlckBjaGFyaXRlLmRl
Günther H. S. Richter5