- 1Department of Geography, Land Management and Cadaster, Al-Farabi Kazakh National University, Almaty, Kazakhstan
- 2Department of Science and Innovations, International Educational Corporation, Kazakh Leading Academy of Architecture and Civil Engineering, Almaty, Kazakhstan
- 3Department of Geography and Ecology, Abai Kazakh National Pedagogical University, Almaty, Kazakhstan
- 4Department of Social Studies Education, Gazi University, Ankara, Türkiye
- 5Department of Special Pedagogy, Abai Kazakh National Pedagogical University, Almaty, Kazakhstan
- 6Teacher-Researcher of Geography of the 8th Gymnasium named after Tole bi, Taraz, Kazakhstan
This systematic review synthesizes research on school-focused initiatives that integrate science, technology, engineering, and mathematics (STEM) with sustainability goals, published between 2019 and 2025. Searches of Scopus, Web of Science, and SpringerLink, along with reference checks, identified 49 studies. We coded approaches, topics, technology use, outcomes, and implementation features. Of these 19 studies, 42 empirical interventions were mapped by topic and subject, while seven conceptual or non-anchored pieces were excluded from topic counts but were used for informed interpretation. Publications accelerated after 2020 and clustered in North America and Southeast/East Asia. Climate dominated the topic distributions, followed by water and circularity; biodiversity and energy were at moderate levels, while smaller clusters addressed disaster, built environment, and justice/policy. Technology integration was most prevalent in water and circularity units, moderate in disaster and built environment, and comparatively limited in climate; energy and justice/policy showed minimal technology integration. Outcome synthesis indicated broad gains from project-based and inquiry-oriented designs and from context/place-based approaches; socio-scientific argumentation most consistently advanced agency and values; modeling and engineering design excelled on skills and, with coherence supports, also improved concepts. A synthesized framework addresses key implementation challenges—curriculum fit, teacher capacity, cognitive load, assessment alignment, and equity logistics. The review offers design-ready guidance for selecting approaches that match desired learning and participation outcomes.
1 Introduction
Across school systems, there is growing pressure to integrate disciplinary science, technology, engineering, and mathematics (STEM) learning with education for sustainable development (ESD), enabling youngsters to reason about—and act on—pressing socio-ecological issues. Over the past decade, classroom innovation has encompassed project-based learning (PjBL), inquiry-based learning (IBL), socio-scientific issues (SSI) and argumentation, modeling and systems thinking, engineering design, game-based learning, and context/place-based education. When these approaches are associated with authentic problems, students tend to demonstrate stronger conceptual understanding, motivation, and capacity to participate in solutions (Birney and McNamara, 2021; Cole et al., 2024; Ellington and Prado, 2024; Restaino et al., 2024).
Evidence suggests that approach matters for outcome. SSI and structured argumentation consistently strengthen scientific/chemical literacy and civic willingness to act—especially when designs include deliberation and action-taking phases such as “Ask–Find out–Act” or collaborative argument cycles (Zhu and He, 2022; Georgiou and Kyza, 2023). Modeling activities that scaffold connections and mechanisms improve students’ systems reasoning and the coherence of their explanations (Sezen-Barrie et al., 2023; Cole et al., 2024). Engineering design projects—e.g., salinity sensors and circular bioeconomy labs—develop problem-solving and conceptual application (Jefferson et al., 2020; McCance et al., 2021; Dung et al., 2023). Although game-based designs consistently increase attention and perceived relevance, they require careful scaffolding to yield cognitive gains (Kim et al., 2023; Besalti and Smith, 2024).
The “where” and “for whom” of learning are equally important. Place- and community-based designs that associate lessons to local infrastructures, partners, and audiences have demonstrated benefits for relevance, preparedness, and stewardship, provided equity barriers—such as transport, fees, and language—are addressed (Birney and McNamara, 2021; Spencer et al., 2022; Ellington and Prado, 2024). Culturally situated approaches, including Ethno-STEM and Indigenous partnerships, help learners see science as connected to community practices and identities (Spencer et al., 2022; Izzah et al., 2023). These moves align with calls to reframe science education around justice, power, and historical context rather than treat technologies and “solutions” as neutral (Gandolfi, 2024; Lowan-Trudeau and Fowler, 2025).
Delivering on this vision depends on effective teacher learning. Studies show that structured tools, e.g., content representation, (CoRe), paired with video-based reflection can make teachers’ pedagogical reasoning about sustainability explicit and improvable. In addition, co-teaching arrangements help integrate disciplinary perspectives around sustainability anchors (Forsler et al., 2024; Wang et al., 2025). Professional development that connects mathematics and science with citizenship through modeling, SSI, and inquiry shifts teacher beliefs and practices at scale (Maass et al., 2022).
Technology is a double-edged lever. Low-cost, “green” kits and simple calculators increase access while surfacing core concepts (Sharif et al., 2021; Nurramadhani et al., 2024). On the contrary, building information modeling (BIM)-linked simulations and energy games can enhance energy literacy and systems thinking, but they pose cognitive-load and integration challenges if not supported by stepwise scaffolds (Kim et al., 2023). Place-based data platforms and citizen science initiatives foster computational thinking and broaden pathways for participation (Birney and McNamara, 2021).
Despite growing momentum, the evidence base remains fragmented. Reporting varies across subjects, outcome domains, and measures; certain topics (e.g., justice/policy and built environment) are underrepresented; and risk-of-bias features (e.g., comparators, fidelity, and validated outcomes) are uneven. A consolidated map is needed to show who is studying what, with which approaches, employing which technologies, and to what effect—and to identify actionable remedies for recurring implementation challenges.
1.1 Aim and research questions
This systematic review synthesizes integrated STEM studies that explicitly target sustainability/ESD between 2019 and 2025 and answers:
1. How have publications and geographies evolved over time?
2. How are subject anchors distributed across STEM integration approaches?
3. Which sustainability topics are addressed, and how is technology employed within these studies?
4. For each approach, what cognitive, affective, skills, and agency/behavior outcomes are most consistently improved?
5. What thematic design patterns link approaches, sustainability topics, and outcome gains?
6. What challenges and remedies recur across implementations?
1.2 Contribution
We presented a trend and geography visuals; a subject × approach map; a sustainability topics × subject table showing the percent of studies using technology; an outcome-by-approach synthesis that clarifies “what works for what;” and an actionable challenge–remedy table. Together, these outputs offer a design-ready evidence map for curriculum developers, teacher educators, and policymakers planning integrated STEM for sustainability.
2 Methodology
2.1 Protocol and reporting
We followed the PRISMA 2020 reporting guidance (Page et al., 2021) for a systematic review of integrated STEM associated with sustainability/ESD, covering the period between January 1, 2019 and July 15, 2025. The PRISMA flow diagram (Figure 1) details identification, deduplication, screening, eligibility assessment, and inclusion, with full-text exclusions tallied by reason. The final database search was conducted on July 15, 2025.
2.2 Databases, time window, and search strategy
We searched Scopus, Web of Science Core Collection (WoS Core), and SpringerLink for records published between January 1, 2019 and July 15, 2025. Searches were limited to English-language journal articles and conference proceedings. We combined three concept blocks with AND:
1. Integration terms: “integrated STEM” OR STEM OR STEAM OR “engineering design” OR modeling OR inquiry OR PjBL OR IBL OR SSI OR argumentation OR “game-based” OR simulation OR “computational thinking”
2. Sustainability terms: sustainable OR ESD OR “climate change” OR energy OR water OR biodivers* OR “waste” OR circular OR “disaster” OR hazard* OR “built environment” OR “green building” OR justice OR policy*
3. Education/context terms: school OR K–12 OR elementary OR middle OR secondary OR high school OR teacher OR preservice OR pre-service OR undergraduate*
Database-specific, runnable Boolean strings (with field limits and filters) are provided in Supplementary material. Reference lists of included papers were also scanned to identify additional records.
2.3 Eligibility criteria
Studies were eligible if they:
• Focused on STEM or science education with an explicit sustainability/ESD aim (e.g., climate, energy, water, biodiversity, circular economy, disasters, built environment, and justice/policy).
• Were situated in formal or co-curricular school (primary/secondary) settings or early teacher education (preservice or introductory undergraduate courses) settings.
• Reported empirical data (quantitative, qualitative, or mixed). Conceptual and methods papers that directly operationalized ESD-linked STEM designs (e.g., transferable lab protocols and justice-oriented frameworks) were retained for thematic mapping but were excluded from risk-of-bias appraisal.
• Were published between 2019 and 2025 in English.
We excluded higher education studies focused strictly on advanced disciplinary training, papers with no identifiable ESD link, and commentaries lacking actionable design/measurement implications.
Population: school (primary/secondary) learners and preservice/introductory undergraduate teachers.
Intervention: integrated STEM designs explicitly linked to ESD.
Comparator: recorded when present; not required.
Primary outcome: cognitive, skills, and agency/behavior; secondary outcome: affective.
Study designs: quantitative, qualitative, or mixed methods empirical studies.
2.4 Screening and study selection
Search results were exported to a spreadsheet for de-duplication and screening. Titles/abstracts were screened against the criteria (n = 232 retained for full-text retrieval). We obtained 142 full texts; studies were excluded when they lacked an ESD focus, did not involve STEM integration or school/teacher/introductory UG contexts, or provided insufficient methodological detail. The final included set comprised 49 studies. Reasons and counts at each step are documented in Figure 1.
2.5 Data extraction
A structured codebook (piloted on a subset and refined iteratively) guided extraction into a master sheet (see Supplementary material for a snapshot of study characteristics). Variables captured:
• Bibliographic: authors, year, country/region.
• Subject focus (Environmental Science, Biology, Chemistry, Physics, and Mixed).
• STEM integration approach (PjBL, IBL, SSI/argumentation, modeling, engineering design, game-based, STEAM, context/place-based; “Lab-based” flagged as a delivery mode).
• Sustainability/ESD focus (climate, energy, water, biodiversity, waste/circular, disaster, built environment, and justice/policy).
• Technology use (any/none, type: sensors/probes, apps/web platforms, BIM/simulation/game, imaging/thermal, and data platforms).
• Education level (elementary, middle school, high school, preservice, and introductory undergraduate).
• Outcomes reported with instruments/indicators.
• Teacher professional development (PD) (yes/no, format).
• Design (quant/qual/mixed, quasi-experimental, case, R&D, and methods).
• Notes (contextual features and implementation details).
• Equity/culture: population, approach, partnership level, supports, outcomes, and justice orientation.
2.6 Coding rules
2.6.1 Approach and subject
When multiple approaches were used, we assigned a primary approach (based on central learning cycle and analysis focus) and marked secondaries for sensitivity checks. Subject focus was the dominant disciplinary anchor evidenced by tasks and assessments.
2.6.2 Outcomes
We grouped outcomes into four domains: cognitive (knowledge/tests and literacy scales), affective (attitudes, motivation, values, and nature-relatedness), skills (modeling, design/problem solving, data/CT, and argumentation), and agency/behavior (willingness to act, participation, and preparedness). Effects were coded positive (+) when authors reported statistically significant gains vs. baseline/comparison or convergent qualitative evidence of improvement; mixed/neutral otherwise. These outcome codes inform Table 1, which presents the synthesis by instructional approach in the Results section.

Table 1. Outcomes by instructional approach: number of studies reporting positive effects across four domains, with representative metrics.
2.6.3 Technology
We coded whether technology was used (yes/no) and identified the dominant type. The percentage of studies using technology within each sustainability topic is reported in Table 2 (sustainability topics × subject with % tech) in the Results section.
2.6.4 Cultural equity coding
We operationalized cultural equity to identify Ethno-STEM and Indigenous/culturally responsive designs and their supports. Each study received: (a) population focus (Indigenous, underserved/minoritized, rural/remote, and general); (b) an equity code (0–3) where 0 = no explicit equity/culture focus, 1 = contextual or access-broadening elements, 2 = culturally situated or place-based design and/or explicit access supports, 3 = justice-oriented/Indigenous frameworks or community-led work; (c) co-designed (Y/N) with communities/partners; (d) context supports (Y/N) such as multilingual materials, transport/fee waivers, or community audiences/benefit; and (e) equity outcomes reported (e.g., belonging/identity, participation/agency, and stewardship). Conceptual pieces were coded for orientation (e.g., decolonial/TribalCrit cues) but excluded from outcome tallies and equity-outcome denominators. For synthesis, we presented a descriptive summary: corpus-level counts (e.g., share of studies with equity code ≥2), cross-tabs by sustainability topic and approach, and illustrative examples of co-design and supports; study-level details appear in Supplementary material; and conceptual papers inform interpretation only.
2.7 Risk of bias (RoB) appraisal
We appraised study-level risk of bias using the Mixed Methods Appraisal Tool (MMAT 2018/2022), selecting the appropriate pathway for each empirical study (qualitative, quantitative non-randomized, or mixed methods). For each study, we answered the five MMAT criteria (Yes/No/Cannot Tell) using information already charted in our extraction sheet (Research Design, Sample/Participants, Intervention Components, Evaluation Metrics, and Challenges/Limitations). We then assigned a prespecified overall tier: Low (≥4 “Yes,” ≤1 “Cannot Tell,” 0 “No”); High (≥2 “No” or a critical flaw); or Some Concerns. We also recorded comparator presence (Y/N), analytic sample (N), and instrument reliability (Cronbach’s α for the primary outcome when reported). Conceptual/methods papers were not appraised with MMAT and are excluded from sensitivity summaries. The complete study-level RoB table appears in Supplementary material.
3 Results
To support transparency and reuse, a complete, paper-by-paper extraction is provided as Supplementary material. This file contains every coded field used in the analyses—authors, year, country/region, education level, subject focus, STEM integration approach, sustainability/ESD focus, intervention components, technology use, teacher professional development, outcomes, and notes. It is the authoritative source behind all figures and tables in the Results. Variable names match those referenced in the text; categorical bins (e.g., subject anchors; approach types and sustainability topics) follow the scheme defined in Methods. Where a study reported multiple foci (e.g., “Chemistry” and “Environmental Science”), all declared values were retained in the coding. For aggregate displays, these values were collapsed into the reported bins (“Chemistry,” “Environmental Science,” or “Mixed,” as specified).
3.1 RQ1: Publication trends and geographic coverage (2019–2025)
Annual output shows a clear post-2020 acceleration in integrated STEM–ESD research. Publications rise from 2019 (n = 1) to 2020 (n = 6) and 2021 (n = 8), remain steady in 2022 (n = 6), and then surge through 2023 (n = 11) to a peak in 2024 (n = 12) (see Figure 2). The apparent dip in 2025 (n = 5) almost certainly reflects a partial-year window rather than a true decline. Across 2019–2025, the corpus totals 49 papers, with a majority published recently (≈57% in 2023–2025; ≈69% in 2022–2025), indicating a fast-growing area.
Geographic coverage is broad but uneven as shown in Figure 3. The United States contributes the largest single share (13/49; ~27%), followed by Indonesia (7/49; ~14%). Roughly one in five records are not tied to a single country (e.g., theoretical papers, “not stated,” or regional labels such as Europe). The remaining studies form a long tail of single or double contributions across many settings—e.g., Republic of Korea (2), Vietnam (2), and one each from Canada, Mexico, Sweden, Finland, Qatar, Greece, China, Portugal, Norway, Malaysia, Spain, Türkiye, and the United Kingdom, along with a regional case in the Southeastern United States.
Implications. The recency surge justifies updating/expanding prior review windows. Regionally, the evidence base is strongest in North America and Southeast/East Asia, with scattered coverage in Europe and comparatively little from Africa and much of Latin America. Readers should keep these imbalances in mind when judging generalizability and when targeting future searches and partnerships.
3.2 RQ2: How are subject anchors distributed across STEM integration approaches?
Across the Corpus, Environmental Science is the modal anchor and carries the widest spread of integration approaches, which is clearly shown in Figure 4. Within this anchor, PjBL is most common, often integrated with IBL and modeling when students investigate local systems (e.g., water, biodiversity, and built environments) or carry out citizen science-style data work. Modeling is especially visible in hydrologic and ocean acidification units and in sustainability-themed systems tasks, whereas SSI/argumentation appears where decisions and trade-offs are central (waste, energy, and climate action). Engineering design is present but comparatively smaller within Environmental Science; when it appears, it typically targets devices or processes tied to water quality or resource use.
Chemistry anchors skew toward SSI and IBL, reflecting a strong emphasis on chemical literacy, decision-making, and safe/green laboratory practices; design work is less frequent here than in mixed or physics-linked contexts. Physics entries are fewer and tend to pair with lab-based PjBL and engineering design (e.g., measurement devices and energy/heat applications), sometimes co-anchored with Environmental Science. Biology appears primarily in place-based or inquiry-oriented designs connected to ecosystems, gardens, and biodiversity, with modeling used to surface systems relations. Finally, Mixed (interdisciplinary) anchors often adopt PjBL and context/place-based designs to integrate multiple disciplines around authentic briefs (e.g., restoration science and transport decarbonization), occasionally layering computational thinking or simulation elements; game-based and STEAM approaches appear more sparingly and are concentrated in motivation-oriented climate units and hazards/disaster education, respectively.
Overall, the stacked distribution shows a clear pattern: PjBL is the cross-cutting backbone across subjects; IBL and modeling cluster in Environmental Science and Mixed anchors; SSI concentrates in Chemistry and climate-oriented studies; engineering design is most visible where physics or mixed STEM problems call for tangible artifacts; and game-based/STEAM play targeted roles rather than serving as dominant integration modes.
3.3 RQ3: Which sustainability topics are addressed, and how is technology employed within these studies?
We mapped sustainability/ESD topics to subject anchors for the empirical intervention studies only. Seven studies were not included in Table 2 because they were conceptual/review papers (n = 6) or an intervention without an explicit ESD anchor (n = 1). Those studies remain in the corpus (see Supplementary material) and inform the thematic synthesis, but they are excluded from the topic-by-subject counts below.
Coverage and distribution. Among the 42 mapped studies, climate was the most frequent topic (n = 14), mainly in Environmental Science (9), with smaller shares in Mixed (3) and Physics (1). Water and waste/circularity followed (n = 6 each). Water-related units were anchored in Environmental Science (3), Chemistry (1), and Physics (2), while waste/circularity was concentrated in Chemistry (4) with minor entries in Environmental Science and Mixed. Biodiversity (n = 5) appeared mostly in Environmental Science (3), with single cases in Biology and Mixed. Energy (n = 5) was represented in Chemistry (2) and Mixed (3) but had no Environmental Science anchors. Smaller clusters included Disaster (n = 2), Built environment (n = 2), and Justice/Policy (n = 2), all within Environmental Science or Mixed domains.
Technology use patterns. Tech use is highest in Water and Waste/Circular (both 67%), consistent with instrumented measurement, fabrication, or data-platform work. Disaster and Built environment show moderate adoption (50%), typically mapping apps, field-data tools, or classroom modeling. Climate—despite being the largest topic—shows lower uptake (36%), reflecting many discussion/SSI or concept-focused inquiries. Biodiversity uses tech in 20% of cases, often for observation/identification rather than analysis. Notably, Energy and Justice/Policy show 0% tech use in this sample, indicating opportunities to incorporate lightweight tools (e.g., household energy monitors, scenario simulators, and civic data dashboards) to deepen analysis and move beyond discourse.
Takeaway. Topics that naturally invite measurement or making (e.g., water quality and circular materials) show highest technology integration; more discourse-heavy areas (energy policy and justice) underutilize technology, representing a clear design gap.
3.4 RQ4: For each approach, what cognitive, affective, skills, and agency/behavior outcomes are most consistently improved?
Table 1 collates, for each instructional approach, how often studies reported positive effects in four outcome domains and the typical metrics used. Three approaches emerge as reliable “all-rounders.” PjBL shows the broadest pattern of gains—cognitive 7/8 (~88%), affective 9/10 (90%), skills 5/5 (100%), and agency 4/4 (100%)—measured with knowledge/concept tests and post-quizzes; attitude, self-efficacy, and sustainability-awareness surveys; artifact/rubric reviews; and preparedness, responsibility, or action-intent indicators. IBL displays a similar profile in a smaller evidence set—cognitive 9/9, affective 6/6, skills 2/2, agency 1/1—using science/chemical-literacy tests, unit exams, Global Science Literacy Questionnaire (GSLQ)/Nature Relatedness (NR)/self-efficacy scales, the Inventive Problem Test, and Science and Engineering Practices (SEP) aligned performances. Context/place-based designs are likewise consistently positive—cognitive 4/4, affective 5/5, skills 3/3, agency 2/2—with knowledge tests, model-based tasks, competence interviews, relevance/NR/motivation scales, data/modeling skill checks, and preparedness or conservation-commitment measures.
Approaches that explicitly foreground civic reasoning tend to be strongest for agency and values while still delivering targeted literacy gains. SSI/Argumentation studies report cognitive 2/2, affective 3/3, skills 1/1, and agency 3/3, drawing on literacy tests, values/motivation questionnaires, argument-quality coding, and willingness/citizenship instruments. These results indicate that structured deliberation and decision-making around SSI reliably move dispositions and participation while consolidating key concepts.
Designs that center representational rigor or making tend to excel on skills, with complementary gains elsewhere when scaffolds are present. Modeling shows affective 3/3, skills 2/2, and agency 2/2, but cognitive 1/2—a split that aligns with the use of model-coherence/integration coding, NR/engagement surveys, systems-thinking rubrics, and conservation-commitment/action phases. The mixed cognitive signal suggests that explicit supports for connecting mechanisms and evidence are pivotal if modeling is to translate into test-score gains. Engineering design yields skills 2/2 and affective 3/3 with cognitive 2/2, assessed via design/problem-solving artifacts, attitude/awareness reflections, and post-quizzes or concept tests; agency measures were not reported in this bin.
Two smaller bins provide directional but tentative read-outs. Game-based work in this corpus chiefly demonstrates affective improvement (1/1 via ARCS coding) with no concurrent cognitive/skills/agency measures reported, implying that tighter curricular integration and outcome instrumentation are needed to evidence learning beyond motivation. STEAM shows cognitive 1/1, skills 1/1, and agency 1/1—all from hazards/preparedness contexts that used pre/post knowledge checks, expert validation of artifacts, and preparedness-choice tasks; no affective instruments were reported.
Taken together, the pattern in Table 1 suggests that if broad impact is the aim, PjBL, IBL, and context/place-based designs are the most dependable choices. When the priority is to cultivate agency and value commitments, SSI/Argumentation is a robust route. Modeling and engineering design are the most efficient levers for skills (systems reasoning; design/problem-solving) and can deliver cognition when coherence supports and concept assessments are built in. Finally, the small denominators in several bins warrant cautious interpretation; reporting the n with positive effect/total reporting alongside typical metrics is essential for transparent strength-of-evidence claims.
3.5 RQ5: What thematic design patterns link approaches, sustainability topics, and outcome gains?
Table 3 reveals a small number of dominant patterns that integrate instructional design with topic choice and outcomes. First, place-based & community partnerships are the modal cluster. Typically implemented via PjBL/IBL around water, built environments, or justice briefs, these designs consistently lift agency/behavior (participation and local action), alongside skills (data/communication) and affective relevance—a profile that mirrors the broad gains we saw for context/place-based approaches in Table 1.
Second, SSI with action-taking (often paired with explicit argumentation scaffolds) anchors climate, waste/incineration, and energy topics. These studies jointly deliver cognitive gains (e.g., literacy tests and GSLQ) while also increasing agency and values/emotions when deliberation culminates in tangible action (campaigns and decision briefs). This cluster is the clearest route to moving dispositions without sacrificing content.
Third, modeling & systems thinking concentrates on water, ocean acidification, and urban/bioeconomy contexts. Typical outcomes are stronger systems reasoning skills and improved conceptual understanding when coherence supports are present—aligning with the mixed cognitive but strong skills/affect pattern in Table 1.
Fourth, engineering design for ESD (e.g., salinometers and circular biorefinery labs) reliably develops design/problem-solving skills and posts complementary concept gains, especially in physics-linked or mixed anchors. In contrast, game-based/simulation entries chiefly raise motivation/affect (ARCS); cognitive effects appear when games are tightly scaffolded and integrated with assessment.
Two cross-cutting clusters operate as enablers. Data/citizen science & computational thinking—common in biodiversity and restoration—builds data/CT skills and stewardship/agency, dovetailing with the higher technology uptake observed in measurement-heavy topics (see Table 2). Teacher PCK/PD for SD (e.g., CoRe + video cycles and co-teaching) strengthens teaching practice and self-efficacy, a prerequisite for coherent integration across subjects and topics.
Finally, specialized clusters round out the map: equity/cultural relevance (Ethno-STEM and Indigenous partnerships) improves belonging and often agency; green chemistry & circular economy shows cognitive and lab-skills gains; gardens/biodiversity boosts knowledge and participation; climate literacy/critical thinking yields moderate cognitive gains and higher self-efficacy; futures thinking primarily moves values/willingness; critical/decolonial & justice reframes purposes and builds critical literacy; disaster/hazards programs improve knowledge and preparedness; and authentic industry briefs heighten relevance and argumentation when tied to a concrete “commission.”
Within the equity/cultural relevance entries, most designs were place-based or citizen science (e.g., gardens, restoration, and local hazards) and occasionally SSI where policy/justice was foregrounded. Co-design with communities and concrete access supports (e.g., transport and multilingual materials) were reported in only a few cases, and equity-specific outcomes (belonging/identity, participation/agency, and stewardship) were seldom directly measured. Where reported, gains concentrated in participation/stewardship (restoration programs), preparedness (co-produced wildfire curricula), and community participation in Indigenous partnerships. Study-level coding appear in Supplementary material.
Design takeaway. Choosing a cluster is tantamount to choosing the outcomes you most want to move: place-based and citizen-science designs for agency/skills; SSI + action for agency/values with solid cognition; modeling/engineering for skills with targeted concept gains; games for motivation (plus learning when scaffolded); and PD/equity clusters to make any of the above work with fidelity and inclusion. This synthesis sets up RQ6, where we examine recurrent implementation challenges and the remedies that make these patterns stick at scale.
3.6 RQ6: What challenges and remedies recur across implementations?
Recurring barriers cluster into three bands. System-level constraints include curriculum fit and standards alignment, equity and access (transport, costs, and language), and the socio-politics of energy/land use; effective remedies are co-produced, locally relevant units aligned explicitly to standards, plus place-based partnerships and multilingual logistics, and, where issues are politically charged, critical/decolonial framings that surface multiple perspectives. Capacity and infrastructure gaps span insufficient teacher PCK/PD, limited lab resources, and technology hurdles; here, structured PD cycles (e.g., CoRe + video and co-teaching), low-cost/green kits and simple calculators, and stepwise scaffolds for simulations/games are repeatedly successful. Design and assessment pitfalls involve misaligned assessment with PjBL/SSI/modeling, superficial modeling practice, weak argumentation, short durations that fail to influence behavior, field trips disconnected from deliverables, and prior-knowledge mismatches; recommended practices include rubric-based assessment of products/arguments/models, iterative systems modeling tied to real infrastructures, explicit claim–evidence–reasoning routines, multi-week action components, pre−/debriefed visits anchored to a concrete “commission,” and targeted ramp-up mini-lessons. Table 4 presents the full challenge–remedy map with author–year citations that underpin each recommendation and can be implemented directly in new STEM–ESD designs.

Table 4. Recurring challenges in integrated STEM–ESD implementation and illustrative remedies with supporting references.
4 Discussion
4.1 Summary of principal findings (RQ1–RQ6)
This review maps a rapidly expanding evidence base on integrated STEM for sustainability/ESD (2019–2025). Publication volume accelerates after 2020 and concentrates in North America and Southeast/East Asia, with fewer studies from Africa and Latin America (RQ1; Figures 2, 3). Subject–approach patterns show project-based learning as a cross-cutting backbone, inquiry-based learning and modeling clustered in Environmental Science/Mixed anchors, SSI/argumentation concentrated in Chemistry and climate contexts, and engineering design most visible when physics or mixed problems require tangible artifacts (RQ2; Figure 4).
Topic distributions (RQ3; Table 2) are led by climate (n = 14), followed by water (n = 6) and waste/circular (n = 6); biodiversity and energy each total n = 5, while disaster, built environment, and justice/policy are smaller clusters (n = 2 each). Technology use is highest in water and waste/circular (both 67%), moderate in disaster and built environment (50%), and lower in climate (36%); energy and justice/policy show 0% tech within this empirical set. Seven studies were not included in Table 2 because they were conceptual/review papers (n = 6) or an intervention without an explicit ESD anchor (n = 1); they still inform the thematic synthesis.
Outcome synthesis by approach (RQ4; Table 1) indicates three “all-rounders:” PjBL, IBL, and context/place-based designs show consistent gains across cognitive, affective, skills, and agency domains (with denominators varying by what each study measured). SSI/argumentation most reliably moves agency/values while also consolidating literacy. Modeling and engineering design excel on skills (systems reasoning; design/problem solving) and can yield cognitive gains when coherence supports and concept assessments are built in. Game-based and STEAM bins are smaller; game studies chiefly evidence affective benefits unless tightly scaffolded and assessed.
Thematic patterns (RQ5; Table 3) integrate place-based/community partnerships, SSI with action-taking, modeling/systems thinking, and engineering design with topic choices to produce predictable outcome profiles. Cross-cutting enablers include data/citizen science + CT, teacher PCK/PD for SD, and equity/cultural relevance moves (Ethno-STEM and Indigenous partnerships). Finally, a consolidated challenge–remedy map (RQ6; Table 4) highlights recurrent barriers—curricular fit, PD capacity, technology/cognitive-load issues, assessment misalignment, equity/logistics—and practical solutions that have been shown to work (e.g., co-produced units aligned to standards, CoRe + video PD cycles, stepwise scaffolds for simulations/games, rubric-based assessment of products/arguments/models, and pre−/de-briefed fieldwork tied to a concrete commission).
Equity-oriented or culturally situated designs comprised roughly a quarter of the empirical corpus, while co-design with communities and concrete access supports were comparatively uncommon; explicit measurement of belonging/identity or community agency was rare (see Supplementary material).
4.2 Design implications: choosing the right lever for the desired outcome
• Broad impact across domains (knowledge, affect, skills, and agency): Favor PjBL, IBL, or context/place-based designs (Table 1).
• Civic agency/values with solid cognition: Use SSI with explicit argumentation and action-taking culminating in public products or decisions (Table 4).
• Skills (systems reasoning; design/problem solving): Emphasize modeling (with coherence scaffolds) and engineering design (with concept-aligned assessment).
• Motivation/engagement boosts: Pair game-based elements with explicit concept/skill assessments and stepwise complexity to convert affect into learning.
4.3 Technology integration: close the design gap in energy and justice/policy
High tech-use in water and waste/circular suggests that topics inviting measurement or making naturally absorb sensors, data platforms, and fabrication (Table 2). By contrast, energy and justice/policy contexts underutilize technology. Practical routes to close this gap include:
• Energy: low-cost household energy monitors, simple data loggers, and scenario simulators embedded in PjBL/IBL; parametric BIM or spreadsheet-based audits with guided interpretation (coherence scaffolds).
• Justice/Policy: civic-data dashboards, participatory mapping (open GIS), and evidence assemblies that feed directly into SSI argumentation cycles and action briefs.
4.4 Equity, culture, and place
Place- and community-based designs consistently elevate relevance and participation, particularly when equity barriers (transport, fees, and language) are addressed up front through partnerships and multilingual materials. Culturally situated approaches (Ethno-STEM and Indigenous collaborations) help students see science as connected to identities and community practices; these designs frequently strengthen affect and agency and should be planned as integral—not peripheral—components. In our sample, however, co-design and explicit access supports were reported in only a minority of studies, and equity outcomes (belonging/identity, participation/agency, and stewardship) were seldom instrumented; details appear in the Supplementary material.
4.5 Teacher learning and implementation fidelity
Teacher capacity is the fulcrum for coherent integration. Evidence favors structured PD that surfaces pedagogical reasoning (e.g., CoRe + video reflection), supports co-teaching across disciplines, and provides vetted, standards aligned materials. District-level adoption is eased when units are co-produced with local stakeholders and when assessments are aligned to products, models, and arguments rather than solely to recall tests.
4.6 Reporting and measurement recommendations
To improve comparability and strength of evidence:
1. Report comparators, fidelity checks, and validated instruments; include agency/behavior measures where claims are made.
2. Make assessment rubrics for models/designs/arguments public; include sample artifacts.
3. Specify technology roles (purpose, scaffolds, and cognitive demands) and provide stepwise task designs for complex tools (e.g., simulations and games).
4. When multiple approaches are integrated, identify the primary learning cycle and the outcomes each approach is intended to move.
5. Report instrument reliability (e.g., Cronbach’s α) and flag null/contradictory findings; interpret low-reliability results with caution.
6. Differentiate technology roles—content delivery, process scaffolding, or transformative tasks—to clarify depth of integration (e.g., a TPACK-aligned description).
4.7 Limitations
This review was limited to English publications (2019–2025) indexed in the selected databases, and the 2025 data reflect a partial year. Geographic coverage was uneven, with concentration in a few regions. Instruments and outcomes were heterogeneous, denominators varied by what each study measured, and agency was not consistently assessed (e.g., in engineering-design bins). Conceptual/review papers and one non-ESD intervention were excluded from topic counts (Table 2) but informed the thematic synthesis; empirical risk-of-bias features varied across studies. Technology integration was coded broadly by presence and type; finer grained distinctions such as content delivery, scaffolding, or transformative use (cf. TPACK) were not applied. Similarly, many studies did not report measurement reliability (e.g., Cronbach’s α), limiting our ability to systematically account for null or low-reliability results. These issues constrain the depth of interpretation and highlight directions for future reviews.
4.8 Future research
Priorities include (a) longitudinal tracking of agency/behavior and transfer, (b) expansion to under-represented regions and multi-site designs with community co-design and explicit equity outcome measures, (c) robust comparators and fidelity reporting, (d) tighter integration of technology in energy and justice/policy contexts with cognitive-load-aware scaffolds, (e) alignment of assessment to products/arguments/models, and (f) studies that connect teacher PD models to student outcomes at scale.
5 Conclusion
This review provides a design-ready evidence map of integrated STEM for sustainability/ESD. We show (1) a surging publication trend with uneven geography; (2) stable subject × approach patterns with PjBL/IBL/context as dependable cores; (3) topic distributions and technology-use profiles that highlight measurement-friendly areas (water and circular) and under-instrumented ones (energy and justice/policy); (4) outcome profiles by approach indicating where to invest for cognition, skills, affect, or agency; (5) thematic design clusters that integrate place, SSI + action, modeling, and design into predictable gains; and (6) a challenge–remedy map that turns recurrent implementation problems into actionable practices.
For practitioners and policymakers, the guidance is straightforward: choose the lever that matches your desired outcomes, co-produce standards aligned units with community partners, scaffold technology use deliberately, and align assessment to the products and reasoning students actually create. For researchers, the agenda is to broaden contexts, strengthen study designs and measures—especially for agency—and document fidelity and costs so promising models scale with equity.
Finally, our topic mapping (Table 2) covers 42 empirical, ESD-anchored interventions; seven additional studies (conceptual/review or without explicit ESD anchoring) inform the synthesis but are excluded from those counts. Together, the tables and figures (Tables 1–4; Figures 1–4) provide a coherent scaffold for planning, implementing, and evaluating integrated STEM designs that help young people reason about—and act on—pressing socio-ecological issues.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
AmK: Writing – original draft, Supervision, Methodology, Conceptualization. SL: Data curation, Investigation, Writing – review & editing. BA: Validation, Formal analysis, Writing – review & editing. AT: Writing – review & editing, Data curation, Visualization. MN: Resources, Project administration, Validation, Writing – original draft. AsK: Investigation, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/feduc.2025.1697058/full#supplementary-material
References
Alcaraz-Dominguez, S., and Barajas, M. (2021). Conceiving socioscientific issues in STEM lessons from science education research and practice. Educ. Sci. 11:238. doi: 10.3390/educsci11050238
Azevedo, H., Soares-Silva, I., Fonseca, F., Alves, P., Silva, D., and Azevedo, M.-M. (2022). Impact of educational gardens and workshop activities on 8th-grade student’s perception and knowledge of plant biology. Educ. Sci. 12:619. doi: 10.3390/educsci12090619
Besalti, M., and Smith, G. G. (2024). High school students’ motivation to learn climate change science through educational computer games. Simul. Gam. 55, 527–551. doi: 10.1177/10468781241235754
Birney, L., and McNamara, D. M. (2021). Tackling problem-solving through the curriculum and community enterprise for environmental restoration project. J. Curric. Teach. 10:1. doi: 10.5430/jct.v10n3p1
Candan Helvaci, S., and Helvaci, İ. (2019). An interdisciplinary environmental education approach: determining the effects of E-STEM activity on environmental awareness. Univer. J. Educ. Res. 7, 337–346. doi: 10.13189/ujer.2019.070205
Celestino, T. (2023). High school sustainable and green chemistry: historical–epistemological and pedagogical considerations. Sustainable Chem. 4, 304–320. doi: 10.3390/suschem4030022
Cole, L. B., Priscilla, L., Zangori, L., Kania-Gosche, B., and Burken, J. (2024). Raising the green roof: enhancing youth water literacy through built environment education. Sustainability 16:4262. doi: 10.3390/su16104262
Cuong, T. V., Dung, N. V. V., Tinh, T. T., and Hau, N. H. (2025). Mathematical modelling for monitoring tree growth in school green initiatives in Vietnam. Int. J. Ecosyst. Ecol. Sci. 15:203. doi: 10.31407/ijees15.226
Dung, T. M., Nga, N. T., and Thanh, L. T. (2023). Interdisciplinary education in the context of protection of water resources: a case study in Vietnam. J. Educ. e-Learning Res. 10, 569–577. doi: 10.20448/jeelr.v10i3.4986
Ellington, A., and Prado, C. (2024). Connecting schools and communities: a look at place-based learning and equitable access in SF bay Area outdoor environmental education. Environ. Educ. Res. 30, 1327–1347. doi: 10.1080/13504622.2024.2321260
Fadly, D., Rahayu, S., Dasna, I. W., and Yahmin, Y. (2022). The effectiveness of a SOIE strategy using socio-scientific issues on students’ chemical literacy. Int. J. Instr. 15, 237–258. doi: 10.29333/iji.2022.15114a
Forsler, A., Nilsson, P., and Walan, S. (2024). Capturing and developing teachers’ pedagogical content knowledge in sustainable development using content representation and video-based reflection. Res. Sci. Educ. 54, 393–412. doi: 10.1007/s11165-023-10149-y
Gandolfi, H. E. (2024). (Re)considering nature of science education in the face of socio-scientific challenges and injustices: insights from a critical-decolonial perspective. Sci. & Educ. doi: 10.1007/s11191-024-00536-w
Georgiou, Y., and Kyza, E. A. (2023). Fostering chemistry students’ scientific literacy for responsible citizenship through socio-scientific inquiry-based learning (SSIBL). Sustainability 15:6442. doi: 10.3390/su15086442
Hakim, A., Efwinda, S., Damayanti, P., Syam, M., Mutmainah, O., and Norsaputra, A. (2023). Improving climate literacy of high school students through physics teaching materials based on STEM. J. Pendidik. IPA Indones. 12, 526–537. doi: 10.15294/jpii.v12i4.43188
Herodotou, C., Ismail, N., I Benavides Lahnstein, A., Aristeidou, M., Young, A. N., Johnson, R. F., et al. (2024). Young people in Inaturalist: a blended learning framework for biodiversity monitoring. Int. J. Sci. Educ. B Commun. Public Engagement 14, 129–156. doi: 10.1080/21548455.2023.2217472
Holley, D., and Park, S. (2020). Integration of science disciplinary Core ideas and environmental themes through constructivist teaching practices. EURASIA J. Math. Sci. Technol. Educ. 16, 1–17. doi: 10.29333/ejmste/115583
Izzah, S. N., Sudarmin, S., Wiyanto, W., and Wardani, S. (2023). Analysis of science concept mastery, creative thinking skills, and environmental attitudes after ethno-STEM learning implementation. Int. J. Instr. 16, 777–796. doi: 10.29333/iji.2023.16342a
Jefferson, M. T., Rutter, C., Fraine, K., Borges, G. V. B., de Souza Santos, G. M., Schoene, F. A. P., et al. (2020). Valorization of sour milk to form bioplastics: friend or foe? J. Chem. Educ. 97, 1073–1076. doi: 10.1021/acs.jchemed.9b00754
Kang, J., and Tolppanen, S. (2024). Exploring the role of science education as a catalyst for students’ willingness to take climate action. Int. J. Sci. Educ., 1–19. doi: 10.1080/09500693.2024.2393461
Kim, J.-H., and Kim, C.-J. (2020). Analysis of emotions of high school students participating in a school SSI club project related to climate change. Asia-Pac. Sci. Educ. 6, 70–96. doi: 10.1163/23641177-BJA00003
Kim, J. B., Oprean, D., Cole, L., and Zangori, L. (2023). Illumi’s world: a Mini-game development with parametric BIM-based simulations. Int. J. Archit. Comput. 21, 462–477. doi: 10.1177/14780771231180256
Kostøl, K. B., and Remmen, K. B. (2022). A qualitative study of teachers’ and students’ experiences with a context-based curriculum unit designed in collaboration with STEM professionals and science educators. Discip. Interdiscip. Sci. Educ. Res. 4:26. doi: 10.1186/s43031-022-00066-x
Littrell, M. K., Tayne, K., Okochi, C., Leckey, E., Gold, A. U., and Lynds, S. (2020). Student perspectives on climate change through place-based filmmaking. Environ. Educ. Res. 26, 594–610. doi: 10.1080/13504622.2020.1736516
Lowan-Trudeau, G., and Fowler, T. A. (2025). Exploring Sociocritical perspectives on renewable energy education in Alberta, Canada. Environ. Educ. Res. 31, 1532–1550. doi: 10.1080/13504622.2024.2446717
Maass, K., Sorge, S., Romero-Ariza, M., Hesse, A., and Straser, O. (2022). Promoting active citizenship in mathematics and science teaching. Int. J. Sci. Math. Educ. 20, 727–746. doi: 10.1007/s10763-021-10182-1
McCance, K. R., Suarez, A., McAlexander, S. L., Davis, G., Blanchard, M. R., and Venditti, R. A. (2021). Modeling a biorefinery: converting pineapple waste to bioproducts and biofuel. J. Chem. Educ. 98, 2047–2054. doi: 10.1021/acs.jchemed.1c00020
Mereli, A., Niki, E., Psycharis, S., Drinia, H., Antonarakou, A., Mereli, M., et al. (2023). Education of students from Greek schools regarding natural disasters through STEAM. EURASIA J. Math. Sci. Technol. Educ. 19:em2314. doi: 10.29333/ejmste/13437
Nida, S., Marsuki, M. F., and Eilks, I. (2021). Palm-oil-based biodiesel in Indonesia: a case study on a Socioscientific issue that engages students to learn chemistry and its impact on society. J. Chem. Educ. 98, 2536–2548. doi: 10.1021/acs.jchemed.1c00244
Nurramadhani, A., Riandi, R., Permanasari, A., and Suwarma, I. R. (2024). Low-carbon food consumption for solving climate change mitigation: literature review with bibliometric and simple calculation application for cultivating sustainability consciousness in facing sustainable development goals (SDGs). Indones. J. Sci. Technol. 9, 261–286. doi: 10.17509/ijost.v9i1.67302
Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., et al. (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372:n71. doi: 10.1136/bmj.n71
Park, W.-Y., and Kim, C.-J. (2020). The impact of project activities on the cultivation of ecological citizenship in a high school climate change Club. Asia Pac. Sci. Educ. 6, 41–69. doi: 10.1163/23641177-BJA00005
Pramasdyahsari, A. S., Rubowo, M. R., Nindita, V., Astutik, I. D., Pant, B. P., Dahal, N., et al. (2024). Developing engaging STEAM-geometry activities: fostering mathematical creativity through the engineering design process using Indonesian cuisine context. Infinity J. 14, 213–234. doi: 10.22460/infinity.v14i1.p213-234
Ramirez-Gonzalez, I. A., Añel, J. A., and Cid Samamed, A. (2020). Ozone measurement practice in the laboratory using Schönbein’s method. Geosci. Comm. 3, 99–108. doi: 10.5194/gc-3-99-2020
Rasa, T., Lavonen, J., and Laherto, A. (2024). Agency and transformative potential of Technology in Students’ images of the future: futures thinking as critical scientific literacy. Sci. & Educ. 33, 1145–1169. doi: 10.1007/s11191-023-00432-9
Restaino, C., Eusden, S., and Kay, M. (2024). Taking the next step in wildfire education: integrating multiple knowledge forms into co-produced high school fire science curricula. Fire Ecol. 20:62. doi: 10.1186/s42408-024-00296-6
Sezen-Barrie, A., Stapleton, M. K., Marbach-Ad, G., and Miller-Rushing, A. (2023). Epistemic discourses and conceptual coherence in students’ explanatory models: the case of ocean acidification and its impacts on oysters. Educ. Sci. 13:496. doi: 10.3390/educsci13050496
Sharif, A. M., Ong, S. W., Rahim, N. A., Lee, T. T., Mustafar, S., Rosmi, M. S., et al. (2021). A green electrochemistry experimental kit: Student’s achievements on Lectrofun 2.0. Jurnal Pendidikan IPA Indonesia 10, 24–34. doi: 10.15294/jpii.v10i1.25475
Sigit, D. V., Arrumaisha, T., and Ristanto, R. H. (2024). Enhancing critical thinking: integrating guided discovery learning and concept mapping (GDL-CM) in climate change concept. J. Pendidik. IPA Indones. 13. doi: 10.15294/f3t8yf33
Spencer, J. L., Maxwell, D. N., Erickson, K. R. S., Wall, D., Nicholas-Figueroa, L., Pratt, K. A., et al. (2022). Cultural relevance in chemistry education: snow chemistry and the Iñupiaq community. J. Chem. Educ. 99, 363–372. doi: 10.1021/acs.jchemed.1c00480
Wang, H.-H., Knobloch, N. A., Nelson, B. J., and Thies, S. L. J. (2025). Exploring a co-teaching model to teach energy and food systems in STEM integration through the Lens of pedagogical content knowledge. Educ. Sci. 15:318. doi: 10.3390/educsci15030318
Keywords: project-based pedagogy, socio-scientific reasoning, systems modeling, learner agency, computational thinking, assessment alignment
Citation: Kopbossyn A, Laiskhanov S, Aksoy B, Tokbergenova A, Nametkulov M and Kozybakova A (2025) Integrated STEM for sustainability in school and early teacher education: a systematic review (2019–2025). Front. Educ. 10:1697058. doi: 10.3389/feduc.2025.1697058
Edited by:
Konstantinos T. Kotsis, University of Ioannina, GreeceReviewed by:
Leonidas Gavrilas, University of Ioannina, GreeceDimitris Panagou, University Research Centre of Ioannina (URCI), Greece
Copyright © 2025 Kopbossyn, Laiskhanov, Aksoy, Tokbergenova, Nametkulov and Kozybakova. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Amandyk Kopbossyn, a29wYm9zc3luX2FtYW5keWsyQGthem51LmVkdS5reg==