Large scale implementation of effective early literacy instruction

Introduction: Large-scale implementation of Multi-Tiered Systems of Support (MTSS) to enhance children’s early literacy success is critically important to address global literacy challenges. This paper describes one such initiative, the Better Start Literacy Approach (BSLA), which was speciﬁcally designed for large-scale implementation in New Zealand. Methods: Between February 2020 and May 2023 over 3,000 teachers in 819 schools across New Zealand implemented BSLA, with baseline data available for 29,795 5-year-old children. Teachers implemented novel online assessments to monitor children’s early literacy growth. Results: In comparison to an internal control group, accelerated progress in children’s phonic, and phoneme awareness skills was evident after 10 weeks of BSLA Tier 1 (universal) teaching. After 30 weeks, there were signiﬁcant gains in word reading, spelling, listening comprehension and oral narrative abilities; growth did not differ based on gender, ethnicity, or socioeconomic status. Children with greater learning needs who received BSLA Tier 2 (small group targeted teaching) caught up to their peers in word reading and spelling skills. Conclusions: With appropriate resourcing, planning and engagement with communities, successful large-scale implementation of evidenced-based early literacy approaches is possible within a relatively short time. Culturally responsive implementation within MTSS frameworks holds much promise for raising literacy achievement for all children.


Introduction
The World Health Organization (WHO) has highlighted the critical importance of literacy to reducing current global health inequities.It has called upon governments worldwide to increase investments in building their population's comprehensive literacy skills, particularly within diverse communities (WHO, 2020).This call to action is reinforced by the United Nations Sustainable Development Goals for 2030.Ensuring equitable and quality education for all is one of 17 goals agreed to by contributing partner countries to ensure the prosperity of our people and our planet (United Nations, 2015).Improved literacy, education and health outcomes in a population begins with early literacy success.Strong foundational linguistic and early reading skills underpin later reading abilities (Cunningham and Stanovich, 1997;Tunmer et al., 2006;Sparks et al., 2014;Zubrick et al., 2015;Lepola et al., 2016;Su et al., 2017;Peng et al., 2019).In turn, reading competency supports the development of comprehensive literacy skills, such as the ability to evaluate, critique, and analyze written information in its many forms.Those children with strong early reading skills are more likely to experience cumulative advantages compared to children who struggle with early reading (Juel, 1988;Stanovich, 2009;Ritchie and Bates, 2013).
Despite our advanced knowledge of facilitators of children's early literacy success, there remains wide variability in children's literacy achievement.Only modest gains (if any) are being reported in reducing educational inequities for indigenous populations, children raised in poverty or children with learning challenges (Ferrer et al., 2015;Schulte et al., 2016;Stone et al., 2017;Hanushek et al., 2022).For some children with lower level literacy skills the achievement gap compared to their peers over time may be widening (D'Agostino and Rodgers, 2017).In addition, the COVID-19 world pandemic has led to severe disruptions to children's learning (Relyea et al., 2023) having a profound adverse effect for some of our diverse communities.Urgent collective effort is therefore required to ensure all children develop the necessary foundational literacy skills that support comprehensive literacy abilities to achieve our global literacy aspirations.
Understanding pathways that lead to the successful wide-scale implementation of research informed literacy teaching is critically important.Insights gained from how differing countries or regions address such issues (e.g., Piper et al., 2018;Ramacciotti et al., 2023) may inform practices internationally to more effectively address our global literacy challenges.This paper presents a case study of an early literacy initiative introduced by the New Zealand Government to reduce persistent education and health inequities.The initiative aims to advance strong foundation literacy skills for all learners with a particular focus on supporting young Māori (New Zealand's indigenous population) and children from Pacific Peoples ethnic group for whom disparity in education outcomes is greatest (May et al., 2019).Factors that enabled an early literacy approach referred to as the Better Start Literacy Approach (BSLA) to extend from development phase, through controlled research trials (Gillon et al., 2019(Gillon et al., , 2020(Gillon et al., , 2023a,b) ,b) to national implementation are described.

Multi-Tiered Systems of Support
Multi-Tiered Systems of Support (MTSS) are advocated to ensure all children succeed in their learning.MTSS replace Response to Intervention frameworks (RTIs) through taking a more comprehensive and systems wide perspective to remove barriers to students' learning (Averill et al., 2011;Pullen et al., 2018;Freudenthal et al., 2023).MTSS may be used to support the academic, social, emotional or behavior development of children and youth (Averill et al., 2011).In relation to children's reading development, MTSS are based on the "science of reading"cumulated knowledge gained over many years through robust scientific methods related to effective reading instruction and reading interventions.MTSS are specifically designed to alleviate the adverse behavioral, education and health outcomes frequently associated with reading failure to focus instead on ensuring each child experiences reading success (Gonzalez et al., 2022).The multiple tiers of support within such frameworks begin with research-based reading instruction for all children (or Tier 1 quality universal reading instruction).Children who don't respond as well to this first Tier of reading instruction (identified through valid and reliable assessment methods) are provided with supplementary tiers of support such as Tier 2 support (typically involving small group targeted instruction) or Tier 3 support (individualized support to meet a specific learner's literacy needs).
Research evidence supports the effectiveness of MTSS frameworks in developing children's early literacy skills (for review, see Fien et al., 2021).Overall, the evidence shows Tier 1 teaching approaches that provide explicit and systematic instruction in phonics, phoneme awareness, and fluency are effective at promoting early literacy development (Fuchs and Deshler, 2007;Al Otaiba et al., 2019;Petscher et al., 2020).Relatively few studies have focused on the impact of structured and systematic teaching in vocabulary and oral language comprehension within Tier 1 teaching, although these are also considered important elements of quality class-wide literacy teaching (Petscher et al., 2020).Wanzek et al. (2016) conducted a meta-analysis to evaluate outcomes for children's foundational literacy skills from a pool of 72 experimental or quasi-experimental studies that investigated the effectiveness of Tier 2 type interventions for children in kindergarten to grade 3. Their analyses indicated moderate positive effects of Tier 2 type interventions on foundational reading skills (e.g., phonological awareness and recognition skills) as evidenced by improvements on both standardized and non-standardized assessment tasks.Small to moderate effects were found for children's language comprehension on standardized measures with large effect sizes reported for non-standardized language measures.There was no difference in outcomes between individual or small group interventions (e.g., up to five children in a group) nor in length of session (e.g., 30 min compared to 1-h sessions).The type of professional implementing the intervention (e.g., researcher, class teacher, special educator) also did not influence outcomes (Wanzek et al., 2016).These analyses suggest that large scale implementation of Tier 2 teaching implemented in relatively cost-effective ways in regular education settings is feasible.Tier 2 may be particularly effective when it is in addition to quality Tier 1 teaching (rather than replacing it) and when it is implemented early in children's first year at school (Al Otaiba et al., 2014).
Despite the evidence-base documenting positive effects of literacy teaching within MTSS (and Response to Intervention) frameworks, the majority of published research in this area falls into the categories of exploration, development, and initial efficacy studies (Solari et al., 2020).There is a need to build toward clinical implementation studies (evaluation of effective interventions within real world settings) and public health studies (i.e., evaluation of population/widespread outcomes of effective interventions) (Fuchs and Fuchs, 2017;Solari et al., 2020).This current paper advances knowledge of MTSS through examining data collected from both Tier 1 and Tier 2 literacy teaching in a large-scale implementation of a comprehensive early literacy approach within the New Zealand education system.

Assessment within MTSS
A key feature of MTSS for early literacy instruction is the regular monitoring of children's progress in foundational literacy skills.The aims of these assessments are to (a) monitor children's progress in response to evidenced based literacy teaching in order to determine next steps for teaching and (b) identify those children who require additional, instructional support (either Tier 2 or 3 of the MTSS).MTSS are potentially useful for the early identification of children with word reading problems such as dyslexia.Carefully monitoring children's response to literacy instruction based on the science of reading in their first year at school may be more helpful than many dyslexia screening assessments given the psychometrics limitations of such screening assessments (Catts and Hogan, 2020).
To be effective, however, monitoring assessments within MTSS need to be both rigorous and efficient (Petersen and Stoddard, 2018).Most studies that include progress monitoring assessment data within MTSS literacy frameworks focus on reliable measures related to word recognition (e.g., phonic, phonological awareness, word reading fluency).Freudenthal et al. (2023) summarized 22 MTSS studies between 2017 and 2020.These studies included a range of researcher and teacher-implemented interventions and measures.However, none of these studies included a monitoring measure of children's oral language or listening comprehension.Given that both word recognition and language comprehension are important for skilled reading comprehension (Hoover and Tunmer, 2018), understanding valid and reliable language monitoring assessments for class teachers to use is important as we advance our understanding of the benefits of MTSS frameworks.
This current study extends previous research in oral narrative assessments and teaching within classroom contexts (Israelsen-Augenstein et al., 2022;Petersen et al., 2022;Gillam et al., 2023).It also extends recent research in the potential benefits of digital technologies and artificial intelligence to improve efficiency in literacy monitoring assessments (Adlof and Hogan, 2019;Fox et al., 2022;Bright et al., 2023) We examine data collected through a large scale implementation of teachers' administrating a series of online tasks to 5-6 year old children as part of their regular teaching practices within a MTSS framework.The tasks included assessment of children's oral narrative (story retell), listening comprehension, phonic, phoneme awareness, and non-word reading and spelling.

Change processes
It is widely recognized that there are significant barriers to achieving sustained change in teachers' literacy assessment and teaching practices (Le Fevre, 2014).To acknowledge and overcome such barriers the implementation of large-scale literacy initiatives requires well-planned change processes at multiple levels (Solari et al., 2020).Understanding the systems changes necessary and effective methods to support the uptake of evidenced based literacy teaching practices into routine class teaching is examined within the fields of Implementation Science and Translational Science.Emerging research and discussions in translational science related to reading highlight the need for a strong communication strategy, engagement with community and a team of researchers with the skills and dispositions necessary to facilitate the successful translation of research evidence into literacy teaching practice (Solari et al., 2020).A partnership approach with school leaders, researchers, policy makers, community leaders, cultural and indigenous leaders all working together in co-constructed processes to support change is critically important (Macfarlane and Macfarlane, 2019).Provision of quality professional learning and development for teachers, ensuring teaching fidelity is aligned to evidence-based practices, valuing and celebrating teachers' success in uplifting their children's literacy achievement, adequate resourcing and time for teachers to enhance their assessment and teaching strategies all need to be addressed.Flexibility within literacy approaches to ensure literacy practices meet the needs of diverse learners and acknowledge local cultural contexts are also important to consider (Macfarlane et al., 2016;Hoover and Soltero-González, 2018).This paper describes the development of an early literacy approach specifically designed for large scale implementation based on these principles of facilitating research evidence into changing or enhancing teacher's current literacy teaching practices.

Study aims
Specifically, the aims of this paper are to:

Methods
The data collection method and process reported were approved by the University of Canterbury's Human Ethics Committee (application 2021/06/LRPS).The approval allowed the researchers to analyze de-identified assessment data routinely collected within the national implementation of BSLA for secondary analyses to evaluate the impact of the Better Start Literacy Approach.The committee determined this was low risk and individual consent for participation was not required.However, as children entered BSLA, parents were provided with an option to exclude their child's anonymized data from the database used for any secondary analyses.

New Zealand education context
New Zealand is a Commonwealth country of ∼5 million people with 86% of the population living in or near a major city.Māori, around 15% of the population, are the indigenous peoples of New Zealand.European settlers exceeded the number of Māori in the nineteenth century and in the 2018 census 74% of the population identified as being of European decent, 11% identified as Asian and 8% from the Pacific Peoples islands.English is the main language spoken but both Te reo Māori and NZ Sign Language are official languages of the country.
Education is free in New Zealand for children aged 5-19 with it being compulsory for children aged 6-16 years to attend school.Prior to school entry 95% of children will have attended some form of early childhood education.The Government currently provides up to 20 h a week of free early childhood education for children aged 2-5 years.The ECE sector's curriculum framework is Te Whāriki (Ministry of Education, 2017).This promotes a holistic, socio-cultural perspective in learning and teaching across across five learning strands (wellbeing, contribution, belonging, communication, and exploration).Teachers use a learning progressions framework to monitor children's growth across these strands.There are currently no required formal assessment measures of emerging literacy skills during early childhood or as children transition to school.
Children often start school on or close to their 5th birthday and formal literacy instruction typically begins from the outset of their attendance at school.In relation to literacy instruction, teachers working in English medium schools (96% of all primary schools) implement the New Zealand English curriculum (Ministry of Education, 2007).This provides a broad framework and learning progression indicators to monitor children's oral and written language development.Schools are self-governing and although the New Zealand Ministry of Education provides Government funded schools with several quality literacy resources, schools and teachers can make their own decisions about the types of literacy approaches and interventions they implement.Reading Recovery has been the main government funded approach to support young struggling readers (Clay, 1990).However, given evidenced informed criticism of both the theoretical underpinning of the approach and longer term reading outcomes of children who have received Reading Recovery (Tunmer et al., 2013;Nicholas and Parkhill, 2014;Chapman and Tunmer, 2016;D'Agostino et al., 2017;May et al., 2023), there has been a steady decline in the implementation of Reading Recovery in NZ schools since 2012.Only 41% of eligible schools implemented Reading Recovery in 2021 down from 51% in 2019.Outside of Reading Recovery there is considerable variability in literacy assessments and interventions used in New Zealand for struggling readers (Arrow et al., 2022).

New Zealand's literacy rankings
Data from the 2021 Progress in International Literacy Study (PIRLS) shows that in 4th grade the average reading achievement score for the New Zealand participating children was significantly above the center point of the PIRLS scale (CenterPoint = 500) (Mullis et al., 2023).New Zealand's average scaled reading score was 521 (Std.error 2.3) which resulted in a rank of 27th out of 57 participating countries.Since 2006, there has been a consistent and small decline in children's reading achievement in New Zealand (with a drop from a scaled score of 532 in 2006).Despite performing relatively well on average, the wide variability in students' reading achievement performance is an area of persistent and ongoing concern.In both the 2016 and 2021 PIRLS, variability between New Zealand's high and low achieving readers by 4th grade is larger than any other country scoring above the PIRLS Centrepoint on reading achievement (Mullis et al., 2017(Mullis et al., , 2023)).Like most countries, girls performed significantly higher in 2021 (scaled score of 531) than boys (scaled score of 512) with the average difference of 19 scale score (Mullis et al., 2023).
In recognizing the persistent inequities in education outcomes in New Zealand, the Government launched a Literacy and Communication and Math Strategy (Ministry of Education, 2022) to uplift foundational skills for all children.As part of an early literacy initiative aligned with this strategy the Government has funded the implementation of the Better Start Literacy Approach (BSLA) (Gillon et al., 2023a).From 2020 to May 2023 over 3,000 junior schoolteachers and literacy specialists have been trained in the BSLA in 819 schools The next section describes the development of BSLA and its pathway from controlled trials to this large scale national implementation.

Development phase
The BSLA emerged from the first phase of a 10-year Government funded programme of research focused on child and youth wellbeing.This program of research is referred to as the-The Better Start National Science Challenge (Maessen et al., 2023).It involved a multidisciplinary team of research leaders across New Zealand engaging with community to co-construct research that would lead to sustained change and positively impact communities.Following extensive community consultation, three key areas for research were identified: successful learning, healthy weight, and youth resilience.As part of the successful learning theme, a 5year programme of research was funded to uplift children's early literacy success.
Key aspects of this research planning involved engaging with community and co-constructing evidenced based literacy teaching instruction for children's first year at school that could be implemented at scale across school communities.A MTSS framework with strengths-based and culturally responsive literacy teaching practices was proposed based on research findings and community feedback.

Cultural framework
A He Awa Whiria or "braided rivers" framework (Gillon and Macfarlane, 2017) was used as a framework to consider the braiding of research evidence from the scientific literature (inclusive of research findings involving diverse populations, and differing research methodologies) together with Māori knowledge and evidence indicating facilitators of education success for Māori and Pacific Peoples (e.g., Berryman and Eley, 2017;Webber and Macfarlane, 2020;Alansari et al., 2022).Our He Awa Whiria framework for literacy acknowledges the importance of research related to necessary cognitive skills to literacy learning, but also acknowledges the influence of ecological, and psychological factors in children's reading development (Aaron et al., 2008) (see Figure 1).The braided river metaphor conceptualizes that just as the streams of a braided river may sometimes flow apart so too does indigenous knowledge and domain specific research knowledge have its own integrity and importance.However, when streams of water braid together and form a larger body of water, the river gains force and strength.So too through the braiding of differing knowledge streams the potential to ensure all children experience early literacy success gains momentum.The schema used within BSLA to support teachers in developing culturally responsive literacy teaching practices within the He Awa Whiria framework is The Hikairo Schema for Primary Teachers (Ratima et al., 2020).Three core principles within this schema include: 1. Relevance: Teachers provide learning experiences that are relevant for children's cultural and personal identities; 2. Balance of power: Mutual care, trust and respect between teachers and learners is fostered through the learning experiences; and 3. Scaffolding: Teachers scaffold learning and provide necessary supports to ensure successful learning outcomes are in the grasp of all learners.

Key features of BSLA content
There are several key features of BSLA teaching content that stem from the He Awa Whiria framework (Gillon et al., 2024).These are summarized in the following section.

Cognitive stream of knowledge
The structured literacy approach to teaching within BSLA is aligned to the Simple View of Reading (see Hoover and Tunmer, 2018).Teachers use explicit teaching techniques to develop children's skills that support both their word recognition and their language comprehension skills.Consistent with MTSS frameworks, structured daily 30-min lesson plans are provided for Tier 1 class teaching that develop children's skills in three areas: 1. Vocabulary, oral narrative and listening comprehension through shared book reading activities.2. Phonic, phoneme awareness and morphological awareness activities following our BSLA phonic scope and sequence (see Supplementary material: in the first 10 weeks of teaching, new phonic patterns are introduced at a rate of between 2 and 4 new patterns per week with flexibility for teachers to revise target phonic patterns as suitable to their learners).3. Activities to support children to transfer their developing phonic and phoneme awareness skills to the reading and writing process.
In addition to these 30 min class lessons, children also participate in a 10-15 min small group instructional reading sessions using a newly developed decodable text reading series that the BSLA leaders were involved in developing called Ready to Read Phonics Plus (RRPP) (see Arrow et al., 2021 for details of reading series).The phonic scope and sequence for BSLA and the RRPP reading series are aligned.This alignment aims to maximize children's exposure to target phonic and phonological patterns from class activities to small group reading with connected text.
For children identified as requiring Tier 2 support teachers implement 40 small group lesson plans (usually four 30-min session over a 10 week period).These lessons are focused on strengthening children's phonic, phoneme awareness, word reading and spelling skills.This support is typically supplementary to children participating in Tier 1 class activities.There is strong alignment between Tier 1 and Tier 2 teaching activities to maximize teaching intensity for learners who require additional support.
With the support of literacy specialists or speech-language therapists, teachers adapt BSLA assessments and Tier 1 and Tier 2 activities for children with complex communication needs who require Tier 3 level support (Clendon et al., 2022).This support is provided at an individual level and often involves teacher assistants who are supporting these learners.
Important teaching elements within BSLA include the following: • BSLA teaching time is focused on phonological awareness skills at the phoneme level (identifying, blending, segmenting, and manipulating phonemes).Syllable and rhyme awareness are not teaching targets within the 30-min daily lessons.This is based on evidence that phoneme skills are more strongly associated with early reading and spelling (Hulme et al., 2002) and that teaching to improve children's syllable and rhyme awareness does not necessarily transfer to reading improvements (Nancollis et al., 2005).• Based on evidence from earlier experimental studies (e.g., Hatcher et al., 1994;Gillon, 2000) many of the BSLA teaching activities integrate phoneme-grapheme knowledge phoneme awareness and the reading and spelling of real words concurrently to maximize teaching efficiency.• The pace of teaching through the phonic scope and sequence is aligned to the Self Teaching Hypothesis of learning to read (Share, 1995).That is, the explicit teaching of phonic patterns and decodable text within the readers is not too tightly constrained to slow learning down.Rather, it allows children to discover a limited number of untaught orthographic patterns alongside the explicitly taught patterns, particularly after the first few weeks of BSLA teaching.

Ecological stream of knowledge
A key aspect of BSLA is supporting teachers to implement BSLA in culturally responsive ways.This involves supporting teachers to develop their own personal cultural competence (Ratima et al., 2020) as well as encouraging them to relate the stories and activities in culturally relevant ways to their learners.In addition, content for family workshops, family reports and weekly updates about BSLA teaching are provided as a guide for teachers to engage families in their children's literacy learning.Typically teachers followed these guides to offer two sessions for children's parents and family members during the first 10 week period of implementing BSLA.

Psychological stream of knowledge
BSLA is developed around strengths-based principles (Gillon et al., 2023a).Teachers are encouraged to focus on what children can achieve and identify their next steps for learning.The assessment data are used and presented in positive ways to focus on students' growth and celebrate their learning and effort.Each child is expected to succeed.The use of children's assessment data to inform teaching along with scaffolding and explicit teaching techniques are promoted to ensure all children experience success in their learning.The consistency and routine embedded within the lesson plans help support children's confidence in engaging in the activities and reduces cognitive load to support learning success.

Professional learning and development (PLD) for teachers
Through an iterative process involving consultation and feedback from teachers and literacy specialists, we developed a series of online BSLA micro-credentials for class teachers, literacy specialists (e.g., teachers with advanced qualifications in children's literacy development and speech-language therapists) and teacher aides (Scott et al. submitted ).Micro-credentials provide teachers with flexibility to engage in PLD in specific learning areas and to gain recognition for their learning (DeMonte, 2017).The online content was available through the University's student learning management system and provided a high-quality online learning experience.The content included theoretical content related to the science of reading (modified for teacher aides), activities to develop professionals' personal linguistic knowledge, access to the Tier 1 and Tier 2 daily lesson plans, a variety of video demonstrations of teaching lesson with commentary around effective teaching strategies, training in the assessment tasks and data interpretation (modified for teacher aides), content related to developing culturally responsive teaching practices, access to materials to support family engagement and access to adapted teaching and assessment content to support children with complex communication challenges.Each professional enrolling in the BSLA micro credential was required to successfully complete online assessment tasks and online quizzes to gain their microcredential.In addition, class teachers were required to complete the teaching of at least 10 weeks (or 40 lesson plans) of BSLA.Literacy specialists and teacher aides were required to provide support to class teachers during these 10 weeks of teaching.Teachers had continuous access to this online learning content (even after they had completed their micro-credential) to enable them to revise content, review lesson demonstrations or engage more deeply in learning content as they progressed with BSLA implementation.
As well as online independent learning teachers engaged weekly in live (or recorded) Zoom sessions with the BSLA leadership team for the first 10 weeks of teaching BSLA.As part of the BSLA microcredential for literacy specialists, participants also attended a 2-day live workshop with the BSLA co-developers and leadership team.This workshop focused on their role as mentors and coaches for class teachers as well as more in depth training in key areas such as BSLA assessments and data interpretation.

Online assessments designed for large scale implementation
A bespoke assessment website was developed to accommodate the online BSLA assessment tasks (Scott et al., 2022;Gillon et al., 2023b).Each teacher has login access to their own class data held within the assessment website.An assessment dashboard provided teachers with details of each child's assessment task performance and alerted teachers to when the next round of monitoring assessments for a particular child was due and what assessment tasks required completing.The online assessment tasks that children completed via iPad or computer ensured consistency of task presentation, reduced teacher workload, and animated characters engaged young learners in the tasks.Scott, A., Gillon, G., McNeill, B., and Gath, M. (submitted).Facilitators of success of a microcredential model for professional learning and development to enhance teachers' literacy practices.
Frontiers in Education frontiersin.org Automatic scoring and automated reporting, features supported large scale implementation.

Research trials to inform large scale implementation
Following the design and development of BSLA, the approach was first trialed in new entrant and Year 1 classrooms (5-to 6-year-old children) from seven schools in areas of high socialeconomic deprivation.Data from this quasi-experimental research trial suggested that BSLA was more effective in accelerating children's foundational literacy skills than the classroom literacy approaches teachers were currently implementing (Gillon et al., 2019(Gillon et al., , 2020)).The Ministry of Education funded a replication of this research trial.The replication involved 14 schools (seven of which were in a different regional area than the first trial), from across socio-economic deprivation deciles (Gillon et al., 2023a;McNeill et al., 2023).The data showed significant positive effects for both Tier 1 and Tier 2 BSLA implementation and demonstrated that BSLA was equally effective across schools of differing socio-economic deprivation deciles.Following a competitive Government procurement process, the BSLA research team were selected to provide PLD to support teachers and literacy specialists to implement BSLA across the country.It also extended the development of BSLA from supporting children in new entrant and Year 1 classes to supporting children in Year 2 classes.

Multidisciplinary team
The importance of a building a team with a variety of discipline knowledge, leadership skills and dispositions to support large scale implementation of evidenced based literacy intervention and systems change has been identified (Solari et al., 2020).The colead developers of BSLA (first two authors of this paper) had a strong research and leadership background in speech-language therapy, teacher education and more broadly children's wellbeing.The academic leadership team for BSLA includes leaders and experienced practitioners in junior school class teaching, Māori education, education psychology, Pacific Peoples education and health, speech-language therapy, developmental child psychology, and specialists in children's learning and behavior.This team collectively support the teachers and literacy specialists through the PLD and in school implementation of BSLA.The team engaged with a software company in the design of the assessment website as well as online learning experts in the development of the PLD micro-credential.In addition, the large-scale implementation was supported by an advisory group comprising international leaders in the science of reading, children's language development and cultural leaders in Māori and Pacific Peoples communities.

Participant data included in analyses
This paper describes data routinely collected through the national implementation of BSLA.By default, all assessment data is transferred to an anonymized research dataset; however, all parents are given the option to have their child's data excluded from the research data base.We have used this anonymized research dataset for our analyses.
As at May 2023, assessment data were available for 29,795 five-year-old students (aged 5 y 0 m to 5 y 11 m) at the Baseline assessment point (prior to teachers implementing BSLA).Further assessment data were available for the subsequent monitoring assessment points within BSLA for the following numbers of students: • 16,497 students at the Ten Week assessment point.Teachers administer these assessments after children have received 10 weeks of BSLA Tier 1 teaching or ∼40 BSLA lessons.• A subset of 7,876 children from the larger cohort who were identified as the "school entry cohort" as they were aged 5 y 0 m to 5 y 3 m at baseline assessment and therefore received BSLA from the start (or within a couple of months from the start) of their formal literacy learning.• 5,495 students with greater learning needs who were assessed at the Twenty Week assessment (i.e., after they had received 10 weeks of BSLA Tier 2 teaching).• 2,313 students at the Thirty Week assessment.All children participate in the Thirty Week assessment round after they have received ∼30 weeks of BSLA teaching.At the time of this analysis, data was available for 2,313 children who had received both their Baseline assessment and Thirty Week assessments.
The students included in the data analyses were from 819 schools, spread across all 16 Regional Council areas in New Zealand.The full sample for this analysis had a mean age of 64.3 months (SD = 3.3) at the Baseline assessment point.Children's ethnicities were as follows: New Zealand European (48.3%),Māori (22.1%),Pacific Peoples (9.1%), Asian (12.7%).Student gender: 49.0% female, 51.0% male, and 0.01% other.
Students in the dataset came from all levels of socio-economic deprivation based on the New Zealand Index of Deprivation (Atkinson et al., 2019) (see Supplementary material for details).A Kolmogorov-Smirnov test indicated significant skew to the distribution of socioeconomic deprivation (D = 0.11, p < 0.01; skewness = −0.20,kurtosis = −1.06).The skewed distribution was due to the Ministry of Education initially prioritizing schools in areas of higher socio-economic deprivation for this Early Literacy Initiative opportunity.

Assessment measures
All tasks (except for the non-word spelling task) were presented via a touch screen iPad or laptop.An animated character (male voice with New Zealand accent) spoke to the children for the phoneme awareness and phoneme grapheme matching tasks.The children were required to touch the screen (iPad), or the teacher used a computer mouse to select the child's response.The children's responses were automatically recorded and a detailed response analysis for each child's attempt was immediately available to the child's teacher.Previous research established both the test-retest reliability for the phoneme awareness and phoneme grapheme matching tasks (Carson et al., 2015) as well as the internal consistency of the task items (Gillon et al., 2023a).Gillon et al. (2023b) describe the validity of the online story retell assessment.The tasks on the assessment website were organized in the following order, but teachers could choose to administer the tasks in any order.

Initial phoneme identity
In this task, the animated character asked children to select one of three pictures shown on the screen or iPad that started with the target sound.For example: "Dog likes words that start with a /d/ sound.Can you help dog find words that start with his favourite sound.Which word starts with /d/?"The phonemes tested were: /m/, /s/, /k/, /b/, and /f/ (two items tested each phoneme).Proficiency on this task was set at 8 or more items correct out of 10.Cronbach's alpha across the 10 items was 0.85, indicating high reliability.

Phoneme blending
In this task, an animated character instructed the children: "I am going to say one of these words very slowly.Click on the picture you think I am saying."For example, children were shown the images of a cake, a cape, and a ring, while listening to the character say the individual phonemes of /k//ei//k/.They were then required to click the image of the word they thought the character was saying.Children completed 10 test items.Proficiency on this task was set at 8 or more items correct out of 10.Cronbach's alpha across the 10 items was 0.79, indicating good reliability.

Letter-sound knowledge
In this task children were required to select the letter (grapheme) that matched the speech sound (phoneme) that they heard by tapping on the screen.This letter-sound matching was assessed with two sets of items (eight items in Set 1; m, d, c, t, s, l, n, p), 17 items in Set 2 (Set 2: ch, b, i, f, r, g, e, sh, k, u, j, w, o, a, v, th, h), with children only progressing onto Set 2 if they achieved proficiency at Set 1. Proficiency on Set 1 of this task was set at 6 or more items correct out of 8, and proficiency on Set 2 was set at 13 or more out of 17. Cronbach's alpha was 0.84 for Set 1 and 0.83 for Set 2, indicating high reliability.

Non-word reading
Students were asked to read sets of non-words and their responses were audio-recorded and scored.Children received two points for every correct grapheme within the non-word, with one point given for a partially correct grapheme (such as an added sound or a b/d or p/q reversal error).A total of four sets of non-words (each set containing 10 words) were used, each with an increasing level of difficulty (level 1 example: wep; level 2 example: flom; level 3 example: gace; level 4 example: spream).Students started at Set 1 of non-word reading and only progressed to subsequent sets if they were proficient (scored 80% or higher) on the previous set.A total non-word reading score was calculated as the sum of all four sets (with 0's imputed for any sets that students did not progress to) out of a maximum of 264 points.Interrater reliability was assessed by having a single independent coder score a random sample of 200 non-word reading assessments.The twoway random effects intraclass correlation was 0.92 across the 10 items across sets, indicating high interrater reliability.

Non-word spelling
The non-word spelling task was a parallel task to non-word reading, with students asked to spell the same four sets of nonwords presented in the non-word reading task.The teacher read aloud the non-word and asked the child to write the word onto a record form.Students received a score of 0, 1, or 2 for each grapheme depending on the spelling accuracy.A total non-word spelling score was calculated as the sum of all four sets (with 0's imputed for any sets that students did not progress to) out of a maximum of 264.Interrater reliability was assessed by having a single independent coder score a random sample of 200 nonword spelling assessments.The two-way random effects intraclass correlation was 0.93 across the 10 items across sets, indicating high interrater reliability.

Oral narrative
Students' oral narrative skills and listening comprehension were assessed through a novel story retell task presented digitally on an iPad or computer (Gillon et al., 2023b).In this task, students listened to a story with accompanying pictures on the screen.Following the presentation of the story, children were prompted to retell the story in their own words.The recordings of the children's retells were then transcribed automatically using speechto-text software (Scott et al., 2022), and analyzed using Systematic Analysis of Language Transcripts (SALT) software (Miller and Chapman, 2004).For each student's transcript of their story retell, the following quantitative data was automatically generated for teachers: number of words used, number of different words used, number of nouns, verbs, adjectives, and adverbs used, and percentage of intelligible utterances.In addition, the research team had access to more detailed SALT analyses including mean length of utterance.
Following the retell, children were asked five comprehension questions which were presented on the screen for the teacher to read aloud.Three of the questions were factual questions tapping understanding of character identities and actions.Two were inferential questions tapping comprehension of story context and motivation for character's actions.Teachers scored children's responses online (usually in real time).Each response was scored as 2 for correct, 1 for partially correct, or 0 for incorrect.Interrater reliability of comprehension scoring was assessed by having a single independent coder score a random sample of 200 comprehension tasks.The two-way random effects intraclass correlation was 0.52 across the five items, indicating moderate interrater reliability.

FIGURE
Percent of students proficient at initial phoneme identity before and after weeks of BSLA teaching.

FIGURE
Percent of students proficient at phoneme blending before and after BSLA teaching.

FIGURE
Percent of students proficient at Set letter-sound knowledge before and after BSLA teaching.

Scores used in analysis
On all measures other than the oral narrative, analysis was undertaken on both proficiency levels and raw scores.Proficiency was defined as scoring 80% or higher on the task.When using raw scores, data was imputed at subsequent assessment points once students reached proficiency on the task (as tasks were not repeated once proficiency was demonstrated) (see notes in Supplementary material).

FIGURE
Percent of students proficient at Set letter-sound knowledge before and after BSLA teaching.

BSLA teaching fidelity data
Teachers were requested to complete online fidelity checklists during their first 10 weeks of BSLA teaching.On these checklists, teachers reported on student engagement, total instructional time for class/large group lessons and small group reading, and the activities that were included in the large and small group sessions by selecting all that applied from a list of potential activities.

Skill progression before and after BSLA teaching
Our first set of analyses used a subset of student Baseline data as an internal control group through a comparison of students of the same month of age who had not yet received BSLA teaching (Baseline assessment internal) against those who had received approximately 10 weeks of BSLA teaching (Ten Week assessment).Figures 2-5 plot the percent of students proficient on each assessment task by student month of age at the Baseline and the Ten Week assessment points.Note that these figures begin at 64 months of age to allow for enough children per cell at the Ten Week assessment point.Thus, the figures include 15,380 children at Baseline (internal control group) and 10,532 children at Ten Week assessment (BSLA treatment group).
After 10 weeks of BSLA teaching, students of all months of age showed a higher rate of proficiency on all tasks than students of the same age who had not yet received BSLA.The gap is particularly large for the youngest students (e.g., ages 64-66 months).Effect sizes of the difference between the BSLA group and the no-BSLA group at each month of age on each task indicated medium to large effects of BSLA teaching (initial phoneme identity, Hedges' g range 0.38-0.64;phoneme blending, Hedges' g range 0.48-0.76;letter-sound knowledge set 1, Hedges' g range 0.42-0.73;letter-sound knowledge set 2, Hedges' g range 0.49-0.90).See Supplementary Table 1 for means, standard deviations, and effect sizes.

Growth in assessment scores by ethnicity
We next examined the change in raw scores from Baseline to the Ten Week assessment for each ethnic group.Large effect sizes ranging from 0.63 to 1.81 indicated all groups scores increased in a meaningful way.Effect sizes were higher for Māori and Pacific Peoples relative to NZ European and Asian students on all tasks aside from phoneme blending (see Supplementary Table 2).
Repeated measures ANOVAs with a within-subjects factor of time and between-subjects factor of ethnicity were used to determine whether growth differed significantly by ethnicity.For this analysis, students could not be included in multiple ethnic groups, and we have coded a prioritized ethnicity variable using the following prioritized order: Pacific Peoples, Māori, Asian, NZ European.Any significant Time * Ethnicity interactions (indicating different growth patterns by ethnicity) were followed up with Bonferroni post-hoc testing.Table 1 provides the results of this analysis, with differing subscripts indicating significantly different growth between ethnic groups.
The overall effect of differing growth by ethnicity was minimal for initial phoneme identity and phoneme blending tasks.However, meaningful effects were found on both phoneme-grapheme tasks (equivalent to d = 0.29 for set 1 and d = 0.20 for set 2).In both cases Māori and Pacific Peoples groups showed more growth than Asian and NZ European students.
Figure 6 illustrates the growth in Phoneme-Grapheme knowledge (set 2) from Baseline to the Ten Week assessment by ethnicity.The figure demonstrates a reduction in the gap between ethnicities at Ten Week as compared to Baseline.

School entry cohort
This set of analyses focuses on the 7,883 students who were aged 5 y 0 m to 5 y 3 m at the Baseline assessment and who were re-assessed after 10 weeks of BSLA teaching.This represents children who had just commenced school when BSLA teaching was introduced.We examined the change in scores as well as changes in proficiency rates from school entry (Baseline) to  Differing superscripts indicate significant differences at alpha = 0.05.η 2 p effect sizes are 0.01 = small, 0.06 = medium, 0.14 = large.

FIGURE
Growth in letter-sound knowledge (Set ) after weeks of BSLA teaching by ethnicity.
the Ten Week assessment point within this group of students.
Table 2 provides descriptive statistics for initial phoneme identity, phoneme blending, and letter-sound knowledge for the school entry cohort at Baseline and Ten Weeks.Paired samples t-tests indicated significant change from Baseline to Ten Weeks on all tasks (all t's > 85.62, all p's < 0.001).
As seen in Table 2, shifts in both mean and median scores are evident after 10 weeks of BSLA teaching.As an illustration of this rapid growth Figures 7, 8 show the distribution of the number of children who reached proficiency levels (8, 9, or 10 items correct) for the initial phoneme identity task at Baseline and Ten Week assessment, respectively.Table 3 provides the proficiency rates for the school entry cohort at Baseline and at Ten Week assessment.

Non-word reading and non-word spelling
This analysis focused on the 2,313 students who completed the Thirty Week assessment.Only 12.3 and 1.7% of students completed the non-word reading and non-word spelling tasks at Baseline assessment, respectively.For any students who did not attempt these tasks at Baseline we have imputed a score of 0 at Baseline.Repeated measures ANOVAs were used to examine growth over time in non-word reading and non-word spelling at the Baseline, Ten Week, and Thirty Week assessments.We examined the between-subjects factors of ELL status, gender, Māori ethnicity, Pacific Peoples ethnicity, and socioeconomic deprivation to determine whether growth differed based on these factors.
There was significant growth in non-word reading scores from Baseline to Thirty Weeks [F (2,4,108) = 426.48,p < 0.001; η 2 p = 0.17], with significant differences between scores at each of Baseline, Ten Weeks, and Thirty Weeks (p's < 0.001).There were no interactions with gender, ethnicity, socioeconomic deprivation, or ELL status, indicating similar rates of growth across demographics (all η 2 p < 0.009).
There was also significant growth in non-word spelling scores from Baseline to Thirty Weeks [F (2,3,828) = 375.13,p < 0.001; η 2 p = 0.16], with significant differences between scores at each of Baseline, Ten Weeks, and Thirty Weeks (p's < 0.001).There were no interactions with gender, ethnicity, socioeconomic deprivation, or ELL status, indicating similar rates of growth across demographics (all η 2 p < 0.007).

BSLA Tier
There were 2,355 children who were identified as receiving 10 weeks (or 40 lessons) of supplementary BSLA Tier 2 and who were assessed at the Twenty Week assessment point.This equates to 14.3% of students assessed at Ten Weeks who received Tier 2. There was no difference in the percentage of English Language Learners in the Tier 2 (9.9% ELL) and non-Tier 2 students [10.8% ELL; χ 2 (1) = 1.64, p = 0.20], nor in the gender distribution of Tier 2 (52.7% male) and non-Tier 2 students [50.4% male; χ 2 (1) = 3.65, p = 0.06].Tier 2 and non-Tier 2 students also didn't differ in their age (Cohen's d = 0.09) or socioeconomic deprivation (Cohen's d = 0.12).
We examined growth for the Tier 2 children across the Baseline, Ten Week, and Twenty Week assessment points using repeated measures ANOVAs (see Table 4).There was significant growth for Tier 2 children on initial phoneme identity, phoneme blending and both letter-sound tasks.Bonferroni post-hoc tests indicated significant differences between Baseline, Ten Weeks, and Twenty Weeks on all tasks (all p's < 0.001).

Growth in non-word reading and spelling in response to Tier
To determine the impact of Tier 2 teaching on students word decoding and encoding ability, we selected a matched control group of demographically similar students but who did not require Tier 2 teaching during their first year at school.There were 432 students with data available at the Thirty Week assessment who had received at least 10 weeks of Tier 2 teaching.Our control group was selected from the remaining 1,881 students with data at the Thirty Week assessment.Case control matching was used to select a matched non-Tier 2 student for each Tier 2 student based on gender, age in months (±1 month), and socioeconomic deprivation (plus or minus one decile), resulting in 432 matched controls.There were 189 females (43.8%) in both the Tier 2 and non-Tier 2 matched  groups.Age in months did not differ significantly between the groups [t (862) = 0.39, p = 0.70; d = 0.03]; however, the control group scored higher in socioeconomic deprivation than the Tier 2 group, although this was a very small effect [t (862) = 2.05, p = 0.04; We examined growth for these two groups (ie children who received Tier 2 and children who did not require Tier 2) in both non-word reading and non-word spelling across 30 weeks of BSLA teaching.
Figures 9, 10 show the growth in non-word reading and non-word spelling, respectively, from Baseline to Thirty Week assessment.Note that children in Tier 2 completed an additional post-Tier 2 assessment (at Twenty Week assessment).
After 10 weeks of BSLA teaching there was a substantial difference between Tier 2 students and non-Tier 2 students, indicating that teachers had correctly identified students requiring further support after 10 weeks of teaching [non-word reading: t (789) = 5.03, p < 0.001, d = 0.36; non-word spelling: t (754) = 6.14, p < 0.001, d = 0.44].At the 30 Week assessment, however, once Tier 2 students had completed Tier 2 teaching, there was no longer significant differences between these students and their matched controls on non-word reading [t (816) = 1.41, p = 0.16, d = 0.10] or non-word spelling [t (784) = 1.81, p = 0.07; d = 0.13], indicating Tier 2 students had caught up to their peers.

Tier students' task response times
Mean response times were computed for each online assessment task (initial phoneme identity, phoneme blending, and letter-sound task) across task items.Change scores, reflecting the reduction in mean response time from Ten Week to Twenty Week assessments, were then computed.
We first compared data for the 864 Tier 2 students and matched non-Tier 2 controls at the Ten Week assessment point.There was a trend for Tier 2 students to take longer on all tasks compared to non Tier 2 matached controls at the Ten Week assessment (ie., pre-Tier 2 implementation).The response time differences were statistically significant (p < 0.05) for phoneme blending and letter-sound matching (Set 2) tasks (results are shown in Supplementary Table 3).
We next examined growth from Ten Weeks to Twenty Weeks (i.e., pre-to post-Tier 2) in the 2,355 students who had completed BSLA Tier 2 using repeated measures ANCOVAs, with change in response time as the covariate.For these analyses, only students who completed the task both pre-and post-Tier 2 are included.On all tasks other than phoneme blending there was a significant interaction between time and change in response time (see Table 5).In other words, the more students' response times reduced between the pre-and post-Tier 2 assessments, the greater their growth in scores over this time on initial phoneme identity and lettersound task.

Oral language and listening comprehension
Students were assessed on their oral narrative ability and listening comprehension at the Baseline and Thirty Week ./feduc. .a Eta-squared effect sizes (η 2 ) have been converted to Cohen's d using formulas available in Cohen (1988).Raw scores (number correct) are used in the analyses.assessment points.Repeated measures analyses of variance were used to examine growth over Thirty Week assessment of BSLA teaching in the 1,518 students assessed at Baseline and Thirty Week on the story retell task.In addition to examining within-subjects growth over time, we also examined the between-subjects factors of gender, ethnicity, and socioeconomic deprivation [coded as high deprivation (deciles 6 through 10) or low deprivation (deciles 1 through 5)] to determine whether growth differed based on these factors.We used an effect size threshold of partial eta-squared (η 2 p ) >0.01 (equivalent to Cohen's d = 0.2) to identify significant effects.
Significant growth over 30 weeks of BSLA teaching was observed in listening comprehension [F (1,1,399) = 31.71,p < 0.001;  While growth in intelligibility was statistically significant, it did not meet our effect size threshold for significant growth [F (1,1,336) = 9.59, p = 0.002; η 2 p = 0.007].Thus, the largest growth effect was seen for mean length of utterance (d = 0.54), with more modest effects for listening comprehension, number of words, and number of different words (all d = 0.29).Figure 11 shows the shift in the distribution of mean length of utterance scores from Baseline to Thirty Weeks.
There were no significant interactions with any between-subjects factors (all η 2 p < 0.006), indicating that patterns of growth in oral narrative abilities and listening comprehension were comparable across gender, ethnicity, and socioeconomic deprivation.

BSLA teaching fidelity
A total of 2,805 fidelity checklists were completed by BSLA teachers during the first 10 weeks of BSLA teaching.In terms of student engagement with BSLA lessons, teachers reported that overall, most students' engagement with the activities was high (65.2%),with a smaller percentage of lessons where engagement was rated as average/variable (32.3%) or low (2.5%).The majority of teachers (70%) reported spending 30 min or more in implementing the BSLA daily lesson plans with 26% indicating they spent between 15 and 25 min daily and 4% indicating they spent <15 min.See Table 4 in Supplementary material for the percentage of lessons that included each of the potential Tier 1 BSLA activities and small group reading instruction activities.
Overall, on average teachers included 6.4 (SD = 1.5) different activities in their daily 30 min large group lessons and 4.2 (SD = 1.1) activities in their daily small group instructional reading lessons.

Discussion
Large scale implementation of effective early literacy teaching approaches is critically important as we strive toward education In this study, we presented data from a large-scale implementation of an early literacy approach referred to as the Better Start Literacy Approach (BSLA) within the New Zealand education context.The BSLA was specifically designed for large scale implementation.Its pathway from initial design to large scale implementation included several features that are consistent with recommendations from other research groups (Hoover and Soltero-González, 2018;Solari et al., 2020).First, there was strong investment in the design and development phase (over a 4-year period).The approach was based on the science of reading and braided together important research findings related to cognitive, ecological (including cultural), and psychological factors that influence children's literacy development.Its development involved consultative processes between a multidisciplinary team of researchers, practitioners, Māori and Pacific leaders, school leaders and policy makers.The implementation plan was coconstructed with class teachers and evolved over time in response to teacher feedback.Controlled research trials were implemented to establish the approach was more effective than teachers' existing early literacy teaching practices.Significant attention was given to the development of online PLD using a micro-credential professional learning framework to support teachers across the country to implement the new approach.Novel online assessments for reliable and efficient ways for teachers to monitor student growth and guide teaching content relevant to local cultural contexts were developed.
Investment in this development phase allowed for the rapid large scale implementation of BSLA across the country.Funded by The New Zealand Ministry of Education, within a 2.3 year time period (February 2000-May 2023) over 3,000 teachers from 819 schools were involved in implementing BSLA in junior school classrooms.The online model of PLD proved particularly useful as this implementation phase occurred during periods (e.g., 2-3 month periods) of school closures due to COVID-19 pandemic disruptions.
Findings from analyses using data that teachers routinely collected online as part of BSLA are very encouraging.After the teachers had implemented just 10 weeks (or 40 Tier 1 daily lesson plans) 5-year-old children (n = 10,532) showed a higher rate of proficiency on phonic and phoneme awareness measures than students of the same age who had not yet engaged in BSLA teaching (n = 15,380).Effect sizes of the difference between the BSLA group (n=10,532 children) and the no-BSLA group (N = at each month of age between 64 and 70 months on each task indicated medium to large effects sizes favoring BSLA teaching.Children commencing school at 5 y 0 m−5 y 3 m and received BSLA from the outset of their literacy teaching showed remarkable growth in lettersound knowledge, phoneme identity and phoneme blending skills following 10 weeks of BSLA.The majority of these learners reached proficiency on these tasks at the Ten Week monitoring assessment. These data add to the evidence base of effective Tier 1 early literacy teaching strategies within MTSS (e.g., Fuchs and Deshler, Al Otaiba et al., 2019;Petscher et al., 2020).They suggest that when teachers are well supported to provide explicit and systematic instruction in phonics and phoneme awareness as part of Tier 1 universal teaching, most 5-year-old children can master these skills.Given the robust evidence demonstrating proficiency in phoneme awareness is a strong predictor of early reading and spelling success (see Gillon, 2018 for a review) developing these skills early is important.The speed of learning for the school entry cohort (7,876 children aged 5 y 0 m−5 y 3 m), observed in the current study, is particularly impressive.Key teaching aspects within BSLA such as, focusing phonological awareness activities at the phoneme level, integrating letter-sound knowledge with phoneme awareness activities, and the pace of teaching through the BSLA scope and sequence, may have contributed to this rapid acquisition of phonic and phoneme awareness skills.
Teachers selected 14.3% of students after 10 weeks of Tier 1 teaching for additional Tier 2 support.These learners showed significant growth in phonic and phoneme awareness skills in response to 10 weeks of BSLA Tier 2 small group lesson plans.Their task response times were also improving, suggesting that on most tasks they were becoming both more accurate and more efficient in processing phonological information.By 30 weeks there was evidence that children with greater learning needs who received BSLA Tier 2 teaching had caught up to their peers in word reading and spelling skills.This is a rare finding outside of tightly controlled experimental studies.
These data provide further support to the value of MTSS for early literacy teaching.Within BSLA, Tier 1 and Tier 2 teaching activities are aligned.Such alignment provides increased intensity of teaching for those children with greater learning needs.It helps to reduce cognitive load and build learners' confidence through familiarity of tasks and game activities and provides additional time in small group reading instruction.A further design feature of the BSLA implementation was significant Government investment to engage differing professional groups (e.g., class teachers, speech-language therapists, literacy specialists), in the same high quality Professional Learning and Development (PLD) (schools self-funded their teacher aides to enroll in the BSLA Teacher Aide micro-credential).This was intentional to build "professional learning communities" (Vescio et al., 2008) in localized areas where a common practice model in supporting children's early literacy learning can evolve.It also helped ensure that professionals involved in supporting children in differing tiers of support within MTSS were familiar with the same monitoring assessment measures, data analyses, teaching content and evidenced based teaching strategies.This provides consistency in practice for children and their families and maximizes learning opportunities for children.
An important aspect to the BSLA was explicit teaching of skills that supported children's oral language comprehension and vocabulary growth as well as their word recognition skills (as aligned to the Simple View of Reading).Data from a novel online oral narrative assessment (Gillon et al., 2023b) indicated significant growth in children's comprehension of both factual and inferential information when listening to a short story as well as growth in expressive vocabulary measures when retelling a story.The successful large-scale implementation of this online oral narrative task by class teachers as part of their regular teaching practice is notable.Evidence that teachers are systematically monitoring children's oral narrative, listening comprehension and expressive vocabulary in valid and reliable ways (in addition to phonic and phoneme awareness progress monitoring) is rare in classroom practice.Teachers' use of valid and reliable online assessments that capitalize on latest digital technologies, automated speech transcription and automated recording and reporting features (as used in BSLA monitoring assessments) hold much promise for large scale implementation.Such features enable teachers to monitor each child's growth in foundational literacy skills in efficient ways that can inform their next steps for teaching.When assessments are designed and used in strengths-based ways to celebrate children's learning and are aligned to activities, they may also support the sustainability of enhanced or new teaching practices.
Teacher self-reported fidelity measures indicated teaching fidelity was high during their first 10 weeks of BSLA implementation.Given the supports teachers were provided (weekly Zoom sessions, lesson plans, online video lesson demonstrations, teaching resources and a literacy specialist to coach or support them through these weeks) this is not a surprising finding.Longer term data and independent observation of teachers' implementation is important in future studies to further investigate teaching fidelity and the sustainability of their enhanced literacy teaching.However, the data of students' progress over a 30 week teaching period suggests teachers were implementing BSLA in ways that significantly enhanced children's foundational literacy skills.This finding contrasts with some previous studies of large-scale literacy PLD.For example, Piasta et al. (2017) through a well-designed study did not find any evidence that literacy PLD delivered state-wide in the USA early education context enhanced children's early literacy achievement.The model involved a series of 2-day face to face workshops with a minimum of 14 h PLD.They did find more positive results when teachers received monthly inclass coaching (vs.no-coaching), but overall effects were minimal.The authors raised important considerations around PLD quality measurement challenges when implementing at large scale and the need for PLD to be embedded within curricula.Within BSLA, teachers have continuous access to the online PLD content and video demonstrations of teaching activities prepared and presented by the BSLA research team and experienced teachers.They also have access to weekly online live Zoom sessions with the research team (even after they have completed their BSLA micro-credential).This ongoing access to high quality PLD may support more positive student learning outcomes.Continued research into the facilitators of ensuring enhanced teacher knowledge and practice leads to significant improvements in learners' literacy achievement is necessary.
The call to action from the WHO to direct resources and investment in developing comprehensive literacy skills in our diverse communities, where education inequities are greater, requires new approaches.The integration of both a culturally responsive and strengths-based approach to early literacy teaching within a MTSS framework are an important feature of BSLA.
These features specifically responded to this call for action as well as responded to the voices of Māori and Pacific leaders in New Zealand.The BSLA development integrated research findings highlighting the importance of cultural relevance, cultural appropriateness, valuing and engaging families, strengths-based language and reporting to facilitating success for indigenous learners and children with greater learning needs.Children who identified as Māori or Pacific Peoples group showed accelerated growth following 10 weeks of BSLA teaching on some measures, catching up to their Asian and NZ European peers.Following 30 weeks of BSLA teaching, positive change was reported across ethnic groups, across areas of different socio economic deprivations, for both males and females and for English Language Learners.These data suggest that the early accelerated progress for Māori and Pacific learners was followed by sustained growth in similar positive ways as their peers.These findings of large-scale positive impact for 5-year-old children who identify as Māori and Pacific Peoples across New Zealand are unique.These findings, however, are not surprising given the BSLA research team specifically drew upon research evidence and knowledge from other Māori and Pacific scholars that identified aspects of teaching and learning environments that facilitate learning success for Māori and for Pacific learners.
These positive findings for Māori young learners in New Zealand have international significance for other indigenous and culturally diverse communities.In particular, they highlight the critical importance of consulting and involving indigenous leaders in large scale implementation design and attending to findings from strengths-based research led by indigenous researchers.Our global aim is to decrease current literacy inequities.We must consider how new literacy approaches introduced at scale are meeting the needs of all learners.We should ensure teaching resources, game activities, stories, decodable reading series, vocabulary items used in activities and family engagement strategies selected to build children's oral language and early reading and writing skills reflect local cultural contexts.Approaches that help facilitate cultural inclusiveness and culturally responsive teaching practices within large scale early literacy implementation projects may ensure we are not simply advancing foundational skills in a country's dominant language and culture, Rather, we need to ensure we are also meeting the early literacy needs for our indigenous learners and learners from culturally and linguistically diverse communities.This is an important area for more in-depth and ongoing research.
A further consideration for ongoing research is evaluating the wider societal benefits for a region from a large-scale implementation of effective early literacy approach within the region.A preliminary attempt at such an evaluation is reported through an independent analysis of the potential social value return on investment for the national implementation of BSLA within New Zealand (ImpactLab, 2021).The independent group (Impactlab) who had access to our BLSA data from controlled research trials used a methodology of estimating the known longer term positive outcomes from early literacy achievement as well as estimated reduction in adverse behaviors associated with reading failure.This evidence was used alongside the investment made in BSLA implementation and results shown within the analysis.This analysis suggested that for every dollar invested in BSLA the social value return on investment for New Zealand was $38.20.Such analyses are important to support further Government investment and the sustainability of new literacy approaches introduced.

Summary
This study has highlighted features of a large-scale implementation of an evidenced based literacy approach (BSLA) within the New Zealand education context.Data demonstrating a strong positive response from thousands of 5-year-old children across ethnicities, socio-economic areas and regional areas is very encouraging.The data suggest that when class teachers are through quality PLD, they can rapidly develop the foundational literacy skills that are critical to reading and writing success during children's first year at school.Significant government investment in both design and implementation phases is beneficial and is potentially a strong social value investment for a country.The longer-term advantages from such investment in early literacy achievement for these 5-year-old learners in New Zealand now awaits detailed analysis.Understanding pathways and strategies for large scale implementation of effective literacy approaches that differing countries and regions adopt is a continued worthwhile pursuit to address our collective global literacy aspirations.

Pacific
Peoples ethnicity refers to Pacific Peoples groups (including: Sāmoa, Tonga, the Cook Islands, Niue, Tokelau, Tuvalu, and other smaller Pacific Peoples nations) who are now living in New Zealand (NZ Stats Census).

FIGURE
FIGUREHighlights the He Awa Whiria framework for BSLA of braiding together multiple influences on children's early literacy development.

FIGURE
FIGUREInitial phoneme identity score distribution at baseline: school entry cohort.

FIGURE
FIGURENon-word reading scores over weeks of BSLA teaching for Tier students and controls.

FIGURE
FIGURENon-word spelling scores over weeks of BSLA teaching for Tier students and controls.

Time*
FIGUREDistribution of mean length of utterance scores at Baseline and Thirty Week assessments.
TABLE Interactions between time and ethnicity in repeated measures analysis.
TABLE Descriptive statistics for school entry cohort assessments.
TABLE Proficiency rates for school entry cohort at baseline and weeks.
TABLE Repeated measures analysis of growth for Tier students.