Experimental Design Details
This is a prospective, pre-registered, two-arm, parallel-group RCT with repeated measures at three time points: pre-test, immediate post-test, and delayed post-test (two months post-intervention). The target population is Cantonese-speaking children aged 4–5 enrolled in Hong Kong kindergartens. Recruitment will use convenience sampling via kindergarten flyers and online channels. Parents complete screening on eligibility (age, Cantonese L1, current enrollment, no structured music training, no special educational needs per parent report) and demographics (parent education/income; child age/gender; kindergarten type). After baseline testing, participants are assigned to MSIP or control using a computer-generated random sequence with permuted blocks of variable size, stratified by age (4 vs 5) and gender, with allocation concealment via sequentially numbered opaque sealed envelopes opened after baseline completion.
Intervention delivery: MSIP consists of twelve 30-minute sessions, delivered in small groups of approximately 3–5 children (implementation aim n≈5) by a single trained tutor (Hong Kong registered ECE teacher with ABRSM Grade 6 Piano) using manualised lesson plans. MSIP integrates listening, tonal–rhythmic training (including fixed-do Solfège), imitation/repetition, movement, body percussion and cabasa work, and keyboard (Yamaha Electone) reproduction, plus brief chant-based speech-beat integration (nonsense syllables) intended to map speech prosody to musical meter while minimising vocabulary demands. Attendance is recorded to quantify dosage. Fidelity is monitored via per-session checklists and audio/video recording of 20% of sessions with independent adherence ratings (target ≥85% adherence).
Outcome measurement: Music outcomes include aPMMA tonal and rhythmic subtests (15 items each; yes/no same-different format) and APT (13-note sequence across one octave; Solfège identification under brown noise). Speech outcome is EPA percent accuracy from syllable deletion (6 items) and onset-phoneme deletion (17 items), adapted from prior work. Planned analyses include multilevel, multivariate difference-in-differences models to account for nested item/trial structure and repeated measures, with multiple imputation for missing data and sensitivity checks.