
About the LLANIA dataset
The new Longitudinal Literacy and Numeracy in Australia (LLANIA) dataset is the Australian Education Research Organisation (Âé¶¹Éç)'s first project aimed at maximising the value of educational data.
During 2022 and 2023, Âé¶¹Éç undertook a national data linkage project, successfully linking up to 4 rounds of National Assessment Program – Literacy and Numeracy (NAPLAN) results for every school student in Australia, corresponding to their performance in Year 3, Year 5, Year 7 and Year 9. This resulted in LLANIA, a fully de-identified longitudinal NAPLAN dataset that has the potential to make great contributions to Australia’s education system.
The LLANIA dataset comprises:
- data from 6,270,515 students enrolled in the Australian education system from 2008 to 2021
- fully linked data from Year 3 to Year 9 for 25% of these students (N = 1,594,261).
LLANIA can be used to investigate a range of educational questions such as:
- understanding student learning growth across different educational domains
- insights about the performance of specific student groups, including the effects of disadvantages, adverse events or interventions aimed at improving student learning.
Longitudinal Literacy and Numeracy in Australia Dataset: Technical Report
Âé¶¹Éç's Longitudinal Literacy and Numeracy in Australia (LLANIA) Dataset: Technical Report describes the creation of the LLANIA dataset, including the linkage process and quality assurance methods.
Keywords: educational datasets, student progress, learning outcomes, student performance, longitudinal data