About the Project

The story behind the Learner's Corpus of Kazakh

About the Project

The Birsoz Initiative

The platform emerged from the Kazakhstan–UK collaboration including the Birsöz initiative at Oxford University.

The database includes texts for A1, A2, B1, B2, C1 levels, lexical and grammatical references, picture dictionary, video materials, exercises and tasks.

It provides comprehensive information about Kazakh phonetics, vocabulary, grammar while offering in-depth introduction to Kazakh culture.

Project Milestones

2019

Birsoz Initiative Launched

Collaboration between Oxford University and A. Baytursynuly Institute of Linguistics begins.

2020

Corpus Data Collection

Linguists compile graded texts, vocabulary, and grammar references across CEFR levels.

2022

Platform Development

Interactive web platform built on the National Corpus of Kazakh Language infrastructure.

2024

AI-Powered Features

Speech recognition, text-to-speech, and AI conversation practice integrated into the platform.

2025

Public Launch

Learner's Corpus of Kazakh opens to learners worldwide with 58 reading texts and full exercise suite.

Curriculum Levels

Materials aligned with CEFR, ensuring a structured and progressive learning experience from beginner to advanced.

A1

Beginner

Basic phrases, greetings, introductions, simple questions about personal details.

A2

Elementary

Routine tasks, simple descriptions of background, immediate environment, and needs.

B1

Intermediate

Main points of familiar matters, travel situations, personal interests and experiences.

B2

Upper-Intermediate

Complex texts on concrete and abstract topics, fluent interaction with native speakers.

C1

Advanced

Demanding, longer texts with implicit meaning, fluent and spontaneous expression.

External Resources

Related institutions and platforms supporting Kazakh language education.