Skip to content

New Post: sinhala is not just low resource under evaluated#9

Merged
madelzzz merged 1 commit into
Cohere-Labs-Community:mainfrom
SayuruRehan:blog/sinhala-under-evaluated
Jul 2, 2026
Merged

New Post: sinhala is not just low resource under evaluated#9
madelzzz merged 1 commit into
Cohere-Labs-Community:mainfrom
SayuruRehan:blog/sinhala-under-evaluated

Conversation

@SayuruRehan

Copy link
Copy Markdown
Contributor

The post discusses why Sinhala NLP should be understood not only as a data-scarcity problem, but also as an evaluation problem. It highlights existing progress such as FLORES, SinhalaMMLU, and Aya-style multilingual research, then proposes a community-shaped evaluation agenda covering Sinhala script, Romanized Sinhala, code-mixing, local knowledge, and human evaluation.

The draft has already been reviewed and approved by the Cohere Labs team.

@madelzzz madelzzz merged commit 28cde32 into Cohere-Labs-Community:main Jul 2, 2026
1 check was waiting
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants