ayukh.bsky.social
Statistics MSc @ ETH Zurich
Multilingual LLM training/eval/safety MSc soldier @ SRI lab
ayukh.com
17 posts
121 followers
469 following
Regular Contributor
comment in response to
post
Yes, I should have specified it before maybe :)
comment in response to
post
This means basically more publicly available materials, yes
For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval🙂
comment in response to
post
Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that
Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
comment in response to
post
👋
comment in response to
post
🤙🫡
comment in response to
post
Як кажуть
comment in response to
post
Дякую за інвайт💅