Profile avatar
ayukh.bsky.social
Statistics MSc @ ETH Zurich Multilingual LLM training/eval/safety MSc soldier @ SRI lab ayukh.com
17 posts 121 followers 469 following
Regular Contributor
comment in response to post
Yes, I should have specified it before maybe :)
comment in response to post
This means basically more publicly available materials, yes For example, more ready-to-use data (e.g. web scraped texts) for LLM fine-tuning, more Ukrainian-native benchmarks for evals etc. This screenshot is from INCLUDE paper by Cohere which has Ukrainian exams in it, thus a new resource for eval🙂
comment in response to post
Before I have seen many papers claiming Ukrainian language to be low-resource, even though there are ~40 mil UA speakers worldwide, so there should be a lot of proof to that Since 2022 Ukrainian NLP effort has dramatically increased and the number of Ukrainian texts available online has increased
comment in response to post
👋
comment in response to post
🤙🫡
comment in response to post
Як кажуть
comment in response to post
Дякую за інвайт💅