Profile avatar
ashvanths.bsky.social
Deep Learning Practitioner | Language Lead for Tamil @ HuggingFace | Interested in Continual Learning and Generative Models | Website : https://ash-01xor.github.io/ X : https://twitter.com/ashvanth_s1
59 posts 57 followers 73 following
Prolific Poster
Conversation Starter

Quite a humbling experience every day while coding. You start with an issue and a vision about how to solve the problem and then pretty much the road traveled often to reach the solution isn't straightforward. Humbled each and every day to understand and accept and that it is how it is.

Pretty similar to how Jio first gained share of the internet users in India. Interesting to note big companies have the ability to shell out too much to develop and operate to gain market share. Only time shall tell what this will lead to

Over a period of time , getting to realize that im having my flow states during certain periods of time and getting to schedule tasks around it. Guess the goal is to build systems that can make sure we enter such states like on and off button.

Not able to point of the difference particularly , but gpt-4o-mini seems to work way too fast over the last day. From taking around 4 to 5 mins to process a 65-page PDF for extraction, it takes around 3 mins. Do you guys want me to run benchmark tests and probably write a blog post about it ?

Looking forward to the next unit of the Agents course and building more @benburtenshaw.bsky.social @hf.co

I just finished writing up my take on reasoning models: magazine.sebastianraschka.com/p/understand... Here, I 1. Discuss the advantages & disadvantages of reasoning models 2. Of course, describe and discuss DeepSeek R1 3. Describe the 4 main ways to building & improving reasoning models

Slowly building it one at a time. Thanks to @sebastianraschka.com for his book. Implementing things from scratch takes a lot of time , but valuable experience. github.com/ash-01xor/Re...

Building SmolGPT myself , have plans to extend it. but before that struggling with managing python versions !!! Had to use pyenv and then pip. like now i get why experienced devs are frustrated with python package management

Updated my site after quite a long time also added a note for how to update your arch linux system. Do check it out if you use arch or if you like to as well :)

Only a few more annotations are needed to complete the initial goal. for Tamil. Do join the initiative alongside me , your contribution is highly valuble

Interesting to see the hype of agents and using them , but almost everyone who uses the term throws it away just like that. All I get to see is a clearly well-defined workflow in a constrained environments most of the time and yet they are being called 'agents'.

Got to find this today only in Python when i made a typo by mistake. How does the for loop work when i present the number inside range like that ??

Well, we are halfway through our initial goal of the Fineweb-C sprint for Tamil. Hopefully I would love to complete the initial goal of annotating 1000 texts within the next two days Do join if you would like to contribute! data-is-better-together-fineweb-c.hf.space/share-your-p...

Since being used to python development from the start i dont think i never had an issue using pyenv , venv , conda etc. Like it never felt like a chore. But then hearing about devs from other communities really does make me question why .

got to read that alec radford left open ai , like what is even happening at open ai

Somehow deep down i always get to think about how optimization of any process leads to boredom over a period of time. The excitement and the risks once taken might decreases due to the numbers the clouds our judgement. Like while recruiting , where folks are given standard questions to solve or ..

Well, around 10 percent of the initial goal is complete, and so far, it's been quite a one-man army effort. We're still in the hunt for more people to join and contribute to this open-source initiative. @hf.co data-is-better-together-fineweb-c.hf.space/share-your-p...

The process has just begun, and we are actively seeking collaborators for Tamil. Join us in this open-source initiative! Building better models demands a better annotation process, and we are deeply committed to achieving this together data-is-better-together-fineweb-c.hf.space/share-your-p...

Its been great few weeks reading about agents after going through the course conducted by Berkeley. While there was lots of insightful talks , one that was particularly insightful was on week 4 , where Burak Gokturk got to talk about enterprise trends about agents. The most important trend...

coding when someone is watching is quite a nervous experience. all of sudden there is quite a bit of fumbling , struggling to come up with names..

Introducing Maya – A New Multimodal Multilingual Vision-Language Model. Maya is a completely open source, open weight, and open dataset, designed to handle 8 languages, cultural diversity, and nuanced real-world contexts in vision-language models. Paper: arxiv.org/abs/2412.07112

Vanakkam makkalae , glad that I’ll be leading the FineWeb 2 collaborative annotation sprint for Tamil! 🤗 I’ll be helping to build an open dataset to improve language models for our language. Do join the process of improving models ! huggingface.co/spaces/Huggi... huggingface.co/spaces/data-...

Exciting things coming up @hf.co . Can't wait to reveal tomorrow.

Weekend was quite cool , got to just keep my head down and work on retrieval engines over a custom database. Implemented thing right from TF-IDF to RAG and saw quite some interesting things. Anyone who says TF-IDF is not the best must need to implement it and check it first...

Everyday as i code , i find more hidden details and my shortcomings.

4th day and im already feeling the difficult of it rising. "Ceres Search" - Day 4 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/4

Cool work done by huggingface @benburtenshaw.bsky.social . Loving the smol-course , got to train a small model on the stack dataset (just on the python subset of it) and upload it to the hub. Link to the model: huggingface.co/collections/... The idea is not to use the model trained right away ....

Got to complete "Mull It Over" - Day 3 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/3

Yep got to complete "Red-Nosed Reports" - Day 2 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/2

I've completed "Historian Hysteria" - Day 1 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/1

Friendly remainder.