ThreadSky
About ThreadSky
Log In
simonwillison.net
•
62 days ago
Wrote up my notes on ModernBERT, the brand new modern alternative to 2018-era BERT released by @benjaminwarner.dev and @howard.fm and team
https://simonwillison.net/2024/Dec/24/modernbert/
Comments
Log in
with your Bluesky account to leave a comment
[–]
davidhuang.blog
•
62 days ago
📌
0
reply
[–]
masoudmaani.bsky.social
•
62 days ago
sorry for the bother, but there's a doppelganger of you out here with the ID
https://bsky.app/profile/simon.fedi.simonwillison.net.ap.brid.gy
It would be a good idea to post it for report if it's not related to you.
best regards
0
1
reply
[–]
simonwillison.net
•
62 days ago
It's me - that's my mastodon account automatically reposted to here
2
1
reply
[–]
masoudmaani.bsky.social
•
62 days ago
ok, nice, you might wanna know that handle doesn't register as a valid DID, which might lower its visibility.
0
1
reply
[–]
simonwillison.net
•
62 days ago
I didn't see that one up, someone created it a while ago using @ap.brid.gy
1
1
reply
[–]
masoudmaani.bsky.social
•
62 days ago
ah, I see, in any case, this is how it shows up as a DID, or rather doesn't show up, good people at @ap.bird.gy might wanna look into it.
0
reply
[–]
simonwillison.net
•
62 days ago
More on ModernBERT here
https://bsky.app/profile/benjaminwarner.dev/post/3ldur45oz322b
11
1
reply
[–]
scottonote.bsky.social
•
62 days ago
do we still need an NER model fine-tuned on top of ModernBERT?
0
reply
[–]
zehavoc.bsky.social
•
62 days ago
It’s a pity you didn’t mention the French core and corresponding authors
0
1
reply
[–]
simonwillison.net
•
62 days ago
What's the French core?
0
1
reply
[–]
zehavoc.bsky.social
•
62 days ago
(A and B) C
0
1
reply
[–]
zehavoc.bsky.social
•
62 days ago
Anyway, I meant Benjamin Clavier and Antoine Chaffin
0
reply
[–]
howard.fm
•
62 days ago
Thanks for checking it out Simon :)
9
reply
[–]
adambrosiomd.bsky.social
•
62 days ago
Hi! Can you suggest some resources on practical use cases of such models?
0
1
reply
[–]
rasmus1610.bsky.social
•
62 days ago
Sentence classification, information retrieval, named entity recognition are among the top use cases.
Everything that isn’t concerned with generating text but doing „discriminative“ tasks on text.
2
2
reply
[–]
callmephilip.com
•
62 days ago
Original announcement also stresses the fact that ModernBERT was trained on a whole lot of code. This unlocks all sorts of code search related use cases
2
reply
[–]
adambrosiomd.bsky.social
•
62 days ago
How does it do name entity recognition!?
0
1
reply
[–]
rasmus1610.bsky.social
•
61 days ago
https://huggingface.co/dslim/bert-base-NER
1
reply
[–]
arnicas.bsky.social
•
62 days ago
Sad lol: “I'm not sure why I had to use numpy<2.0 but without that I got an error”
6
reply
Posting Rules
Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service
×
Reply
Post Reply
Comments
https://bsky.app/profile/simon.fedi.simonwillison.net.ap.brid.gy
It would be a good idea to post it for report if it's not related to you.
best regards
Everything that isn’t concerned with generating text but doing „discriminative“ tasks on text.