1/2 The original 2022 ROME paper by Meng et al.: - ThreadSky | a Reddit-style client for Bluesky

About ThreadSky

neelrajani.bsky.social • 95 days ago

1/2 The original 2022 ROME paper by Meng et al.:

Comments

Log in with your Bluesky account to leave a comment

neelrajani.bsky.social•95 days ago

2/2 My hacky attempt at changing their codebase to accept Llama3.1 8B Instruct. Pretty cool that the 'early-site/late-site' findings replicate somewhat even on a single sample. Very curious for my sweep of the full 1209 samples from their paper to finish for more representative results :D

Comment image

sraval.bsky.social•91 days ago

Gorgeous! Looking forward to seeing what that looks like!

neelrajani.bsky.social•87 days ago

Thank you, that's very kind! Credit to the ROME authors for how cool the plots look, I'm using their public GitHub code. Just posted some results comparing to the base model too :)

Posting Rules

Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service

Fullsize image

×

Reply

© 2024 ThreadSky. All rights reserved. Built for the Bluesky community. Privacy • Terms