shan23chen.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

comment in response to post

congrats！

submitted 72 days ago

comment in response to post

Source: t.co/mV27ZZg5MN

submitted 126 days ago

comment in response to post

Yea… he does have problems portraying female in stereotypical ways, big critics in China too

submitted 153 days ago

comment in response to post

During the QA session, one stood up to her regarding this issue really respectfully and her response was: “That was not based on my judgment. That was based on the student's quote saying that the school was not teaching it, which meant that it applied to a lot of people from there."

submitted 175 days ago

comment in response to post

Most of the talk discussed about bad practices. But only one slide mentioned specific group of people.

submitted 175 days ago

comment in response to post

Haha which one has more nowadays?

submitted 178 days ago

comment in response to post

Haha transformers really transformed both. However, I feel like the division is even further… currently, seems like RL is taking over LM post training and many NLProc are dealing with language model enabled new applications

submitted 178 days ago

comment in response to post

Thanks!

submitted 182 days ago

comment in response to post

Imagine a world where these will be positively correlated

submitted 183 days ago

comment in response to post

Quite possible! Here, we found some early evidence that SAE features trained on language models are still meaningful to LLaVA. More details will be provided in the post, and more details will be provided soon! @JackGallifant @oldbayes.bsky.social @daniellebitterman.bsky.social

submitted 183 days ago

comment in response to post

More on future potential reliance on LLM agent doing reviews and audits

submitted 191 days ago

comment in response to post

I’m terrified by the massive openreview data. Potentially gonna bite back on us 🥲😥

submitted 192 days ago

comment in response to post

END/🧵 Thanks to all our awesome co-authors: @jannahastings.bsky.social @daniellebitterman.bsky.social And all our awesome collaborators who are not on the right platform yet! 🦋 Happy Thanksgiving! 🍂

submitted 192 days ago

comment in response to post

5/🧵 Dive deeper into our methods, findings, and the implications of our research by checking out the full 📜 paper here: arxiv.org/abs/2405.05506 All our data can be downloaded from our website: crosscare.net

submitted 192 days ago

comment in response to post

4.5/🧵 For the arxiv pretraining dataset, we also have an overall trend based on entity mentions! Guess which two terms are the big bump there back in 2019

submitted 192 days ago

comment in response to post

4/🧵 We've also developed a new data visualization tool, available at [http://crosscare.net], to allow researchers and practitioners to explore these biases from different pretraining corpus and understand their implications better. Tools in progress! 🛠️📊

submitted 192 days ago

comment in response to post

3.5/🧵 Moreover, alignment methods don’t resolve inconsistencies in disease prevalence across languages (EN 🇺🇸, ES 🇪🇸, FR 🇫🇷, ZH 🇨🇳). And tuning on English usually only affects English prompt output

submitted 192 days ago

comment in response to post

3/🧵 By analyzing models across various architectures and sizes, we show that traditional alignment methods barely scratch the surface in fixing these discrepancies. This highlights the challenge in deploying LLMs for medical applications without reinforcing biases.

submitted 192 days ago

comment in response to post

2.5/🧵How misaligned are things here? 📈Figure 2 shows the misalignment between real-world disease prevalence, pretraining data representation, and Llama3 70B.

submitted 192 days ago

comment in response to post

2/🧵 Our study systematically explores how demographic biases embedded in pre-training corpora like ThePile affect LLM outputs. We reveal substantial misalignments between LLM representations of disease prevalence and actual data across demographics. 📷 👥

submitted 192 days ago

comment in response to post

Oh dang, I gotta read this thanks

submitted 194 days ago

comment in response to post

I recommend this! apex-magazine.com/short-fictio...

submitted 194 days ago

comment in response to post

Maybe highly interesting to you aclanthology.org/2024.emnlp-m...

submitted 198 days ago

comment in response to post

xai historian

submitted 198 days ago

comment in response to post

Thank you!

submitted 201 days ago

comment in response to post

Me and @daniellebitterman.bsky.social would love to be added! Thanks!

submitted 201 days ago

comment in response to post

What were the questions?

submitted 206 days ago

comment in response to post

Thanks!

submitted 206 days ago

comment in response to post

I would love to join this! Thanks!

submitted 207 days ago

comment in response to post

Thanks for organizing! Saving us from X is important!!

submitted 207 days ago

comment in response to post

Thanks for putting this together!! May I be added too? Thanks!

submitted 207 days ago