I'm creating a starter pack for the WOAH Community π¦
Send me your username if you are a researcher working on (fighting) online harms and would like to be added π
https://go.bsky.app/2hJFNQb
#NLProc #HateSpeech
Send me your username if you are a researcher working on (fighting) online harms and would like to be added π
https://go.bsky.app/2hJFNQb
#NLProc #HateSpeech
Comments
am trying to develop options for probabilistic firewalls
Q: what is/are the best security measure(s) that you are aware of to help stop or mitigate probabilistic injection ?
the simplest form of probabilistic injection is a βprompt injectionβ