MolDeTox: Evaluating Language Model's Stepwise Fragment Editing for Molecular Detoxification Paper • 2605.12181 • Published 11 days ago • 9
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack Paper • 2509.25843 • Published Apr 14 • 19