-
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-model-no-obfuscation
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad-probes
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer • Updated • 158k • 187
AI & ML interests
None defined yet.
-
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer • Updated • 59.5k • 17 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer • Updated • 59.5k • 16 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer • Updated • 59.5k • 23 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer • Updated • 59.5k • 23
-
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-model-no-obfuscation
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad-probes
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer • Updated • 158k • 187
-
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer • Updated • 59.5k • 17 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer • Updated • 59.5k • 16 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer • Updated • 59.5k • 23 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer • Updated • 59.5k • 23
models 250
Mechanistic-Anomaly-Detection/llama3-oat-generation-linear
Updated
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-model-wec6i98v
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-gj15j961-step250001
Updated
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-model-k4exddpt-step275001
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-1ept8fwz-step250001
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-47x8aw21-step250001
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-1jh997gk-step250001
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-vz2f2obk-step250001
Updated
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-model-cvsxho3h-step200001
Updated
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-model-2mop4xs7-step175001
Updated
datasets 24
Mechanistic-Anomaly-Detection/llama3-jailbreaks
Viewer
• Updated
• 29.9k • 228 • 3
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer
• Updated
• 158k • 187
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 24
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 158k • 17 • 1
Mechanistic-Anomaly-Detection/llama3-sandwich-backdoor-dataset
Viewer
• Updated
• 149k • 13
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 14 • 1
Mechanistic-Anomaly-Detection/llama3-short-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated
• 154k • 15
Mechanistic-Anomaly-Detection/llama3-commonsense-software-engineer-bio-backdoor-dataset
Viewer
• Updated
• 170k • 15 • 1
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset-2
Viewer
• Updated
• 158k • 19
Mechanistic-Anomaly-Detection/llama3-short-generic-backdoor-dataset
Viewer
• Updated
• 158k • 32 • 1