AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.01-det10-seed2-diverse_deception_probe
Updated
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.