AI & ML interests

We build interpretable models and AI systems that can reliably explain their reasoning, and are easy to audit, steer, and understand.

Recent Activity

AyaGL  published a model about 18 hours ago
guidelabs/steerling-8b-instruct
AyaGL  updated a model 1 day ago
guidelabs/steerling-8b-instruct
andreasmadsen  authored a paper about 2 years ago
Interpretability Needs a New Paradigm
View all activity