Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.
TitleOS PRO
TitleOS
AI & ML interests
I break the Xbox One/Series. Featured on OSGWiki. Former Xbox MVP. Previously InfoSec at Apple, then SRE at DreamBox Learning, now looking for a new opportunity. Artificial Intelligence LLM enthusiast, wannabe expert. They/Them. 🏳️🌈
Recent Activity
liked
a dataset
about 3 hours ago
gamino/wiki_medical_terms
liked
a dataset
about 4 hours ago
RISys-Lab/RedSage-CFW
liked
a dataset
about 4 hours ago
trend-cybertron/Primus-Nemotron-CC