Alexander

AlexanderKyng

1 2 123

AI & ML interests

Deeply passionate in all things AI. Junior Data Scientist & AI Engineer.

Recent Activity

updated a model about 3 hours ago

AlexanderKyng/Qwopus-27B-Coder-Mixed-Q5

reacted to Banaxi-Tech's post with 👀 about 6 hours ago

A new model is coming! Its going to take a long time on my 5070 Ti so expect a release in ~1 month. We think this model is going to be SOTA For its size. Our Mini Version will be 25M Parameters and Pro with 140M. The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE) Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base. The training will start this weekend We are very exited to release it when its done!

published a model 1 day ago

AlexanderKyng/Qwopus-27B-Coder-Mixed-Q5

View all activity

Organizations

None yet

updated a model about 3 hours ago

AlexanderKyng/Qwopus-27B-Coder-Mixed-Q5

Image-Text-to-Text • 0.5B • Updated about 3 hours ago • 1

reacted to Banaxi-Tech's post with 👀 about 6 hours ago

Post

4090

A new model is coming!
Its going to take a long time on my 5070 Ti so expect a release in ~1 month.
We think this model is going to be SOTA For its size.
Our Mini Version will be 25M Parameters and Pro with 140M.
The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE)
Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base.

The training will start this weekend

We are very exited to release it when its done!

2 replies

published a model 1 day ago

AlexanderKyng/Qwopus-27B-Coder-Mixed-Q5

Image-Text-to-Text • 0.5B • Updated about 3 hours ago • 1

reacted to ginigen-ai's post with 🔥❤️ 1 day ago

Post

7478

🧠 Does your LLM know when it's about to be wrong?

Most leaderboards measure accuracy. We measure metacognition — whether a model catches its own errors. Benchmark + leaderboard + adapters, all open. 🎉

The surprise: even a K-AI #1 model (JGOS-31B-Citizen) is the strongest on multiple-choice traps (trap_rate 0.005 — ~2 misses in 400) yet blind to its own free-form mistakes (self-confidence AUROC = 0.5, pure random). A tiny base-frozen adapter recovers that signal.

Two independent axes (never compared across a row): ① trap_rate — does it fall for tempting trap options? (lower = stronger) ② adapter gain Δ — how much a lightweight adapter catches errors the model itself misses. (higher = more adapter value)

What's open: 📊 300+100 trap problems (each with a hidden trap + TICOS type) 🏆 24-model leaderboard 🧩 11 per-model adapters — adapters, NOT fine-tunes (base stays frozen; the adapter just reads the hidden state → P(wrong))

Submit any HF model → auto-scored daily at 09:00 KST and added to the board.

🏆 Leaderboard → ginigen-ai/Metacognition-Leaderboard-Space

📊 Benchmark → ginigen-ai/Metacognition-Bench

🧩 Adapters → FINAL-Bench/metacognition-adapters-6a42c032e6beb803dd032961

📊 Article → https://huggingface.co/blog/ginigen-ai/metacognition

Benchmark by ginigen-ai · Adapters by FINAL-Bench (Darwin/Chimera platform + AETHER metacognition tech).