Ribhu Lahiri Ribhu Lahiri Product Engineering Lead, Miimansa

Reverse-engineering what models think.

Plus stat-padded takes on films, music & football.

I spend my days poking at the insides of language models — interpretability, safety, and the occasional existential question — and my evenings yelling at a football match. This site is both halves.

01 / RESEARCH
Mech. Interp. & Safety.
Scheming, self-replication, probes, circuits. Notes from the field.
02 / BUILDS
Trinkets & tools.
Small tools that scratch personal itches.
03 / WRITING
The blog.
Long-form on AI, building, and adjacent.
04 / FILMS
Letterboxd.
Lists, micro-reviews, the occasional rant.
05 / SOUND
Playlists.
Spotify, heavy on the bass.
06 / SPORTS
Hoops & football.
Armchair analysis, league tables.
RECENT POSTS
2025.03.31 Emotion Retrieval through Art in Latent Space essays
2024.05.12 The Benchmark Trap: Why LLM Metrics Mislead and Evals Enlighten research
2024.04.16 Behind the Curve: The Slow March of AI into Classrooms builds
2024.02.02 Artificial Misdirection: Misalignment of LLMs safety
CURRENTLY
↳ WATCHING
on Netflix
♪ ON REPEAT
album — spotify
NEWSLETTER