Media Summary: This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? Been Kim (Google Brain) Frontiers of Deep Learning. A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Interpretability Now What - Detailed Analysis & Overview

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? Been Kim (Google Brain) Frontiers of Deep Learning. A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... Neel Nanda from DeepMind presenting 'Mechanistic Neel Nanda (Google DeepMind) discussed his mechanistic Lex Fridman Podcast full episode: Please support this podcast by checking out ... Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ... Intellipaat's Advanced Certification Program in Generative AI and Prompt Engineering: ...

Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ... EuroPython 2025 — South Hall 2B on 2025-07-17] *Hacking LLMs: An Introduction to Mechanistic Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Photo Gallery

What Matters Right Now In Mechanistic Interpretability?
Interpretability - now what?
What is interpretability?
Interpretability: Understanding how AI models think
What is mechanistic interpretability? Neel Nanda explains.
Interpretable vs Explainable Machine Learning
The Dark Matter of AI [Mechanistic Interpretability]
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
Interpretability for Everyone - Been Kim
Eliezer Yudkowsky explains AI interpretability | Lex Fridman Podcast Clips
View Detailed Profile
What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Interpretability - now what?

Interpretability - now what?

Been Kim (Google Brain) https://simons.berkeley.edu/talks/tbd-72 Frontiers of Deep Learning.

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

Interpretable vs Explainable Machine Learning

Interpretable vs Explainable Machine Learning

Interpretable

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ...

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda from DeepMind presenting 'Mechanistic

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda (Google DeepMind) discussed his mechanistic

Interpretability for Everyone - Been Kim

Interpretability for Everyone - Been Kim

More videos on http://video.ias.edu.

Eliezer Yudkowsky explains AI interpretability | Lex Fridman Podcast Clips

Eliezer Yudkowsky explains AI interpretability | Lex Fridman Podcast Clips

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=AaTRHFaaPG8 Please support this podcast by checking out ...

Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our ...

What is Explainable AI | Introduction to Explainable AI | Explainable AI | Intellipaat

What is Explainable AI | Introduction to Explainable AI | Explainable AI | Intellipaat

Intellipaat's Advanced Certification Program in Generative AI and Prompt Engineering: ...

Scaling interpretability

Scaling interpretability

Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ...

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

EuroPython 2025 — South Hall 2B on 2025-07-17] *Hacking LLMs: An Introduction to Mechanistic

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

With a growing interest in

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...