Media Summary: MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

25 Interpretability - Detailed Analysis & Overview

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... How can we reverse engineer what a neural network is doing? In this IASEAI ' Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic EuroPython 2025 — South Hall 2B on 2025-07-17] *Hacking LLMs: An Introduction to Mechanistic This 5 minute video explains the difference between global Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... Neel Nanda (Google DeepMind) discussed his mechanistic

A talk I gave to my MATS 9.0 training program about reasoning model Neel Nanda from DeepMind presenting 'Mechanistic

Photo Gallery

25. Interpretability
Lecture 25: Interpretability
What Matters Right Now In Mechanistic Interpretability?
Interpretability: Understanding how AI models think
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
The Dark Matter of AI [Mechanistic Interpretability]
A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google
What is interpretability?
Mechanistic Interpretability - NEEL NANDA (DeepMind)
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega
Interpretable AI: Global vs Local Interpretability
View Detailed Profile
25. Interpretability

25. Interpretability

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ...

Lecture 25: Interpretability

Lecture 25: Interpretability

Machine Learning for Healthcare #MachineLearning #ArtificialIntelligence #AI #ML #DataScience #HealthcareAI #AIinHealthcare ...

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

With a growing interest in

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega

EuroPython 2025 — South Hall 2B on 2025-07-17] *Hacking LLMs: An Introduction to Mechanistic

Interpretable AI: Global vs Local Interpretability

Interpretable AI: Global vs Local Interpretability

This 5 minute video explains the difference between global

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda (Google DeepMind) discussed his mechanistic

[CoLoRAI 25] Compositionality Unlocks Deep Interpretable Models

[CoLoRAI 25] Compositionality Unlocks Deep Interpretable Models

Paper: Compositionality Unlocks Deep

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda from DeepMind presenting 'Mechanistic