Media Summary: Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ...

What Happened With Sparse Autoencoders - Detailed Analysis & Overview

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ... In this video, we dive deep into the world of I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying

In this AI Research Roundup episode, Alex discusses the paper: 'Sanity Checks for

Photo Gallery

What Happened With Sparse Autoencoders?
A Window  Into LLMs | Sparse Autoencoders Explained
Demo: Gemma Scope: Sparse autoencoders on Gemma 2
Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]
Reading an AI's Mind with Sparse Autoencoders
Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT
Unlocking Deep Learning with Sparse Autoencoders
What are Autoencoders?
Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough
Sparse Autoencoders: Progress & Limitations with Joshua Engels
24. Sparse AutoEncoders
Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal
Sponsored
View Detailed Profile
What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

A Window  Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ...

Reading an AI's Mind with Sparse Autoencoders

Reading an AI's Mind with Sparse Autoencoders

A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ...

Sponsored
Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Sparse Autoencoders

Unlocking Deep Learning with Sparse Autoencoders

Unlocking Deep Learning with Sparse Autoencoders

In this video, we dive deep into the world of

What are Autoencoders?

What are Autoencoders?

Learn about watsonx: https://ibm.biz/BdvxR8 An

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying

Sparse Autoencoders: Progress & Limitations with Joshua Engels

Sparse Autoencoders: Progress & Limitations with Joshua Engels

In this talk, Joshua Engels discusses

24. Sparse AutoEncoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.

Sanity Checks for LLM Sparse Autoencoders

Sanity Checks for LLM Sparse Autoencoders

In this AI Research Roundup episode, Alex discusses the paper: 'Sanity Checks for