Media Summary: This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

24 Sparse Autoencoders - Detailed Analysis & Overview

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ... ... Electrical Communication Engineering Department , IIT Kharagpur Discussion Content :

I had a lot of fun making this video! Nested SAEs are quite a brilliant solution overcoming a lot of the limitations of regular SAEs, ... In this video, we dive deep into the world of

Photo Gallery

24. Sparse AutoEncoders
A Window  Into LLMs | Sparse Autoencoders Explained
Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]
What Happened With Sparse Autoencoders?
Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough
Demo: Gemma Scope: Sparse autoencoders on Gemma 2
Reading an AI's Mind with Sparse Autoencoders
Lecture 32  Autoencoder Variants I
Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT
Matryoshka (Nested) Sparse Autoencoders Explained
Unlocking Deep Learning with Sparse Autoencoders
HKU IDS Scholar Seminar Series #24
Sponsored
View Detailed Profile
24. Sparse AutoEncoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

A Window  Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ...

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying

Sponsored
Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Reading an AI's Mind with Sparse Autoencoders

Reading an AI's Mind with Sparse Autoencoders

A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ...

Lecture 32  Autoencoder Variants I

Lecture 32 Autoencoder Variants I

... Electrical Communication Engineering Department , IIT Kharagpur Discussion Content :

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Sparse Autoencoders

Matryoshka (Nested) Sparse Autoencoders Explained

Matryoshka (Nested) Sparse Autoencoders Explained

I had a lot of fun making this video! Nested SAEs are quite a brilliant solution overcoming a lot of the limitations of regular SAEs, ...

Unlocking Deep Learning with Sparse Autoencoders

Unlocking Deep Learning with Sparse Autoencoders

In this video, we dive deep into the world of

HKU IDS Scholar Seminar Series #24

HKU IDS Scholar Seminar Series #24

Sparse Autoencoders

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.