Media Summary: Kaiyu Yang (Meta) Simons Institute for the Theory of Computing ... Today Kaiyu Yang from Meta joined us to discuss formal reasoning using I review the progress of large language models for

Cbu Verifying Research Level Math In Llms - Detailed Analysis & Overview

Kaiyu Yang (Meta) Simons Institute for the Theory of Computing ... Today Kaiyu Yang from Meta joined us to discuss formal reasoning using I review the progress of large language models for This webinar explores in-depth approaches to evaluating and enhancing In today's video we'll be discussing ChatGPT's ability to solve In today's video we'll be tackling a problem that's shown up in my PhD

In today's video we'll be testing GPT-5 on some Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best ...

Photo Gallery

CBU: Verifying Research-Level Math in LLMs
Evaluating LLMs on Research-Level Math Proofs
Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification
Kaiyu Yang - Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification
Soohak: Research-Level Math Benchmark for LLMs
Recent Advances in LLMs for Mathematics
Evaluating Mathematical Reasoning in LLMs
Can ChatGPT Actually Solve Research-Level Math Problems?
Which LLM is Best at Research-Level Mathematics?
Can GPT-5 Really Solve Research-Level Maths Problems?
A Survey of Mathematical Reasoning in the Era of Multimoda LLM: Benchmark, Method & Challenges
Aletheia: New LLM Agent for Professional Math
Sponsored
View Detailed Profile
CBU: Verifying Research-Level Math in LLMs

CBU: Verifying Research-Level Math in LLMs

In this AI

Evaluating LLMs on Research-Level Math Proofs

Evaluating LLMs on Research-Level Math Proofs

In this AI

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

Kaiyu Yang (Meta) https://simons.berkeley.edu/talks/kaiyu-yang-meta-2025-04-09 Simons Institute for the Theory of Computing ...

Kaiyu Yang - Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

Kaiyu Yang - Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

Today Kaiyu Yang from Meta joined us to discuss formal reasoning using

Soohak: Research-Level Math Benchmark for LLMs

Soohak: Research-Level Math Benchmark for LLMs

In this AI

Sponsored
Recent Advances in LLMs for Mathematics

Recent Advances in LLMs for Mathematics

I review the progress of large language models for

Evaluating Mathematical Reasoning in LLMs

Evaluating Mathematical Reasoning in LLMs

This webinar explores in-depth approaches to evaluating and enhancing

Can ChatGPT Actually Solve Research-Level Math Problems?

Can ChatGPT Actually Solve Research-Level Math Problems?

In today's video we'll be discussing ChatGPT's ability to solve

Which LLM is Best at Research-Level Mathematics?

Which LLM is Best at Research-Level Mathematics?

In today's video we'll be tackling a problem that's shown up in my PhD

Can GPT-5 Really Solve Research-Level Maths Problems?

Can GPT-5 Really Solve Research-Level Maths Problems?

In today's video we'll be testing GPT-5 on some

A Survey of Mathematical Reasoning in the Era of Multimoda LLM: Benchmark, Method & Challenges

A Survey of Mathematical Reasoning in the Era of Multimoda LLM: Benchmark, Method & Challenges

A Survey of

Aletheia: New LLM Agent for Professional Math

Aletheia: New LLM Agent for Professional Math

In this AI

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive knowledge refers to humans' intuitive knowledge of their own thinking and reasoning processes. Today's best ...