Media Summary: In this paper, we propose VideoScene that distills the Introducing UniVidX, a unified multimodal framework designed to leverage the powerful generative priors of pre-trained [CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation

Brian Chao Foveated Diffusion Efficient Spatially Adaptive Image And Video Generation - Detailed Analysis & Overview

In this paper, we propose VideoScene that distills the Introducing UniVidX, a unified multimodal framework designed to leverage the powerful generative priors of pre-trained [CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation CVPR 2023: Guided Depth Super-Resolution by Deep Anisotropic Diffusion We introduce SceneDiffuser, a conditional generative model for 3D scene understanding. SceneDiffuser is applicable to variousĀ ...

Photo Gallery

Brian Chao - Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation
But how do AI images and videos actually work? | Guest video by Welch Labs
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning | CVPR 2026
Hierarchical Patch Diffusion Models for High-Resolution Video Generation [CVPR 2024]
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
TBAF | Image stability after 10k frames autoregressively generated
Video Generation with Diffusion Transformers | Generative AI
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Video Super-Resolution
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
[CVPR 2023 Talk] SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field.
[CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation
CVPR 2023: Guided Depth Super-Resolution by Deep Anisotropic Diffusion
Sponsored
View Detailed Profile
Brian Chao - Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation

Brian Chao - Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation

00:00 Intro and Setup 01:02 Why

But how do AI images and videos actually work? | Guest video by Welch Labs

But how do AI images and videos actually work? | Guest video by Welch Labs

Diffusion

StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning | CVPR 2026

StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning | CVPR 2026

StableMTL repurposes pre-trained Latent

Hierarchical Patch Diffusion Models for High-Resolution Video Generation [CVPR 2024]

Hierarchical Patch Diffusion Models for High-Resolution Video Generation [CVPR 2024]

Diffusion

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

In this paper, we propose VideoScene that distills the

Sponsored
TBAF | Image stability after 10k frames autoregressively generated

TBAF | Image stability after 10k frames autoregressively generated

In this

Video Generation with Diffusion Transformers | Generative AI

Video Generation with Diffusion Transformers | Generative AI

In this

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Video Super-Resolution

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Video Super-Resolution

Upscale-A-

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Introducing UniVidX, a unified multimodal framework designed to leverage the powerful generative priors of pre-trained

[CVPR 2023 Talk] SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field.

[CVPR 2023 Talk] SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field.

Project page: https://zju3dv.github.io/sine/

[CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation

[CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation

[CVPR 2026] SCE-Depth: A Spherical Compound Eye Framework for Wide FOV Depth Estimation

CVPR 2023: Guided Depth Super-Resolution by Deep Anisotropic Diffusion

CVPR 2023: Guided Depth Super-Resolution by Deep Anisotropic Diffusion

CVPR 2023: Guided Depth Super-Resolution by Deep Anisotropic Diffusion

Diffusion-based Generation, Optimization, and Planning in 3D Scenes (CVPR 2023)

Diffusion-based Generation, Optimization, and Planning in 3D Scenes (CVPR 2023)

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding. SceneDiffuser is applicable to variousĀ ...