Media Summary: [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation TimeBalance: Temporally-Invariant and Temporally-Distinctive VideoRepresentations for Semi-Supervised This video introduces the paper titled "Context-based and Diversity-driven Specificity in Compositional

Cvpr 2024 Test Time Zero Shot Temporal Action Localization - Detailed Analysis & Overview

[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation TimeBalance: Temporally-Invariant and Temporally-Distinctive VideoRepresentations for Semi-Supervised This video introduces the paper titled "Context-based and Diversity-driven Specificity in Compositional Want to play with the technology yourself? Explore our interactive demo → Learn more about the ... ArXiv Link: Abstract: Producing quality segmentation masks for images is a fundamental problem ... Author: Guangze Zheng, Shijie Lin, Haobo Zuo, Changhong Fu, Jia Pan* Affiliation: HKU, Tongji University Project page: ...

ProjectPage: Arxiv: HomePage Abstract: ... Virtual presentation of our recent work "Towards Depth Any Camera (DAC) is a training framework for metric depth estimation that enables In this video, we present our paper on probing and instilling video-language models with a sense of

Photo Gallery

[CVPR 2024] Test-Time Zero-Shot Temporal Action Localization
[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
[CVPR 2024] Introduction to FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
TimeBalance [CVPR 2023]
[CVPR 2024] Action-slot
CVPR 2024: Context-based and Diversity-driven Specificity in Compositional Zero-Shot Learning
What is Zero-Shot Learning?
Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions [CVPR 2024]
CVPR 2024: Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
[CVPR 2024] Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph
[CVPR 2024] NetTrack: Tracking Highly Dynamic Objects with a Net
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
View Detailed Profile
[CVPR 2024] Test-Time Zero-Shot Temporal Action Localization

[CVPR 2024] Test-Time Zero-Shot Temporal Action Localization

Zero

[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

[CVPR 2024] Introduction to FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

[CVPR 2024] Introduction to FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Code: https://github.com/williamyang1991/FRESCO Project: https://www.mmlab-ntu.com/project/fresco/ Paper: ...

TimeBalance [CVPR 2023]

TimeBalance [CVPR 2023]

TimeBalance: Temporally-Invariant and Temporally-Distinctive VideoRepresentations for Semi-Supervised

[CVPR 2024] Action-slot

[CVPR 2024] Action-slot

Action

CVPR 2024: Context-based and Diversity-driven Specificity in Compositional Zero-Shot Learning

CVPR 2024: Context-based and Diversity-driven Specificity in Compositional Zero-Shot Learning

This video introduces the paper titled "Context-based and Diversity-driven Specificity in Compositional

What is Zero-Shot Learning?

What is Zero-Shot Learning?

Want to play with the technology yourself? Explore our interactive demo → https://ibm.biz/BdKkPk Learn more about the ...

Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions [CVPR 2024]

Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions [CVPR 2024]

Presenting "Improved

CVPR 2024: Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

CVPR 2024: Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

ArXiv Link: https://arxiv.org/abs/2308.12469 Abstract: Producing quality segmentation masks for images is a fundamental problem ...

[CVPR 2024] Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph

[CVPR 2024] Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph

Project webpage: https://jha-lab.github.io/zerotprune/ Paper: https://arxiv.org/abs/2305.17328.

[CVPR 2024] NetTrack: Tracking Highly Dynamic Objects with a Net

[CVPR 2024] NetTrack: Tracking Highly Dynamic Objects with a Net

Author: Guangze Zheng, Shijie Lin, Haobo Zuo, Changhong Fu, Jia Pan* Affiliation: HKU, Tongji University Project page: ...

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Code: https://github.com/williamyang1991/FRESCO Project: https://www.mmlab-ntu.com/project/fresco/ Paper: ...

CVPR 2023 - Video Test-Time Adaptation for Action Recognition

CVPR 2023 - Video Test-Time Adaptation for Action Recognition

ProjectPage: https://wlin-at.github.io/vitta Arxiv: https://arxiv.org/abs/2211.15393 HomePage https://wlin-at.github.io/ Abstract: ...

[CVPR 2025] Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

[CVPR 2025] Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

Virtual presentation of our recent work "Towards

[CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera

[CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera

Depth Any Camera (DAC) is a training framework for metric depth estimation that enables

YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)

YOLOE: Real-time Zero-shot Object Detection | Visual Prompting | Live Coding & Q&A (Mar 14th)

Explore the

[CVPR 2023] Test of Time: Instilling Video-Language Models with a Sense of Time

[CVPR 2023] Test of Time: Instilling Video-Language Models with a Sense of Time

In this video, we present our paper on probing and instilling video-language models with a sense of

CVPR 2023 : Post-Processing Temporal Action Detection

CVPR 2023 : Post-Processing Temporal Action Detection

Existing