5 15 7

Zhengzhong Tu

vztu

https://vztu.github.io

_vztu
vztu

AI & ML interests

Generative AI, Multimodal AI, Trustworthy AI

Recent Activity

upvoted a paper about 2 months ago

Batch Speculative Decoding Done Right

upvoted a paper 2 months ago

LLMs Can Get "Brain Rot"!

upvoted a paper 2 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

View all activity

Organizations

upvoted a paper about 2 months ago

Batch Speculative Decoding Done Right

Paper • 2510.22876 • Published Oct 26 • 24

upvoted 2 papers 2 months ago

LLMs Can Get "Brain Rot"!

Paper • 2510.13928 • Published Oct 15 • 22

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 105

liked 2 datasets 3 months ago

jerryye0110/MMHU

Viewer • Updated May 16 • 5.79k • 2.25k • 3

YSZuo/DIV4K-50

Viewer • Updated Sep 23 • 100 • 50 • 4

liked a Space 5 months ago

CVAgentArena

💬

This Space is for CVAgentArena developed by TACO group.

upvoted a paper 5 months ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16 • 26

authored a paper 5 months ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16 • 26

commented a paper 5 months ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16 • 26 •

upvoted a paper 6 months ago

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9 • 24

authored 4 papers 6 months ago

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Paper • 2506.07564 • Published Jun 9 • 6

mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation

Paper • 2505.24073 • Published May 29

GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution

Paper • 2505.00687 • Published May 1

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 105

commented a paper 6 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 105 •

liked a dataset 6 months ago

xiangbog/AirV2X-Perception

Viewer • Updated May 15 • 3.03k • 21.5k • 2

upvoted a paper 6 months ago

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Paper • 2506.15645 • Published Jun 18 • 4

authored a paper 6 months ago

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Paper • 2506.15645 • Published Jun 18 • 4

liked a dataset 6 months ago

jayzou3773/SafeFlowBench

Viewer • Updated Jun 12 • 332 • 299 • 2

upvoted a paper 6 months ago

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Paper • 2506.07564 • Published Jun 9 • 6

Zhengzhong Tu

AI & ML interests

Recent Activity

Organizations

vztu's activity

CVAgentArena