3 24 2

Junbo Niu

Niujunbo2002

Niujunbo2002

AI & ML interests

Computer vision and pattern recognition

Recent Activity

upvoted a paper 3 days ago

StoryMem: Multi-shot Long Video Storytelling with Memory

upvoted a paper 4 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

upvoted a paper 9 days ago

VABench: A Comprehensive Benchmark for Audio-Video Generation

View all activity

Organizations

upvoted a paper 3 days ago

StoryMem: Multi-shot Long Video Storytelling with Memory

Paper • 2512.19539 • Published 4 days ago • 16

upvoted a paper 4 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 8 days ago • 188

upvoted a paper 9 days ago

VABench: A Comprehensive Benchmark for Audio-Video Generation

Paper • 2512.09299 • Published 17 days ago • 7

authored a paper 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 139

upvoted 2 papers 2 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10 • 50

Trace Anything: Representing Any Video in 4D via Trajectory Fields

Paper • 2510.13802 • Published Oct 15 • 30

upvoted a paper 3 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 139

liked a model 3 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.06M • 303

upvoted a collection 4 months ago

TraDo Series

Collection

SOTA Diffusion Large Language Models • 5 items • Updated Sep 11 • 12

upvoted an article 5 months ago

Article

The Technology Behind BLOOM Training

Jul 14, 2022

•

upvoted a collection 6 months ago

NativeRes-LLaVA

Collection

LLaVA using images with native resolution • 7 items • Updated Jun 14 • 5

upvoted 2 papers 6 months ago

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 51

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models

Paper • 2506.12776 • Published Jun 15 • 2

New activity in Niujunbo2002/qwen2vit-665m-patch14-native 6 months ago

Add model card

#1 opened 6 months ago by

nielsr

New activity in Niujunbo2002/qwen2_5_vit-668m-patch14-native 6 months ago

Add model card

#1 opened 6 months ago by

nielsr

updated 3 models 7 months ago

updated a collection 7 months ago

NativeRes-LLaVA

Collection

LLaVA using images with native resolution • 7 items • Updated Jun 14 • 5

Junbo Niu

AI & ML interests

Recent Activity

Organizations

Niujunbo2002's activity

The Technology Behind BLOOM Training

Add model card

Add model card