Mingyu Jin's picture

9

Mingyu Jin

JimmyNLP

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

R-WoM: Retrieval-augmented World Model For Computer-use Agents

Paper • 2510.11892 • Published Oct 13 • 21

upvoted 8 papers 3 months ago

MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4 • 41

Dynamic Speculative Agent Planning

Paper • 2509.01920 • Published Sep 2 • 6

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29 • 29

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29 • 140

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10 • 56

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26 • 118

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27 • 62

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 134