Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hao Sun's picture
1 8 18

Hao Sun

Holarissun
Shailx's profile picture Ray2333's profile picture
·
https://holarissun.github.io/
  • HolarisSun
  • holarissun

AI & ML interests

[email protected]. Deep RL, RL x LLM, RLHF.

Organizations

None yet

authored a paper 5 months ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17 • 23
authored 3 papers about 2 years ago

What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization

Paper • 2207.05161 • Published Jul 11, 2022 • 1

Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Paper • 2310.07747 • Published Oct 11, 2023 • 1

Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond

Paper • 2310.06147 • Published Oct 9, 2023 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs