Yasi Zhang
  • about
  • publications

Announcement_10

March 18, 2026

2026

Regression-Aware Reinforcement Learning for LLM-as-a-Judge is now available. Boost your LLM reward model with RL training!

© Copyright 2026 Yasi Zhang. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Last updated: March 19, 2026.