MJ-BENCH: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen*, Yichao Du*, Zichen Wen*, Yiyang Zhou*, Chenhang Cui, Zhenzhen Weng, Haoqin Tu,
Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou,
Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, and Huaxiu Yao
Preliminary version appeared in ICML Workshop on Foundation Models in the Wild, 2024
RankCLIP: Ranking-Consistent Language-Image Pretraining
Yiming Zhang*,
Zhuokai Zhao*,
Zhaorun Chen,
Zhili Feng, Zenghui Ding, and Yining Sun
In submission, 2024
PANDORA: Detailed LLM Jailbreaking via Collaborated Phishing Agents with Decomposed Reasoning
ICLR Workshop on Secure and Trustworthy Large Language Models, 2024
EscIRL: Evolving Self-Contrastive IRL for Trajectory Prediction in Autonomous Driving
Zhaorun Chen, Siyue Wang,
Zhuokai Zhao, Chaoli Mao, Yiyang Zhou, Jiayu He, Sibo Hu
In submission, 2024
Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding
International Conference on Machine Learning (ICML), 2024
Preliminary version appeared in ICLR Workshop on Reliable and Responsible Foundation Models, 2024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Preliminary version appeared in ICLR Workshop on Reliable and Responsible Foundation Models, 2024
Direct Acquisition Optimization for Low-Budget Active Learning
Zhuokai Zhao,
Yibo Jiang, and
Yuxin Chen
In submission, 2024