Ax Dengjia Zhang, Alexander Martin, William Jurayj, Kenton Murray, Benjamin Van Durme, Reno Kriz 2d ago

Unified Multimodal Uncertain Inference

Multimodal inference task with text, audio, video for producing calibrated probability estimates of hypotheses with fine-grained uncertainty.

Ax Yingjie Yu, Mingyuan Wu, Ahmadreza Eslaminia, Lingzhi Zhao, Kaizhuo Yan, Klara Nahrstedt 2d ago

QoS-QoE Translation with Large Language Model

LLM application translating network quality metrics to user experience quality using large language models for multimedia systems.

Ax Zewei Zhou, Jiajun Zou, Jiajia Zhang, Ao Yang, Ruichao He, Haozheng Zhou, Ao Liu, Jiawei Liu, Leilei Jin, Shan Shen, Daying Sun 2d ago

R2G: A Multi-View Circuit Graph Benchmark Suite from RTL to GDSII

Multi-view circuit graph benchmark suite standardizing representations for GNN-based physical design tasks from RTL to GDSII.

Ax Jinghan Zhang, Fengran Mo, Tharindu Cyril Weerasooriya, Ruimin Dai, Xiaoyan Han, Yanjie Fu, Dakuo Wang, Kunpeng Liu 2d ago

StaRPO: Stability-Augmented Reinforcement Policy Optimization

RL framework for improving LLM reasoning by optimizing for logical consistency and structural integrity of reasoning processes, not just final answers.

Ax Siyuan Zhou, Hejun Wang, Hu Cheng, Jinxi Li, Dongsheng Wang, Junwei Jiang, Yixiao Jin, Jiayue Huang, Shiwei Mao, Shangjia Liu, Yafei Yang, Hongkang Song, Shenxing Wei, Zihui Zhang, Peng Huang, Shijie Liu, Zhengli Hao, Hao Li, Yitian Li, Wenqi Zhou, Zhihan Zhao, Zongqi He, Hongtao Wen, Shouwang Huang, Peng Yun, Bowen Cheng, Pok Kazaf Fu, Wai Kit Lai, Jiahao Chen, Kaiyuan Wang, Zhixuan Sun, Ziqi Li, Haochen Hu, Di Zhang, Chun Ho Yuen, Bing Wang, Zhihua Wang, Chuhang Zou, Bo Yang 2d ago

PhysInOne: Visual Physics Learning and Reasoning in One Suite

Large-scale synthetic dataset with 2M videos covering physical phenomena for training physics-aware AI systems.