Ax Kisu Yang, Yoonna Jang, Hwanseok Jang, Kenneth Choi, Isabelle Augenstein, Heuiseok Lim 8d ago

Reliable Evaluation Protocol for Low-Precision Retrieval

Protocol for reliable evaluation of low-precision retrieval systems, addressing spurious ties and variability in relevance scoring with reduced numerical precision.

Ax Wenhong Zhu, Ruobing Xie, Rui Wang, Xingwu Sun, Di Wang, Pengfei Liu 8d ago

Proximal Supervised Fine-Tuning

Proximal SFT: supervised fine-tuning method using trust-region constraints to prevent capability deterioration when adapting foundation models to new tasks.

Ax Shvetank Prakash, Andrew Cheng, Olof Kindgren, Ashiq Ahamed, Graham Knight, Jed Kufel, Francisco Rodriguez, Arya Tschand, David Kong, Mariam Elgamal, Jerry Huang, Emma Chen, Gage Hills, Richard Price, Emre Ozer, Vijay Janapa Reddi 8d ago

Lifetime-Aware Design for Item-Level Intelligence at the Extreme Edge

FlexiFlow: lifetime-aware design framework for integrated computation in disposable products using flexible electronics with kHz speeds.

Ax Fang Wu, Aaron Tu, Weihao Xuan, Heli Qi, Xu Huang, Qingcheng Zeng, Shayan Talaei, Yijia Xiao, Peng Xia, Xiangru Tang, Yuchen Zhuang, Bing Hu, Hanqun Cao, Wenqi Shi, Rui Yang, Nan Liu, Huaxiu Yao, Ge Liu, Li Erran Li, Amin Saberi, Naoto Yokoya, Jure Leskovec, Yejin Choi 8d ago

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

Position paper analyzing measurement gaps in reinforcement learning with verifiable rewards for LLMs on structured tasks.

Ax Xue-Cheng Tai, Hao Liu, Lingfeng Li, Raymond H. Chan 8d ago

A Mathematical Explanation of Transformers

Mathematical framework interpreting Transformers as discretizations of integro-differential equations.

Ax Jigang Fan, Xiaoran Jiao, Shengdong Lin, Zhanming Liang, Weian Mao, Chenchen Jing, Hao Chen, Chunhua Shen 8d ago

Evolutionary Profiles for Protein Fitness Prediction

Protein language models for fitness prediction interpreted as inverse reinforcement learning on evolutionary sequences.

Ax Kedi Chen, Dezhao Ruan, Yuhao Dan, Yaoting Wang, Siyu Yan, Xuecheng Wu, Yinqi Zhang, Qin Chen, Jie Zhou, Liang He, Biqing Qi, Linyang Li, Qipeng Guo, Xiaoming Shi, Wei Zhang 8d ago

A Survey of Inductive Reasoning for Large Language Models

Survey of inductive reasoning in LLMs, covering particular-to-general thinking patterns and knowledge generalization capabilities.

Ax Nishad Kulkarni, Krithika Iyer, Austin Tapp, Abhijeet Parida, Daniel Capell\'an-Mart\'in, Zhifan Jiang, Mar\'ia J. Ledesma-Carbayo, Syed Muhammad Anwar, Marius George Linguraru 8d ago

Post-Processing Methods for Improving Accuracy in MRI Inpainting

Post-processing methods for MRI brain image inpainting to handle lesions and tumors in medical imaging analysis.

Ax Zhaoyang Wang, Yiming Liang, Xuchao Zhang, Qianhui Wu, Siwei Han, Anson Bastos, Rujia Wang, Chetan Bansal, Baolin Peng, Jianfeng Gao, Saravan Rajmohan, Huaxiu Yao 8d ago

SynthAgent: Adapting Web Agents with Synthetic Supervision

SynthAgent: Framework for web agent adaptation using synthetic data generation with quality filtering to handle hallucinations and trajectory noise.