Ax Wenhong Zhu, Ruobing Xie, Rui Wang, Xingwu Sun, Di Wang, Pengfei Liu 8d ago

Proximal Supervised Fine-Tuning

Proximal SFT: supervised fine-tuning method using trust-region constraints to prevent capability deterioration when adapting foundation models to new tasks.

Ax Shvetank Prakash, Andrew Cheng, Olof Kindgren, Ashiq Ahamed, Graham Knight, Jed Kufel, Francisco Rodriguez, Arya Tschand, David Kong, Mariam Elgamal, Jerry Huang, Emma Chen, Gage Hills, Richard Price, Emre Ozer, Vijay Janapa Reddi 8d ago

Lifetime-Aware Design for Item-Level Intelligence at the Extreme Edge

FlexiFlow: lifetime-aware design framework for integrated computation in disposable products using flexible electronics with kHz speeds.

Ax Fang Wu, Aaron Tu, Weihao Xuan, Heli Qi, Xu Huang, Qingcheng Zeng, Shayan Talaei, Yijia Xiao, Peng Xia, Xiangru Tang, Yuchen Zhuang, Bing Hu, Hanqun Cao, Wenqi Shi, Rui Yang, Nan Liu, Huaxiu Yao, Ge Liu, Li Erran Li, Amin Saberi, Naoto Yokoya, Jure Leskovec, Yejin Choi 8d ago

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

Position paper analyzing measurement gaps in reinforcement learning with verifiable rewards for LLMs on structured tasks.

Ax Xue-Cheng Tai, Hao Liu, Lingfeng Li, Raymond H. Chan 8d ago

A Mathematical Explanation of Transformers

Mathematical framework interpreting Transformers as discretizations of integro-differential equations.

Ax Jigang Fan, Xiaoran Jiao, Shengdong Lin, Zhanming Liang, Weian Mao, Chenchen Jing, Hao Chen, Chunhua Shen 8d ago

Evolutionary Profiles for Protein Fitness Prediction

Protein language models for fitness prediction interpreted as inverse reinforcement learning on evolutionary sequences.

Ax Kedi Chen, Dezhao Ruan, Yuhao Dan, Yaoting Wang, Siyu Yan, Xuecheng Wu, Yinqi Zhang, Qin Chen, Jie Zhou, Liang He, Biqing Qi, Linyang Li, Qipeng Guo, Xiaoming Shi, Wei Zhang 8d ago

A Survey of Inductive Reasoning for Large Language Models

Survey of inductive reasoning in LLMs, covering particular-to-general thinking patterns and knowledge generalization capabilities.

Ax Nishad Kulkarni, Krithika Iyer, Austin Tapp, Abhijeet Parida, Daniel Capell\'an-Mart\'in, Zhifan Jiang, Mar\'ia J. Ledesma-Carbayo, Syed Muhammad Anwar, Marius George Linguraru 8d ago

Post-Processing Methods for Improving Accuracy in MRI Inpainting

Post-processing methods for MRI brain image inpainting to handle lesions and tumors in medical imaging analysis.

Ax Zhaoyang Wang, Yiming Liang, Xuchao Zhang, Qianhui Wu, Siwei Han, Anson Bastos, Rujia Wang, Chetan Bansal, Baolin Peng, Jianfeng Gao, Saravan Rajmohan, Huaxiu Yao 8d ago

SynthAgent: Adapting Web Agents with Synthetic Supervision

SynthAgent: Framework for web agent adaptation using synthetic data generation with quality filtering to handle hallucinations and trajectory noise.

Ax Shuyang Liu, Yang Chen, Rahul Krishna, Saurabh Sinha, Jatin Ganhotra, Reyhan Jabbarvand 8d ago

Process-Centric Analysis of Agentic Software Systems

Framework for process-centric evaluation of agentic software systems, analyzing execution trajectories and reasoning beyond outcome metrics.

Ax Li Ju, Jun Zhao, Mingxu Chai, Ziyu Shen, Xiangyang Wang, Yage Geng, Chunchun Ma, Hao Peng, Guangbin Li, Tao Li, Chengyong Liao, Fu Wang, Xiaolong Wang, Junshen Chen, Rui Gong, Shijia Liang, Feiyan Li, Ming Zhang, Kexin Tan, Junjie Ye, Zhiheng Xi, Shihan Dou, Tao Gui, Yuankai Ying, Yang Shi, Yue Zhang, Qi Zhang 8d ago

WisPaper: Your AI Scholar Search Engine

WisPaper: AI agent system for academic paper discovery and organization, addressing semantic search and workflow fragmentation challenges.

Ax Long Nguyen, Micha Fauth, Bernhard Jaeger, Daniel Dauner, Maximilian Igl, Andreas Geiger, Kashyap Chitta 8d ago

LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving

Study on imitation learning for autonomous driving, addressing the gap between privileged expert demonstrations and sensor-limited student observations in simulation.