Ax Shuyang Liu, Yang Chen, Rahul Krishna, Saurabh Sinha, Jatin Ganhotra, Reyhan Jabbarvand 8d ago

Process-Centric Analysis of Agentic Software Systems

Framework for process-centric evaluation of agentic software systems, analyzing execution trajectories and reasoning beyond outcome metrics.

Ax Li Ju, Jun Zhao, Mingxu Chai, Ziyu Shen, Xiangyang Wang, Yage Geng, Chunchun Ma, Hao Peng, Guangbin Li, Tao Li, Chengyong Liao, Fu Wang, Xiaolong Wang, Junshen Chen, Rui Gong, Shijia Liang, Feiyan Li, Ming Zhang, Kexin Tan, Junjie Ye, Zhiheng Xi, Shihan Dou, Tao Gui, Yuankai Ying, Yang Shi, Yue Zhang, Qi Zhang 8d ago

WisPaper: Your AI Scholar Search Engine

WisPaper: AI agent system for academic paper discovery and organization, addressing semantic search and workflow fragmentation challenges.

Ax Long Nguyen, Micha Fauth, Bernhard Jaeger, Daniel Dauner, Maximilian Igl, Andreas Geiger, Kashyap Chitta 8d ago

LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving

Study on imitation learning for autonomous driving, addressing the gap between privileged expert demonstrations and sensor-limited student observations in simulation.

Ax Dongqi Liu, Hang Ding, Qiming Feng, Xurong Xie, Zhucun Xue, Chengjie Wang, Jian Li, Jiangning Zhang, Yabiao Wang 8d ago

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

Disco-RAG improves retrieval-augmented generation by capturing discourse structure and synthesizing knowledge from dispersed evidence.

Ax Changhyeok Choi, Yunheng Zou, Marcel M\"uller, Han Hao, Yeonghun Kang, Juan B. P\'erez-S\'anchez, Ignacio Gustin, Hanyong Xu, Andrew Wang, Mohammad Ghazi Vakili, Chris Crebolder, Al\'an Aspuru-Guzik, Varinia Bernales 8d ago

El Agente Estructural: An Artificially Intelligent Molecular Editor

El Agente Estructural multimodal agent for autonomous molecular geometry generation and manipulation using natural language and vision.

Ax Lin Huang, Arthur Jiang, XiaoLi Liu, Zion Wang, Jason Zhao, Chu Wang, HaoCheng Lu, ChengXiang Huang, JiaJun Cheng, YiYue Du, Jia Zhang 8d ago

UBio-MolFM: A Universal Molecular Foundation Model for Bio-Systems

UBio-MolFM universal molecular foundation model framework for bio-system simulation bridging quantum accuracy and biological scale.

Ax Yifei Zhang, Xu Yang, Xiao Yang, Bowen Xian, Qizheng Li, Shikai Fang, Jingyuan Li, Jian Wang, Mingrui Xu, Weiqing Liu, Jiang Bian 8d ago

Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

Gome agent for machine learning engineering using gradient-based optimization instead of tree search, scaling LLM-based reasoning.

Ax Gyujun Jeong (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Sungwon Cho (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Minji Shon (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Namhoon Kim (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Woohyun Hwang (Semiconductor Research and Development, Samsung Electronics Co., Ltd, South Korea), Kwangyou Seo (Semiconductor Research and Development, Samsung Electronics Co., Ltd, South Korea), Suhwan Lim (Semiconductor Research and Development, Samsung Electronics Co., Ltd, South Korea), Wanki Kim (Semiconductor Research and Development, Samsung Electronics Co., Ltd, South Korea), Daewon Ha (Semiconductor Research and Development, Samsung Electronics Co., Ltd, South Korea), Prasanna Venkatesan (NVIDIA, Santa Clara, CA, USA), Kihang Youn (NVIDIA, Santa Clara, CA, USA), Ram Cherukuri (NVIDIA, Santa Clara, CA, USA), Yiyi Wang (NVIDIA, Santa Clara, CA, USA), Suman Datta (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Asif Khan (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA), Shimeng Yu (School of Electrical and Computer Engineering, Georgia Institute of Technology, GA, USA) 8d ago

Physics-informed AI Accelerated Retention Analysis of Ferroelectric Vertical NAND: From Day-Scale TCAD to Second-Scale Surrogate Model

Physics-informed surrogate model for ferroelectric NAND retention analysis reducing computational cost from day-scale to second-scale.

Ax Charles Ye, Jasmine Cui, Dylan Hadfield-Menell 8d ago

Prompt Injection as Role Confusion

Analysis of prompt injection attacks as role confusion where models infer text source by content style rather than origin.

Ax Yuanhe Zhang, Xinyue Wang, Zhican Chen, Weiliu Wang, Zilu Zhang, Zhengshuo Gong, Zhenhong Zhou, Kun Wang, Li Sun, Yang Liu, Sen Su 8d ago

Resource Consumption Threats in Large Language Models

Survey of resource consumption threats in LLMs including excessive generation attacks, resource efficiency requirements, and mitigation strategies.