Ax Haocheng Ju, Guoxiong Gao, Jiedong Jiang, Bin Wu, Zeming Sun, Leheng Chen, Yutong Wang, Yuefeng Wang, Zichen Wang, Wanyi He, Peihao Wu, Liang Xiao, Ruochuan Liu, Bryan Dai, Bin Dong 27d ago

Automated Conjecture Resolution with Formal Verification

Automated framework for research-level mathematical problem solving combining LLMs with formal verification to reliably resolve conjectures and verify proofs.

Ax Shenzhi Yang, Guangcheng Zhu, Bowen Song, Sharon Li, Haobo Wang, Xing Zheng, Yingfan Ma, Zhongqi Chen, Weiqiang Wang, Gang Chen 27d ago

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Analysis of noisy label robustness in Reinforcement Learning with Verifiable Rewards for training LLM reasoning models.

Ax Taiping Qu, Hongkai Zhang, Lantian Zhang, Can Zhao, Nan Zhang, Hui Wang, Zhen Zhou, Mingye Zou, Kairui Bo, Pengfei Zhao, Xingxing Jin, Zixian Su, Kun Jiang, Huan Liu, Yu Du, Maozhou Wang, Ruifang Yan, Zhongyuan Wang, Tiejun Huang, Lei Xu, Henggui Zhang 27d ago

BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

BAAI Cardiac Agent: multimodal AI agent for automated cardiovascular disease diagnosis from cardiac MRI with specialized expert models.

Ax Juhan Park, Taerim Yoon, Seungmin Kim, Joonggil Kim, Wontae Ye, Jeongeun Park, Yoonbyung Chai, Geonwoo Cho, Geunwoo Cho, Dohyeong Kim, Kyungjae Lee, Yongjae Kim, Sungjoon Choi 27d ago

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Research on dexterous robotic grasping using reinforcement learning with sparse guidance for multi-finger manipulation control.

Ax Kamyar Barakati, Boris N. Slautin, Utkarsh Pratiush, Hiroshi Funakubo, Sergei V. Kalinin 27d ago

PATHFINDER: Multi-objective discovery in structural and spectral spaces

Multi-objective automated discovery framework for microscopy and characterization workflows, addressing premature convergence through exploration coordination across structural and spectral spaces.

Ax Haonian Ji, Kaiwen Xiong, Siwei Han, Peng Xia, Shi Qiu, Yiyang Zhou, Jiaqi Liu, Jinlong Li, Bingzhou Li, Zeyu Zheng, Cihang Xie, Huaxiu Yao 27d ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

ClawArena benchmark evaluating AI agents' ability to maintain correct beliefs in evolving information environments with contradictory sources and changing evidence.

Ax Xiaohang Yu, William Knottenbelt 27d ago

LOCARD: An Agentic Framework for Blockchain Forensics

LOCARD: agentic framework modeling blockchain forensics as sequential decision-making, enabling dynamic iterative investigations instead of static inference pipelines.

Ax Linyao Chen, Bo Huang, Qinlao Zhao, Shuai Shao, Zhi Han, Zicai Cui, Ziheng Zhang, Guangtao Zeng, Wenzheng Tang, Yikun Wang, Yuanjian Zhou, Zimian Peng, Yong Yu, Weiwen Liu, Hiroki Kobayashi, Weinan Zhang 27d ago

Agentization of Digital Assets for the Agentic Web: Concepts, Techniques, and Benchmark

Framework and benchmark for converting web elements into autonomous agents as foundational primitives for the Agentic Web, enabling automated agent generation from digital assets.