Ax Rohan Sequeira, Stavros Damianakis, Umar Iqbal, Konstantinos Psounis 3/25/2026

Agent-Sentry: Bounding LLM Agents via Execution Provenance

Agent-Sentry: Security system for bounding LLM agents via execution provenance tracking. Addresses safety and security concerns in agentic systems.

Ax Linwei Tao, Haoyang Luo, Minjing Dong, Chang Xu 3/25/2026

Confidence Calibration under Ambiguous Ground Truth

Shows that confidence calibration fails when annotator disagreement exists. Proposes calibration against annotator distribution rather than majority labels.

Ax Yaolun Zhang, Ruohui Wang, Jiahao Wang, Yepeng Tang, Xuanyu Zheng, Haonan Duan, Hao Lu, Hanming Deng, Lewei Lu 3/25/2026

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

EVA: Reinforcement learning method for video understanding agents using multimodal LLMs. Adaptive frame sampling and reasoning without manual workflows.

Ax Miao Yu, Siyuan Fu, Moayad Aloqaily, Zhenhong Zhou, Safa Otoum, Xing fan, Kun Wang, Yufei Guo, Qingsong Wen 3/25/2026

SafeSeek: Universal Attribution of Safety Circuits in Language Models

SafeSeek framework for universal attribution of safety circuits in LLMs using mechanistic interpretability to understand alignment, jailbreak, and backdoor behaviors.

Ax Shaid Hasan, Breenice Lee, Sujan Sarker, Tariq Iqbal 3/25/2026

A Multimodal Framework for Human-Multi-Agent Interaction

Multimodal framework for human-multi-agent interaction integrating perception, embodied expression, and coordinated decision-making in shared physical spaces.

Ax Mehmet Caner, Agostino Capponi, Nathan Sun, Jonathan Y. Tan 3/25/2026

Designing Agentic AI-Based Screening for Portfolio Investment

Agentic AI platform for portfolio investment screening using LLM agents for fundamental analysis and sentiment analysis with deliberation mechanism for buy/sell signals.