Ax Nishant Subramani, Kshitish Ghate, Mona Diab 2/25/2026

Personal Information Parroting in Language Models

Personal information memorization in language models: detector suite for email, phone, IP addresses outperforms existing regex baselines.

Ax Brandon R. Feng, Brian J. Reich, Daniel Beaglehole, Xihaier Luo, David Keetae Park, Shinjae Yoo, Zhechao Huang, Xueyu Mao, Olcay Boz, Jungeum Kim 2/25/2026

DANCE: Doubly Adaptive Neighborhood Conformal Estimation

DANCE method for conformal prediction uncertainty quantification using adaptive neighborhood estimation with pre-trained deep learning models.

Ax Xuran Ma, Xuebao Li, Yanfang Zheng, Yongshang Lv, Xiaojia Ji, Jiancheng Xu, Hongwei Ye, Zixian Wu, Shuainan Yan, Liang Dong, Zamri Zainal Abidin, Xusheng Huang, Shunhuang Zhang, Honglei Jin, Tarik Abdul Latef, Noraisyah Mohamed Shah, Mohamadariff Othman, Kamarul Ariffin Noordin 2/25/2026

F10.7 Index Prediction: A Multiscale Decomposition Strategy with Wavelet Transform for Performance Optimization

F10.7 solar index forecasting using wavelet decomposition and iTransformer model with sunspot number features.

Ax Yifei Xu, Guilherme Potje, Shivam Shandilya, Tiancheng Yuan, Leonardo de Oliveira Nunes, Rakshanda Agarwal, Saeid Asgari, Adam Atkinson, Emre K{\i}c{\i}man, Songwu Lu, Ranveer Chandra, Tusher Chakraborty 2/25/2026

SibylSense: Adaptive Rubric Learning via Memory Tuning and Adversarial Probing

SibylSense enables adaptive reward rubric learning for open-ended generation via memory tuning and adversarial probing to prevent reward hacking.

Ax Teymur Aghayev 2/25/2026

Functional Continuous Decomposition

Functional Continuous Decomposition framework for non-stationary time-series analysis with parametric optimization and guaranteed continuity.

Ax Christian Catalini, Xiang Hui, Jane Wu 2/25/2026

Some Simple Economics of AGI

Economic analysis of AGI impact on labor, marginal costs, and human verification as binding constraint on growth.

Ax Mehdi Acheli, Walid Gaaloul 2/25/2026

Motivation is Something You Need

Proposes dual-model training framework inspired by neuroscience motivation states with alternating base and larger model activation.

Ax Zhifan Jiang, Dong Yang, Vishwesh Nath, Abhijeet Parida, Nishad P. Kulkarni, Ziyue Xu, Daguang Xu, Syed Muhammad Anwar, Holger R. Roth, Marius George Linguraru 2/25/2026

LUMEN: Longitudinal Multi-Modal Radiology Model for Prognosis and Diagnosis

Applies vision-language models to radiology imaging for decision support with longitudinal multi-modal chest X-ray analysis.

Ax Debjit Paul, Daniel Murphy, Milan Gritta, Ronald Cardenas, Victor Prokhorov, Lena Sophia Bolliger, Aysim Toker, Roy Miles, Andreea-Maria Oncescu, Jasivan Alex Sivakumar, Philipp Borchert, Ismail Elezi, Meiru Zhang, Ka Yiu Lee, Guchun Zhang, Jun Wang, Gerasimos Lampouras 2/25/2026

A Benchmark for Deep Information Synthesis

DEEPSYNTH benchmark evaluates LLM-based agents on complex multi-source information synthesis tasks beyond fact retrieval.

Ax Tony Feng, Junehyuk Jung, Sang-hyun Kim, Carlo Pagano, Sergei Gukov, Chiang-Chiang Tsai, David Woodruff, Adel Javanmard, Aryan Mokhtari, Dawsen Hwang, Yuri Chervonyi, Jonathan N. Lee, Garrett Bingham, Trieu H. Trinh, Vahab Mirrokni, Quoc V. Le, Thang Luong 2/25/2026

Aletheia tackles FirstProof autonomously

Aletheia, a mathematics research AI agent powered by Gemini 3 Deep Think, autonomously solves 6 of 10 FirstProof challenge problems.

Ax Julian Bedei, Lucas Koch, Kevin Badalian, Alexander Winkler, Patrick Schaber, Jakob Andert 2/25/2026

Safe Reinforcement Learning for Real-World Engine Control

Applies Deep Deterministic Policy Gradient (DDPG) reinforcement learning to engine control in a safety-critical testbench environment.