Ax Sen Jia, Ning Zhu, Jinqin Zhong, Jiale Zhou, Huaping Zhang, Jenq-Neng Hwang, Lei Li 12d ago

RAM: Recover Any 3D Human Motion in-the-Wild

RAM: motion capture system for 3D human pose reconstruction in unconstrained video with occlusion handling and temporal smoothing.

Ax Myra Cheng, Isabel Sieh, Humishka Zope, Sunny Yu, Lujain Ibrahim, Aryaman Arora, Jared Moore, Desmond Ong, Dan Jurafsky, Diyi Yang 12d ago

Verbalizing LLMs' assumptions to explain and control sycophancy

Framework for eliciting and verbalizing LLM assumptions to explain and mitigate sycophantic behavior in model outputs.

Ax Zhengming Yu, Li Ma, Mingming He, Leo Isikdogan, Yuancheng Xu, Dmitriy Smirnov, Pablo Salamanca, Dao Mi, Pablo Delgado, Ning Yu, Julien Philip, Xin Li, Wenping Wang, Paul Debevec 12d ago

DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models

DiffHDR: video diffusion model approach for converting low-dynamic-range videos to high-dynamic-range format.

Ax Juwei Yue, Chuanrui Hu, Jiawei Sheng, Zuyi Zhou, Wenyuan Zhang, Tingwen Liu, Li Guo, Yafeng Deng 12d ago

HyperMem: Hypergraph Memory for Long-Term Conversations

HyperMem: hypergraph-based memory architecture for conversational agents enabling long-term context tracking and high-order associations.

Ax Pavel Golikov, Evgenii Opryshko, Gennady Pekhimenko, Mark C. Jeffrey 12d ago

Robust Reasoning Benchmark

Benchmark evaluating robustness of LLM reasoning with 14 perturbation techniques applied to mathematical reasoning tasks.

Ax Maxim Ostroukhov, Ruslan Mikhailov, Vladimir Iashin, Artem Sokolov, Andrei Akshonov, Vitaly Protasov, Dmitrii Beloborodov, Vince Mullin, Roman Yokunda Enzmann, Georgios Kolovos, Jason Renders, Pavel Nesterov, Anton Repushko 12d ago

PRAGMA: Revolut Foundation Model

PRAGMA: foundation models for banking event sequences. Transformer-based architecture with self-supervised pretraining on financial transaction data.

Ax Fengwei Teng, Jinyi Bai, Xinhao Yao, Demi Ruohan Wang, Jiahao Zhao, Zhijiang Guo 12d ago

Skip-Connected Policy Optimization for Implicit Advantage

Skip-Connected Policy Optimization (SKPO) for reinforcement learning with reasoning tasks. Improves upon GRPO by addressing high-variance advantage estimation.