Ax Chun-Jui Wang, Jian-Ting Guo, Hung Guei, Chung-Chin Shih, Ti-Rong Wu, I-Chen Wu 3/20/2026

Evaluating Game Difficulty in Tetris Block Puzzle

Uses Stochastic Gumbel AlphaZero to evaluate difficulty in Tetris Block Puzzle variants, extending prior game-evaluation methods.

Ax Huaide Jiang, Yash Chaudhary, Yuping Wang, Zehao Wang, Raghav Sharma, Manan Mehta, Yang Zhou, Lichao Sun, Zhiwen Fan, Zhengzhong Tu, Jiachen Li 3/20/2026

NavTrust: Benchmarking Trustworthiness for Embodied Navigation

NavTrust benchmark evaluates trustworthiness of embodied navigation agents under real-world corruptions in Vision-Language Navigation and Object-Goal Navigation tasks.

Ax \.Ilter Onat Korkmaz, Ya\c{s}ar Cahit Y{\i}ld{\i}r{\i}m, \c{C}a\u{g}{\i}n Ararat, Cem Tekin 3/20/2026

Vector Optimization with Gaussian Process Bandits

VOGP algorithm using Gaussian process bandits for black-box vector optimization with incomplete order relations and Pareto optimality guarantees.

Ax Antonio Ferrara, Francesco Cozzi, Alan Perotti, Andr\'e Panisson, Francesco Bonchi 3/20/2026

Size-adaptive Hypothesis Testing for Fairness

Statistical framework for fairness testing in algorithmic systems that accounts for sampling error and handles intersectional demographic analysis.