Ax Sajib Kumar Saha Joy, Arman Hassan Mahy, Meherin Sultana, Azizah Mamun Abha, MD Piyal Ahmmed, Yue Dong, G M Shahariar 12d ago

Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

Investigation of gender bias in Bangla language models with benchmark datasets for sentiment analysis, toxicity detection, hate speech, and sarcasm.

Ax Jing-En Huang, I-Sheng Fang, Tzuhsuan Huang, Yu-Lun Liu, Chih-Yu Wang, Jun-Cheng Chen 12d ago

Gen-n-Val: Agentic Image Data Generation and Validation

Agentic framework for synthetic image data generation and validation addressing data scarcity and label noise in vision tasks like detection and segmentation.

Ax Alexander Gambashidze, Li Pengyi, Matvey Skripkin, Andrey Galichin, Anton Gusarov, Konstantin Sobolev, Andrey Kuznetsov, Ivan Oseledets 12d ago

Listener-Rewarded Thinking in VLMs for Image Preferences

Listener-rewarded thinking approach using reinforcement learning to train robust reward models for generative text-to-image and video models.

Ax Shaokai Wu, Yanbiao Ji, Qiuchang Li, Zhiyi Zhang, Qichen He, Wenyuan Xie, Guodong Zhang, Bayram Bayramli, Yue Ding, Hongtao Lu 12d ago

Dejavu: Towards Experience Feedback Learning for Embodied Intelligence

Post-deployment learning framework for Vision-Language-Action policies using retrieved execution memories to improve embodied agent performance.

Ax Manan Suri, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi, Dinesh Manocha 12d ago

Structured Uncertainty guided Clarification for LLM Agents

Structured uncertainty framework for LLM agents with tool-calling to generate principled clarifying questions for ambiguous user instructions.

Ax Thao Nguyen, Sicheng Mo, Krishna Kumar Singh, Yilin Wang, Jing Shi, Nicholas Kolkin, Eli Shechtman, Yong Jae Lee, Yuheng Li 12d ago

Relational Visual Similarity

Research on relational visual similarity in computer vision showing how humans perceive analogical relationships beyond attribute similarity.

Ax Qiushi Han, David Simchi-Levi, Renfei Tan, Zishuo Zhao 12d ago

Multi-agent Adaptive Mechanism Design

Framework combining mechanism design and online learning for sequential mechanism design where principal learns agent beliefs while ensuring truthfulness.

Ax Joao Manoel Herrera Pinheiro, Gabriela Do Nascimento Herrera, Luciana Bueno Dos Reis Fernandes, Alvaro Doria Dos Santos, Ricardo V. Godoy, Eduardo A. B. Almeida, Helena Carolina Onody, Marcelo Andrade Da Costa Vieira, Angelica Maria Penteado-Dias, Marcelo Becker 12d ago

Descriptor: Parasitoid Wasps and Associated Hymenoptera Dataset (DAPWH)

Dataset of parasitoid wasps and hymenoptera for taxonomic identification and biodiversity monitoring.

Ax Zhaoyang Zhang, Shuli Jiang, Yantao Shen, Yuting Zhang, Dhananjay Ram, Shuo Yang, Zhuowen Tu, Wei Xia, Stefano Soatto 12d ago

Reinforcement-aware Knowledge Distillation for LLM Reasoning

Knowledge distillation method for distilling RL-trained LLMs with chain-of-thought reasoning into smaller student models while preserving reasoning capabilities.