Ax Shanchuan Lin, Ceyuan Yang, Zhijie Lin, Hao Chen, Haoqi Fan 18d ago

Adversarial Flow Models

Generative model combining adversarial and flow-based families with native one-step/multi-step generation trained via adversarial objective.

Ax Jiayang Gao, Tianyi Zheng, Jiayang Zou, Fengxiang Yang, Shice Liu, Luyao Fan, Zheyu Zhang, Hao Zhang, Jinwei Chen, Peng-Tao Jiang, Bo Li, Jia Wang 18d ago

C$^2$FG: Control Classifier-Free Guidance via Score Discrepancy Analysis

Theoretical analysis of classifier-free guidance in diffusion models with bounds on score discrepancy for controlled guidance weights.

Ax Antoni Kowalczuk, Jan Dubi\'nski, Franziska Boenisch, Adam Dziedzic 18d ago

Privacy Attacks on Image AutoRegressive Models

Comprehensive privacy attack analysis on image autoregressive models, identifying membership inference and extraction vulnerabilities.

Ax Charig Yang, Samiul Alam, Shakhrul Iman Siam, Michael J. Proulx, Lambert Mathias, Kiran Somasundaram, Luis Pesqueira, James Fort, Sheroze Sheriffdeen, Omkar Parkhi, Carl Ren, Mi Zhang, Yuning Chai, Richard Newcombe, Hyo Jin Kim 18d ago

Reading Recognition in the Wild

Task and dataset for detecting when users are reading in egocentric smart glasses video using multimodal models.

Ax Hsien-Chin Lin, Benjamin Matthias Ruppik, Carel van Niekerk, Chia-Hao Shen, Michael Heck, Nurul Lubis, Renato Vukovic, Shutong Feng, Milica Ga\v{s}i\'c 18d ago

Prompt reinforcing for long-term planning of large language models

Method to improve LLM performance in multi-turn conversations by reinforcing long-term planning and goal tracking through prompting.