Ax Antoni Kowalczuk, Jan Dubi\'nski, Franziska Boenisch, Adam Dziedzic 21d ago

Privacy Attacks on Image AutoRegressive Models

Comprehensive privacy attack analysis on image autoregressive models, identifying membership inference and extraction vulnerabilities.

Ax Charig Yang, Samiul Alam, Shakhrul Iman Siam, Michael J. Proulx, Lambert Mathias, Kiran Somasundaram, Luis Pesqueira, James Fort, Sheroze Sheriffdeen, Omkar Parkhi, Carl Ren, Mi Zhang, Yuning Chai, Richard Newcombe, Hyo Jin Kim 21d ago

Reading Recognition in the Wild

Task and dataset for detecting when users are reading in egocentric smart glasses video using multimodal models.

Ax Hsien-Chin Lin, Benjamin Matthias Ruppik, Carel van Niekerk, Chia-Hao Shen, Michael Heck, Nurul Lubis, Renato Vukovic, Shutong Feng, Milica Ga\v{s}i\'c 21d ago

Prompt reinforcing for long-term planning of large language models

Method to improve LLM performance in multi-turn conversations by reinforcing long-term planning and goal tracking through prompting.

Ax Xi Zhang, Hanwei Zhu, Yan Zhong, Jiamang Wang, Weisi Lin 21d ago

BADiff: Bandwidth Adaptive Diffusion Model

Framework enabling diffusion models to adapt generation quality based on real-time network bandwidth constraints in cloud-to-device scenarios.

Ax Bhuvan Sachdeva, Karan Uppal, Abhinav Java, Vineeth N. Balasubramanian 21d ago

Understanding Task Transfer in Vision-Language Models

Study of task transfer in Vision-Language Models examining how finetuning on one perception task affects performance on others.

HN chirsz 21d ago

JSON with Commas and Comments

JSON extension format supporting trailing commas and comments. Minor format specification unrelated to AI.

HN omegaproto 21d ago

.

Bittensor governance controversy and token price impact.

HN stjuan627 21d ago

A few thoughs about AI videos

Personal experience report on using AI video generation models. Notes Gemini Nano, Veo3, Sora2, and Chinese models. Anecdotal observations.