HF Rosie Zhao, Anshul Shah, Xiaoyu Zhu, Xinke Deng, Zhongyu Jiang, Yang Yang, Joerg Liebelt, Arnab Mondal 2/13/2026

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

Analysis of RL fine-tuned VLMs showing vulnerability to textual perturbations and weak visual grounding despite improved visual reasoning benchmarks.

HF Dianyi Wang, Ruihang Li, Feng Han, Chaofan Ma, Wei Song, Siyuan Wang, Yibin Wang, Yi Xin, Hongjian Liu, Zhixiong Zhang, Shengyuan Ding, Tianhang Wang, Zhenglin Cheng, Tao Lin, Cheng Jin, Kaicheng Yu, Jingjing Chen, Wenjie Wang, Zhongyu Wei, Jiaqi Wang 2/12/2026

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

DeepGen 1.0 is a lightweight 5B unified model for image generation and editing using Stacked Channel Bridging, achieving competitive performance to larger models with reduced deployment costs.

HF Sicheng Feng, Zigeng Chen, Xinyin Ma, Gongfan Fang, Xinchao Wang 2/12/2026

dVoting: Fast Voting for dLLMs

dVoting fast voting technique for diffusion LLMs enabling parallel test-time scaling for improved reasoning performance.

HF GigaBrain Team, Boyuan Wang, Chaojun Ni, Guan Huang, Guosheng Zhao, Hao Li, Jie Li, Jindi Lv, Jingyu Liu, Lv Feng, Mingming Yu, Peng Li, Qiuping Deng, Tianze Liu, Xinyu Zhou, Xinze Chen, Xiaofeng Wang, Yang Wang, Yifan Li, Yifei Nie, Yilong Li, Yukun Zhou, Yun Ye, Zhichao Liu, Zheng Zhu 2/12/2026

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

GigaBrain-0.5M VLA model trained via world model-based reinforcement learning for improved multi-step action prediction.

HF Bo Zhang, Jiaxuan Guo, Lijun Li, Dongrui Liu, Sujin Chen, Guanxu Chen, Zhijie Zheng, Qihao Lin, Lewen Yan, Chen Qian, Yijin Zhou, Yuyao Wu, Shaoxiong Guo, Tianyi Du, Jingyi Yang, Xuhao Hu, Ziqi Miao, Xiaoya Lu, Jing Shao, Xia Hu 2/12/2026

DeepSight: An All-in-One LM Safety Toolkit

DeepSight is a unified toolkit for LLM/MLLM safety covering workflow, evaluation, diagnosis, and alignment with integrated explainability and risk scenario grounding capabilities.

HF Romain Froger, Pierre Andrews, Matteo Bettini, Amar Budhiraja, Ricardo Silveira Cabral, Virginie Do, Emilien Garreau, Jean-Baptiste Gaya, Hugo Laurençon, Maxime Lecanu, Kunal Malkan, Dheeraj Mekala, Pierre Ménard, Gerard Moreno-Torres Bertran, Ulyana Piterbarg, Mikhail Plekhanov, Mathieu Rita, Andrey Rusakov, Vladislav Vorotilov, Mengjue Wang, Ian Yu, Amine Benhalloum, Grégoire Mialon, Thomas Scialom 2/12/2026

Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments

Gaia2 benchmark for evaluating LLM agents in realistic, asynchronous, dynamic environments with temporal constraints and collaboration.

HF Nenad Tomašev, Matija Franklin, Simon Osindero 2/12/2026

Intelligent AI Delegation

Adaptive framework for intelligent AI agent delegation across decomposed sub-tasks with dynamic adaptation to environmental changes and failure handling.

HF MiniCPM Team, Wenhao An, Yingfa Chen, Yewei Fang, Jiayi Li, Xin Li, Yaohui Li, Yishan Li, Yuxuan Li, Biyuan Lin, Chuan Liu, Hezi Liu, Siyuan Liu, Hongya Lyu, Yinxu Pan, Shixin Ren, Xingyu Shen, Zhou Su, Haojun Sun, Yangang Sun, Zhen Leng Thai, Xin Tian, Rui Wang, Xiaorong Wang, Yudong Wang, Bo Wu, Xiaoyue Xu, Dong Xu, Shuaikang Xue, Jiawei Yang, Bowen Zhang, Jinqian Zhang, Letian Zhang, Shengnan Zhang, Xinyu Zhang, Xinyuan Zhang, Zhu Zhang, Hengyu Zhao, Jiacheng Zhao, Jie Zhou, Zihan Zhou, Shuo Wang, Chaojun Xiao, Xu Han, Zhiyuan Liu, Maosong Sun 2/12/2026

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

MiniCPM-SALA hybrid sparse-linear attention architecture for efficient long-context LLM processing in 9B parameter model.

HF Matteo Nulli, Vladimir Orshulevich, Tala Bazazo, Christian Herold, Michael Kozielski, Marcin Mazur, Szymon Tuzel, Cees G. M. Snoek, Seyyed Hadi Hashemi, Omar Javed, Yannick Versley, Shahram Khadivi 2/12/2026

Adapting Vision-Language Models for E-commerce Understanding at Scale

Large-scale study on adapting general-purpose VLMs to e-commerce attribute understanding while preserving generalizability across multi-image noisy product data.