Isolater - Feed

Ax Zhibo Hou, Zhiyu An, Wan Du 29d ago

Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring

Proposes learning progress monitoring to improve exploration efficiency in reinforcement learning agents when encountering unlearnable noise sources.

Ax Hita Kambhamettu, Alyssa Hwang, Philippe Laban, Andrew Head 29d ago

Attribution Gradients: Incrementally Unfolding Citations for Critical Examination of Attributed AI Answers

Introduces attribution gradients technique to improve citation informativeness and evidence transparency in AI answer engines.

Ax Zhongkai Yu, Yue Guan, Zihao Yu, Chenyang Zhou, Zhengding Hu, Shuyi Pei, Yangwook Kang, Yufei Ding, Po-An Tsai 29d ago

Patterns behind Chaos: Forecasting Data Movement for Efficient Large-Scale MoE LLM Inference

Forecasts expert selection patterns in Mixture of Experts LLMs to optimize data movement overhead in multi-unit serving systems.

Ax Frank Wu, Mengye Ren 29d ago

Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions

Extends Forward-Forward algorithm to reinforcement learning using action-conditioned Q-functions and layer activity statistics as learning signals.

Ax Federica Bologna, Tiffany Pan, Matthew Wilkens, Yue Guo, Lucy Lu Wang 29d ago

CQA-Eval: Designing Reliable Evaluations of Multi-paragraph Clinical QA under Resource Constraints

CQA-Eval evaluation framework for multi-paragraph clinical question answering systems with physician annotations and recommendations for resource-constrained settings.

Ax Subhodip Panda, Dhruv Tarsadiya, Shashwat Sourav, Prathosh A. P, Sai Praneeth Karimireddy 29d ago

f-INE: A Hypothesis Testing Framework for Estimating Influence under Training Randomness

f-INE hypothesis testing framework estimates sample influence on model performance while accounting for training randomness, addressing instability in existing influence estimation methods.

Ax Daniel Zhao, Daniel Beaglehole, Taylor Berg-Kirkpatrick, Julian McAuley, Zachary Novack 29d ago

Steering Autoregressive Music Generation with Recursive Feature Machines

MusicRFM framework adapts Recursive Feature Machines to enable fine-grained control over frozen pre-trained music generation models via internal activation steering.

Ax Chun-Ming Huang, Li-Heng Chang, I-Hsin Chang, An-Sheng Lee, Hao Kuo-Chen 29d ago

Recovering Sub-threshold S-wave Arrivals in Deep Learning Phase Pickers via Shape-Aware Loss

Deep learning approach fixing systematic S-wave detection failures in seismic phase picking via shape-aware loss functions.

Ax Rohit Kundu, Vishal Mohanty, Hao Xiong, Shan Jia, Athula Balachandran, Amit K. Roy-Chowdhury 29d ago

SAGA: Source Attribution of Generative AI Videos

SAGA framework for source attribution of AI-generated videos. Identifies specific generative model used instead of binary real/fake detection.

Ax Stefanos Koutoupis, Michaela Areti Zervou, Konstantinos Kontras, Maarten De Vos, Panagiotis Tsakalides, Grigorios Tsagkatakis 29d ago

The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

Research on contrastive fusion for higher-order multimodal alignment in joint representation learning across multiple modalities.

Ax Jayan Adhikari, Prativa Joshi, Sushish Baral 29d ago

Analysis of Invasive Breast Cancer in Mammograms Using YOLO, Explainability, and Domain Adaptation

Deep learning approach using YOLO and ResNet50 for breast cancer detection in mammograms with improved out-of-domain robustness.

Ax Chengqi Dong, Chuhuai Yue, Hang He, Rongge Mao, Fenghe Tang, S Kevin Zhou, Zekun Xu, Xiaohan Wang, Jiajun Chai, Guojun Yin 29d ago

Training Multi-Image Vision Agents via End2End Reinforcement Learning

IMAgent: open-source visual agent trained with end-to-end RL for multi-image reasoning tasks, addressing limitations of single-image VLM agents.

Ax Vivek Alumootil, Tuan-Anh Vu 29d ago

DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass

Method for dense 3D point tracking and reconstruction in dynamic scenes using single forward pass without requiring known camera poses.

Ax Alessio Buscemi, Tom Deckenbrunnen, Fahria Kabir, Kateryna Mishchenko, Nishat Mowla 29d ago

Assessing High-Risk AI Systems under the EU AI Act: From Legal Requirements to Technical Verification

Maps EU AI Act legal requirements to technical verification activities for compliance assessment of high-risk AI systems across member states.

Ax Ziyuan Tao, Chuanzhi Xu, Sandaru Jayawardana, Adnan Mahmood, Wei Bao, Kanchana Thilakarathna, Teng Joon Lim 29d ago

FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

FedVideoMAE: federated learning framework for privacy-preserving video moderation using self-supervised representations and differential privacy.

Ax Sashuai Zhou, Qiang Zhou, Jijin Hu, Hanqing Yang, Yue Cao, Junpeng Ma, Yinchao Ma, Jun Song, Tiezheng Ge, Cheng Yu, Bo Zheng, Zhou Zhao 29d ago

Unified Thinker: A General Reasoning Modular Core for Image Generation

Open-source image generation model with improved reasoning for logic-intensive instruction following, closing gap to closed-source systems.

Ax Honghao Chen, Jiangjie Qiu, Yi Shen Tew, Xiaonan Wang 29d ago

Autonomous Computational Catalysis Research via Agentic Systems

Multi-agent framework automating full computational catalysis research lifecycle from conception to publication.

Ax Minghui Chen, Wenlong Deng, James Zou, Han Yu, Xiaoxiao Li 29d ago

Textual Equilibrium Propagation for Deep Compound AI Systems

Equilibrium propagation method for optimizing compound AI systems with multiple modules in long-horizon agentic workflows.

Ax J Rosser, Robert Kirk, Edward Grefenstette, Jakob Foerster, Laura Ruis 29d ago

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

Framework using influence functions to craft training data perturbations inducing targeted model behavior changes.

Ax Zhongyao Wang, Taoyong Cui, Jiawen Zou, Shufei Zhang, Bo Yan, Wanli Ouyang, Weimin Tan, Mao Su 29d ago

Equivariant Evidential Deep Learning for Interatomic Potentials

Research on uncertainty quantification for ML interatomic potentials using evidential deep learning.

Ax Yongzhong Xu 29d ago

Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking

arXiv: Geometric analysis of transformer optimization dynamics revealing low-dimensional manifolds in grokking.

Ax Yongzhong Xu 29d ago

Early-Warning Signals of Grokking via Loss-Landscape Geometry

Research paper studying loss-landscape geometry as early-warning signals for grokking in neural networks.

Ax Hung-Hsuan Chen 29d ago

CeRA: Overcoming the Linear Ceiling of Low-Rank Adaptation via Capacity Expansion

CeRA: parameter-efficient fine-tuning method overcoming LoRA's linear capacity ceiling via non-linear gating and dropout for rank adaptation.

Ax Xiangyang Zhu, Yuan Tian, Qi Jia, Kaiwei Zhang, Zicheng Zhang, Chunyi Li, Kaiyuan Ji, Dongrui Liu, Zijian Chen, Lu Sun, Renrui Zhang, Yan Teng, Jing Shao, Wei Sun, Xia Hu, Yu Qiao, Guangtao Zhai 29d ago

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

SafeSci: comprehensive benchmark and framework for evaluating LLM safety in scientific domains with multi-domain risk coverage and objective evaluation.

Ax Yuchen Wang, Haonan Wang, Yu Guo, Honglong Yang, Xiaomeng Li 29d ago

Escaping the BLEU Trap: A Signal-Grounded Framework with Decoupled Semantic Guidance for EEG-to-Text Decoding

Framework for EEG-to-text decoding addressing semantic bias and signal neglect in neural signal interpretation. Published on arXiv.

Ax Mohammad Al Ridhawi, Mahtab Haj Ali, Hussein Al Osman 29d ago

Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis

Stock market prediction using Node Transformer architecture with BERT sentiment analysis to capture market patterns and dependencies.

Ax Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen 29d ago

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

DiFlowDubber: discrete flow matching framework for video dubbing with TTS, lip synchronization, and expressive prosody. Published on arXiv.

Ax Eason Chen, Ce Guan, A Elshafiey, Zhonghao Zhao, Joshua Zekeri, Afeez Edeifo Shaibu, Emmanuel Osadebe Prince, Cyuan-Jhen Wu 29d ago

When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education

Qualitative study of 167,000+ AI agents on multiple platforms learning from each other and developing emergent behaviors without researcher intervention.

Ax Jaemin Kim, Jong Chul Ye 29d ago

Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models

arXiv: RAG-enhanced diffusion models using adaptive guidance to resolve conflicts between retrieved noisy context and parametric model knowledge.

Ax Santosh Arron 29d ago

Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

Uses unsupervised machine learning (UMAP, HDBSCAN) to analyze drift rate patterns in fast radio burst data, discovering bimodal structure in emission regions.

Ax Xiang Chen, Fangfang Yang, Chunlei Meng, Yuxian Dong, Ang Li, Yiwei Wei, Jiahuan Long, Jiujiang Guo, Chengyin Hu 29d ago

CoDA: Exploring Chain-of-Distribution Attacks and Post-Hoc Token-Space Repair for Medical Vision-Language Models

Studies robustness of medical vision-language models under real clinical workflows using chain-of-distribution attacks and token-space repair techniques.

Ax Cristian P\'erez-Corral, Alberto Fern\'andez-Hern\'andez, Jose I. Mestre, Manuel F. Dolz, Enrique S. Quintana-Ort\'i 29d ago

$\lambda$-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

ArXiv research on parameterized GELU activation for controlled ReLU approximation in deep networks.

Ax Cheng Cui, Ting Sun, Suyin Liang, Tingquan Gao, Zelun Zhang, Jiaxuan Liu, Xueqing Wang, Changda Zhou, Hongen Liu, Manhui Lin, Yue Zhang, Yubo Zhang, Jing Zhang, Jun Zhang, Xing Wei, Yi Liu, Dianhai Yu, Yanjun Ma 29d ago

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

ArXiv paper on coarse-to-fine visual processing for efficient document parsing with vision-language models.

Ax Aman Mehta 29d ago

Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy

ArXiv study on behavioral consistency of LLM agents in SWE-bench comparing multiple models.

Ax Haochuan Kevin Wang 29d ago

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

ArXiv research analyzing prompt injection attack success stages across five frontier LLM agents.

Ax Song Yu, Li Li, Wenwen Zhao, Zhisheng Yang 29d ago

ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models

ArXiv paper on token-level entropy regulation for reinforcement learning in large reasoning models.

Ax Yongzhong Xu 29d ago

The Spectral Edge Thesis: A Mathematical Framework for Intra-Signal Phase Transitions in Neural Network Training

ArXiv research on spectral edge thesis controlling phase transitions in neural network training dynamics.

Ax Pratyay Banerjee, Masud Moshtaghi, Ankit Chadha 29d ago

APEX-EM: Non-Parametric Online Learning for Autonomous Agents via Structured Procedural-Episodic Experience Replay

APEX-EM non-parametric framework for LLM agents to accumulate and reuse procedural plans without weight modification.

Ax Yanjia Huang, Yunuo Chen, Ying Jiang, Jinru Han, Zhengzhong Tu, Yin Yang, Chenfanfu Jiang 29d ago

Learn2Fold: Structured Origami Generation with World Model Planning

World model planning for structured origami generation satisfying geometric constraints and kinematic rules via long-horizon reasoning.

Ax Patrice Bechard, Orlando Marquez Ayala, Emily Chen, Jordan Skelton, Sagar Davasam, Srinivas Sunkara, Vikas Yadav, Sai Rajeswar 29d ago

Terminal Agents Suffice for Enterprise Automation

Terminal agents executing enterprise tasks via CLI are simpler and more cost-effective than tool-augmented or web agents.

Ax Rafael Sojo, Pedro Larra\~naga, Concha Bielza 29d ago

Transfer learning for nonparametric Bayesian networks

Transfer learning methods for nonparametric Bayesian networks under scarce data with constraint-based and score-based algorithms.

Ax Derek Austin 29d ago

Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars

Body model ablation replaces SMPL with Momentum Human Rig for 3D Gaussian avatar generation with simpler architecture.

Ax Smriti Jha, Matteo Paltenghi, Chandra Maddila, Vijayaraghavan Murali, Shubham Ugare, Satish Chandra 29d ago

ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents

ProdCodeBench evaluates AI coding agents using production-derived tasks reflecting real developer-agent sessions and workflows.

Ax Boyang Gong, Yu Zheng, Fanye Kong, Jie Zhou, Jiwen Lu 29d ago

Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation

Visual attention inertia in MLLMs causes cognitive hallucinations; proposes mitigation for compositional understanding.

Ax Martin \v{S}petl\'ik, Jan B\v{r}ezina 29d ago

Convolutional Surrogate for 3D Discrete Fracture-Matrix Tensor Upscaling

Convolutional surrogate model for accelerating 3D discrete fracture-matrix simulations in groundwater flow modeling.

Ax Md Kowsher, Haris Mansoor, Nusrat Jahan Prottasha, Ozlem Garibay, Victor Zhu, Zhengping Ji, Chen Chen 29d ago

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

LiME achieves expert specialization in multimodal MoE-PEFT via lightweight modulation instead of separate adapters per expert.

Ax Parth Asawa, Alexandros G. Dimakis, Matei Zaharia 29d ago

SIEVE: Sample-Efficient Parametric Learning from Natural Language

SIEVE enables sample-efficient parametric learning from natural language instructions and feedback without high-quality traces.

Ax Ivan Sedykh, Nikita Sorokin, Valentin Malykh 29d ago

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

Model scheduling for masked diffusion language models uses smaller models at early denoising steps for faster generation.

Ax Mohammad Rezaei, Jens Lehmann, Sahar Vahdati 29d ago

LLM Reasoning with Process Rewards for Outcome-Guided Steps

Process reward models improve LLM mathematical reasoning by providing step-level feedback on intermediate errors, not just final outcomes.

Ax Mahdi Tavassoli Kejani, Fadi Dornaika, Charlotte Laclau, Jean-Michel Loubes 29d ago

Homophily-aware Supervised Contrastive Counterfactual Augmented Fair Graph Neural Network

Fairness-aware GNN training using contrastive learning and counterfactual augmentation to mitigate biases from graph structure.