Isolater - Feed

Ax Youcheng Zong, Runda Jia, Ranmeng Lin, Mingxuan Ren, Dakuo He 22d ago

Open-Ended Scenario Reasoning for Specialist Model Adaptation

LLM-based adaptation method for specialist models in industrial processes without retraining.

Ax Yu Cheng, Siyue Yao, Zhongang Qi, Shanyan Guan, Wei Li, Fajie Yuan 22d ago

Dynamic-in-Few-Step: Unifying Dynamic Computation and Few-Step Distillation for Efficient Video Generation

Dynamic computation framework combining adaptive architecture with few-step distillation for video generation.

Ax Amin Haeri, Mahdi Ghelichi 22d ago

Specification Grounding Drives Test Effectiveness for LLM Code

Study on how specification-grounded test generation improves LLM code quality and edge case handling.

Ax Felix Feldman, Joshua Harris, Timothy Laurence, Leo Loman, Ollie Higgins, Fan Grayson, Poonam Soma, Bethany Pace-Bonello, Michael Borowitz, Toby Nonnenmacher 22d ago

Healthier LLMs: Retrieval-Augmented Generation for Public Health Question Answering

Retrieval-augmented generation system for public health question answering reducing LLM hallucinations through corpus-grounded responses.

Ax Iman Seyedi, Francesco Archetti 22d ago

Diffusion enabled Optimal Transport distances for graph matching

Graph matching method using diffusion-enabled optimal transport for comparing graphs with sparse or noisy node features and structure.

Ax Sankalp Gilda 22d ago

tsbootstrap: Distribution-Free Uncertainty Quantification and Conformal Prediction for Time Series

Library providing block/residual/sieve resampling and conformal prediction methods for time series uncertainty quantification and bootstrap confidence intervals.

Ax Inkyu Sa, Chanoh Park, Hea-Min Lee, Donghee Noh, Ho Seok Ahn 22d ago

Vision Language Action (VLA) Models for Unmanned Aerial Robotics and Bimanual Manipulation: A Review

Review of Vision Language Action models enabling robots to follow natural language instructions for manipulation and aerial tasks.

Ax Razvan Mihai Popescu 22d ago

Reliable and Developer-Aligned Evaluation of Agents for Software Engineering

Evaluation framework for LLM-based software engineering agents addressing fragmentation and developer alignment in autonomous development contributions.

Ax Nilay Kushawaha, Muhammad Sunny Nazeer, Baljinder Singh Bal, Cecilia Laschi, Egidio Falotico 22d ago

A Continual Learning Framework for Adaptive Control of Modular Soft Robots

Continual learning framework for adaptive control of modular soft robots with deformable and reconfigurable structures.

Ax Yizhi Wang, Xinghua Gao, Reachsak Ly, Alireza Shojaei 22d ago

SmartHomeSecure: Automated Detection and Repair of Smart Home Configuration Errors Using Large Language Models

LLM-based tool for automated detection and repair of YAML configuration errors in smart home automation platforms.

Ax Petar Djukic, Sudipta Acharya, Takai Eddine Kennouche, Burak Kantarci 22d ago

From Agentic to Autogenic Network Management for AI-Native 6G and Beyond: A Standards Perspective

Standards perspective on agentic AI and large AI models for autonomous 6G network management with runtime software evolution capabilities.

Ax Javidan Abdullayev, Maxime Devanne, Jonathan Weber, Germain Forestier 22d ago

Enhancing deep learning models for time series classification via knowledge distillation

Knowledge distillation approach for compressing state-of-the-art deep learning time series models for resource-constrained deployment.

Ax Robert Richardson 22d ago

What Predicts Correctness in Text-to-SQL? A Selective-Prediction Study

Study of signals predicting correctness in text-to-SQL generation using self-consistency and execution-based confidence metrics on BIRD and Spider benchmarks.

Ax Dovy Paukstys 22d ago

A Multi-Analyst LLM Pipeline for Auditable Rule Discovery Across 68 Public Physiological Corpora

LLM pipeline workflow for discovering detector rules across 68 physiological datasets for contactless health monitoring platform design.

Ax Haowen Xu, Xue Tan, Lei Ma, Zhihao Zhang, Chao Wang, Qingze Wang, Ping Chen, Jun Dai, Xiaoyan Sun 22d ago

When Agents Go Rogue: Activation-Based Detection of Malicious Behaviors in Multi-Agent Systems

Security framework for detecting malicious behaviors in LLM-based multi-agent systems through activation-based detection of semantic attacks.

Ax Yashal Shakti Kanungo, Sumit Negi, Aruna Rajan 22d ago

Ad Headline Generation using Self-Critical Masked Language Model

E-commerce ad headline generation using reinforcement learning policy gradient methods on masked language models with self-critical training.

Ax Albert Zeyer, Ralf Schl\"uter, Hermann Ney 22d ago

Gradient-Based Speech-to-Text Alignment for Any ASR Model: From CTC to Speech LLMs

Gradient-based method for speech-to-text alignment compatible with CTC, transducers, attention-based encoders, and speech LLMs.

Ax Nima Kelidari, Mohammadsaeed Haghi, Mahdi Salmani 22d ago

A Gold-Standard Study of What Makes a Lightweight Game-Playing Agent Strong

Study using rule-based expert baseline to evaluate reinforcement learning agents in imperfect-information card game Gin Rummy across 100+ trained agents.

Ax Peter Bohm, Saimunur Rahman, Abdelwahed Khamis, Sagun Man Singh Shrestha, Chris McCool, Peyman Moghadam 22d ago

GemNav: Discrete-Token Visual Robot Navigation using a Multimodal Large Language Model

Visual robot navigation policy using frozen multimodal LLM with low-rank adaptation for waypoint navigation without custom encoders or large training datasets.

Ax Abhay Kumar Pathak, Mrityunjay Chaubey, Manjari Gupta 22d ago

ReMoDEx: A Local-to-Global Relevance-Based Model Decision Explainability Framework for large-Scale Image Datasets

Framework for explaining deep learning image classifier decisions at scale using local-to-global relevance analysis on large datasets.

Ax Jie Wang 22d ago

Computing with Stochastic Oracles in AI-Augmented Computation

Theoretical framework modeling AI-augmented computation as interaction between probabilistic Turing machines and stochastic oracles.

Ax Sojung An, Junha Lee, Sujeong You, Nam Ik Cho, Donghyun Kim 22d ago

LoCA: Spatially-Aware Low-Rank Convolutional Adaptation of Vision Foundation Models

Parameter-efficient fine-tuning method using spatially-aware low-rank adaptation for vision foundation models to reduce computational costs.

Ax Yiming Gai, Junde Lu, Xuefei Huang 22d ago

Comprehensive Evaluation of Large Language Model Responses: A Multi-Factor Scoring System

Multi-factor scoring system for comprehensive evaluation of LLM responses across accuracy, consistency, and readability.

Ax Kiarash Ahi, Saeed Valizadeh 22d ago

Large Language Models (LLMs) and Generative AI in Cybersecurity and Privacy: A Survey of Dual-Use Risks, AI-Generated Malware, Explainability, and Defensive Strategies

Survey of LLM and generative AI security applications, covering dual-use risks, malware generation, and defensive strategies.

Ax Amin Tabrizian, Arsyi Aziz, Aarifah Ullah, Mahyar Ghazanfari, Pouria Razzaghi, Peng Wei 22d ago

End-to-End LLM Flight Planning with RAG-based Memory and Multi-modal Coach Agent

FRAMe: end-to-end LLM flight planning system using RAG memory and multi-modal coach agents for eVTOL aircraft.

Ax Jun Choi, Chang-Ock Lee, Minam Moon 22d ago

Hybrid Least Squares/Gradient Descent Methods for MIONets

Hybrid least squares/gradient descent optimization method for accelerating MIONet training.

Ax Yusen Feng, Bingchen Han, Jiangran Lyu, Kai Liu, Yixin Zheng, Yuxuan Wan, Weiheng Liu, Sun Han, Ruiqin Li, Yulong Zhang, Fangfu Liu, Xuesong Shi, Libin Liu, Yizhou Wang, Zhizheng Zhang, He Wang 22d ago

WAM-TTT: Steering World-Action Models by Watching Human Play at Test Time

WAM-TTT: test-time training framework for adapting robot foundation models using human video demonstrations.

Ax Dennis Gross, Quentin Mazouni, Helge Spieker, Arnaud Gotlieb 22d ago

Gimitest: A Comprehensive Tool for Testing Reinforcement Learning Policies

Gimitest: open-source framework for comprehensive testing of reinforcement learning policies across environments and algorithms.

Ax Kyuan Oh, Bumsoo Kim 22d ago

AnchorPrune: Relevance-Anchored Contextual Expansion for Visual Token Pruning

AnchorPrune method for efficient visual token pruning in vision-language models balancing relevance and diversity.

Ax Changcun Huang 22d ago

On the Principles of Deep Feedforward ReLU Networks

Systematic analysis of training dynamics and solutions in deep feedforward ReLU networks.

Ax Arun Malik 22d ago

Progressive Crystallization: Turning Agent Exploration into Deterministic, Lower-Cost Workflows in Production

Progressive crystallization framework converts agent exploration into deterministic, cost-effective production workflows through lifecycle stages.

Ax Guoyang Zhao, Quanhao Qian, Gongjie Zhang, Wenhao Li, Jiuniu Wang, Xiaowei Lu, Deli Zhao, Ran Xu 22d ago

GeoProp: Grounding Robot State in Vision for Generalist Manipulation

arXiv paper on proprioceptive grounding mechanism (GeoProp) for vision-based robotic manipulation policies.

Ax Stepanida Alekseeva, Jenifer Kalafatovich, Seong-Whan Lee 22d ago

Tree-of-Thoughts Reasoning for Text-to-Image In-Context Learning

arXiv paper applying tree-of-thoughts reasoning to improve text-to-image in-context learning with multimodal LLMs.

Ax Zetian Hu, Shunyu Liu, Junjie Zhang, Yongcheng Jing, Ting-En Lin, Yongbin Li, Dacheng Tao 22d ago

Entropy Pacing Policy Optimization for Multi-Task Agentic Reinforcement Learning

arXiv paper on entropy pacing for multi-task reinforcement learning with agentic LLMs to handle varied exploration dynamics.

Ax Marcus Williams, Hannah Sheahan, Cameron Raymond, Tomek Korbak, Deng Pan, Peilin Yang, Leon Maksin, Ningyi Xie, Phillip Guo, Ian Kivlichan, Micah Carroll 22d ago

Predicting LLM Safety Before Release by Simulating Deployment

arXiv paper on pre-deployment safety evaluation of LLMs by simulating realistic deployment with de-identified conversations.

Ax Satoshi Matsuoka 22d ago

Memory Scarcity, Open Models, and the Restructuring of the AI Industry, 2026-2030 -- A quantitative scenario analysis of inference economics, training-cost divergence, and infrastructure solvency

arXiv quantitative analysis of AI industry restructuring 2026-2030 examining memory constraints, open models, inference economics, and compute markets.

Ax Sergio Rozada, Yiming Qin, Manuel Madeira, Pascal Frossard, Alejandro Ribeiro 22d ago

DiPhon: Diffusion on Graphons for Scalable Graph Generation

arXiv paper on scalable graph generation via diffusion models using graphon theory for dense graphs.

Ax Donato Cerciello, Leonardo Schiavo, Angel Panizo-LLedot, Javier Huertas Tato, David Camacho 22d ago

FMMVCC: Fuzzy Mamba-based Multi-View Contrastive Clustering for Univariate Time Series

arXiv paper on unsupervised time series clustering using Mamba-based multiview contrastive learning framework.

Ax Sergei Zorkaltsev, Maciej Haranczyk, Christina Schenk 22d ago

Bayesian Optimization of Genetic Algorithm Hyperparameters in a Multi-Fidelity Framework for Efficient Lattice Material Design

arXiv paper on multi-fidelity Bayesian optimization framework for genetic algorithm hyperparameter tuning with neural surrogates.

Ax Chongkai Li, Bang Zhang, Wenjian Luo 22d ago