Isolater - Feed

Ax Nadav Dym, Matthias Wellershoff, Efstratios Tsoukanis, Daniel Levy, Radu Balan 18d ago

Quantitative Bounds for Sorting-Based Permutation-Invariant Embeddings

Quantitative bounds analysis for permutation-invariant embeddings using sorting-based projections relevant to graph deep learning.

Ax Hangshuo Tian 18d ago

On the Interaction Between Chicken Swarm Rejuvenation and KLD-Adaptive Sampling in Particle Filters

Theoretical analysis of interactions between chicken swarm optimization-based particle rejuvenation and KLD-adaptive sampling in particle filters.

Ax Shanchuan Lin, Ceyuan Yang, Zhijie Lin, Hao Chen, Haoqi Fan 18d ago

Adversarial Flow Models

Generative model combining adversarial and flow-based families with native one-step/multi-step generation trained via adversarial objective.

Ax Colin Doumont, Donney Fan, Natalie Maus, Jacob R. Gardner, Henry Moss, Geoff Pleiss 18d ago

We Still Don't Understand High-Dimensional Bayesian Optimization

Research on high-dimensional Bayesian optimization showing simple Bayesian linear regression outperforms complex BO methods after geometric transformation.

Ax Hao Tang, Hao Chen, Chao Li 18d ago

Generalized Spherical Neural Operators: Green's Function Formulation

Neural operator framework for solving PDEs on spherical domains using Green's function formulation preserving rotational geometry.

Ax Xuwei Tan, Yao Ma, Xueru Zhang 18d ago

Understanding Structured Financial Data with LLMs: A Case Study on Fraud Detection

Case study applying LLMs to structured financial fraud detection data with focus on interpretability and feature analysis.

Ax Joyjit Roy, Samaresh Kumar Singh 18d ago

Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis

Comparative evaluation of embedding techniques for financial news sentiment analysis in resource-constrained environments.

Ax Carla Crivoi, Radu Tudor Ionescu 18d ago

Machine Unlearning in the Era of Quantum Machine Learning: An Empirical Study

First empirical study of machine unlearning in hybrid quantum-classical neural networks with adaptation of classical unlearning methods.

Ax Yusuf Brima, Marcellin Atemkeng 18d ago

From Classical Machine Learning to Tabular Foundation Models: An Empirical Investigation of Robustness and Scalability Under Class Imbalance in Emergency and Critical Care

Empirical study of tabular foundation models versus classical ML for healthcare applications under class imbalance in critical care.

Ax Lang Cao, Hui Ruan, Yongqian Li, Peng Chao, Wu Ning, Haonan Song, Renhong Chen, Yitong Li 18d ago

TreeAdv: Tree-Structured Advantage Redistribution for Group-Based RL

Tree-structured advantage redistribution method for group-based RL improving sample efficiency in LLM alignment on reasoning tasks.

Ax Rohan Tangri, Jan-Peter Calliess 18d ago

Constrained Policy Optimization with Cantelli-Bounded Value-at-Risk

Sample-efficient reinforcement learning algorithm for Value-at-Risk constrained optimization with safety guarantees during training.

Ax Ziqiao Shang, Lingyue Ge, Yang Chen, Shi-Yu Tian, Zhenyu Huang, Wenbo Fu, Yu-Feng Li, Lan-Zhe Guo 18d ago

MapTab: Are MLLMs Ready for Multi-Criteria Route Planning in Heterogeneous Graphs?

Benchmark for evaluating multimodal LLMs on multi-criteria route planning reasoning tasks in heterogeneous graphs.

Ax Wall Kim, Chaeyoung Song, Hanul Kim 18d ago

MultiModalPFN: Extending Prior-Data Fitted Networks for Multimodal Tabular Learning

Extension of TabPFN foundation model to handle multimodal tabular data combining images, text and structured features.

Ax Ruinan Jin, Yingbin Liang, Shaofeng Zou 18d ago

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

Theoretical analysis explaining Adam optimizer's empirical advantages over SGD through second-moment normalization properties.

Ax Lukas K\"onig, Manuel Kuhn, David Kappel, Anand Subramoney 18d ago

Training event-based neural networks with exact gradients via Differentiable ODE Solving in JAX

JAX framework for gradient-based training of spiking neural networks using differentiable ODE solving with exact gradients.

Ax Jiayang Gao, Tianyi Zheng, Jiayang Zou, Fengxiang Yang, Shice Liu, Luyao Fan, Zheyu Zhang, Hao Zhang, Jinwei Chen, Peng-Tao Jiang, Bo Li, Jia Wang 18d ago

C$^2$FG: Control Classifier-Free Guidance via Score Discrepancy Analysis

Theoretical analysis of classifier-free guidance in diffusion models with bounds on score discrepancy for controlled guidance weights.

Ax Yuval Ran-Milo 18d ago

Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks

Theoretical analysis proving attention sinks are functionally necessary in softmax Transformers for trigger-conditional tasks.

Ax Martin G. Frasch 18d ago

Minimum-Action Learning: Energy-Constrained Symbolic Model Selection for Physical Law Identification from Noisy Data

Framework for identifying symbolic physical laws from noisy data by minimizing action functional with sparsity and energy conservation.

Ax Huamin Chen, Xunzhuo Liu, Bowei He, Fuyuan Lyu, Yankai Chen, Xue Liu, Yuhan Liu, Junchen Jiang 18d ago

The Workload-Router-Pool Architecture for LLM Inference Optimization: A Vision Paper from the vLLM Semantic Router Project

vLLM Semantic Router architecture for optimizing LLM inference with routing mechanisms, semantic caching, and safety classification.

Ax Shreeram Murali, Cristian R. Rojas, Dominik Baumann 18d ago

Computationally lightweight classifiers with frequentist bounds on predictions

Computationally efficient classification algorithm with frequentist uncertainty bounds for safety-critical applications.

Ax Xiang Li, Yixuan Jia, Xiao Li, Jeffrey A. Fessler, Rongrong Wang, Qing Qu 18d ago

MCLR: Improving Conditional Modeling via Inter-Class Likelihood-Ratio Maximization and Unifying Classifier-Free Guidance with Alignment Objectives

Theoretical framework unifying classifier-free guidance with alignment objectives in diffusion models for generative modeling.

Ax Noah Bergam, Samuel Deng, Daniel Hsu 18d ago

A One-Inclusion Graph Approach to Multi-Group Learning

Theoretical analysis of sample complexity bounds for multi-group learning using one-inclusion graph prediction strategy.

Ax Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Adish Singla, Goran Radanovi\'c 18d ago

Corruption-robust Offline Multi-agent Reinforcement Learning From Human Feedback

Multi-agent reinforcement learning framework addressing robustness to data corruption in preference-based learning from human feedback.

Ax Chien-Ping Lu 18d ago

Continued AI Scaling Requires Repeated Efficiency Doublings

Analysis of AI scaling requiring repeated efficiency doublings, distinguishing logical compute from physical resource implementation efficiency.

Ax Anci Lin, Xiaohong Liu, Zhiwen Zhang, Wenju Zhao 18d ago

Biomimetic causal learning for microstructure-forming phase transitions

Biomimetic physics-informed neural networks for modeling microstructure-forming phase transitions in cellular matrices.

Ax Brandon Yee, Pairie Koh 18d ago

PI-JEPA: Label-Free Surrogate Pretraining for Coupled Multiphysics Simulation via Operator-Split Latent Prediction

Physics-informed label-free pretraining method for coupled multiphysics simulation surrogates using operator-split latent prediction.

Ax Dharmesh Tailor, Nicol\`o Felicioni, Kamil Ciosek 18d ago

A Bayesian Information-Theoretic Approach to Data Attribution

Bayesian information-theoretic approach to training data attribution that traces model predictions to influential training examples for interpretability.

Ax Minglu Liu, Cunchen Hu, Liangliang Xu, Fengming Tang, Ruijia Wang, Fu Yu 18d ago

STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training

STQuant framework for adaptive spatio-temporal quantization of optimizer states during large multimodal model training to reduce memory costs.

Ax Nozomu Kobayashi, Yoshiyuki Suimon, Koichi Miyamoto 18d ago

Time series generation for option pricing on quantum computers using tensor network

Quantum computing approach for option pricing using tensor networks to prepare quantum states encoding asset price distributions.

Ax Tim Johnsen, Marco Levorato 18d ago

NaviSlim: Adaptive Context-Aware Navigation and Sensing via Dynamic Slimmable Networks

Adaptive neural networks for autonomous micro-drones with computational constraints via dynamic slimmable network architecture.

Ax Takuro Kutsuna 18d ago

A Probabilistic Formulation of Offset Noise in Diffusion Models

Theoretical analysis of offset noise in diffusion models to address brightness value generation challenges in large-scale models.

Ax Huawei Lin, Yingjie Lao, Weijie Zhao 18d ago

DMin: Scalable Training Data Influence Estimation for Diffusion Models

DMin framework for scalable training data influence estimation in diffusion models, enabling identification of influential training samples on generated outputs.

Ax Ximing Xing, Juncheng Hu, Ziteng Xue, Jing Zhang, Buyu Li, Sheng Wang, Dong Xu, Qian Yu 18d ago

SVGFusion: A VAE-Diffusion Transformer for Vector Graphic Generation

VAE-diffusion framework for generating high-quality SVG graphics from text with structural understanding.

Ax Antoni Kowalczuk, Jan Dubi\'nski, Franziska Boenisch, Adam Dziedzic 18d ago

Privacy Attacks on Image AutoRegressive Models

Comprehensive privacy attack analysis on image autoregressive models, identifying membership inference and extraction vulnerabilities.

Ax Mohammad Albinhassan, Pranava Madhyastha, Alessandra Russo 18d ago

$\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Method for enforcing syntactic and semantic constraints in LLM decoding through MCTS-guided token-level control.

Ax Musfiqur Rahman, SayedHassan Khatoonabadi, Emad Shihab 18d ago

OpenClassGen: A Large-Scale Corpus of Real-World Python Classes for LLM Research

Large-scale corpus of 324,843 Python classes from open-source projects for training and evaluating LLMs on code generation.

Ax Dezheng Han, Yibin Jia, Ruxiao Chen, Wenjie Han, Shuaishuai Guo, Jianbo Wang 18d ago

ReCellTy: Domain-Specific Knowledge Graph Retrieval-Augmented LLMs Reasoning Workflow for Single-Cell Annotation

RAG-based LLM workflow using domain-specific knowledge graph for automated single-cell type annotation in biology.

Ax Rui Melo, Claudia Mamede, Andre Catarino, Rui Abreu, Henrique Lopes Cardoso 18d ago

Are Sparse Autoencoders Useful for Java Function Bug Detection?

Study evaluating sparse autoencoders for detecting bugs in Java code, addressing software vulnerability detection.

Ax Ozsel Kilinc, Cem Tarhan 18d ago

RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection

Technique for improving BEV-based 3D object detection in autonomous driving by reparametrizing regression targets.

Ax Charig Yang, Samiul Alam, Shakhrul Iman Siam, Michael J. Proulx, Lambert Mathias, Kiran Somasundaram, Luis Pesqueira, James Fort, Sheroze Sheriffdeen, Omkar Parkhi, Carl Ren, Mi Zhang, Yuning Chai, Richard Newcombe, Hyo Jin Kim 18d ago

Reading Recognition in the Wild

Task and dataset for detecting when users are reading in egocentric smart glasses video using multimodal models.

Ax Thinh Pham, Nguyen Nguyen, Pratibha Zunjare, Weiyuan Chen, Yu-Min Tseng, Tu Vu 18d ago

SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

Benchmark dataset (SealQA) for evaluating search-augmented LLMs on fact-seeking questions with conflicting or noisy search results.

Ax Adrian-Marius Dumitran, Radu Dita, Angela Liliana Dumitran 18d ago

BacPrep: Lessons from Deploying an LLM-Based Bacalaureat Assessment Platform

Deployment case study of LLM-based platform for automated assessment of Romanian Bacalaureat exam questions using Gemini Flash.

Ax Tianjiao Yu, Vedant Shah, Muntasir Wahed, Ying Shen, Kiet A. Nguyen, Ismini Lourentzou 18d ago

Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting

Framework for 3D reconstruction of articulated objects using part-aware Gaussian splatting representation.

Ax Scarlett Raine, Tobias Fischer 18d ago

AI-Driven Marine Robotics: Emerging Trends in Underwater Perception and Ecosystem Monitoring

Survey of AI applications in marine robotics for ecosystem monitoring and conservation using underwater perception.

Ax Alissa A. Valentine, Lauren A. Lepow, Lili Chan, Alexander W. Charney, Isotta Landi 18d ago

Bias Detection in Emergency Psychiatry: Linking Negative Language to Diagnostic Disparities

Analysis of clinician bias in emergency psychiatry using NLP to detect negative language linked to diagnostic disparities.

Ax Himanshu Singh, A. V. Subramanyam, Shivank Rajput, Mohan Kankanhalli 18d ago

Nearest Neighbor Projection Removal Adversarial Training

Adversarial training framework for neural networks that mitigates inter-class feature overlap to improve robustness.

Ax Hyungjin Chung, Hyelin Nam, Jiyeon Kim, Hyojun Go, Byeongjun Park, Junho Kim, Joonseok Lee, Seongsu Ha, Byung-Hoon Kim 18d ago

Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs

Inference method for VideoLLMs that processes multiple frame subsets in parallel to improve temporal detail without increasing context window.

Ax Christoph Timmermann, Hyunse Lee, Woojin Lee 18d ago

SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP

Technique to improve CLIP few-shot classification by addressing modality gap through semantic bridging between image and text embeddings.

Ax Ayan Majumdar, Feihao Chen, Jinghui Li, Xiaozhen Wang 18d ago

Evaluating LLMs for Demographic-Targeted Social Bias Detection: A Comprehensive Benchmark Study

Benchmark for evaluating LLMs on detecting demographic-targeted social biases across diverse content types and demographics.

Ax Hsien-Chin Lin, Benjamin Matthias Ruppik, Carel van Niekerk, Chia-Hao Shen, Michael Heck, Nurul Lubis, Renato Vukovic, Shutong Feng, Milica Ga\v{s}i\'c 18d ago

Prompt reinforcing for long-term planning of large language models

Method to improve LLM performance in multi-turn conversations by reinforcing long-term planning and goal tracking through prompting.