RSHallu: Dual-Mode Hallucination Evaluation for Remote-Sensing Multimodal Large Language Models with Domain-Tailored Mitigation
researchpaper
PELLI: Framework to effectively integrate LLMs for quality software generation
researchpaper
Beyond Confidence: The Rhythms of Reasoning in Generative Models
researchpaper
Flow caching for autoregressive video generation
researchpaper
Enhancing Multivariate Time Series Forecasting with Global Temporal Retrieval
researchpaper
Time Series Foundation Models for Energy Load Forecasting on Consumer Hardware: A Multi-Dimensional Zero-Shot Benchmark
researchpaper
ICA: Information-Aware Credit Assignment for Visually Grounded Long-Horizon Information-Seeking Agents
researchpaper
FedPS: Federated data Preprocessing via aggregated Statistics
researchpaper
Diagnosing Structural Failures in LLM-Based Evidence Extraction for Meta-Analysis
researchpaper
The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems
researchpaper
Interactive LLM-assisted Curriculum Learning for Multi-Task Evolutionary Policy Search
researchpaper
Resource-Efficient Model-Free Reinforcement Learning for Board Games
researchpaper
Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System
researchpaper
Traceable, Enforceable, and Compensable Participation: A Participation Ledger for People-Centered AI Governance
researchpaper
What do people want to fact-check?
researchpaper
Computational Phenomenology of Temporal Experience in Autism: Quantifying the Emotional and Narrative Characteristics of Lived Unpredictability
researchpaper
Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models
researchpaper
Rotary Positional Embeddings as Phase Modulation: Theoretical Bounds on the RoPE Base for Long-Context Transformers
researchpaper
Healthy Harvests: A Comparative Look at Guava Disease Classification Using InceptionV3
researchpaper
FeatureBench: Benchmarking Agentic Coding for Complex Feature Development
researchpaper
RiemannGL: Riemannian Geometry Changes Graph Deep Learning
researchpaper
LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules
researchpaper
Fine-Tuning GPT-5 for GPU Kernel Generation
researchpaper
Enhancing Predictability of Multi-Tenant DNN Inference for Autonomous Vehicles' Perception
researchpaper
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression
researchpaper
CVPL: A Geometric Framework for Post-Hoc Linkage Risk Assessment in Protected Tabular Data
researchpaper
From Buffers to Registers: Unlocking Fine-Grained FlashAttention with Hybrid-Bonded 3D NPU Co-Design
researchpaper
OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
researchpaper
ContactGaussian-WM: Learning Physics-Grounded World Model from Videos
researchpaper
Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting
researchpaper
Linguistic Indicators of Early Cognitive Decline in the DementiaBank Pitt Corpus: A Statistical and Machine Learning Study
researchpaper
Language Model Inversion through End-to-End Differentiation
researchpaper
GraphSeek: Next-Generation Graph Analytics with LLMs
researchpaper
Conversational Behavior Modeling Foundation Model With Multi-Level Perception
researchpaper
Chatting with Images for Introspective Visual Thinking
researchpaper
Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing
researchpaper
In-the-Wild Model Organisms: Mitigating Undesirable Emergent Behaviors in Production LLM Post-Training via Data Attribution
researchpaper
SteuerLLM: Local specialized large language model for German tax law analysis
researchpaper
GRASP: group-Shapley feature selection for patients
researchpaper
General Flexible $f$-divergence for Challenging Offline RL Datasets with Low Stochasticity and Diverse Behavior Policies
researchpaper
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning
researchpaper
Direct Learning of Calibration-Aware Uncertainty for Neural PDE Surrogates
researchpaper
Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away
researchpaper
Learning to Compose for Cross-domain Agentic Workflow Generation
researchpaper
Weight Decay Improves Language Model Plasticity
researchpaper
Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows
researchpaper
GENIUS: Generative Fluid Intelligence Evaluation Suite
researchpaper
Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling
researchpaper
Implicit Probabilistic Reasoning Does Not Reflect Explicit Answers in Large Language Models
researchpaper
Metareasoning in uncertain environments: a meta-BAMDP framework
researchpaper