ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data
ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data
ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data
Enhancing Inverse Reinforcement Learning through Encoding Dynamic Information in Reward Shaping
When Speculation Spills Secrets: Side Channels via Speculative Decoding In LLMs
Symmetrization Weighted Binary Cross-Entropy: Modeling Perceptual Asymmetry for Human-Consistent Neural Edge Detection
Multi-Objective Bayesian Optimization for Networked Black-Box Systems: A Path to Greener Profits and Smarter Designs
from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors
Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling
GenDR: Lighten Generative Detail Restoration
Convergence and Connectivity: Dynamics of Multi-Agent Q-Learning in Random Networks
LLM-Mediated Guidance of MARL Systems
MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering
Efficient IoT Intrusion Detection with an Improved Attention-Based CNN-BiLSTM Architecture
Localized Graph-Based Neural Dynamics Models for Terrain Manipulation
Geospatial Representation Learning: A Survey from Deep Learning to The LLM Era
Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning
ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training
Intrinsic Self-Correction in LLMs: Towards Explainable Prompting via Mechanistic Interpretability
Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
Unveiling the "Fairness Seesaw": Discovering and Mitigating Gender and Race Bias in Vision-Language Models
Can LLMs Reason Structurally? Benchmarking via the Lens of Data Structures
Cross-Attention Speculative Decoding
Belief-Based Offline Reinforcement Learning for Delay-Robust Policy Optimization
Algorithmically Establishing Trust in Evaluators
Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs
Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures
A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Computers
Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle
MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models
AI Agentic Vulnerability Injection And Transformation with Optimized Reasoning
AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
Self-Augmented Robot Trajectory: Efficient Imitation Learning via Safe Self-augmentation with Demonstrator-annotated Precision
Is In-Context Learning Learning?
TableDART: Dynamic Adaptive Multi-Modal Routing for Table Understanding
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances
HuMam: Humanoid Motion Control via End-to-End Deep Reinforcement Learning with Mamba
Bridging Fairness and Explainability: Can Input-Based Explanations Promote Fairness in Hate Speech Detection?
Beyond Aggregation: Guiding Clients in Heterogeneous Federated Learning
Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding
HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment
Discrete Variational Autoencoding via Policy Search
GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models
VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale
ACT: Agentic Classification Tree
Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique
Learning under Quantization for High-Dimensional Linear Regression
Context-level Language Modeling by Learning Predictive Context Embeddings
RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA