Willful Disobedience: Automatically Detecting Failures in Agentic Traces
AgentPex: Framework for detecting procedural failures in agentic traces including workflow routing and tool usage violations.
AgentPex: Framework for detecting procedural failures in agentic traces including workflow routing and tool usage violations.
Method for finding representations in language models via adversarial perturbation without implausible constraints.
Benchmark (PoliticsBench) measuring political bias in eight LLMs using multi-turn roleplay evaluation.
Research on activation function curvature role in adversarial robustness using Recursive Curvature-Tunable Activation Family.
Discussion of user experience design for generative AI in education emphasizing human-AI epistemic partnership.
Investigation of vision-language model robustness under distribution shifts using visual deductive reasoning tasks.
HDPO method augmenting RL with privileged self-distillation for LLM mathematical reasoning on unsolvable cliff prompts.
Luna: C++ implementation of alpha-CROWN bound propagation for neural network formal verification.
Multi-agent robotic platform using AI agents for adaptive chemical laboratory automation handling diverse experimental tasks.
SM-Net model generating stellar spectra from physical parameters using combined stellar library data.
Knowledge-refined network for retrieving partially relevant video segments using semantic context awareness.
Latent bias alignment technique for improving diffusion model inversion quality in real-world image reconstruction.
Self-distillation method for multi-token prediction in LLMs to improve inference efficiency and MTP head acceptance rates.
Multimodal deception detection system using schema-driven approach with multicultural datasets and explainable reasoning.
MVH-Bench dataset and analysis of multi-view hallucination in vision-language models processing diverse viewpoint images.
Watermarking system for face content protection against AIGC manipulation and deepfakes with high fidelity recovery.
Variable-length audio fingerprinting method using deep learning for robust recognition of distorted recordings.
LLM-enabled framework for automated threat hunting using Splunk SOC logs to assist security analysts with APT detection.
Systematic study of reasoning LLM inference costs revealing pricing reversal phenomenon where cheaper models cost more across 9 diverse tasks.
Physics-guided text-to-motion framework for humanoid control using rectified flow and safety gating to prevent kinematic hallucinations.
Ensemble of specialized LLMs architecture for adaptive tutoring that separates pedagogical decision-making from response generation.
Analysis of challenges in iterative generative optimization with LLMs for self-improving agents, identifying hidden design choices limiting adoption.
Fine-tuned 8B model for text-to-SQL at scale, reducing API costs and latency for production deployment in conversational applications.
Training framework addressing contextual exposure bias in speech-LLMs using teacher error knowledge and contrastive learning.
Method to reduce object hallucinations in LVLMs by rectifying attention imbalance across and within vision-language modalities.
Safety analysis of MLLMs for image generation, identifying semantic understanding capabilities that may introduce new risks compared to diffusion models.
Multi-task robotic manipulation framework using knowledge graphs and dynamic relation mechanisms for vision-grounded policy learning.
Dual-guidance RL framework for LLMs that combines external execution feedback with internal experience for improved reasoning task learning.
Graph representation learning method for analog circuit design automation using DC electrical equivalence principles.
Comparative study of dual-form attention networks for multi-modal satellite time series analysis in land monitoring applications.
Analysis of response homogenization in RLHF-aligned LLMs and its impact on uncertainty estimation methods, identifying alignment-robustness tradeoffs.
Multilingual multi-turn medical dialogue dataset for training conversational AI systems in healthcare with improved realism and accessibility.
Scaling RL for LLM code generation using synthetic data pipelines and curriculum learning, addressing data diversity over volume.
Security analysis of Model Context Protocol (MCP) tool-augmented LLM agents, demonstrating stealthy injection attacks on tool responses.
Knowledge distillation method using dual-modality (vision + text/CLIP) teacher models to improve student model efficiency and quality.
Privacy analysis of time series imputation models, demonstrating membership inference and attribute leakage vulnerabilities in black-box settings.
Open-source tool for decomposing citation networks and measuring researcher influence through bibliometric scoring (BARON/HEROCON).
Study of fairness impacts in RAG-augmented LLMs, examining if certain demographic groups receive systematically different response quality.
Multi-agent LLM workflow for automated penetration testing of networked cyber-physical systems and robotic infrastructure using environment grounding.
DVM runtime kernel generation system for efficient compilation of dynamic AI models with variable tensor shapes and control flows.
Probabilistic time series forecasting method embracing heteroscedasticity for uncertainty quantification.
Heterogeneous caching accelerates diffusion-based video editing by reusing features across denoising timesteps.
Studies coordination failures when multiple LLM-based code agents implement parts of same class without explicit specification.
Graph neural network layer (CSNA) for heterophilous graphs with cost-sensitive neighborhood aggregation.
Reviews neural motion planning approaches for robotic manipulators, discussing challenges in generalist manipulation policies.
Automated reward design framework using LLMs for cooperative multi-agent reinforcement learning with aligned incentives.
Coarse-to-fine visual processing reduces computational costs in document parsing with vision-language models.
GameplayQA benchmark for evaluating multimodal LLMs as perceptual backbones for autonomous agents in 3D environments.
Improves deepfake audio detection using neuron-level mechanisms and neuroplasticity. Builds on Wav2Vec and LLMs.
Studies emergent self-awareness in continual robot learning by quantifying invariant cognitive structures.