The Diminishing Returns of Early-Exit Decoding in Modern LLMs
Analysis of early-exit decoding in modern LLMs showing reduced efficiency gains due to improved architectures with lower layer redundancy.
Analysis of early-exit decoding in modern LLMs showing reduced efficiency gains due to improved architectures with lower layer redundancy.
Study of filtered vector search algorithms in PostgreSQL for semantic search and GenAI applications, evaluating real-world database performance.
Continuous-time diffusion models for generating synthetic electronic health records with mixed numerical and categorical features.
Self-paced curriculum learning for RL using closed-form Gaussian updates to improve efficiency in high-dimensional contexts.
Intent-Based Networking using AI to translate high-level natural language intents into network policies with automated compliance assurance.
Human-in-the-loop Pareto optimization for motor skill training and rehabilitation, characterizing task difficulty vs. performance trade-offs.
Bayesian latent transport framework for domain-adaptive foundation models addressing distribution mismatch and uncertainty propagation in limited-supervision scenarios.
Cognitive Firewall: hybrid edge-cloud architecture for securing browser-based LLM agents against indirect prompt injection attacks using split-compute security checks.
LLM-informed model-based planning for object search using LLM likelihood estimates and environment costs.
Neural Regression Collapse phenomenon across network layers showing feature sparsity and low rank structures.
AgentPex: Framework for detecting procedural failures in agentic traces including workflow routing and tool usage violations.
Method for finding representations in language models via adversarial perturbation without implausible constraints.
Benchmark (PoliticsBench) measuring political bias in eight LLMs using multi-turn roleplay evaluation.
Research on activation function curvature role in adversarial robustness using Recursive Curvature-Tunable Activation Family.
Discussion of user experience design for generative AI in education emphasizing human-AI epistemic partnership.
Investigation of vision-language model robustness under distribution shifts using visual deductive reasoning tasks.
HDPO method augmenting RL with privileged self-distillation for LLM mathematical reasoning on unsolvable cliff prompts.
Luna: C++ implementation of alpha-CROWN bound propagation for neural network formal verification.
Multi-agent robotic platform using AI agents for adaptive chemical laboratory automation handling diverse experimental tasks.
SM-Net model generating stellar spectra from physical parameters using combined stellar library data.
Knowledge-refined network for retrieving partially relevant video segments using semantic context awareness.
Latent bias alignment technique for improving diffusion model inversion quality in real-world image reconstruction.
Self-distillation method for multi-token prediction in LLMs to improve inference efficiency and MTP head acceptance rates.
Multimodal deception detection system using schema-driven approach with multicultural datasets and explainable reasoning.
MVH-Bench dataset and analysis of multi-view hallucination in vision-language models processing diverse viewpoint images.
Watermarking system for face content protection against AIGC manipulation and deepfakes with high fidelity recovery.
Variable-length audio fingerprinting method using deep learning for robust recognition of distorted recordings.
LLM-enabled framework for automated threat hunting using Splunk SOC logs to assist security analysts with APT detection.
Systematic study of reasoning LLM inference costs revealing pricing reversal phenomenon where cheaper models cost more across 9 diverse tasks.
Physics-guided text-to-motion framework for humanoid control using rectified flow and safety gating to prevent kinematic hallucinations.
Ensemble of specialized LLMs architecture for adaptive tutoring that separates pedagogical decision-making from response generation.
Analysis of challenges in iterative generative optimization with LLMs for self-improving agents, identifying hidden design choices limiting adoption.
Fine-tuned 8B model for text-to-SQL at scale, reducing API costs and latency for production deployment in conversational applications.
Training framework addressing contextual exposure bias in speech-LLMs using teacher error knowledge and contrastive learning.
Method to reduce object hallucinations in LVLMs by rectifying attention imbalance across and within vision-language modalities.
Safety analysis of MLLMs for image generation, identifying semantic understanding capabilities that may introduce new risks compared to diffusion models.
Multi-task robotic manipulation framework using knowledge graphs and dynamic relation mechanisms for vision-grounded policy learning.
Dual-guidance RL framework for LLMs that combines external execution feedback with internal experience for improved reasoning task learning.
Graph representation learning method for analog circuit design automation using DC electrical equivalence principles.
Comparative study of dual-form attention networks for multi-modal satellite time series analysis in land monitoring applications.
Analysis of response homogenization in RLHF-aligned LLMs and its impact on uncertainty estimation methods, identifying alignment-robustness tradeoffs.
Multilingual multi-turn medical dialogue dataset for training conversational AI systems in healthcare with improved realism and accessibility.
Scaling RL for LLM code generation using synthetic data pipelines and curriculum learning, addressing data diversity over volume.
Security analysis of Model Context Protocol (MCP) tool-augmented LLM agents, demonstrating stealthy injection attacks on tool responses.
Knowledge distillation method using dual-modality (vision + text/CLIP) teacher models to improve student model efficiency and quality.
Privacy analysis of time series imputation models, demonstrating membership inference and attribute leakage vulnerabilities in black-box settings.
Open-source tool for decomposing citation networks and measuring researcher influence through bibliometric scoring (BARON/HEROCON).
Study of fairness impacts in RAG-augmented LLMs, examining if certain demographic groups receive systematically different response quality.
Multi-agent LLM workflow for automated penetration testing of networked cyber-physical systems and robotic infrastructure using environment grounding.
DVM runtime kernel generation system for efficient compilation of dynamic AI models with variable tensor shapes and control flows.