Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text
Systematic analysis of demographic bias in LLM-generated targeted messaging across GPT-4o, Llama-3.3, and Mistral-Large models.
Systematic analysis of demographic bias in LLM-generated targeted messaging across GPT-4o, Llama-3.3, and Mistral-Large models.
MERMAID: Multi-agent system for fact-checking using LLMs with memory-enhanced retrieval and iterative reasoning to assess veracity of claims.
Agent memory system beyond RAG addressing agent-specific needs: bounded coherent dialogue retrieval with decoupling and aggregation.
Unified framework explaining LLM steering methods (fine-tuning, LoRA, activation interventions) as dynamic weight updates from control signals.
El Agente Estructural multimodal agent for autonomous molecular geometry generation and manipulation using natural language and vision.
Fake-HR1 hybrid-reasoning model for synthetic image detection balancing chain-of-thought reasoning with computational efficiency.
Study of electromagnetic fault injection attacks on embedded deep learning models analyzing influence of number representations on resilience.
AdvSynGNN resilient graph neural network architecture addressing structural noise and non-homophilous topologies via adversarial synthesis.
SubQuad pipeline for immune repertoire analysis combining subquadratic retrieval with GPU-accelerated affinity kernels and multimodal fusion.
UBio-MolFM universal molecular foundation model framework for bio-system simulation bridging quantum accuracy and biological scale.
Pyramid MoA hierarchical mixture-of-agents architecture with decision-theoretic router for cost-optimized anytime LLM inference.
Gome agent for machine learning engineering using gradient-based optimization instead of tree search, scaling LLM-based reasoning.
SteerEval benchmark for evaluating LLM controllability across language features, sentiment, and personality at multiple specification levels.
Physics-informed surrogate model for ferroelectric NAND retention analysis reducing computational cost from day-scale to second-scale.
BiCLIP extends vision-language models to specialized domains via structured geometric transformation and domain canonicalization.
Causally-informed feature expansion method for class-incremental learning addressing catastrophic forgetting through feature collision mitigation.
Analysis of prompt injection attacks as role confusion where models infer text source by content style rather than origin.
Survey of resource consumption threats in LLMs including excessive generation attacks, resource efficiency requirements, and mitigation strategies.
Study of LLM alignment evaluation focusing on routing from concept detection to behavioral policy, using Chinese language models as case study.
Diffusion-based image super-resolution framework addressing inference efficiency vs reconstruction quality trade-off with learnable noise prediction.
Suiren-1.0 family of molecular foundation models for organic systems with 1.8B parameters pre-trained on 70M samples.
Cross-view geo-localization method for UAV navigation in GNSS-denied environments using aerial and satellite image matching.
Curriculum learning framework using cross-entropy games to automatically build general capabilities and discover skills in language models.
Introduces Step-Level Reasoning Capacity metric and LC-CoSR training method to measure and reduce reasoning rigidity in chain-of-thought reasoning.
Training-free spatial-temporal token compression method for video MLLMs achieving high-ratio visual token reduction via forest modeling.
Philosophical analysis of control dynamics and relational ethics in human-AI companion interactions examining provider authority.
Memory-sparse attention mechanism enabling LLMs to scale context to 100M tokens through efficient memory modeling instead of full attention.
Multimodal deception detection using schema-driven approach with audiovisual analysis across multicultural datasets for forensics applications.
LLM-enabled threat hunting framework for SOC analysts integrating Splunk with policy-guided decision making for APT detection.
Benchmark framework for evaluating multimodal LLMs as perceptual backbones for autonomous agents in 3D environments with decision-dense scenarios.
Reinforcement learning framework enabling multimodal LLMs to autonomously crop and focus on regions of interest for improved perception in complex visual scenes.
Coarse-to-fine reasoning framework using reinforcement learning for interpretable multimodal sentiment analysis with MLLMs and hint-guided training.
Bio-inspired self-evolving network architecture for autonomous agents using evolutionary approaches instead of static human-defined protocols.
Open source sound effect foundation model from Sony AI with audio encoder/decoder and text-to-audio capabilities.
Framework analyzing agent communication protocols for LLM systems across three layers: communication, syntactic, and semantic. Systematically organizes 18 representative protocols.
Evaluates whether LLMs can infer causal intervention effects from natural language descriptions using behavioral simulation on climate-psychology interventions.
Generative camera system using visual preference optimization for cinematic trajectory generation. Addresses framing and composition without director feedback loop.
LitPivot: tool supporting iterative research idea development through dynamic literature contextualization and AI-driven critique.
Method to identify valence-arousal emotional subspace in LLM representations for emotion steering and behavioral control.
Privacy-preserving system using LLMs to analyze student attention in classroom videos without storing identifiable footage.
Zero-shot quantization method using weight-space arithmetic to improve post-training quantization robustness across models.
Economic analysis of AI productivity gains showing sustained tool use erodes worker expertise over time.
Taxonomy of LLM-based coding agent architectures via source-code analysis, categorizing control loops, tool definitions, and context strategies.
Analysis of policy routing circuits in alignment-trained LLMs, localizing attention gates and amplifier heads controlling refusal behavior.
StableTTA: training-free test-time adaptation improving image classification accuracy via novel ensemble strategies.
EduIllustrate: benchmark evaluating LLMs on automated generation of diagram-rich educational content combining visuals and reasoning.
VideoStir: retrieval-augmented generation system for understanding long videos with multimodal LLMs using spatio-temporal structure.
Theoretical analysis proving limitations of continuous wrapper defenses against prompt injection attacks in LLMs.
MoBiE: binarization framework for efficient inference in Mixture-of-Experts LLMs via post-training quantization.
Analysis of emotional representation geometry in LLM latent spaces for transparency and safety.