STAG-CN: Spatio-Temporal Apiary Graph Convolutional Network for Disease Onset Prediction in Beehive Sensor Networks
Graph neural network for predicting disease spread across beehives using spatio-temporal sensor data.
Graph neural network for predicting disease spread across beehives using spatio-temporal sensor data.
Deep learning framework using geometric topology for predicting cold spray particle impact responses in materials processing.
Infinite Problem Generator: agentic framework synthesizing physics problems with guaranteed solvability for LLM training data generation.
Analysis of tensions between AI Safety and AI Ethics in governance, proposing engagement modes and bridging approaches.
CangjieBench benchmark for evaluating LLMs on Cangjie, a low-resource general-purpose programming language with contamination-free evaluation.
Trust-region search algorithm for black-box alignment of diffusion and flow models to target rewards at inference time without gradient access.
Vision-Language-Action framework with thinking-with-image reasoning allowing models to revisit visual context during long-horizon embodied tasks.
Benchmark of 12 language models on MALINT, a human-annotated disinformation corpus capturing malicious intent, for improved detection.
Loss landscape visualization method for analyzing critic neural networks in online reinforcement learning actor-critic algorithms.
End-to-end language-driven agent system for high-energy physics phenomenology workflows, executing tasks from theoretical input to final outputs.
Multimodal medical dataset and benchmark for automated ulcerative colitis severity scoring in endoscopy using standardized metrics.
Survey on using machine learning methods for adaptive memory system design in modern computing platforms instead of static heuristics.
Extension of critic match loss landscape visualization method to off-policy reinforcement learning and Soft Actor-Critic algorithm.
Efficient drop-in replacement for dense classification heads in language models, reducing parameter and compute overhead for consumer devices.
Biologically-inspired agentic memory architecture using reward prediction error routing to reduce token costs and write latency in LLM agents.
Loss landscape visualization framework for interpreting reinforcement learning behavior in actor-critic algorithms and control systems.
Policy-aware agent alignment framework using chain-of-thought reasoning to help LLM agents adhere to complex business rules without excessive prompting.
Novel policy gradient method addressing pathological behavior in standard policy gradients through context-aware advantage weighting.
LLM-augmented system for automated change summarization and impact analysis in cloud-native CI/CD pipelines and release management.
Constrained hierarchical multi-agent RL framework for sustainable maritime logistics with emission budgets and fairness guarantees.
Benchmark for evaluating LLMs on low-level code reasoning and formal proof generation using cryptographic library assembly code.
Open-source multi-agent system for literature review assistance using DSPy, Qdrant, and local-first architecture to synthesize papers and draft related work.
Framework for evaluating fairness in automated prior authorization systems for healthcare, addressing demographic parity in medical decision-making.
Study on compute allocation strategies for LLM-augmented retrieval agents handling reasoning-intensive queries over long horizons with growing memory stores.
Training-free inference-time model steering strategies to improve chain-of-thought reasoning in large audio-language models across multiple benchmarks.
TopoCL framework for medical imaging that adds topological contrastive learning to capture connectivity and boundary patterns beyond visual appearance.
Deep learning methodology for predicting thermal limit bias in boiling water nuclear reactors to improve operational efficiency.
EARCP ensemble architecture dynamically weights heterogeneous expert models based on performance and inter-model coherence for sequential decision making.
Empirical study comparing human deepfake detection to 95 AI detectors on standard and mobile-recorded video datasets.
VisionCoach uses reinforcement learning with visual-perception prompting to improve spatio-temporal grounding in video reasoning models.
Comparative analysis of 3D and 2.5D U-Net architectures for brain MRI super-resolution using elucidated diffusion models.
Study on detecting when language models actively conceal knowledge, finding larger models better at deception with gradient-based concealment easier to detect.
Two-stage framework for complex human-object interaction video reenactment using 3D foundation models and multi-view conditions.
AgentTrace provides lightweight causal graph tracing for post-hoc root cause diagnosis in deployed multi-agent workflows with cascading failures.
Theoretical work on intuitionistic temporal logic and its relationship to temporal answer set programming.
Domain adaptation approach for building damage detection across different disaster types using remote sensing imagery.
Multi-agent reasoning framework for automated software system performance optimization beyond local code transformations, reasoning about whole-system interactions.
AdapterTune adds zero-initialized low-rank adapters to frozen Vision Transformers for stable transfer learning with principled capacity guidance.
TrajMamba uses Mamba architecture to predict pedestrian trajectories from egocentric camera perspective using ego-motion guidance.
Privacy-preserving machine translation at inference stage with new benchmark dataset for evaluating local translation without cloud servers.
Data augmentation technique preserving ring-type polygon topology in architectural floorplan segmentation pipelines.
POLCA framework uses LLMs as optimizers guided by rewards and feedback to automate optimization of prompts and multi-turn agent systems, formalizing it as stochastic generative optimization.
Hybrid-order split federated learning combining zeroth-order optimization with standard backprop for memory-efficient fine-tuning.
Privacy-preserving RAG service supporting arbitrary top-k retrieval for LLM-based systems with secure document retrieval.
Framework for generalizable drug-target affinity prediction using latent representations and salient feature extraction.
Multi-task genetic algorithm with multi-granularity encoding for protein-nucleotide binding site prediction.
Framework for lifelong agents requiring epistemic control to select appropriate reasoning frameworks and prevent decision chain failures.
Probabilistic certification method for verifying behavioral fidelity in compressed deep neural networks.
Model-agnostic unlearning framework using ratio-aware layer editing for vision transformers and diffusion models.
Inference-time feature projection balancing safety and utility tradeoffs in large vision-language models.