Flowette: Flow Matching with Graphette Priors for Graph Generation
Flowette: flow matching generative model for graphs with recurring motifs using graph neural network transformers.
Flowette: flow matching generative model for graphs with recurring motifs using graph neural network transformers.
Evidential neural radiance fields framework for uncertainty estimation in 3D scene reconstruction and novel view synthesis.
CycleBEV: regularization framework using cycle consistency to improve bird's-eye-view semantic segmentation for autonomous driving.
BRIDGE: data augmentation method mitigating bias amplification in automated scoring systems for English Language Learners.
SDMixer: sparse dual-stream framework for multivariate time series forecasting handling multi-scale features and noise.
Hyperdimensional alignment method for frozen vision-language models enabling efficient image captioning without fine-tuning.
Pseudo contrastive learning method to improve diagram comprehension in multimodal models through fine-grained structural sensitivity.
KEEP: KV-cache-centric memory management system for efficient embodied planning with memory-augmented LLMs.
LFQA-HP-1M: 1.3M human preference annotations dataset for long-form QA with quality rubrics outperforming LLM evaluators.
LLM-driven synthesis of multi-turn task-oriented dialogue datasets for evaluating LLM reasoning in realistic scenarios.
Benchmark study on multimodal learning fusion of EHR and chest X-rays for clinical decision support under missingness and fairness constraints.
ReDON: recurrent diffractive optical neural processor using optical domain computation with nonlinear responses.
DLEBench: evaluation benchmark for instruction-based image editing models focusing on small object editing capabilities.
FlexGuard: continuous risk scoring framework for LLM content moderation with adaptive strictness levels across platforms.
FedRot-LoRA addresses rotational misalignment in federated fine-tuning of LLMs on decentralized data to reduce aggregation error.
AudioCapBench: benchmark for evaluating audio captioning of multimodal LLMs across sound, music, speech with 1,000 samples and LLM-as-Judge evaluation.
MedMAP pre-training framework for vision-language models on 3D MRI data with modality-specific alignment for multi-organ abnormality detection.
ProtoDCS framework for test-time adaptation of vision-language models under distribution shift in open-set scenarios.
TRIZ-RAGNER applies retrieval-augmented LLMs for named entity recognition in patent contradiction mining for systematic innovation.
Blockchain-based routing protocol for low-altitude intelligent networks (LAINs) with UAVs to improve security and stability.
Sociological analysis of AGI rhetoric in essays by OpenAI and Anthropic leadership, examining corporate sociotechnical imaginaries.
Multimodal gesture recognition system for hands-free teleoperation of drones and mobile robots using log-likelihood ratio fusion.
Analysis of transformer training geometry showing parameter updates organize into dominant drift direction with oscillatory transverse dynamics.
SAGE-LLM architecture combining LLMs with formal safety verification (Fuzzy-CBF) and graph-structured knowledge for safe UAV autonomous decision-making.
System design for distributed LLM inference across device, RAN-edge, and cloud tiers with latency constraints for 5G embodied AI applications.
Agent-centric benchmarking paradigm where autonomous agents dynamically generate, validate, and solve problems to evaluate LLM reasoning capabilities.
BDGxRL uses Diffusion Schrödinger Bridge to enable cross-domain reinforcement learning policy transfer when target domain interaction is unavailable.
UPath proposes learning-based heuristics for A* pathfinding on grid maps using deep neural networks across heterogeneous topologies.
MPU framework for privacy-preserving machine unlearning in LLMs without sharing server parameters or client forget sets.
Research on applying causal discovery algorithms to real-world longitudinal data with institutional workflow constraints.
Sea² framework using vision-language models and autonomous agents for unsupervised cross-domain visual adaptation without fine-tuning on target data.
Theoretical research on offline reinforcement learning with general function approximation and parametric policies, extending beyond finite action spaces.
Q-learning approach for learning safe policies from expert demonstrations with unknown constraints and non-observable costs.
Federated learning optimization achieving consistency between local and global model flatness under data heterogeneity.
Continual fine-tuning method for LLM-based vulnerability detection addressing catastrophic forgetting under temporal distribution shift.
Multi-layer intrusion detection framework with incremental learning for Industrial IoT networks handling novel cyber threats.
Benchmark study evaluating feature set transferability across IoT datasets for cross-domain intrusion detection.
Scientific modeling approach for identifying sustainable ingredient combinations in personal care formulations.
Adversarial benchmark testing MLLM visual reasoning and grounding capabilities in referring expression comprehension tasks.
Multi-agent cascaded framework for breast ultrasound screening reducing unnecessary biopsy referrals through selective decision-making.
Transferability estimation method for selecting optimal medical foundation models for segmentation without retraining.
Reinforcement learning approach balancing cold-start latency and carbon emissions in serverless computing.
Multimodal benchmark evaluating MLLMs on explicit 3D geometric reasoning with point clouds, exposing geometric hallucinations.
Hierarchical concept embedding models improving neural network interpretability through human-readable concept representations.
Benchmark for agentic search systems balancing quality and efficiency, addressing underspecified user preferences in LLM-powered retrieval.
Dual-branch feature extraction network with attention for recognizing micro-expressions in video.
Sequential hierarchical network for EEG/MEG signal processing applied to speech detection from brain activity.
Controlled experimental studies identifying what triggers and prevents sycophancy in LLMs, an alignment failure in advisory contexts.
Fully intrinsic neural network architecture using Lorentz geometry for representing hierarchical data structures.
Multimodal knowledge transfer approach applying imaging and speech analysis for early Alzheimer's screening.