Breaking the Factorization Barrier in Diffusion Language Models
Research on diffusion language models addressing the factorization barrier to enable efficient parallel token generation.
Research on diffusion language models addressing the factorization barrier to enable efficient parallel token generation.
OrthoAI combines 3D tooth segmentation with biomechanical reasoning for clear aligner orthodontics using sparse-supervision learning.
Dual-pipeline bird image segmentation framework combining Grounding DINO 1.5, YOLOv11, and SAM 2.1 for zero-shot and supervised segmentation.
Pri4R approach equipping Vision-Language-Action models with implicit world dynamics understanding through privileged 4D representation learning.
GOME: MLE agent framework replacing tree search with gradient-based optimization for machine learning engineering tasks using LLM reasoning.
Coordinated Boltzmann MCTS for decentralized multi-agent planning, replacing deterministic UCT with stochastic Boltzmann policy for sparse reward environments.
Communication mechanism (RMHA) for decentralized multi-robot path planning using attention over Manhattan distances for spatial-aware coordination.
Training-free method (PlaneCycle) for lifting 2D foundation models to 3D volumetric data without adapters or retraining via cyclic spatial aggregation.
Study of grokking phenomenon through architectural modifications, identifying topology-based degrees of freedom that influence memorization vs. generalization phases.
Analysis of performative chain-of-thought in reasoning models, showing hidden beliefs diverge from generated reasoning tokens at task-specific difficulty levels.
Methods for eliciting truthful outputs from censored LLMs using honesty elicitation and lie detection, tested on models trained to conceal information.
OptiRoulette meta-optimizer that dynamically selects update rules during training for faster convergence, packaged as PyTorch-compatible component.
GameVerse benchmark evaluating Vision-Language Models' ability to learn from video-based reflection and self-correction in game environments.
Multi-objective preference alignment approach for protein sequence design balancing designability with competing properties like solubility and thermostability.
Method for improving Vision-Language Models for ophthalmic diagnosis by injecting domain-specific knowledge to address perception and reasoning gaps.
Layered Governance Architecture (LGA) framework addressing execution-layer vulnerabilities in LLM-based autonomous agents: prompt injection, retrieval poisoning, tool invocation.
Adversarial training approach for robust reinforcement learning policies in partially observable domains with latent distribution shift.
Tunable-complexity priors for diffusion models, normalizing flows, and VAEs to solve inverse problems with flexible representation capacity.
Decision-theory framework for designing probabilistic weather forecasts tailored to heterogeneous farmers' circumstances for agricultural decision-making.
Test-time adaptation method using spectral experts in Vision Transformers to handle distribution shift with minimal parameter updates.
Clinical feasibility study of LLM-based conversational AI (AMIE) for patient diagnostic history-taking in real-world primary care workflows with safety assessment.
PostTrainBench benchmark evaluating whether LLM agents can automate post-training of base LLMs into assistants.
OWO-FMTL framework for fair multi-task learning in AI-enabled radio access networks with equitable user performance.
HCAPO framework addressing credit assignment in long-horizon LLM agent tasks using hindsight and value baseline alignment.
SPREAD framework for lifelong imitation learning preserving task representation manifolds across sequential skill acquisition without catastrophic forgetting.
Multi-level meta-reinforcement learning with skill-based curriculum for hierarchical sequential decision-making through MDP compression.
Temporal Markov Transition Field extension handling non-stationary time series by tracking regime changes instead of using global transition matrix.
SoftJAX and SoftTorch extensions enabling informative gradients for hard primitives like thresholding and discrete operations in automatic differentiation frameworks.
GenGNN modular framework for discrete graph generation achieving competitive validity with graph Transformers at 2-5x faster inference speed.
Study of hybrid sequence models combining Transformers and state-space models, analyzing expressivity-efficiency tradeoffs and mechanisms for performance benefits.
Feature selection method for hybrid information systems using fuzzy rough set theory to reduce dimensionality in big data scenarios.
Comprehensive ablation of confidence bounds for selective prediction with risk control, introducing Transfer-Informed Betting for cross-domain uncertainty quantification.
Study of memorization and privacy risks in genomic language models trained on sensitive DNA sequence data, assessing data leakage concerns.
Strong Lottery Ticket method using continuously relaxed Bernoulli gates to find sparse subnetworks in over-parameterized networks without weight training.
Analysis of Mixture-of-Experts inference inefficiency identifying double penalty from microbatch fragmentation and reduced KV cache headroom during decoding.
Semantic Level of Detail introduces continuous resolution control for knowledge graphs via heat kernel diffusion, enabling agents to navigate abstraction levels.
MAcPNN proposes mutual assisted learning for IoT edge devices handling streaming data with temporal dependencies and concept drift without catastrophic forgetting.
MAPLE framework improves medical LLM reasoning by replacing majority voting with process-led alignment in test-time reinforcement learning for clinical decision-making.
Flatness measure for CNN generalization based on Hessian spectrum accounting for convolutional architecture-specific geometric structure.
CALIPER: data-only detector estimating post-drift data size sufficiency for stable model retraining after concept drift in data streams.
Distributed scientific ML approach co-guiding models with hardware and physics constraints for edge processing without central data aggregation.
SCALAR framework coupling LLM planning with deep RL through learned skill libraries, enabling agents to ground language into low-level control with iterative correction.
Simulation-to-decision learning framework addressing prediction errors in simulators through adversarial calibration for reliable policy training in industrial domains.
Dynamic multi-period experts framework for online time series forecasting handling recurring and emergent concept drift.
Learning adaptive decoding policies for LLMs that dynamically select sampling strategies based on prompt difficulty and compute.
Exclusive self-attention mechanism constraining attention to orthogonal information for improved Transformer language modeling.
PPO-based optimization for vehicular edge computing with reconfigurable intelligent surfaces and semantic communication.
Finetuned LLM sentiment analysis from news for aluminum price forecasting under different market conditions.
Machine learning model for predicting voltage hysteresis in silicon-graphite EV batteries using uncertainty quantification.
Method to decouple reasoning from confidence in LLM RL training, addressing over-confidence calibration issues.