DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone
DiffuMamba: diffusion language model with Mamba backbone for efficient masked sequence modeling, achieving linear-time complexity vs Transformer quadratic overhead.
DiffuMamba: diffusion language model with Mamba backbone for efficient masked sequence modeling, achieving linear-time complexity vs Transformer quadratic overhead.
Analysis of stochastic gradient descent-based unlearning algorithms (D2D and R2D) with provable guarantees for removing training data impact.
Multi-agent reinforcement learning approach using attention for automated feature transformation in structured data processing.
Method for monitoring LLM API consistency over time by tracking log probability changes to detect undisclosed model updates.
Rough sets method for explaining results of spectral graph clustering algorithms applied to document data.
Study on membership inference attacks for extracting training data from LLMs, demonstrating privacy risks through coordinated extraction and verification techniques.
Generalized Primal Averaging (GPA) optimizer for faster LLM training, extending Nesterov's method and unifying recent averaging-based approaches like DiLoCo.
Trust region masking technique for LLM reinforcement learning to address off-policy divergence in policy gradient training of large language models.
CSyMR benchmark for evaluating LLMs on compositional music information retrieval tasks requiring multi-step reasoning over symbolic music notation.
DUET method for LLM unlearning using distilled teacher models to remove undesirable knowledge efficiently while avoiding catastrophic forgetting.
Novel convex loss functions for SVMs in binary classification and regression with mathematical derivations and small-scale experiments.
Federated-inspired batch correction for single-cell RNA sequencing without centralizing high-dimensional datasets.
Position paper proposing agentic framework for time series forecasting with iterative refinement and adaptation.
KV-cache quantization to 2-bit precision enabling long video generation on resource-constrained hardware.
Machine unlearning method addressing superficial forgetting by targeting core feature representations rather than logits.
Online learning framework for training robust classifiers under adversarially chosen clean data and labels.
Pretrained variational bridge for efficient molecular dynamics trajectory generation across diverse molecular systems.
Automated detection pipeline for unverbalized biases in LLM chain-of-thought reasoning without predefined categories.
Performance characterization framework for small language models on edge devices using Roofline model analysis.
Benchmarking framework for time series foundation models addressing data quality, task alignment, and evaluation rigor.
ECG language model for cardiac event forecasting and report generation from electrocardiogram recordings.
Framework for measuring propensities (behavioral tendencies) in AI models beyond capability assessment using Item Response Theory.
Theoretical analysis of gradient descent convergence rates under large step sizes in separable logistic regression.
Heterogeneous graph neural network model predicting high-potential small/medium enterprises using public data.
Discrete diffusion framework using sample-efficient conditional probability estimators for discrete state space generation.
Analysis showing test-time training with KV binding functions as learned linear attention rather than memorization.
Federated learning aggregation method (FedVG) using gradient guidance to address client drift and data heterogeneity.
Safety filtering framework for flow-based generative models with formal guarantees on constraint satisfaction.
Strategic risk aversion approach for training collaborative AI agents that generalize better with new partners.
Adversarial reinforcement learning dataset (AOT-SFT) to improve robustness of multimodal LLMs on visually complex scenes.
Maps failure regions in LLMs using MAP-Elites quality diversity search to characterize unsafe behaviors and vulnerabilities.
Energy-based theory for detecting concept drift in ECG signals, distinguishing physiologically plausible variation from true distribution shift.
Regularized online RLHF for Nash Equilibrium identification with generalized bilinear preferences modeling intransitive preferences.
ParamMem augments language agents with parametric reflective memory to improve reasoning through diverse self-reflection.
Theoretical framework for stationary kernels and Gaussian processes on Lie groups and homogeneous spaces in compact case.
Assesses quality of deep learning models for spatio-temporal prediction with missing and heterogeneous data.
Kernel spectral joint embeddings using duo-landmark integral operators for integrative analysis of noisy high-dimensional datasets.
Studies spectral-stimulus information for self-supervised encoding of spatial navigation in neurons, exploring place cell population coding.
Analyzes polynomial scaling complexity for neural operator approximations of structured families of backward stochastic differential equations.
FinBloom presents a knowledge-grounding approach for LLMs to handle real-time financial queries with live data integration.
Uses discrete optimal transport with barycentric projection for voice conversion by aligning speaker embeddings.
Apple proposes general active perception via reinforcement learning to handle uncertainty in partially observable robotic environments.
Interview study of ML practitioners at Big Tech examining how fairness is approached in real-world recommender systems deployment.
REA-RL uses online reinforcement learning with reflection to reduce overthinking and inference costs in large reasoning models.
Applies quantum learning methods to coordinate energy distribution networks and communities using price signals.
CoMind introduces MLE-Live, a framework for evaluating LLM agents that engage with research communities in ML engineering tasks.
Uses Explainable Boosting Machines to identify overshooting tops in satellite imagery for weather forecasting, emphasizing interpretability in high-stakes ML.
pFedMMA proposes personalized federated fine-tuning with multi-modal adapters for vision-language models like CLIP on decentralized heterogeneous data.
AMBER-AFNO architecture using Adaptive Fourier Neural Operators for lightweight 3D medical image segmentation.
20 fine-tuned Llama-3.1 8B variants specialized for High-Energy Physics, trained on arXiv abstracts with comparative domain analysis.