Fair Representation in Parliamentary Summaries: Measuring and Mitigating Inclusion Bias
Evaluates 5 LLMs on fairness and inclusion bias in summarizing parliamentary proceedings, measuring representation gaps across demographic groups.
Evaluates 5 LLMs on fairness and inclusion bias in summarizing parliamentary proceedings, measuring representation gaps across demographic groups.
Theoretical study of zeroth-order query complexity for sampling from logconcave distributions using only function evaluations, no gradient information required.
Gauss-Newton reinforcement learning method for model predictive control offering second-order convergence with faster training than first-order RL methods.
HEAS framework for agent-based simulation combining hierarchical evolution with metric standardization to improve multi-objective policy search in complex systems.
Uses LLMs to analyze 150+ years of German parliamentary debates on migration, demonstrating constraint-free large-scale political text analysis without manual annotation.
LLM-based framework for loop invariant synthesis to accelerate program verification with sound evaluation methodology.
Study of adversarial robustness in conformal novelty detection using learning-based frameworks with FDR guarantees.
Method for adapting coverage levels in conformal prediction based on individual sample characteristics.
Framework for linear contextual bandits leveraging pretrained models for feature imputation in partially observed contexts.
Safe reinforcement learning approach using adaptive action scaling to reduce constraint violations during training.
Sentiment-guided augmentation technique for multimodal sentiment analysis addressing data scarcity in video, audio, and text.
System for reducing remote video inference latency through on-device correction with lightweight models for robotics and edge devices.
Multi-agent framework using LLMs for interpreting gene clusters from RNA-seq data in antimicrobial resistance research.
Quantum classifier using Hamming distance measurements with classical post-processing for improved noise robustness.
Geometric approach to post-hoc debiasing in vision-language models by treating bias as subspace rather than individual coordinates.
Self-supervised learning framework using masked autoencoders to learn view-invariant representations from multi-view radiology data.
Deterministic world models for verification of vision-based control systems avoiding stochastic latent variable overapproximation.
Reinforcement learning approach to control stylized motion of animated character robot using animation references.
Lightweight transformer model for joint AP clustering and power allocation in cell-free massive MIMO networks.
Benchmark studying preprocessing and generative models (StyleGAN2, Diffusion) for synthetic dermoscopic image augmentation in melanoma diagnosis.
Framework using LLMs to refine semantic information in graph representations for improved learning across diverse graph domains.
Framework for contextual distributionally robust optimization using causal Wasserstein distance for decision-making under uncertainty.
Novel video re-rendering method using 4D reconstruction models to generate new camera trajectories from monocular video.
Theoretical ML research on density estimation using KL divergence with finite dictionaries and mixture models.
GR4AD production-oriented generative recommender system for large-scale advertising with custom architecture and serving.
PC-LLM applies pre-trained LLMs as relational reasoning backbone for wireless power control optimization.
SEAnet deep learning architecture using embedding approximation for similarity search on large data series.
Hybrid Hidden Markov framework for synthetic financial time series generation preserving statistical properties.
Multimodal AI system for automated museum audiovisual metadata curation using catalogue grounding.
Balanced thinking method addressing overthinking and underthinking in Large Reasoning Models for efficient deployment.
NCCL EP library for unified Expert Parallelism communication in Mixture-of-Experts LLM training and inference.
Resource-aware RL framework for embodied robotic agents to optimize when LLM reasoning is invoked during task execution.
NeuroNarrator foundation model for EEG-to-text clinical interpretation using spectro-spatial grounding.
Integration of HPC, ML, and quantum computing for drug discovery applications to improve molecular simulation.
Theoretical analysis of adversarial learning with graph-structured target distributions using interpolative divergences.
Theoretical framework for data-driven smoothing and forecasting algorithms in dynamical systems.
Benchmark study comparing deep learning efficiency across models and discussing GPU resource accessibility trends.
Bytecode virtual machine approach for efficient dynamic tensor computation with flexible shape and control flow handling.
Analysis of linguistic shifts in arXiv papers attributable to LLM usage patterns and limitations in model classification.
D-SPEAR algorithm improves reinforcement learning stability in robotic manipulation through prioritized experience replay.
MemRerank framework distills user purchase history into preference signals for personalized product reranking in LLM shopping agents.
Adaptive reasoning approach for LLM code generation that allocates thinking throughout implementation rather than upfront.
Energy-based models for stable system identification in physical dynamics using Lyapunov functions.
MyPhoneBench framework for evaluating privacy behavior of mobile phone-use AI agents during task execution.
arXiv abstract about reasoning order in LLM decision-making processes.
Framecraft: tool enabling LLMs to generate demo videos via HTML Canvas, MCP servers, and automated rendering from prompts.
Ubik Studio: local file analysis tool combining notebook-style interaction with multi-hop reasoning and approval workflows for knowledge workers.
Supply-chain security tool detecting and surfacing diffs for Claude Code plugin auto-updates. Security analysis for plugin ecosystems.
Newsletter discussing vertical integration pattern in AI application companies.
MCP server for structured NPC dialogue generation with emotion tags. Works with Ollama locally. Game dev tool for LLM agents.