Subsampling Factorization Machine Annealing
Algorithm combining quantum computing and machine learning for hybrid optimization in combinatorial problems.
Algorithm combining quantum computing and machine learning for hybrid optimization in combinatorial problems.
Theoretical analysis of quantum generative adversarial networks with pure state generators, finding poor generalization across datasets.
Python package for physics simulations of quantum dot devices, addresses ML dataset collection challenges for quantum device calibration and operation.
Action-prompted video segmentation framework for embodied AI that handles label noise and multimodal inconsistencies in object interaction segmentation.
Analysis of best-of-N ensemble selection for LLMs using majority voting at infinite limit, with adaptive generation scheme to reduce inference cost.
Foundation model for EEG signal representation using neural topological architecture inspired by biological mechanisms rather than vision/NLP adaptations.
Benchmarking study of 8 foundation models on ECG interpretation across 26 clinical tasks, examining architecture generalization and scaling with limited labels.
Algorithms using kernel density estimation for faster kernel matrix linear algebra with applications to matrix operations.
FLOWR.root: SE(3)-equivariant flow-matching foundation model for 3D ligand generation and binding affinity prediction.
Geometric framework modeling LLM reasoning as flows in representation space to study how models internalize logical structure.
ToMCLIP: method for topological alignment of vision-language embedding space in multilingual contrastive models.
COGS: composition-grounded data synthesis for improving visual reasoning in MLLMs on artificial domains like charts and documents.
Theoretical analysis of implicit bias properties of the Jordan-Kinderlehrer-Otto scheme for Wasserstein gradient flow.
ceLLMate: sandboxing approach to protect browser-using AI agents from prompt injection attacks and unintended actions.
Iterative algorithm for constructing deterministic epsilon-coreset with Lp subspace embedding guarantees.
DevRev: automated dataset construction and query-side adaptation for large-scale multi-tenant search systems without curated labels.
Measurement-Consistent Langevin Corrector stabilizes latent diffusion models for inverse problem solving.
Automated pipeline for cervical spine fracture detection using 3D masks derived from 2D segmentations.
Statistical framework for synthetic data augmentation in imbalanced classification, analyzing when augmentation helps and optimal sample generation.
NRR-Phi: formal framework for text-to-state mapping that preserves ambiguity in LLM inference rather than early semantic commitment.
Framework for deploying language models with least-privilege security principle, limiting capability exposure per request.
SureLock: optimization technique that stops computation for converged tokens in masked diffusion language models to reduce redundant compute.
Learning-guided approach to Kansa collocation method for solving forward, inverse, and discovering PDEs beyond linear cases.
Mathematical study of homology of ample groupoids via Moore complex and topological abelian groups.
Framework for solving PDEs and inverse problems using sinusoidal random Fourier features with analytical derivatives, avoiding automatic differentiation.
Chimera: neuro-symbolic framework mapping neural attention computations onto programmable network dataplane for trustworthy line-rate traffic analysis.
DRESS: deterministic framework iteratively refining graph structure to produce isomorphism-invariant edge fingerprints via dynamical systems.
Production-oriented generative recommender system co-designed for real-time large-scale advertising with novel architecture and serving strategies.
AMA-Bench: benchmark for evaluating long-horizon memory capabilities in LLM-based autonomous agents beyond dialogue interactions.
Theoretical work on causal identification from counterfactual data, extending completeness results to Layer 3 of Pearl's Causal Hierarchy.
CMI-RewardBench: benchmark for evaluating music reward models handling multimodal inputs combining text, lyrics, and reference audio.
Narrative graph annotation framework using qualitative content analysis principles to improve annotation quality for NLP tasks.
Tensor factorization method for fine-grained evaluation of generative models at prompt level, reducing human annotation costs.
Framework for federated inference enabling privacy-preserving collaboration between independently trained models at inference time without sharing parameters.
Research on early quality assessment for text-to-image diffusion models, proposing efficient evaluation metrics to reduce computational costs.
Proposal for website using LLMs to solve Knuth's problem set as comprehensive LLM evaluation benchmark.
DOJ policy proposal limiting state bar ethics probes into department attorneys.
AI-generated custom audio drivers optimizing hardware integration by eliminating unnecessary abstraction layers.
Technical overview of GitHub Copilot's model hosting infrastructure via OpenAI and Azure with data privacy details.
Explores tradeoffs of AI coding tools in software engineering, discussing where they excel and their reliability limitations.
Critical analysis of LLM hype in software development, examining actual productivity gains versus marketing claims.
Open source CLI tool using multi-model adversarial debate for comprehensive code review. Supports Claude, Gemini, Qwen, and custom LLM providers.
Catalog of linguistic patterns in LLM-generated text, documenting overuse of em-dashes and specific syntactic structures like negation-reframe constructions.
Former Block DevRel discusses observations on LLM coding agents and multi-agent systems becoming prevalent in software development.
UK regulatory inquiry into Meta's practices of having workers review sensitive video content from AI smart glasses.
Book on using PostgreSQL with pgvector for vector search, RAG pipelines, and in-database ML with production patterns and implementation examples.
TurboCast converts YouTube videos and articles into AI-generated podcasts with transcription and text extraction features.
Microsoft security research on AI recommendation poisoning attacks where hidden instructions injected via URLs manipulate LLM outputs for profit.
Announcement of Cursor AI coding startup reaching $2B annual sales rate with minimal details.
Parody YC accelerator concept for AI agents with humorous take on agent capabilities and constraints.