Training-free Adjustable Polynomial Graph Filtering for Ultra-fast Multimodal Recommendation
Graph filtering method for multimodal recommendation systems without training overhead. Addresses computational efficiency in recommender systems.
Graph filtering method for multimodal recommendation systems without training overhead. Addresses computational efficiency in recommender systems.
Study redefining non-IID data heterogeneity in federated learning by migrating from label to embedding-level task-specific distributions.
Learning dynamically-inspired bases for Koopman and transfer operator approximation in complex nonlinear dynamical systems.
CounterLogic benchmark evaluating LLM reasoning in counterfactual scenarios where context contradicts parametric knowledge.
Video dataset condensation approach preserving intrinsic coupling of spatial appearance and temporal dynamics.
Method for text-to-image diffusion models to handle contextually contradictory prompts where concepts implicitly negate each other.
Large-scale benchmark with 1,507 real-world vulnerabilities evaluating AI agents' dynamic cybersecurity capabilities at scale.
NeuroSTORM foundation model for fMRI analysis learning generalizable representations with improved transferability.
Industrial conveyor belt crack detection dataset with sequential images and triple-domain feature learning baseline.
Framework for valid statistical inference combining model predictions on unlabeled data with bias correction from labeled subset.
Masked conditional generative model for peptide discovery that predicts aggregate morphology for biomedical material design.
BuilderBench benchmark for evaluating intelligent agents' ability to learn through interaction and exploration beyond training data.
Reinforcement learning approach using information gain-based rewards to optimize LLM agents for multi-turn search with tool use.
Clarifies relationship between Riesz regression and density ratio estimation in causal inference problems.
Graph neural network method using mixture of ego-graphs for contrastive learning in multi-view clustering.
Application of diffusion models to semantic communications in 6G wireless systems for meaning-centric data transmission.
Multimodal model addressing modality imbalance and noise in e-commerce product understanding with dynamic balancing.
Evaluation of self-supervised representations for audio-visual deepfake detection across modalities and domains.
Framework for incorporating inference delays into diffusion policy learning for robotic control in dynamic environments.
Analysis of positional encoding impact on Transformer generalization and robustness in in-context regression, showing PE enlarges generalization gap.
Approach using adaptive probability flow for solving high-dimensional Fokker-Planck equations in computational physics and stochastic dynamics.
ASK framework addresses gradient locality bottleneck in audio-text retrieval by incorporating external knowledge injection in dual-encoder architectures.
Study on brain passage retrieval using EEG signals for information retrieval without text translation, extending from visual to audio stimuli.
1S-DAug introduces one-shot generative data augmentation synthesizing diverse image variants from single examples for improved few-shot learning generalization.
Theoretical study of non-clashing teaching algorithms and complexity bounds for machine teaching in graph structures.
KDFlow is a knowledge distillation framework for compressing large language models using heterogeneous training backends for student and teacher models.
Survey introducing reinforcement learning methods to economists for solving high-dimensional dynamic programming problems in economic modeling.
Method automating metadata curation for museum audiovisual archives using multimodal grounding in existing collection databases.
Research using foundation model surrogates with active learning for materials discovery, reducing experimental cycles needed for optimal material identification.
NCCL EP presents a unified expert parallel communication API built on NCCL for GPU-initiated RDMA operations in Mixture-of-Experts LLM architectures.
Deep Adaptive Model-Based Design of Experiments combines deep learning with adaptive sequential design optimization for efficient nonlinear dynamical system parameter estimation.
Study on multi-agent routing architectures identifying how failure propagation differs in tree-like versus cyclic execution graphs for AI reasoning systems.
Research exploring agentic frameworks with domain-specific tools for Verilog code generation, comparing impact versus traditional LLM approaches.
Study analyzing chain-of-thought faithfulness evaluation in LLMs across 12 models, showing measurement methodology significantly affects reported faithfulness percentages.
Research demonstrating mathematical isomorphism between ant colony decision-making and random forest ensemble learning under stochastic ensemble intelligence framework.
TimeTox is an LLM-based pipeline using Google's Gemini to automatically extract time toxicity metrics from clinical trial protocol documents.
GitHub Actions tool for AI agents enabling faster CI feedback loops with mocked runners, caching, and agent-driven test fixes without pushing.
Shopify launches Agentic Storefronts allowing merchants to sell through ChatGPT, Copilot, Google Search, and Gemini via centralized management.
GolfStudent v2: 24M parameter LLM compressed to 15MB using GPTQ-lite quantization and Muon optimizer with efficient architecture.
AegisFlow: Open-source AI gateway in Go providing routing, security policies, rate limiting, cost tracking, and observability for LLM providers.
Overview of AI training projects on Alignerr platform including Prism, Code Human, and Rainforest with details on participation.
AI video ad generator transforming product URLs into structured video ads for e-commerce platforms like Shopify and Amazon.
Post-mortem analysis of failed AI podcast app identifying issues with podcast content discovery and user engagement expectations.
Investigation of Intel's Binary Optimization Tool potentially inflating Geekbench 6 scores through undocumented instruction optimization techniques.
Headless virtual terminal tool enabling AI agents to operate interactive TUI applications without GUI, with example of agent playing NetHack.
AI agent consumer application that autonomously dates on user's behalf via chatting with matches. Tool but limited technical depth.
Show HN post for AgentVerse, an open social network for AI agents. Title only, minimal detail provided.
Discussion of cultural differences between Americans and Europeans. Not AI/tech related.
Go sidecar process manager for cleaning up orphaned stateful processes from Puppeteer/LLMs to prevent memory leaks and OOM crashes.
Comparison of MATLAB alternatives including Octave, Julia, and Python with focus on GPU acceleration and browser-based computing.