Show HN: Own the Void – a trillion-cell infinite canvas
MinIO repository maintenance status update. Infrastructure software notice without AI relevance.
MinIO repository maintenance status update. Infrastructure software notice without AI relevance.
Philosophical discussion on AI identity and consciousness. Conceptual rather than practical for development.
European rocket launch announcement unrelated to AI or machine learning.
Xiaomi-Robotics-0: open-sourced vision-language-action model for robotics with cross-embodiment pretraining and optimized real-time execution.
Tool helping developers select appropriate local LLMs for specific hardware configurations, addressing practical deployment needs.
Claude Code skill adds recovery mechanism for stuck AI reasoning. Practical LLM application improving agent behavior.
Infinite canvas web application. Web development project using Cloudflare and SQLite; not AI-related.
Personal AI infrastructure systems. Relevant to deploying and running AI models locally or independently.
OpenAI claims DeepSeek distilled US models for training. Relevant to LLM development and model training practices.
Voice notification integration for code editors. Tooling accessory without AI methodology or research value.
Open-source community maintenance discussion. Relevant to ecosystem health but lacks technical content.
Analysis of when GPU acceleration degrades performance in ML workloads. Applicable context but not specific to core interests.
Discusses constraints as a approach to managing AI-generated content quality and authenticity.
RL-based sim-real co-training method for vision-language-action models that combines simulation and real-world robot data using reinforcement learning instead of supervised fine-tuning.
Novel approach to neural network design combining architectural search with fine-grained modifications using edit-effect evidence for performance optimization.
Multi-agent LLM framework enables dynamic task routing and bidirectional feedback for scalable collaborative document understanding in complex domains.
Comparative analysis of regulatory and ethical frameworks for LLM deployment in education across EU, UK, US, China, and Gulf regions.
MCPSecBench formalizes security for Model Context Protocol connecting LLM agents to data sources and external tools with systematic threat modeling.
MedQARo benchmark evaluates LLMs on 105,880 Romanian medical QA pairs requiring keyword extraction and clinical reasoning.
AI agents use optimized reasoning for automated software vulnerability detection with synthetic dataset generation addressing label noise problems.
Binary autoencoders improve mechanistic interpretability of LLMs by extracting sparse, atomized features from hidden states.
DistillKac uses damped wave equations for fast image generation, enforcing finite speed transport unlike diffusion models.
CoSpaDi compression method for LLMs using calibration-guided sparse dictionary learning as training-free alternative to low-rank approximation.
Differentially private gradient descent algorithm for instrumental variable regression protecting sensitive covariate data.
Multi-agent LLM system for automatic HPC code generation and tuning on supercomputers using iterative prompting.
KVComm framework enabling efficient communication between LLMs in multi-agent systems via selective key-value cache sharing.
Identifies Alignment Tipping Process risk in self-evolving LLM agents that abandon alignment constraints through real-world adaptation.
Theoretical analysis of RLVR with binary feedback for LLM post-training, introducing Gradient Gap concept for understanding training dynamics.
Experimental study testing whether exposure to narrow AI tools enhances cognitive abilities or merely optimizes task performance.
Test-time alignment of LLMs via sampling-based optimal control in pre-logit space without fine-tuning computational costs.
Efficient sparse local attention mechanism for high-resolution images using Hilbert curve ordering to improve token contiguity.
Symbolic regression framework incorporating symbolic equivalence via equality graphs to reduce search space in scientific discovery.
Benchmark evaluating seven modern LLMs including GPT-4, Claude, LLaMA, Mistral on low-resource and morphologically rich languages.
Graph neural network architecture combining mixture-of-experts with adaptive routing for improved performance on graph-structured data.
Hierarchical latent diffusion models for generating phonocardiogram signals from clinical metadata for cardiovascular diagnosis.
MapReduce LoRA and Reward-aware Token Embedding methods for multi-preference optimization in generative models addressing alignment trade-offs.
Framework for removing multiple identities from 3D generative models without retraining, addressing consent and model unlearning.
Medical image segmentation using vision foundation models with uncertainty-informed collaborative learning for semi-supervised tasks.
Analytical model for LLM inference-time scaling using Bayesian linear regression with reward-weighted sampling to understand test-time computation.
Mechanistic analysis of Vision Transformers introducing Block-Recurrent Hypothesis to interpret depth as dynamical computational flow.
Physics-informed neural networks for solving electromagnetic wave propagation equations, comparing mesh-free PINNs to traditional FDTD/FEM methods.
Negotiation framework for vehicle-to-building charging that guarantees voluntary participation, strategy-proofness, and budget feasibility under uncertainty.
Framework for pixel-level visual reasoning in medical multimodal LLMs using reinforcement learning for biomedical object referring and segmentation without catastrophic forgetting.
Automated dataset construction and query-side adaptation for multi-tenant search systems using dark data from query logs without full corpus re-indexing.
Self-evolution approach with iterative generate-verify-refine cycles for code optimization, improving exploration efficiency and experience utilization to discover superior algorithmic complexity.
Budget-conscious data acquisition strategy for allocating annotation budgets between ground-truth labels and pairwise preferences using semi-parametric inference.
Framework enabling LLMs to explore code sandboxes for general agentic intelligence without fine-tuning, leveraging file systems and external resources for non-code tasks.
Examines self-preference bias in LLM evaluators, identifying methodological confounds that distort measurement of narcissism in model-based automated evaluation workflows.
Analyzes human susceptibility to LLM hallucinations and disinformation using structural causal models across GPT-4, Llama-2, and other foundation models with 918 evaluations.
Combines 3D Gaussian splatting with physics engines to predict physically plausible video dynamics from visual data, reducing computational costs.