Show HN: Osint of I-80 for EV site selection. Finding 10MW spots
Appears to be spam/corrupted content about Airtable and EV site selection with mixed advertising elements.
Appears to be spam/corrupted content about Airtable and EV site selection with mixed advertising elements.
WebGPU benchmarking tool with minimal description.
Emscripten system library enabling WebGPU access from C/C++ code compiled to WebAssembly.
Agentmatic is an AI agent platform that generates full marketing campaigns from prompts with persistent brand memory across sessions.
Memsearch provides persistent cross-session semantic memory for AI coding agents with zero-configuration plugin installation.
Ask HN: Users share experiences selecting LLM models for agentic software development lifecycle with specific use cases.
SideX: Tauri-based VS Code port replacing Electron with native backend, 96% smaller with early-stage open source development.
Analysis of 138 practitioner conference talks examining how companies adopt AI agent architectures, architectural patterns, and LLM-driven agentic system implementation.
arXiv paper: Diversity-aware RKL divergence improves LLM distillation by focusing on dominant modes in teacher-student training.
MAC-Attention: acceleration technique for long-context LLM decoding by reusing prior attention computations for semantically similar tokens without compression.
REM-CTX: RL-based peer review system using 8B LLM with Group Relative Policy Optimization, incorporating visual figures and scholarly context.
Apprenticeship learning approach for inducing pedagogical policies from imperfect, evolving student demonstrations in e-learning environments.
Systematic evaluation of LLMs for educational essay scoring across holistic and analytic rubrics, analyzing human alignment and bias.
QAsk-Nav benchmark for evaluating embodied agents combining navigation and dialogue-based question-asking for collaborative object finding tasks.
Energy-based models framework for physical system identification with formal stability guarantees, applying to Port-Hamiltonian dynamics.
Research on reducing modality gap in Vision-Language Models like CLIP through geometric analysis to improve cross-modal tasks like captioning and clustering.
VeriAct: agentic system for synthesizing correct and complete formal specifications using LLMs beyond just verifier-passing output.
SANA-I2I: text-free flow matching framework for paired image-to-image translation with application to fetal MRI artifact reduction.
Asymmetric Actor-Critic method for improving reliability of multi-turn LLM agents in one-shot settings without requiring model retraining.
CASA: conditional decoding strategy for robust multimodal safety in MLLMs against cross-modal attacks.
Prompt-guided image compression for Vision-Language Models optimized for downstream VLM tasks rather than human perception.
Research on vulnerabilities in aligned AI agents with filesystem and email access; introduces ACDC for automated circuit discovery in transformers.
RAGShield: five-layer defense against knowledge base poisoning attacks in RAG systems deployed across federal agencies.
EvolveTool-Bench: diagnostic benchmark for evaluating quality of LLM-generated tool libraries as software artifacts in engineering workflows.
Research on out-of-distribution anomaly where deep models assign higher density to simple OOD data than in-distribution test data.
Domain adaptation framework for brain metastases segmentation across multiple medical institutions with different imaging protocols.
Transfer learning approach for trajectory prediction in autonomous driving handling domain shift across different regional driving patterns.
System enabling humanoid robot navigation in unseen environments using diffusion models trained on 5 hours of human walking video without robot data.
Privacy attack method using gradient-induced feature drift to infer membership in LLM training data without relying on output probabilities.
Analysis of neuron polysemy in neural networks, decomposing superposition metrics to separate lexical overlap from concept compression.
Method to mitigate object hallucination in Vision-Language Models through visual grounding using logit boosting without retraining.
Study examining how personality traits moderate effectiveness of AI-driven conversational coaching in workplace negotiation scenarios.
Technique to reduce LLM code generation latency by executing code incrementally as tokens are generated, eliminating idle waiting periods.
Vision-Language foundation model for chest X-ray interpretation providing explicit reasoning about visual evidence and diagnostic predictions.
Theoretical analysis of generalization bounds for overparameterized neural networks using distance from initialization as an explanatory factor.
Multimodal LLM approach for e-commerce product understanding that captures fine-grained attributes through reasoning-aware representation learning.
Self-supervised learning framework using masked autoencoders for 3D medical imaging, addressing domain shift from natural image pretraining.
Deep learning approach for animal activity recognition from wearable sensors, optimizing sampling rates and addressing class-specific classification accuracy.
Agentic framework combining Vision-Language Models with iterative reasoning for zero-shot 3D visual grounding from natural language descriptions.
Method for optimizing rubrics used in synthetic data generation for LLM fine-tuning, leveraging influence-guided selection in knowledge-intensive domains.
Mamba-based neural network for dental diagnosis from X-rays, unifying tooth detection, caries segmentation, anomaly detection, and developmental staging.
Multi-agent LLM system for housing consultation decisions, combining reasoning, constraint handling, and factuality guarantees beyond simple ranking.
Unified neural architecture framework studying scaling laws across attention-based, TokenMixer, and factorization-machine recommendation systems.
Method using model cascades to optimize LLM inference costs in semantic SQL queries by routing rows through fast/expensive models based on confidence.
Research on autonomous web agents navigating browser-based websites by leveraging internal APIs instead of DOM inspection, addressing architectural mismatches in agent design.
Reinforcement learning technique adding hints to overcome advantage collapse in group relative policy optimization.
Black-box security tool for detecting exploitable third-party vulnerabilities in web applications.
Study of tradeoffs between parametric knowledge in LLM pretraining and non-parametric knowledge from retrieval.
Multi-agent optimization framework addressing non-stationarity through active shared perception of agent policies.
Educational framework for assessing Scratch programming skills using fuzzy clustering aligned with CEFR levels.