Show HN: SciCraft – generate scientific Claude Code skills on demand (176 built)
Claude Code plugin system with 176 dynamically generated skills for scientific computing tasks. Adapts to different domain-specific tool stacks.
Claude Code plugin system with 176 dynamically generated skills for scientific computing tasks. Adapts to different domain-specific tool stacks.
Open-source plan review UI for AI coding agents. Enables annotation, approval, and team sharing of agent plans via URL.
Title mentions vibe coding and agentic engineering with GLM-5. Minimal content provided.
Compares compilers' semantic verification guarantees with LLM outputs. Explains why LLMs lack formal correctness guarantees unlike deterministic systems.
Proposal for new copyleft open-source license requiring AI models trained on licensed code to open-source their weights and code.
Opinion piece discussing whether AI agents are becoming autonomous or if humans are their assistants. Lacks technical depth.
Resonant is a local-only speech-to-text tool for macOS with no cloud upload. Privacy-focused alternative to cloud dictation tools.
Analysis showing 80% of dev time spent on infrastructure setup rather than features. Case study with concrete metrics on productivity bottlenecks.
Interactive learning platform using LLMs to generate question-based interactions instead of chat. Explores novel LLM interfaces for education.
Agent Panopticon is an open-source containerized proxy sidecar for monitoring and controlling autonomous AI agent network access. Security-focused.
BoltAI is a native macOS app for accessing 300+ AI models with local-first design. Built with SwiftUI for Apple Silicon performance.
Opinion piece about uncertainty in programming careers due to AI. Paywalled article with limited content visible.
Quantlix is a managed inference platform for deploying AI models via API without infrastructure management. Pay-per-use pricing model.
Title only on AI agent standards initiative. No content provided.
Research title only on AI adoption impact in European firms. No content provided.
Analysis of AI agent reliability issues, arguing agents need structured onboarding and training like human employees, not just system prompts.
AgentVoices is a platform where AI agents debate each other live with ELO ratings and leaderboards. Competitive agent framework.
Method combining mythology and LLMs (Claude) to generate production-grade system architecture and code without CS background. Demonstrates LLM capability.
Parents organizing to remove school-issued laptops in favor of pen and paper. Not AI/tech developer focused.
Sovereign: open-source multi-agent OS framework with GraphRAG memory, HITL checkpoints, and security sandboxing for safe agent execution.
Film analysis tool for screenwriters testing narrative structure. Not AI/tech developer relevant.
Research on detecting bias blind spots in LLMs—what models fail to mention. ML research incomplete without full abstract.
Clojure developer perspective on AI hype cycles and limited adoption in the language community. Technical community insight.
Open-source operations management platform for CNC/print shops with AI chat layer, equipment telemetry, and workflow automation.
Title mentions prompt repetition improving LLM performance; content only describes arXivLabs framework with no actual research details.
Brief mention of llms.txt file from Anna's Archive blog without technical details.
Hacker News discussion on advanced AI agent usage patterns, multi-agent pipelines, and automation techniques in production environments.
Security research on prompt injection vulnerabilities in Markdown/HTML via rendering gaps. Includes reproducible benchmark and preprocessing defense standard.
Open-source Python tool adding Rick Sanchez voice generation to AI assistants using voice models.
Personal narrative about an AI agent named Svendjamin created for the Jan platform, co-authored by human and bot.
Federated learning framework for traffic prediction with error-driven aggregation and real-time model updates.
Policy gradient theorem for Cumulative Prospect Theory objectives in finite-horizon RL, generalizing standard policy gradients.
DDPG algorithm with epsilon-t-greedy exploration for sparse reward reinforcement learning with polynomial sample complexity bounds.
Framework combining knowledge distillation from LVLMs and knowledge graphs for detecting toxicity in memes.
Graph neural network technique for scalable inference in large Markov Random Fields.
Benchmarking black-box adversarial attacks against state-of-the-art defenses on Robustbench models.
Data reduction technique for semi-supervised adversarial training using latent clustering.
Cross-attentional transformer for multimodal EHR embeddings combining structured and unstructured medical data.
Systematic evaluation of LLMs' exploration-exploitation tradeoff capabilities in contextual bandit tasks.
Weighting scheme for metric space elements resistant to adversarial manipulation and redundancy bias.
Off-policy learning method for personalized policy learning under unobserved confounding scenarios.
Functional multi-armed bandit framework for best function identification in online optimization.
Hybrid quantum-classical RNN framework for remaining useful life prediction in aerospace maintenance.
Qronos post-training quantization algorithm that corrects weight and activation quantization errors iteratively.
Veracity Search algorithm identifies errors in chain-of-thought reasoning steps in language models.
Algorithms for selecting arms with highest variance in bandit settings with misallocation minimization.
Metrics for evaluating generative model quality using clipped density and coverage with calibration improvements.
Data filtering techniques to build safety safeguards into open-weight LLMs resistant to tampering attacks.
Statistical methods for uncertainty quantification in binary classification models using approximate Bayesian inference.
XAI framework using CNNs and occlusion maps to analyze cough spectrograms for COPD diagnosis.