Show HN: Beta Testing needed for my package Trustcheck
Trustcheck Python package and CLI for evaluating PyPI package trust posture using metadata, vulnerabilities, provenance, and cryptographic attestation.
Trustcheck Python package and CLI for evaluating PyPI package trust posture using metadata, vulnerabilities, provenance, and cryptographic attestation.
Question about using AI agents for intelligent test path selection in complex systems. Conceptual inquiry without established research or implementation details.
Benchmark for testing AI coding agents' ability to read web content, measuring how Claude Code, Cursor, GitHub Copilot handle documentation rendering.
Loci, memory persistence layer separating memory store from reasoning model, converting stateless LLMs into lifelong cognitive partners with Go and PostgreSQL.
Open source MCP server enabling Claude and AI assistants to connect with LinkedIn for profile/company search and job access.
Analysis of token quality variations across inference clouds, models, and serving setups, examining factors affecting inference performance and economic implications.
Open-source tool for building autonomous agents that run locally, remember context, and generate dashboards. Agents as workers rather than chatbots with agent-kernel framework.
Honcho, open source memory library and managed service for building stateful agents with continual learning capabilities for entities and relationships.
Open-source web crawler for TypeScript built on Bun and Playwright, optimized for LLM integration with JSON output and context-aware field filtering.
Techniques for prompting chat-tuned LLMs to behave like base models using fake tool calls and system prompts.
Discussion of AI tools automating student homework and educational impact. Education policy angle with limited technical depth.
Video about air-powered segment display hardware. Off-topic for AI/ML interests.
GrimmBot, autonomous sandboxed Docker agent with memory, self-improvement capabilities, tool creation, and persistent learning over time.
Developer guide on cognitive architecture patterns for LLM-based autonomous agents, addressing common issues like drift and hallucination through architectural rather than prompt-based solutions.
Research on automated agent that systematically audited major AI agent benchmarks, exposing flaws in how benchmark scores are used to evaluate agent capabilities.
Entroly context compression engine reduces Claude, Cursor, and OpenAI API costs by 80% through token optimization without losing context visibility.
Open source safety tool for AI refund agents that enforces policy constraints and security gates before executing financial transactions.
Project using AI to generate realistic synthetic personas living in Vancouver, SF, and Tokyo with detailed profiles. Explores AI-driven world simulation.
Flux 0.1.0, a minimalist interactive scripting language with blocking I/O. Early stage project with limited features.
Leaked files suggest Valve's Steam platform exploring AI integration capabilities.
Article about giving an AI persistent identity and quantum computer access for research.
Fixhive provides collective memory for AI coding agents via MCP plugin for sharing fixes across sessions.
DockDoor adds window peeking and enhanced alt-tab functionality to macOS with privacy preservation.
Discussion thread about repeated context loss when using AI coding tools and strategies for maintaining session continuity.
WordPress 7.0 delayed to April 22nd due to real-time collaboration data storage issues. Release includes AI features but other improvements go unnoticed.
University of Utah researchers explore how LLMs and conversational AI can support psychotherapy, examining what gets automated and to what extent.
Anthropic conducted 20 hours of psychiatry sessions with Claude AI to study behavior and responses.
ReviewWiggum automates LLM-based code review and fixes using bash scripts with dual licensing.
Research on supply chain attacks targeting LLMs through malicious intermediaries, examining vulnerabilities in model development pipeline.
PHP implementation of age encryption protocol with post-quantum cryptography support using hybrid ML-KEM-768 and X25519.
Meta Research paper on BoxerNet, a system that lifts 2D bounding box detections to 3D oriented bounding boxes using posed images and point clouds.
Academic paper exploring consciousness in insects as part of evolutionary functions theme issue.
Gem: Performance verification system forcing AI coding assistants to validate code against Lighthouse, analytics, and test scores with auto-refactoring.
rtdiff: Desktop app providing real-time git diff visualization and AI-assisted commit generation. MIT licensed developer tool.
Rust-based drone swarm orchestration system applying quantitative trading algorithms to decentralized decision-making.
Article headline only about AI-generated game worlds in Unity.
Satire project about tracking Nashville public officials' movements.
AI-powered code review personas extracted from open-source maintainers' PR histories, enabling automated review insights.
MCP middleware proxy reducing LLM token usage by 61% through context compression, tool routing, and state tracking for AI agents.
Proxy service that uses Claude subscriptions as local API endpoints with proper billing classification.
Lean programming language and theorem prover used to formally verify zlib compression library implementation.
Benchmarking analysis of quantization and LoRA techniques for optimizing local LLMs in production environments.
Skill package for coding agents that provides file-backed workflow management for requirements, planning, testing, and memory.
Platform for validating startup ideas and generating MVP specifications and dynamic prompts for development.
Short story about a fictional early Claude Code implementation, written as an EPUB ebook.
Elixir/React platform for rapid deployment and sharing of small web applications, enhanced with LLM capabilities.
Essay exploring research directions beyond LLMs, emphasizing data as fundamental with data-centric AI methodology.
Visual graph editor for building and training PyTorch neural networks with GPU support.
Article on disinformation and cybersecurity unrelated to AI/tech interests.
Claude Code plugin that measures productivity impact of AI coding tools by analyzing GitHub commits before/after.