Claude-cast – stream twitch, YouTube, and kick inside your Claude Code session
Tool for streaming Twitch/YouTube/Kick content inside Claude Code with live chat integration.
Tool for streaming Twitch/YouTube/Kick content inside Claude Code with live chat integration.
Google DeepMind research study on AI's persuasion and manipulation capabilities.
Video title only with no content or context provided.
Zinc LLM inference engine in Zig enabling 35B model inference on consumer AMD GPUs via Vulkan.
API design principles for LLM consumers. Reducing Claude's healthcare API calls from 72 to 8 through agent-focused redesign.
Title only, no content. General discussion about critical thinking with LLMs.
Web app for community location sharing using Claude Opus for text editing. Minor AI assistance, not primary focus.
Analysis of AI-generated patches passing CI tests but failing production. 20% breakage rate in vulnerability fixes.
Title only, no content. Tool for detecting LLM-generated text.
Brief mention of AMD Ryzen AI 300 processor inference capabilities without technical details.
Open source email infrastructure for AI agents. Send, receive, search, extract codes. Deploy on Cloudflare. Integrates with Claude Code and AI agent platforms.
Open source library of 450+ modular agent skills for medical research. Works with OpenClaw, Claude with scientific integrity constraints.
Open source macOS terminal multiplexer for running AI agents in parallel with notifications. Built for agent workflows.
Founder dispute over Stripe account closure for AI image/video generation platform citing payment reversal policy.
Analysis of accelerating AI tool/framework releases tracked via HN, GitHub, npm, PyPI showing ecosystem growth rate.
Philosophy preprint on mathematical methods and AI's role in mathematics formalization and human thought.
Neovim GUI for macOS using Metal GPU rendering with multi-window support and IME for CJK input.
Analysis of how AI agents integrate third-party tools into code generation and product decision workflows.
Marketing content for commercial AI image upscaler/enhancement tool with no technical details.
Website redesign benchmark comparing four AI models for generating website designs from URLs.
Empirical study showing verification steps degraded AI agent performance across 29 tests. Original experimental research.
APIEval-20 benchmark dataset for evaluating black-box API test suite generation using LLMs and schemas.
CEO used ChatGPT to terminate studio head; decision was reversed and criticized.
GPU profiling tool that diagnoses performance bottlenecks beyond utilization metrics. Minimal details provided but relevant tool.
MCP server for AI agents to select appropriate cloud services with current pricing and compatibility data. 74 services, no API key required.
Stanford research showing AI vision models generate images not in training data through hallucination mechanisms.
News about President Trump press interaction on Air Force One.
TRIBE v2: Predictive AI model of human brain responses to visual, auditory, and language stimuli from neuroscience research.
HD Audio driver for Windows 98SE/ME systems on Intel chipsets with WDM support.
R package that converts Excel workbooks to standalone R scripts with formula recreation and verification against cached values.
LLMnesia: Local-first search tool for AI conversation history across ChatGPT, Claude, Gemini, and other platforms.
Analysis of Meta's legal losses and liability implications from internal social science research on platform effects.
WhisperFlow: Free, open-source speech-to-text tool for macOS. On-device processing, no cloud upload, no account required.
arXiv research on 4D generation from natural language and images using embodied world models. Addresses data scarcity and long-horizon video generation challenges.
arXiv research proposing Balanced Fine-Tuning method for aligning LLMs with biomedical knowledge. Combines SFT and RL using confidence-weighted token optimization for scientific understanding.
arXiv research on streaming video understanding with gaze signal interpretation for AR applications. Evaluates multimodal LLMs on temporal reasoning with human attention signals.
arXiv research on multimodal memory architecture for long-form video understanding. Addresses context capacity and visual detail retention in hours-long videos using dynamic memory mechanisms.
Post-training method for lower-resource languages preserving fluency when aligned by disfluent reward models, addressing preference optimization data scarcity.
Feed-forward transformer model predicting 3D object articulations including parts, kinematic structure, and motion constraints for articulated object understanding.
Cascaded reinforcement learning infrastructure for scaling general-purpose reasoning models, addressing heterogeneity in response lengths and verification latency.
SonicMoE optimizes Mixture of Experts model inference through IO and tile-aware techniques, accelerating high-sparsity MoE architectures for language models.
Deep learning method for radio path loss prediction in multi-transmitter 5G scenarios, addressing distribution shifts and environmental generalization.
Dual-objective language model combining autoregressive and masked-diffusion training without architectural changes, improving efficiency and reducing overfitting.
Medical report generation using reinforcement learning with clinical alignment objectives, improving correctness over token-level likelihood training approaches.
Study comparing SpeechLLMs that directly process speech for translation against cascaded transcription pipelines, evaluating speech modality integration effectiveness.
Dual-State Architecture formalizes execution primitives coupling stochastic LLM generation with deterministic verification guards for reliable code generation agents.
Benchmark evaluating LiDAR 3D perception model robustness under simultaneous domain shifts and label-space evolution in autonomous driving scenarios.
Crucible system augments RAG with Q&A nuggets from documents, preserving citation provenance and improving extraction, selection, and report generation.
Study examining risks of RAG system evaluation and optimization using LLM judges, revealing circularity issues in nugget-based evaluation approaches.
CARPE method improving vision-centric capabilities of vision-language models through context-aware image representation prioritization via ensemble approach.