2d ago
llms 102 agents 78 code 67 open-source 38 products 27 infrastructure 21 tutorials 18 research 17 rag 13 training 12 inference 12 computer-vision 10 safety 6
Claude 57 OpenAI 24 GitHub 21 MCP 20 Anthropic 18 AI 17 Claude Code 16 Google 16 Docker 13 Gemini 12 Codex 9 LLM 9 Claude Code. 8 AI systems 8 YouTube 7 Claude Code, 5 DeepSeek 5 RAG 5 LangChain 5 PyTorch 4
2d ago
ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces
3d ago
ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning
3d ago
ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation
3d ago
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm
3d ago
Budget-Constrained Agentic Large Language Models: Intention-Based Planning for Costly Tool Use
3d ago
Multimodal Fact-Level Attribution for Verifiable Reasoning
3d ago
MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation
3d ago
Voxtral Realtime
3d ago
GameDevBench: Evaluating Agentic Capabilities Through Game Development
3d ago
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
3d ago
Use A2A to connect agents across different frameworks and teams
3d ago
How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning
3d ago
Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity
4d ago
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning
4d ago
LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation
4d ago
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens
4d ago
Andrew Ng on Vibe Coding
4d ago
Stemphonic: All-at-once Flexible Multi-stem Music Generation
Proposes method for multi-stem music generation with flexible instrument control. Relevant to ML research but outside core AI agent/LLM focus.
4d ago
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies
5d ago
χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies
5d ago
StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors
5d ago
Why are diffusion LLMs so fast?
6d ago
PISCO: Precise Video Instance Insertion with Sparse Control
6d ago
Dreaming in Code for Curriculum Learning in Open-Ended Worlds
6d ago
Graph-Enhanced Deep Reinforcement Learning for Multi-Objective Unrelated Parallel Machine Scheduling
Applies deep reinforcement learning with graph neural networks to optimize parallel machine scheduling. Relevant to ML research but not directly related to LLMs or AI agents.
6d ago
MemFly: On-the-Fly Memory Optimization via Information Bottleneck
8d ago
From Features to Actions: Explainability in Traditional and Agentic AI Systems
8d ago
The Biggest Mistake AI Beginners Make
9d ago
The Boring Way to Learn AI (That Actually Works)
9d ago
Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation
9d ago
Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation
11d ago
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
11d ago
An image is worth NxN words | Diffusion Transformers (ViT, DiT, MMDiT)
12d ago
TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments
12d ago
ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning
17d ago
Learn to equip AI agents with reusable skills
19d ago
Is vibe coding real coding?
19d ago
Unlock data from your files with Agentic Document Extraction
22d ago
Document AI: How Agents add the brain to OCR's eyes
23d ago
The Physics of Diffusion Models
28d ago
Text Diffusion: A new LLM paradigm
11/17/2025
I asked them to show me their RAG pipeline...
11/6/2025
Transformers & Diffusion LLMs: What's the connection?
10/6/2025
Text diffusion: A new paradigm for LLMs
8/18/2025
The physics behind diffusion models
7/14/2025
Reverse-engineering GGUF | Post-Training Quantization
6/19/2025
Training models with only 4 bits | Fully-Quantized Training
5/28/2025
The myth of 1-bit LLMs | Quantization-Aware Training
5/17/2025