Air Canada CRJ collides with fire fighting truck on landing in New York
Aviation incident report. Off-topic spam content about aircraft collision and flight radar network.
Aviation incident report. Off-topic spam content about aircraft collision and flight radar network.
Claudebox wraps Claude subscription as OpenAI-compatible API, runs Code in sandboxed Docker enabling agent capabilities. LLM application with developer tool focus.
Joy: trust network platform for AI agents enabling discovery, reputation building, and verification of agent capabilities for autonomous delegation.
Rust implementation of Mamba SSM with custom CUDA kernels for training and inference. Original ML research implementation with GPU optimization.
FedEx workforce AI training program announcement for 400k workers.
macOS menu bar app that auto-hides the Dock when windows overlap it. Native utility, not AI/ML related.
Brief news item about airport closure from aircraft incident.
News article about TSA and ICE immigration checkpoints at airports.
Personal experiment using ChatGPT and Gemini APIs to identify actors in movies via Emacs integration. Informal blog post about LLM capabilities and limitations.
Guide to building voice AI agents covering abstractions, networks, models, and evaluations. Real-world examples include debt collection, emergency services, and language-specific agents.
Curated collection of research papers on diffusion-based language models. Links to papers from 2015-2023.
Marketing content about AI-generated book on human-AI relationships priced at $1M. Promotional material without technical substance.
Open-source local evaluation framework for AI agents with cryptographic verification. Zero cloud dependencies, includes benchmarking metrics for accuracy, latency, and fairness.
Using LLMs to improve GitHub's topic tagging system for open-source projects. Limited detail provided.
Version control system for LLM/agentic reasoning state, enabling tracking and recovery of reasoning progress across multiple models and sessions.
LLM-powered code review tool using entity graphs for risk scoring, identifying critical changes in diffs with 95% recall and 5-67ms latency.
Virtual instrumentation approach using operators for sensing in environments where physical sensors cannot be placed, applied to cosmic radiation monitoring.
Medical application of classification transformers for objective evaluation of chronic pain treatment outcomes.
Data annotation pipeline using LLMs with critical thinking as both annotators and judges to improve supervised learning label quality.
Efficient RL training method for reasoning LLMs using adaptive drafting to handle long-tail response generation distribution and reduce computation time.
Cross-domain offline reinforcement learning method using dynamics and value alignment for filtering datasets to improve agent training in target environments.
ReLaX: approach addressing entropy collapse in large reasoning models by promoting latent-level exploration during reinforcement learning with verifiable rewards.
Unified framework maintaining factorized momentum states across neural network training and model merging to reduce redundant computation.
Physics-informed temporal fusion framework (TPI-AI) for lane-change intention prediction combining LSTM with physics-inspired features for autonomous driving.
Self-Distilled Reasoner: on-policy self-distillation approach for LLM reasoning that addresses distribution mismatch without teacher models.
Reinforcement Unlearning via GRPO: technique for removing sensitive data from LLMs without retraining, compliant with GDPR and EU AI Act.
Sheaf-theoretic and topological perspective on signal diffusion and attention mechanisms in graph neural networks and geometric deep learning.
Analysis of forecast uncertainty in machine learning explainability, addressing instability of LIME and SHAP near decision boundaries.
StealthRL: RL framework using group relative policy optimization to test robustness of AI-text detectors against adversarial paraphrasing attacks.
Theoretical analysis of iterative self-improvement in LLMs using reward-verified outputs with easy-to-hard curriculum learning.
Method for comparing clustering algorithms with overlapping clusters and outliers in unsupervised learning evaluation.
Spectral convolution techniques for geometric deep learning on non-Euclidean data structures like graphs and manifolds.
Interactive browser-based platform teaching federated learning concepts with real-time visualization of heterogeneous data and aggregation algorithms.
mlx-vis: GPU-accelerated dimensionality reduction library for Apple Silicon implementing 8 methods with hardware-accelerated rendering.
FEAT: linear-complexity foundation model for structured data handling heterogeneous datasets with improved attention mechanisms for large-scale applications.
Study of cone effect and modality gap in medical vision-language models, analyzing embedding concentration and cross-modal separation in supervised learning.
AcceRL: distributed asynchronous RL framework for Vision-Language-Action models with integrated trainable world models, eliminating synchronization barriers.
Difficulty-Differentiated Policy Optimization addresses Large Reasoning Models' overthinking and overconfidence by redistributing token allocation based on problem difficulty.
Hypergraph-augmented transformer network for crowd trajectory prediction using spatial-temporal interactions and group dynamics, applicable to robotics and autonomous driving.
AI framework for analyzing bodyworn camera footage to improve police accountability and government transparency at scale.
Algorithm for uniformly sampling high-dimensional convex bodies via stochastic diffusion, achieving improved runtime complexity and Rényi divergence guarantees.
Machine learning approach for learning representations to improve statistical independence testing between high-dimensional random variables with complex distributions.
Graph convolutional network for detecting ataxic gait severity from 2D video, addressing subtle pathological variations in patient movement.
Framework for assessing information security awareness in LLMs, including security knowledge, attitudes, and behavior to improve rejection of unsafe requests.
Research on transfer learning of QAOA parameters for quantum optimization on NISQ processors, focusing on layer-selective approaches for combinatorial problems.
Machine learning approach applying critical transition theory to detect early warning signals in Reddit r/place social experiment.
Genomic language model framework using phylogenetic trees and multispecies alignment for identifying evolutionarily constrained sequences.
Analysis of multi-stage LLM inference pipelines including RAG, KV cache retrieval, routing, and reasoning with optimization strategies.
Pseudo-simulation method for evaluating autonomous vehicles addressing limitations of real-world and closed-loop simulation evaluation.
Framework for multimodal representation learning through simultaneous alignment of diverse data modalities.