Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation
Addresses gradient staleness in asynchronous pipeline parallelism for distributed training via basis rotation technique.
Addresses gradient staleness in asynchronous pipeline parallelism for distributed training via basis rotation technique.
EvoMAS uses evolutionary methods to automatically generate LLM-based multi-agent system architectures, addressing brittleness and generalization challenges in MAS design.
Novel adaptive optimization algorithm extending Adam-style methods to matrix operations for large-scale training.
ML method for discovering event patterns in clinical time series data using attention mechanisms with timing awareness.
Research on fairness and privacy in ML for human-centric tasks using worst-case group optimization and differential privacy techniques.
Training-free guidance for continuous diffusion language models to satisfy formal syntax constraints. Enables JSON/structured output generation without retraining.
Multi-domain graph pre-training with domain-specific experts for homogeneous and heterogeneous graphs. Unified approach handling mixed graph types across distribution shifts.
Analysis of attention head singular vectors aligning with learned features in language models. Provides theoretical justification for mechanistic interpretability observations.
Fast KV cache compaction using attention matching. Reduces key-value cache size for long-context LLM inference while maintaining performance.
Structural theory explaining position bias and Lost-in-the-Middle phenomenon in Transformers. Analyzes causal attention architecture origins of token position bias.
InfoNoise adaptive noise scheduling for diffusion model training. Data-adaptive allocation based on conditional entropy to optimize denoising difficulty.
Neural evolution approach for antibody engineering using phylogenetic models. Leverages affinity maturation data to model evolutionary fitness landscape.
Sparse scheduled diffusion guidance for Bayesian inverse problems. Reduces computational cost by applying guidance selectively through reverse trajectory.
RLVR method addressing calibration degradation in LLM reasoning. Decouples confidence from reasoning to prevent overconfidence on incorrect answers.
Self-supervised learning for wearable accelerometer data using biological structure tokenization. Improves human activity recognition with limited labeled data.
TreeKD method distilling tree-based model knowledge into LLMs for molecular property prediction. Improves LLM performance on drug discovery tasks.
Hybrid-Order Split Federated Learning reducing memory usage on edge devices. Combines zeroth-order optimization with split learning for faster convergence.
Transformer variants for financial time-series forecasting using knowledge distillation. Addresses non-stationary data and regime shifts in financial markets.
Framework for discovering and inferring dynamic causal relationships in time-series neural networks without requiring known causal structure a priori.
Research on LLM pretraining convergence: investigating whether models converge to common minima across data sources to improve downstream generalization.
SaFeR-Steer framework for multi-turn safety alignment in multimodal LLMs using synthetic data bootstrapping and feedback dynamics to address long-context safety degradation.
Security vulnerability discovered in Starlette Python package used by LLM software.
Safescript language design motivation: preventing supply chain attacks and hidden logic in AI-generated code via static analysis.
Opinion piece on future job roles managing and coordinating AI systems.
AIPass platform for persistent multi-agent workspaces with shared memory, context, and collaboration between agents.
Teleport-env: <500ms OS-level stateful rollback sandbox for autonomous coding agents using CRIU snapshots for MCTS and RL.
Meta testing paid subscriptions for AI features on Meta AI app and website.
Zig programming language 2026 roadmap includes no-AI policy, foundation funding, and GitHub departure.
Workplace discussion about identifying AI-generated content in internal communications.
Research paper on protein biology language models developing implicit world models for biological prediction and understanding.
Illinois legislature passes SB 315 requiring third-party safety audits of frontier AI labs.
Video on access control and authorization mechanisms for AI systems.
Markdown-first list management app for iOS and CLI. Open source project.
Uvilox AI offers real-time sign language interpretation with <80ms latency using vision AI models.
AgingBench research on long-context memory degradation in AI agents. Studies information loss and retrieval problems over time.
LLM INQUISITOR: methodology for evaluating AI systems in real workflows, not benchmarks. Tests stability and reliability.
AGH: open network protocol for AI agents. Enables durable CLI sessions with memory, tools, autonomy on NATS-based channels.
Paper formalizing emergent behavioral patterns in sustained human-AI interaction. Introduces 'third vector' concept in response space.
Chrome extension providing unified prompt management interface across multiple AI platforms.
Empirical study measuring jailbreak vulnerability rates across 15 frontier LLMs including Grok (88%) and Claude (12%).
Analysis of differences between AI infrastructure requirements and traditional cloud infrastructure design.
Security incident: malware developer attempted to steal Claude API credentials but leaked own GitHub token.
Argonne National Lab uses supercomputing resources to build private AI inference service.
Study examining AI model biases and responses toward religious content, particularly Jehovah's Witnesses.
Framework analyzing five pillars of AI agent accountability: traceability, authorization, identity, policy, and oversight.
Using Claude API to extract data from 1997 football manager game.
VS Code extension for Swift/iOS development integrating xcodebuild, swift-format, and language server protocol.
Meta launches paid subscriptions for Facebook, WhatsApp, Instagram, and Meta AI.
VAEN open-source CLI packages AI coding-agent harnesses with skills and MCP servers as portable .agent files.
Assessment platform intercepting Claude Code requests to prevent AI from over-solving coding interview problems.