Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization
SVD-based quantization for compressing fine-tuned LLM delta parameters with minimized quantization error for storage efficiency.
SVD-based quantization for compressing fine-tuned LLM delta parameters with minimized quantization error for storage efficiency.
HYPER foundation model for inductive link prediction with knowledge hypergraphs handling novel entities and relation types.
Novel safety measure for LLMs measuring generations needed to trigger unsafe responses, with calibrated predictive bounds for evaluation.
NeuronSeek uses symbolic regression to discover optimal neuron formulations and construct task-driven neural networks.
Vision Transformers for climate downscaling using shared encoder and multi-decoder architecture to emulate regional climate models.
Studies optimal ordering of intermediate steps in chain-of-thought reasoning for arithmetic tasks, improving learning efficiency.
Weighted policy optimization method for reinforcement learning in diffusion-based LLMs, addressing intractable likelihood approximation.
Theoretical framework distinguishing inherently sequential problems that cannot be efficiently parallelized, relevant to LLM reasoning.
Spatial disaggregation clustering method for large-scale urban morphology mapping using Earth observation data.
Benchmark dataset for evaluating LLM reasoning on molecular properties at functional group level for chemistry applications.
Investigates spontaneous deception in LLMs on benign prompts, revealing trustworthiness risks in reasoning and planning tasks.
Deep learning architecture for lightning occurrence prediction using uncertainty-aware loss function.
Inference-time method for balancing multiple conflicting objectives in LLM outputs without expensive per-objective fine-tuning.
Token order prediction auxiliary objective improves language model performance, offering alternative to multi-token prediction for next-token training.
Adaptive resampling method for imbalanced classification that adjusts training data distribution based on class-wise learning difficulty.
Online policy-iteration RL framework using sparse Gaussian mixture model Q-functions with interpretable exploration mechanisms.
Online reinforcement learning method for diffusion models addressing intractable likelihoods, enabling RLHF-style training without solver restrictions.
Analyzes scaling laws for loss dynamics and learning rate schedules in SGD on kernel regression, with implications for LLM training.
Theoretical study on estimating inverse temperature parameters in truncated Ising models using single samples.
Demonstrates that activation steering for LLM control can compromise safety mechanisms, causing models to comply with harmful requests.
8-bit quantization technique for Muon optimizer states in LLM pre-training, reducing memory overhead while maintaining training efficiency.
Studies how curvature approximations (GGN, K-FAC) in influence functions affect data attribution accuracy for deep learning models.
Method to trace evolutionary relationships between LLMs through functional representations, enabling better model management and understanding of fine-tuning/distillation lineages.
OpenTSLM: time-series language models integrating multivariate medical time-series as native modality. Enables LLMs to handle temporal clinical data.
arXiv paper: Visual Autoregressive models reinterpreted as Laplacian latent pyramid with learned coarse-to-fine refinement. Formal analysis of design trade-offs.
Research on optimal placement of PDE diffusion layers in transformer architectures using heat equation-based smoothing for local geometric priors.
Legacy SciTech SNAP Graphics device driver codebase released as open source.
Zero-code desktop app for fine-tuning LLMs locally on Apple Silicon with full pipeline from documents to Ollama export.
Self-hosted low-latency streaming application using WebRTC for sub-second broadcast.
Privacy-focused desktop app for local Stable Diffusion image generation with HuggingFace model support.
DeepMind's cryptographic delegation protocol for multi-agent systems with accountability chains and MCP integration.
Retrospective on infrastructure decisions at startup including cloud provider choice, database selection, and tradeoffs.
Research showing LLM-generated skills provide no benefit; models cannot reliably author procedural knowledge they consume.
Manus AI launches persistent AI agents with Telegram integration; account suspended shortly after launch. Platform expansion planned via WhatsApp.
Vibe-coded Asmongold simulator game with author expressing skepticism about AI-based code generation.
Evaluation framework for coding agents detecting overlap, boundary violations, and coverage gaps via static and live analysis.
MCP server providing Claude real-time access to trading signals from Reddit, SEC filings, FDA approvals, and Congressional trades.
MCP server enabling AI agents to discover, install, and learn new tools automatically without restarts or manual configuration.
Constrained DSL for generating reliable LLM decision logic with schema-driven prompts and deterministic execution for quantitative tasks.
React library for ASCII animations that converts video to character grids with performance optimization.
Minimal title-only entry about distributed P2P network for AI inference without substantive content.
Meta patent allowing LLM simulation of deceased users' social media activity for continued posting.
Marketing content for generative sprite creation tool for game developers, claiming to replace artist jobs.
Opinion on India's AI policy summit with focus on governance and homelessness concerns.
Andrej Karpathy explores training language models competitive with GPT-2 for under $100, analyzing cost-effectiveness improvements since 2019.
Open-source governance proxy in Rust for controlling AI agent access to tools, providing audit trails and content moderation via MCP.
Open-source web scraper optimized for LLM consumption, cleaning HTML noise for agent accessibility to web content.
GitHub repository settings update allowing maintainers to disable or restrict pull requests. Not AI/ML related.
GrowthClaw is an open-source marketing operating system for agent-driven workflows, converting goals into task pipelines with human approval and evaluation.
Opinion piece critiquing AI's impact on open source, discussing hallucinations, agent harassment, and OpenAI recruitment.