Exploring Single Domain Generalization of LiDAR-based Semantic Segmentation under Imperfect Labels
Domain generalization approach for LiDAR semantic segmentation handling imperfect labels and sensor noise in autonomous driving scenarios.
Domain generalization approach for LiDAR semantic segmentation handling imperfect labels and sensor noise in autonomous driving scenarios.
RECODE agentic framework using code generation and derendering for verifiable visual reasoning on structured visuals with multimodal LLMs.
RL-100 real-world robotic manipulation framework combining diffusion visuomotor policies with imitation and reinforcement learning via clipped PPO.
Personalized collaborative learning framework with affinity-based variance reduction for heterogeneous multi-agent systems.
FALCON vision-language-action model incorporating 3D spatial foundation priors to improve reasoning and generalization in embodied AI tasks.
Interpretable operator-learning ML model for reconstructing electric field distributions from EFISH signal profiles in plasma physics.
Fairness-aware LoRA fine-tuning of vision-language models for medical imaging with differentiable MaxAccGap loss for demographic parity optimization.
ELERAG system enhancing retrieval-augmented generation with entity linking to improve factual accuracy in specialized domains like education.
ADHint method integrating difficulty-aware hints into reinforcement learning post-training to improve sample efficiency and reasoning generalization.
Evaluation of spatial descriptor-based methods for decoding finger movements from electromyography signals using neural networks.
Machine learning parameter tuning for wavelet transform analysis of amorphous material atomic structure reconstruction.
Theoretical analysis of distributed optimization with multiple local updates between communication rounds, proving acceleration guarantees.
Bayesian generative modeling framework for flexible conditional inference on arbitrary partitions of observed variables without fixed conditioning structure constraints.
Robust assortment optimization from observational retail data handling unstable customer preferences and correlation shifts.
Bottleneck transformer for non-intrusive STOI speech intelligibility prediction without clean reference signals.
Missing-by-Design framework for revocable multimodal sentiment analysis enabling certified deletion of specific data modalities.
Equivariant neural networks for robust object recognition under symmetric transformations and unusual viewing conditions.
Theoretical analysis of non-rectangular robust MDPs under average-reward criterion with optimal policy characterization.
Speaker diarization system for medical conversations in noisy rural healthcare using voice activity detection and clustering algorithms.
FinTexTS dataset pairs financial time-series with semantic text data for multi-modal financial forecasting and analysis.
Analysis of performative chain-of-thought in reasoning models, showing models generate tokens without revealing internal beliefs via activation probing.
PolyBlocks: MLIR-based modular compiler infrastructure for AI programming frameworks and chips using affine analysis and analytical cost models.
VLN-Cache improves vision-language model inference efficiency for Vision-and-Language Navigation via semantic-aware token caching.
Megatron Core system optimizations for scaling Mixture-of-Experts model training across memory, communication, and computation constraints.
Covenant-72B: 72B parameter LLM trained via globally distributed, trustless peer-to-peer training over the internet without whitelisting.
Clinical feasibility study of AMIE, an LLM-based conversational AI for patient diagnostic history in real-world primary care workflows.
PostTrainBench benchmarks LLM agents' ability to automate post-training of language models, extending AI agents to AI research automation.
Discussion of prompt fatigue challenges when using LLMs for coding and writing, asking community for management strategies.
ClawSoc: Open-source framework for observing and testing AI agents in multi-agent scenarios with game-theoretic interactions.
Analysis of how AI pioneers Bengio, Hinton, and LeCun's divergent views on AI's future informed building TRACE platform.
CryptoFlora: visualization tool mapping SHA-256 hashes to flower patterns using Rose curves for visual identification, with potential avatar generation use cases.
Gemini CLI agent that orchestrates Google Workspace APIs to generate polished documents, sheets, and slides from natural language input.
MCP-compatible credit optimizer reducing Manus AI token usage 30-75% through prompt analysis and six optimization strategies.
OpenClaw plugin adding multi-mode orchestration (ask/delegate/autonomous) for Claude-based code generation with plan approval and session persistence.
Discussion on preventing runaway behavior in MCP-based agents through loop detection, tool call limits, and iteration constraints in production deployments.
IOA Core: open-source governance kernel for AI workflows with policy checks, audit trails, and quorum-style review patterns, provider-neutral execution controls.
Autonomous AI agent that optimizes control systems by independently writing code, training models, and iterating on research problems with minimal human guidance via Discord.
Polaris API provides structured, real-time intelligence from 160+ news sources for AI agents to query and reason over global events without web scraping.
Open marketplace indexing 45,000+ AI agent skills with semantic search. Works with Claude Code, Cursor, Windsurf and other agents.
Python library with drop-in adapters for translating embeddings between different model vector spaces. Enables interoperability without hacks.
MCP server providing 18 structured tools for AI agents to interact with Robinhood trading platform. Compatible with Claude Code and OpenClaw.
Satirical project using LLM agents to solve FizzBuzz. Appears as joke/parody content.
macOS utility that fixes prompt typos before sending to Claude, Codex, or Gemini. Reduces prompt noise in terminal AI sessions.
Claude Code skill that organizes problems into cross-functional teams and executes work in parallel using dependency-based waves and subagents.
Burn-after-reading image host built on Cloudflare free tier. Auto-expires after 24h with no tracking.
Discussion thread on code review practices for AI-generated code. Explores tension between natural language prompting and artifact review.
Python library for creating plugin infrastructure. Enables code to automatically hook into contexts without direct dependencies.
Multi-agent software engineering framework using contracts-first architecture. Agents implement code in parallel with mechanical test validation.
MCP server for Hacker News that enables AI agents to discover relevant stories, identify credible voices using EigenTrust propagation, and understand ranking signals.
Curly-brace syntax prompting language for AI agents. JavaScript-like syntax for structured prompts with local LLM support.