Isolater - Feed

LB kirancodes.me by kirancodes 27d ago

Multi-agentic Software Development is a Distributed Systems Problem (AGI can't save you)

Research on using choreographic languages as a formalism for describing multi-agent LLM workflow coordination, framing it as a distributed systems problem.

HN morinoppp 27d ago

QitOS – A research-first framework for building serious LLM agents

QitOS is a research-first framework for building reproducible LLM agents with clean module design, benchmarks, and built-in observability.

HN gopiandcode 27d ago

Multi-Agentic Software Development Is a Distributed Systems Problem

Research on choreographic languages for managing multi-agent LLM coordination as a distributed systems problem with new programming language design.

HN 1659447091 27d ago

Businesses scramble to get noticed by AI search

Business impact of AI search engines. HubSpot experienced 140M lost visits as search behavior shifts toward AI-powered tools.

HN msolujic 27d ago

Addyosmani/agent-skills: Prod-grade skills for AI coding agents

Production-grade skills framework for AI coding agents. Encodes workflows, quality gates, and engineering best practices as reusable skills activated via slash commands.

HN _____k 27d ago

AI Agent Traps

Title only with minimal metadata. No substantive content provided.

Ax Zhaoting Gong, Ran Ran, Fan Yao, Wujie Wen 27d ago

AEGIS: Scaling Long-Sequence Homomorphic Encrypted Transformer Inference via Hybrid Parallelism on Multi-GPU Systems

AEGIS: Scaling homomorphic encrypted transformer inference via hybrid parallelism on multi-GPU. Privacy-preserving ML optimization, niche application.

Ax Thomas Manuel Rost 27d ago

Inference-Path Optimization via Circuit Duplication in Frozen Visual Transformers for Marine Species Classification

Circuit duplication technique for frozen vision transformer inference on marine species classification. ML optimization off-topic domain.

Ax Matthew Levinson 27d ago

MetaSAEs: Joint Training with a Decomposability Penalty Produces More Atomic Sparse Autoencoder Latents

MetaSAEs: Introduces decomposability penalty for training sparse autoencoders with atomic latents. Improves alignment and safety-relevant applications.

Ax Lamyea Maha, Tajmilur Rahman, Chanchal Roy 27d ago

Agile Story-Point Estimation: Is RAG a Better Way to Go?

Compares RAG vs standard approaches for Agile story point estimation in sprint planning. arXiv study on LLM application.

Ax Noshin Ulfat, Ahsanul Ameen Sabit, Soneya Binta Hossain 27d ago

Measuring LLM Trust Allocation Across Conflicting Software Artifacts

TRACE: Study on how LLMs allocate trust between conflicting code, documentation, and tests. Evaluates trustworthiness in AI-assisted software engineering.

Ax Kenan Tang, Jiasheng Guo, Jeffrey Lin, Yao Qin 27d ago

ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop

ExpressEdit: Photoshop plugin using diffusion models for facial expression editing. Computer vision application, off-topic for interests.

Ax Ganlin Feng, Yuxi Long, Hafsa Ali, Erin Lou, Fahad Butt, Qian Liu, Yang Wang, Pingzhao Hu 27d ago

RDFace: A Benchmark Dataset for Rare Disease Facial Image Analysis under Extreme Data Scarcity and Phenotype-Aware Synthetic Generation

RDFace benchmark dataset for rare disease facial phenotype analysis in children with synthetic data generation. ML research but off-topic domain.

Ax Jacob Dineen, Aswin RRV, Zhikun Xu, Ben Zhou 27d ago

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

Introduces vocabulary dropout technique to solve diversity collapse in co-evolutionary LLM self-play curriculum learning. arXiv paper with novel method.

Ax Mikhail Seleznyov, Daniil Korbut, Viktor Moskvoretskii, Oleg Somov, Alexander Panchenko, Elena Tutubalina 27d ago

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

LLM-powered evolutionary search automatically discovers unsupervised uncertainty quantification methods as Python programs for claim verification.

Ax Haocheng Tang, Xingyu Dang, Junmei Wang 27d ago

Fine-tuning DeepSeek-OCR-2 for Molecular Structure Recognition

Fine-tuning approach adapting DeepSeek-OCR-2 for optical chemical structure recognition by formulating task as image-to-text.

Ax Mete Ismayilzada, Simone A. Luchini, Abdulkadir Gokce, Badr AlKhamissi, Antoine Bosselut, Antonio Laverghetta Jr., Lonneke van der Plas, Roger E. Beaty 27d ago

Large Language Models Align with the Human Brain during Creative Thinking

Study of brain-LLM alignment during creative divergent thinking tasks, measuring correlation between model performance and human neural activity.

Ax Xiaoan Liu, DaeHo Lee, Eric J Gonzalez, Mar Gonzalez-Franco, Ryo Suzuki 27d ago

VisionClaw: Always-On AI Agents through Smart Glasses

VisionClaw wearable AI agent on Meta Ray-Ban glasses combining egocentric perception with speech-driven task execution via OpenClaw agents.

Ax Zilin Huang, Zhengyang Wan, Zihao Sheng, Boyue Wang, Junwei You, Yue Leng, Sikai Chen 27d ago

Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving

Sim2Real-AD framework for zero-shot sim-to-real transfer of VLM-guided RL policies from CARLA simulation to physical autonomous vehicles.

Ax Michael Caosun, Sinan Aral 27d ago

The Augmentation Trap: AI Productivity and the Cost of Cognitive Offloading

Dynamic model analyzing productivity-skill tradeoffs when workers use AI tools, decomposing productivity effects into expertise-dependent and independent channels.

Ax Benjamin Rombaut 27d ago

Inside the Scaffold: A Source-Code Taxonomy of Coding Agent Architectures

Taxonomy of LLM-based coding agent architectures analyzing scaffolding code patterns including control loops, tool definitions, and context strategies.

Ax Chenglizhao Chen, Shujian Zhang, Luming Li, Wenfeng Song, Shuai Li 27d ago

Determined by User Needs: A Salient Object Detection Rationale Beyond Conventional Visual Stimuli

Novel salient object detection method based on user needs rather than visual stimuli alone.

Ax Sing Hieng Wong, Hassan Sajjad, A. B. Siddique 27d ago

LangFIR: Discovering Sparse Language-Specific Features from Monolingual Data for Language Steering

LangFIR uses sparse autoencoders on monolingual data to discover language-specific features for steering LLM output language without parallel corpora.

Ax Daniel Ogenrwot, John Businge 27d ago

AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub

AgenticFlict dataset of merge conflicts from AI coding agent pull requests on GitHub, studying integration challenges in collaborative AI-assisted development.

Ax Jason Chen, I-Chun Arthur Liu, Gaurav Sukhatme, Daniel Seita 27d ago

CRAFT: Video Diffusion for Bimanual Robot Data Generation

Video diffusion framework (CRAFT) for generating synthetic bimanual robot manipulation demonstrations with temporal coherence.

Ax Sohyeon Kim, Sang Yeon Yoon, Kyeongbo Kong 27d ago

Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models

Phase-aware suppression method to reduce hallucinations in Vision-Language Models without iterative optimization overhead.

Ax Hao Wang, Niels M\"undler, Mark Vero, Jingxuan He, Dawn Song, Martin Vechev 27d ago

SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization

SecPI framework for secure code generation using reasoning LLMs through security reasoning internalization, addressing inference-time vulnerability mitigation.

Ax Mohammad Merati, H. M. Sabbir Ahmad, Wenchao Li, David Casta\~n\'on 27d ago

Multi-Robot Multi-Queue Control via Exhaustive Assignment Actor-Critic Learning

Actor-critic reinforcement learning approach for multi-robot task allocation with asymmetric arrivals and switching delays.

Ax Qusay Muzaffar, David Levin, Michael Werman 27d ago

Neural Global Optimization via Iterative Refinement from Noisy Samples

Neural method for black-box global optimization using iterative refinement from noisy samples, addressing multi-modal function optimization.

Ax Ruwei Pan, Junlei Shen, Linhao Wu, Yueheng Zhu, Zixiong Yang, Yakun Zhang, Lu Zhang, Hongyu Zhang 27d ago

Toward Executable Repository-Level Code Generation via Environment Alignment

LLM-based approach for multi-file repository code generation with executable validation, addressing dependency resolution and integration challenges.

Ax Ruwei Pan, Jiangshuai Wang, Qisheng Zhang, Yueheng Zhu, Linhao Wu, Zixiong Yang, Yakun Zhang, Lu Zhang, Hongyu Zhang 27d ago

Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

LiveCoder framework for repository-level code generation preserving and reusing task-specific state across multiple LLM attempts.

Ax Jinxi Xiang, Mingjie Li, Siyu Hou, Yijiang Chen, Xiangde Luo, Yuanfeng Ji, Xiang Zhou, Ehsan Adeli, Akshay Chaudhari, Curtis P. Langlotz, Kilian M. Pohl, Ruijiang Li 27d ago

A Generative Foundation Model for Multimodal Histopathology

Generative foundation model for multimodal histopathology that imputes missing modalities from incomplete medical data.

Ax Jongsoo Lee, Jangwon Kim, Soohee Han 27d ago

Delayed Homomorphic Reinforcement Learning for Environments with Delayed Feedback

Reinforcement learning approach for environments with delayed feedback using homomorphic state representation.

Ax Yunyao Yu, Zhengxian Wu, Zhuohong Chen, Hangrui Xu, Zirui Liao, Xiangwen Deng, Zhifang Liu, Senyuan Shi, Haoqian Wang 27d ago

Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

Method for stable unsupervised self-evolution of multimodal LLMs using continuous softened retracing resampling for feedback quality.

Ax Ruochen Li, Ziyi Chang, Junyan Hu, Jiannan Li, Amir Atapour-Abarghouei, Hubert P. H. Shum 27d ago

ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware Relations

Adaptive Relational Transformer for pedestrian trajectory prediction using temporal-aware relations in robotics.

Ax Vladimir Beskorovainyi 27d ago

AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

Microservice system using NLP and deep learning to automate classification of citizen appeals in government services.

Ax Yoshinari Fujinuma, Keisuke Sakaguchi 27d ago

Unlocking Prompt Infilling Capability for Diffusion Language Models

Unlocks prompt infilling in masked diffusion language models by applying full-sequence masking during supervised finetuning.

Ax Yuqi Zhu, Jintian Zhang, Zhenjie Wan, Yujie Luo, Shuofei Qiao, Zhengke Gui, Da Zheng, Lei Liang, Huajun Chen, Ningyu Zhang 27d ago

LightThinker++: From Reasoning Compression to Memory Management

LightThinker++ enables LLMs to dynamically compress intermediate reasoning thoughts into compact representations for efficiency.

Ax Zhifu Wei, Yizhou Dang, Guibing Guo, Chuang Zhao, Zhu Sun 27d ago

Fusion and Alignment Enhancement with Large Language Models for Tail-item Sequential Recommendation

Uses LLMs to capture semantic relationships for tail-item sequential recommendation, addressing sparse interaction problem.

Ax Sichen Tao, Yifei Yang, Ruihan Zhao, Kaiyu Wang, Sicheng Liu, Shangce Gao 27d ago

RDEx-CMOP: Feasibility-Aware Indicator-Guided Differential Evolution for Fixed-Budget Constrained Multiobjective Optimization

RDEx-CMOP is a differential evolution algorithm variant for constrained multiobjective optimization under budget constraints.

Ax Asmaa M. Elwer, Muhammad A. Rushdi, Mahmoud H. Annaby 27d ago

Learning Superpixel Ensemble and Hierarchy Graphs for Melanoma Detection

Graph learning approach for melanoma detection in dermoscopic images using graph signal processing.

Ax Steeven Villa, Abdallah El Ali 27d ago

15 Years of Augmented Human(s) Research: Where Do We Stand?

Scientometric analysis of 15 years of augmented human research, examining conference evolution and core themes.

Ax Baicheng Chen, Yu Wang, Ziheng Zhou, Xiangru Liu, Juanru Li, Yilei Chen, Tianxing He 27d ago

CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

CREBench evaluates LLMs on cryptographic binary reverse engineering, assessing capabilities for vulnerability discovery and malware analysis.

Ax Angelos Poulis, Mark Crovella, Evimaria Terzi 27d ago

Testing the Limits of Truth Directions in LLMs

Research identifying limitations in universality of linear truth directions in LLM activation spaces across different settings.

Ax Alexander Loth, Martin Kappes, Marc-Oliver Pahl 27d ago

Can Humans Tell? A Dual-Axis Study of Human Perception of LLM-Generated News

Study measuring human ability to distinguish LLM-generated news from human-written content across six LLM models.

Ax Ragib Shahariar Ayon, Shibbir Ahmed 27d ago

AutoReSpec: A Framework for Generating Specification using Large Language Models

AutoReSpec uses LLMs to generate formal specifications for programs, addressing syntax and logic errors through techniques for complex control flow.

Ax Pierrick Lorang, Johannes Huemer, Timothy Duggan, Kai Goebel, Patrik Zips, Matthias Scheutz 27d ago

Build on Priors: Vision--Language--Guided Neuro-Symbolic Imitation Learning for Data-Efficient Real-World Robot Manipulation

Neuro-symbolic framework for robot manipulation using vision-language models and autonomous domain construction.

Ax Jonathan Katzy, Razvan-Mihai Popescu, Erik Mekkes, Arie van Deursen, Maliheh Izadi 27d ago

Automated Attention Pattern Discovery at Scale in Large Language Models

Method for discovering repeated attention patterns in large language models at scale for mechanistic interpretability.

Ax Yuanhang Li 27d ago

When Does Multimodal AI Help? Diagnostic Complementarity of Vision-Language Models and CNNs for Spectrum Management in Satellite-Terrestrial Networks

Compares vision-language models and CNNs for spectrum management in satellite-terrestrial networks.

Ax Renzo G. Soatto, Anders Hoel, Greycen Ren, Shorna Alam, Stephen Bates, Nikolaos P. Daskalakis, Caroline Uhler, Maria Skoularidou 27d ago

CountsDiff: A Diffusion Model on the Natural Numbers for Generation and Imputation of Count-Based Data

CountsDiff extends diffusion models to discrete ordinal data on natural numbers for generation and imputation tasks.