LB jnsgr.uk via knx 3/11/2026

Brewlog: Coffee & Agents

Personal blog about tracking coffee habits with an iOS app and building a custom data system.

Ax Cornelius Emde, Alexander Rubinstein, Anmol Goel, Ahmed Heakl, Sangdoo Yun, Seong Joon Oh, Martin Gubri 3/11/2026

MASEval: Extending Multi-Agent Evaluation from Models to Systems

MASEval: benchmark extending multi-agent evaluation beyond models to system components, comparing topologies, orchestration logic, and error handling across LLM frameworks.

Ax Yixiong Chen, Xinyi Bai, Yue Pan, Zongwei Zhou, Alan Yuille 3/11/2026

Meissa: Multi-modal Medical Agentic Intelligence

Meissa: open-source multi-modal medical agentic system combining medical image understanding with tool use and multi-agent collaboration, deployable on-premise without frontier models.

Ax Seunghwan Kim (AnsibleHealth Inc., San Francisco, USA), Tiffany H. Kung (AnsibleHealth Inc., San Francisco, USA, Stanford School of Medicine, Stanford, USA), Heena Verma (AnsibleHealth Inc., San Francisco, USA), Dilan Edirisinghe (AnsibleHealth Inc., San Francisco, USA), Kaveh Sedehi (AnsibleHealth Inc., San Francisco, USA), Johanna Alvarez (AnsibleHealth Inc., San Francisco, USA), Diane Shilling (AnsibleHealth Inc., San Francisco, USA), Audra Lisa Doyle (AnsibleHealth Inc., San Francisco, USA), Ajit Chary (AnsibleHealth Inc., San Francisco, USA), William Borden (AnsibleHealth Inc., San Francisco, USA, George Washington University, Washington, D.C., USA), Ming Jack Po (AnsibleHealth Inc., San Francisco, USA) 3/11/2026

From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring

Sentinel: autonomous AI agent for remote patient monitoring clinical triage using Model Context Protocol and 21 clinical tools, reducing manual review from days to minutes.

Ax Hajime Shimao, Warut Khern-am-nuai, Sung Joo Kim 3/11/2026

Chaotic Dynamics in Multi-LLM Deliberation

Research on stability and chaotic dynamics in multi-LLM committee systems using Lyapunov exponents to measure inter-run sensitivity across policy scenarios.

Ax Jincenzi Wu, Yuxuan Lei, Jianxun Lian, Yitian Huang, Lexin Zhou, Haotian Li, Xing Xie, Helen Meng 3/11/2026

Social-R1: Towards Human-like Social Reasoning in LLMs

Social-R1 framework enhancing social reasoning in LLMs for perceiving social cues and inferring mental states in human-AI collaboration.