Isolater - Feed

Ax Caroline Wang, Daniel Kasenberg, Kim Stachenfeld, Pablo Samuel Castro 1d ago 90

Discovering Differences in Strategic Behavior Between Humans and LLMs

researchpaper

Ax Zhiling Yan, Dingjie Song, Zhe Fang, Yisheng Ji, Xiang Li, Quanzheng Li, Lichao Sun 1d ago 90

LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation

researchpaper

Ax Yansong Qu, Zihao Sheng, Zilin Huang, Jiancong Chen, Yuhao Luo, Tianyi Wang, Yiheng Feng, Samuel Labi, Sikai Chen 1d ago 90

Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

researchpaper

Ax Jihwan Oh, Murad Aghazada, Yooju Shin, Se-Young Yun, Taehyeon Kim 1d ago 90

MERIT Feedback Elicits Better Bargaining in LLM Negotiators

researchpaper

Ax Zhenhe Cui, Huaxiang Xia, Hangjun Shen, Kailun Luo, Yong He, Wei Liang 1d ago 90

Abstraction Generation for Generalized Planning with Pretrained Large Language Models

researchpaper

Ax Bo Xue, Yunchong Song, Fanghao Shao, Xuekai Zhu, Lin Chen, Luoyi Fu, Xinbing Wang, Zhouhan Lin 1d ago 90

Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets

researchpaper

Ax Shuai Han, Mehdi Dastani, Shihan Wang 1d ago 90

Neuro-symbolic Action Masking for Deep Reinforcement Learning

researchpaper

Ax Nanxu Gong, Haotian Li, Sixun Dong, Jianxun Lian, Yanjie Fu, Xing Xie 1d ago 90

To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

researchpaper

Ax Keane Ong, Sabri Boughorbel, Luwei Xiao, Chanakya Ekbote, Wei Dai, Ao Qu, Jingyao Wu, Rui Mao, Ehsan Hoque, Erik Cambria, Gianmarco Mengaldo, Paul Pu Liang 1d ago 90

OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization

researchpaper

Ax Jie Jiang, Yangru Huang, Zeyu Wang, Changping Wang, Yuling Xiong, Jun Zhang, Huan Yu 1d ago 90

Spend Search Where It Pays: Value-Guided Structured Sampling and Optimization for Generative Recommendation

researchpaper

Ax Da-Lun Chen, Prasasthy Balasubramanian, Lauri Lov\'en, Susanna Pirttikangas, Jaakko Sauvola, Panagiotis Kostakos 1d ago 90

Integrating Generative AI-enhanced Cognitive Systems in Higher Education: From Stakeholder Perceptions to a Conceptual Framework considering the EU AI Act

researchpaper

Ax Xingyi Zhang, Yulei Ye, Kaifeng Huang, Wenhao Li, Xiangfeng Wang 1d ago 90

See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch

researchpaper

Ax Xuecheng Zou, Yu Tang, Bingbing Wang 1d ago 90

SynergyKGC: Reconciling Topological Heterogeneity in Knowledge Graph Completion via Topology-Aware Synergy

researchpaper

Ax Leheng Sheng, Wenchang Ma, Ruixin Hong, Xiang Wang, An Zhang, Tat-Seng Chua 1d ago 90

Reinforcing Chain-of-Thought Reasoning with Self-Evolving Rubrics

researchpaper

Ax F. Carichon, R. Rampa, G. Farnadi 1d ago 90

Can LLMs Cook Jamaican Couscous? A Study of Cultural Novelty in Recipe Generation

researchpaper

Ax Yusong Lin, Haiyang Wang, Shuzhe Wu, Lue Fan, Feiyang Pan, Sanyuan Zhao, Dandan Tu 1d ago 90

CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion

researchpaper

Ax Wayne Chi, Yixiong Fang, Arnav Yayavaram, Siddharth Yayavaram, Seth Karten, Qiuhong Anna Wei, Runkun Chen, Alexander Wang, Valerie Chen, Ameet Talwalkar, Chris Donahue 1d ago 90

GameDevBench: Evaluating Agentic Capabilities Through Game Development

researchpaper

Ax Jiayi Zhou, Yang Sheng, Hantao Lou, Yaodong Yang, Jie Fu 1d ago 90

FormalJudge: A Neuro-Symbolic Paradigm for Agentic Oversight

researchpaper

Ax Anjali K. Kapoor (Department of Neurosurgery, NYU Langone Health, New York, USA), Anton Alyakin (Department of Neurosurgery, NYU Langone Health, New York, USA, Global AI Frontier Lab, New York University, Brooklyn, USA, Department of Neurosurgery, Washington University in Saint Louis, Saint Louis, USA), Jin Vivian Lee (Department of Neurosurgery, NYU Langone Health, New York, USA, Global AI Frontier Lab, New York University, Brooklyn, USA, Department of Neurosurgery, Washington University in Saint Louis, Saint Louis, USA), Eunice Yang (Department of Neurosurgery, NYU Langone Health, New York, USA, Columbia University Vagelos College of Physicians and Surgeons, New York, USA), Annelene M. Schulze (Department of Neurosurgery, NYU Langone Health, New York, USA), Krithik Vishwanath (Department of Aerospace Engineering and Engineering Mechanics, University of Texas at Austin, Austin, USA), Jinseok Lee (Global AI Frontier Lab, New York University, Brooklyn, USA, Department of Biomedical Engineering, Kyung Hee University, Yongin, South Korea), Yindalon Aphinyanaphongs (Department of Population Health, NYU Langone Health, New York, USA, Division of Applied AI Technologies, NYU Langone Health, New York, USA), Howard Riina (Department of Neurosurgery, NYU Langone Health, New York, USA, Department of Radiology, NYU Langone Health, New York, USA), Jennifer A. Frontera (Department of Neurology, NYU Langone Health, New York, USA), Eric Karl Oermann (Department of Neurosurgery, NYU Langone Health, New York, USA, Global AI Frontier Lab, New York University, Brooklyn, USA, Division of Applied AI Technologies, NYU Langone Health, New York, USA, Center for Data Science, New York University, New York, USA) 1d ago 90

Large Language Models Predict Functional Outcomes after Acute Ischemic Stroke

researchpaper

Ax Eranga Bandara, Ross Gore, Sachin Shetty, Sachini Rajapakse, Isurunima Kularathna, Pramoda Karunarathna, Ravi Mukkamala, Peter Foytik, Safdar H. Bouk, Abdul Rahman, Xueping Liang, Amin Hass, Tharaka Hewa, Ng Wee Keong, Kasun De Zoysa, Aruna Withanage, Nilaan Loganathan 1d ago 90

A Practical Guide to Agentic AI Transition in Organizations

researchpaper

Ax Yukun Jiang, Yage Zhang, Xinyue Shen, Michael Backes, Yang Zhang 1d ago 90

"Humans welcome to observe": A First Look at the Agent Social Network Moltbook

researchpaper

Ax David Holtz 1d ago 90

The Anatomy of the Moltbook Social Graph

researchpaper

Ax C\'ecile Rousseau, Samuel Jackson, Rodrigo H. Ordonez-Hurtado, Nicola C. Amorisco, Tobia Boschi, George K. Holt, Andrea Loreti, Eszter Sz\'ekely, Alexander Whittle, Adriano Agnello, Stanislas Pamela, Alessandra Pascale, Robert Akers, Juan Bernabe Moreno, Sue Thorne, Mykhaylo Zayats 1d ago 90

TokaMark: A Comprehensive Benchmark for MAST Tokamak Plasma Models

researchpaper

Ax Adam AlSayyad, Kelvin Yuxiang Huang, Richik Pal 1d ago 90

AgentTrace: A Structured Logging Framework for Agent System Observability

researchpaper

Ax Zhiyu Sun, Minrui Luo, Yu Wang, Zhili Chen, Tianxing He 1d ago 90

Reverse-Engineering Model Editing on Language Models

researchpaper

Ax Leo Thomas Ramos, Angel D. Sappa 1d ago 90

Multi-encoder ConvNeXt Network with Smooth Attentional Feature Fusion for Multispectral Semantic Segmentation

researchpaper

Ax Zhihang Yi, Jian Zhao, Jiancheng Lv, Tao Wang 1d ago 90

Multimodal Information Fusion for Chart Understanding: A Survey of MLLMs -- Evolution, Limitations, and Cognitive Enhancement

researchpaper

Ax Lepeng Zhao, Zhenhua Zou, Shuo Li, Zhuotao Liu 1d ago 90

Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

researchpaper

Ax Nuno Fachada, Daniel Fernandes, Carlos M. Fernandes, Jo\~ao P. Matos-Carvalho 1d ago 90

Can Large Language Models Implement Agent-Based Models? An ODD-based Replication Study

researchpaper

Ax Jonas K\"ubler, Kailash Budhathoki, Matth\"aus Kleindessner, Xiong Zhou, Junming Yin, Ashish Khetan, George Karypis 1d ago 90

When LLMs get significantly worse: A statistical approach to detect model degradations

researchpaper

Ax Itsuki Fujisaki, Kunhao Yang 1d ago 90

Silence Routing: When Not Speaking Improves Collective Judgment

researchpaper

Ax Cau\~a Ferreira Barros, Marcos Kalinowski, Mohamad Kassab, Valdemar Vicente Graciano Neto 1d ago 90

On the Use of a Large Language Model to Support the Conduction of a Systematic Mapping Study: A Brief Report from a Practitioner's View

researchpaper

Ax Yu Yan, Sheng Sun, Shengjia Cheng, Teli Liu, Mingfeng Li, Min Liu 1d ago 90

Red-teaming the Multimodal Reasoning: Jailbreaking Vision-Language Models via Cross-modal Entanglement Attacks

researchpaper

Ax Ali Nour Eldin, Mohamed Sellami, Walid Gaaloul 1d ago 90

Exploring Semantic Labeling Strategies for Third-Party Cybersecurity Risk Assessment Questionnaires

researchpaper

Ax Yilong Dai, Shengyu Chen, Xiaowei Jia, Peyman Givi, Runlong Yu 1d ago 90

PEST: Physics-Enhanced Swin Transformer for 3D Turbulence Simulation

researchpaper

Ax Jiangong Chen, Mingyu Zhu, Bin Li 1d ago 90

PRISM-XR: Empowering Privacy-Aware XR Collaboration with Multimodal Large Language Models

researchpaper

Ax Yunpeng Tan, Qingyang Li, Mingxin Yang, Yannan Hu, Lei Zhang, Xinggong Zhang 1d ago 90

MalMoE: Mixture-of-Experts Enhanced Encrypted Malicious Traffic Detection Under Graph Drift

researchpaper

Ax Liujia Yang, Zhuo Yang, Jiaqing Xie, Yubin Wang, Ben Gao, Tianfan Fu, Xingjian Wei, Jiaxing Sun, Jiang Wu, Conghui He, Yuqiang Li, Qinying Gu 1d ago 90

NMRTrans: Structure Elucidation from Experimental NMR Spectra via Set Transformers

researchpaper

Ax Ishan Sahu, Somnath Hazra, Somak Aditya, Soumyajit Dey 1d ago 90

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

researchpaper

Ax Kun Wang, Zherui Li, Zhenhong Zhou, Yitong Zhang, Yan Mi, Kun Yang, Yiming Zhang, Junhao Dong, Zhongxiang Sun, Qiankun Li, Yang Liu 1d ago 90

Omni-Safety under Cross-Modality Conflict: Vulnerabilities, Dynamics Mechanisms and Efficient Alignment

researchpaper

Ax Edward Wijaya 1d ago 90

Beyond SMILES: Evaluating Agentic Systems for Drug Discovery

researchpaper

Ax Lucia Borrego, Vajira Thambawita, Marco Ciuffreda, Ines del Val, Alejandro Dominguez, Josep Munuera 1d ago 90

Anatomy-Preserving Latent Diffusion for Generation of Brain Segmentation Masks with Ischemic Infarct

researchpaper

Ax Ethan Bandasack, Vincent Bouget, Apolline Bruley, Yannis Cattan, Charlotte Claye, Matthew Corney, Julien Duquesne, Karim El Kanbi, Aziz Fouch\'e, Pierre Marschall, Francesco Strozzi 1d ago 90

EVA: Towards a universal model of the immune system

researchpaper

Ax Wentao Zhang, Jianfeng Wang, Liheng Liang, Yilei Zhao, HaiBin Wen, Zhe Zhao 1d ago 90

EvoCodeBench: A Human-Performance Benchmark for Self-Evolving LLM-Driven Coding Systems

researchpaper

Ax Md. Khairul Islam, Zeyu Xia, Ryan Goudjil, Jialu Wang, Arya Farahi, Judy Fox 1d ago 90

Cosmo3DFlow: Wavelet Flow Matching for Spatial-to-Spectral Compression in Reconstructing the Early Universe

researchpaper

Ax Tony Feng (Maggie), Trieu H. Trinh (Maggie), Garrett Bingham (Maggie), Dawsen Hwang (Maggie), Yuri Chervonyi (Maggie), Junehyuk Jung (Maggie), Joonkyung Lee (Maggie), Carlo Pagano (Maggie), Sang-hyun Kim (Maggie), Federico Pasqualotto (Maggie), Sergei Gukov (Maggie), Jonathan N. Lee (Maggie), Junsu Kim (Maggie), Kaiying Hou (Maggie), Golnaz Ghiasi (Maggie), Yi Tay (Maggie), YaGuang Li (Maggie), Chenkai Kuang (Maggie), Yuan Liu (Maggie), Hanzhao (Maggie), Lin, Evan Zheran Liu, Nigamaa Nayakanti, Xiaomeng Yang, Heng-tze Cheng, Demis Hassabis, Koray Kavukcuoglu, Quoc V. Le, Thang Luong 1d ago 90

Towards Autonomous Mathematics Research

researchpaper

Ax Jiacheng Hou, Yining Sun, Ruochong Jin, Haochen Han, Fangming Liu, Wai Kin Victor Chan, Alex Jinpeng Wang 1d ago 90

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

researchpaper

Ax Truong Minh Huy, Edward Hirst 1d ago 90

Versor: A Geometric Sequence Architecture

researchpaper

Ax Shiting Huang, Zecheng Li, Yu Zeng, Qingnan Ren, Zhen Fang, Qisheng Su, Kou Shi, Lin Chen, Zehui Chen, Feng Zhao 1d ago 90

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

researchpaper

Ax Ivana Nikoloska 1d ago 90

Quantum Integrated Sensing and Computation with Indefinite Causal Order

researchpaper