Isolater - Feed

Ax Xinyu Zhou, Yinfeng Yu 29d ago

Audio Spatially-Guided Fusion for Audio-Visual Navigation

Audio-visual navigation system for autonomous agents to localize and navigate toward vocalizing targets in 3D environments.

Ax Xuejian Zhang, Ruisi He, Minseok Kim, Inocent Calist, Mi Yang, Ziyi Qi 29d ago

Environment-Aware Channel Prediction for Vehicular Communications: A Multimodal Visual Feature Fusion Framework

Deep learning framework for predicting wireless channel characteristics in vehicular 6G communications using visual feature fusion.

Ax Anderson Augusma (UGA, LIG, M-PSI), Dominique Vaufreydaz (LIG, M-PSI), F\'ed\'erique Letu\'e (SVH) 29d ago

Variational Encoder--Multi-Decoder (VE-MD) for Privacy-by-functional-design (Group) Emotion Recognition

Privacy-preserving group emotion recognition model using variational encoder-multi-decoder architecture without per-person feature extraction.

Ax Scott Piersall, Yang Gao, Shenyang Liu, Liqiang Wang 29d ago

Improving MPI Error Detection and Repair with Large Language Models and Bug References

Approach using LLMs to detect and repair errors in MPI code for high-performance computing and distributed training frameworks.

Ax Yuchen Guo, Junli Gong, Hongmin Cai, Yiu-ming Cheung, Weifeng Su 29d ago

LumiVideo: An Intelligent Agentic System for Video Color Grading

LumiVideo agentic system mimicking professional video colorists' workflows with interpretable iterative control for automated color grading.

Ax Zachary Bogorad, Ibrahim Elsharkawy, Yonatan Kahn, Andrew J. Larkoski, Noam Levi 29d ago

Generative models on phase space

Research on deep generative models (diffusion, flow matching) for high-dimensional distributions on constrained submanifolds in physics data.

Ax Timothy Gould, Sidike Paheding 29d ago

Self-Directed Task Identification

Self-Directed Task Identification framework enabling models to autonomously identify target variables in zero-shot learning without pre-training.

Ax Kevin Song 29d ago

PlayGen-MoG: Framework for Diverse Multi-Agent Play Generation via Mixture-of-Gaussians Trajectory Prediction

Framework using Mixture-of-Gaussians trajectory prediction for diverse multi-agent play generation in team sports.

Ax Shramana Dey, Zahir Khan, T. A. PramodKumar, B. Uma Shankar, Ashis K. Dhara, Ramachandran Rajalakshmi, Rajiv Raman, Sushmita Mitra 29d ago

Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview

Survey of deep learning approaches for diabetic retinopathy detection addressing dataset limitations and geographic diversity issues.

Ax Aaditya Naik, Guruprerana Shabadi, Rajeev Alur, Mayur Naik 29d ago

Do We Need Frontier Models to Verify Mathematical Proofs?

Research investigating whether frontier reasoning models are necessary for mathematical proof verification versus smaller LLM judges.

Ax Nishit Asnani, Rohan Badlani 29d ago

Skeleton-based Coherence Modeling in Narratives

NLP research on skeleton-based coherence modeling for narrative generation and detection of incoherent story structures.

Ax Zonghan Li, Feng Ji 29d ago

When simulations look right but causal effects go wrong: Large language models as behavioral simulators

Empirical evaluation of LLMs as behavioral simulators for predicting intervention effects across 11 climate-psychology interventions using 59,508 participants.

Ax Jun-Sik Yoo 29d ago

On the Geometric Structure of Layer Updates in Deep Language Models

Research studying geometric structure of layer-wise updates in deep language models across Transformer and state-space architectures.

Ax Mengtian Li, Yuwei Lu, Feifei Li, Chenqi Gan, Zhifeng Xie, Xi Wang 29d ago

VERTIGO: Visual Preference Optimization for Cinematic Camera Trajectory Generation

VERTIGO system for cinematic camera trajectory generation with visual preference optimization for realistic shot composition.

Ax Haodong Xie, Yujun Cai, Rahul Singh Maharjan, Yiwei Wang, Federico Tavella, Angelo Cangelosi 29d ago

Hierarchical, Interpretable, Label-Free Concept Bottleneck Model

Hierarchical Interpretable Label-Free Concept Bottleneck Model enabling interpretability at multiple abstraction levels unlike single-level existing CBMs.

Ax Valeria Martin, K. Brent Venable, Derek Morgan 29d ago

Generating Satellite Imagery Data for Wildfire Detection through Mask-Conditioned Generative AI

Diffusion-based foundation model generates synthetic satellite imagery for wildfire detection without task-specific retraining.

Ax Kiran Yalamanchi, Shivam Barwey, Ibrahim Jarrah, Pinaki Pal 29d ago

A Multimodal Vision Transformer-based Modeling Framework for Prediction of Fluid Flows in Energy Systems

Transformer-based framework using Vision Transformer for predicting fluid flows in energy systems, applied to gas injection phenomena.

Ax Samita Bai, Hamed Jelodar, Tochukwu Emmanuel Nwankwo, Parisa Hamedi, Mohammad Meymani, Roozbeh Razavi-Far, Ali A. Ghorbani 29d ago

Automated Malware Family Classification using Weighted Hierarchical Ensembles of Large Language Models

Zero-shot malware family classification using weighted hierarchical ensembles of LLMs, avoiding need for labeled datasets and handcrafted features.

Ax Joong Ho Choi, Jiayang Zhao, Avani Appalla, Himansh Mukesh, Dhwanil Vasani, Boyi Qian 29d ago

Token-Efficient Multimodal Reasoning via Image Prompt Packaging

Image Prompt Packaging method to reduce token costs in multimodal LLMs by embedding structured text into images, benchmarked across frontier models.

Ax Md. Sajeebul Islam Sk., Md. Mehedi Hasan Shawon, Md. Golam Rabiul Alam 29d ago

An Explainable Vision-Language Model Framework with Adaptive PID-Tversky Loss for Lumbar Spinal Stenosis Diagnosis

Vision-language model for lumbar spinal stenosis diagnosis from MRI with adaptive loss function for class imbalance handling.

Ax Roland M\"uhlenbernd 29d ago

Social Meaning in Large Language Models: Structure, Magnitude, and Pragmatic Prompting

Study of social meaning in LLMs, introducing calibration metrics and pragmatic prompting strategies to improve quantitative approximation of human reasoning.

Ax Rushabha Balaji, Kuan-Lin Chen, Danijela Cabric, Bhaskar D. Rao 29d ago

Sparse Bayesian Learning Algorithms Revisited: From Learning Majorizers to Structured Algorithmic Learning using Neural Networks

Unified framework for deriving sparse Bayesian learning algorithms using neural networks and majorizer learning.

Ax Darya Kaviani, Alp Eren Ozdarendeli, Jinhao Zhu, Yu Ding, Raluca Ada Popa 29d ago

Opal: Private Memory for Personal AI

System for private long-term memory in personal AI using trusted hardware and oblivious RAM to hide data access patterns from providers.

Ax Adam Bayley, Xiaodan Zhu, Raquel Aoki, Yanshuai Cao, Kevin H. Wilson 29d ago

Jump Start or False Start? A Theoretical and Empirical Evaluation of LLM-initialized Bandits

Theoretical and empirical evaluation of using LLM-generated preferences to warm-start contextual bandits, examining alignment with actual user preferences.

Ax Kamalasankari Subramaniakuppusamy, Jugal Gajjar 29d ago

Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions?

Analysis of stability in post-hoc feature attribution methods for vision systems under input perturbations, introducing evaluation suite.

Ax Murtuza Shahzad, Joseph Wilson, Ibrahim Al Azher, Hamed Alhoori, Mona Rahimi 29d ago

From Theory to Practice: Code Generation Using LLMs for CAPEC and CWE Frameworks

LLM-based code generation for security vulnerabilities using CAPEC and CWE frameworks, addressing gaps in existing vulnerability datasets.

Ax Lingjun Zhao, Dayeon Ki, Marine Carpuat, Hal Daum\'e III 29d ago

Pragmatics Meets Culture: Culturally-adapted Artwork Description Generation and Evaluation

Study of cultural bias in LLM text generation, introducing task of culturally-adapted artwork descriptions for different audience groups.

Ax Jackson G. Lu, Gerui Gloria Zhao, Anna Manyi Zheng 29d ago

Generative AI Use in Entrepreneurship: An Integrative Review and an Empowerment-Entrapment Framework

Integrative review of generative AI impact on entrepreneurship across opportunity recognition, evaluation, resource assembly, and venture launch stages.

Ax John T. Halloran 29d ago

Understanding the Effects of Safety Unalignment on Large Language Models

Research on safety alignment vulnerabilities in LLMs, examining jailbreak-tuning and weight orthogonalization methods that can disable safety guardrails.

Ax Sahaj Singh Maini, Robert L. Goldstone, Zoran Tiganj 29d ago

High Volatility and Action Bias Distinguish LLMs from Humans in Group Coordination

Comparative study of LLM vs human coordination in group games, revealing volatility and action bias differences in adaptive strategies.

Ax Ethan Reid 29d ago

Moondream Segmentation: From Words to Masks

Vision-language model extension for referring image segmentation using autoregressive decoding and reinforcement learning refinement.

Ax Hita Kambhamettu, Will Crichton, Sean Welleck, Harrison Goldstein, Andrew Head 29d ago

Making Written Theorems Explorable by Grounding Them in Formal Representations

System grounding LLM-generated explanations in formal representations to enable interactive exploration of mathematical proofs.

Ax Hita Kambhamettu, Bhavana Dalvi Mishra, Andrew Head, Jonathan Bragg, Aakanksha Naik, Joseph Chee Chang, Pao Siangliulue 29d ago

LitPivot: Developing Well-Situated Research Ideas Through Dynamic Contextualization and Critique within the Literature Landscape

Tool for developing research ideas through dynamic literature contextualization and critique using LLMs.

Ax Wei Zou, Mingwen Dong, Miguel Romero Calvo, Wei Zou, Shuaichen Chang, Jiang Guo, Dongkyu Lee, Xing Niu, Xiaofei Ma, Yanjun Qi, Jiarong Jiang 29d ago