Points-to-3D: Structure-Aware 3D Generation with Point Cloud Priors
Diffusion-based 3D generation framework leveraging point cloud priors as geometric constraints for improved structure-aware object synthesis.
Diffusion-based 3D generation framework leveraging point cloud priors as geometric constraints for improved structure-aware object synthesis.
32B parameter Korean-language LLM optimized for enterprise reasoning, long-context understanding, and agentic workflows with domain-specific capabilities.
Watermarking method for LLM ownership protection using functional subspaces, robust against fine-tuning, quantization, and knowledge distillation.
Vision-language model enhanced with explicit spatial token generation for improved 2D/3D spatial reasoning and fine-grained grounding.
Survey of 230 computer science students on ethical implications and societal impacts of AI from a gender perspective.
Formal specification for cryptographic admission control governing autonomous agent actions in institutional B2B environments, validating identity and policy compliance.
Video reasoning model using trajectory and motion information for improved spatio-temporal inference in video understanding tasks.
Study on how AI-mediated video communication affects trust and credibility detection. Social impact of AI, limited technical content.
Case study evaluating LLM-generated lessons in Duolingo for language learning. LLM application assessment with limited technical depth.
MultihopSpatial benchmark for multi-hop spatial reasoning in Vision-Language agents. Evaluation dataset for VLA agents.
Framework proposing readiness metrics for human-AI decision-making teams beyond accuracy. Evaluation methodology for AI collaboration.
Conditional diffusion models translating MRI to PET for medical imaging. ML for healthcare, not AI/agent related.
PASTE: Pattern-Aware Speculative Tool Execution to reduce latency in LLM agent tool loops. Optimization for agentic workflows.
XKD-Dial: four-stage training pipeline for citation-grounded dialogue reducing hallucination in English-Hindi LLMs. LLM application addressing hallucination.
arXiv paper examining regulatory frameworks for agentic AI security and privacy. Policy analysis of AI agent governance.
Simulation-based inference for moment tensor inversions in seismology. ML method applied to geophysics, not AI/agent focused.
PRIOR framework for humanoid locomotion with natural gaits using Isaac Lab. ML for robotics, not core AI agent/LLM focus.
Benchmark evaluating AI agent performance on domain-specific data science tasks against human expert baselines across multiple domains.
RAG method using hypothesis-conditioned query rewriting to retrieve decision-relevant evidence for choice tasks beyond topical relevance.
Framework enabling LLM agents to recognize secure trusted execution environments for secure IP disclosure negotiations.
Multilingual temporal reasoning benchmark with 15K examples across 5 languages testing LLM capabilities on date arithmetic and temporal relations.
Post-hoc debiasing method for vision-language models like CLIP using sparse embedding modulation to separate bias from semantic information.
Streaming video understanding framework that decouples semantic understanding from perception for proactive query handling.
Study comparing LLM-generated analogies to human-produced ones using geometric parallelogram model of analogical relations.
Neural solver for multi-objective multi-agent traveling salesman problem using conditional learning approach.
Framework for steering safety judgments in vision-language models through semantic cues without parameter changes.
RAG benchmark and framework for multilingual multi-hop question answering across multiple languages and corpora.
Federated learning system for road condition classification with defenses against poisoning attacks from malicious clients.
Reference-image driven framework for high-fidelity texturing of 3D indoor scenes.
Stock price prediction model using autoencoders and transformers with RL-based adaptive regime detection.
Internal representation debiasing framework for LLMs using graph isomorphism to remove social biases from model embeddings.
RL-based policy optimization method for improving low-resource language model performance through structural constraints on tokenization.
Multi-agent framework for vision-language navigation with probabilistic grounding of spatial references and metric constraints.
Medical imaging framework for coronary angiography combining vision-language model with reinforcement learning for vessel segmentation.
Benchmark for GPU kernel optimization spanning 235 CUDA problems from production AI models, measuring proximity to hardware efficiency limits.
Mathematical study of R-equivalence on cubic surfaces over p-adic fields.
3D object generation method using part-level decomposition with semantic grounding for text-to-3D synthesis.
Nemotron-Cascade 2 open 30B MoE model using cascade RL and multi-domain distillation achieving IMO gold-medal-level mathematical reasoning.
F2LLM-v2 multilingual embedding models (80M-14B parameters) supporting 200+ languages with emphasis on low-resource language coverage.
FinTradeBench benchmark for evaluating LLM reasoning on financial decision-making using company fundamentals and trading signals.
NavTrust benchmark evaluating robustness of embodied navigation agents (VLN and OGN) under real-world data corruptions.
Heuristic methods for constructing restricted decision diagrams to approximate Pareto frontiers in multiobjective optimization.
Framework combining machine learning with automated reasoning for generating and selecting explanations in scientific discovery tasks.
Analysis of LLM-based world models for decision-making in reasoning systems, identifying evaluation gaps and methodological issues.
Framework using LLM agents to simulate decision discourse by representing diverse stakeholder perspectives in complex problem-solving.
Manus AI general-purpose autonomous agent combining LLM reasoning with execution capabilities for complex end-to-end tasks.
Deep reinforcement learning approach for multi-objective combinatorial optimization using conditional computation and preference decomposition.
Multimodal learning framework for solving Generalized Traveling Salesman Problem in robotic task planning.
Single-agent reinforcement learning framework for bus fleet control addressing traffic stochasticity and demand variability.
MMSearch-Plus benchmark for multimodal browsing agents requiring genuine vision-text reasoning and iterative retrieval verification.