NDT: Non-Differential Transformer and Its Application to Sentiment Analysis
Non-Differential Transformer architecture for improved sentiment analysis in text, inspired by Differential Transformer variants.
Non-Differential Transformer architecture for improved sentiment analysis in text, inspired by Differential Transformer variants.
RoboECC edge-cloud collaborative deployment framework for Vision-Language-Action models enabling real-time embodied AI inference.
Multi-RF Fusion ensemble method achieving top OGB leaderboard rank for molecular property prediction via Random Forests and GNNs.
Evaluation and solution for improving general QA performance of LLMs trained with reinforcement learning from verifiable rewards.
Study of visual representation degradation in MLLMs with predictive regularization technique to preserve visual competence during language training.
Compass framework for optimizing compound AI workflows across multiple specialized models with dynamic adaptation under varying loads.
Dodgersort system for efficient pairwise ranking via CLIP-based pre-ordering, neural ranking, and uncertainty-aware pair selection.
HiCI hierarchical attention module for long-context language modeling with segment-level and global context integration.
Lightweight ensemble approach for white blood cell classification addressing class imbalance in medical imaging.
Time-series forecasting method for financial markets handling delayed/stale observations via residual latency-aware mixing.
RubricRAG system for interpretable LLM evaluation via domain knowledge retrieval to generate detailed rubrics instead of scalar scores.
Framework for jointly learning state, dynamics, and filtering algorithm parameters in data assimilation via auto-differentiable filtering.
Router mechanism for LLMs using internal prefill activations and Encoder-Target Decoupling to route queries to best-performing models.
Benchmark study of deep learning model efficiency comparing Conv6, VGG16, and other architectures under computational constraints.
Machine learning approach for adaptive rate allocation in 5G+ networks handling rapid wireless link quality fluctuations in mobile environments.
Framework applying Active Inference and Free Energy Principle to physical AI agents and robots operating under resource constraints in real-world environments.
Theoretical analysis of coordinate ascent variational inference stability differences in sequential vs parallel variants for high-dimensional linear regression.
MOELIGA multi-objective evolutionary algorithm for feature selection balancing subset size and classification accuracy.
DiscoUQ framework extracts semantic structure from disagreement patterns in multi-agent LLM ensembles for improved uncertainty quantification.
Intelligent Disobedience Game formalizes shared autonomy scenarios where systems must override human instructions for safety using game theory.
ALL-FEM framework fine-tunes LLMs to automatically generate and analyze finite element method code for engineering simulations.
Framework integrating LLMs into Pepper robots with low-latency multimodal interaction, enabling speech processing and agentic control capabilities.
Permutation-Aware GRPO method reducing selection bias in LLMs during multiple-choice evaluation by training models to produce consistent answers across option permutations.
DSL-R1 framework training retrieval agents via reinforcement learning to bridge structured metadata and unstructured content using domain-specific language.
Knowledge Boundary Discovery framework using reinforcement learning to systematically map what LLMs can and cannot answer reliably.
TabPFN transformer model applied to geotechnical site characterization using sparse borehole data for uncertainty quantification and interpretability.
Statistical learning framework for latent embedding alignment in brain encoding/decoding with limited fMRI data.
Improved sample complexity bounds for training over-parameterized neural networks to learn low-degree spherical polynomials.
ViCLSR: contrastive learning framework improving Vietnamese NLU with limited annotated data through supervised representation learning.
Joint source and RIS-assisted channel encoding for multi-user semantic communications using DNNs for feature extraction.
Free Sinewich: parameter-efficient multi-task learning framework enabling low-cost weight modulation via frequency switching.
Personalized XML document retrieval integrating domain ontologies and user profiles with semantic resources.
Functional Gaussian Process regression using Empirical Bayes for spatiotemporal random fields on manifolds.
Neuro-symbolic framework for self-healing resilience in edge computing environments spanning cloud to edge devices.
Addresses scaling failure in AlphaZero-style tree search for LLMs using Gumbel sampling and sequential halving for budget-efficient reasoning.
Architectural framework for multi-UAV autonomous precision agriculture systems with algorithm abstractions.
JANUS framework for adversarial jailbreak attacks on text-to-image models using lightweight distribution optimization without RL.
Landmark-constrained algorithm to accelerate Vector Diffusion Maps framework for manifold learning on complex datasets.
Formal analysis proving that AI agents with indexed external memory achieve exponential speedup in retrieval cost versus sequential scanning, advancing agentic reasoning.
Closed-form analytical solution for conditional diffusion models in data assimilation, leveraging tractable score functions instead of neural networks.
Analysis of noise robustness in variational quantum classifiers based on entropy and transpilation depth.
HELIX: Hybrid Mamba-Attention framework for raw audio understanding with benchmarking beyond quadratic limits.
FinRL-X: Modular open-source framework for quantitative trading with unified research-to-deployment pipeline.
TimeTox: LLM-based pipeline using Gemini for automated time toxicity extraction from clinical trial protocols.
Generalized Discrete Diffusion from Snapshots framework supporting arbitrary noising processes over discrete state spaces.
HamVision: Medical image analysis framework using Hamiltonian dynamics as inductive bias for segmentation.
Comprehensive efficiency comparison of 16 language models across NLP tasks with novel Performance-Efficiency Ratio metric.
Analysis of instruction-tuned LLM failure modes showing error detection gaps across architectures.
Comparative study of PEFT and quantization techniques for fine-tuning BERTimbau on Portuguese QA tasks.
DRTriton: Synthetic data reinforcement learning pipeline for automatic CUDA kernel generation from PyTorch.