Detecting Jailbreak Attempts in Clinical Training LLMs Through Automated Linguistic Feature Extraction
Automated linguistic feature extraction for detecting jailbreak attempts in clinical training LLMs.
Automated linguistic feature extraction for detecting jailbreak attempts in clinical training LLMs.
PolyShapes-Ideal benchmark dataset tests topological invariance in vision models under affine transformations.
Collaborative inference framework routing vision transformer queries between edge and near-edge accelerators.
Distribution regression re-calibration for ensuring predictive uncertainty reflects empirical accuracy.
MoralityGym: benchmark for evaluating hierarchical moral alignment in sequential decision-making agents.
LAF-YOLOv10 detector for small object detection in aerial drone imagery using modified YOLOv10.
Multi-turn safety benchmark for tool-using LLM agents evaluating hierarchical risks in sequential interactions.
FUTON: Fourier tensor network for implicit neural representations using low-rank decomposition.
Protect*: neuro-symbolic method for steerable retrosynthesis controlling LLM chemical pathway generation.
Analysis of information storage in language model embeddings versus autoencoders for memory characterization.
AsyncVLA: asynchronous framework for fast robotic navigation decoupling semantic reasoning from reactive control.
Stochastic variance reduced methods for solving hierarchical variational inequalities and optimization.
Data-driven equation discovery for modeling gradient descent dynamics to accelerate optimization.
Trainable sparse attention via hybrid Top-k+Top-p masking for accelerating diffusion model inference.
LLM calibration from response-level to capability-level confidence estimation for reliable deployment.
LiveNewsBench: benchmark for evaluating LLM web search and agentic capabilities with fresh news data.
Differentiable inductive logic programming for rule learning from raw sequence data.
ReViS: multi-round agent for video question answering with selective frame sampling and early stopping.
Transformer-based mmWave beamforming for vehicular communication using multi-modal sensing.
Uncertainty-aware rollout planning for diffusion models in long-horizon PDE solving.
Parametric change-point detection in time series under local differential privacy constraints.
Intent drift detection in network management using risk scoring and machine learning.
Foundation model using in-context learning for relational databases that avoids retraining across different prediction targets.
Fine-tunes vision-language model to localize parasitic eggs in microscopic images for soil-transmitted helminth diagnostic support.
Combines Mamba state-space models with LLM reasoning to analyze dynamic fMRI functional connectivity in autistic brains.
Sparse attention mechanism with constant-time complexity for long-context LLM decoding using projection onto convex hull of keys.
Physics-Informed Neural Networks for modeling coupled electro-elastodynamic wave propagation with three-stage loss optimization.
Auto-regressive transformer for text-to-3D generation using discrete 3D tokenizer to address information loss in encoding.
Causal constraints framework using response theory and score matching for reduced-order neural emulators of turbulent dynamical systems.
Proposes evolved activation functions that account for missing data indicators and confidence scores in neural networks.
Integrates Hindsight Experience Replay into Option-Critic hierarchical RL to improve multi-goal learning in sparse reward environments.
Formulation of Ensemble-Conditional Gaussian Processes connecting ensemble methods with conditional Gaussian inference and Kalman filtering.
Framework for training neural PDE solvers on partial observations using diffusion models, avoiding need for complete observation datasets.
Fine-tunes DINOv2 Vision Transformer with LoRA for font classification, achieving 86% accuracy while training <1% of 87.2M parameters.
Calibration framework (RANSAC-P3P Gradient Descent) for aligning MoCap skeletal data with RGB camera views using human motion.
Non-asymptotic convergence rate analysis for stochastic approximation algorithms using Wasserstein distance bounds.
Hybrid vision model combining Mamba and DINO architectures for breast cancer 3-year risk prediction from imaging data.
Statistical early stopping methods for LLM reasoning that monitor uncertainty signals to prevent overthinking during generation.
Theoretical framework explaining why LLM fine-tuning requires only few epochs, combining early stopping theory with Neural Tangent Kernel analysis.
Method for compressing long LLM contexts into soft prompts via block-wise causal masking to reduce inference latency from quadratic attention costs.
Research on NER algorithms (CRF, BiLSTM-CRF, transformers) for extracting structured information from payment transaction data.
Research on robust covariance estimation from heavy-tailed samples with outliers using clipped Euclidean norm approach and computable Bernstein certificates.
Theoretical analysis of iterative self-training showing tradeoffs between noise denoising and signal forgetting in overparameterized linear regression.
MC²Mark: distortion-free multi-bit watermarking framework for embedding long provenance identifiers in LLM-generated text.
Adaptive memory structures for LLM agents enabling context-dependent memory selection across heterogeneous interaction patterns.
Geometry-preserving aggregation for mixture-of-experts embedding models accounting for hyperspherical manifold structure of expert outputs.
GTS: learnable Gaussian thought sampler for efficient inference-time scaling in latent reasoning models with structured exploration.
GUI-GENESIS: framework for automated synthesis of efficient GUI training environments with verifiable rewards for agent post-training.
Algebraic quantum intelligence framework proposing quantum computing approach to improve creative output generation in LLMs.
DenseMLLM: framework enabling multimodal LLMs for dense prediction tasks like segmentation and depth estimation without task-specific decoders.