SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Autoregressive transformer approach for component-based colored SVG generation from text descriptions.
Autoregressive transformer approach for component-based colored SVG generation from text descriptions.
Dataset for hierarchical KPI extraction from earnings filings using iXBRL structured financial documents.
LLM-based index advisor for database optimization using in-context learning to iteratively refine index recommendations.
Equilibrium finding algorithms in polymatrix games under differential privacy constraints with hardness results.
Survey of AI-based detection and mitigation methods for DDoS attacks with taxonomy of attack categories.
Ensemble of language models for automated tumor group classification from unstructured pathology reports in cancer registries.
Federated learning system balancing privacy-utility tradeoffs with incentive mechanisms and heterogeneous resource accommodation across organizations.
Tractable description logic with categorical semantics for biomedical ontologies supporting negative knowledge representation.
Weighted gradient-based adversarial attacks on 3D point cloud classifiers with improved imperceptibility through point-wise perturbation adjustment.
Online fair allocation of indivisible goods with sequential arrival, analyzing fairness guarantees with access to future information.
Rank-based uniformity test for detecting undisclosed substitutions or quantization of black-box LLM APIs without access to model weights.
Statistical methods for fairness testing in algorithmic decision-making systems accounting for sampling error and demographic subgroups.
Systematic review combined with Monte Carlo simulation examining student perceptions of GenAI tools and educational outcomes.
Pipeline for converting RGB-D scans into compact 3D virtual replicas with physically-based rendering and interaction support.
arXiv paper: surrogate ML model for predicting heat transfer in impinging jet arrays. CFD acceleration via neural networks.
arXiv paper: agent-based LLM approach for automating free-form clinical notes to HL7 FHIR structured data.
arXiv paper: 2D-guided cross-modal fusion method for LiDAR-camera alignment in 3D autonomous vehicle detection.
Automated page image classification system for historical document digitization handling diverse content types, layouts, and handwritten/printed text.
Unsupervised deep learning approach for inverse problems in computed tomography combining deep image prior and unrolled optimization.
One-step image generation improvement using soft embeddings in distilled masked diffusion models, enabling gradient flow for post-distillation refinements.
Analysis of positional and language bias in mid-layer representations of vision-language encoders for zero-shot language-grounded spatial understanding.
Artificial Age Score framework formalizing memory aging in LLMs, modeling how semantic and episodic information degrades across conversational sessions.
Benchmark using equivalence scoring for ground-truth-free evaluation of formally verifiable code generated by LLMs in languages like Dafny.
Knowledge distillation method for small language models that balances exploration and guidance through adaptive switching to address exposure bias.
Benchmark evaluating hallucinations in audio-visual multimodal LLMs with spoken queries under diverse acoustic conditions.
Information-Determined Scoring framework using LLMs to score free-text psychological assessment responses and augment rating-scale measures.
National Weather Service implementation of automated translation using LLMs and LILT's training process to serve non-English speakers.
Robotic assembly system using vision-language models to handle connector-aware assembly from instruction manuals with focus on physical constraints.
Video reasoning model with explicit spatio-temporal evidence grounding, extending evidence-centered reasoning from images to videos with temporal tracking and spatial localization.
Framework for composing synergistic multi-agent LLM teams by analyzing model interaction geometry to optimize collaboration and surpass single-model capabilities.
Method for LLM ownership verification using encrypted fingerprinting with protection against attacks during verification processes.
Multimodal framework combining ECG signals and anatomical knowledge for cardiac myocardial scar segmentation from MRI images.
Novel knowledge distillation-based membership inference attack against LLM-based recommendation systems to determine if data samples were in training sets.
Study comparing genetic algorithms and other methods for generating sample weights to mitigate bias in ML models, examining trade-offs between fairness and accuracy.
Research on models' ability to detect activation steering vectors injected into their residual streams during forward passes, revealing steering awareness in instruction-tuned models.
MRD fusion approach for high-resolution image understanding in MLLMs combining retrieval-augmented generation with detection to prevent object fragmentation and false positives.
Survey of cell-cell communication inference from single-cell omics data, covering biological mechanisms and computational approaches for ligand-receptor interaction analysis.
Continual learning study revealing asymmetry in experience replay between feature-level and classifier-level forgetting, showing minimal buffers preserve representations but not predictions.
ClinicalTrialsHub platform consolidating ClinicalTrials.gov with PubMed data extraction, increasing structured trial data access by 83.8% for patients and clinicians.
Benchmark of multiple instance learning models for lymphoma subtyping from whole slide images, comparing deep learning approaches for pathology diagnosis.
Adaptive Accountability Framework for networked multi-agent systems using cryptographic provenance tracking and runtime detection of emergent norms like collusion and unfairness.
Neuron-level interpretability study of code LLMs identifying language-specific neurons and concept layers, adapting NLP techniques to formal programming language structure.
GeoMotionGPT aligns motion space geometry with embedding space in LLM-based motion understanding by coupling discrete motion tokenization with semantic learning.
Systematic evaluation of LLM susceptibility to persuasion across six models using SMCR communication framework, testing adoption of counterfactual beliefs.
Forest-Chat integrates vision-language agents with satellite imagery for interactive forest change analysis, combining LLMs with computer vision for environmental monitoring.
Mechanistic study comparing internal algorithmic changes when post-training autoregressive models into masked diffusion models, investigating genuine bidirectional reasoning acquisition.
Analysis of diffusion language models showing arbitrary token generation order doesn't unlock reasoning improvements over autoregressive models, revealing limitations of flexibility.
STELLAR framework guides LLM-based generation of SystemVerilog Assertions for formal verification using structural similarity from hardware design ASTs.
One-shot data augmentation method combining geometric perturbations with noise injection for few-shot learning generalization to novel classes.
Sheaf Neural Networks algorithm with biomedical case study outperforming GCNs, GATs, and GraphSage on graph-structured biomedical data.