EM-Aware Physical Synthesis: Neural Inductor Modeling and Intelligent Placement & Routing for RF Circuits
ML framework for RF circuit physical synthesis using neural inductor models and intelligent routing to generate GDSII layouts.
ML framework for RF circuit physical synthesis using neural inductor models and intelligent routing to generate GDSII layouts.
ML techniques applied to autonomous shuttle car-following models calibrated with field data. Peripheral to core interests.
Research on budget-constrained LLM agents that plan tool use under monetary constraints. Addresses sequential decision-making with expensive tool executions.
3D object detection framework using 4D mmWave radar point clouds with task-aware construction for improved weather-robust detection.
Survey covering multi-agent communication mechanisms in MARL, emergent language, and LLM-based agents across autonomous and collaborative systems.
Weakly supervised segmentation method improving pseudo-label quality for scribble-based medical image annotations.
Neural operator incorporating physical reciprocity constraints for seismic wave propagation modeling and computational efficiency.
Offline reinforcement learning framework for cyclic MDPs with heterogeneous stage dynamics addressing state distribution mismatch.
Approach routing LLM reasoning between latent and discrete spaces to improve efficiency while maintaining reasoning quality and confidence.
Training optimization technique for Mixture-of-Experts models addressing expert load imbalance through dynamic expert relayering.
Bayesian approach for estimating observation parameters in inverse problems using diffusion priors for posterior sampling.
Method using crosscoders for comparing internal representations across different LLM architectures to identify safety-critical behavioral differences.
Framework for improving feature importance estimation stability by aggregating model explanations across ensemble members for scientific discovery.
9B-parameter LLM architecture combining sparse and linear attention mechanisms for efficient long-context processing and reduced computational costs.
Training method for multi-turn RL of LLM agents using trajectory search rollouts to improve exploration in sparse reward environments.
Decentralized optimization algorithm for distributed machine learning addressing heterogeneous gradient variance across nodes.
Synthetic dataset generation using fractals for action recognition pre-training via Formula Driven Supervised Learning approach.
Comparative analysis of MAP and LMMSE estimators for blind inverse problems addressing non-convexity and solution non-identifiability.
Training-free framework using sparse latent steering to reduce object hallucination in large vision-language models via latent space geometry.
Deep learning approach for MIMO receiver design combining linear signal processing with ML blocks for improved scalability and explainability.
Method using token-level explanations to guide LLM-based query rewriting for improving neural retrieval system robustness.
Research on stabilizing Rectified Flow generative models for improved inversion and downstream tasks like reconstruction and editing.
Prototype Transformer architecture designed for interpretability by design, enabling explicit and transparent reasoning in autoregressive language models.
Region-to-Image Distillation approach for improving fine-grained visual understanding in MLLMs without repeated zooming during inference.
DMAP method for analyzing text using LLM next-token probability distributions, improving on perplexity metrics for context-dependent interpretation.
Selective Abstraction framework for LLMs to improve factual reliability in long-form generation by abstaining when confidence is low rather than discarding information.
Large Audio Language Models framework using audio-interleaved reasoning to improve comprehension of complex audio content beyond one-time encoding.
Research on activation steering in audio diffusion models to understand how semantic concepts are represented in attention layers.
Benchmark evaluating vision-language models for PDF-to-Markdown conversion on French documents. LLM application for document processing and RAG pipelines.
Bayesian deep learning approach for calibrated uncertainty in medical imaging decision support. ML application, limited to healthcare domain.
Analysis of implicit bias from logit regularization in linear classifiers. Theoretical ML research, limited practical developer relevance.
Robust control framework combining conformal prediction and system level synthesis for learned dynamics models. Control theory, limited AI/ML interest.
Large-scale book graph dataset for personalized recommendation in Bangla. NLP dataset, but low relevance to core AI/ML development interests.
Framework combining statistical and agentic reasoning for predicting large model performance from limited data. AI agents for model evaluation.
Evaluation framework testing whether GPT-4o possesses theory of mind via causal mental state models. LLM capability research and evaluation.
Theoretical analysis of Nash equilibria in convex Markov games and learning algorithms. Game theory foundations, limited practical AI application.
Low-latency speech recognition encoder for edge devices with streaming capability. Developer tool for real-time ASR applications.
Game theory analysis of bidding algorithms in repeated first-price auctions. Economics/game theory, not AI/ML development.
Physics-guided LLM agent for symbolic equation discovery using multi-step reasoning. AI agents for scientific research with domain knowledge.
Few-step diffusion language model via trajectory self-distillation for faster parallel token decoding. LLM optimization for inference efficiency.
Efficient attention mechanism for real-time video generation using diffusion transformers. Developer tool for reducing computational bottlenecks.
Neural network approach for solving nonsmooth optimal control of linear PDEs. Scientific computing, not directly ML/AI development.
Unified multimodal model with test-time scaling via chain-of-thought reasoning for complex tasks. LLM application combining vision and language.
Algorithm for nuclear-norm regularized matrix optimization using Burer-Monteiro decomposition. Pure mathematics/optimization, no AI connection.
Diffusion models for accelerating MRI reconstruction by optimizing sampling patterns. Medical imaging application, not AI/ML development focused.
Neural network approach to solve parametric partial differential equations and enhance physics-informed deep learning methods.
Watermarking technique for embedding invisible watermarks in diffusion model outputs to prevent misuse of AI-generated images.
Defense technique against backdoor poisoning attacks on machine learning malware classifiers through post-training purification.
Research on how Chain-of-Thought training enables LLMs to generalize by composing learned skills for complex reasoning tasks.
Off-policy learning method for personalized policies under unobserved confounding without unconfoundedness assumptions.