Publications

A collection of my research work.

More Memory, Worse Agents: Error Reproduction and Anti-Persistence in LLM Agents

More Memory, Worse Agents: Error Reproduction and Anti-Persistence in LLM Agents

Ruhan Wang, Kishan Panaganti, Dongruo Zhou

The Fortieth Conference on Neural Information Processing Systems (NeurIPS) 2026

Identifies a structural error-reproduction failure mode in persistent abstraction memory for self-improving LLM agents, and proposes ANTI-PERSISTENCE: a memory module that stores only factual interaction records and synthesizes task-specific abstractions on demand via an adaptive contextual UCB bandit over abstraction modes.

Uncertainty-Aware Federated Reasoning with Large Language Models

Uncertainty-Aware Federated Reasoning with Large Language Models

Ruhan Wang, Chengkai Huang, Zhiyong Wang, Rui Wang, Tong Yu, Lina Yao, Dongruo Zhou

Conference on Language Modeling (COLM) 2026

A parameter-free federated reasoning framework (FERA) using uncertainty quantification and a dual-pipeline aggregation mechanism for cross-client knowledge integration.

Paper
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

Runze Zhao, Yue Yu, Ruhan Wang, Chunfeng Huang, Dongruo Zhou

Forty-Third International Conference on Machine Learning (ICML) 2026

Instance-dependent analysis of continuous-time reinforcement learning using maximum likelihood estimation.

Paper
Federated In-Context Learning: Iterative Refinement for Improved Answer Quality

Federated In-Context Learning: Iterative Refinement for Improved Answer Quality

Ruhan Wang, Zhiyong Wang, Chengkai Huang, Rui Wang, Tong Yu, Lina Yao, John C.S. Lui, Dongruo Zhou

Forty-Second International Conference on Machine Learning (ICML) 2025

Privacy-preserving framework (Fed-ICL) that combines federated learning with in-context learning to collaboratively train diverse LLM agents, with theoretical equivalence to established FL algorithms.

PaperCode
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Chengkai Huang, Junda Wu, Yu Xia, Sheldon Yu, Ruhan Wang, Tong Yu, Ruiyi Zhang, Ryan Rossi, Branislav Kveton, Dongruo Zhou, Julian McAuley, Lina Yao

ACM Transactions on Intelligent Systems and Technology (TIST) 2025

Formal framework for LLM-based Agentic Recommender Systems (LLM-ARS) covering user profiling, memory, planning, and action selection, with seven key research challenges identified.

Paper
How to Provably Improve Return Conditioned Supervised Learning?

How to Provably Improve Return Conditioned Supervised Learning?

Zhishuai Liu, Yu Yang, Ruhan Wang, Pan Xu, Dongruo Zhou

arXiv preprint 2025

Theoretical analysis and improvements to return-conditioned supervised learning in reinforcement learning.

PaperCode
Quantum Diffusion Models for Few-Shot Learning

Quantum Diffusion Models for Few-Shot Learning

Ruhan Wang, Ye Wang, Jing Liu, Toshiaki Koike-Akino

2025 IEEE International Conference on AI and Data Analytics (ICAD) 2025

Quantum diffusion model framework for few-shot learning, combining generative quantum circuits with classical diffusion training.

Paper
Safe Decision Transformer with Learning-based Constraints

Safe Decision Transformer with Learning-based Constraints

Ruhan Wang, Dongruo Zhou

7th Annual Learning for Dynamics and Control Conference (L4DC) 2025

Constrained Q-learning Decision Transformer (CQDT) for safe offline RL, addressing stitching limitations of CDT while strictly adhering to safety constraints.

Paper
Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Ruhan Wang, Yu Yang, Zhishuai Liu, Dongruo Zhou, Pan Xu

Transactions on Machine Learning Research (TMLR) 2024

Return Augmented Decision Transformer (RADT) for offline off-dynamics RL with rigorous suboptimality analysis and D4RL evaluation across off-dynamics shifts.

PaperCode
LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Parteek Sharma, Fan Chen, Lei Jiang

The Twelfth International Conference on Learning Representations (ICLR) 2024

End-to-end carbon footprint projection model for LLMs across training, inference, experimentation, and storage phases, integrating LLM, hardware, and data center parameters.

PaperCode
JustQ: Automated Deployment of Fair and Accurate Quantum Neural Networks

JustQ: Automated Deployment of Fair and Accurate Quantum Neural Networks

Ruhan Wang, Lei Jiang, Fahiz Baba-Yara, Fan Chen

29th Asia and South Pacific Design Automation Conference (ASP-DAC) 2024

First fairness-aware QNN deployment framework jointly optimizing fairness and accuracy on NISQ devices via reinforcement-learning-driven design space exploration.

Paper
A Hybrid Quantum-Classical Neural Network for Learning Transferable Visual Representation

A Hybrid Quantum-Classical Neural Network for Learning Transferable Visual Representation

Ruhan Wang, Phil Richerme, Fan Chen

Quantum Science and Technology 2023

QCLIP, a hybrid quantum-classical architecture for learning transferable visual representations via Quantum Contrastive Language-Image Pre-training.

Paper

Person Re-Identification Based on Generative Adversarial Network and Self-Calibrated Convolution

Kaifang Li, Guancheng Hui, Ruhan Wang, Miaohui Zhang

Laser & Optoelectronics Progress 2022

Person re-identification framework combining GAN-based augmentation with self-calibrated convolution.

Paper

A Brief Analysis on Damaged Building Classification: Optimizer and Learning Rate

Ruhan Wang, Ruixin Qiao, Yukang Zou

2022 International Conference on Cloud Computing, Performance Computing and Deep Learning (SPIE 12287) 2022

Empirical comparison of optimizer and learning-rate schedule combinations for ResNet-based post-hurricane damaged building classification on satellite imagery.

Paper