|
Wendong XU (徐文栋)Ph.D. Student |
Algorithm & Architecture & AI
I am a Ph.D. Candidate at the Department of Electrical and Electronic Engineering, the University of Hong Kong (HKU) (Sept 2023 - Jun 2027 Exp.), supervised by Prof. Bei YU and Prof. Ngai WONG. I received M.Eng. in Computer Technology from Institute of Computing Technology, University of Chinese Academy of Sciences (UCAS) (2018-2021), and B.Eng. in Computer Science and Technology from Beijing Information Science and Technology University (BISTU) (2014-2018).
My research interests include Code Agent and the whole lifetime of Agents.
I am currently a Co-Founder and Research Scientist at UniPat AI (Oct 2025 - Present). Before starting my PhD, I was a Senior Software Engineer at Baidu (Jul 2021 - Aug 2023), where I worked on the evolution of page search engine.
Email: kirai.wendong [at] gmail.com
Code Agent
Machine Learning System
Nonparametric Teaching of Attention Learners
Chen ZHANG, Jianghui WANG, Bingyang CHENG, Zhongtao CHEN, Wendong XU, Cong WANG, Marco CANINI, Francesco ORABONA, Yik Chung WU, Ngai WONG
International Conference on Learning Representations (ICLR’26), 2026
BabyVision: Visual Reasoning Beyond Language
Liang CHEN, Weichu XIE, Yiyan LIANG, Hongfeng HE, Hans ZHAO, Zhibo YANG, Zhiqi HUANG, Haoning WU, Haoyu LU, Y. CHARLES, Yiping BAO, Yuantao FAN, Guopeng LI, Haiyang SHEN, Xuanzhong CHEN, Wendong XU, Shuzheng SI, Zefan CAI, Wenhao CHAI, Ziqi HUANG, Fangfu LIU, Tianyu LIU, Baobao CHANG, Xiaobo HU, Kaiyuan CHEN, Yixin REN, Yang LIU, Yuan GONG, Kuan LI
arXiv:2601.06521, 2026
HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture
Taiqiang WU*, Chenchen DING*, Wenyong ZHOU*, Yuxin CHENG, Xincheng FENG, Shuqi WANG, Wendong XU, Chufan SHI, Zhengwu LIU, Ngai WONG
ACM Transactions on Design Automation of Electronic Systems (TODAES), 2026
S-TRAC: An Algorithm-Hardware Co-design of Sparsity-aware Threshold Adjustment for Accelerator-based RISC-V ISA Extensions
Yueting LI, Terry Tao YE, Wanshuang LIN, Wendong XU, Ngai WONG, Weisheng ZHAO
ACM Transactions on Parallel Computing (TOPC), 2026
SWINGARENA: Competitive Programming Arena for Long-context GitHub Issue Solving
Wendong XU, Jing XIONG, Chenyang ZHAO, Qiujiang CHEN, Haoran WANG, Hui SHEN, Zhongwei WAN, Jianbo DAI, Taiqiang WU, He XIAO, Chaofan TAO, Z. Morley MAO, Ying SHENG, Zhijiang GUO, Hongxia YANG, Bei YU, Lingpeng KONG, Quanquan GU, Ngai WONG
International Conference on Learning Representations (ICLR’26), 2025. Oral
AnchorTP: Resilient LLM Inference with State-Preserving Elastic Tensor Parallelism
Wendong XU, Chujie CHEN, He XIAO, Kuan LI, Jing XIONG, Chen ZHANG, Wenyong ZHOU, Chaofan TAO, Yang BAI, Bei YU, Ngai WONG
Design, Automation and Test in Europe (DATE’26), 2025
RaccoonServe: Soft Prefill-Decode Disaggregation with Hybrid Scheduling for LLM Serving
Wendong XU, Chujie CHEN, et al.
Manuscript, 2025
PASProxy: Performance-Aware Scheduling Proxy for Distributed Large Language Model Serving
Wendong XU, et al.
Manuscript, 2025
A Learned Performance Model with Transfer Learning Across GPUs on Tensorized Instruction
Yang BAI, Mingjun LI, Wendong XU, Bei YU
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2025
Fighter: Unveiling the Graph Convolutional Nature of Transformers in Time Series Modeling
Chen ZHANG, Weixin BU, Wendong XU, Runsheng YU, Yik-Chung WU, Ngai WONG
arXiv:2510.17106, 2025
Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models
He XIAO, Qingyao YANG, Dirui XIE, Wendong XU, Zunhai SU, Runming YANG, Wenyong ZHOU, Haobo LIU, Zhengwu LIU, Ngai WONG
arXiv:2509.16989, 2025
PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models
He XIAO, Runming YANG, Qingyao YANG, Wendong XU, Zhen LI, Yupeng SU, Zhengwu LIU, Hongxia YANG, Ngai WONG
arXiv:2509.16989, 2025
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Weichu LIU, Jing XIONG, Yuxuan HU, Zixuan LI, Minghuan TAN, Ningning MAO, Hui SHEN, Wendong XU, Chaofan TAO, Min YANG, Chengming LI, Lingpeng KONG, Ngai WONG
arXiv:2509.07403, 2025
PhyX: Does Your Model Have the 'Wits’ for Physical Reasoning?
Hui SHEN, Taiqiang WU, Qi HAN, Yunta HSIEH, Jizhou WANG, Yuyue ZHANG, Yuxin CHENG, Zijian HAO, Yuansheng NI, Xin WANG, Zhongwei WAN, Kai ZHANG, Wendong XU, Jing XIONG, Ping LUO, Wenhu CHEN, Chaofan TAO, Zhuoqing MAO, Ngai WONG
arXiv:2505.15929, 2025
A Custom RISC-V ISA with Scalable Processing Units for Efficient Neural Network Inference
Yueting LI, Wanshuang LIN, Wendong XU, Ngai WONG, Weisheng ZHAO
Design Automation Conference Engineering Track Poster (DAC-ENG’25), 2025
PPD: A Portable and Highly Parallel Dispatching System for Deep Learning
Wendong XU, Yuhao JI, Yang BAI, Yueting LI, Yuxuan ZHAO, Zhengwu LIU, Bei YU, Ngai WONG
ACM Transactions on Design Automation of Electronic Systems (TODAES), 2024
Performance benchmarking methods, devices, equipment and storage media for index data structures
Wendong XU, Ning WANG
CNIPA, Patent No. CN116204441A, 2023
Breast Cancer Molecular Subtype Prediction on Pathological Images with Discriminative Patch Selection and Multi-Instance Learning
Hong LIU*, Wendong XU* (equal contribution), Zihao SHANG, Xiangdong WANG, Haiyan ZHOU, Kewen MA, Huan ZHOU, Jialin QI, Jiarui JIANG, Lilan TAN, Huimin ZENG, Huijuan CAI, Kuansong WANG, Yueliang QIAN
Frontiers in Oncology, 2022
Method, apparatus, device and storage medium for executing distributed task
Wendong XU
CNIPA, Patent No. CN116069497A, 2022
Numerical storage method, Numerical query method, equipment
Wendong XU, Jin LIANG, Pengyu SUN, Wenbo YANG
CNIPA, Patent No. CN114817651B, 2022
CoUNet: An End-to-End Colonoscopy Lesion Image Segmentation and Classification Framework
Wendong XU, Hong LIU, Xiangdong WANG, Hanqiang OUYANG, Yueliang QIAN
International Conference on Video and Image Processing (ICVIP’20), 2020
Liver segmentation in CT based on ResUNet with 3D probabilistic and geometric post process
Wendong XU, Hong LIU, Xiangdong WANG, Yueliang QIAN
International Conference on Signal and Image Processing (ICSIP’19), 2019
Ph.D. Candidate, Department of Electrical and Electronic Engineering, The University of Hong Kong (HKU), Sept 2023 - Jun 2027 (Exp.)
Supervised by Prof. Bei YU and Prof. Ngai WONG
Research Interests: Code Agent, Machine Learning System
M.Eng. in Computer Technology, Institute of Computing Technology (ICT), University of Chinese Academy of Sciences (UCAS), Sept 2018 - Jun 2021
Research Interests: Machine Learning System, Deep Learning, Computer Vision
B.Eng. in Computer Science and Technology, Beijing Information Science and Technology University (BISTU), Sept 2014 - Jun 2018
Mainly Working on: Competitive Programming
UniPat AI Beijing / Shanghai, China
Research Scientist (Co-Founder), Oct 2025 - Present
Baidu Beijing, China
Senior Software Engineer, Core Searcher (Page Searcher), Jul 2021 - Aug 2023
Worked on the evolution of General Search Sort and Recall's Architecture
Performance optimization of recall system, distributed system design, and succinct data structures for inverted indexes
Research Institute, Anonymous Chip Company Shanghai, China
Research Intern, Dec 2023 - May 2024
Designed and implemented heterogeneous LLM inference system
Bytedance Beijing, China
Software Engineer Intern, Feed (Xiaohe), May 2020 - Sept 2020
Worked on feed recommendation system
Google Beijing, China
Software Engineer Intern in ML, Mobile-first Indexing, Mar 2019 - Jul 2019
Improved efficiency of MF indexing monitoring products
Megvii Beijing, China
Research Intern, Base Model, Sept 2019 - Feb 2019
Researched on image semantic segmentation
Momenta Beijing, China
Research and Development Intern, DMS, Feb 2019 - Mar 2019
Designed algorithm for driver fatigue detection
| Full Postgraduate Studentship, | The University of Hong Kong |
| Postgraduate Studentship, | University of Chinese Academy of Sciences |
| Bronze Medals, | ACM/ICPC Xi'An, Beijing Regional |