portrait
Photo by my lovely gf Liangyuting Zhang

Wenxuan Xu 许文轩

I am Wenxuan Xu, a Master's student in Computer Science at Dartmouth College. My current focus is AI infrastructure, especially LLM inference acceleration, serving systems, CUDA kernels, and quantization.

I previously worked on Human-Computer Interaction and VR/AR with Andrew Campbell, Hai-Ning Liang, Wolfgang Stuerzlinger, and Yuntao Wang. I eventually felt that HCI research was dead water for me, both in future prospects and in practical upside. Still, that detour gave me real research training and led to works such as LENS.

In this era, joining industry and building real systems is clearly the better choice for me. Influenced by friends who were already working on infrastructure, I started moving into LLM infra and inference acceleration. I am serious about it: I keep learning, building, and contributing to projects such as SGLang and LiteInfer.

I am actively seeking New Grad roles in AI infrastructure, especially LLM inference acceleration. I am excited about ambitious startups and challenging systems work. If my background looks relevant, please contact me.

Education
Dartmouth College
Master of Science in Computer Science with Concentration in Digital Arts
Hanover, United States Sep 2024 - Present
University of Liverpool (Xi'an Jiaotong-Liverpool University)
Bachelor of Science in Information and Computing Science
Suzhou, China Sep 2020 - Jun 2024
• First Class Honours
News
2026 Apr 24
Our LENS paper has been accepted to ACL Main! Huge thanks to all collaborators. See you in San Diego!
2025 Dec 28
My first-author paper LENS has been released on arXiv! Check it out!
2025 May 01
I'm now starting to work as a Research Assistant at HealthX Lab, Dartmouth College with Andrew Campbell
Open Source Contributions
SGLang
Open-source contributor
I contribute to the SGLang community through community profile work, small bug fixes, and documentation improvements. I am currently learning NVFP4, PD disaggregation, and WideEP, with the long-term goal of growing into a strong LLM systems engineer and contributing to more roadmap-level work. For full contribution history, please check my GitHub activity.
LiteInfer
Project developer
A high-performance C++/CUDA LLM inference engine developed on top of KuiperLlama / KuiperInfer. LiteInfer extends the educational inference framework into a more complete serving stack for decoder-only LLMs, including hand-written CUDA kernels, vLLM-style paged KV cache, continuous batching, request scheduling, and INT8 / AWQ INT4 weight-only quantization. The project supports Llama-3 and Qwen-family models and is designed as a transparent systems-learning codebase without PyTorch runtime, external attention libraries, or external quantization kernels.
Selected Publications (see full on my Google Scholar)
LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models

LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models

Wenxuan Xu, Arvind Pillai, Subigya Nepal, Amanda C Collins, Daniel M Mackin, Michael V Heinz, Tess Z Griffin, Nicholas C Jacobson, Andrew Campbell
Annual Meeting of the Association for Computational Linguistics (ACL), Main Conference (2026)
Predicting Ray Pointer Landing Poses in VR Using Multimodal LSTM-Based Neural Networks

Predicting Ray Pointer Landing Poses in VR Using Multimodal LSTM-Based Neural Networks

IEEE Conference on Virtual Reality and 3D User Interfaces (2025)
Optimizing Moving Target Selection in VR by Integrating Proximity-Based Feedback Types and Modalities

Optimizing Moving Target Selection in VR by Integrating Proximity-Based Feedback Types and Modalities

Xuning Hu*, Wenxuan Xu*, Yushi Wei, Zhang Hao, Jin Huang, Hai-Ning Liang (* equal contribution)
IEEE Conference on Virtual Reality and 3D User Interfaces (2025)
Exploring the Effects of Spatial Constraints and Curvature for 3D Piloting in Virtual Environments

Exploring the Effects of Spatial Constraints and Curvature for 3D Piloting in Virtual Environments

Xuning Hu, Xinan Yan, Yushi Wei, Wenxuan Xu, Yue Li, Yue Liu, Hai-Ning Liang
IEEE International Symposium on Mixed and Augmented Reality (ISMAR) (2024)
Services
Teaching
Teaching Assistant, COSC 74: Machine Learning and Statistical Data Analysis (Dartmouth College)
Reviewing
CHI 2025 Late-Breaking Work/ CHI PLAY 2025 Full Papers/ ISMAR 2025 Papers/ SUI 2025 Papers
View my SGLang PRs