About Me
I am a Machine Learning Engineer at EmbodyX Inc. & AIbao LLC, working on LLM compression, deployment, and real-time AI systems. My work focuses on pruning and quantizing large language models, deploying MoE models on edge platforms, and building real-time chatting systems integrating LLMs with TTS.
Previously, I worked as a Software Engineering Co-op at Cognex Corporation, where I developed .NET wrapper layers and contributed to VisionPro software.
I received my M.S. in Electrical and Computer Engineering from Northeastern University and my B.E. in Electrical Engineering from Beijing University of Technology.
Research Interests
- Large Language Model Compression & Deployment
- Edge Computing & On-device AI Inference
- Computer Vision & Autonomous Driving Perception
