AI Systems Engineer

Design and optimize high-performance algorithm engines for in-house large model development and real-world AI applications.

About the Role

Build the systems core that powers 39 AI's in-house large model development, from training to inference deployment.
Partner with research and product teams to turn model capabilities into reliable, high-performance production systems.
Shape long-term technical direction across inference engines, distributed training, and model serving.

Responsibilities

Design and develop high-performance algorithm engines for in-house large model development and applications, providing foundational primitives for training and inference.
Optimize and debug large model training and inference systems, working extensively with and contributing to engines like SGLang, vLLM, and Megatron-LM.
Run performance evaluation and analysis for algorithm deployment, define system-level technical planning and performance standards, and continuously improve stability and efficiency.
Participate in the architecture design and implementation of large-model systems, driving efficient deployment and iteration in real-world scenarios.

Requirements

Solid C/C++/Python programming skills, familiarity with common data structures and algorithms, and strong motivation to solve complex systems problems.
Strong understanding of computer systems, with experience developing and architecting large software systems or low-level engines.
Hands-on experience with at least one large model inference or training engine, such as SGLang, vLLM, or Megatron-LM, including practical debugging and optimization.
Strong systems debugging skills; able to quickly root-cause and resolve performance and stability issues in distributed environments.
Strong communication, collaboration, and ownership, with the ability to drive technical plans into production and iterate continuously.

Nice to Have

Experience developing large model training or inference systems, with familiarity in related architectures and scheduling strategies.
Familiarity with deep learning framework internals such as PyTorch, or compiler technologies such as Triton and TVM.
Experience with distributed systems, high-performance networking, or storage optimization.
Awards in programming competitions such as ICPC, IOI, NOI, or equivalents, or contributions to relevant open-source projects.
Practical experience with dialogue systems, NLP application engines, or large model service deployment.

How to Apply

Send your resume along with GitHub, personal project, or technical writing links to the contact below.
For open-source contributions or past projects, direct links or short write-ups are welcome.
Take-home tasks, if any, will be paid at a reasonable market rate.
No requirements around years of experience or degree - we evaluate on technical depth and past work.

Explore More

Other Open Roles

View all roles

Brand and Creative Designer

Own visual direction and creative execution across campaigns, launches, social media, product visuals, and the long-term brand system.

View role

Product Designer

Design end-to-end AI product experiences for voice, video, and image products, turning model capabilities into simple, useful, human-feeling workflows.

View role

Audio LLM Researcher

Research and build audio understanding models and realtime audio LLM systems for natural, low-latency, human-like conversation.

View role

Full-Stack Engineer

Own the full path from database design, APIs, and web frontends to deployment, directly taking AI products from zero to one.

View role