Skip to content
View duoan's full-sized avatar

Block or report duoan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duoan/README.md

πŸ‘‹ Hi, I'm Duo An (Victor)

Senior Machine Learning Engineer @ Amazon AGI
Scaling multimodal foundation models β€” optimizing how they learn, generalize, and align through data and systems co-design.


🧭 About

I work at the intersection of machine learning and distributed systems,
designing large-scale learning pipelines and multimodal data systems that improve how foundation models learn from vast, diverse signals.

My focus areas:

  • 🧠 Training dynamics & optimization β€” improving convergence, stability, and efficiency of large-scale multimodal models
  • 🧩 Learning-centric systems β€” integrating data, architecture, and feedback to enhance representation learning and model alignment
  • βš™οΈ Scalable orchestration β€” leveraging Ray, Spark, and Kubernetes to parallelize multimodal workloads across thousands of GPUs
  • πŸ” Evaluation & feedback loops β€” automating model-driven data refinement and continual quality signals for alignment and adaptation

My work centers on how models learn, not just how they’re trained.


🧰 Core Stack

Machine Learning

PyTorch Transformers Ray FAISS DeepSpeed

Systems & Infra

AWS Spark Kubernetes CDK Docker

Languages

Python Scala Rust C++


βš–οΈ Principles

1. Models and systems co-evolve.
The best architectures emerge when data, compute, and learning dynamics are designed together.

2. Scale reveals behavior.
Many learning problems only appear β€” and can only be solved β€” at massive scale.

3. Data is part of the model.
Every batch defines what the model becomes.


πŸ“Š Snapshot

duoan's github stats Top Langs


🌐 Connect

Linkedin Badge Gmail Badge


β€œAt scale, learning is a systems problem β€” and every system is a hypothesis about how intelligence forms.”
β€” Duo An

Pinned Loading

  1. ReproduceAI ReproduceAI Public

    Recreating every milestone in Machine Learning and Artificial Intelligence

    Python