Skip to content
View zephyr-sh's full-sized avatar

Block or report zephyr-sh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zephyr-sh/README.md

Z. Yuan

EmailLinkedInGitHubWebsite

💡 Specializing in Vision + Language, building complete solutions for document AI, face recognition, and eKYC.


🧑‍💻 About Me

I am a Senior Computer Vision & Machine Learning Engineer, with deep expertise in both vision and language domains.
My key strengths include end-to-end development, infrastructure design, and cross-functional deployment.

  • Specializing in Document OCR/KIE, Face Recognition, and Face Anti-Spoofing Detection
  • Experienced in model design and training, including CNN, Transformer, ViT, and self-supervised learning
  • Proficient in MLOps workflows, including ONNX Runtime, quantization (PTQ/QAT), Docker, and CI/CD
  • Self-hosting a full-stack technical blog and research note platform using React, FastAPI, and Nginx
  • Demonstrates strong capabilities in Linux system administration and DevOps practices

📂 Most of my professional work is maintained under the DocsaidLab organization.

This profile contains only personal repositories and prototypes.


🧠 Tech Stack

Stack Proficiency Notes
Python / PyTorch Expert Model training, CNN, Transformer, ViT, SSL
Computer Vision & Deep Learning Expert OCR, KIE, Face Recognition, Anti-Spoofing
ONNX Runtime / PTQ / QAT Proficient Inference optimization & quantization
Docker / Docker Compose Proficient Containerization & private deployment
FastAPI / RESTful API Proficient Lightweight backend & eKYC integration
Nginx / Linux DevOps Proficient HTTPS, reverse proxy, CI/CD automation
Docusaurus (React) Proficient Multilingual docs & blog framework

🚀 Featured Projects

Deep Learning

Tools & Integrations

  • AutoTraderX: Integration experiments with Taiwan brokerage APIs.
  • Capybara: CV toolkit for batch inference and utilities.
  • GmailSummary: Gmail ⇄ OpenAI summarizer (archived).
  • Nginx Notes: Handy Nginx configs & tips.
  • WordCanvas: Synthetic font‑to‑image data generator.

📚 Research Notes (230+)

Covers topics like ViT, OCR, Multimodal Learning, Self-Supervised Methods, and more.


📬 Contact

👍 Follow me on Facebook Fan Page for updates on computer vision, AI papers, and project insights.

Pinned Loading

  1. DocsaidLab/DocAligner DocsaidLab/DocAligner Public

    Predictions of the four corners of documents.

    Python 33 2

  2. DocsaidLab/Capybara DocsaidLab/Capybara Public

    OpenCV and ONNX Runtime Inference Toolkit

    Python 2

  3. DocsaidLab/website DocsaidLab/website Public

    A playground for our developers

    JavaScript 1

  4. DocsaidLab/Chameleon DocsaidLab/Chameleon Public

    Advanced Toolbox for PyTorch Development

    Python 1 1