Skip to content
View VyetGokyra's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report VyetGokyra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
VyetGokyra/README.md

Hi! My name is Viet Tien Pham

🚀 AI Engineer

I'm an AI Engineer and Researcher passionate about building intelligent systems that understand and interact with humans naturally.

I have solid experience working on speech technologies, including:

  • Automatic Speech Recognition (ASR)
  • Speaker Verification (SV)
  • Text-to-Speech (TTS)
  • Audio Large Language Models (Audio LLMs)

I enjoy designing callbot center solutions, conversational AI, and human-robot interaction systems, where the ability to process and generate natural speech plays a critical role.

During my university years, I worked extensively on computer vision and deep learning, focusing on:

  • Image segmentation
  • Image classification
  • Few-shot segmentation
  • Object detection

This foundation helped me develop strong skills in designing and fine-tuning large-scale models, as well as integrating them into real-world applications.

On the research side, I'm interested in:

  • Reasoning with LLMs
  • Reinforcement Learning (RL) for optimizing conversational strategies
  • Retrieval-Augmented Generation (RAG) for enhancing knowledge-grounded dialogue systems

My technical background spans both model development and deployment, covering end-to-end speech pipelines, advanced audio feature engineering, and multi-modal reasoning capabilities.

I always aim to bridge the gap between state-of-the-art research and impactful user-facing products — from scalable voicebots for customer service to advanced vision and speech-based interactive systems.


🎯 Focus Areas

  • ASR, SV, TTS, Audio LLMs
  • Callbot / contact center automation
  • Human-robot interaction
  • Computer vision (segmentation, classification, few-shot segmentation, object detection)
  • Reasoning and RAG with LLMs
  • Reinforcement Learning for dialogue optimization

📫 Contact


⭐ Feel free to check out my repositories and connect with me! Followers GitHub Stars

Pinned Loading

  1. ai-agents-course ai-agents-course Public

    Forked from pearls-lab/ai-agents-course

    Jupyter Notebook 1

  2. awesome-generative-ai-guide awesome-generative-ai-guide Public

    Forked from aishwaryanr/awesome-generative-ai-guide

    A one stop repository for generative AI research updates, interview resources, notebooks and much more!

    1

  3. awesome-LLM-bayesian-optimization awesome-LLM-bayesian-optimization Public

    a curated list of LLMs bayesian optimization tutorials, papers, softwares

    1

  4. awesome-audio-visual awesome-audio-visual Public

    Forked from krantiparida/awesome-audio-visual

    A curated list of different papers and datasets in various areas of audio-visual processing

  5. awesome-speech-recognition-speech-synthesis-papers awesome-speech-recognition-speech-synthesis-papers Public

    Forked from zzw922cn/awesome-speech-recognition-speech-synthesis-papers

    Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)