gnana70

Follow

🎯

Focusing

Gnana Prasath gnana70

🎯

Focusing

Follow

Data Scientist

12 followers · 1 following

Chennai
19:32 (UTC +05:30)
in/gnana-prasath

Achievements

Achievements

gnana70/README.md

Hi, I'm Gnana Prasath

Lead Data Scientist | Deep Learning, Generative AI, Computer Vision, NLP

🧑‍💻 About Me

💼 Lead Data Scientist at Yubi, Chennai, India
🎯 9+ years building scalable, end-to-end deep learning solutions in computer vision and NLP
🤖 Passionate about Generative AI, LLMs, Document AI, and deploying real-world ML products
🏆 Proven track record delivering ML and GenAI projects under tight deadlines through strategic planning and cross-functional collaboration

🚀 Skills & Tech

Languages: Python
Backend: FastAPI, Django Rest Framework
Databases: PostgreSQL
Cloud Platforms: AWS, Azure
Expertise: OCR, Document AI, Generative AI, LLMs, Retrieval-Augmented Generation (RAG), Deep Learning, NLP

🏢 Work Highlights

Yubi (Lead Data Scientist, 2022–Present)

Led all Computer Vision initiatives, projects & POCs for Yubi Group
Developed a template-agnostic bank statement extraction system (deployed across India, Sri Lanka, Middle East)
Fine-tuned Llama 3.2 1B for entity extraction in financial documents
Built a custom OCR solution saving $10K+/month compared to AWS Textract
Designed a KYC extraction system (7 deep learning models, >99% accuracy, <0.5 sec latency, half a million documents/day, $0.5M/year cost saving)
Engineered a scalable form recognition engine (Vision-Language Transformer/Donut, >90% accuracy, 20K+ PDFs/day)
Delivered a CPU-only captcha resolver (5+ lakh captchas/day)

TheMathCompany (Senior Associate - Data Science)

Led a 10-member team for payment recommendation in the auto industry
Built price elasticity and optimization frameworks, Airflow ML pipelines

Accenture (App Dev Senior Analyst)

Engineered ML-powered resource planners, classifier chains, Flask APIs
Recognized as "Automation Prime certified in AI & Analytics"

🌐 Open Source & Community

🏅 Built a state-of-the-art Tamil Text Recognition model (360° natural scene reading, outperforms Tesseract, PaddleOCR, EasyOCR)
- GitHub repo: tamil_ocr
- PyPI: ocr_tamil
📄 Published: "Pharmaceutical inspection using machine vision" (Journal of Advanced Research in Dynamical and Control Systems, Jan 2017)

📚 Education

B.E. Mechatronics, Kumaraguru College of Technology (2012–2016)
Higher Secondary (Science, 95%), SRC Memorial Matriculation School

💬 Let's Connect!

📫 Reach me at gnana70@gmail.com
💼 LinkedIn
🏠 GitHub

“Turning ideas into scalable AI solutions.”

Pinned Loading

tamil_ocr tamil_ocr Public

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

Python 68 14