Skip to content
View alizat's full-sized avatar
β›³
Shooting the Moon
β›³
Shooting the Moon

Block or report alizat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alizat/README.md

πŸ‘‹ Hi there, I'm Ali Ezzat

Welcome to my GitHub profile! I'm a data scientist who is specialized in machine learning and passionate about solving complex problems through data science, with a strong focus on generative AI and building impactful applications using natural language processing (NLP) and large language models (LLMs).

  • πŸ”­ Current Role: Chief Data Scientist at Synapse Analytics
  • 🌱 Education: PhD in Computer Science & MSc in Bioinformatics from Nanyang Technological University

🍳 What's cooking these days

I am mostly working on the "Life with" repos here on my GitHub:

I have my own pet projects that I work on from time to time:

  • HEROIC Surfer
    • HEROIC is a self-development platform that provides lots of content (book summaries, daily wisdom videos, meditations, etc.)
    • I am currently scraping the HEROIC website.
    • With what I am scraping, I intend to build a LLM-powered Shiny app that lets you explore the HEROIC database.
  • BGG Scraper
    • Board Game Geek (BGG) is an encyclopedic website that has all kinds of information on all board games ine existence.
    • I developed many functions for scraping different kinds of board game info. Most of these functions made use of the BGG's XML API.

πŸ› οΈ Skills & Technologies

  • Programming: Python, R, MATLAB, SQL, Java, C++, JavaScript
  • Libraries: Transformers, PyTorch, scikit-learn, pandas, NumPy, Plotly, Matplotlib, seaborn, dplyr, ggplot2, Leaflet, Shiny, Gradio
  • DevOps/Development Tools: Amazon Web Services (AWS), MS Azure, Docker, Git, GitHub, HuggingFace, Spark, RStudio, Jupyter, Cursor, VSCode
  • Machine Learning: Model Deployment, Supervised Learning, XGBoost, Clustering, Feature Engineering, Feature Selection, Deep Learning, Gradient Descent, Convolutional Networks, Ensemble Learning, Manifold Learning, Cross-Validation, Time Series Forecasting, Recommender Systems, Matrix Factorization, Dimensionality Reduction, Graph Neural Networks
  • Generative AI: Large Language Models, Natural Language Processing, LLM fine-tuning, Retrieval-Augmented Generation (RAG), LangChain, Ollama, Open-source LLMs, Chatbots, Multi-modal RAG
  • Other Competencies: Statistics, Data Wrangling, Data Visualization, Data Mining, Association Analysis, Web Scraping, Parallel Computing, Algorithms, Software Engineering, Relational Databases, Graph Databases (Neo4j), Shiny App Development, Competitive Programming, Bioinformatics, Drug Discovery, Pharmaceuticals

πŸ“« Let's Connect

Feel free to explore my repositories and reach out if you'd like to collaborate on exciting data science projects!

πŸ“ˆ GitHub Stats

Ali's GitHub Stats

Pinned Loading

  1. Life-with-LLMs Life-with-LLMs Public

    Pet projects involving Gen AI

    Jupyter Notebook 1

  2. Life-with-DL Life-with-DL Public

    Straight-forward DL Pet projects

    Jupyter Notebook

  3. Chemogenomic-DTI-Prediction-Methods Chemogenomic-DTI-Prediction-Methods Public

    Algorithms for prediction of drug-target interactions via computational (chemogenomic) methods

    MATLAB 47 14

  4. ropensci/dbparser ropensci/dbparser Public

    Source code for the R package, "dbparser" (i.e. DrugBank Parser)

    R 63 19

  5. bggscraper bggscraper Public

    Scripts for scraping all sorts of (publicly accessible) board games data from boardgamegeek.com

    R

  6. my_r_snippets my_r_snippets Public

    My own R snippets. Feel free to copy and use.

    Vim Snippet