Skip to content
View Pedro2um's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Pedro2um

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Pedro2um/README.md

Hi there, I'm Pedro Igor Gomes de Morais 👋

LinkedIn Email

I'm a Data Scientist from Vitória, ES, Brazil 🇧🇷, passionate about leveraging data to build intelligent solutions and drive insights. My core interests lie in Natural Language Processing (NLP), predictive modeling, and developing scalable data pipelines. I enjoy the challenge of translating complex academic research into practical, production-ready systems and have a proven ability to automate workflows for significant efficiency gains.


🔭 What I'm Currently Focused On:

  • Working as a Machine Learning Engineer at Labic , building ETL pipelines to empower the tourism sector with data.
  • Contributing as an Undergraduate Researcher at Data Science Lab - UFES, where I've:
    • Developed PyTorch-based churn models for TIM Brazil (achieving cost reduction).
    • Currently developing an automated WhatsApp data ingestion and sentiment analysis system for misinformation monitoring.
    • Co-authored publications for AINA'25 and SBRC'24.
  • Advancing my knowledge through my B.S. in Computer Science at the Federal University of Espírito Santo (UFES).

💻 My Tech Stack & Tools:

Languages: Python SQL Java C/C++ Elixir

Machine Learning & NLP: PyTorch | Transformers (BERT) | Scikit-learn | SHAP | FAISS

Cloud, DevOps & Data Tools: AWS Docker Git Linux DVC | Pandas | Spark | Google Places API | SQL

Data Visualization: Matplotlib | Seaborn


🚀 Highlighted Project Areas:

You can find more details and other projects on my Portfolio Website

  • Episodic Memory for LLMs: Implemented FAISS semantic memory to enhance chatbot coherence.
  • Predictive Modeling: Developed high-accuracy churn prediction models.
  • Data Engineering: Built ETL pipelines and automated data reporting.
  • NLP Research: Processed large text datasets for Portuguese NLP studies.

📫 Let's Connect & Collaborate:


📊 My GitHub Stats:

Pedro's GitHub stats Top Langs

Pinned Loading

  1. trab2-ed trab2-ed Public

    This repository contains the implementation of the second assignment for the Data Structures (Estruturas de Dados) course. The project focuses on the development and application of various data str…

    C

  2. tbotrab1 tbotrab1 Public

    tbotrab1 is a command-line application developed in C, designed to perform specific tasks efficiently. The project leverages modular programming practices and includes examples and scripts to facil…

    C 2

  3. LinearRegression-2024 LinearRegression-2024 Public

    Freelance project. Results were used by a Production Engineering PhD Candidate at UFES.

    Jupyter Notebook

  4. Projeto-Integrado Projeto-Integrado Public

    Eugor is a roguelike game developed for Windows 10/11 as part of the Integrated Project course taught by Dr. João Paulo Andrade Almeida. Inspired by the classic Rogue, it features permadeath, rando…

    Python

  5. nilogm/2024-2-gpt nilogm/2024-2-gpt Public

    Jupyter Notebook 2

  6. trabalho2tbo trabalho2tbo Public

    This repository contains the implementation of the second assignment for the Search and Sorting Techniques (Técnicas de Busca e Ordenação) course. The project focuses on the development and applica…

    C