Skip to content
View ab3llini's full-sized avatar
🌎
🌎
  • DocuSign
  • Dublin
  • 18:13 (UTC +01:00)
  • X @ab3llini

Block or report ab3llini

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ab3llini/README.md

Alberto Bellini

Machine Learning Engineer @ DocuSign Applied Science

Currently wrangling terabytes of documents and building NLP systems that actually work in production. Previously made real estate pricing models less wrong and co-founded an AI startup.

What I'm Into

Obsessed with large-scale NLP and the beautiful chaos of distributed systems. Love the challenge of taking transformer models from "works on my laptop" to "processes millions of documents without falling over."

Deep in the Spark + Databricks ecosystem, building ETL pipelines that don't make you want to throw your laptop out the window. Comfortable juggling PyTorch transformers, distributed training, and the occasional Rust side project when Python feels too slow.

Have a soft spot for information extraction, document understanding, and making LLMs do useful things beyond writing poetry. Also enjoy the dark arts of MLOps - because someone has to make sure your beautiful model actually runs in production.

Recent Experiments

  • NLP at Scale: Transformer pipelines for document processing, because regex isn't always the answer
  • Research: Combined GPT-2 with vision encoders for visual question answering back when that was still novel (code)
  • Side Quest: Built a differentiable tensor engine in Rust because why not (carbon)

Stack: Python • PyTorch • Transformers • Spark • Databricks • Kubernetes • Rust • SQL (yes, it counts)

Currently exploring how far we can push transformer architectures before they become sentient 🤖

📧 Always down to chat about NLP, distributed systems, or why your model isn't converging

Pinned Loading

  1. carbon carbon Public

    A fully functional differentiation engine for 2D tensors and scalar operations in pure Rust

    Rust

  2. Transformer-VQA Transformer-VQA Public

    Transformer-based VQA system capable of generating unconstrained, open-ended answers based on OpenAI's GPT-2 117M

    Python 1

  3. News2Headline News2Headline Public

    Recurrent neural networks for headline generation.

    Python 1

  4. DataMining DataMining Public

    Two-stage bagging of decision regression trees to predict customers and sales for more than 750 stores across Europe.

    Jupyter Notebook 2

  5. Lorenzo-il-Magnifico Lorenzo-il-Magnifico Public

    Bachelor thesis project consisting of a full multiplayer (5+ players) implementation of Cranio Creation's board game "Lorenzo il Magnifico".

    Java

  6. BoBooky BoBooky Public

    BoBooky, the best way to shop books online!

    HTML 1