AI Projects Portfolio | Artificial Intelligence Utils 📚

"Turning coffee into neural networks since 2025" ☕➡️🤖

Welcome to my AI project portfolio! This repository contains implementations and analyses of fundamental NLP techniques and modern transformer architectures. Below you'll find Batman-style tech briefings for each project!

🗂️ Project Catalog

1. 🔍 Word Embeddings Showdown

CBOW vs Skip-Gram vs GloVe
NLP Fundamentals with Reuters Financial News

🎯 Key Insights

Implemented 3 classic embedding models from scratch
Developed multi-modal evaluation framework:
- KNN Semantic Clustering 👯
- SimLex-999 Benchmark 📈
- Vector Arithmetic for Analogies ➕➖

📉 Performance Matrix

Model	KNN Clustering	SimLex-999 ρ	Analogy Accuracy
CBOW	🌕🌕🌗🌑🌑	0.0954	0%
SkipGram	🌕🌕🌑🌑🌑	0.0504	0%
GloVe	🌕🌑🌑🌑🌑	0.0659	0%

💡 Epiphany Moment: Even financial jargon needs bigger embeddings! (64-dim wasn't cutting it)

2. 🌀 Transformer Times

Chinese News Classification
Battling with 10 News Categories

🚀 Turbocharged Architecture

Scaled encoder layers: 2 → 8 🏗️
Enhanced classification head with GAP 🎯
Cosine decay + warmup scheduling 🔥

📊 Results Evolution

Version	Accuracy	F1-Score	Key Improvement
Baseline	81.19%	82.55%	Initial Transformer
+Preprocess	83.53%	83.34%	Punctuation Ninjutsu ✂️
Final Model	84.07%	84.19%	Deep Encoder Magic 🧙

Hot Take 🔥: Commas matter! But sometimes they don't... 🤷

3. 🤖 BERT Unleashed

Dual Task Dominance
Sentiment Analysis + Paraphrase Detection

Mission Parameters

{'tasks': ['SST2', 'MRPC'],
 'model': 'bert-mini',
 'secret_sauce': 'Custom LossCallback() 🕵️',
 'hardware': 'Enough CUDA cores to fry an egg 🍳'}

📈 Performance Metrics

Task	Accuracy	F1-Score	Prediction Prowess
SST2	82.80%	83.11%	4/5 Test Samples Correct 🎬
MRPC	75.25%	82.43%	5/5 Real-world Correct 🌍

Golden Insight 💡: Small BERTs can play big! (But they still hate irony)

🧪 Lab Environment Specs

please refer to reports accrodingly to each project:

📜 Project Reports

Project	Report Link
Word Embeddings Analysis	project1-2_report.pdf
News Classification	project3_report.pdf
BERT Classification	project4_report.pdf

📣 Future Quest Log

Subword embeddings for rare financial terms 💼
Hybrid positional encoding strategies 🧬
Attention visualization toolkit 👀
Domain-adaptive pretraining 🌐

Made with ❤️ (and probably too much caffeine) by Zijin Cai

"If debugging is removing bugs, then programming must be putting them in." - Edsger Dijkstra

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Project 3/Transformer/Transformer		Project 3/Transformer/Transformer
Project 4		Project 4
project1-2		project1-2
LICENSE		LICENSE
README.md		README.md
project1-2_report.pdf		project1-2_report.pdf
project3_report.pdf		project3_report.pdf
project4_report.pdf		project4_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Projects Portfolio | Artificial Intelligence Utils 📚

🗂️ Project Catalog

1. 🔍 Word Embeddings Showdown

🎯 Key Insights

📉 Performance Matrix

2. 🌀 Transformer Times

🚀 Turbocharged Architecture

📊 Results Evolution

3. 🤖 BERT Unleashed

Mission Parameters

📈 Performance Metrics

🧪 Lab Environment Specs

📣 Future Quest Log

About

Uh oh!

Releases

Packages

Languages

License

CAI991108/Artificial-Intelligence-Utils

Folders and files

Latest commit

History

Repository files navigation

AI Projects Portfolio | Artificial Intelligence Utils 📚

🗂️ Project Catalog

1. 🔍 Word Embeddings Showdown

🎯 Key Insights

📉 Performance Matrix

2. 🌀 Transformer Times

🚀 Turbocharged Architecture

📊 Results Evolution

3. 🤖 BERT Unleashed

Mission Parameters

📈 Performance Metrics

🧪 Lab Environment Specs

📣 Future Quest Log

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages