Skip to content
View dimitris-markopoulos's full-sized avatar

Block or report dimitris-markopoulos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Profile views Python VSCode

Dimitris Markopoulos' Github

Machine Learning • Quantitative Finance

🎓 M.A. in Statistics at Columbia University (4.19/4.33)
🎓 B.S. in Applied Mathematics & Statistics from Stony Brook University (3.99/4.0)


Project Summary
trees-ensembles-neural-networks Decision trees, boosting, and neural networks on UCI income data with tuning, overfitting analysis, and feature interpretation.
algorithmic-trading Systematic trading pipelines using ML and time-series cross-validation
latent-semantic-clustering UMAP + EM-GMM clustering of book chapters via NLP frequency vectors
mnist-image-classification Comparing Lasso, Naive Bayes, Ridge, SVM, and Group Lasso
quantitative-finance BSM & Heston option pricing, Monte Carlo simulations, VaR, etc
crime-predictor-analysis Predicting crime using UCI community features; LASSO, Ridge, Elastic Net, kernel regression + manually implemented CV
sepsis-prediction Cleaned & merged using SQL, then applied ML pipeline to CUMC + NYP secure patient-level dataset; HIPAA-compliant experiments using Azure Secure Environment; certified.

Supervised Learning & Statistical Modeling: LASSO, Ridge, Elastic Net, Logistic Regression, LDA, ARIMA, Group Lasso, etc
Dimensionality Reduction & Feature Analysis: PCA, UMAP, t-SNE, Spectral Embedding, MDS, NMF, Kernel PCA
Unsupervised Learning & Clustering: KMeans++, Gaussian Mixture Models (GMM), Spectral Clustering, Hierarchical Clustering

Languages: Python (primary), SQL, R, MATLAB
Libraries: PyTorch, TensorFlow, scikit-learn, XGBoost, Numpy, Pandas, Statsmodels
Visualization: Matplotlib, Seaborn, Streamlit
Workflow: VSCode, Git/GitHub, Google Colab (for GPU compute), LaTeX
Infra: Azure, APIs, GitHub Actions

🌐 Connect With Me
LinkedIn
GitHub
dimitris.markopoulos@columbia.edu

GitHub followers GitHub Repo stars GitHub Repo forks


"Averaged over all possible data-generating distributions, every classification algorithm has the same error rate."
— David H. Wolpert, No Free Lunch Theorems for Optimization

Pinned Loading

  1. trees-ensembles-neural-networks trees-ensembles-neural-networks Public

    Machine learning models including decision trees, random forests, adaboost, gradient boosting, and neural networks applied to structured data for classification tasks.

    Jupyter Notebook

  2. latent-semantic-clustering latent-semantic-clustering Public

    Clustering book chapters with unsupervised ML—custom EM-GMM, sklearn baselines, and dimensionality reduction.

    Jupyter Notebook 1

  3. quantitative-finance quantitative-finance Public

    A collection of quantitative finance projects covering option pricing, risk analysis, volatility modeling, and investment strategies. Includes Monte Carlo simulations, Black-Scholes & Heston models…

    Jupyter Notebook 2

  4. algorithmic-trading algorithmic-trading Public

    Applying ML to finance - systematic trading.

    Jupyter Notebook

  5. crime-predictor-analysis crime-predictor-analysis Public

    Predicting violent crime rates using high-dimensional community data from the UCI dataset. Implements a structured machine learning pipeline with extensive preprocessing, multiple feature selection…

    Jupyter Notebook

  6. mnist-image-classification mnist-image-classification Public

    Classifying MNIST digits (3, 5, 8) using supervised learning methods including Logistic Regression, LDA, SVM, Naive Bayes, and Group LASSO. Includes model comparison, confusion matrix visualization…

    Jupyter Notebook