Skip to content
View Samox1's full-sized avatar

Block or report Samox1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Samox1/README.md

Hi πŸ‘‹ I'm Szymon BaczyΕ„ski γ€” SamoX 〕

Passionate about data and extracting information (also photonics engineer)

I'm a physicist by training, with a deep curiosity for data-driven insights. After over a decade immersed in photonics and optofluidics at the Warsaw University of Technology - where I graduated with an engineering degree, a master's degree and I'm really close to finishing my PhD - I've transitioned from academia into the dynamic world of data science and ML/AI.

Since mid-2023, I've been actively contributing to the open-science project SpeakLeash, assisting in data curation and model development for Bielikβ€”Poland’s open-source large language model family. This experience has provided me with hands-on exposure to the entire LLM pipeline, from dataset preparation to deploying instruction-tuned models like Bielik-11B-v2, optimized for Polish NLP tasks. Beyond research, I'm passionate about applying GenAI tools in both commercial and personal projects, exploring their capabilities and pushing their boundaries. I approach challenges systematically, always considering both the needs and limitations of a project.

I'm eager to grow in areas such as Business Intelligence, Data Science, Machine Learning, MLOps, and Big Data. Let's connect and explore the possibilities together.

πŸŽ“ Articles I co-authored during my PhD research and other info
  1. "Study of PDMS Microchannels for Liquid Crystalline Optofluidic Devices in Waveguiding Photonic Systems",
    Crystals. 2022; 12 (5):729. https://doi.org/10.3390/cryst12050729
  2. "A Novel Approach for the Creation of Electrically Controlled LC:PDMS Microstructures",
    Sensors. 2022; 22 (11):4037. https://doi.org/10.3390/s22114037
  3. "Orientation of Liquid Crystalline Molecules on PDMS Surfaces and within PDMS Microfluidic Systems",
    Applied Sciences. 2021; 11 (24):11593. https://doi.org/10.3390/app112411593
  4. "Low-cost, widespread and reproducible mold fabrication technique for PDMS-based microfluidic photonic systems",
    Photonics Letters of Poland. 2020; 12 (1):22-24. https://doi.org/10.4302/plp.v12i1.981

WUT Link ← Together with a team of Ph.D. students, we won the MedTech-Athon Warsaw University of Technology, receiving funding to develop a mobile air pollution monitoring device (May 2022). We built a prototype and showcased it a year later at DemoDay, collaborating along the way with other researchers, a pulmonology doctor and organizations like the Chief Environmental Protection Inspector in Warsaw.


LinkedIn DataCamp Google Cloud Skills Boost

ResearchGate Kaggle LeetCode HackerRank Instagram


⚑ Technologies

Use on a daily basis :

(or wrote some code)

Python & libraries

Pandas Numpy Scikit-Learn Scipy Plotly Seaborn Streamlit Jupyter

R C C++ LabView

Tools :

PowerBI ApacheSuperset Postman Excel

VSCode Jupyter GoogleColab RStudio

Git GitHub Docker Portainer

Databases :

PostgreSQL MySQL

Operating systems :

Linux Windows

Other tools I use :

Figma Zotero

ESPMicro STMMicro MQTT Arduino

LrC Premiere AE

* I'd like to learn
(click here)


Spacy Pytorch Tensorflow PyTest FastAPI

MLFlow WeightsAndBiases Apache Spark Apache Hadoop

Firebase Cockroach MongoDB GraphQL NGINX Swagger

Tableau Flutter TravisCI Blender UnrealEngine


πŸ’» Coding learning Projects

πŸ™‹β€β™‚οΈ Helping with :

  • SpeakLeash - Language Processing Granary [pl. Spichlerz] - datasets for PolishGitHub - I'm helping as much as I can, writing code in Python to scrape forums and articlesGitHub, creating new datasets dashboardGitHub, making examplary Python scripts using the Speakleash packageGitHub and scraping the internet, adding another GBs of data to the project.

πŸ‘· Projects in development :

  • When I'm Gone (KiedyOdjade) - mini-project inspired by jakdojade.pl ~ using the API with online location of Warsaw public transport vehicles, I'm working on a database where certain values would be calculated (with ML prediction) and provide an API for other developers and hopefully a future mobile application written in Flutter.

  • Learning more - Python / Data Science / ML / MLOps / deep learning libraries

  • ESP32 Thermal Camera WebServer - hobby project (which needs to be improved) ~ ESP32 microcontroller + thermal imaging array sensor create a neat little thermal camera which has its own WiFi network, allowing you to watch the image live on your phone.


πŸ“Š GitHub Profile Stats


Β 


πŸ•‘ Recorded coding since November 2022 (only in VS Code):

(not fully recorded - no private repositories or commercial code)

Wakatime

Wakatime Full Stats

SamoX's Wakatime Stats



Repositories and Tools used to make this profile readme.md

Pinned Loading

  1. WhenImGone-KiedyOdjade WhenImGone-KiedyOdjade Public

    Python

  2. ESP_Thermal_Camera_WebServer ESP_Thermal_Camera_WebServer Public

    ESP32 with MLX90640 (Thermal Camera 32x24px). Sending images to WebServer on ESP. Working (yey!) perfect on ESP32.

    C++ 43 12

  3. Python_Micro_Codes Python_Micro_Codes Public

    Python Micro Codes for every day use - sometimes learning & sometimes for fun

    HTML

  4. Light-Propagation-Cpp-Cuda-OpenMP Light-Propagation-Cpp-Cuda-OpenMP Public

    A mini project that includes calculations related to light propagation. The following technologies are used: C ++ & CUDA & OpenMP.

    Cuda

  5. R_LAB_LSED_MSR_TEXT R_LAB_LSED_MSR_TEXT Public

    Laboratoria z LSED i MSR (LSED - Laboratorium Statystycznej Eksploracji Danych; MSR - Zastosowanie pakietu R w statystyce medycznej)

    R

  6. Data-Mining-EiTI---DBSCAN-VP-TREE-C- Data-Mining-EiTI---DBSCAN-VP-TREE-C- Public

    DBSCAN (and DBSCAN with VP-TREE) implementation in C++

    C++ 2