Passionate about data and extracting information (also photonics engineer)
I'm a physicist by training, with a deep curiosity for data-driven insights. After over a decade immersed in photonics and optofluidics at the Warsaw University of Technology - where I graduated with an engineering degree, a master's degree and I'm really close to finishing my PhD - I've transitioned from academia into the dynamic world of data science and ML/AI.
Since mid-2023, I've been actively contributing to the open-science project SpeakLeash, assisting in data curation and model development for BielikβPolandβs open-source large language model family. This experience has provided me with hands-on exposure to the entire LLM pipeline, from dataset preparation to deploying instruction-tuned models like Bielik-11B-v2, optimized for Polish NLP tasks. Beyond research, I'm passionate about applying GenAI tools in both commercial and personal projects, exploring their capabilities and pushing their boundaries. I approach challenges systematically, always considering both the needs and limitations of a project.
I'm eager to grow in areas such as Business Intelligence, Data Science, Machine Learning, MLOps, and Big Data. Let's connect and explore the possibilities together.
π Articles I co-authored during my PhD research and other info
- "Study of PDMS Microchannels for Liquid Crystalline Optofluidic Devices in Waveguiding Photonic Systems",
Crystals. 2022; 12 (5):729. https://doi.org/10.3390/cryst12050729 - "A Novel Approach for the Creation of Electrically Controlled LC:PDMS Microstructures",
Sensors. 2022; 22 (11):4037. https://doi.org/10.3390/s22114037 - "Orientation of Liquid Crystalline Molecules on PDMS Surfaces and within PDMS Microfluidic Systems",
Applied Sciences. 2021; 11 (24):11593. https://doi.org/10.3390/app112411593 - "Low-cost, widespread and reproducible mold fabrication technique for PDMS-based microfluidic photonic systems",
Photonics Letters of Poland. 2020; 12 (1):22-24. https://doi.org/10.4302/plp.v12i1.981
β Together with a team of Ph.D. students, we won the MedTech-Athon Warsaw University of Technology, receiving funding to develop a mobile air pollution monitoring device (May 2022). We built a prototype and showcased it a year later at DemoDay, collaborating along the way with other researchers, a pulmonology doctor and organizations like the Chief Environmental Protection Inspector in Warsaw.
Use on a daily basis : (or wrote some code) |
|
Tools : | |
Databases : | |
Operating systems : | |
Other tools I use : | |
- SpeakLeash - Language Processing Granary [pl. Spichlerz] - datasets for PolishGitHub - I'm helping as much as I can, writing code in Python to scrape forums and articlesGitHub, creating new datasets dashboardGitHub, making examplary Python scripts using the Speakleash packageGitHub and scraping the internet, adding another GBs of data to the project.
-
When I'm Gone (KiedyOdjade) - mini-project inspired by jakdojade.pl ~ using the API with online location of Warsaw public transport vehicles, I'm working on a database where certain values would be calculated (with ML prediction) and provide an API for other developers and hopefully a future mobile application written in Flutter.
-
Learning more - Python / Data Science / ML / MLOps / deep learning libraries
-
ESP32 Thermal Camera WebServer - hobby project (which needs to be improved) ~ ESP32 microcontroller + thermal imaging array sensor create a neat little thermal camera which has its own WiFi network, allowing you to watch the image live on your phone.