Skip to content
View samiit's full-sized avatar
  • Berlin

Block or report samiit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
samiit/README.md

Sam Mathew

๐Ÿ‘‹ Hello! I'm Sam, a Full Stack Data Scientist with a background in Chemical Engineering, currently pursuing an M.Sc. in Polymer Science at Freie Universitรคt Berlin. I'm passionate about solving complex problems at the intersection of data science, chemical physics and engineering.

๐Ÿš€ About Me

  • ๐Ÿ”ฌ Full Stack Data Scientist with experience in NLP, medical entity extraction, and patient-study profile matching
  • ๐ŸŽ“ Currently pursuing M.Sc. in Polymer Science at FU Berlin
  • ๐Ÿ‘จโ€๐Ÿซ Regular corporate trainer in Generative AI, Causal Discovery and Inference, Linear Algebra, and Machine Learning
  • ๐Ÿ’ป Proficient in Python, with experience in Pandas, NumPy, Langchain, and FastAPI
  • ๐Ÿงฎ Strong background in mathematical modeling of physical systems and mathematical optimization
  • ๐Ÿค– Experience with Large Language Models (RAG and Agent)
  • ๐ŸŒ Multilingual: Fluent in English, Hindi, Malayalam; Proficient in German, Tamil, and Telugu; still dabbling with French and Spanish!

๐ŸŽฏ Current Focus

I'm currently working on my Master's thesis that combine my expertise in data science with my studies in Polymer Science:

  • ๐Ÿงช Molecular Dyanmics study of PFAS interaction with candidate PEI variants for water purification
  • ๐Ÿ” Searching dominant molecular interactions for PFAS removal
  • ๐Ÿ”— Integrating knowledge from computational chemistry & physics with data science to explore molecular interactions
  • ๐Ÿ“Š Utilizing data-driven approaches to accelerate materials discovery and optimization

๐Ÿ› ๏ธ Skills

  • Python (Pandas, NumPy, Langchain, FastAPI)
  • Mathematical modeling and optimization
  • Large Language Models (RAG and Agent)
  • Natural Language Processing
  • Azure DevOps and AWS
  • Data reconciliation and process optimization
  • Image processing and deep learning

๐Ÿ”— Projects

Here are some projects I've worked on earlier in the industry:

  1. Medical Entity Extraction

    • NLP project for extracting medical entities from clinical texts
  2. Patient-Study Profile Matching

    • AI-powered system to match patient profiles with suitable clinical studies
  3. Water Network Management Optimization

    • Large-scale integer optimization for efficient water network scheduling
  4. Blast Furnace Data Reconciliation

    • Data reconciliation project for improving blast furnace efficiency

๐Ÿ“ซ How to reach me

๐ŸŒŸ Interests and Fun Facts

  • ๐Ÿ“š I'm deeply interested in causal inference and its applications in data science. Judea Pearl's "The Book of Why" has been a significant influence on my thinking in this area.
  • ๐Ÿง  I love exploring the intersection of machine learning, causal inference, and materials science.
  • ๐Ÿ“– My reading interests span history, philosophy, technology, and scientific advancements.
  • ๐Ÿง— In my free time, you can find me hiking or cycling.
  • ๐ŸŒ I've lived and studied in India, Germany, and the Netherlands.
  • ๐Ÿงฌ I'm fascinated by the potential of combining materials science with machine learning and causal inference to solve real-world problems.

Feel free to explore my repositories and don't hesitate to reach out if you'd like to collaborate on a project, discuss the exciting world of polymer science and machine learning, or explore the depths of causal inference!

๐Ÿ“š Selected Publications and Recognition of Contributions
  1. Sujan Hazra, Prakash Abhale, Sam Mathew and Shankar Narasimhan, "Application of data reconciliation and gross error detection techniques to enhance reliability and consistency of the blast furnace process data", Asia-Pacific Journal of Chemical Engineering, 2021

  2. Pallab Sinha Mahapatra and Sam Mathew, "Activity-induced mixing and phase transitions of self-propelled swimmers", Phys. Rev. E, 2019, Vol. 99, 012609

  3. Pallab Sinha Mahapatra, Ajinkya Kulkarni, Sam Mathew, Mahesh V. Panchagnula and Srikanth Vedantam, "Transitions between multiple dynamical states in a confined dense active-particle system", Phys. Rev. E, 2017, Vol. 95, 062610

  4. Pallab Sinha Mahapatra, Sam Mathew, Mahesh V. Panchagnula, Srikanth Vedantam, "Effect of size distribution on mixing of a polydisperse wet granular material in a belt-driven enclosure", Granular Matter, 2016, Vol. 18, 30

  5. Pramode K Das, Sam Mathew, A J Shaiju and B S V Patnaik, "Energetically efficient proportional-integral-differential (PID) control of wake vortices behind a circular cylinder", Fluid Dynamics Research, 2015, Vol. 48, 015510

  6. Sam Mathew, B S V Patnaik and T John Tharakan, "Numerical study of air-core vortex dynamics during liquid draining from cylindrical tanks", Fluid Dynamics Research, 2014, Vol. 46, 025505

  7. Sam Mathew, Ganesh Visavale and Vijay Mali, "CFD Analysis of a Heat Collector Element in a Solar Parabolic Trough Collector", International Conference on Applications of Renewable and Sustainable Energy for Industry and Society, Hyderabad (REIS-2010), 2010

  8. Sam Mathew, Ganesh Visavale and Vijay Mali, "Making order in the cabinet : Integrating CFD in the green energy design process for food industry helps identify and fix causes for uneven drying in a Solar Cabinet Dryer", Ansys Users Conference, Bangalore, 2010

  9. Raja Gopal Rayavarapu, Wilma Petersen, Constantin Ungureanu, Janine N. Post, Ton G. van Leeuwen, and Srirang Manohar, "Synthesis and Bioconjugation of Gold Nanoparticles as Potential Molecular Probes for Light-Based Imaging Techniques", Int. J. of Biomedical Imaging, 2007, 2007:29817

Popular repositories Loading

  1. helmet-detection helmet-detection Public

    Detect helmets and person at construction sites

    Jupyter Notebook 1 1

  2. test-repo test-repo Public

  3. datasciencecoursera datasciencecoursera Public

    The Data Scientistโ€™s Toolbox account

  4. datasharing datasharing Public

    Forked from jtleek/datasharing

    The Leek group guide to data sharing

  5. thrust thrust Public

    Forked from NVIDIA/thrust

    Thrust is a parallel algorithms library which resembles the C++ Standard Template Library (STL).

    C++

  6. Wet_Granular_SPP Wet_Granular_SPP Public

    Combined code for wet granular and self-propelled particles

    C++