Skip to content
View PavelGrigoryevDS's full-sized avatar
πŸ’­
Python, SQL, Pandas, Plotly, Dash, Tableau, etc
πŸ’­
Python, SQL, Pandas, Plotly, Dash, Tableau, etc

Block or report PavelGrigoryevDS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PavelGrigoryevDS/README.md

πŸ‘‹ Welcome! I'm Pavel

πŸ§‘β€πŸ’» About me

I hold a higher technical education.
I specialize in data analysis and visualization, with a focus on empowering informed decision-making.
By extracting insights from complex data sets, I help organizations make data-driven decisions that drive business growth and improvement.


πŸ› οΈ Languages and Tools

  • Programming languages: Python, SQL (PostgreSQL, MySQL), NoSQL (MongoDB).
  • Data analysis and visualization:
    • Libraries: Pandas, NumPy, SciPy, Statsmodels, Plotly, Dash, Matplotlib, Seaborn.
    • Tools: Power BI, Tableau, Redash, DataLens, Superset.
  • Machine learning and AI: Scikit-learn, Pyspark.
  • Natural Language Processing: NLTK.
  • Web scraping: BeautifulSoup, Selenium, Scrapy.
  • DevOps: Linux, Git, Docker, Apache Airflow.
  • IDEs: VS Code, Google Colab, Jupyter Notebook, Zeppelin, PyCharm.
PythonΒ  PandasΒ  NumPyΒ  PlotlyΒ  PostgreSQLΒ  MySQLΒ  MongoDBΒ  TableauΒ  Power BIΒ  RedashΒ  SklearnΒ  VS CodeΒ  JupyterΒ  LinuxΒ  GitΒ  DockerΒ  AirflowΒ 

🎯 Skills

  • Deep data analysis: preprocessing, cleaning, and identifying patterns using visualization to support decision-making.
  • Writing complex SQL queries: working with nested queries, window functions, CASE and WITH statements for data extraction and analysis.
  • Understanding product strategy: knowledge of product development and improvement principles, including analyzing user needs and formulating recommendations for its growth.
  • Product metrics analysis: LTV, RR, CR, ARPU, ARPPU, MAU, DAU, and other key performance indicators.
  • Conducting A/B testing: analyzing results using statistical methods to evaluate the effectiveness of changes.
  • Cohort analysis and RFM segmentation: identifying user behavior patterns to optimize marketing strategies.
  • Data visualization and dashboard development: creating interactive reports in Tableau, Redash, Power BI, and other tools for presenting analytics.
  • Web scraping: experience in extracting data from websites using tools and libraries such as BeautifulSoup, Scrapy, and Selenium for information gathering and data analysis.
  • Working with big data: experience with tools and technologies for processing large volumes of data (e.g., Hadoop, Spark).
  • Machine Learning Applications: Capable of building and applying simple machine learning models for data analysis tasks, including forecasting, classification, and clustering, to uncover deeper insights and enhance decision-making processes.
  • Working with APIs: integrating and extracting data from various sources via APIs.

🌟 Featured Projects

Stack: Python | Pandas | NumPy | SciPy | Plotly | Statsmodels | Scikit-learn | Pingouin | TextBlob | Sphinx

Key Methods: Data Exploration | Statistical Testing | Cohort Analysis | Automated Visualization | Feature Analysis | Machine Learning

Powerful pandas extension that enhances DataFrames with production-ready analytics while maintaining native functionality.

  • Seamlessly integrates exploratory analysis, statistical testing and visualization into pandas workflows
  • Provides instant insights through automated data profiling and quality checks
  • Enables cohort analysis with flexible periodization and metric customization
  • Offers built-in statistical methods (bootstrap, effect sizes, group comparisons)
  • Generates interactive visualizations with single-command access
  • Supports both DataFrame-level and column-specific analysis
  • Modular architecture allows extending with domain-specific methods
  • Preserves all native pandas functionality for backward compatibility

Stack: Python | Pandas | Plotly | StatsModels | SciPy | NLTK | TextBlob | Sklearn | Pingouin

Key Methods: Time-Series | Anomaly Detection | Custom Metrics | RFM/Cohorts | NLP | Clustering

Comprehensive analysis of Brazilian e-commerce data, uncovering key insights and actionable business recommendations.

  • Time-series analysis of sales dynamics, seasonality, and trend decomposition
  • Anomaly detection in orders, payments, and delivery times
  • Customer profiling (RFM segmentation, clustering, geo-analysis)
  • Cohort analysis to track retention and lifetime value (LTV)
  • NLP processing of customer reviews (sentiment analysis)
  • Hypothesis validation involved conducting tests to verify data-driven assumptions.
  • Delivered strategic, data-backed recommendations to optimize logistics, enhance customer retention strategies, and drive sales growth.

Pinned Loading

  1. frameon frameon Public

    🐼✨ Frameon - enhances pandas DataFrames with analysis methods while preserving all native functionality

    Python 2 1

  2. awesome-data-analysis-resources awesome-data-analysis-resources Public

    πŸš€πŸ“Š 400+ curated resources for data analysis and data science: Python, SQL, ML, Visualization, Dashboards, Cheatsheets, Roadmaps, and Interview Prep. Perfect for beginners and pros!

    3

  3. olist-deep-dive olist-deep-dive Public

    🌊 Deep Sales Analysis of Olist E-Commerce: EDA | Viz | RFM | NLP | Segmentation & Actionable Business Recommendations

    Jupyter Notebook 2