Skip to content
View SemyonSinchenko's full-sized avatar

Organizations

@apache

Block or report SemyonSinchenko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SemyonSinchenko/README.md

Sem Sinchenko

Data Egnineer, Open Source Software enthusiast, Apache Software Foundation committer.

I'm developing in Python, Scala/Java and some Rust. Mostly my activities are related to the Apache Spark / PySpark ecosystem and Data Engineering tools.

I'm a maintainer at the following projects:

  • GraphFrames -- scalabale graph algorithms on top of Apache Spark DataFrames.
  • Apache GraphAr (incubating) -- universal "open-table" format for storing Property Graphs.
  • spark-fast-tests -- Apache Spark testing helpers and assertions (Scala).
  • chispa -- Apache Spark testing helpers and assertions (Python).
  • falsa -- CLI tool for generating datasets of the H2O benchmark. Wriiten in Rust.

And other various projects.

Wakatime weekly stats:

Scala            10 hrs 42 mins  ███████████████░░░░░░░░░░   60.56 %
Python           5 hrs 13 mins   ███████▒░░░░░░░░░░░░░░░░░   29.49 %
sbt              1 hr 3 mins     █▒░░░░░░░░░░░░░░░░░░░░░░░   05.96 %
Markdown         30 mins         ▓░░░░░░░░░░░░░░░░░░░░░░░░   02.88 %
YAML             6 mins          ░░░░░░░░░░░░░░░░░░░░░░░░░   00.58 %

About any open source activities and / or collaborations you can reach me using ssinchenko@apache.org.

About any other activities and / or collaborations you can reach me using my private email ssinchenko@pm.me.

Pinned Loading

  1. apache/incubator-graphar apache/incubator-graphar Public

    An open source, standard data file format for graph data storage and retrieval.

    C++ 302 74

  2. graphframes/graphframes graphframes/graphframes Public

    GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

    Scala 1.1k 252

  3. graphframes-rs graphframes-rs Public

    GraphFrames but in DataFusion

    Rust 6 1

  4. flake8-pyspark-with-column flake8-pyspark-with-column Public

    A flake8 plugin that detects of usage withColumn in a loop or inside reduce

    Python 28 1

  5. mrpowers-io/falsa mrpowers-io/falsa Public

    Python 7 2

  6. MrPowers/chispa MrPowers/chispa Public

    PySpark test helper methods with beautiful error messages

    Python 714 74