Skip to content
Change the repository type filter

All

    Repositories list

    • 0000Updated Apr 28, 2020Apr 28, 2020
    • Chinese

      Public
      Tools and resources for Chinese texts preprocessing. Validated in two papers, one CCF C, EI indexing and one CCF B, SCI indexing.
      Python
      575000Updated May 20, 2018May 20, 2018
    • Java
      1000Updated Jan 25, 2018Jan 25, 2018
    • Shared software among connectors that target distributed filesystems and cloud storage.
      Java
      155000Updated Dec 25, 2017Dec 25, 2017
    • confluent

      Public
      0000Updated Dec 22, 2017Dec 22, 2017
    • newly plugin for flume2kafka support offset control
      Java
      1000Updated Jul 19, 2017Jul 19, 2017
    • flumeng-kafka-plugin
      Java
      96000Updated Apr 12, 2017Apr 12, 2017
    • sparklint

      Public
      A tool for monitoring and tuning Spark jobs for efficiency.
      Scala
      92000Updated Nov 4, 2016Nov 4, 2016
    • AirDataX

      Public
      HTML
      3102Updated Oct 26, 2016Oct 26, 2016
    • Enabling Spark Optimization through Cross-stack Monitoring and Visualization
      Scala
      11000Updated Oct 24, 2016Oct 24, 2016
    • Spark

      Public
      Spark Club
      2300Updated Sep 27, 2016Sep 27, 2016
    • DataX

      Public
      DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
      Java
      1.3k000Updated Sep 7, 2016Sep 7, 2016
    • Scala
      87000Updated Jun 14, 2015Jun 14, 2015