Skip to content

criteo-forks/kafka-connect-hdfs

 
 

Repository files navigation

Kafka Connect HDFS Connector

kafka-connect-hdfs is a Kafka Connector for copying data between Kafka and Hadoop HDFS.

Documentation for this connector can be found here.

Criteo fork changes

  • Disable all hive related test raising a NoClassDefFound Could not initialize class org.apache.hadoop.hive.ql.exec.Utilities. Related issue (#1). To be fixed if we plan to use hive module (not the case currently).
  • Apply unmerged PR confluentinc#684 to solve the rotate Interval that doesn't work for low volume or irregular traffic
  • Force jackson-mapper-asl transitive dependency to 1.9.13

Development

To build a development version you'll need a recent version of Kafka as well as a set of upstream Confluent projects, which you'll have to build from their appropriate snapshot branch. See the FAQ for guidance on this process.

You can build kafka-connect-hdfs with Maven using the standard lifecycle phases.

FAQ

Refer frequently asked questions on Kafka Connect HDFS here - https://github.com/confluentinc/kafka-connect-hdfs/wiki/FAQ

Contribute

License

This project is licensed under the Confluent Community License.

About

Kafka Connect HDFS connector

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%