This is the command to rebuild the corpus from rebuild (avro) files. It might be trickier to port because it has more dependencies, iirc. Code is here: https://github.com/oscar-project/ungoliant/blob/v1.2.3/src/processing/rebuild.rs