feat: Reprovide Sweep #1082

guillaumemichel · 2025-05-06T11:54:59Z

Note

This PR may be replaced by

Reprovide Sweep review #1095 👈 see that instead

Summary

Problem

Reproviding many keys to the DHT one by one is inefficient, because it requires a GetClosestPeers (or GCP) request for every key.

Current state

Currently, reprovides are managed in boxo/provider. Every ReprovideInterval (22h in Amino DHT), all keys matching the reprovide strategy are reprovided at once. The process is slightly different depending on whether the accelerated DHT client is enabled.

Default DHT client

All the keys are reprovided sequentially, using the go-libp2p-kad-dht Provide() method. This operation consists in finding the k closest peers to the given key, and then request them all to store the associated provider record.

The process is expensive because it requires a GCP for each key (opening approx. 20-30 connections). Timeouts due to unreachable peers make this process very long, resulting in a mean of ~10s in provide time (source: probelab.io 2025-06-13).

With 10 seconds per provide, a node using this process could reprovide less than 8'000 keys over the reprovide interval of 22h (using a single thread).

Accelerated DHT client (`fullrt`)

The accelerated DHT client periodically (every 1h) crawls the DHT swarm to cache the addresses of all discovered peers. It allows it to skip the GCP during the provide request, since it already knows the k closest peers and the associated multiaddrs.

Hence, the accelerated DHT client is able to provide much more keys during the reprovide interval compared with the default DHT client. However, crawling the DHT swarm is an expensive operation (networking, memory), and since all the keys are reprovided at once, the node will experience a bust period until all keys are reprovided.

Ideally, nodes wouldn't have to crawl the swarm to reprovide content, and the reprovide operation could be smoothed over time to avoid a bust during which the libp2p node is incapable of performing other actions.

Pooling Reprovides

If there are more keys to be reprovided than the number of nodes in the DHT swarm divided by the replication factor (k), then it means that there are at least two keys that will be provided to the exact same set of peers. This means that the number of GCP is less than the number of keys to reprovide.

For the Amino DHT, containing ~10k DHT servers and having a replication factor of 20, pooling reprovides becomes efficient starting from 500 keys.

Reprovide Sweep

The current process of reproviding all keys at once is bad because it creates a bust. In order to smooth the reprovide process, we can sweep the keyspace from left to right, in order to cover all peers over time. This consists of exploring keyspace regions, corresponding to a set of peers that are close to each other in the Kademlia XOR distance metric.

⚠️ The Kademlia keyspace in NOT linear

A keyspace region is explored using a few (typically 2-4 GCP) to discover all the peers it contains. A keyspace region can be identified by a Kademlia identifier prefix, the kademlia identifiers of all peers within this region start with the region's prefix.

Once a region is fully explored, all the keys matching the keyspace region's prefix can be allocated to this set of peers. No additional GCP is needed.

Implementation

This PR contains an implementation of the Reprovide Sweep strategy. The SweepingReprovider basically does the following:

Expose Provide() and ProvideMany() methods. All cids passed through this methods are provided to the DHT as expected.
All cids that are given through the above methods are stored in a trie. The reprovider implementation keeps a state of all cids it is responsible for reproviding.
The reprovider schedules when reprovides should happen for each keyspace region (for which there is at least 1 cid). Region reprovides are spread evenly over the reprovide interval.
Once the time to reprovide a region has come, the reprovider explores the region, and allocate the provider records corresponding to the cids belonging to this region to the appropriate peers.

Features

Concurrency limit
- Ability to configure the number of workers both for i) initial provide operation and ii) regular reprovides
- Limit the number of connection that a worker can open
Parallel reprovide
- If a reprovide isn't complete and it is time for the next one, the next one can start already given there are some available workers
Error handling
- If a cid or a complete region couldn't be provided, the operation will be retried later until it succeeds
Connectivity checker
- The reprovider will check connectivity on provide failure, and won't try to provide as long as the node is offline.
- When the node comes back online, the activity resumes with (re)providing the cids/regions that should have been provided during the down time.
Dynamic prefix length estimation
- When starting up, the reprovider doesn't know how many peers are included in a region. It will hence make a few GCP requests to estimate the initial prefix length for exploring regions.
Reset reprovided cids
- Offer a ResetReprovideSet method to replace the cids that must be reprovided.

Missing features

Store keys to reprovide in Datastore instead of memory.
- Currently a trie.Trie in memory containing all cids to be reprovided
- Ideally move the trie to Datastore
- Keys can be grouped by region/prefix if it helps
  - Anyway they will be loaded by region
  - Not sure if adding just 1 key to a group is easy
(optional) Persist when a region is reprovided to the datastore (region prefix, timestamp, e.g prefix -> timestamp).
- Allows resuming reprovides after a crash/shutdown and start by catching up regions that should have been provided during the down time.
  - For this it may be useful to save the last reprovided region (e.g lastProvided -> [prefix, timestamp])
- Only store the last time a region was reprovided, everytime the region is reprovided we can override the older timestamp
- Storing timestamp about individual provides would help kubo users know the last time a cid was provided. (e.g cid -> timestamp). These can {expire, be garbage collected} after reprovideInterval.
(optional) Persist provide and reprovide queues to datastore
- Don't loose pending cids on restart
Refactor pending cids queue
- Mix failed cids with cids that were just added using Provide()
- Allows to group close cids together to provide more efficiently
- We may loose prioritization (e.g calling Provide(cidA) before Provide(cidB) doesn't mean that cidA will be provided before cidB)
~~[ ] The Dual DHT (used by Kubo) currently has 1 SweepingReprovider for each DHT (LAN and WAN)~~
- ~~Allow the SweepingReprovider to (re)provide content to multiple DHT swarms with a single scheduler and cids store (trie)~~
- ~~It means that pending regions/cids have to be distinct for each swarm since provide could work for a swarm, but fail in another one~~
- ~~It will probably require multiple ConnectivityCheckers, one for each DHT swarm.~~
- This isn't useful since the schedule depends on the network size. Hence each network should have its own schedule.
- The only thing that can be shared between the 2 ReprovideSweepers is the set of cids that needs to be reprovided (datastore).
If we decide to change the routing/provide interfaces in kubo get rid of the boxo/provider.System implementation in go-libp2p-kad-dht/dual/reprovider.go
(optional) Provide status provider: ProvideStatus interface #1110

TODO

Complete implementation with missing mandatory features
Implementation review Reprovide Sweep review #1095
(optional) Increase unit & integration test coverage
(optional) Increase amino DHT test coverage
Benchmark performance vs default and accelerated DHT clients
(optional) High level documentation in go-libp2p-kad-dht/reprovider/README.md
Integration in kubo feat: DHT Reprovide Sweep ipfs/kubo#10834
- This one is going to be long and painful 😢

Admin

Depends on:

Need new release of:

Closes #824

Part of ipshipyard/roadmaps#6, ipshipyard/roadmaps#7, ipshipyard/roadmaps#8

guillaumemichel added 3 commits May 4, 2025 13:33

reprovide sweep

7274ccd

refactor provide

e62a665

WIP: error handling

8d280ca

guillaumemichel mentioned this pull request May 6, 2025

feat: generic find PeerID with CPL libp2p/go-libp2p-kbucket#145

Merged

kbucket new version

d16f5b2

guillaumemichel force-pushed the reprovide-sweep branch from 4e3c483 to d16f5b2 Compare May 6, 2025 14:40

guillaumemichel added 4 commits May 7, 2025 17:10

WIP: ProvideMany

e8e10dd

ProvideMany

9591ac6

catchupCrunch

22264bb

catchupPendingWork

57ac586

aschmahmann mentioned this pull request May 13, 2025

feat: rework fullrt to use a caching routing table #1084

Draft

guillaumemichel and others added 8 commits May 14, 2025 10:52

reorg

5a9f790

options

3985008

initial tests

0521984

Merge branch 'master' into reprovide-sweep

de4dd0b

single provide test

bd4373b

fix: nextNonEmptyLeaf

9ec6d9e

increase test coverage

1058f34

test helper methods

61a8a97

guillaumemichel force-pushed the reprovide-sweep branch 2 times, most recently from 6cad84d to 9bd54ba Compare May 21, 2025 14:54

test ProvideMany

9b5bb41

guillaumemichel force-pushed the reprovide-sweep branch from 9bd54ba to 9b5bb41 Compare May 21, 2025 15:02

lidel mentioned this pull request May 21, 2025

Release 0.36 ipfs/kubo#10816

Closed

46 tasks

fix race in TestProvideMany

b060d52

guillaumemichel force-pushed the reprovide-sweep branch from 8129731 to b060d52 Compare May 22, 2025 08:34

regular reprovide improvement

66ab0a4

guillaumemichel force-pushed the reprovide-sweep branch from cc979b9 to 66ab0a4 Compare May 23, 2025 08:06

guillaumemichel added 2 commits May 24, 2025 11:14

WIP

0c8c22c

dedicated connectivity checker component

608ea39

guillaumemichel added 2 commits July 21, 2025 17:27

moved RegionsFromPeers to helpers

2337204

call Regions from peers from helpers

2a31622

guillaumemichel mentioned this pull request Jul 22, 2025

provider: ProvideStatus interface #1110

Open

guillaumemichel added 11 commits July 22, 2025 10:06

minor simplification

3ef6bdc

increase connectivity test coverage

c634f76

refactoring individualProvideForPrefix tests

2aacf6c

fix: flaky test

c89c9ed

tests: fixes TestProvideOnce

0b20092

remove long lived context from KeyStore

8feee00

remove ctx from SweepingProvider constructor

a7a33e9

tests: refactor TestStartProvidingSingle

dc2f09c

tests: refactor TestStartProvidingMany

99d6012

tests: fix flaky test¨

8a4692d

tests: adapted TestStartProvidingMany

0f045de

guillaumemichel force-pushed the reprovide-sweep branch from a9c7270 to 0f045de Compare July 22, 2025 13:57

refactor: rename helpers package to keyspace

385f5f7

This was referenced Jul 22, 2025

Integrate Modernized Provider System from go-libp2p-kad-dht ipfs/kubo#10881

Open

Kubo users can more directly control which data is advertised — IPFS/2025 ipshipyard/roadmaps#8

Open

fullrt: use dht MsgSenderBuilder

20e658f

guillaumemichel force-pushed the reprovide-sweep branch from eb43269 to 20e658f Compare July 23, 2025 09:31

guillaumemichel added 6 commits July 23, 2025 17:37

options: test for invalid parameters

ed11564

tests: fix flaky connectivity test

aeff358

fix race in TestProvideOnce

211f434

address AllocateToKClosest review

654d674

minor cleanup

750a88f

consolidate schedule prefix addition

a7341d0

guillaumemichel force-pushed the reprovide-sweep branch 2 times, most recently from fa69218 to 3642888 Compare July 24, 2025 20:44

provideForPrefix error handling

c4f311a

guillaumemichel force-pushed the reprovide-sweep branch from 3642888 to c4f311a Compare July 24, 2025 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Reprovide Sweep #1082

feat: Reprovide Sweep #1082

Uh oh!

guillaumemichel commented May 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

feat: Reprovide Sweep #1082

Are you sure you want to change the base?

feat: Reprovide Sweep #1082

Uh oh!

Conversation

guillaumemichel commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Current state

Default DHT client

Accelerated DHT client (fullrt)

Pooling Reprovides

Reprovide Sweep

Implementation

Features

Missing features

TODO

Admin

Uh oh!

Uh oh!

guillaumemichel commented May 6, 2025 •

edited

Loading

Accelerated DHT client (`fullrt`)