feat(cache): a variant of sieve, with lazy op #13904

PsiACE · 2023-12-02T19:50:14Z

I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/

Summary

This optimization is inspired by the sieve, which can reduce unnecessary element movements and has a potential filtering effect.

Simply put, we maintain the visited status of each key and try to evict those elements that were inserted a long time ago but have not been accessed yet.

Closes #issue

This change is

PsiACE · 2023-12-02T19:59:09Z

It is no longer Lru and theoretically has better properties. For the sake of evaluation, I did not change the name. You can refer to the unit tests for a quick understanding.

The impact on Databend performance and cache hit rate needs further evaluation.

PsiACE · 2023-12-03T04:30:57Z

This algorithm is storagebackend-friendly. In fact, we only need to maintain a "visited" linkedhashmap as the state. Since the state and storage are decoupled, we can consider refactoring our disk cache and implementing S3 cache in the future.

github-actions · 2023-12-04T02:14:07Z

Docker Image for PR

tag: pr-13904-a2c9e86

note: this image tag is only available for internal use,
please check the internal doc for more details.

JackTan25 · 2023-12-04T04:09:15Z

well, this is the core idea of sieve cache. And the initial state of 'hand' p is null, it will be initialized by the oldest object which is the tail of the queue. And this cache algorithm is good to extended to the existed cache evict algorithms,

github-actions · 2023-12-04T04:19:31Z

Docker Image for PR

tag: pr-13904-c5909e1

note: this image tag is only available for internal use,
please check the internal doc for more details.

JackTan25 · 2023-12-04T04:30:35Z

the core idea of the sieve is to reduce the duplicated obj insert

github-actions · 2023-12-04T04:51:02Z

ClickBench Report

PsiACE · 2023-12-04T08:17:32Z

This version lacks a hand pointer, so there is no protection for the frequently accessed parts. However, personally, I think it is acceptable to evict and reload them. One possible optimization is to change "visited" into a counter with an upper bound, which would provide some protection mechanism. This approach seems somewhat like a version between s3-FIFO and sieve, but further evaluation is still needed. The specific impact needs to be discussed based on the workload.

dantengsky · 2023-12-04T09:27:07Z

👍

Besides the missing of hand pointer, are there any other tweaks or improvements worth mentioning?

And about the 'SIEVE is not scan-resistant' thing mentioned in the Sieve paper - any idea how that may affect us?

Also, does find_evict_candidate need to iterate through all elements in visited every time?

https://github.com/datafuselabs/databend/blob/c8b06a5c86cdba178bbc23ac589db2cb5e32e42f/src/common/cache/src/cache/lru.rs#L163-L174

PsiACE · 2023-12-04T10:22:45Z

Besides the missing of hand pointer, are there any other tweaks or improvements worth mentioning?

No. But hand does have a significant meaning, so we will try to compare only this PR and Lru.

And about the 'SIEVE is not scan-resistant' thing mentioned in the Sieve paper - any idea how that may affect us?

If we frequently encounter large scans, then scan-resistant will be a very important feature. This means that the elements we insert will soon no longer be accessed. However, the Lru we previously used also does not have scan resistance. We can try using probability models or other methods to further improve it.

Also, does find_evict_candidate need to iterate through all elements in visited every time?

Currently, yes. This means it is an O(n) operation. Perhaps we can use other techniques to accomplish this since we only need to find an element that has not been accessed before.

One simple solution is to allow the elements in "visited" to be moved, so we actually only need a deque to complete the sorting, ensuring that the key to be removed is always at a certain position, depending on when we perform the move.

PsiACE · 2023-12-04T17:50:21Z

Although there seems to be improvement on the hits dataset, it performs similarly to Lru on some public traces and causes a decrease in throughput due to the O(n) traversal. I will try to make further modifications.

1a1a11a · 2023-12-10T01:09:40Z

@PragmaTwice Cool work!

One possible way to avoid the O(N) operations is to track the visited bit using a HashSet. Add to the HashSet upon get, and at eviction time, we iterate through the map; if an object has been visited, we put it back. Otherwise, we evict the object. In the worst case, we may have to check N objects, but we can cap this to some value, e.g., 20. There are more optimized solutions, but they would need more engineering work.

Signed-off-by: Chojan Shang <psiace@apache.org>

github-actions · 2023-12-10T17:14:35Z

Docker Image for PR

tag: pr-13904-2090508

note: this image tag is only available for internal use,
please check the internal doc for more details.

github-actions · 2023-12-10T17:44:58Z

ClickBench Report

github-actions bot added the pr-feature this PR introduces a new feature to the codebase label Dec 2, 2023

PsiACE marked this pull request as draft December 2, 2023 20:37

PsiACE marked this pull request as ready for review December 3, 2023 04:18

PsiACE requested review from BohuTANG and dantengsky December 3, 2023 04:20

PsiACE added the ci-cloud Build docker image for cloud test label Dec 4, 2023

JackTan25 added the ci-benchmark Benchmark: run all test label Dec 4, 2023

xiaguan mentioned this pull request Dec 4, 2023

Discussion About Cache Eviction Algorithm Design and Implementation datenlord/datenlord#435

Open

PsiACE marked this pull request as draft December 4, 2023 16:34

PsiACE added 5 commits December 11, 2023 00:02

feat(cache): a variant of sieve, with lazy op

2bc0671

Signed-off-by: Chojan Shang <psiace@apache.org>

feat(cache): add peek by policy

a9594da

Signed-off-by: Chojan Shang <psiace@apache.org>

refactor: make key can be clone, so, we can forget unsafe ptr

a90403f

Signed-off-by: Chojan Shang <psiace@apache.org>

chore: cargo fmt

69a907c

Signed-off-by: Chojan Shang <psiace@apache.org>

feat: better hits ratio

82d454f

Signed-off-by: Chojan Shang <psiace@apache.org>

PsiACE force-pushed the sieve branch from c8b06a5 to 82d454f Compare December 10, 2023 16:02

PsiACE added 2 commits December 11, 2023 00:29

fix: make fmt/clippy/test happy

5189766

Signed-off-by: Chojan Shang <psiace@apache.org>

fix: make fmt/clippy/test happy

ae1901f

Signed-off-by: Chojan Shang <psiace@apache.org>

PsiACE removed the ci-benchmark Benchmark: run all test label Dec 10, 2023

PsiACE added the ci-benchmark Benchmark: run all test label Dec 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(cache): a variant of sieve, with lazy op #13904

feat(cache): a variant of sieve, with lazy op #13904

Uh oh!

PsiACE commented Dec 2, 2023 •

edited by drmingdrmer

Loading

Uh oh!

PsiACE commented Dec 2, 2023

Uh oh!

PsiACE commented Dec 3, 2023

Uh oh!

github-actions bot commented Dec 4, 2023

Uh oh!

JackTan25 commented Dec 4, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Dec 4, 2023

Uh oh!

JackTan25 commented Dec 4, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Dec 4, 2023

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

dantengsky commented Dec 4, 2023

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

1a1a11a commented Dec 10, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Dec 10, 2023

Uh oh!

github-actions bot commented Dec 10, 2023

Uh oh!

Uh oh!

feat(cache): a variant of sieve, with lazy op #13904

Are you sure you want to change the base?

feat(cache): a variant of sieve, with lazy op #13904

Uh oh!

Conversation

PsiACE commented Dec 2, 2023 • edited by drmingdrmer Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

PsiACE commented Dec 2, 2023

Uh oh!

PsiACE commented Dec 3, 2023

Uh oh!

github-actions bot commented Dec 4, 2023

Docker Image for PR

Uh oh!

JackTan25 commented Dec 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 4, 2023

Docker Image for PR

Uh oh!

JackTan25 commented Dec 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 4, 2023

ClickBench Report

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

dantengsky commented Dec 4, 2023

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

PsiACE commented Dec 4, 2023

Uh oh!

1a1a11a commented Dec 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 10, 2023

Docker Image for PR

Uh oh!

github-actions bot commented Dec 10, 2023

ClickBench Report

Uh oh!

Uh oh!

PsiACE commented Dec 2, 2023 •

edited by drmingdrmer

Loading

JackTan25 commented Dec 4, 2023 •

edited

Loading

JackTan25 commented Dec 4, 2023 •

edited

Loading

1a1a11a commented Dec 10, 2023 •

edited

Loading