[Mar 29th Discussion] A Unified Theory of Garbage Collection #303

orkosinha · 2022-03-22T14:59:05Z

orkosinha
Mar 22, 2022

This is the discussion thread on A Unified Theory of Garbage Collection by David F. Bacon, Perry Cheng, V.T. Rajan.

Discussion hosted by @Yasgur99 @orkosinha

anshumanmohan · 2022-03-25T17:35:46Z

anshumanmohan
Mar 25, 2022

I think this paper is wicked cool. The authors observe that, although tracing and reference-counting GCs are traditionally seen as completely incomparable, optimized versions of these algorithms are actually different in a more comparable and logical way. The average language designer wants some kind of Goldilocks GC that lives between these two versions, and so it is helpful to be able to compare the two GC styles directly and make more informed choices when rolling one's own Goldilocks GC.

I imagine that, when making these choices, one can then factor in language-specific information that one happens to know. For instance, does this language buck the generational hypothesis for some reason? Does it frequently allocate massive contiguous chunks of memory for some reason? Is stopping the world particularly tolerable/intolerable?

I'd be curious as to what the class thinks of the optimizations that the authors tack on over sections 3 and 4. It feels somewhat important that "users" of this paper subscribe to these optimized versions, and obviously this comes at the cost of choosing some other optimization strategy that, for whatever reason, breaks the duality that this paper so carefully achieves. Does this mean that this paper is somehow trapped in amber?

4 replies

sampsyo Mar 25, 2022
Maintainer

Indeed! It would be interesting to hear if folks are aware of cool GC techniques that fall outside of the taxonomy laid out in this paper, or that appear to contradict it in certain ways.

gsvic Mar 28, 2022

So, I think that one major challenge with garbage collection, is that the users need to be aware of 1. how each technique works, 2. the mechanics of the language that they use and 3. with respect to these to, how to tune these methods (e.g. Java gives some freedom in configuring GC), but usually users are unfamiliar with all these things. I think one recent fancy technique, is the learned garbage collection, which tries to minimize the involvement of the human using reinforcement-learning. In summary, using Q-learning they to discover GC policies that will be adapted to the specific application's needs, depending on the workload patterns. Taking into account the expensive steps that these learning algorithms perform, I am not sure how practical would be such an approach in a large spectrum of applications. Here is the link of the paper: https://dl.acm.org/doi/pdf/10.1145/3394450.3397469. This is from MAPL 2020 (workshop co-held with PLDI).

5hubh4m Mar 28, 2022

People stop applying machine learning to random domains challenge: difficulty impossible.

ayakayorihiro Mar 28, 2022

Big agree with a lot of the points that you've made here!! Overall I found this to be a super cool paper as well, especially with the way that the authors arrived in their key observation that tracing and reference counting are duals of each other, which is by implementing optimized versions of the original two approaches (interesting to see observations from developing implementations help with the higher-level ideas).

It almost felt overwhelming to see all of the different optimizations and how they are each different from each other in their own small ways... In some sense there's a unified theory for all of this work, but there's no unified suggestion for which version of GC to use 😓

It seems like GC is something that researchers have thought about deeply for many many decades, and this paper makes it seem (at least to me) like we've reached a threshold. I'd like to know more about what has happened in the GC world since this paper (partly I think we will cover on Thursday) given that we have this "unified theory".

5hubh4m · 2022-03-27T20:36:54Z

5hubh4m
Mar 27, 2022

Loved this paper. It had me thinking, "obviously tracing and reference counting are algorithmic duals --- they're solving the same problem!" sort of like Prim's and Kruskal's algorithms for finding MST. Regardless, I'm still wondering how to apply the learnings of this paper beyond appreciating it for it's theoretical contribution and the cost model which might help making design decisions creating making garbage collection systems. FWIW, those two things already seem pretty groundbreaking but I'm curious to see what ideas other people have/had (especially PL people).

2 replies

sampsyo Mar 27, 2022
Maintainer

Yes, good question—I'd love to hear others' thoughts about how this paper's core insight seems useful (as distinct from interesting)!

One general category of answers has to do with just identifying the complementary strengths and weaknesses of the two approaches and giving clarity to a design space that mediates the weaknesses of one by adopting the strengths of the other.

susan-garry Mar 29, 2022

This paper is at the very least useful as an introduction to different algorithms for garbage collection and a paradigm for how to think about and evaluate them. For example, understanding that the only inherent differences between reference counting and tracing are (1) the ability of tracing to divide the heap and (2) the write-barrier associated with reference counting, prevents one from going off into the weeds by considering how different hybrids of these algorithms affect these factors - the base for your GC (tracing or reference counting) becomes pretty clear. Perhaps this is fairly obvious, but it's nice to have it formalized.

I also found myself thinking "of course they're algorithmic duals, they perform the same thing by looking at the same data (pointers)", but everything seems more obvious in hindsight and I'm curious how this paper shifted the ways that people thought about reference counting compared to tracing at the time it was published.

andrewb1999 · 2022-03-29T02:49:04Z

andrewb1999
Mar 29, 2022

I also really enjoyed this paper. Reframing the solution space in this way gives a very unique insight into the fundamental problems of the field in a way that few other papers are able to accomplish. I imagine this paper has fundamentally changed many people view garbage collection. This kind of ties into @5hubh4m's question, but it would be very interesting to see if any future work was able to use this new framing to develop garbage collection algorithms with different tradeoffs than those that existed before this paper was written. At a quick look I didn't see any papers along those lines, but if you imagine a somewhat continuous design space between reference counting and tracing I could see there being design points that were not fully considered before this shift in perspective. Maybe there are some specific use cases where a very specific tradeoff between latency and throughput is required. In these cases, it would be interesting to see if choosing the proper garbage collection algorithm could be viewed as choosing a specific point in the reference counting vs tracing design space.

0 replies

chhzh123 · 2022-03-29T04:12:37Z

chhzh123
Mar 29, 2022

This paper is really long, but the key idea is very concise and shocking to me. I have never thought about tracing and reference counting can be dual problems. Yeah, it is intuitive to “think of tracing as operating upon live objects, while reference counting operates upon dead objects”. However, the authors did not stop here, they even proposed the framework of using fix-point formulation to prove and unify those garbage collection algorithms. It seems the framework is elegant and can cover the mainstream algorithms.

Since this paper was published in 2004, I also wonder what new garbage collectors emerged in these two decades, and can they be categorized into this framework? The RL-based GC [MAPL’20] mentioned by @gsvic should be an interesting read, and I believe there should be something new beyond this framework.

1 reply

sampsyo Mar 29, 2022
Maintainer

I don't know if the ML-powered GC paper took inspiration from this one, but if it didn't, it seems like there's probably room for interesting follow-up work that does build on its insights!

gsvic · 2022-03-29T04:23:23Z

gsvic
Mar 29, 2022

It is interesting to see how ideas can be abstracted that far in order to make connections like these (e.g. that two different GC techniques are duals). Looks like this is a common pattern in research, as this reminds me the following similar work on the duality of operating systems (message-oriented vs procedure-oriented): http://cgi.di.uoa.gr/~mema/courses/m131/papers/lauer78.pdf

I liked the paper because it contained a thorough analysis on the different techniques. Taking into account that GC is configurable, I believe that this paper could give a deeper understanding also to developers who would like to modify the various GC-related tuning nobs of their language.

3 replies

sampsyo Mar 29, 2022
Maintainer

Just for fun, another duality that this all reminds me of is shared memory vs. message passing.

gsvic Mar 29, 2022

@sampsyo Sounds interesting. Is "The Duality of Memory and Communication in the Implementation of a Multiprocessor Operating System" the one you're referring to?

sampsyo Mar 29, 2022
Maintainer

Ah, I actually wasn't thinking of a particular paper—just the somewhat often-discussed notion that one can implement a shared-memory abstraction on top of a message-passing system and vice-versa. But that Mach paper looks like an interesting expression of that underlying idea.

tonyjie · 2022-03-29T05:10:36Z

tonyjie
Mar 29, 2022

This paper provides a unique way to see the two fundamental GC approaches: tracing and reference counting, and claims that all realistic, high-performance collectors are in fact hybrids that combine tracing and reference counting, which lies on the design space this paper proposes.

I'm just wondering what algorithms do realistic GCs (used in Python, Java, Lisp, ...) use, and can these algorithms also be mapped into the design space this paper proposed? Is the unified theory able to explain all the real-world GCs: their characteristics and trade-off?

1 reply

sampsyo Mar 29, 2022
Maintainer

It's not a direct answer to your question, but Thursday's paper is about a realistic/modern GC that definitely has characteristics of both extremes.

JonathanDLTran · 2022-03-29T06:28:17Z

JonathanDLTran
Mar 29, 2022

I thought this paper presented an important insight between tracing and reference collection, which I never anticipated. Even the algorithms as formulated in figures 3 and 4 demonstrate how similar the ideas are. It is also interesting to me how they can be related by the fix point formulation of garbage collection.
Another aspect of this paper that I found important was the creation of a standardized cost model for calculating various costs associated with different garbage collection algorithms. I agree with the authors that this leads to better comparisons of different garbage collection algorithms, and can help decide which is better in various contexts. I am curious how well these cost models map well into real world implementations of the garbage collection algorithms, especially with different workloads, and whether real world implementations are able to approximately follow what the cost models suggest, specifically for time cost.

0 replies

yy665 · 2022-03-29T11:41:10Z

yy665
Mar 29, 2022

I think this paper adds to an interesting fact in research: that is, even on a topic that has been researched for 40 years, people can still find new insights just by formulating the problem in a different way and unifying existing solutions.

The real value of this paper to any follow-up research in my opinion is that with this duality, we now have a very clear and quantifiable search space on GC, and we could just optimize over this search space based on different constraints.

One concern I am having is that this paper seems to ignore some aspects, including unreachable memory. Although I feel like taking these aspects into consideration might break the elegance and conciseness of this formulation, the author should still offer more discussion on these issues.

0 replies

charles-rs · 2022-03-29T12:42:56Z

charles-rs
Mar 29, 2022

I found the fix point formulation of garbage collectors especially interesting, and how it relates to calling these systems "duals". I really never would have guessed that ref counting and mark and sweep are doing the same thing!

It was also interesting to read a more comprehensive review of all of the ways of dealing with cycles in reference counting, as that seems like a very attractive upgrade. Also in the same vein, it was interesting to read about the "everything in between" of ref counting and tracing, as this is probably where the "best" solution lies, depending on the use case.

0 replies

zzzDavid · 2022-03-29T21:01:17Z

zzzDavid
Mar 29, 2022

I think this paper is very useful because it provides a systematic way to reason about the tradeoffs in different design choices of a garbage collector. Other than that, it reminds me of some GC hardware acceleration paper I encountered before. As the paper points out, one major drawback of tracing is the long pause time because it only frees dead objects at the end of a collection cycle. Apart from breaking it down to smaller chunks or do it concurrently with the mutator, there's also an interest to build co-processors or accelerators to do the garbage collection. For example, this paper from ISCA'18 builds a non-intrusive co-processor to accelerate tracing GC.

0 replies

alaiasolkobreslin · 2022-04-07T14:18:14Z

alaiasolkobreslin
Apr 7, 2022

I hope it’s not too late to chime in here, but I thought this paper was super fascinating. I had never thought of tracing and reference counting as being duals of each other before, so this paper helped me think about these algorithms differently. I especially liked the fix-point formulation section since I thought the formula presented was really elegant.

This paper talked in-depth about a cost model for garbage collectors, and did a cost analysis on tracing, RC, and several hybrids. I think it would have also been interesting to also see a cost analysis on garbage collectors for widely-used programming languages to see how they compare (we might have talked about this during the discussion already).

0 replies

[Mar 29th Discussion] A Unified Theory of Garbage Collection #303

Uh oh!

Replies: 11 comments · 11 replies

Uh oh!

Uh oh!

sampsyo Mar 25, 2022 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sampsyo Mar 27, 2022 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sampsyo Mar 29, 2022 Maintainer

Uh oh!

Uh oh!

Uh oh!

sampsyo Mar 29, 2022 Maintainer

Uh oh!

Uh oh!

Uh oh!

sampsyo Mar 29, 2022 Maintainer

Uh oh!

Uh oh!

sampsyo Mar 29, 2022 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Replies: 11 comments 11 replies

sampsyo Mar 25, 2022
Maintainer

sampsyo Mar 27, 2022
Maintainer

sampsyo Mar 29, 2022
Maintainer

sampsyo Mar 29, 2022
Maintainer

sampsyo Mar 29, 2022
Maintainer

sampsyo Mar 29, 2022
Maintainer