>*I don't understand why garbage collection is so mythologized. TBH I don't even...

kibwen · on Feb 23, 2024

This comes down to the usual boring semantic argument over what the term "garbage collection" means.

Instead, I'll use the term "automatic dynamic lifetime determination".

In a Java-style GC, there exists code that runs at runtime to determine whether or not the lifetime of a piece of memory can be determined to have expired.

In a reference-counted system, there exists code that runs at runtime to determine whether or not the lifetime of a piece of memory can be determined to have expired.

Of course, in the latter case the "code" is just checking a single counter rather than doing fancy concurrent root-tracing, but the similarity is established, and the alternatives are clearly delineable:

1) "Automatic static lifetime determination": this is what Rust does with ownership, by statically analyzing your code and automatically inserting code to free memory at statically-determined points.

2) "Manual static lifetime determination": this is the classic C-style memory management, where the author manually inserts code to free memory at statically-determined points.

eru · on Feb 26, 2024

> Of course, in the latter case the "code" is just checking a single counter rather than doing fancy concurrent root-tracing, but the similarity is established, and the alternatives are clearly delineable: [...]

You also need to do some fancy tracing for reference counting, because when a big data structure 'dies', you need to trace it completely.

Of course, reference counting also pays the price of fiddling with its counters on every single read. So every read-only workload turns into a read-write workload.

Garbage collection has no such overhead until you start the collector. (Basically all the overhead is amortised.)

t8sr · on Feb 23, 2024

Sorry, but it really sounds to me like you're repeating something from documentation or CS 101. Have you ever actually worked on a garbage collector?

* ARC is not deterministic outside of toy examples

* The runtime overhead of modern GC algorithms has a lower bound than ARC, and this has been proven formally

* Reference counting is usually atomic, so it implies other wonderfully expensive things like CPU cache flushes, memory fences, etc.

* The runtime overhead of deferred GC also happens outside of the hot path, and doesn't do unpredictable non-local things if you release a reference at an inconvenient moment

* ARC is in fact a GC strategy. E.g. obj-c has autorelease pools that postpone deletion until later, so the hot path doesn't get interrupted if reference count of something reaches zero early. These things are a spectrum.

Rust leans a lot on the borrow checker, and both Rust and C++ nudge you toward using fewer heap small allocations because it is inconvenient to use them. This is where a lot of the performance wins come from, not because reference counting somehow shaves off 20% of the program's runtime.

In fact, ARC is not only a GC strategy, it is close to being the worst possible GC strategy.

coldtea · on Feb 23, 2024

Sorry, but it really sounds to me like you're repeating something from theorical research on GCs, best cases, and cherry-picking counter-examples.

* ARCs non-determinism doesn't matter outside of contrived cases. In practical use it's as deterministic as expected.

* The runtime overhead of modern GC algorithms might have been proven formally to have a "lower bound than ARC", but like many things proven formally, it's seldom the case in practice. And this isn't also counting the memory overhead from using GC. Just like the oft-repeated "JIT can be faster than compiled because it has knowledge of the runtime" but in practice that only happens on special cases, and compiled C/C++/Rust still beats its ass.

As for the runtime overhead of deferred GC happing "outside of the hot path", and yet every domain with a GC and high load/low latency has historically suffered GC pauses, and GCs have to be reworked to ever more complex byzantine schemes to try to reduce the issue.

>*ARC is in fact a GC strategy. E.g. obj-c has autorelease pools that postpone deletion until later, so the hot path doesn't get interrupted if reference count of something reaches zero early. These things are a spectrum.

Yes, like cars and 18-wheelers are a spectrum. And C++/Rust is on the other side of that spectrum compared to Java and Go.

bencyoung · on Feb 23, 2024

The worst memory performance bug I ever saw turned out to be heap fragmentation in a non-GC system. There are memory allocators that solve this like https://github.com/jemalloc/jemalloc/tree/dev but ... they do it by effectively running a GC at the block level

As soon as you use atomic counters in a multi-threaded system you can wave goodbye to your scalability too!

t8sr · on Feb 23, 2024

As you point out, the use of ARC is correlated with languages used in high-performance code.

You think it's because ARC is a better way of managing memory, but that's false. Rust is not fast because it uses ARC, it uses ARC because Rust is fast. Rust is fast because it has a good ownership model for memory, because it encourages patterns that mostly get rid of the need to dynamically manage lots of heap objects. These qualities make it cache-friendly, among other things. What's left is generally small enough that even a poor GC strategy like ARC is good enough. A state-of-the-art GC wouldn't be worth the extra complexity in Rust. (And would probably conflict with other features.)

Go needs a state of the art GC, because it wants to hide "stack vs heap" from the programmer, and so it puts a lot of things on the heap. If it used ARC, it would be unusable.

To people who have worked on programming languages, saying "ARC is good, actually" is either a sign that you're about to present a very nuanced argument informed by decades of experience, or, more likely, that you have no idea what you're talking about.

If you're going to make a claim that lies this far outside the mainstream, I'd say the onus to present a good argument is on you, not me.

With age, I've found it good to moderate my fervor in proportion to how much I know about a subject. If people learn that you're someone who talks with confidence only about things you know, you'll have an easier time getting them to listen.

pjmlp · on Feb 23, 2024

On top of that, many seem to forget that Objective-C's ARC is the outcome of a project failure.

Apple did not add ARC to Objective-C because it was a great design, rather they failed to add a tracing GC in Objective-C 2.0 that wouldn't crash left and right due to the underlying C semantics, thus they settled with the 2nd best option, having the compiler automate Cocoa's retain/release calls (similar to VC++ extensios to deal with COM AddRef/Release).

Then sold the failure as a success, and improvement, in typical Apple's fashion.

Likwise, Swift had to adopt a similar approach if it was to stay compatible with Objective-C's runtime, otherwise it would need a complicated interop layer like .NET has with COM (RCW/CCW).

steveklabnik · on Feb 23, 2024

Don't forget that Rust's Arc is not Swift's ARC.

pjmlp · on Feb 23, 2024

I know, although I am quite curious as language nerd, if the Swift ARC alongside the new Swift 6's memory ownership won't be a much productive approach than dealing with the borrow checker and manually having to type .clone() instead of lettting the compiler do it.

Not that is matters much outside Apple's ecosystem, though.

kaba0 · on Feb 23, 2024

> Go needs a state of the art GC

It needs, but unfortunately it doesn’t have. That category is won by Java multiple times.

t8sr · on Feb 23, 2024

Interesting - I would be really interested in an apples-to-apples comparison between the two of them, but such a thing must be hard to do. I'd fully expect the amount of investment in JVM to have produced a way better result, but I can't find any good articles about it.

kaba0 · on Feb 23, 2024

Not an article, but a good “measuring stick” might be the binary trees benchmark: https://benchmarksgame-team.pages.debian.net/benchmarksgame/...

This one stresses the GC quite a bit (so it doesn’t make much sense for non-managed languages that can “cheat”), and there is only Haskell as a managed language that could get ahead of Java (but it is not really apples to oranges due to the different execution model), and in other’s case it is not even close.

Go is particularly bad in this benchmark, though it sort of makes sense as it optimizes for low latency, which is fundamentally at odds with throughput.

neonsunset · on Feb 23, 2024

Also the cost of write barriers. Looking at the benchmark code, it may actually benefit the languages with non-moving, simpler GC implementations because they would have cheaper write barriers or complete lack of thereof. With that said, it's a good demonstration of the quality of OpenJDK HotSpot JIT and GC. This is an area where .NET has still some ways to go - it has precise write barriers but they are not inlined so performance in scenarios bottlenecked by the cost of assigning object reference to a new heap location will vary a lot depending on WKS vs SRV GC and its configuration + exact CPU model and arch being used (more so than regular code). It does, however, use .NET 7 which is a shame.

igouy · on Feb 23, 2024

> managed language that could get ahead of Java

Here's a different reading:

https://news.ycombinator.com/item?id=29323468

pjmlp · on Feb 23, 2024

And yet no one is rewriting either Android (outside kernel services) or Kubernetes in C++/Rust.

Apparently after the whole WinRT/UWP mess, C++ is now left for writing low level stuff, with everything else on WinUI 3.0/WinAPPSDK being done in C#.

Devil May Cry 5 on PlayStation 5 seems to be doing quite alright with Campcon's in-house fork from .NET.

So in the end, other than some corner cases where no kind of automated memory management is at all allowed, people will rather take a regular car instead of a F1.

eru · on Feb 26, 2024

Reference counting can also take arbitrary amounts of time.

lomase · on Feb 23, 2024

Do you really don't see how a GC and reference counting are similar?

coldtea · on Feb 23, 2024

Yes: in the same way a boat and a car are similar since they both "take you from A to B".

Now, do you really don't see how a Java/GO style GC and reference counting are different?

It's not like I didn't enumerate some characteristics that make all the difference - and that explain why people do not think of those two cases as just a single thing called "garbage collection", and also why people believe that ARC would have less overhead than a Java/GO style garbage collector.

pjmlp · on Feb 23, 2024

There isn't a Java style GC, there are multiple implementations, including with reference counting GC algorithms.

The language and runtime specification don't say anything about GC algorithms required by compliant implementations.