We switched to Java 21 virtual threads and got a deadlock in TPC-C for Postgres

synthetigram · 2024-01-16T05:09:14 1705381754

This problem is not going to go away so easily. Numerous core Java classes (like BufferedInputStream) use synchronized. I count 1600+ usages in java.base. The blocking issue means it's _much_ easier to accidentally run into this, rather than waving it away as an unlikely edge case.

I personally ran into this Using the built in com.sun webserver, with a virtual thread executor. My VPS only has two CPUs which means the FJP that virtual threads run on only have 2 active threads at a time. I ran into this hang when some of the connection hung, blocking any further requests from being processed.

pron · 2024-01-16T12:38:37 1705408717

As the JEP states, pinning due to synchronized is a temporary issue. We didn't want to hold off releasing virtual threads until that matter is resolved (because users can resolve it themselves with additional work), but a fix already exists in the Loom repository, EA builds will be offered shortly for testing, and it will be delivered in a GA release soon.

Those who run into this issue and are unable or unwilling to do the work to avoid it (replacing synchronized with j.u.c locks) as explained in the adoption guide [1] may want to wait until the issue is resolved in the JDK.

I would strongly recommend that anyone adopting virtual threads read the adoption guide.

[1]: https://docs.oracle.com/en/java/javase/21/core/virtual-threa...

cesarb · 2024-01-16T13:20:20 1705411220

> unable or unwilling to do the work to avoid it

The problem is that it's rare to write code which uses no third-party libraries, and these third-party libraries (most written before Java virtual threads ever existed) have a good chance of using "synchronized" instead of other kinds of locks; and "synchronized" can be more robust than other kinds of locks (no risk of forgetting to release the lock, and on older JVMs, no risk of an out-of-memory while within the lock implementation breaking things), so people can prefer to use it whenever possible.

To me, this is a deal breaker; it makes it too risky to use virtual threads in most cases. It's better to wait for a newer Java LTS which can unmount virtual threads on "synchronized" blocks before starting to use it.

josefx · 2024-01-16T14:58:53 1705417133

> have a good chance of using "synchronized" instead of other kinds of locks; and "synchronized" can be more robust than other kinds of locks (no risk of forgetting to release the lock, and on older JVMs, no risk of an out-of-memory while within the lock implementation breaking things),

I haven't professionally written Java in years, however from what I remember synchronized was considered evil from day one. You can't forget to release it, but you better got out of your way to allocate an internal object just for locking because you have no control who else might synchronize on your object and at that point you are only a bit of syntactic sugar away from a try { lock.lock();}finally{lock.unlock();} .

hashmash · 2024-01-16T15:21:09 1705418469

The fact that the monitor is public rarely causes issues, and in those cases where it's used on internal objects, it's not really public anyhow.

There's an additional benefit to using the built in monitors, and that has to do with heap allocation. The data structure for managing it is allocated lazily, only when contention is actually encountered. This means that "synchronized" can be used as a relatively low cost defensive coding practice in case an object which isn't intended to be used by multiple threads actually is.

bunderbunder · 2024-01-16T15:31:19 1705419079

Is there a similarly low-level synchronization mechanism that doesn't work this way? .NET's does the same thing.

I guess I might have preferred if both Java and .NET had chosen to use a dedicated mutex object instead of hanging the whole thing off of just any old instance of Object. But that would have its own downsides, and the designers might have good reason to decide that they were worse. Not being able to just reuse an existing object, for example, would increase heap allocations and the number of pointers to juggle, which might seriously limit the performance of multithreaded code that uses a very fine-grained locking scheme.

merb · 2024-01-16T20:04:55 1705435495

In .net async won where lock and mutex does not work (lock is like synchronized, not exactly the same, tough). That’s why most libraries use SemaphoreSlim which would work with green threads. But that’s more because of the ecosystem. I’ve barley stumble upon lock’s and mutex is mostly used in the main method since it acquires a real os mutex, not really a cheap thing but for GUIs it’s clever to check if the app is running. Most libs that use system.threading.task use semaphoreslim tough.

bunderbunder · 2024-01-16T22:57:40 1705445860

Yeah, definitely. But for a fair comparison I think you have to look at how .NET did things before async/await hit the scene. And, for that, the aspect of the design in question is quite similar between the two.

josefx · 2024-01-17T07:22:51 1705476171

Early .Net is hardly an independent data point from early Java. Not only was .Net directly influenced by Java, it also had to support a direct migration from the Microsoft JVM specific Visual J++ to J#.

The handful of languages I know either do not have a top level object class that supports a randomized set of features ( C++ ) or prioritize a completely different way of concurrent execution ( Python, JavaScript ).

masoudprv · 2024-01-16T15:59:19 1705420759

Hi Ron. Thanks a lot for the amazing work you are doing on loom and whole JVM platform. EA builds and GA release you mentioned can make it into 22 or you meant EA build for 23?

i000 · 2024-01-16T13:34:38 1705412078

Wow, I would love to be in the meeting where this decision was made.

Let's ship this with a foot gun, but lets not mention in the JEP that it may hang - let them figure it out.

bilbo0s · 2024-01-16T14:40:13 1705416013

I don't know man?

We make scalable graphics rendering servers to stream things like videogames across the web. When we started the project to switch to virtual threads we had that as number one on the big board. "Rewrite for reentrant locks."

Maybe we have more fastidious engineers than a normal company would since we are in the medical space? But even the juniors were reading and familiarizing themselves on how to properly lock in loom's infancy.

All that only to point out that, yes, they had communicated the proper use of reentrant locks long ago.

I do understand what you're saying from an engineering management perspective though. That effort cost a fortune. Especially when you have the FDA to deal with.

It was more than worth it though! In the world of cloud providers, efficiency is money.

riwsky · 2024-01-16T16:23:50 1705422230

Wait, are you writing medical videogames?

bilbo0s · 2024-01-16T16:50:24 1705423824

We use the same technologies to deliver, say, remote CT review capability, that you would use to stream a videogame. It's just far more likely that the audience I'm communicating with, HN, is familiar with the requirements of videogame streaming, than it is that they are familiar with remote medical dataset viewing. Obviously the requirements or our use case are far more stringent, but no need to go into all that to illustrate the point made.

1 - Use virtual threads with reentrant locks if you need to do "true heavy" scaling.

2 - Kind of implied, but since you gave the opportunity to make it explicit with your comment =D, there is no need to waste your life on earning no money in videogames when the medical industry is right there willing to pay you 10x as much for the same skills. (Provided your skill is in the hard backend engine and physics work. They pay more for the ML too, if I'm being honest.)

deely3 · 2024-01-16T14:29:10 1705415350

I understand the frustration, but why not read a doc?

https://docs.oracle.com/en/java/javase/21/core/virtual-threa...

In Virtual Threads: An Adoption Guide part there is:

When using virtual threads, if you want to limit the concurrency of accessing some service, you should use a construct designed specifically for that purpose: the Semaphore class.

bunderbunder · 2024-01-16T15:36:06 1705419366

That language only obliquely mentions the issue. It is nowhere near clear and direct enough for someone who is just, for example, using a third-party library that is affected. And then it's stuck inside detailed documentation that anyone who wasn't personally planning on adopting virtual threads is unlikely to read.

This seems like it's at least vaguely headed in the direction of that famous scene from early in The Hitchhiker's Guide to the Galaxy:

“But the plans were on display…”

“On display? I eventually had to go down to the cellar to find them.”

“That’s the display department.”

“With a flashlight.”

“Ah, well, the lights had probably gone.”

“So had the stairs.”

“But look, you found the notice, didn’t you?”

“Yes,” said Arthur, “yes I did. It was on display in the bottom of a locked filing cabinet stuck in a disused lavatory with a sign on the door saying ‘Beware of the Leopard.”

sigzero · 2024-01-16T16:12:34 1705421554

Maybe you should stick to reading Adams and not programming?

kaba0 · 2024-01-16T14:40:25 1705416025

You might accidentally write an infinite loop as well - should we not use Turing-complete languages or what?

It’s not like multithreaded computing wasn’t full of footguns anyway.

eklavya · 2024-01-16T15:02:17 1705417337

I would like to take this opportunity to thank pron and the amazing jdk developers for working on a state of the art runtime and language ecosystem and providing it for free. Please ignore the entitled, there are many many happy Dev's who can't thank you all enough.

jillesvangurp · 2024-01-16T08:33:12 1705393992

People always forget that things that only happen every few million times, can happen fairly frequently on a busy server. This has bitten me numerous times. The nature of a lot of these types of issues is that they are hard to detect and hard to reproduce.

Virtual threads are nice for unblocking legacy code but they aren't without issues. There are better options for new code with less trade offs on the jvm as well. I've recently been experimenting with jasync-postgresql (there's a mysql variant as well) as an alternative to JDBC in Kotlin. It's a nice library. It does have some limitations and is a bit on the primitive side. But it appears to be somewhat widely used in various database frameworks for Scala, Java, and Kotlin.

Databases and database frameworks are an area on the JVM where there just is a huge amount of legacy code built on threads and blocking IO. It's probably one of the reasons Oracle worked on virtual threads as migrating away from these frameworks is unlikely to ever happen in a lot of code bases. So, waving a magic wand and making all that code non blocking is very attractive. But of course that magic has some hard limitations and synchronize blocks are one of those. I imagine they are working on improving that further.

simiones · 2024-01-16T13:00:42 1705410042

> Virtual threads are nice for unblocking legacy code but they aren't without issues. There are better options for new code with less trade offs on the jvm as well.

The designers of Project Loom would say the exact opposite. The whole push behind Project Loom and similar models (Go's oft-praised "goroutines" runtimes being another one) is motivated by Threads being a much better fit for async behavior in a fundamentally procedural language like Java or Go than promise-based frameworks like async/await.

The whole motivation of Project Loom is to make the simple thing (spawning threads to handle blocking IO) the fast thing as well (by actually replacing the blocking IO with efficient async IO OS calls and managing the threads internally). Project Loom will be considered a full success if the next generation Java web server does something akin to "new Thread(() -> {executeHandlerFunc(conn); }.Start(); " for each incoming connection, just like the Go built-in web server.

jillesvangurp · 2024-01-16T14:22:42 1705414962

I think it's not that black and white. Clearly they made a choice to be backwards compatible. Not because Java Threads have a nice API (not even close) but because a lot of legacy code that will never be changed uses it. Including all the ugly bits that you shouldn't be using. Like a lot of the low level synchronization primitives that date back to the early days of Java. It's an impressive bit of work but they made some compromises to make things work. A new API would have been easier, would have had less overhead, and be nicer to use. But backwards compatibility with legacy code was a big goal.

It mostly works fine and it's an impressive bit of engineering. But it has some really ugly failure modes in combination with hacky legacy code designed for real threads. So, you can't blindly assume things to just work. Hence the deadlocks.

Many Java servers already work the way you outline. It's just that they are a bit tedious to use with the traditional Java frameworks. Which is one reason I like using Spring's webflux with Kotlin instead. Just way nicer when it's all exposed via co-routines.

simiones · 2024-01-16T15:43:09 1705419789

There are two separate choices. One is the choice of whether to implement green threads in the JVM at all, or whether to use async/await, or some other type of concurrency primitive. The other is whether to expose the new concurrency primitive using a new API or an existing one.

You could say the second choice, the specific API, was done, at least to some extent, for backwards compatibility reasons. I wouldn't agree, but I think there is at least some argument to be made. Here is one of the designer's explanation [0]:

> We also realized that implementing the existing thread API, so turning it into an abstraction with two different implementations won't add any runtime overhead. I also found that when talking about Java's new user mode threads back when this feature was in development, and back when we still called them fibers, every time I talked about them at conferences, I kept repeating myself and explaining that fibers are just like threads. After trying a few early access releases of the JDK with a fiber API, and then a thread API, we decided to go with the thread API.

However, the choice of adding a new concurrency primitive to Java in the form of green threads instead of others was very very clearly not done for backwards compatibility's sake. Ron Pressler (who is active here as 'pron') has several talks on the advantages of green threads over async/await that you can look at [0][1]. The designers of Go also had the same belief, and also chose to add green threads as the fundamental built-in concurrency primitive in Go, obviously not for backwards compatibility reasons in their case.

[0] https://www.infoq.com/presentations/virtual-threads-lightwei...

[1] https://www.youtube.com/watch?v=EO9oMiL1fFo

coldtea · 2024-01-16T13:53:33 1705413213

>The designers of Project Loom would say the exact opposite.

Sure, but then again the designers of circa 2000-2010 J2EE also thought the verbosity and over-engineering was a good idea.

discreteevent · 2024-01-16T14:28:17 1705415297

There might be some justification for comparing any one particular thing to the worst possible particular thing if those things have something in common. The only feature the two things you picked have in common is the word 'java'.

coldtea · 2024-01-16T22:38:17 1705444697

Also have in common the "appeal to authority": (the designers) as arbiters of good judgement

peterashford · 2024-01-16T22:52:39 1705445559

Appeal to expertise. Appeal to authority is a falacy when the authority is not an expert in the requisite domain. eg: we don't care what a policeman thinks about astrophysics, we do care what the astrophysicist says.

pjmlp · 2024-01-17T12:26:39 1705494399

J2EE started as a Objective-C framework, before being rewritten in Java.

nordsieck · 2024-01-16T13:09:56 1705410596

I don't know.

My understanding is that that highest performance webserver is nginx. And it uses async internally.

IMO, virtual threads is a better general purpose language feature because it avoids function coloring and is generally easier to reason about, but it may not result in the highest performance Java webserver.

simiones · 2024-01-16T13:23:07 1705411387

NGINX is a native C implementation, so it has to be carefully written to use the OS's native high-performance IO and native OS threads.

The purpose of project Loom is to abstract that away from Java application code. The runtime can use the most efficient IO for the given platform (ideally io_uring on Linux or IOCP on Windows, for example) even if the application code calls the old blocking File.Write(). The application can then use simple APIs and code patterns, but still get massive performance.

With Loom, you can easily have 20,000 virtual threads servicing 20,000 concurrent HTTP requests and each "blocked" in IO, while only using, say, 100 OS threads that are polling an IOCP. A normal Linux box can typically only handle around maybe 1000 threads across all running processes.

incrudible · 2024-01-16T15:52:15 1705420335

Servicing 20,000 concurrent requests on a single box where somehow threads are the bottleneck, is that not a problem that approximately no one has?

bitzun · 2024-01-16T17:36:04 1705426564

Most application webservers (by default) handle one request per thread. For mostly IO bound stuff (which many projects are), it makes sense to me that threads become a bottleneck in relatively ordinary scenarios.

incrudible · 2024-01-17T01:27:27 1705454847

The scenario where your IO could handle way more than a thousand concurrent requests if only the thread overhead was reduced? When does that ever happen?

simiones · 2024-01-17T11:34:14 1705491254

Each OS thread costs memory. With the version of Java I have, the default is to allocate 1MB of stack for each thread. So, 10,000 threads would require 10,000 MB of RAM even if we configured ulimit to allow that many threads. In contrast, asking the kernel to do buffered reads of 10,000 files in parallel requires much less memory - especially if most of those are actually the same physical file. Of course, they won't be read fully in parallel.

For example, this program:

  var threads = new Thread[20000];
  for (int i = 0; i < 20000; i++) {
    threads[i] = Thread.ofVirtual().start(() -> {
      try {
        Files.copy(FileSystems.getDefault().getPath("abc.txt"), System.out);
      } catch (IOException e) {
        System.err.println("Error writing file");
        e.printStackTrace();
      }});
    }
  for (int i = 0; i < 20000; i++) {
   threads[i].join();
  }

Run as `java Test > ./cde.txt` takes about 4.5s to run on my WSL2 system with 2 cores, writing a 2 GB file (with abc.txt having 100KB); even this would be within the HTTP timeout, though users would certainly not be happy. Pretty sure a native Linux system on a machine beefy enough to be used as a web server would have no problem serving even larger files over a network like this.

incrudible · 2024-01-17T15:20:47 1705504847

1. You are not solving a real problem. The use case you describe (basically a CDN) is already exotic, the scenario where such a system would have already been implemented with Java and its basic IO seems implausible.

2. You did not compare against fewer threads to see if threads are actually the bottleneck rather than IO. Also, all your threads are competing for stdout.

mike_hearn · 2024-01-16T08:51:49 1705395109

The lack of support for synchronized isn't a fundamental or hard limit, it's just that the HotSpot implementation is complicated for performance reasons and they put off rewriting that code until later. They're indeed working on that now and in some future version I guess wait/notify and synchronized blocks will start to work. After all, you can easily transform such code into an equivalent that does work.

tveita · 2024-01-16T10:20:52 1705400452

There are ways to find problem sections without having to trigger a full deadlock: https://openjdk.org/jeps/444

  The system property jdk.tracePinnedThreads triggers a stack trace when a thread blocks while pinned. Running with -Djdk.tracePinnedThreads=full prints a complete stack trace when a thread blocks while pinned, highlighting native frames and frames holding monitors. Running with -Djdk.tracePinnedThreads=short limits the output to just the problematic frames.

vincnetas · 2024-01-16T11:12:04 1705403524

Was curious what it is "jasync". And man it hurts me to see documentation like this (when compared to classic javadocs)

https://github.com/jasync-sql/jasync-sql/wiki/API-Overview

From project WIKI (https://github.com/jasync-sql/jasync-sql/wiki)

kaba0 · 2024-01-16T08:50:07 1705395007

Synchronized blocks are not a problem. Synchronized blocks that later don’t unblock the thread may sometimes be.

he0001 · 2024-01-16T07:55:57 1705391757

BufferdInputStream is rewritten and is only using synchronized if subclassed. In fact there has been a lot of work removing the synchronized keyword.

anthony88 · 2024-01-16T09:35:50 1705397750

I've written an open source library to easily replace synchronized with something more virtual thread friendly: https://github.com/japplis/Virtually

mrintegrity · 2024-01-16T04:31:01 1705379461

Totally off topic but I am getting tired of the AI generated images used on nearly all blog posts nowadays. They are instantly recognisable, it just seems low effort and lowers the feeling of quality one might otherwise have

sdedovic · 2024-01-16T04:37:35 1705379855

To me, its more about the style than the use of an AI. But I agree.

I enjoyed this writeup by Michael Lynch on finding an illustrator [1], for their blog. In doing some of my own writing, I've really found it enlightening how much secondary work goes into publishing your own work. I often think its so nice to be able to _just_ plug in what I want on a site and get a (more or less) free illustration. But as someone selling their own work / time, it feels wrong. I'd rather pay a real human and build a relationship and have something more quality. On the other hand, though, it can be expensive, time consuming, and I've been screwed over. Often it seems like a bigger risk than its worth.

So idk, you're trading some hardship and risk for an ethical dilemma but ease of use.

[1] https://mtlynch.io/how-to-hire-a-cartoonist/

cpeterso · 2024-01-16T05:53:44 1705384424

Worse yet, the dining philosophers in the image have too many hands. No wonder they’re deadlocking! :)

smarks · 2024-01-16T15:42:25 1705419745

Clearly, those are virtual hands.

mort96 · 2024-01-16T09:44:14 1705398254

Between the shitty obviously-AI-generated square header image with floating hands everywhere, the equally shitty obviously-AI-generated image in the middle and the "Please pay for Medium" banner which takes up literally half the page, this blog post does its utmost to make a truly terrible impression.

blindriver · 2024-01-16T04:47:54 1705380474

I prefer AI generated images over stock photos though. You can tell that both are phony, but at least the AI can be a bit more creative.

bigbillheck · 2024-01-16T14:41:48 1705416108

People in stock photos can generally be counted on to have no more than a traditional number of hands.

layer8 · 2024-01-16T14:27:22 1705415242

Th issue is that we’re now getting tons of blog posts with AI-generated images that previously didn’t dare to use stock photos.

Zardoz84 · 2024-01-16T06:11:47 1705385507

I prefer no images over IA stolen/generated images.

jazzyjackson · 2024-01-16T06:24:58 1705386298

> ...the Taft Test:

> Does your page design improve when you replace every image with William Howard Taft?

> If so, then, maybe all those images aren’t adding a lot to your article. At the very least, leave Taft there! You just admitted it looks better.

https://idlewords.com/talks/website_obesity.htm

datadeft · 2024-01-16T11:16:25 1705403785

What is IA stolen?

jjgreen · 2024-01-16T13:27:53 1705411673

stolen by French AI of course

minimaxir · 2024-01-16T04:55:22 1705380922

There's a quality spectrum of AI-generated header images. Some are just random DALL-E output which aren't intrinsically relevant to the article (like the one used in this article), but you can have a little fun with it and do something distinct. This may require more control than just using Bing Image Creator.

Also, a thumbnail tip: square thumbnails are bad. If you have to use a square 1024x1024 AI generation, crop it to something like 1024x575, which incidentally can make things difficult if using AI generation since figuring out what to crop requires human intervention.

mort96 · 2024-01-16T09:48:06 1705398486

There's a quality spectrum of AI-generated images, sure, but they're all equally artistically void.

hoseja · 2024-01-16T11:24:35 1705404275

Not all.

mort96 · 2024-01-16T13:39:44 1705412384

Yeah they are. Art is communication. Computers don't communicate, they generate.

Someone · 2024-01-16T09:15:04 1705396504

> since figuring out what to crop requires human intervention.

I don’t know how good they are, but people have trained models on that problem. Googling “autocrop tool” gives me multiple options.

minimaxir · 2024-01-16T16:15:47 1705421747

Fair point.

gcau · 2024-01-16T07:34:53 1705390493

It cheapens it, making it look like AI-generated seo-blog-spam. I'd rather a technical diagram or some plain icons, at least that would look tasteful.

sureglymop · 2024-01-16T12:04:29 1705406669

I just don't really understand the value they're supposed to bring. If everyone uses the same looking generates images it just makes all the blogs look the same again. Then the thing is, they usually have nothing to do with the actual article. So why not just leave them out and not waste space.

nottorp · 2024-01-16T12:27:32 1705408052

All the SaaS sites featured on HN now and then also look the same :)

kelnos · 2024-01-16T08:19:19 1705393159

I dislike the style this particular author chose, but don't object in general. Assuming the images are actually somewhat relevant (or at least funny), I think I'd prefer an AI-generated image over a big wall of text.

To each their own, though, of course.

alpaca128 · 2024-01-16T09:07:20 1705396040

The text is the only reason to visit the blog, so why waste bandwidth with images of four-handed people?

At least they could try and generate something where I can't see malformed bodies within seconds. Or create a nice diagram that actually adds something to the text.

Mashimo · 2024-01-16T09:18:48 1705396728

> why waste bandwidth with images of four-handed people?

In the end it's just 2x 40kb

alpaca128 · 2024-01-16T09:32:03 1705397523

So that makes a signal to noise ratio of 1:5 given the text including code is just 14kB.

Mashimo · 2024-01-16T16:32:00 1705422720

For me the top 3 files downloaded by size are all js files that are about 450kb in total.

Also like 7 font files for ~100kb

mort96 · 2024-01-16T09:41:43 1705398103

What value does AI slop add though?

patmorgan23 · 2024-01-16T13:01:04 1705410064

What value does a random stock image add?

mort96 · 2024-01-16T13:40:34 1705412434

Very little, but it might at least have the right amount of hands and a sensible aspect ratio

azinman2 · 2024-01-16T04:33:53 1705379633

Typically it’ll be in lieu of nothing or stock photography. Doesn’t it seem better than that?

dmart · 2024-01-16T04:35:23 1705379723

Not really. I wish the trend of giant generic hero images on every blog post would go away, they almost never add any value. I think it was Medium that started the trend.

minimaxir · 2024-01-16T04:49:39 1705380579

Unfortunately all social media sharing requires a thumbnail for easy clicking, no real way around it. (with Hacker News as the lone exception of course)

The default thumbnails in lieu of your own aren't good.

troupo · 2024-01-16T07:30:37 1705390237

> all social media sharing requires a thumbnail for easy clicking, no real way around it

This doesn't mean you need a giant hero header, or an AI generated image, or even any images in your posts at all.

Tomte · 2024-01-16T07:44:12 1705391052

Use og:image then: https://ogp.me/

cqqxo4zV46cp · 2024-01-16T06:41:40 1705387300

I’ll honestly take the “put some text in the thumbnail” trend that GitHub, Nuxt Content, etc all do, over a low-quality image.

iknowstuff · 2024-01-16T04:38:28 1705379908

They do add value, they make clicks more likely.

logifail · 2024-01-16T08:53:32 1705395212

> They do add value, they make clicks more likely

"Making clicks more likely" is a terrible measure of genuine value.

There are lots of images which will make people click, even if once they see your page they click 'Back' a second later. Our metrics are broken if we continue to attribute that click as 'success'.

diggan · 2024-01-16T13:58:33 1705413513

> "Making clicks more likely" is a terrible measure of genuine value.

Genuine value, to who? For the author, getting more clicks is probably of "genuine value", depending on their goals for their writing. But seems most people are not writing and publishing stuff today because they think it provides value to others, but because they think it'll provide value to themselves somehow.

alpaca128 · 2024-01-16T09:19:13 1705396753

The question is: do you actually want to attract people who only click because of an image? And if you AI-generate it, are you fine with parts of the target audience not clicking on obvious AI thumbnails because they assume the entire content is low-effort?

usrusr · 2024-01-16T05:59:29 1705384769

One is aesthetic filler that is true to its purpose of loosening up the typography of a wall of text. The other tends to be awkwardly clever on a level of awkward that was unknown to mankind until recently. I used to hate stock photography fillers just like everybody else, but now my preference is as clear as it would be surprising to past me.

i80and · 2024-01-16T04:35:37 1705379737

Genuinely it's a downgrade in my opinion

happymellon · 2024-01-16T07:40:44 1705390844

Considering it adds nothing but 5 megs of noise.

No.

DeathArrow · 2024-01-16T07:22:13 1705389733

Whole Medium thing seems low effort. I hardly remember reading a well written article there.

nottorp · 2024-01-16T12:26:02 1705407962

Interesting. Now that you mention it, there are illustrations there. But I'm pretty sure I subconsciously scrolled past them to get to the rest of the article without consciously noticing them on the first read.

Generated or hand drawn, they're kinda a wasted effort on a technical post.

tazjin · 2024-01-16T12:31:14 1705408274

I also subconsciously scrolled past them. Anything unexpectedly colourful and so on just hits some sort of mental adblock for me now.

Zetobal · 2024-01-16T06:13:50 1705385630

At least you now know which of your peers have no taste and strange beauty standards. Some images posted by my colleagues for everyone to see on LinkedIn look like sexist propaganda cartoons.

Stock images used to hide this "quality" better than I thought.

ta8645 · 2024-01-16T05:01:45 1705381305

Honestly, the images attached to the article seemed great to me; they were colorful and fun. I don't see any reason to care who or what created them.

Rapzid · 2024-01-16T11:41:32 1705405292

I like the image. It's cute and it's fun picking out the defects.

davidgerard · 2024-01-16T08:19:03 1705393143

exactly. It says "I can't be bothered producing this" and I feel like, so why should I be bothered reading it?

kelnos · 2024-01-16T08:21:57 1705393317

Ah yes, let's take the most uncharitable explanation and assume that's the case.

Maybe they have no artistic ability of their own? Maybe they just aren't good at finding the kinds of images (that can be freely used without infringing on anyone's copyright) that they need?

If it were me, and the guidance was "never use AI generated images in your blog post", I would probably just not use any images at all. Which I guess for some people would probably be best. But personally I prefer walls of text to be broken up by... something.

ed_elliott_asc · 2024-01-16T13:30:12 1705411812

It’s something I struggle with - I really can’t draw, I really really can’t draw on a computer.

In the past I’ve used lots of screenshots which seems to work well.

Where I have used images I have cut and pasted and used things like canva but nothing has ever really ended up as I would have liked it.

davidgerard · 2024-01-16T09:26:12 1705397172

It shouldn't be so hard to realise that if you make your blog post look how spam looks, it'll look like spam.

danielovichdk · 2024-01-16T08:48:45 1705394925

Kinda like when someone pulls in FOSS code or a package without contributing or at least email the authors.

MrBuddyCasino · 2024-01-16T08:50:18 1705395018

Someone (not me) put it like this:

"to the trained eye you can already see that every single ai generated image is a picture of the same thing"

alpaca128 · 2024-01-16T09:21:57 1705396917

That applies to every generative AI, not just images. Generate a bunch of text with LLMs and you'll also see patterns emerging that it won't ever break out of.

NovaX · 2024-01-16T01:42:06 1705369326

It is a known caveat that virtual threads do not work well with long running synchronization by pinning the thread. That unfortunately means that for many applications it may be premature to adopt them, but it is mature enough for broader evaluation by the libraries and frameworks. The Java team provided a status of their efforts recently [1].

https://www.youtube.com/watch?v=WoQJnnMIlFY&t=421s

avodonosov · 2024-01-16T05:05:49 1705381549

Sorry, the first sentence is a mis-informing wording.

The `synchronized` pins the thread only when from within of the `synchronized` the program calls a blocking operation that would normally unmount the virtual thread, like blockingQueue.take() or similar. (Which is not a sane coding practice). It's because the unmounting, as it's implemented today, does not work well with synchronized.

It's better if people read JEP 444 than rely on forum comments, to avoid being misinformed.

Speaking of long-running - even without synchronized, a long running code keeps the native thread occupied, until some blocking operation is called. So an endless loop that does not call a virtual-thread-ready blocking operation will occupy the native thread forever.

Java virtual threads are a kind of cooperative multithreading - another virtual thread only gets chance to kick-in when some current virtual thread reaches specific blocking operations. In contrast to preemptive multi-threading with native threads.

So I agree with your conclusion. Virtual threads can not (yet?) be blindly used as a drop-in replacement of native threads for existing code. And the new code needs to take their specifics into account.

BTW, another method I discovered to block the native carrier thread that executes a virtual thread is to call blocking reading through FileInputStream, for example reading from the console. The FileInputStream does not implement virtual thread parking at all (yet?).

mike_hearn · 2024-01-16T08:55:40 1705395340

The issue in this case isn't actually the synchronized block. The thread is blocked on Object.wait, which releases the monitor before sleeping. The problem is that Object.wait is implemented in native code still, which pins the thread. The idea is that these days wait isn't exactly deprecated but there are better concurrency tools available, so they upgraded those first, leaving the Java 1 style concurrency tools for later. And Java 1 style concurrency has been improved on but is hardly insane, it can work well enough in many situations and is sometimes the basis for higher level concurrency utilities.

NovaX · 2024-01-16T05:25:58 1705382758

By long running I just meant anything that was not fast compute. I was more focused on finding the reference link so I agree my wording wasn’t clear.

Go started without preemption and added it later. The Java team has indicated a similar path, so we might see that tackled in the future. I think they could do that using safe points or JEP 312‘s handshakes, so it’s not infeasible.

For file io they wanted to explore io_ring and they might need to add a loom friendly resolver for JEP 418. There is just so much left, like scalable timers, that I think it’s going to be a long time until VTs will be a good default choice.

spintin · 2024-01-16T07:56:00 1705391760

https://www.youtube.com/watch?v=WoQJnnMIlFY&t=260s

To get the whole context, so virtual threads are unusable?

What holds a monitor by default and is there a workaround?

Found more:

    A virtual thread cannot be unmounted during blocking operations when it is pinned to its carrier. A virtual thread is pinned in the following situations:

    The virtual thread runs code inside a synchronized block or method

    The virtual thread runs a native method or a foreign function (see Foreign Function and Memory API)

For those those that don't know what this means: Blocking network TCP IO needs a sychronized block to work = you can't use virtual threads for networking. I wish they formulated it like that from the start!

Atleast now we know what they meant with don't use virtual threads for anything but tasks <- not blocking IO with synchronization!

So for now manual NIO is still the king of the hill.

We are reaching peak humanity levels of complexity!

kaba0 · 2024-01-16T09:00:50 1705395650

> For those those that don't know what this means: Blocking network TCP IO needs a sychronized block to work = you can't use virtual threads for networking

That’s not true — blocking TCP IO is not implemented as blocking under the hood - that’s the whole point of virtual threads, so your conclusion is faulty.

NovaX · 2024-01-16T08:13:59 1705392839

Perhaps it’s better to say that they are not yet general purpose. There are many caveats which need to be resolved and are being actively worked on. I would not use them broadly yet, but that could change rapidly.

A monitor will pin the VT to the carrier thread. That can have surprising incompatibility in the current jdk. Soon these footguns will be fixed and you can use them worry free.

https://www.reddit.com/r/java/comments/1512xuo/virtual_threa...

spintin · 2024-01-16T08:40:59 1705394459

Well see, if Patricio Chilano hasn't fixed this in a year I would start to get VERY worried.

Moving monitors into Java is not a good solution, like the long solution they are working on.

Java should be the API not the implementation!

saagarjha · 2024-01-16T02:04:13 1705370653

The problem is not “long running synchronization” but synchronization that relies on stuff running outside of virtual threads to unblock itself. There is no issue beyond performance if you perform filesystem operations in your mounted state.

NovaX · 2024-01-16T02:09:33 1705370973

Yeah, I wasn’t being particular about this exact issue and was generalizing about synchronization pinning the carrier. A deadlock is trivial once the implications of that are thought through.

https://mail.openjdk.org/pipermail/loom-dev/2023-July/005993...

papercrane · 2024-01-16T02:00:31 1705370431

Curious if you considered switching to a different connection pooling library. These days I usually use HikariCP which is fast an actively maintained. c3p0 hasn't had any activity for years, I'm not sure if it's still maintained.

samus · 2024-01-16T07:50:39 1705391439

Crucially, c3p0 will probably never see the `synchronized` blocks being replaced by reentrant locks. Since LTS offers exist for Java 21, many libraries might actually do that. But I actually hope that the ecosystem resists, which would force virtual thread users suffering from this problem to upgrade soon.

rickette · 2024-01-16T05:58:02 1705384682

Indeed, Hikari is the go to connection pool for some years now. It's even the default when running Spring Boot.

eivanov89 · 2024-01-16T02:49:00 1705373340

Perhaps we'll give HikariCP a chance. However, please keep in mind that the goal of the YDB team is to enhance database performance. We needed virtual threads to make TPC-C efficient enough to generate a reasonable load on a modest amount of hardware.

xmcqdpt2 · 2024-01-16T13:52:23 1705413143

Is that true?

Virtual threads aren't necessarily faster, you still have just as many sockets and network connections as before. You can easily spawn 5000 platform threads, and if that's not enough, there are quite a few user-space implementations of fibers/coroutines/async etc on the JVM that can deal with many outlying requests (Cats/ZIO in Scala, Kotlin coroutines, the Play framework, concurrent.Future, etc.)

tveita · 2024-01-17T14:32:19 1705501939

Looks like HikariCP is also awaiting fixes for this https://github.com/brettwooldridge/HikariCP/pull/2055

whalesalad · 2024-01-16T04:45:53 1705380353

Been ages since I’ve touched it but back in 2017-2018 I had some fun integrating HikariCP in place of c3p0 in some Clojure projects and it was more performant.

vvern · 2024-01-16T03:34:40 1705376080

Go has a mechanism to spawn a new thread (m in ho runtime parlance) if it thinks one of its threads might be blocked in a cgo (go’s “native function” equivalent). That prevents stuff like this.

avodonosov · 2024-01-16T04:28:24 1705379304

Java does the same for Object.wait(), only the number of such compensating threads is limited by default, but can be extended via config option. They have exhausted the default number of compensating threads, I think.

And they are mistaken to call this situation a "pinning"

JEP 444:

> The vast majority of blocking operations in the JDK will unmount the virtual thread, freeing its carrier and the underlying OS thread to take on new work. However, some blocking operations in the JDK do not unmount the virtual thread, and thus block both its carrier and the underlying OS thread. This is because of limitations at either the OS level (e.g., many filesystem operations) or the JDK level (e.g., Object.wait()). The implementations of these blocking operations compensate for the capture of the OS thread by temporarily expanding the parallelism of the scheduler. Consequently, the number of platform threads in the scheduler's ForkJoinPool may temporarily exceed the number of available processors. The maximum number of platform threads available to the scheduler can be tuned with the system property jdk.virtualThreadScheduler.maxPoolSize.

(In my testing the default ForkJoinPool limit was 256)

So theoretically they could have extended the jdk.virtualThreadScheduler.maxPoolSize to a number sufficient for the use case. Although their workaround with semaphores is probably more reliable - no need to guess the sufficient number.

The situation with Object.wait() is not what JEP 444 calls "pinning". The "pinning" happens, for example, when one calls `syncronized(....) {blockingQueue.take()}`, which is not sane coding, BTW. In this case the native thread is blocked and is not compensated by another thread - much worse than the Object.wait(). The number of native threads that run virtual threads is equal to the number of CPUs by default, so "pinning" immediately makes one CPU unavailable to the virtual threads of the application.

All those issues are temporarily, as I understand. The JDK team works for fix Object.wait(), synchronized, etc.

delusional · 2024-01-16T06:13:10 1705385590

> The situation with Object.wait() is not what JEP 444 calls "pinning". The "pinning" happens, for example, when one calls `syncronized(....) {blockingQueue.take()}` [...]

To call Object.wait() you need to own the objects monitor, which would imply that your code would actually look like `synchronized(....) {Object.wait()}` in which case you would indeed be pinned.

avodonosov · 2024-01-16T12:08:15 1705406895

As I read JEP 444 (starting from the quote above and several following paragraphs, ending with the words "As always, strive to keep locking policies simple and clear."), the term "pinning" is when a blocking function, that normally unmounts virtual thread, does not do so, due to being called from `synchronized` or from native code.

That's different from blocking functions, described in the quote, that does not even try to unmount virtual thread. Like Object.wait().

Pinning is worse than those functions, because the functions compensate for a blocked native thread by adding one more native thread to the pool.

xmcqdpt2 · 2024-01-16T13:53:41 1705413221

Object.wait() releases the monitor lock though. This specific case doesn't have to do with synchronized at all, but with wait() being a native call.

delusional · 2024-01-16T15:17:36 1705418256

That makes sense to me. I'll agree to not call Object.wait() pinning.

neonsunset · 2024-01-16T04:31:49 1705379509

So does C# with active blocking detection (which injects threads to counteract this) and hill climbing algorithm to scale threadpool threads automatically.

Rapzid · 2024-01-16T11:45:07 1705405507

Which is very handy sometimes. The default throttling is a bit conservative though and perhaps based on Windows thread costs.

neonsunset · 2024-01-16T15:30:11 1705419011

It used to be the case - before .NET 6 there was only hill climbing so poorly written blocking code could starve threadpool very quickly (for + Task.Run + Thread.Sleep and the like), but since 6 blocking threads in such a way makes threadpool inject more threads without going through hill climbing mitigating the impact much more effectively. This does not mean such code should not be fixed however :)

spintin · 2024-01-16T07:16:12 1705389372

The warning shots across the bow where heard with this statement from the devs:

"Don't replace platform/native threads with virtual ones, replace tasks (without further explanation) instead"?!

Combine that with the fact that they chose to implement the scheduler in Java instead of C(++) and you're set for performance problems.

Remember that NIO took from 1.5 to 1.7 to be usable/performant and that was native!

Edit: Finally figured out why: https://news.ycombinator.com/item?id=39010648

MrBuddyCasino · 2024-01-16T09:22:38 1705396958

> they chose to implement the scheduler in Java instead of C(++) and you're set for performance problems

The JDK has historically used some native implementations in its stdlib (zip, imageio and others), back when the runtime wasn't as fast as it is today. But today's runtime would often be faster in Java than those native implementations.

bitcharmer · 2024-01-16T09:03:00 1705395780

> Combine that with the fact that they chose to implement the scheduler in Java instead of C(++) and you're set for performance problems.

Ah yes, the argument from the 1990s. It would make sense to understand where the JVM and its compiler are these days before making incorrect statements about performance.

From your link:

> Blocking network TCP IO needs a sychronized block to work

This is utterly false.

spintin · 2024-01-16T14:45:17 1705416317

So how do you implement a TCP socket?

I have always had to do synchronized(something) { socketInputStream.read(); }

And the dude himself says that reading from a socket is a problem if you listen to the interview.

bitcharmer · 2024-01-16T21:25:16 1705440316

Something tells me you don't know much about programming in Java. Just look up any tutorial showing how to use NIO (blocking and non-blocking).

Synchronized in this context is pretty nonsensical.

saagarjha · 2024-01-16T02:00:50 1705370450

This is a common problem when migrating a system from threads to virtual threads. In general, using primitives which block the current thread and prevent forward progress can quickly lead to deadlocks. It’s a hard issue to catch because in the past usually this would get “solved” by spawning a new thread to complete the task but in a world with virtual threads the runtime is usually reluctant to spawn more threads, so there’s nothing that can service more work if you’ve blocked all the threads.

Groxx · 2024-01-16T03:11:07 1705374667

Is that all that's happening here? There's an implicit limit on real threads, where before it was unlimited by virtue of not using the virtual thread's limited pool?

If it doesn't spawn threads when all of them are blocked, that seems kinda dumb. And a severe change in semantics. It can be conservative and try running unpinned ones on fewer threads and shuffle them around and slowly spawn more to ensure eventual progress, which would mean a possibly significant optimization problem, but a hard cap impacts correctness.

dboreham · 2024-01-16T04:43:08 1705380188

My long held belief: green/user-level/M:N threading schemes never work at first, and only work reliably after extreme effort has been put into fixing all the cases where blocking code gets called underneath. afaik there are only two modern working implementations: golang and erlang. This article is consistent with that belief.

cyberax · 2024-01-16T05:00:40 1705381240

There are many other implementations, although in less popular languages.

The trick is to include the green threads from the start, so there are no libraries that depend on real threading. That's why Go and Erlang are so successful.

kelnos · 2024-01-16T08:26:13 1705393573

The funny thing is that Java did have green threads back in v1.1, but they were dropped in v1.3.

That doesn't invalidate your point; more than 20 years of Java practice has focused on making things work well for platform threads.

andrewf · 2024-01-16T12:26:05 1705407965

I think Solaris moved from green threads to pure kernel threads at the same time (https://docs.oracle.com/cd/E19253-01/816-5137/mtintro-75924/... says Solaris 9 was the transition point).

pjmlp · 2024-01-16T07:24:33 1705389873

Go suffers the same issue when calling into native code, that is why it has APIs to deal with it.

For example, https://pkg.go.dev/runtime#LockOSThread

marwis · 2024-01-17T22:31:18 1705530678

This seems different.

It pins goroutine until it is explicitly released ensuring that multiple native calls will remain on the same platform thread and nothing else is going to use it. This is critical for namespace manipulation on Linux.

Java only pins for duration of native call and synchronized blocks.

It looks like Java does not offer equivalent API? For now could be achieved with synchronized but if synchronized will be changed in the future to not pin it would break.

marwis · 2024-01-17T22:32:42 1705530762

Oh, actually one can just spawn non-virtual thread to solve it.

nerdponx · 2024-01-16T08:28:02 1705393682

It works well enough in Python and NodeJS.

kaba0 · 2024-01-16T12:29:42 1705408182

That’s M-on-N, with N being 1. That’s basically a trivial problem in comparison.

samus · 2024-01-16T07:40:31 1705390831

Virtual threads were never intended as a drop-in replacement for platform threads. They offer the same API, but they are for different usage scenarios.

If you have lots of blocking I/O (meaning: waiting for things happening on other threads or processes, which offers scheduling opportunities), use virtual threads. If you compute or call native code, keep using platform threads.

The issue with synchronized is eventually going to be resolved. But long-running computations (sorting, parsing, number crunching, etc) or native calls must also in the future be offloaded to an ExecutorService with platform threads.

xmcqdpt2 · 2024-01-16T13:07:36 1705410456

The change in semantics is that while in principle your OS thread will always have a turn at making progress (assuming no super heavy spin locks etc), that isn't true for virtual threads. The classic situation and the one they hit in the article is something like this,

You've got some virtual threads that encounter this code,

    synchronized(foo) {
      foo.wait()
    }

And some other virtual threads that are in charge of awaking the waiters,

    synchronized(foo) {
      operation()
      foo.notify()
    }

This is a classic approach to the producer/consumer pattern in Java.

If operation() can do a virtual thread suspend, then it's possible to be suspended, relinquish the platform thread, which the scheduler reuses for the consumer and gets blocked on Object.wait. If this happens enough, you can end up with all the platform threads blocked, and no threads available to make progress on the producer.

The problem is that Object.wait doesn't release the virtual thread, which is a pretty major foot gun that I think the JDK team would have liked to avoid but it was too hard to implement correctly in the current JDK's codebase.

Groxx · 2024-01-16T18:27:28 1705429648

The only way I can see this being a problem is if the virtual threads can't be stolen from their (now pinned) carrier thread. Because otherwise that's all true of real threads too, blocking them is the whole point of Object.wait.

If there's no work-stealing from pinned carriers (or they're low-finite and normal threads are effectively infinite): yes that'd be a HUGE issue. I would be shocked if they released anything with that limitation though, that would violate some of the core expectations of mutexes and threads - independent ones need to make progress or nearly all patterns can't guarantee progress.

Groxx · 2024-01-16T22:13:42 1705443222

From Java docs for `jdk.virtualThreadScheduler.maxPoolSize`: the default is 256.

So yeah I can see that starving rather quickly, particularly with benchmarking-like workloads. Synchronized is very very common, 256 concurrent calls really doesn't seem all that abnormal.

If that were raised to like max-int32 would things be fine, semantically? That'd mimic real threads limits (no jvm limit at all afaict).

xmcqdpt2 · 2024-01-16T22:58:22 1705445902

> If there's no work-stealing from pinned carriers (or they're low-finite and normal threads are effectively infinite): yes that'd be a HUGE issue. I would be shocked if they released anything with that limitation though, that would violate some of the core expectations of mutexes and threads - independent ones need to make progress or nearly all patterns can't guarantee progress.

Correct you can't steal the carrier thread from an Object.wait() waiting virtual thread. This is apparently in the pipeline but it is a pretty major limitation.

Most cases of synchronized/notify/wait should probably use concurrent collections instead (as message queues) so in greenfield code it's not that big of a deal. Virtual threads make writing consumers/producers using collections way easier too.

Sadly, most Java projects are not greenfield projects.

Groxx · 2024-01-17T07:38:52 1705477132

>Correct you can't steal the carrier thread from an Object.wait() waiting virtual thread. This is apparently in the pipeline but it is a pretty major limitation.

I mean stealing other virtual threads from the pinned carrier thread (except for the one pinning it) so they can make progress. Normal work-stealing stuff - the queue(thread) is blocked(pinned), so process that task(virtual thread) in a different queue(thread).

It makes sense that a pinned thread remains pinned with the virtual thread that pinned it.

The 256 default carrier thread limit is going to frequently be a problem though, yeah. That's more than enough to cause all this, and it's a pretty crazy default imo.

moonchild · 2024-01-16T03:51:21 1705377081

I am extremely confused.

> There are two scenarios in which a virtual thread cannot be unmounted during blocking operations because it is pinned to its carrier:

> When it executes code inside a synchronized block or method

Isn't 'synchronized' effectively sugar for taking a kind of lock? Why can't it be treated uniformly by the scheduler?

avodonosov · 2024-01-16T04:45:17 1705380317

The `synchronized` by itself does not cause any problems for virtual threads.

Only when one calls a blocking operation from synchronized, the thread is not unmounted. E.g. `synchronized (...) {blockingQueue.take()}`. Note that this is not a sane coding practice. (Calling a potentially long operation from within synchronized. The blockingQueue.take() does not need to be wrapped into synchronized. It has synchronization inside and plays well with virtual threads. Only when wrapped into the synchronized, the current implementation can not unmount the virtual thread.).

The JDK team works to remove quirks like pinning in the future versions.

dikei · 2024-01-16T04:06:40 1705378000

No, synchronized is a very primitive lock implementation compared to what's available in java.util.concurrent.Locks.

However, it's built directly into the JVM specification, so it's difficult to change while keeping compatibility, while j.u.c.Locks is just a library. In other words, they can't change synchronized schematic, so they created j.u.c.Locks as a replacement.

altfredd · 2024-01-16T06:37:46 1705387066

> it's difficult to change while keeping compatibility

Actually, it is trivial to change. Just embed a ReentrantLock into every object and rewrite all calls to "synchronized"/"Object.wait" to use that lock.

Unfortunately, this would result in a bit of performance regression (increasing per-object memory footprint). To solve that would require turning ReentrantLock into a magical intrinsic, fully integrated with lock bytes in the object header. Which is actually not that hard either — other runtimes like Golang or Android VM solve problems like this on daily basis. Oracle, however…

samus · 2024-01-16T07:58:27 1705391907

... was taking years to land Project Loom. So long that people started calling it vaporware. Project Valhalla is still regarded as such by many. It had to be shipped as soon as it was usable, even though a few rough edges remain that really ought to be deburred.

As you indicate, the complexity lies in not burning too many bridges with existing users and use cases. This is something that Android regularly does and which Go never really had to do due to its shorter history and up-front design.

samus · 2024-01-16T07:52:36 1705391556

The j.u.c.*Locks exist for a very long time already.

jillesvangurp · 2024-01-16T08:13:52 1705392832

Since Java 1.5 in 2004. Twenty years this year. Before that it existed as a separate library developed by Doug Lee. I remember using that library before Java 1.5 was released. There is quite a bit of Java code out there that predates that of course. Also, lots of people continued to not grasp the essentials of that library and stuck with the primitives they knew. So there's a lot of code there with synchronized blocks post Java 1.5 instead of the more robust concurrency primitives that came with the java.util.concurrent package.

taspeotis · 2024-01-16T01:29:27 1705368567

Did they get a deadlock again? https://news.ycombinator.com/item?id=38939165

eivanov89 · 2024-01-16T01:35:55 1705368955

Haha, but quite frankly we had one more in TPC-C for YDB. But unrelated to the virtual threads.

dikei · 2024-01-16T02:42:30 1705372950

I wonder if HikariCP, currently the best Java DB Connection Pooling library, suffer the same issue as c3p0.

jshowalter · 2024-01-20T20:44:42 1705783482

Why not treat this as a bug, and fix it in Java 21? For compliance reasons we can only use LTS versions, and the next one isn't until September 2025, according to https://www.oracle.com/java/technologies/java-se-support-roa....

torrent · 2024-01-16T15:31:34 1705419094

Seems to be a similiar problem field as writing blocking functions that call async functions in C# and co-existence of synchronous and asynchronous code.

There are numerous recommendations such as

https://learn.microsoft.com/en-us/archive/msdn-magazine/2015...

Final phase is "I hope these techniques will help you adopt async into your existing applications in a way that works best for you."

pierrebai · 2024-01-16T19:09:37 1705432177

I thought the blog was great but "in summary" conclusion bad.

The summary merely stated that Java virtual thread are great. I expected a summary of the problem and solution, for example something like:

When using Java 21 virtual threads, you can end-up starved of carrier threads due to all carrier threads waiting on a pool exhausted resources with no thread available to free such resources. The solution is to wrap those resources in a virtual-thread aware object. In our case, we solved our problem by wrapping connections in semaphores.

xyst · 2024-01-16T01:49:20 1705369760

Concurrency, parallelism. These are among the most misunderstood concepts in programming/software development.

TLS (especially mutual TLS) and Oauth also join this club.

motoboi · 2024-01-16T03:11:24 1705374684

Interestingly enough, I love both mutual-TLS and OAUTH (especially OIDC).

Alifatisk · 2024-01-16T06:31:01 1705386661

Why is that interesting?

motoboi · 2024-01-16T12:09:49 1705406989

Because the same type of people is interested in both things.

skyde · 2024-01-16T22:00:01 1705442401

Why synchronized block are not preemptible? When compiling

public void syncMethod() {

        synchronized(lock) {
            // some code
        }

}

they could translate to

public void syncMethod() {

        await reentrantLockAsync.lockAsync();
        try {
            await somecodeAsync();
        } finally {
            await lock.unlockAsync()
        }
    }

dikei · 2024-01-17T02:27:43 1705458463

The first issue is your second code is not Java (no await/async literal for Java yet)

The second issue is they're not completely equivalent. In the second case, you'd need extra memory for the `reentrantLock`, while `synchronized` works with any object. Furthermore, if you need to use `wait/notify`, then there need to be an extra `Condition` object to use in combination with the `ReentrantLock`. For sure, developers can rewrite most `synchronized` to use `ReentrantLock` and `Condition`, but javac won't do it automatically for you.

skyde · 2024-01-29T16:38:33 1706546313

They could at least introduce a new language construct like await synchronizedAsync(lock) { // some code }

C# introduced:

       await foreach (int item in RangeAsync(10, 3))

       Console.Write(item + " "); // Prints 10 11 12

So you dont have to type:

IAsyncEnumerator<int> e = RangeAsync(10, 3).GetAsyncEnumerator();

  try {

    while (await e.MoveNextAsync()) Console.Write(e.Current + " ");

  } finally { 

    if (e != null) await e.DisposeAsync(); 

  }

dxxvi · 2024-01-17T02:57:42 1705460262

Why was c3p0 used (its latest version was released in Dec 2019)? Those tests existed for a while and people were too lazy to replace c3p0 with something newer? I guess that they spent all their time to use virtual threads in those tests and had no time left to look at c3p0.

bheadmaster · 2024-01-16T09:06:37 1705395997

Why couldn't JVM detect when all carrier threads are blocked, and just spawn more of them?

davidtos · 2024-01-16T09:33:36 1705397616

That already exists luckily, you can even change the maximum number of carrier threads with:

- jdk.virtualThreadScheduler.maxPoolSize=10

dikei · 2024-01-16T09:51:59 1705398719

The default is 256, way higher than 10.

But of course, when you have thousands of Virtual Threads all deliberately pinning the carrier thread, you quickly run out.

bheadmaster · 2024-01-16T14:26:50 1705415210

I suppose that a hard limit on the number of carrier threads is a sensible choice then - deadlock is better than creating threads until the system grinds to a halt.

But then again, why couldn't scheduler detect a deadlock? Go has a system in place that, in case of total program deadlock, prints out an error message with all goroutines' stack traces, and stops the program. Perhaps Virtual Thread Scheduler could do the same thing?

But then again, Java also allows for native threads to run in parallel to Virtual Threads, which makes it impossible to detect whether there's a deadlock, and not just virtual threads waiting on a native thread.

I suppose this is a very good example why simple is better than complex.

bob1029 · 2024-01-16T14:44:31 1705416271

Sounds similar to the quirks you get with TPL in .NET under some circumstances. For library code, a ConfigureAwait(false) invoke should be considered to signify that the execution does not need to resume on the original thread.

nottorp · 2024-01-16T12:32:01 1705408321

Can someone explain what these virtual threads are in 2 sentences please?

xmcqdpt2 · 2024-01-16T12:47:53 1705409273

Why two sentences? Maybe you should ask ChatGPT if you want explanations with specific length requirements.

Anyway it's pretty simple really. A generic thread is bunch of stack frames (with their associated local variables). A standard OS thread is under the control of the kernel scheduler which decides whether the thread runs and makes progress or not. The VirtualThread in Java is just a thread which is not directly mapped to the OS thread scheduler but exists as a user space object that can be scheduled by a (Java implemented) scheduler. It's basically just a call stack with its local variables, but one that only steps forward when an OS thread of the scheduler decides to step it.

ganeshkrishnan · 2024-01-16T12:35:06 1705408506

They are not real threads. That is, the CPU is not context switching them but jvm running them in async.

Usually threads were also used for long running io but not cpu intensive tasks. It's recommended to use virtual threads for such scenarios now

nottorp · 2024-01-16T12:51:40 1705409500

Thanks!

> Why two sentences? Maybe you should ask ChatGPT if you want explanations with specific length requirements.

As you can see, people can do it better. I put a limit on it because I didn't want an explanation of what threads are, just of the difference.

marginalia_nu · 2024-01-16T12:59:05 1705409945

The post you replied to had four sentences. (But two paragraphs).

nottorp · 2024-01-16T13:12:03 1705410723

The explanation had two sentences :)

Edit: why are some commenters on HN so literal minded anyway? This is free form chat not code specs. You could have read "2 sentences" as "concise", you know...

DeathArrow · 2024-01-16T07:24:32 1705389872

Why dining philosophers from the image have more than two hands?

isoprophlex · 2024-01-16T07:36:56 1705390616

Because you're reading a low-effort Medium webshit with AI generated images

pierrebai · 2024-01-16T20:38:20 1705437500

Probably the author thought that this typical AI-generation quirks were a funny wink to the concept of virtual threads. The elephants, of course, represent Postgres.

hoseja · 2024-01-16T10:53:28 1705402408

BTW, dining philosophers is an extremely clunky example.

mrkeen · 2024-01-16T13:13:23 1705410803

I was looking to see how it would be relevant, and then:

> we present a case study on how we encountered a deadlock with virtual threads in TPC-C for PostgreSQL, even without the dining philosophers problem.

I guess it was a clunky non-example!

(I was hoping to see a virtual thread solution to compare to:

  https://www.adit.io/posts/2013-05-15-Locks,-Actors,-And-STM-In-Pictures.html
  https://www.youtube.com/watch?v=aQXgW55f7cg
  https://hackage.haskell.org/package/stm
)

oldgradstudent · 2024-01-16T01:15:57 1705367757

Java virtual threads did not cause a deadlock here.

The deadlock was a usage error.

A better title would be: Naively switching to Java virtual threads caused a deadlock in TPC-C for Progress SQL.

dang · 2024-01-16T03:45:58 1705376758

Submitted title was "Java virtual threads caused a deadlock in TPC-C for PostgreSQL". We've reverted it now to the article's own title (truncated to fit HN's 80 char limit).

"Please use the original title, unless it is misleading or linkbait; don't editorialize." - https://news.ycombinator.com/newsguidelines.html

eivanov89 · 2024-01-16T01:33:24 1705368804

I would like to reply with a quote of sir Tony Hoare's 1980 ACM Turing Award Lecture: "There are two ways of constructing a software design: One way is to make it so simple that there are obviously no deficiencies and the other way is to make it so complicated that there are no obvious deficiencies".

adra · 2024-01-16T01:43:57 1705369437

This one was a very large caveat pointed out loudly every time the feature was mentioned to the community. So, yes the limitation/flaw was very well known and is ideally going to be addressed in a future JDK release.

lmm · 2024-01-16T05:54:24 1705384464

> This one was a very large caveat pointed out loudly every time the feature was mentioned to the community.

It really wasn't. There were people on here, including Oracle employees, claiming that the virtual thread implementation was a drop-in replacement that would work (not necessarily perform better, but work) in all cases.

samus · 2024-01-16T08:07:50 1705392470

And it indeed does in common usage scenarios. And also in this case once the issue with `synchronized` is resolved. After all, this is a benchmark and it's not surprising that one of the limitations of the design was hit.

kelnos · 2024-01-16T08:39:29 1705394369

"common usage scenarios" != "in all cases".

I don't know if those Oracle employees actually did outright say -- or even imply -- "in all cases" as the GP asserted, but if they did, then "only" working in "common usage scenarios" would definitely be overselling the feature.

samus · 2024-01-16T10:48:09 1705402089

I find it unlikely as well that they said it would just work in all cases. But since it's going to work out eventually and a workaround exists, they would actually not be that wrong with that statement.

kaba0 · 2024-01-16T08:18:04 1705393084

If your program was prone to deadlock as is, and is just more easily happening with virt threads, it means that the problem is your code.

simiones · 2024-01-16T13:04:22 1705410262

If your program requires 5 OS threads to be able to make progress (say, there are operations for which 4 threads are blocked while the 5th does work), and the runtime only spawns 2 OS threads and tries to schedule your 5 now virtual threads on those 2 OS threads, then your old program was perfectly correct and the virtual threads runtime has broken it.

This was a known and advertised failure case with the new threads runtime, exacerbated by limitations in the implementation that cause certain blocking operations to block the current OS thread instead of blocking the virtual thread and allowing another virtual thread to be re-use the underlying OS thread.

lmm · 2024-01-16T09:45:40 1705398340

Having X resources and Y dedicated threads that operate on them, where Y > X, and allowing a thread to block in a way that requires the assistance of another thread to make progress but only when it's holding a resource, is a perfectly reasonable, standard, and safe design. When a change to the runtime silently reduces the number of synchronized sections a program can enter concurrently, it's not at all surprising that this breaks working code.

kaba0 · 2024-01-16T09:10:31 1705396231

Concurrency plus parallelism is complicated. There is no going around that.

shermantanktop · 2024-01-16T01:31:23 1705368683

Article won’t load for me. My understanding was that the switch to virtual was supposed to be relatively simple, allowing the programmer to be naive. But a user deadlock is a user deadlock, no matter the threading impl.

papercrane · 2024-01-16T01:40:49 1705369249

The issue here is currently virtual threads don't work well with the 'sychronized' keyword. Right now synchronized will pin the carrier thread. The fix was to switch to a higher-level abstraction that works with virtual threads.

My understanding is there is work to make synchronized not pin the carrier thread, but that's some pretty complex and important code to change.

Groxx · 2024-01-16T03:00:42 1705374042

From a relatively brief skim and past Go and Java experience: synchronized blocks the current normal thread, so that doesn't really seem any different to me. If you starve your threads, you starve your threads.

It definitely leaves room to optimize by not pinning that thread, which would be great, but that shouldn't change semantics at all. Or is there something actually screwed up in the implementation of virtual threads that makes this a much bigger issue?

extractionmech · 2024-01-16T03:21:40 1705375300

It’s a thread that supports n virtual threads. You want synchronized in virtual thread a and not the carrier thread which will block all the virtual threads.

Been away from Java land for a while. How did something like that even get into release? That’s like a pretty big loaded shotgun to leave lying around with lots of kids playing, no?

samus · 2024-01-16T08:20:53 1705393253

It's an explicitly documented shortcoming of the existing implementation that will be fixed soon. I knew immediately from the title of TA what probably happened. The other similar limitations (CPU-bound tasks, native calls) seem much more severe, but are ultimately unsolvable. Meanwhile, the issue with synchronized is regarded as a scalability bottleneck since the JDK is supposed to temporarily spawn additional platform threads. This behavior can be controlled via the system property `jdk.virtualThreadScheduler.maxPoolSize`.

Also, this is a benchmark. It's not surprising that they managed to produce a situation where more than n_cores virtual threads would actually start waiting.

extractionmech · 2024-01-18T00:47:33 1705538853

Appreciate the reply. Hope you get it out soon since many /g do not read documentation and synchronized semantics changing is a ‘surprise’, specially since this one of those bugs that is a nursery for heisenbugs.

kaba0 · 2024-01-16T09:13:56 1705396436

It spawns a new carrier thread in place, up to a certain, configurable limit. But starving carrier threads will also result in effectively live locks, so that’s not a solution.

So I don’t see the big fuss about it - don’t spawn a million virtual thread that all just spams synchronized?