Are top websites using WebGL for fingerprinting?

slashink · on April 17, 2021

Some of these are probably fingerprinting but the Twitch one isn't (I worked on the video system at Twitch for a number of years).

"player-core-variant-....js" is the Javascript video player and it uses WebGL as a way to guess what video format the browser can handle. A lot of the times mobile android devices will say "I can play video X, Y, Z" but when sent a bitstream of Y it fails to decode it. WebGL allows you to get a good indication of the supported hardware features which you can use to guess what the actual underlying hardware decoder is.

This is sadly the state of a lot of "lower end" mobile SoCs. They will pretend to do anything but in reality it just... doesn't.

javajosh · on April 17, 2021

JavaScript legitimately needs to know about it's runtime environment. The problem with fingerprinting is not the act of examining the environment itself, but rather sending the results of that examination to the server as a form of identification. I would rather confront the second problem more directly with a "per tab Little Snitch" type solution that eliminates communication that is not in the user's interest, rather than eliminate fingerprinting, for precisely the reasons you give, slashink.

smolder · on April 17, 2021

Unfortunately web apps can defeat that by bundling unwanted telemetry type stuff in with other API calls, directly or by batching. So, it would need to be a complex tool to deal with that or suffer limited applicability. Perhaps if it stayed under the radar it could be effective in many cases without instigating countermeasures by site owners.

javajosh · on April 17, 2021

Let's not make perfect the enemy of good. Counter-measures like user-interactive, request specific firewalls (like Little Snitch) can always be defeated by a motivated malfactor willing to commmit resources. That does not mean it isn't worth doing.

Consider that virtually all physical locks are trivial to pick by someone who knows how (see youtube.com/lockpickinglawyer), and yet we still use locks. Pickable locks improve security because they increase the cost to the attacker enough that it deters most attacks.

hnlmorg · on April 18, 2021

The point is there isn't any extra cost nor difficulty in circumventing the checks you describe. You just run a library.

javajosh · on April 18, 2021

I write webapps for a living. If a browser plugin wanted to selectively allow XHR/fetch calls based on payload, there is very little I could do about it.

The implementation might be to have your plugin content script wrap the DOM XHR/fetch in a proxy. The proxy runs a predicate on the payload, and if true, lets it go through. The predicate would be something like "No PII", which would also imply that the traffic be unencrypted.

An app could remove the proxy. But it seems to me that most people wouldn't bother. It's also possible that there are other mechanisms, for example a special Plugin IO API that cannot be changed by content scripts.

hnlmorg · on April 18, 2021

> I write webapps for a living.

I’d imagine most people in this thread do or have. Myself included. It’s a pretty massive industry :)

What you’re missing is that whatever you do to remove fingerprints does itself add a unique metric to fingerprint. This is also compounded by how easy, cheap and legal it is to add fingerprinting tech to ones site. Literally the only way to break fingerprinting is if the majority of the web browsing population ran systems that randomised fake responses. But as it stands at the moment, it’s possible to:

1. Identify when a plug-in it overloading a builtin function

2. Identify which users are consistently doing so because so few are and there’s methods of fingerprinting that exist outside of your JS VM.

I don’t have the link to hand but there’s a website that you can visit and it tells you how identifiable you are. I used to think it was possible to hide until I visited that site and then I realised just how many different metrics they collect and how a great many of them are literally impossible to block or rewrite without breaking the website entirely.

emn13 · on April 18, 2021

You may have been thinking of this one: https://coveryourtracks.eff.org/

hnlmorg · on April 18, 2021

It was this one: https://www.amiunique.org/fp

It goes into more detail than the EFF link where it breaks down your uniqueness per each metric (and how much entropy each metric adds) as well as giving you an overall uniqueness.

It's a fantastic but also scary resource :)

emn13 · on April 28, 2021

The cover your tracks implementation also breaks down uniqueness per category, per metric.

They are nice resources, but don't get too scared!

Frankly, both are exaggerating a little - e.g. including stuff like browser version numbers which only appear as unique as they do because the time-span they cover is long enough to overlap update cycles (AmIUnique even seems to have it cover the entire history by default??? That's just noise), yet not stable for more than a short period of time. AmIUnique includes the exact referer, which is likely not nearly as useful as that would make it seem.

Then there's stuff like "Upgrade Insecure Requests" and "Do not track", which is likely extremely highly correlated with browser version choice.

Both sites can't really tell you how reliable the identification is, only how unique you are at this moment. And that matters a lot, because if identification is unreliable (i.e. the same person in some metric has multiple distinct fingerprints) the end result is that for reliable overall identification a fingerprinter may need many times as many bits of entropy as a naive estimate might assume, especially if visits are occasionally sparse and thus changes to fingerprints may frequently come all at once.

Clearly over the very short term you are likely uniquely identifiable as a visitor. However, it's less clear how stable that is.

athrowaway3z · on April 18, 2021

uMatrix. It does what you describe and I always use it.

But the solution isn't good per se.

It provides a high level of granularity and it could theoretically provide even more granularity. But its already an advanced tool that an average user will never use.

kenniskrag · on April 18, 2021

umatrix is end of life, not?

benatkin · on April 17, 2021

Not all JavaScript does, and the kind that does isn't neccessarily something I asked for. I would be plenty happy if GitHub and Stripe couldn't show their 3D globe animations until I request them, for the sake of privacy.

zelphirkalt · on April 19, 2021

After realizing the unbelievable CPU hog that the Github globe is, I simply added an element hiding rule for that crap. Not sure if element hiding rules help preventing fingerprinting from such sources.

make3 · on April 20, 2021

how would that work, once js has information it can send it anywhere

rasz · on April 17, 2021

incidentally part of my anti fingerprinting script looks like this

    let UNMASKED_RENDERER_WEBGL = ["ANGLE (AMD Radeon HD 7900 Series Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (Intel(R) HD Graphics 4000 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (Intel(R) HD Graphics 4600 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (Intel(R) HD Graphics 520 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (Intel(R) HD Graphics 530 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (Intel(R) HD Graphics Family Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (NVIDIA GeForce GTX 960 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (NVIDIA GeForce GTX 1070 Direct3D11 vs_5_0 ps_5_0)",
                               "ANGLE (NVIDIA GeForce GTX 760 Direct3D11 vs_5_0 ps_5_0)"]

those were the most popular desktop GPUs according to Steam a year or two ago.

nine_k · on April 17, 2021

Won't a custom script make you special and thus very easy to precisely fingerprint?

I can imagine that having several typical configs and switching between them at random would help blend in.

gruez · on April 17, 2021

>I can imagine that having several typical configs and switching between them at random would help blend in.

You have to be careful with that too. An anti-anti-fingerprinting implementation can record the values and compare them across visits to see whether they stay the same. If they change every few months that's reasonable (eg. changing hardware), but if they change every day or every week there's most certainly spoofing involved.

nine_k · on April 17, 2021

It should change every few requests. The point is not to conceal spoofing, but to foil attempts to fingerprint.

Maybe an explicit sign of spoofing is even better, it sends a message in a gentle way.

Unspoofing on the server side, if at all possible, would likely be too expensive for whatever gain it might bring.

cmeacham98 · on April 17, 2021

Unless a major anti fingerprinting solution uses the same list of GPUs as you doing this puts you in a tiny bucket and provides massive entropy to trackers, possibly even enough to exactly identify you given many webGL calls.

strken · on April 18, 2021

You could seed your random number generator with a hash of the hostname, guaranteeing consistency across all the random values you return to the one host.

therouwboat · on April 17, 2021

Anti-anti-fingerprinting? :/

I feel like going there and telling them to stop following me around.

Gravyness · on April 18, 2021

You can't do that because of the anti-anti-anti-fingerprinting, I know this because of reasons

xsmasher · on April 17, 2021

Does it matter if they know you're spoofing, as long as it prevents them from linking two separate sessions together?

gruez · on April 17, 2021

you'll run into this problem: https://xkcd.com/1105/

x775 · on April 17, 2021

Would you be willing to share this script?

gruez · on April 17, 2021

Don't bother. It's hard to do it correctly. If you look through the snippets (or the MDN docs[1]), the value is retrieved using the getParameter() function. You might be tempted to override the function by doing something like

    gl.getParameter = () => "test"

but that's easily detectable. If you run

    gl.getParameter.toString()

You get back

    "() => "test""

whereas the original function you get back

    "function getParameter() { [native code] }"

In general, don't try to fix fingerprinting via content scripts[2]. It's very much detectable. Your best bet is a browser that handles it natively.

[1] https://developer.mozilla.org/en-US/docs/Web/API/WEBGL_debug...

[2] https://palant.info/2020/12/10/how-anti-fingerprinting-exten...

Matheus28 · on April 17, 2021

You can easily hide it by hijacking Function.prototype.toString to see if `this == fake gl.getParemeter or this == fake toString`. Then the js code needs to find a real Function.prototype.toString by creating an iframe, but then you can detect that. Then I'm out of ideas on how to rescue the original toString

HWR_14 · on April 17, 2021

So the issue is that the fingerprinting code can detect the anti-fingerprinting code? Doesn't that mean the best solution is for everyone to override the same functions with the same dummy information?

cookiengineer · on April 18, 2021

This can be fixed by overriding valueOf() and toString() on the prototype. Just return another native function, like JSON.stringify ;)

rasz · on April 18, 2021

Sadly there are still things you cant programmatically override/proxy, like storagemanager

    await navigator.storage.estimate()

rasz · on April 17, 2021

gl.getParameter.toString() = () => 'function getParameter() { [native code] }'

gruez · on April 17, 2021

    -> gl.getParameter.toString.toString()
    <- "() => 'function getParameter() { [native code] }'"

Not to mention the iframe trick mentioned in palant's article.

rasz · on April 17, 2021

is that Firefox? in Chrome I get

    gl.getParameter.toString() = () => 'function getParameter() { [native code] }'
    gl.getParameter.toString()
    "function getParameter() { [native code] }"
    gl.getParameter.toString().toString()
    "function getParameter() { [native code] }"
    gl.getParameter.toString().toString().toString()
    "function getParameter() { [native code] }"

iframes, worker, sharedworker, serviceWorker are all covered. Good luck timing the difference.

deathanatos · on April 17, 2021

You're running

  gl.getParameter.toString().toString()

what the comment you're replying to is trying to tell you to run is:

  gl.getParameter.toString.toString()

Call toString on the toString fuction, not on its result.

rasz · on April 17, 2021

Good point. Rewriting according to https://adtechmadness.wordpress.com/2019/03/23/javascript-ta... now

stevehawk · on April 17, 2021

nice try NSA

novok · on April 17, 2021

Do you present a set of 9 GPUs or pick one at random?

rasz · on April 17, 2021

UNMASKED_RENDERER_WEBGL[Math.floor(Math.random()*UNMASKED_RENDERER_WEBGL.length)]

jefftk · on April 17, 2021

Agreed: fingerprinting is using ways one browser or device consistently differs from others to derive a stable identifier.

Several others on this list are also not used for fingerprinting, and are instead detecting robots/spam/abuse. Unfortunately, there isn't any technical way for the public to verify that, because client-side all it looks like is collecting a bunch of high-entropy signals.

All the major browsers have said they intend to remove identifying bits to where fingerprinting is not possible, which will also make these other uses stop working.

AlotOfReading · on April 17, 2021

That was always a short-term hack anyway. The limiting case is that spammers simply pay humans somewhere to continue whatever "abuse" was formerly automated, as currently happens with captchas.

thu2111 · on April 18, 2021

Almost all spam is not profitable enough to pay humans to do every step manually, so if you make it that expensive, it's the same thing as winning.

kjrose · on April 17, 2021

This is interesting. I can see how it could have legitimate uses to get around bad inplementations with stating what capabilities are possible.

However that being said it's like fingerprinting to sort out what their system really has. Still an abuse of the system.

slashink · on April 17, 2021

That's a fair take. In a sense it's is a "fingerprinting" method although I personally think fingerprinting embodies using this data to track devices between contexts.

If you're interested why this data was exposed in the first place, the MDN docs has good info https://developer.mozilla.org/en-US/docs/Web/API/WEBGL_debug... . "Generally, the graphics driver information should only be used in edge cases to optimize your WebGL content or to debug GPU problems."

Sadly that's the reality of some of these tools. The intent was good and in many cases they are a necessity to create a web experience that works on every device. On the flip side this allows people to use this data to fingerprint.

nerdponx · on April 17, 2021

As per my sibling (cousin?) comment, pretty much any legitimate capability-determining data can be used for fingerprinting.

This can only be solved with legislation, IMO. There is no way for an industry to self-regulate something like this, the candy bowl too big and the candy too sweet.

difosfor · on April 17, 2021

Yeah, I think the world would be better off without ads altogether. Just allow people to search for what they need or want by themselves, that's enough.

nine_k · on April 17, 2021

...and pay out of pocket for the use of a search engine? Why, well, I'd do that; e.g. neeva.com is accepting sign-ups.

Also, ads without personal targeting, much like dead-trees newspaper / magazine ads, could still work and prop up certain web publishers.

tjpnz · on April 17, 2021

>...and pay out of pocket for the use of a search engine?

That's the only way service providers will see a cent from me going forward. Ads present not only a privacy risk but they're increasingly becoming a security risk too. I will not allow them on any of the devices I own or that connect to my home WiFi.

nine_k · on April 17, 2021

I think that static ads served from the first-party CDN are security-wise no worse that the content itself.

Blocking scripts and requests by third-party ad networks makes complete sense from security perspective, though.

Affiliate links going directly to relevant item pages in a store are fine by me, too. They have to be relevant for anyone to click on them, they don't play video or make sound, etc. They do give some tracking opportunity, but without third-party cookies and third-party requests, it's hard to achieve anything resembling the privacy-invading precise tracking which current ad networks routinely do.

In any case, I much more prefer the absence of AWS and an honest donation button.

throwawgler87 · on April 17, 2021

To make fingerprinting illegal, you mean? Or were you thinking of some other way to solve the problem with regulation?

nerdponx · on April 17, 2021

More like (the spirit of) GDPR, where data collection itself becomes a legal and financial liability to point where it's not worth it to collect and retain it for a typical entity.

tmpz22 · on April 17, 2021

So absolutely no data on the device obtained via webGL is used for marketing or other BI workloads? None of it is shared with 3rd parties (especially advertisers)? Its entirely used for the user experience and then discarded when no longer relevant?

FWIW while I'm playing hardball here I really appreciate your answer and expertise.

slashink · on April 17, 2021

I don't work at Twitch anymore so I can't answer your question without guessing and I rather not.

There is always a likelihood that data gets used for reasons beyond the original purpose. In the best of world the hardware that runs on consumers devices would do the right thing which would allow the web to be a perfect sandbox. I think we're slowly getting there, in terms of video it's slowly getting to a point where H.264 support is "universally true" rather than a minefield although VP9 and AV1 is a bit of being back to square one.

I think the spirit of my original comment was not to say "I promise that X company isn't doing Y" more to explain why this code existed in the first place. A search engine doesn't need to know what WebGL capabilities a device has as it doesn't deal with rendering whereas a site that has to work with hardware decoders most likely does need to know.

rasz · on April 17, 2021

Just looked at it, it only triggers when player encounters an error while decoding video.

Ensorceled · on April 17, 2021

> FWIW while I'm playing hardball here I really appreciate your answer and expertise.

You’re actually just outright accusing them of lying.

slashink · on April 17, 2021

I didn't interpret the comment as an acquisition to my original comment.

It's good to ask the hard questions and even though I'm not able to answer it in detail I still think that 'tmpz22' brought up a good point in that data can be used for both good and bad at the same time.

Quarrelsome · on April 17, 2021

outright is literal, this isn't outright its just challenging their answer and its journalistically a good sequence of questions.

You need to be able to tell the good from the bad and IMO you're wearing these trousers the wrong way round.

Nextgrid · on April 17, 2021

Considering companies have lied before when it comes to privacy & ad tracking (see Facebook's "promise" of not using 2FA phone numbers for advertising purposes) his concerns are totally reasonable.

happyopossum · on April 17, 2021

Do you see those question marks? Those denote questions. Questions are not accusations.

itsananderson · on April 17, 2021

Have you stopped beating your wife?

Edit: this is the typical example of a loaded question, not an actual accusation against the parent comment https://en.m.wikipedia.org/wiki/Loaded_question

jameshart · on April 17, 2021

Really? never?

So when did you stop beating your wife?

II2II · on April 17, 2021

I understand the point you are trying to make, with the question incorporating an accusation (i.e. that the person beat their wife in the past). That is different from asking pointed questions about potential actions (e.g. "have you ever beat your wife?").

wizzwizz4 · on April 17, 2021

Tell that to Socrates.

Arnavion · on April 17, 2021

I'm curious if something like [1] would work for those SoCs, eg ask if it supports "video/mp4-liar-liar-pants-on-fire-from-twitchtv"

[1]: https://devblogs.microsoft.com/oldnewthing/?p=40663

slashink · on April 17, 2021

Good question!

So the problem here is a bit different. It's not that devices will say "I can play Format X" and then not play it. It's that devices say "I can play Format X at Resolution A, B, C". When you give the device resolution A and B it succeeds but at resolution C it fails to decode it.

In H.264 this would be the "Level" https://en.wikipedia.org/wiki/Advanced_Video_Coding#Levels . A device may say that it can decode Level 4.2 but in reality it can only do 4.1. That means it can play back 1080p30 but not 1080p60. The only way to know is to actually try and observe the failure (which often btw is a silent failure fro m the browsers point of view, meaning you need to rely on user reports).

greggman3 · on April 18, 2021

Wouldn't it be just as easy to test videos in formats A, B, and C and see if the play? You could check that video.currentTime advances. If it lies about that you could draw to a canvas and check the values. That seems more robust than checking WebGL.

slashink · on April 19, 2021

Also a good question.

The issue here is the architectural difference between the hardware decoder and the GPU. What happens under the hood with MSE ( https://developer.mozilla.org/en-US/docs/Web/API/Media_Sourc...) is that you are responsible for handing off a buffer to the hardware decoder as a bitstream. Underneath, the GPU sets up a texture and sends the bitstream to the hardware decoder that's responsible for painting the decoded video into that texture.

What often ends up happening is that the GPU driver says "yes the hardware decoder can do this", it accepts the bitstream, sets up the texture for you which is bound against your canvas in HTML. Starts playing the video, moves the timeline playhead but the actual buffer is just an empty black texture. From the software's point of view, the pipeline is doing what it's supposed, due to the hardware decoder being a black box from the Javascript perspective it's impossible to know if it "actually" worked. Good decoders will throw errors or refuse to advance the PTS, bad decoders won't.

Knowing this, your second suggestion was to read back the canvas and detect video. That would work but the problem here is "what constitutes working video". We can detect if the video is just a black box but what if the video plays back but at 1 frame per second, or plays back with the wrong colors. It's impossible to know without knowing the exact source content, a luxury that a UGC platform like Twitch does not have.

For this reason just doing heuristics with WebGL is often the "best" path to detecting bad actors when it comes to decoders.

greggman3 · on April 21, 2021

My point with the video to canvas is if you create samples of the various formats in various resolutions then you can check a video with known content (solid red on top, solid green on left, solid blue on right, solid yellow on bottom) and check if that video works. If it does then other videos of the same format/res should render? I've written conformance tests that do this.

At worst it seems like you'd need to do this once per format per user per device but only if that user hasn't already had the test for that video size/format. (save a cookie/indexed-db/local-storage that their device supports that format) so after that only new sizes and formats need to be checked.

Just an idea. No idea if what problems would crop up

nerdponx · on April 17, 2021

That said, surely this "functional" information can also be valuable fingerprinting data, no? What's stopping an enterprising data science team from pulling it into their data lake, using to build ad models for logged-out users, maybe submitting it to 3rd-party machine learning vendors, etc. and generally making it undeletable?

kjeetgill · on April 17, 2021

... but that is sorta fingerprinting, right? Atleast, inferring a user's device capabilities via WebGL is hardly using WebGL.

I think it's a beautiful, legitimate use — but I can't fault the author for labeling fingerprinting.

jeffgreco · on April 17, 2021

Is fingerprinting not trying to uniquely identify a user?

calhoun137 · on April 17, 2021

Since there is no way to 100% fingerprint a device, therefore there is no way to uniquely identify anyone with 100% confidence using pure fingerprinting techniques.

My view is that fingerprinting is a set of tools which can be used for "good or evil" if that makes sense. If you are gathering meta-data to determine the capabilities of the device, then this is part of the wider framework of data points which can, in principle, be used for fingerprinting a user. This data can be imported into a completely different system by a sophisticated adversary, so it needs to be treated as a security vector, imho

salawat · on April 17, 2021

>Since there is no way to 100% fingerprint a device, therefore there is no way to uniquely identify anyone with 100% confidence using pure fingerprinting techniques.

Pedantic point, so forgive me, but 100% uniquely identifying a device does not imply 100% uniquely identifying the user of the device. We call them User-Agents for a reason. Anyone could be using it.

It's critical people not fall into the habit of conflating users and user-agents. Two completely different things, and increasingly, law enforcement has gotten more and more gung-ho about surreptitiously forgetting the difference.

Ad networks and device/User-Agent based surveillance only makes it worse.

There are several initiatives to implement UUID's for devices. There is the Android Advertising ID, systemD's machine-id file, Intel burns in a unique identifier into every CPU.

IPv6 (without address randomization) would also work as a poor man's UUID.

It's frighteningly easy, and you'll be surprised how unintentionally one can be implementing something seemingly innocent and end up furthering the purposes of those seeking to surveil.

dheera · on April 18, 2021

You could fingerprint the user as well:

- look at the statistical behavior of how they operate the mouse

- estimate their reading speed based on their scrolling

- for mobile devices, use the IMU to fingerprint their walking gait and angle at which they hold the phone (IMU needs no permissions)

- measure how the IMU responds at the exact moment a touch event occurs. this tells you a quite a bit about how they hold their phone

- if they ever accidentally drop their phone, use the IMU to detect that and measure the fall time, which tells you the distance from the ground to the height they held the phone. then assuming the phone is held normal to the eyes, you can use the angle they held the phone to extrapolate the location of the eyes and estimate the user's approximate height

salawat · on April 18, 2021

That's a lot of extraneous data to be adding to a stream leaving the phone. (Or dumping to a locally stored db file.), but you're technically correct, though not infallibly so.

The level of noise is incredibly problematic. My leading cause of dropped phone, for instance is forgetting I have it in my shirt pocket, on my lap, off my desk, or from my back pocket if I don't put it in just right. Am I a different person in each of those circumstances? The statistical answer would be no, but only cones from widening the scope of collected data. Suppose I fiddle with it? Dance with it? Have a habit of leaving it in a car? Without a control, you have a different set of relative patterns. At best you know there is a user with X. Yes you can make some statistical assumptions, but at best, when it really counts, it still needs to line up with a hell of a lot of other circumstantial datapoints to hold water.

Furthermore, I guarantee not a single person would dare make any high impact assumption based on that metadata given that once it gets out, it's so adversarially exploitable it isn't even funny. Imagine a phone unlock you could do just by changing your gait. Or worse, a phone that locks the moment you get a cramp or blister. Madness. Getting different ads because you started walking like someone else for a bit. Do I become a different person because I try to read something without my glasses, or dwell on a passage to re-read it several times? Or blaze through a section because I already know where it is going? These are not slam dunk "fingerprints" by a long shot. More like corraborating data than anything else, and in that sense, even more dangerous, because people are not at all naturally inclined to look at these things with a sense of perspective. It can lead a group of non-data-savvy folks to thinking there is a much cleaner tighter case than there necessarily is, and on top of that, mandates that people be okay with the gathering of that data in the first place, which has only been acceptable up until now because there was no social imperative to disclose that collection.

Going off on a tangent here, so I'll close with the following.

There is the argument to be made that that exact kind of practice is why defensive software analysis should be taught as a matter of basic existence nowadays. If I find symbols that line up with libraries or namespaces that access those resources, why should I be running that software in the first place?

I can't overstate how over 90% of software I come across I won't even recommend anymore without digging into it anymore. There's just too much willingness to spread data around and repurpose it for revenue extraction. It does more harm than good. What people don't know can most certainly hurt them, and software is a goldmine for creating profitable information asymmetries.

dheera · on April 18, 2021

> My leading cause of dropped phone, for instance is forgetting I have it in my shirt pocket, on my lap, off my desk, or from my back pocket if I don't put it in just right. Am I a different person in each of those circumstances? The statistical answer would be no, but only cones from widening the scope of collected data. Suppose I fiddle with it? Dance with it? Have a habit of leaving it in a car? Without a control,

Oh, but all of these can be added to your statistical model and learned over time! If we figure out that you suddenly walk with a limp, and all the other metrics match, we can recommend painkillers! Or if the other metrics match and you start dancing, we start recommending dance instructors! Hell, we can even figure out how well you dance using the IMU and recommend classes of the appropriate skill level.

For a recommendation system, like ads, the consequences of mis-indentification wouldn't be that high either. You'd still target much better than random, which is the alternative in the absence of fingerprinting.

calhoun137 · on April 17, 2021

This is an excellent point! Thank you for pointing that out +1

nerdponx · on April 17, 2021

Fingerprinting works because devices are surprisingly easy to identify just by enumerating their capabilities. If you are collecting this data, you are likely fingerprinting (read: uniquely identifying) machines even if you aren't trying to.

The same is true of humans, by the way. Even something as innocuous as surname, gender, and county of residence could be enough.

calhoun137 · on April 17, 2021

+1 just because fingerprinting with WebGL has practical applications and legitimate use cases, this does not mean it's not fingerprinting

BHSPitMonkey · on April 17, 2021

It would only be fingerprinting if the "fingerprint" is persisted alongside some other information about you as a user, and subsequently used in attempts to identify other activity as belonging to said user. That is not at all what was implied by the approach described above (which would just be used at the time of initializing every video streaming session).

calhoun137 · on April 17, 2021

I stand corrected. You make a good point.

crazygringo · on April 17, 2021

Sorry, but this is nonsense, as presented.

There is zero evidence any of these are being used for fingerprinting, which is defined as building up an entire set of capabilities to uniquely identify a user.

Essentially all of these seem to be attempting to identify the graphics card, and all of these could be related to feature detection for an embedded video player, for example. Feature detection is not fingerprinting.

Fingerprinting is pretty easy to confirm, because it collects a large combination of data points (fonts installed, canvas rendering quirks, etc.) and either reports or hashes them all together. (That doesn't mean it's easy to find the code that does the fingerprinting, but once you've found it it's quite obvious what it's doing.)

It is deeply irresponsible and misleading of the author to claim fingerprinting without looking for that type of combination. And slapping "No claims made about accuracy" at the top of the page isn't an excuse.

lifthrasiir · on April 18, 2021

> [...] and all of these could be related to feature detection for an embedded video player, for example.

I agree the OP's analysis itself is a bit shallow, but I don't see how that can be related. If you embed a video and do not do anything with its output then WebGL is simply unrelated. In fact most of websites in question do not seem to use WebGL visibly. That fact combined with a direct check for WEBGL_debug_renderer_info and so on should flag a suspicion. Explicit alternative explanations would be needed to prove innocence.

crazygringo · on April 18, 2021

Literally the current top comment for this article explains how WebGL values are used to more accurately determine which video stream is best to send to the video player, since other methods can return inaccurate or insufficient information. Because you want to make sure the video format has hardware acceleration for decoding, to not destroy a mobile user's battery.

So that's how they're related.

And let's not assume guilt and then require proving innocence? It works the other way around. It's the burden of the person making the accusation to form a strong case. Which is not done here at all.

lifthrasiir · on April 18, 2021

Oh, I missed that comment---I thought the top comment was same at that time, but it has changed since then. That suffices as "explicit alternative explanations". Thank you for pointing it out.

I still believe that the presumption of innocence should not apply in this case, because fingerprinting is already close to guilty (if you don't agree to this premise there is no point for further discussion, so please refrain from arguing against this specific point). You are right that feature detection is not fingerprinting but they are virtually indistinguishable; given how widespread fingerprinting has been, it was enough to suspect so.

MarblePillar · on April 17, 2021

It appears that your null hypothesis embraces the benevolence of tech companies. Is this a reasonable assumption? How, after all, do they make their money?

crazygringo · on April 18, 2021

It does no such thing.

You could take literally any line of code and make an unfounded claim. "It calculates a hash, and fingerprints use hashes!" "It stores a variable, and analytics uses variables!"

It's on the burden of the person making an accusation of bad behavior to actually demonstrate that. Otherwise it's no different from me declaring you're an evil hacker because you comment on Hacker News, guilty until proven innocent.

MarblePillar · on April 18, 2021

Missing the point.

The null hypothesis determines the "unfounded claim". For example, judicially, the null hypothesis is, "You are innocent until proven guilty in a court of law." Similarly, commercially, the null hypothesis is, "If it's profitable and mostly legal, corporations will compete to do it better."

Fingerprinting is both profitable and legal. It is so profitable and so legal that today's most dominant corporations, entities representing trillions of dollars of value, are founded on its premise.

The "unfounded claim", therefore, is yours. Or do you have any evidence that you are not being surveilled?

modulus1 · on April 17, 2021

Author doesn't actually know what this information is being used for. Sure there can be a discussion about whether this information should be recorded at all (say, to determine if your site doesn't work on certain hardware), but claiming it _is_ fingerprinting is baseless.

viseztrance · on April 17, 2021

It's quite tough to know for certain if you're being fingerprinted.

I worked on finding ways around fingerprinting at a previous job. The problem is that sites go out of their way to hide fingerprinting like performing it in an arbitrary redirect and then never do it again or only after using the site for a bit, and prefer doing it before an important operation like making a payment.

da_big_ghey · on April 17, 2021

Did you find any realistic way person can prevent being fingerprint? most idea i have seen, like a tor browsing, is focusing on changing fingerprint and not so much on making fingerprint non-unique. But it always are easy for to connect change fingerprint to former fingerprint, so what we are really need is blending in with same fingerprint as other persons.

viseztrance · on April 17, 2021

I don't know how it can be fully prevented other than disabling javascript, but in my opinion firefox with "privacy.resistFingerprinting" is a good start, though you'll still stand out because few people are using it.

I've seen a script that performed two canvas fingerprints - a complex, and a simpler one. The latter being so simple always returned the same value regardless of the browser, so it was there to see if you have altered the canvas value. That's why changing your fingerprint might still leave you trackable.

akalsz · on April 17, 2021

> most idea i have seen, like a tor browsing, is focusing on changing fingerprint and not so much on making fingerprint non-unique.

Not sure if I exactly understand what you're trying to say here, but the Tor Browser itself certainly focuses on making its users' fingerprints identical. At least it's the only browser I know of that passes fingerprint tests (Panopticlick and friends) with JavaScript enabled.

cookiengineer · on April 18, 2021

As I'm repeating myself too often regarding this: the statistical norm is to be trackable. Every browser that isn't will get flagged.

Privacy does only exist if you seem to look exactly like any other person visiting the website.

That is why user agent, web apis, asset loading order and behaviours, http stack behaviours, tcp fingerprints all have to look like they came from the Browser Engine you're trying to identify with.

If you don't, you're watched. It's as simple as that. Don't kid yourself into safety if you think that the Content-Security-Policy hack of adblockers work to prevent tracking and fingerprinting. Sure, you send less data, but less data means more obvious than the norm.

As there was not a single concept that could fix this (in regards to looking like another Browser) I started to build my own "web filtering browser" [1] that is able to emulate fingerprinting, and sticks them to specific domains and their originating uncached CDN requests so that if you have the same IP for the same website, they cannot correlate it to ptevious visits.

The most effective way to not get tracked is to not visit the website. And I think that there's a huge potential in offloading traffic to peers that have the same URL already in their caches. Peer-to-peer Browser Caches would fix so many issues on their own, I don't understand why nobody is doing that. To the end user it doesn't matter where the assets come from, as long as the website is rendered in the same way afterwards.

[1] https://github.com/tholian-network/stealth

sixothree · on April 18, 2021

I still think there is an opportunity for a hosted browser vpn type service or a generic contained browser. It would look like every other person because every other person uses the same setup.

cookiengineer · on April 18, 2021

Except tracking services know about centralized VPN IP ranges that do not rotate. That's why everybody in algotrading switched to mobile apps and 3G/4G proxies that are installed on actual smartphones, and usually they have dozens of SIM cards laying around.

KirillPanov · on April 18, 2021

Er, wait, your comment sounds really interesting, but what is the threat model for algorithmic traders? Who are they trying to hide from and why?

Almost none of the web breaks simply because you're using a (very well known, very old) VPN IP. Purchases using a credit card are pretty much the only thing that is likely to get blocked. And you face a few more captchas sometimes.

Do algorithmic traders need to use something on the web that blocks VPN IPs more aggressively than this? They must, because juggling all those SIM cards sounds like a huge headache, and cell phone data has awful latency. I'm wondering what they're scraping that the average person doesn't use, and why they want to look like an average person if that's the case.

cookiengineer · on April 18, 2021

AFAIK mostly details, news, stories and stuff related to public knowledge about a company (which they factor in to the real HFT data) due to - as you already said - high latencies in mobile networks.

The issue that arises is mostly cloudflare-related, due to them having a huge influence on hosting, and the forced recaptchas when they detect anomalies in traffic behaviour makes a web scraping workflow real shitty.

From an algotrader's point of view it's a fix to make their web scrapers work again. I'm not sure how Chrome/ium headless could fix this (if it could). I'm a bit sceptical as it's just yet another cat and mouse game.

But as of late I've seen a huge scene start to develop around extension-building for headless Chrome specifically, so that they can run headless and still get the data as they want it to by integrating a content script that sends the data to another service.

KirillPanov · on April 18, 2021

Ah, thanks, now I get it: it doesn't bother them that fingerprinting lets them be tracked; it bothers them that fingerprinting (and VPN IPs) mean their scrapers get hit with captchas.

FWIW, it takes very little effort to completely conceal the fact that firefox is running headless under marionette/webdriver/geckodriver. Chrom(ium) takes more effort, but these guys have solved it (and built a business around it):

https://intoli.com/blog/making-chrome-headless-undetectable/

https://github.com/intoli/remote-browser

Of course neither of these address fingerprinting -- all your scrape requests will have the same or similar fingerprint, which will lead to captchas pretty quickly. This might help (and might even be part of the reason for buying piles of cellphones):

https://news.ycombinator.com/item?id=25379342

nextaccountic · on April 18, 2021

How to make headless firefox undetectable to fingerprinting?

sixothree · on April 19, 2021

Reason 1 of 1000 why I am not working on any projects related to this.

t0astbread · on April 18, 2021

How does a Peer-to-Peer cache work in regards to trust?

cookiengineer · on April 19, 2021

A peer to peer consent algorithm always has to be trustless, meaning that you have to be able to verify things cryptographically so that there can be no 51% attack for each peer bucket. How that cryptography can be "safe" in the sense that nobody is able to fake it is a different story, because in the web lots of things can be statistically true or untrue, but stochastically the opposite (e.g. a website being down right now or unreachable or rendered differently for each country etc).

In my case I'm using existing TLS infrastructure so that each peer can use their own certificates to communicate with other peers directly.

DDR0 · on April 17, 2021

I added a little test for this to my blog, based on Facebook's function. https://ddr0.ca/blog-posts/11.Graphics_Card_Test.html

So far, the only things which haven't reported my graphics card to me are the Tor browser (good), KHTML (good, but probably because it doesn't support WebGL), and Lynx (of course, no js). Firefox reported much less info than Chrome did.

Gravyness · on April 17, 2021

I propose a "This website wants to use WebGL" browser dialog before returning `getContext("webgl")`

surround · on April 17, 2021

Firefox already the same thing for html5 canvas [1], enabled I think via privacy.resistFingerprinting (in about:config). A similar feature would make sense for webGL. In the meantime, power users can disable webGL entirely via webgl.disabled (but without a user-friendly prompt to warn you when it's being blocked).

[1] https://www.ghacks.net/2017/10/28/firefox-58-warns-you-if-si...

hackcasual · on April 17, 2021

The problem with popups like that is too many, or ones where the user doesn't understand why it's blocked leads to fatigue where they just start approving everything

SavantIdiot · on April 17, 2021

Agreed, but it would be nice to have an "advanced (paranoid) mode". E.g., I run little snitch and while it is super noisy, I don't mind the overhead, but I wouldn't force it on everyone.

quenix · on April 17, 2021

Paradoxically, that backfires. If we only let a small subset of users disable WebGL... well, having WebGL disabled _becomes_ the identifying characteristic.

hackcasual · on April 17, 2021

Probably not that hard to build an extension for.

kjeetgill · on April 17, 2021

The hard part is that WebGL is a pretty narrow to concept to communicate this way for most users. It's hardly a Camera or Microphone.

Most not technical users have no basis for action here.

devit · on April 17, 2021

"Do you want to let the website use WebGL? It's either for rendering 3D graphics, such as 3D games, or an attempt to exploit bugs in WebGL to maliciously take full control over your machine, which may succeed if you consent"

modsmustgo · on April 17, 2021

Full control only if the exploit works, otherwise you may end up with some data loss from the proprietary NVIDIA black screen of death on a linux.

olliej · on April 17, 2021

Safari did this in the original implementation - it turns out to be absolutely useless. A person cannot know whether they should be clicking yes, even if they are well versed in the technology.

seba_dos1 · on April 17, 2021

Exactly this. WebGL has plenty of legitimate use cases, but those won't really suffer much from such a dialog (as long as it's reasonably phrased).

nerdponx · on April 17, 2021

Until poorly-constructed websites and/or 3rd-party ad spyware added by the marketing dept starts requiring it or the site breaks. Just like Javascript, third-party cookies, canvas, localstorage...

Also Google runs the browser standards by way of their Chrome market share, and they would surely never implement something like this until they were confident that they didn't need WebGL for fingerprinting, at which point they'd just stop supporting WebGL entirely in favor of something that they invented.

It's not Embrace Extend Extinguish with Google; it's more like Embrace Replace Extinguish.

kevingadd · on April 17, 2021

You'd be surprised how many websites use WebGL now. You'd probably end up with some sort of whitelist like the Google one for web audio, otherwise end users would complain incessantly.

seba_dos1 · on April 17, 2021

Web Audio's autoplay block was implemented in a very poor way though, basically guaranteeing that existing content will be broken with the user having no way to unbreak it.

panic · on April 17, 2021

Even simpler would be something like what Apple did with the clipboard in iOS, where it displays a message whenever it’s used.

olliej · on April 17, 2021

except that as this post shows many sites will produce that, causing notification exhaustion

gcoguiec · on April 17, 2021

I definitely agree with you.

animex · on April 17, 2021

Chrome: Add "/path/to/chrome.exe" --disable-webgl Firefox: about:config webgl.disabled true Edge: easiest seems to be install Disable WebGL Chrome Extension

Test your browser here FIRST, then disable and test again: https://testdrive-archive.azurewebsites.net/graphics/webglst...

fallingknife · on April 17, 2021

Wouldn't this backfire since having webgl disabled is an outlier, and you will be fingerprinted that way?

KirillPanov · on April 18, 2021

It's still a cohort much, much larger than one.

I've had webgl disabled for almost two years now because I don't find that webgl provides no benefit that I care about, yet lets websites slow down my laptop and drain my battery.

I disabled wasm a few months ago for the same reason.

You are welcome to join my cohort!

stonesweep · on April 17, 2021

I've had WebGL disabled forever, I don't use Twitch (as the other comment describes a valid use for) and really have never missed having it enabled for many years.

It's frequently included in "secure yourself" tuning guides: Firefox -> about:config -> webgl.disabled: true

criddell · on April 17, 2021

Ironically, disabling WebGL in a browser that typically has it on is itself a pretty valuable signal when fingerprinting.

gherkinnn · on April 17, 2021

Well that’s inconspicuous

stonesweep · on April 17, 2021

It's been awhile, but IIRC the recommendation to disable it is based on security, not privacy. Other comments here explain it better than I could, not a programmer by trade.

qwertox · on April 17, 2021

The official registration page for vaccinations in Germany ( impfterminservice.de ) uses Akamai's Bot Manager, which in turn uses WebGL for fingerprinting. It does a lot more, computes timings of certain tasks and stuff, all in all, pretty impressive work, but it is doing fingerprinting.

The privacy policy of the website doesn't mention this technique, which effectively is a replacement for cookies, nor the use of any of Akamai's services. It even CNAMEs Akamai.

So basically Akamai's service is left out of this list for no reason at all.

I stumbled upon this while trying to automate the querying for available slots to get notified via email, and one of the POST calls was sending my graphics card model to the server.

userbinator · on April 18, 2021

Have you told the government how this abuse of tech is making it less accessible and also a subtle privacy invasion? It's a bit disturbing that what would've probably been a simple HTML form a decade or two ago, and would still work just fine today, instead requires this extra crap.

biryani_chicken · on April 17, 2021

Is there an extension that blocks webgl until I click something like the old Flash blocking extensions?

ec109685 · on April 17, 2021

Here's a site that will show you your fingerprint: https://browserleaks.com/webgl

SavantIdiot · on April 17, 2021

I'm confused by this page. I ran it on two MBPros side by side and they got the same fingerprint. Did I miss something?

EDIT: Safari and Chrome got the same prints on both machines, FireFox got two different prints. Huh?

tgsovlerkhgsel · on April 17, 2021

If all hardware and software is the same, you should get the same print. Which doesn't make the fingerprint globally unique, but unique enough to make you "one of these 10000 people" instead of "any of this billion people". Combine it with other data, and the combined set identifies you uniquely.

tyingq · on April 17, 2021

I would guess nobody uses webgl exclusively for fingerprinting, but rather, in combination with other things. Apple machines are notoriously homogeneous hardware wise. You can imagine there's a much wider spread of webgl fingerprints for android phones and non-apple desktops/laptops.

redandblack · on April 17, 2021

fwiw - I got different webgl report hash values between refreshes on firefox. I do have webgl fingerprint disabling addon in firefox.

edit - addon faq says it injects fake values

mikeiz404 · on April 17, 2021

I have found this extension helpful for helping mitigate WebGL fingerprinting on Firefox — https://addons.mozilla.org/en-US/firefox/addon/webgl-fingerp...

You can test its effectiveness here — https://coveryourtracks.eff.org/

gruez · on April 17, 2021

> I have found this extension helpful for helping mitigate WebGL fingerprinting on Firefox — https://addons.mozilla.org/en-US/firefox/addon/webgl-fingerp...

It might work for simple fingerprinting scripts, but not for anti-anti-fingerprinting scripts. See: https://palant.info/2020/12/10/how-anti-fingerprinting-exten.... Since you're already on firefox, you're probably better off enabling resistfingerprinting instead.

kristofferR · on April 17, 2021

That blog post doesn't seem valid, that extension randomizes the fingerprint each time.

iudqnolq · on April 17, 2021

Getting a random value each time is trivially detectable. That's a better signal that what you're blocking if more people share your video card model than install the extension.

gruez · on April 17, 2021

It fails the first section ("Barking the wrong tree"). If I install the addon and then run

    gl.getParameter.toString()

I get

    "function () {
              window.top.postMessage(\"webgl-fingerprint-defender-alert\", '*');
              //
              if (arguments[0] === 3415) return 0;
              else if (arguments[0] === 3414) return 24;
              else if (arguments[0] === 36348) return 30;
              else if (arguments[0] === 7936) return \"WebKit\";
              else if (arguments[0] === 37445) return \"Google Inc.\";
              else if (arguments[0] === 7937) return \"WebKit WebGL\";
              else if (arguments[0] === 3379) return config.random.number([14, 15]);
              else if (arguments[0] === 36347) return config.random.number([12, 13]);
              else if (arguments[0] === 34076) return config.random.number([14, 15]);
              else if (arguments[0] === 34024) return config.random.number([14, 15]);
              else if (arguments[0] === 3386) return config.random.int([13, 14, 15]);
              else if (arguments[0] === 3413) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 3412) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 3411) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 3410) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 34047) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 34930) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 34921) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 35660) return config.random.number([1, 2, 3, 4]);
              else if (arguments[0] === 35661) return config.random.number([4, 5, 6, 7, 8]);
              else if (arguments[0] === 36349) return config.random.number([10, 11, 12, 13]);
              else if (arguments[0] === 33902) return config.random.float([0, 10, 11, 12, 13]);
              else if (arguments[0] === 33901) return config.random.float([0, 10, 11, 12, 13]);
              else if (arguments[0] === 37446) return config.random.item([\"Graphics\", \"HD Graphics\", \"Intel(R) HD Graphics\"]);
              else if (arguments[0] === 7938) return config.random.item([\"WebGL 1.0\", \"WebGL 1.0 (OpenGL)\", \"WebGL 1.0 (OpenGL Chromium)\"]);
              else if (arguments[0] === 35724) return config.random.item([\"WebGL\", \"WebGL GLSL\", \"WebGL GLSL ES\", \"WebGL GLSL ES (OpenGL Chromium\"]);
              //
              return getParameter.apply(this, arguments);
            }"

without the addon I get

    "function getParameter() {
        [native code]
    }"

theamk · on April 17, 2021

We need more of those “do you want to allow....” confirmations. They are trivial to accept in the rare cases when webgl is actually required (online game or something), but will introduce enough friction and “default off” behavior that sites should stop using them by default.

This will probably help with security as well, there has been a fair share of webgl exploits as well.

keyringlight · on April 17, 2021

The issue with this and all similar permission dialogs is that they often go the other way, you're training the user to just hit "allow" and swat the dialog away so they can get at the task/site they wanted. It's one thing to give the user options and ask for everything, but you've got to consider whether they've got the knowledge/context behind it to make a decision and whether it's presented in a way where they won't swat it away.

theamk · on April 18, 2021

Remember those things are non-blocking.. for users who do not care, it is just another bar at the top they can ignore and proceed with their life.

Also from a "user privacy" point of view, this is still OK. Right now, disabling webgl is so rare it makes the user stand out a lot, and can be used as a tracking signal. Also, websites do not expect this to be disabled, and can break.

But if a non-trivial fraction of the users (say 10%) start refusing WebGL, then websites have to keep working without it; and the fact that it is disabled can no longer be used as tracking indicator.

johnyzee · on April 17, 2021

It should simply be disabled until the user performs an action that requires it. I believe web audio works the same way.

pitaj · on April 17, 2021

I think this can be solved via UX. When the browser wants WebGL, notification privs, etc:

1. Pend the request 2. Notify the user "This site wants access to the WebGL graphics API to display complex graphics. This is disabled by default to prevent browser fingerprinting. If the site fails to work properly, click the lock icon and allow it." 3. Deny access until the user explicitly enables it on a temporary or permanent basis

mdtusz · on April 17, 2021

Yes, but also no.

More granular browser API permissions would be fantastic, but the current interface for grant/deny in all the major browsers is annoying enough as is. The prompts need to be moved to a less in-your-face place, or at least have the option to be. I almost always find myself clicking "deny" because no, I don't want to give your recipe site permission to show notifications.

Somewhat related is the new "sign in with Google" prompt I've been seeing on lots of websites. I accidentally clicked it when visiting the Seattle Times website and have been working inundated with spam emails even after clicking unsubscribe.

tombert · on April 17, 2021

Forgive a bit of ignorance on this, but I'm not 100% sure I know what browser fingerprinting actually is. I remember reading something by the DuckDuckGo founder mentioning that it could be a problem even if you use a VPN and incognito mode, but I had some trouble actually figuring out what that actually meant.

Sayrus · on April 17, 2021

Browser fingerprinting allows you to identify uniquely a browser and thus the user. This usually means that VPN and incognito mode will not help you to "change you identity".

One example I know well is AudioContext fingerprinting with a demo available here: https://audiofingerprint.openwpm.com/

There are also worse fingerprinting methods that work cross-browsers.

gruez · on April 17, 2021

audiocontext fingerprinting is less identifying than you think. It pretty much boils down to "what's your browser complied with and what FPU implementation you're running"

https://github.com/WebAudio/web-audio-api/issues/1500#issuec...

https://github.com/w3cping/tracking-issues/issues/53#issueco...

goalieca · on April 17, 2021

Imagine gathering screen size, installed fonts, graphics card, processor model, extensions, plugins, operating system, ... and so on. Perhaps you could gather enough system properties to actually identify someone uniquely. It’s like a fingerprint.

dehrmann · on April 17, 2021

Some of these are more interesting than others. OS is exposed through the user agent. I'm not sure if you can actually capture processor model an graphics card, and plugins were more interesting when people installed plugins like Flash and Java.

WebGL fingerprinting usually works by rendering something off-screen that exposes GPU differences, then captures the rendered image as the fingerprint.

goalieca · on April 17, 2021

If you’re really clever you can figure timings out using JavaScript.. to the point spectre was remotely exploitable. LOL.

everdrive · on April 17, 2021

Hmm, would disabling hardware acceleration in Firefox provide any privacy benefit, then?

srg0 · on April 17, 2021

if you're the only person in the world who has disabled hardware acceleration, you will be uniquely identified by not having hardware acceleration

everdrive · on April 17, 2021

I’ll bet a lot of computers have it disabled: VMs, perhaps Tor Browser, kids who tweak settings for no good reason.

Gravyness · on April 17, 2021

This tracking method (WebGL) disregards VPNs, user agent changes and incognito mode.

Browser fingerprinting is the ability of the website you are accessing to differentiate between you and other people because of a "fingerprint": A hash string that uniquely identifies your current browser, and since its using WebGL, your computer, because of its configuration and graphical capabilities.

have_faith · on April 17, 2021

Using the way that your browser renders a page to uniquely identify you amongst other visitors. This can take the form of measuring how your computer generates random numbers, timing how long it takes to compute certain things, minute differences in how your your gpu renders some pixels via webgl etc. With enough signals you can pierce through the noise.

seaghost · on April 17, 2021

I do participate in research project at https://browser-fingerprint.cs.fau.de as a user. You can find more info about fingerprinting here https://browser-fingerprint.cs.fau.de/faq?lang=en

gfodor · on April 17, 2021

It's when a site is able to take a variety of Javascript-accessible state that in isolation is benign (such as your reported graphics driver, as the case here) but together form a unique identifier for a user given the high dimensionality involved. This allows identifying users without their consent and avoids some methods of anti-tracking.

bigdict · on April 17, 2021

My understanding is it's simply checking the level of support for different Web APIs. Since that is invariant with respect to the route between you and the website, a VPN wouldn't save you from this.

dvfjsdhgfv · on April 17, 2021

Basically, each browser has an unique set of features so each device can be uniquely identified and profiled across different domains, regardless if you block cookies, use incognito mode and other techniques.

meheleventyone · on April 17, 2021

Some of the web APIs expose information that in aggregate can give an almost unique fingerprint to that device. Using fingerprinting techniques you can then track what the device is doing.

SuchAnonMuchWow · on April 17, 2021

basically no two systems are the same, and if the website can gather enough information about your system, it can identify you, even through a VPN or in incognito mode. Popular information for fingerprinting include: your browser, the country you are in, your language(s), your graphic card (through webgl), the fonts installed on your computer, the size of your screen (or your browser window), ...

codewiz · on April 18, 2021

window.botguard on google.com could very well be a way to detect bots running WebDriver to impersonate humans. Since WebDriver is headless, there would be detectable differences in WebGL support. This would also explain why the code is "very obfuscated".

mike_hearn · on April 18, 2021

That's exactly what it is. BotGuard isn't designed for identifying individual users, that's what cookies are for. It's designed to detect programs that are pretending to be normal web browsers being used but which aren't (because they're scripts, or rendering engines being automated, etc).

fabian2k · on April 17, 2021

WebGL is a very useful feature, but the top sites by traffic are not exactly where I'd look for good examples of WebGL use. It is a bit more of a niche feature, but for some use cases you can't really replace it at all.

everdrive · on April 17, 2021

One thing that continues to confuse me is the fact that all this remains speculative. Surely on HN, there are people who have worked in FAANGs. Doesn't anyone have first hand knowledge of how tracking is performed at various websites, and whether people are really using webGL to track? And if so, how that's stored and correlated across different sites?

codezero · on April 17, 2021

I don't doubt this is a thing, but the place to look for anything concerning is information returned from these API calls going out over the wire. Very few places encrypt or obfuscate their network requests with anything but base64, URL encoding, or maybe puny code, I've seen a few others but nothing difficult to figure out.

superkuh · on April 17, 2021

Most bare metal features introduced into what used to be browsers, but are now virtual machines for running applications, are and will be used for malicious purposes. The idea that you should allow random strangers to run code on your machine, especially this low level, is even more crazy than opening every email attachment you get.

dmitriid · on April 17, 2021

WebKit and Mozilla simply refuse to implement many of the APIs because they cant' figure out how to prevent them from being abused.

See e.g.https://webkit.org/tracking-prevention/#anti-fingerprinting (Mozilla's positions are more spread out over various issues, see e.g. https://webapicontroversy.com )

pjmlp · on April 17, 2021

Except Firefox is getting irrelevant, with Edge Chromium overtaking it, and Apple naturally would rather developers to use iOS APIs exactly for the same purpose, while collecting some revenue in Apple hardware for development purposes.

dmitriid · on April 17, 2021

> Except Firefox is getting irrelevant, with Edge Chromium overtaking it

And this is a huge problem.

> and Apple naturally would rather developers to use iOS APIs

You're missing the point where Firefox agrees with Apple on these APIs.

But sure. "Apple bad" and all that.

deltron3030 · on April 17, 2021

FF should focus on a better UX. Try FF and Edge on Windows 10, Edge seems much snappier because it boosts the mouse scroll speed. A friend wanted to switch because of that, and increasing the overall mouse scroll speed on W10 did the trick for FF. It's little tricks like those that make non techy people switch.

pjmlp · on April 17, 2021

I don't agree with "Apple bad" at all, if you see my comments history.

My point was that contrary to Mozilla, they do have an agenda regarding how much Web Safari should support on iOS.

gfodor · on April 17, 2021

How would you define bare-metal? WebGL is proxied via ANGLE, so there's no risk of crashing or other forms of leaky processes, which is typically the primary thing I think of when I worry about low-level APIs.

lifthrasiir · on April 17, 2021

ANGLE is not a virtual machine but a translation layer from OpenGL ES (on which WebGL is based) to Direct3D. Inappropriate translation would result in a direct exploit unlike VM, since AFAIK there is no practical means to sandbox shaders. In some sense it is akin to JIT in a kernel which has the roughly same security ramification.

reader_mode · on April 17, 2021

>Inappropriate translation would result in a direct exploit unlike VM, since AFAIK there is no practical means to sandbox shaders.

What does it mean to sandbox shaders in this context ? GL ES shaders are sent down in a high level language and can't really do much besides do computation on input parameters to generate output in a pipeline. I wish they were more general purpose and worthy of sandboxing but the web GL shaders are really limited.

pjmlp · on April 18, 2021

Well, not so.

https://www.contextis.com/us/blog/webgl-more-webgl-security-...

https://www.hpcwire.com/2018/05/31/gpus-excellent-performanc...

https://www.cs.utexas.edu/users/witchel/pubs/zhu17gpgpu-secu...

reader_mode · on April 18, 2021

First link demonstrates reading from uncleared buffered to capture screen contents - and is from Windows XP era.

Second one is about running a bitcoin miner on the GPU, hypothetical memory breaches, and GPU DoS. You can stall the graphics pipeline without shaders, run a miner on CPU, and I'm going to need a demonstration of a usefull GPU memory breach with WebGL. On top of that any such breach is likely driver dependent and will only work on some specific driver/HW combination - that's such a low attack surface I doubt anyone will bother developing exploits targeting that.

I didn't bother going through third because I wasted enough time on first and second, I don't see how anything here suggest you need to sandbox shaders.

pjmlp · on April 18, 2021

Until it eventually happens on your computer I guess.

TazeTSchnitzel · on April 17, 2021

Modern GPUs have process isolation just like CPUs, and if that is not enough for you, GPU APIs allow an application (in this case the browser) to request bounds-checked execution for extra safety.

labawi · on April 17, 2021

Don't forget bugs in the driver and/or GPU firmware.

TazeTSchnitzel · on April 17, 2021

Note that most of a GPU driver runs in user mode, trapped inside the browser's process sandbox.

cptskippy · on April 17, 2021

What's used on platforms where Direct3D doesn't exist?

skymt · on April 17, 2021

ANGLE has OpenGL and OpenGL ES backends in addition to Direct3D, as well as partial Metal support.

https://chromium.googlesource.com/angle/angle/+/master/READM...

kevingadd · on April 17, 2021

OpenGL or Vulkan as appropriate. AFAIK there's a Metal backend in development for macs but it's not finished yet.

turminal · on April 17, 2021

There's no need to translate?

cptskippy · on April 17, 2021

MacOS?

pjmlp · on April 18, 2021

PS4, XBox?

cptskippy · on April 19, 2021

Wouldn't Xbox use Direct3D?

jcelerier · on April 17, 2021

OpenGL directly, no ?

olliej · on April 17, 2021

ANGLE is used to to a source to source translation irrespective of backend - it normalizes the source input to various GPUs (dropping things that have historically been funky like comment parsing, #controls, dodgy constructs, disallowed constructs, etc)

[Source: I worked on the original spec and spent a lot of time trying to prevent the more egregious security and privacy problems]

cptskippy · on April 17, 2021

What about MacOS? Isn't OpenGL deprecated?

pjmlp · on April 18, 2021

Besides Apple devices, not on game consoles.

jcelerier · on April 18, 2021

... but Google Chrome is not running on game consoles anyways ? Blink likely can, but then it does not necessarily use ANGLE, e.g. for instance QtWebEngine definitely does not use ANGLE on Mac / Linux but does native GL calls instead.

pjmlp · on April 18, 2021

PS 4 dashboard was done in WebGL.

https://news.softpedia.com/news/The-PS4-UI-Is-Built-Using-We...

jcelerier · on April 18, 2021

I don't understand your point. Yes, it's using WebGL. Do you think that it's going to do WebGL -> OpenGL ES 2/3 calls -> ANGLE -> Direct3D -> PS4 graphics driver (like it does on e.g. chrome on windows) ? No, it's likely doing WebGL -> PS4 proprietary graphics API -> PS4 graphics driver, not going through ANGLE or whatnot

pjmlp · on April 18, 2021

It uses a Chrome fork, so most likely Sony added LibGNM backend to ANGLE instead of redoing the whole graphics stack for Chrome.

jcelerier · on April 19, 2021

but why would it go through ANGLE's libGLESv2.so and add a backend to that for the PS4 when the PS4 team could most likely provide their own libGLESv2 which maps directly to their driver (since they were doing that already for the PS3) ? As this is what chrome actually calls, gles functions.