> "the scaling hypothesis" Oh boy, don't get me started.... I first off should s...

armoredtech · 2024-01-02T05:01:15 1704171675

ML researchers saying they need "category theory" sounds like a way to try to convince mathematicians that their work is cool. You absolutely do not need category theory.

Math is just models? Lol!

calebkaiser · 2024-01-02T12:47:44 1704199664

The parent didn't say category theory is necessary to conducting ML research, just that it could be useful. This point isn't particularly controversial. If you're interested in this niche of the field, I find Tai-Danae Bradley's work to be pretty cool! She has a site: https://www.math3ma.com/

armoredtech · 2024-01-02T17:40:28 1704217228

Thanks for the reply. I'm glad my comment is no longer flagged.

What do you mean that "this point isn't particularly controversial?" If you just mean that "X may be useful", then of course. But the particular X matters, and "could be useful" is much different than "is useful".

People who like category theory want it everywhere. I don't know your mathematical background, but spend any time in a math department, or even classes, and you'll find people ready to explain any topic in the language of CT.

The may be useful, but it has to be justified. It's clear in some mathematical contexts, but definitely not in ML (yet alone analysis).

ML has a problem in that no one knows what certain methods work. Just look at something like batch normalization: I can think of at least 3 different "explanations" on why it works.

ML people want explanations, and mathematicians need work. Category theorists therefore have work. But I don't think you should mistake this as being an explanation. You just get a nice get a "cleaner way" to present concepts.

godelski · 2024-01-02T18:42:19 1704220939

> I'm glad my comment is no longer flagged.

FYI, I flagged you because the comment does not live up to the HN community standards[0]. A new account with just a comment to me made shortly after my comment was made just to say something sarcastic and does not contribute to the conversation. I decided to flag instead of commenting and continuing an unproductive exchange.

> People who like category theory want it everywhere.

This isn't surprising. It is an attempt at further generalization of mathematics. Albeit it can get annoying, it isn't wrong because cat theory is about looking from the high abstract level and making connections between differing branches of mathematics. If you don't see it everywhere you either don't have an understanding or have discovered something those people would really like to know. From personal experience, it can be a quite useful tool to describe things because of this.

> The may be useful, but it has to be justified.

The former begets the latter.

> Just look at something like batch normalization: I can think of at least 3 different "explanations" on why it works.

Are those the same thing? What are those?

> But I don't think you should mistake this as being an explanation. You just get a nice get a "cleaner way" to present concepts.

The latter is de facto the former.

And yes, math is just models. Or as Poincaré said, math is the study of relationship between numbers. One might also say "the map is not the territory" and you can find several math theorems making this point explicitly about math. You may even find one by reading my username with a little care. More than one if you take more care.

[0] https://news.ycombinator.com/newsguidelines.html

armoredtech · 2024-01-02T21:00:53 1704229253

> If you don't see it everywhere you either don't have an understanding or have discovered something those people would really like to know. From personal experience, it can be a quite useful tool to describe things because of this.

Get off your high horse. I've had my share of Mac Lane. If you can describe something in terms of CT, you can talk to mathematicians who care about CT. I don't see why this helps ML.

> The may be useful, but it has to be justified. "May be useful" does not beget "justified." CT may be useful in all areas if you ask a CT theorist. I fail to see how CT helps me build a car.

>The latter is de facto the former.

No it's not. You can take you favorite analysis topic and find a suitable category to view your topic from a CT perspective, but this won't tell you how to prove anything. If you did the CT correct you can now make some analogies, but it won't tell you anything specific.

> And yes, math is just models. Or as Poincaré said, math is the study of relationship between numbers. One might also say "the map is not the territory" and you can find several math theorems making this point explicitly about math.

How do you square "math is the study of relationship between numbers" with CT? You can diagram chase without seeing a single number. I have no idea what mathematical theorem you are referring to, but if you're extrapolating philosophical points from a mathematical theorem, you're doing it wrong

> You may even find one by reading my username with a little care. More than one if you take more care.

Ok I'll bite. You seem to be into Normalizing Flows. How does CT explain it being useful?

godelski · 2024-01-02T18:34:27 1704220467

There's also the cats.for.ai group and this nice github: https://github.com/bgavran/Category_Theory_Machine_Learning

borissk · 2024-01-02T01:15:37 1704158137

Very interesting. Are any of your lectures available online?

godelski · 2024-01-02T02:27:51 1704162471

I'm trying not to dox myself so I can be more open on HN (though more concerns in modern era...). You can find some harsh words against some ML community practices in my history and I think it is easy to get misinterpreted as calling people dumb or confuse academic bashing from utility (I criticize LLMs and diffusion a lot because I like them, not the other way). So yes and no. But the lectures I have aren't recorded and public (zoom for my Uni. I'm ABD in my PhD). My lecture slides and programs should be publicly visible though, but I don't go into this with them because I've been specifically asked to not teach this way :/ In all fairness, our ML course only has Calc 1 as a pre-req and CS students aren't required to take Lin Alg (most do though, but first courses are never really that great ime) or differential equations. TBH to get into this stuff you kinda need some metric theory. If you actually poke through this paper you'll find that come up very quickly, and this is common in the optimal transport community. But I think if you get into metric theory a lot of this will make sense pretty quickly. So if you can, maybe start with Shao's Mathematical Statistics?