Model Explorer: intuitive and hierarchical visualization of model graphs

joaquincabezas · 2024-05-14T17:49:42 1715708982

I normally use Netron for quickly inspecting models and making this 'mental picture' of the architecture, but this hierarchical approach seems a better fit for my needs.

I'm just starting and the first impression is pretty good!

fabmilo · 2024-05-14T17:56:29 1715709389

These tools are eye candy and have been around from tensorflow/tensorboard 0.x 10 years ago but never used after just trying them for fun. You need to read the source code no easy way around it.

tomrod · 2024-05-14T19:06:37 1715713597

Looks like they have moved to https://ai.google.dev/edge/model-explorer

dang · 2024-05-14T19:30:00 1715715000

I'm not sure which link is more up to date, but since this one appears to give more background, we've switched to it from https://github.com/google-ai-edge/model-explorer above. Thanks!

tomrod · 2024-05-15T02:21:14 1715739674

You bet! They reference the github repo, but more info is in the link you moved it to.

swaptr · 2024-05-14T17:48:30 1715708910

https://github.com/lutzroeder/netron

https://netron.app/

anvuong · 2024-05-14T17:47:39 1715708859

Looks cool but seems like it doesn't work on torch 2.0

"AttributeError: module 'torch' has no attribute 'export'"

The torch.export API is currently in active development with planned breaking changes. The installation guide for this is still very minimal, anyone knows how to get it working on torch 2.0?

frontierkodiak · 2024-05-14T21:31:31 1715722291

I haven't managed to successfully export my custom ViT model yet, but I've not had an issue accessing the export methods in torch 2.3 within the nvcr.io/nvidia/pytorch:24.02-py3 container.

I may have some more time to debug my trace tonight (i.e. remove conditionals from model + make sure everything is on CPU) and will update if I have any new insights.

``` from torch.export import export ... example_args = (dummy_input) exported_program = export(model, args=example_args) ```

Links: - torch.export docs: https://pytorch.org/docs/stable/export.html#serialization - Using 24.02 container: https://docs.nvidia.com/deeplearning/frameworks/pytorch-rele...

westurner · 2024-05-14T19:58:38 1715716718

google-ai-edge/model-explorer//example_colabs/quick_start.ipynb: https://github.com/google-ai-edge/model-explorer/blob/main/e...

XAI: Explainable AI: https://en.wikipedia.org/wiki/Explainable_artificial_intelli...

germanjoey · 2024-05-14T21:25:28 1715721928

Is there a demo of a model visualized using this somewhere? Even if it's just a short video... it's hard to tell what it's like from screenshots.

brrrrrm · 2024-05-14T17:52:47 1715709167

I've never really understood the point of these visualizer things. The idea that a model is always well represented by a directed acyclic graph seems extremely dated.

I really would love a PyTorch/JAX profiler that shows, in annotated Python, where your code is allocating memory, using compute or doing device copies.

lamename · 2024-05-14T17:57:19 1715709439

It may be that the way you like to think about things and the way others like to are different.

I find that quickly grasping a new architecture is easiest with a graph-based diagram first. Then code for details. All with the goal of internalizing the information processing steps. Not memory allocation per se.

In my mind, how the network implementation allocates memory is a different question.

But I think both of our desires just reflect our jobs, our interests, and simply how our brains conceptualize things differently.

brrrrrm · 2024-05-14T19:43:58 1715715838

I think it's a trap of visual elegance. When you start thinking of models this way you miss the way a lot of models are actually written.

E.g. how do you represent an online fine-tuning process? I want to randomly switch between a reference impl and an approximation method, but when using the approx method I want to back-propagate so that it gets better over time.

full disclosure: I've written plenty of these little visualizers and also fallen for the trap of "everything should be a declarative graph."

almostgotcaught · 2024-05-14T18:38:29 1715711909

Lol I interned inside pytorch a few years ago (you and I even met/talked about tangential things :)) and worked on tracking such allocations (although I didn't hook it up to profiler). Spoiler alert: you can't track such provenance because everything gets muddled in the dispatcher.

EDIT: not completely accurate to say you can't do it. I prototyped a little allocator that would stamp every allocation (the pointer itself, in the unused bits, a trick I learned from zach) with the thread id and a timestamp (just an incrementing counter) and then percolate that up to the surface. Obv that didn't land lol.

chillee · 2024-05-14T19:40:48 1715715648

We actually do track such provenance now (https://pytorch.org/blog/understanding-gpu-memory-1/) - works pretty well I think :)

almostgotcaught · 2024-05-14T19:48:17 1715716097

well you should tell bram then :p

but also while generally "in all things i defer to horace" (ok not really) so maybe i'm not looking closely enough (and missed it) but the bottom of that stack shows (roughly) the autograd dispatch key and not the python call site (or some such). and maybe it's a pedantic difference (depends on what bram wants) but i wanted provenance back to the TS op so that i could then do static memory allocation things with that representation (now i've probably fully de-anonymized myself...) and for that use-case, even what you have now, isn't enough (you can't get a total sum for how much each TS op or whatever allocates and when the corresponding free happens).

brrrrrm · 2024-05-14T19:39:33 1715715573

I wonder if you could track provenance by operating at the highest layer of the dispatcher and capture any calls to GPU operations (ala Cuda Graph)?

> in the unused bits

I feel like this is a PT rite of passage :P

gedy · 2024-05-14T18:22:13 1715710933

Not mocking you, but I'd make a guess you don't like to draw block diagrams when discussing designs or code architecture with others either? Some folks aren't "visual" thinkers, and took me a long time working to realize that some folks are like that.

hatthew · 2024-05-14T22:20:16 1715725216

A DAG visualization of a model is a good abstraction to learn the general structure of the model to help contextualize the code you're reading.

mathematicaster · 2024-05-14T18:51:25 1715712685

I don't see any reference to acyclic as a requirement.

brrrrrm · 2024-05-14T19:40:28 1715715628

I think it uses TF's graph construct which has that built in? it's like a weird mix of dataflow and control flow graphs.

pdevr · 2024-05-14T17:59:30 1715709570

Other than visualizing, creating custom nodes seems to be the most interesting available operation (at the time of writing this).

API Guide:

https://github.com/google-ai-edge/model-explorer/wiki/4.-API...

Custom Nodes - User Guide:

https://github.com/google-ai-edge/model-explorer/wiki/2.-Use...

toddmorey · 2024-05-14T18:18:05 1715710685

Is it becoming more and more common to launch open software like this without accepting contributions?

kleiba · 2024-05-14T18:54:36 1715712876

I'm confused - so it runs only in Edge or what's with the name?!

inhumantsar · 2024-05-14T18:57:55 1715713075

the team is building tools to help run models on edge devices. embedded, mobile, laptops, etc.

kleiba · 2024-05-14T19:24:42 1715714682

Okay, thanks for the explanation!

spydum · 2024-05-14T17:49:55 1715708995

Not to be confused with Microsoft Edge, which replaced Internet Explorer with Chromium (the browser that Google Chrome is based on). /s

Veuxdo · 2024-05-14T18:05:46 1715709946

What is the "AI" part of this...?

minimaxir · 2024-05-14T18:11:14 1715710274

It's an "AI Model" Explorer, not an AI "Model Explorer."

Veuxdo · 2024-05-14T18:11:27 1715710287

Also, what is the "Google" part of this?

minimaxir · 2024-05-14T18:16:04 1715710564

It's owned and operated by Google.

nashashmi · 2024-05-14T18:14:13 1715710453

And why is this referring to MS Edge?

akomtu · 2024-05-14T18:06:58 1715710018

Marketing & SEO.