More

jorgemf · 2025-05-29T19:24:48 1748546688

I think they are doing that because using real images the model changes the face. So that problem is removed if the initial image doesn't show the face

jorgemf · 2025-02-11T19:47:02 1739303222

This genereation seems that is getting performance using more power and more cores. Not really an architectural change but only packing more things in the chip that require more power.

jgalt212 · 2025-02-11T21:13:58 1739308438

Too true. I've been looking replace my 1080. This was a beast in 2016, but the only way I can get a more performant card these days is to double the power draw. That's not really progress.

UberMouse · 2025-02-11T22:05:37 1739311537

Then get a modern GPU and limit the power to what your 1080 draws. It will still be significantly faster. GPU power is out of control these days, if you knock 10% off the power budget you generally only lose a few percentage of performance.

Cutting the 5090 down from 575w to 400w is a 10% perf decrease.

jgalt212 · 2025-02-11T23:08:20 1739315300

Even if I knew how to do that, I'd still need double the power connectors I currently have.

UberMouse · 2025-02-12T02:43:11 1739328191

5090 was an example, same process applies to lower tier GPUs that don't require extra power cables. ie a 3080 with the same power budget as a 1080 would run circles around it (1080 with default max power limit of 180w gets approx 7000 in TimeSpy, 3090 limited to 150w gets approx 11500). Limiting the power budget is very simple with tools such as MSI Afterburner and others in the same space.

0x457 · 2025-02-11T21:57:07 1739311027

That's because 1080 and whole 10xx generation was pinacle and is the best GPU nvidia ever made. Nvidia won't make the same mistake any time soon.

jorgemf · on Aug 14, 2024

Gaussian splatting transform images to a cloud points. GPUs can render these points but it is a very slow process. You need to transform the cloud points to meshes. So basically is the initial process to capture environments before converting them to 3D meshes that the GPUs can use for anything you want. It is much cheaper to use pictures to have a 3D representantion of an object or environment than buying professional stuff.

andybak · on Aug 14, 2024

> Gaussian splatting transform images to a cloud points.

Not exactly. The "splats" are both spread out in space (big ellipsoids), partially transparent (what you end up seeing is the composite of all the splats you can see in a given direction) AND view dependent (they render differently depending on the direction you are looking.

Also - there's not a simple spatial relationship between splats and solid objects. The resulting surfaces are a kind of optical illusion based on all the splats you're seeing in a specific direction. (some methods have attempted to lock splats more closely to the surfaces they are meant to represent but I don't know what the tradeoffs are).

Generating a mesh from splats is possible but then you've thrown away everything that makes a splat special. You're back to shitty photogrammetry. All the clever stuff (which is a kind of radiance capture) is gone.

Splats are a lot faster to render than NeRFs - which is their appeal. But heavier than triangles due to having to sort them every frame (because transparent objects don't composite correctly without depth sorting)

vessenes · on Aug 14, 2024

Minor nit — in what way do splats render differently depending on direction of looking? To my mind these are probabilistic ellipsoids in 3D (or 4D for motion splats) space, and so while any novel view will see a slightly different shape, that’s an artifact of the view changing, not the splat. Do I understand it (or you) correctly?

refibrillator · on Aug 14, 2024

In 3DGS, spherical harmonics are used to model view-dependent changes in color.

https://en.m.wikipedia.org/wiki/Spherical_harmonics

Basically for each Gaussian there is a set of coefficients and those are used to calculate what color should be rendered depending on the viewing angle of the camera. And the SH coeffs are optimized through gradient descent just like the other parameters including position and shape.

vessenes · on Aug 14, 2024

Ah, thank you. Taking into account say reflection/refraction.

jorgemf · on Aug 14, 2024

Basically you train a model per each set of images. The model is a neural network able to render the final image. Different images will require different trained models. Initial gaussian splatting models took hours to train, last year models took minutes to train. I am not sure how much this one takes, but it should be between minutes and hours (and probably more close to minutes than hours).

tomp · on Aug 14, 2024

No, what you're describing is NeRF, the predecessor technology.

The output of Gaussian Splat "training" is a set of 3d gaussians, which can be rendered very quickly. No ML involved at all (only optimisation)!

They usually require running COLMAP first (to get the relative location of camera between different images), but NVIDIA's InstantSplat doesn't (it however does use a ML model instead!)

dagmx · on Aug 14, 2024

Nit: splats are significantly older than NeRFs. They just had a resurgence after nerfs.

We’ve been using pretty similar technology for decades in areas like Renderman radiance caches before RIS.

petargyurov · on Aug 14, 2024

Thank you, that explains it.

jorgemf · on March 4, 2024

cost is relative. how much would it cost for a human to read and give you an answer for 200k tokens? Probably much more than $3.

vinay_ys · on March 4, 2024

You are not going to take the expensive human out of the loop where downside risk is high. You are likely to take the human out of the loop only in low risk low cost operations to begin with. For those use cases, these models are quite expensive.

jakderrida · on March 4, 2024

Yeah, but the human tends not to get morally indignant because my question involves killing a process to save resources.

jorgemf · on Dec 30, 2023

My problem with this analysis is ignoring the fact of who is using which computer. So far new people in the company get the M3, while old people have M2, and the people who has been the longest time in the company have an M1. Who is going to work on more critical tasks with more changes in the code? who is going to work mostly in easy bugs until they get some experience with the code in the company? I bet you if you give both populations the same computer the compiling times are going to be faster for the new people. For me the analysis doesn't have enough dimensions, it should take into account the time since the person was hired in the company and the seniority. I would also have added more type of graphs (boxplots seems a better way to compare the information), and also I would have measure the total % of CPU usage. The battery/AC analysis gave me the impression that M3 might be underutilized and that it is going to be impossible to get lower compiling times without faster single core speeds (which might be a relevant information for the future).

jorgemf · on Nov 17, 2023

I think kotlin is one example. It uses the same idea but it uses powers of 10 for incremental fixes and numbers for 1 to 9 for hotfixes. That's if for the 3rd number, I do not know what will happen when the second number reaches 2 digits. I guess they will do something to make it comparable again.

jorgemf · on Nov 8, 2023

You are assuming that the whole existence of humanity is to work? because, without working, they would be sloths? What about expending more time having healthy habits like working out, meeting more often with family and friends, discovering the world, learning new stuff? So retired people are just sloths?

TerrifiedMouse · on Nov 8, 2023

I’m more worried about people not being able to feed themselves because their labor became worthless. They will effectively be frozen out of the economy as they have nothing to trade with.

jorgemf · on Nov 8, 2023

If AI does everything, the economic won't make sense anymore. Maybe there would be a basic rent or just anyone will ask for what they want and AI will provide it.

We though AI would replace the low level jobs first, but it seems creative jobs are gone first (art, software developers, etc). Bear that in mind.

TerrifiedMouse · on Nov 8, 2023

No matter who gets replaced first, someone is getting screwed.

Frankly, if it's the higher end jobs getting replaced first that would likely spill over to the lower end ones as the those people who lost their jobs resort to taking lower end work to survive, flooding the market.

jorgemf · on Nov 8, 2023

That's under the assumption that nothing else will change. But it is not the case, the system would have to adapt. One possibility is that we wont use money anymore, and there are a lot of in betweens in the middle. But for sure what you cannot do is to stop the change that is coming.

TerrifiedMouse · on Nov 8, 2023

How to adapt the system is the big question. In the worst case the system doesn’t adapt and lots of people are plunged into poverty.

someplaceguy · on Nov 8, 2023

So you think that in a world of AGI, humans will have all our needs met by machines?

Or do you think there will always be space for humans to provide value to other humans, even if machines surpass us in intelligence?

TerrifiedMouse · on Nov 8, 2023

Frankly, I think eventually machines will do it all. I see AGI as the universal automation that can do everything a human can - apart from “being human”.

someplaceguy · on Nov 8, 2023

Well, that's it, isn't it?

Even if AI can do everything, you'd still want the authentic human experience.

99% of people are far smarter than horses but people still pay to ride horses.

I don't see why someone wouldn't want to pay to... ride me... uh, in a matter of speaking, of course (of course). I mean, look at me.

TerrifiedMouse · on Nov 8, 2023

> 99% of people are far smarter than horses but people still pay to ride horses.

How many working horses are there today vs before automobiles?

Heck, how many domestic horses are there today vs before automobiles?

someplaceguy · on Nov 8, 2023

> How many working horses are there today vs before automobiles?

Well, exactly. The less working horses there are, the more expensive and exclusive it would be to ride them.

There could be 1 trillion automobiles and I bet you, none of these automobiles would compare to riding a real, live horse.

Similarly, there could be 1 trillion AI robots, they could do everything better than a human, and yet I bet you'd still want to ride (or otherwise experience) a real, live human.

My point is that if automobiles were always better than horses in every way, then nobody would want horses. But even today, with the amazing automobiles that we have, some of which even faster and more reliable than most horses, it's clear that we still want horses.

My question is, if horses were as intelligent as us and they could have their basic needs met extremely cheaply, would they be willing to work at all, apart from the occasional ride? Because the horse labor pool would shrink immensely if they didn't really want to work.

TerrifiedMouse · on Nov 8, 2023

> Well, exactly. The less working horses there are, the more expensive and exclusive it would be to ride them.

So you expect there to be less of us?

Imagine everyone who is selling labor today out of a job.

someplaceguy · on Nov 8, 2023

> > Well, exactly. The less working horses there are, the more expensive and exclusive it would be to ride them.

> So you expect there to be less of us?

Less working people, not less people.

> Imagine everyone who is selling labor today out of a job.

If it's because they don't need it anymore, that's great!

TerrifiedMouse · on Nov 8, 2023

> Less working people, not less people.

What are the people not work going to eat?

fragmede · on Nov 8, 2023

If they get hungry enough, the working ones.

jorgemf · on Oct 18, 2023

Time is the forth dimension. The input data is a video, so the model learns the colors and the position of the elements (basically points). You can rende the scene from any angle at any time once the models is trained

jorgemf · on Oct 2, 2023

SEEKING WORK | Spain | Remote (EU and US time zones)

  Technologies: TensorFlow, Pytorch, Deep learning, LLM, diffusion models, GANs
  Résumé/CV: http://jorgemf.github.io/cv.pdf
  Personal website: http://jorgemf.github.io
  Email: (in the CV)