Hacker News new | past | comments | ask | show | jobs | submit login
Luma AI Dream Machine (lumalabs.ai)
213 points by Anon84 6 months ago | hide | past | favorite | 63 comments



Goodness me this is the stuff of nightmares!

I asked it for "A dog catching a treat in slow motion, it's chops flapping around comically"

https://imgur.com/a/dog-catching-treat-slow-motion-its-chops...



What, you don't know about the Marvel character of the warrior who travels through bovine?


Ah yes, the Bovine Blitzer.


I wonder if this is doing basically the same thing as SORA/Kling, but with less compute/model size. It kind of reminds me of OpenAI's examples from their technical report: https://openai.com/index/video-generation-models-as-world-si...

Somewhere between "base compute" and "4x compute".

So maybe you "just" need to know how to create a certain type of diffusion transformer model and then train on a ton of videos, but with an adequate amount of compute. Which is probably a LOT for training and inference to get more realistic results.


I'm reading the Southern Reach Trilogy at the moment, and this type of imagery fits there really well.


I can see him catching the treat (middle of the video) in super fast motion.


Nightmares are also dreams


I hate when a website offers 'Sign in with Google' as the sole sign-in option. What happened to have email as the default account creation method?


I’ve launched apps with free trials in the past and when I allowed email, there were 100 fake emails from temporary email services for every 1 real one. Had to do more work to block those as they were eating up all the resources from paying users


If you have an extensive demo page covering my use case, I would prefer a low-cost entry offer to use my Google account. Upgrading to a more expensive option later is usually not such a hard task for me when I really love the product.

Lately, I had a case when someone informed me that he would take my e-mail from his newsletter mailing list because I seemed to be a dead address (and every address generated cost for him). I replied that I did not like to be tracked (implying that I had read the newsletter) and we had a nice conversation. Not every fake e-mail is as fake as it seems ;-)

Apparently, Thunderbird can be quite good at beating E-Mail Tracking.


Doesn't this just further the point that your potential customers don't want to identify themselves to try your product? Punting on anti-sybil or anti-spam by requiring a strongly identity-linked account (Google accounts can't be created from VPNs or without phone numbers, for instance) is user-hostile.

If your free users are using up paid user resources, that's a different problem.


One of the nice parts about starting a business is that you get to choose your customers. There is no ethical or moral rule that you must provide access to everyone who wants to use your app. You are even allowed to make decisions that reduce your revenue in favor of other priorities.


The problem is not that I want peoples real information, I really couldn't care less. It's the fact that some of these are clearly the same people creating hundreds of emails to just keep resubscribing to a free trial.

I could tell because there were emails with the sequence of:

1@xyz.com 2@xyz.com ... etc


Low trust world, I guess. If I'm poking around to see whether I like something or not, no way am I giving it a real email.


I've become even more drastic: if you want me to sign in at all before I can try out your product "for free", you already have one less potential customer.


Not much choice with a GPU backed service. Way too expensive to not have at least a verified email filter in front of it, and Google is the simplest most "universal" plug and play OIDC provider.


Sure, there's plenty good reasons from the service provider's perspective. However, there's not a lot of good reasons from my perspective.


Yeah it’s not really free if you require me to give you things in exchange is it.


Ya it's really the Google-as-only-option problem. I immediately close the window when I see that and move on.

Are there any "privacy-first" SSO options? Maybe Proton or Mullvad or some similarly positioned company should think about this. Or maybe it's just way too hard to break into the Google/Apple duopoly.


Using SSO permits faster launch.


It’s annoying when it’s Google only though.

If you’re going to offer SSO, at least take the lowly amount of time to add some other providers. I genuinely don’t believe anyone is launching so quickly that they don’t have time to add a second or third option.


I totally agree, I hate it. Just answer the question of why people do this.


Monkey paw curls, Twitter and Facebook are added.


Using zero sign in options permits an even faster launch! See if your CEO will go for that option!


Maybe if you have skill issues.


No, using SSO simply let's you worry about one less thing.


It also provides decent spam prevention, and high probability people will check the associated email address


I think Kling looks more impressive:

https://kling.kuaishou.com/

Roughly on the level of OpenAI's Sora.



This is very similar to the open source project https://pollinations.ai/ which I'm involved in.


Am I missing something? The AI video examples on this just look like non-temporally stable image gen frames with the wobbly/morphy look. I don't think the comparison is there to what Luma shipped.


The problem for me is that they took our design, styles, fonts, colors, gradients, and name for a very similar product and shipped it with quite an impressive team that is well-funded by A16z.

Their product is much slicker than what we are doing with Pollinations, of course. We are a few people working on it in our spare time.

It would have just been nice to get a kind of mention or have someone reach out. I'm all for getting inspired and copying.

The current state of the Pollinations site is very different. The Luma Labs page looks like what Pollinations had on its site about nine months ago when the Dream Machine was a more significant focus of ours.

I could get into the weeds and show side-by-side screenshots but it seems like the wrong direction to invest time. The person who did our design didn't think it was very funny.

In my opinion Gen AI startups should be extremely careful in crediting because the whole copyright thing is already such a hot potato with artists and designers.


If it's any consolation, the Luma website is so laggy and poorly-designed that I don't think I could recommend their service simply as a matter of principle. The Pollinations site is much more static and doesn't have any distracting or finicky on-scroll effects, which at least makes it enjoyable to read.


Does Pollinations do a good job of crediting the artists and designers whose work has be used for training data?


We're giving free access to a model that other people trained on other people's data. The code for all improvements or tweaks is accessible in public repositories. From a copyright perspective, I think one can make a fair use argument.

Nevertheless, it would be great to credit artists and designers whose work was used for training. Practically, it's not easy because I think one would have to credit humanity.

Every image in the training data influenced the model's output to a certain degree, so one would have to credit every person who uploaded an image to a public place.

One could argue that a model should not be trained on public data in the first place because we can't fairly credit all individuals who contributed to it.


Well good luck finding sympathy for having people copy your stuff.


I found it quite similar, even the name 'DREAMACHINE' Do you think they are aware of Pollinations.ai?


Like the alternatives, it appears to be limited to adding forced movement (shift each frame by X, Y) and only the most predictable motion, like moving fog, water or sand.

These constraints (and most of the morphing) come from these type of image generators always going for the most plausible interpretation of noise. They don't have any actual temporal coherence yet.

It's very cool, but fairly limited in practical application with the current capabilities.


Very impressive.

But also I LOVE that they put "Janus" under current limitations. Made me laugh and earned my respect.


That's actually a term that is used ofter in papers to refer to this phenomenon.


What does it mean?


It means when a character turns, sometimes another face shows on the other side where the back of a head or a rear end should be.


Two headed guy from Roman Mythology: https://en.wikipedia.org/wiki/Janus


That polar bear has two heads.


Where is Jessica Hyde?


Here's a small compilation of clips, some are actually pretty good but a lot are strange: https://youtu.be/q_s3gKFLkEc


I really wish AI generators displayed the prompt used to generate content so that others can learn from it. E.g, https://storage.cdn-luma.com/lit_lite_inference_text2vid_v1.... was generated from "A gray and silver Maine Coon playing with an android" and it's unclear how to make it as good as the official demos.


Prompt it with an image from midjourney and that prompt.


Awesome video quality hahah, the frogman wearing a leather jacket was on point! But apparently they said themsellves that accurate text rendering has a long way to go

Also found these guys providing API for Dream Machine: https://piapi.ai/dream-machine-api, would be interesting to see what developers can do with this in their own chat bot!


Reminder that "This is the worst version of the AI video creation we'll have from here on out."

As we mock the new Will Smith Eating Spaghetti video, every update is an improvement on where we're at and a new, higher baseline of the worst functionality.


Doesn't mean it'll get any better though. Most technologies and algorithms have an end point. People thought flying cars were inevitable and imminent because after all, cars got so much better and feature-rich within a very short timespan. And then innovation practically stopped because there's a limit to possible energy storage and utilisation unless some magical new innovation comes along. But it hasn't.

These generated videos are exactly the same nonsense as the spaghet videos, they've just got more data crammed in there. But nothing about them is consistent.


Flying cars do exist, you're just not allowed to drive them.


Anything other than short distance prototypes? Please share because I’m not aware of any


I suspect that this is a compute tradeoff they decided to make. Uses much less compute than SORA, so feasible to scale up for the public, but that much less coherent. If you look at OpenAI's technical report for SORA they show examples with "base compute" and "4x compute". This output looks like an estimated "2x" to me.

I wonder if they are really talking about the model size.


I mean, there could still be worse ones that come out; the recent Stable Diffusion 3 release seems to show that sometimes regressions do happen.


That's the version they released as an "open" model. Their SD3 model behind an API works just fine, the released medium model has been basically been lobotomized to hell and back in the name of "safety", because apparently human anatomy is icky and nipples are the devil. Unfortunately, that sort of heavy handed model lobotomizing leads to nasty side effects that turn even innocent renders of people into Eldritch nightmares as the model has been forced to forget that a human doesn't have limbs growing out of random places.


I don't even want video, why hasn't someone launched image generation with a consistency between images yet?

OpenAi demoed it but they haven't launched it


Look up Story Diffusion.


I like that they put limitations next to features - this should be a standard for transparent communications


The giant pink boar has a third back leg ... but yea looks impressive overall


Not even a good try. Let's wait till sora comes up.


has a lot of the same flaws as sora, just more prominently than the demos Open AI showed. its just that Sora may be good enough




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: