Luma AI Dream Machine

pmx · 2024-06-13T20:17:10 1718309830

Goodness me this is the stuff of nightmares!

I asked it for "A dog catching a treat in slow motion, it's chops flapping around comically"

https://imgur.com/a/dog-catching-treat-slow-motion-its-chops...

usful · 2024-06-13T20:34:59 1718310899

Same nightmare fuel for me

https://imgur.com/a/warriors-who-are-cows-fighting-with-edo-...

ugh123 · 2024-06-14T05:51:29 1718344289

What, you don't know about the Marvel character of the warrior who travels through bovine?

danielbln · 2024-06-14T06:42:50 1718347370

Ah yes, the Bovine Blitzer.

ilaksh · 2024-06-14T13:36:01 1718372161

I wonder if this is doing basically the same thing as SORA/Kling, but with less compute/model size. It kind of reminds me of OpenAI's examples from their technical report: https://openai.com/index/video-generation-models-as-world-si...

Somewhere between "base compute" and "4x compute".

So maybe you "just" need to know how to create a certain type of diffusion transformer model and then train on a ton of videos, but with an adequate amount of compute. Which is probably a LOT for training and inference to get more realistic results.

rpastuszak · 2024-06-14T10:57:46 1718362666

I'm reading the Southern Reach Trilogy at the moment, and this type of imagery fits there really well.

csomar · 2024-06-14T12:24:07 1718367847

I can see him catching the treat (middle of the video) in super fast motion.

seydor · 2024-06-14T11:14:03 1718363643

Nightmares are also dreams

arendtio · 2024-06-14T08:11:45 1718352705

I hate when a website offers 'Sign in with Google' as the sole sign-in option. What happened to have email as the default account creation method?

czhu12 · 2024-06-14T13:29:04 1718371744

I’ve launched apps with free trials in the past and when I allowed email, there were 100 fake emails from temporary email services for every 1 real one. Had to do more work to block those as they were eating up all the resources from paying users

arendtio · 2024-06-14T18:05:41 1718388341

If you have an extensive demo page covering my use case, I would prefer a low-cost entry offer to use my Google account. Upgrading to a more expensive option later is usually not such a hard task for me when I really love the product.

Lately, I had a case when someone informed me that he would take my e-mail from his newsletter mailing list because I seemed to be a dead address (and every address generated cost for him). I replied that I did not like to be tracked (implying that I had read the newsletter) and we had a nice conversation. Not every fake e-mail is as fake as it seems ;-)

Apparently, Thunderbird can be quite good at beating E-Mail Tracking.

sneak · 2024-06-14T15:55:56 1718380556

Doesn't this just further the point that your potential customers don't want to identify themselves to try your product? Punting on anti-sybil or anti-spam by requiring a strongly identity-linked account (Google accounts can't be created from VPNs or without phone numbers, for instance) is user-hostile.

If your free users are using up paid user resources, that's a different problem.

danjl · 2024-06-14T16:20:48 1718382048

One of the nice parts about starting a business is that you get to choose your customers. There is no ethical or moral rule that you must provide access to everyone who wants to use your app. You are even allowed to make decisions that reduce your revenue in favor of other priorities.

czhu12 · 2024-06-14T18:02:15 1718388135

The problem is not that I want peoples real information, I really couldn't care less. It's the fact that some of these are clearly the same people creating hundreds of emails to just keep resubscribing to a free trial.

I could tell because there were emails with the sequence of:

1@xyz.com 2@xyz.com ... etc

stavrianos · 2024-06-14T15:55:08 1718380508

Low trust world, I guess. If I'm poking around to see whether I like something or not, no way am I giving it a real email.

kleiba · 2024-06-14T14:31:18 1718375478

I've become even more drastic: if you want me to sign in at all before I can try out your product "for free", you already have one less potential customer.

ramesh31 · 2024-06-14T14:35:29 1718375729

Not much choice with a GPU backed service. Way too expensive to not have at least a verified email filter in front of it, and Google is the simplest most "universal" plug and play OIDC provider.

kleiba · 2024-06-14T16:12:38 1718381558

Sure, there's plenty good reasons from the service provider's perspective. However, there's not a lot of good reasons from my perspective.

mock-possum · 2024-06-14T16:13:21 1718381601

Yeah it’s not really free if you require me to give you things in exchange is it.

carimura · 2024-06-14T15:18:23 1718378303

Ya it's really the Google-as-only-option problem. I immediately close the window when I see that and move on.

Are there any "privacy-first" SSO options? Maybe Proton or Mullvad or some similarly positioned company should think about this. Or maybe it's just way too hard to break into the Google/Apple duopoly.

lopis · 2024-06-14T10:20:39 1718360439

Using SSO permits faster launch.

FridgeSeal · 2024-06-14T10:43:03 1718361783

It’s annoying when it’s Google only though.

If you’re going to offer SSO, at least take the lowly amount of time to add some other providers. I genuinely don’t believe anyone is launching so quickly that they don’t have time to add a second or third option.

lopis · 2024-06-14T12:47:35 1718369255

I totally agree, I hate it. Just answer the question of why people do this.

danielbln · 2024-06-15T19:35:06 1718480106

Monkey paw curls, Twitter and Facebook are added.

mhuffman · 2024-06-14T20:20:11 1718396411

Using zero sign in options permits an even faster launch! See if your CEO will go for that option!

archerx · 2024-06-14T10:52:05 1718362325

Maybe if you have skill issues.

lopis · 2024-06-14T12:47:01 1718369221

No, using SSO simply let's you worry about one less thing.

MasterScrat · 2024-06-14T13:07:17 1718370437

It also provides decent spam prevention, and high probability people will check the associated email address

cubefox · 2024-06-13T22:59:16 1718319556

I think Kling looks more impressive:

https://kling.kuaishou.com/

Roughly on the level of OpenAI's Sora.

YeGoblynQueenne · 2024-06-13T22:20:51 1718317251

More dead butterflies.

https://www.emilydamstra.com/please-enough-dead-butterflies/

thomashop · 2024-06-13T22:08:44 1718316524

This is very similar to the open source project https://pollinations.ai/ which I'm involved in.

whywhywhywhy · 2024-06-14T12:45:41 1718369141

Am I missing something? The AI video examples on this just look like non-temporally stable image gen frames with the wobbly/morphy look. I don't think the comparison is there to what Luma shipped.

thomashop · 2024-06-14T13:46:00 1718372760

The problem for me is that they took our design, styles, fonts, colors, gradients, and name for a very similar product and shipped it with quite an impressive team that is well-funded by A16z.

Their product is much slicker than what we are doing with Pollinations, of course. We are a few people working on it in our spare time.

It would have just been nice to get a kind of mention or have someone reach out. I'm all for getting inspired and copying.

The current state of the Pollinations site is very different. The Luma Labs page looks like what Pollinations had on its site about nine months ago when the Dream Machine was a more significant focus of ours.

I could get into the weeds and show side-by-side screenshots but it seems like the wrong direction to invest time. The person who did our design didn't think it was very funny.

In my opinion Gen AI startups should be extremely careful in crediting because the whole copyright thing is already such a hot potato with artists and designers.

talldayo · 2024-06-14T14:11:05 1718374265

If it's any consolation, the Luma website is so laggy and poorly-designed that I don't think I could recommend their service simply as a matter of principle. The Pollinations site is much more static and doesn't have any distracting or finicky on-scroll effects, which at least makes it enjoyable to read.

criddell · 2024-06-14T15:02:48 1718377368

Does Pollinations do a good job of crediting the artists and designers whose work has be used for training data?

thomashop · 2024-06-14T17:54:40 1718387680

We're giving free access to a model that other people trained on other people's data. The code for all improvements or tweaks is accessible in public repositories. From a copyright perspective, I think one can make a fair use argument.

Nevertheless, it would be great to credit artists and designers whose work was used for training. Practically, it's not easy because I think one would have to credit humanity.

Every image in the training data influenced the model's output to a certain degree, so one would have to credit every person who uploaded an image to a public place.

One could argue that a model should not be trained on public data in the first place because we can't fairly credit all individuals who contributed to it.

criddell · 2024-06-15T00:05:32 1718409932

Well good luck finding sympathy for having people copy your stuff.

nechtan · 2024-06-13T22:12:57 1718316777

I found it quite similar, even the name 'DREAMACHINE' Do you think they are aware of Pollinations.ai?

DrSiemer · 2024-06-14T08:25:57 1718353557

Like the alternatives, it appears to be limited to adding forced movement (shift each frame by X, Y) and only the most predictable motion, like moving fog, water or sand.

These constraints (and most of the morphing) come from these type of image generators always going for the most plausible interpretation of noise. They don't have any actual temporal coherence yet.

It's very cool, but fairly limited in practical application with the current capabilities.

bluelightning2k · 2024-06-13T19:14:00 1718306040

Very impressive.

But also I LOVE that they put "Janus" under current limitations. Made me laugh and earned my respect.

GaggiX · 2024-06-13T20:18:25 1718309905

That's actually a term that is used ofter in papers to refer to this phenomenon.

Gooblebrai · 2024-06-13T20:10:28 1718309428

What does it mean?

oidar · 2024-06-13T20:28:28 1718310508

It means when a character turns, sometimes another face shows on the other side where the back of a head or a rear end should be.

gtm1260 · 2024-06-13T20:16:54 1718309814

Two headed guy from Roman Mythology: https://en.wikipedia.org/wiki/Janus

hasty · 2024-06-13T20:16:09 1718309769

That polar bear has two heads.

TheHumanist · 2024-06-14T00:48:12 1718326092

Where is Jessica Hyde?

jjbinx007 · 2024-06-14T09:37:58 1718357878

Here's a small compilation of clips, some are actually pretty good but a lot are strange: https://youtu.be/q_s3gKFLkEc

sxp · 2024-06-13T21:24:58 1718313898

I really wish AI generators displayed the prompt used to generate content so that others can learn from it. E.g, https://storage.cdn-luma.com/lit_lite_inference_text2vid_v1.... was generated from "A gray and silver Maine Coon playing with an android" and it's unclear how to make it as good as the official demos.

cma · 2024-06-13T21:52:12 1718315532

Prompt it with an image from midjourney and that prompt.

Dubwrig · 2024-06-15T15:41:23 1718466083

Awesome video quality hahah, the frogman wearing a leather jacket was on point! But apparently they said themsellves that accurate text rendering has a long way to go

Also found these guys providing API for Dream Machine: https://piapi.ai/dream-machine-api, would be interesting to see what developers can do with this in their own chat bot!

t-writescode · 2024-06-14T03:26:15 1718335575

Reminder that "This is the worst version of the AI video creation we'll have from here on out."

As we mock the new Will Smith Eating Spaghetti video, every update is an improvement on where we're at and a new, higher baseline of the worst functionality.

simianparrot · 2024-06-14T07:37:54 1718350674

Doesn't mean it'll get any better though. Most technologies and algorithms have an end point. People thought flying cars were inevitable and imminent because after all, cars got so much better and feature-rich within a very short timespan. And then innovation practically stopped because there's a limit to possible energy storage and utilisation unless some magical new innovation comes along. But it hasn't.

These generated videos are exactly the same nonsense as the spaghet videos, they've just got more data crammed in there. But nothing about them is consistent.

lopis · 2024-06-14T10:26:35 1718360795

Flying cars do exist, you're just not allowed to drive them.

simianparrot · 2024-06-15T09:57:18 1718445438

Anything other than short distance prototypes? Please share because I’m not aware of any

ilaksh · 2024-06-14T13:44:37 1718372677

I suspect that this is a compute tradeoff they decided to make. Uses much less compute than SORA, so feasible to scale up for the public, but that much less coherent. If you look at OpenAI's technical report for SORA they show examples with "base compute" and "4x compute". This output looks like an estimated "2x" to me.

I wonder if they are really talking about the model size.

yjftsjthsd-h · 2024-06-14T04:08:55 1718338135

I mean, there could still be worse ones that come out; the recent Stable Diffusion 3 release seems to show that sometimes regressions do happen.

danielbln · 2024-06-14T06:47:25 1718347645

That's the version they released as an "open" model. Their SD3 model behind an API works just fine, the released medium model has been basically been lobotomized to hell and back in the name of "safety", because apparently human anatomy is icky and nipples are the devil. Unfortunately, that sort of heavy handed model lobotomizing leads to nasty side effects that turn even innocent renders of people into Eldritch nightmares as the model has been forced to forget that a human doesn't have limbs growing out of random places.

brigadier132 · 2024-06-13T23:44:49 1718322289

I don't even want video, why hasn't someone launched image generation with a consistency between images yet?

OpenAi demoed it but they haven't launched it

danielbln · 2024-06-14T06:43:32 1718347412

Look up Story Diffusion.

kolinko · 2024-06-14T06:34:20 1718346860

I like that they put limitations next to features - this should be a standard for transparent communications

jdthedisciple · 2024-06-14T08:01:01 1718352061

The giant pink boar has a third back leg ... but yea looks impressive overall

detay · 2024-06-13T21:17:27 1718313447

Not even a good try. Let's wait till sora comes up.

yieldcrv · 2024-06-13T23:28:16 1718321296

has a lot of the same flaws as sora, just more prominently than the demos Open AI showed. its just that Sora may be good enough