This model is a LOT of fun. It's absolutely tiny - just a 241MB download - and s...

roughly · 2025-08-14T17:47:35 1755193655

I audibly laughed at this one: https://gist.github.com/simonw/25e7b7afd6a63a2f15db48b3a51ec... where it generates a… poem? Song? And then proceeds to explain how each line contributes to the SVG, concluding with:

> This SVG code provides a clear and visually appealing representation of a pelican riding a bicycle in a scenic landscape.

icoder · 2025-08-15T10:21:33 1755253293

This reminds me of my interactions lately with ChatGPT where I gave into its repeated offer to draw me an electronics diagram. The result was absolute garbage. During the subsequent conversation it kept offering to include any new insights into the diagram, entirely oblivious to its own incompetence.

0x00cl · 2025-08-14T17:43:42 1755193422

I see you are using ollamas ggufs. By default it will download Q4_0 quantization. Try `gemma3:270m-it-bf16` instead or you can also use unsloth ggufs `hf.co/unsloth/gemma-3-270m-it-GGUF:16`

You'll get better results.

simonw · 2025-08-14T17:55:49 1755194149

Good call, I'm trying that one just now in LM Studio (by clicking "Use this model -> LM Studio" on https://huggingface.co/unsloth/gemma-3-270m-it-GGUF and selecting the F16 one).

(It did not do noticeably better at my pelican test).

Actually it's worse than that, several of my attempts resulted in infinite loops spitting out the same text. Maybe that GGUF is a bit broken?

danielhanchen · 2025-08-14T19:51:21 1755201081

Oh :( Maybe the settings? Could you try

temperature = 1.0, top_k = 64, top_p = 0.95, min_p = 0.0

canyon289 · 2025-08-14T20:41:52 1755204112

Daniel, thanks for being here providing technical support as well. Cannot express enough how much we appreciate your all work and partnership.

danielhanchen · 2025-08-14T21:37:26 1755207446

Thank you and fantastic work with Gemma models!

simonw · 2025-08-14T22:28:38 1755210518

My topping only lets me set temperature and top_p but setting them to those values did seem to avoid the infinite loops, thanks.

danielhanchen · 2025-08-14T23:52:47 1755215567

Oh fantastic it worked! I was actually trying to see if we can auto set these within LM Studio (Ollama for eg has params, template) - not sure if you know how that can be done? :)

JLCarveth · 2025-08-14T22:07:34 1755209254

I ran into the same looping issue with that model.

danielhanchen · 2025-08-14T23:53:15 1755215595

Definitely give

temperature = 1.0, top_k = 64, top_p = 0.95, min_p = 0.0

a try, and maybe repeat_penalty = 1.1

Patrick_Devine · 2025-08-15T17:05:39 1755277539

We uploaded gemma3:270m-it-q8_0 and gemma3:270m-it-fp16 late last night which have better results. The q4_0 is the QAT model, but we're still looking at it as there are some issues.

ertgbnm · 2025-08-14T17:24:02 1755192242

He may generate useless tokens but boy can he generate ALOT of tokens.

TheJoeMan · 2025-08-14T17:46:18 1755193578

Can he draw an "alot" made of tokens? https://hyperboleandahalf.blogspot.com/2010/04/alot-is-bette...

lucb1e · 2025-08-14T17:30:48 1755192648

He? I know some Gemmas and it's distinctly a female name; is Gemma a boy's name where you're from?

ertgbnm · 2025-08-14T17:37:43 1755193063

I don't really gender LLMs in my head in general. I guess Gemma is a female name. I only gendered it in the joke because I think it makes it funnier, especially since it's just "a little guy". I know they are giving gendered names to these models now but I think it's a bit weird to gender when interacting with them.

layer8 · 2025-08-14T19:40:48 1755200448

Doesn’t the “M” in “Gemma 3 270M” Stand for “male”?

Also: https://en.wikipedia.org/wiki/Gemma_Frisius

avarun · 2025-08-14T20:31:40 1755203500

Not sure if that’s a serious question but it stands for “million”. As compared to 1B+ models, where the B stands for “billion” parameters.

jgalt212 · 2025-08-14T17:34:31 1755192871

Perhaps the poster we referring to Simon not Gemma.

not_a_bot_4sho · 2025-08-15T03:04:34 1755227074

> ALOT

'Alot' is not a word. (I made this mistake a lot, too.)

layer8 · 2025-08-14T17:33:35 1755192815

> It's absolutely tiny - just a 241MB download

That still requires more than 170 floppy disks for installation.

freedomben · 2025-08-14T19:37:39 1755200259

Indeed. Requires over 3,000,000 punch cards to store. Not very tiny!

stikypad · 2025-08-14T22:43:25 1755211405

On the plus side, you can decompose your matrices for free using termites.

mdp2021 · 2025-08-14T19:42:48 1755200568

> For this one it decided to write a poem

My first try:

user: "When was Julius Caesar born"

response: "Julius Caesar was born in **Rome**"

Beautiful :D

(I do not mean to detract from it - but it's just beautiful. It will require more effort to tame it.)

mirekrusin · 2025-08-14T21:19:32 1755206372

Cutting number of parameters in half is like drinking a pint of beer.

stikypad · 2025-08-14T22:44:10 1755211450

I think you meant vodka.

marinhero · 2025-08-14T17:17:22 1755191842

Serious question but if it hallucinates about almost everything, what's the use case for it?

simonw · 2025-08-14T17:28:55 1755192535

Fine-tuning for specific tasks. I'm hoping to see some good examples of that soon - the blog entry mentions things like structured text extraction, so maybe something like "turn this text about an event into an iCal document" might work?

turnsout · 2025-08-14T17:57:08 1755194228

Google helpfully made some docs on how to fine-tune this model [0]. I'm looking forward to giving it a try!

  [0]: https://ai.google.dev/gemma/docs/core/huggingface_text_full_finetune

CuriouslyC · 2025-08-14T17:57:59 1755194279

Fine tuning messes with instruction following and RL'd behavior. I think this is mostly going to be useful for high volume pipelines doing some sort of mundane extraction or transformation.

iib · 2025-08-14T19:19:57 1755199197

This is exactly the fine-tuning I am hoping for, or I would do if I had the skills. I tried it with gemma3 270M and vanilla it fails spectacularly.

Basically it would be the quickadd[1] event from google calendar, but calendar agnostic.

[1] https://developers.google.com/workspace/calendar/api/v3/refe...

striking · 2025-08-14T17:28:53 1755192533

It's intended for finetuning on your actual usecase, as the article shows.

zamadatix · 2025-08-14T17:29:27 1755192567

I feel like the blog post, and GP comment, does a good job of explaining how it's built to be a small model easily fine tuned for narrow tasks, rather than used for general tasks out of the box. The latter is guaranteed to hallucinate heavily at this size, that doesn't mean every specific task it's fine tuned to would be. Some examples given were fine tuning it to efficiently and quickly route a query to the right place to actually be handled or tuning it to do sentiment analysis of content.

An easily fine tunable tiny model might actually be one of the better uses of local LLMs I've seen yet. Rather than try to be a small model that's great at everything it's a tiny model you can quickly tune to do one specific thing decently, extremely fast, and locally on pretty much anything.

yifanl · 2025-08-14T17:54:49 1755194089

It's funny. Which is subjective, but if it fits for you, it's arguably more useful than Claude.

luckydata · 2025-08-14T17:46:29 1755193589

Because that's not the job it was designed to do, and you would know by reading the article.

mirekrusin · 2025-08-14T21:21:38 1755206498

The same as having a goldfish. You can train it to do a trick I guess.

deadbabe · 2025-08-14T17:30:18 1755192618

Games where you need NPCs to talk random jiberrish.

iLoveOncall · 2025-08-14T17:27:20 1755192440

Nothing, just like pretty much all models you can run on consumer hardware.

cyanydeez · 2025-08-14T17:28:13 1755192493

This message brought to you by OpenAI: we're useless, but atleast theres a pay gate indicating quality!

numpad0 · 2025-08-14T17:35:35 1755192935

robotic parrots?

rotexo · 2025-08-14T17:18:47 1755191927

An army of troll bots to shift the Overton Window?

ants_everywhere · 2025-08-14T17:27:10 1755192430

oh no now we'll never hear the end of how LLMs are just statistical word generators

nico · 2025-08-14T17:40:12 1755193212

Could be interesting to use in a RAG setup and also finetuning it

For sure it won’t generate great svgs, but it might be a really good conversational model

luckydata · 2025-08-14T17:46:00 1755193560

The article says it's not a good conversational model but can be used for data extraction and classification as two examples.

mdp2021 · 2025-08-14T17:21:21 1755192081

> For this one it decided to write a poem

Could it be tamed with good role-system prompt crafting? (Besides fine-tuning.)

campbel · 2025-08-14T17:17:56 1755191876

Do you take requests? We need to see how well this model works with some fine-tuning :D

bobson381 · 2025-08-14T20:41:46 1755204106

It's gonna be a customer service agent for Sirius Cybernetics. Share and enjoy!

Balinares · 2025-08-14T19:51:54 1755201114

This is like a kobold to the other models' dragons and I don't hate it. :)

aorloff · 2025-08-15T05:44:40 1755236680

Finally we have a model that's just a tad bit sassy

cyanydeez · 2025-08-14T17:27:28 1755192448

the question is wheather you can make a fine tuned version and spam any given forum within an hour with the most attuned but garbage content.

volkk · 2025-08-14T17:22:25 1755192145

i was looking at the demo and reading the bed time story it generated and even there, there was confusion about the sprite and the cat. switched subjects instantly making for a confusing paragraph. what's the point of this model?