My current dream is a model that's good at coding with a ~10m token content wind...

imiric · 2025-04-09T22:16:18 1744236978

Has anyone found the output at these large context windows usable at all?

IME the quality of all models goes down considerably after just a few thousand tokens. The chances of hallucinating, mixing up prompts, forgetting previous prompts, etc., are much more likely as context size increases. I couldn't imagine a context of 1M tokens, let alone 10M, being usable at all. Not to mention that any query is going to come to a crawl just to move that amount of data around (which still annoyingly happens on every query...).

So usually at around 10K tokens I ask it to summarize what was discussed, or I manually trim down the current state, and start a new fresh chat from there. I've found this to work much better than wasting my time fighting bad output. This is also cheaper if you're on a metered plan (OpenRouter, etc.).

vessenes · 2025-04-09T20:59:07 1744232347

The results are not mixed, Llama 4 is terrible at coding. I agree on longer context window being the dream.

ZeroCool2u · 2025-04-09T19:58:47 1744228727

I mean you get a 2 Million token context window and by far my favorite coding model with Gemini 2.5 Pro.

sva_ · 2025-04-09T20:54:24 1744232064

I just subscribed to the free trial yesterday, and I've been hooked tbh. I haven't subscribed to any of the other LLM companies so far. I hope something else comes out within a month because I really don't want to spend 22 Euro per month for it.

The 1M context window (2M?) really sets it apart.

simonw · 2025-04-09T21:22:58 1744233778

I believe you can still use Gemini 2.5 Pro for free via https://aistudio.google.com and their gemini-2.5-pro-exp-03-25 model ID through their API.

The free tier is "used to improve our products", the paid tier is not.

baq · 2025-04-09T21:13:10 1744233190

22 euro per month is less than 1 per day. Less than one espresso.

I get the subscription fatigue, but there are splurges and there are truly valuable things.

siva7 · 2025-04-09T20:12:39 1744229559

Has someone tried the 2m context window for a code base and can report how it compares over claude or o1?

khromov · 2025-04-09T22:15:53 1744236953

Made a video comparing Gemini 2.5 Pro to Claude Sonnet 3.7 recently: https://www.youtube.com/watch?v=AVdVJ_hD_vo

ZeroCool2u · 2025-04-09T20:18:18 1744229898

I mean I've tried it with Gemini 2.5 Pro + Roo and then tried Claude 3.7 + Roo on the same task and Gemini blew Claude away. Haven't spent anymore OpenRouter credits, because Gemini was so much better.

johnisgood · 2025-04-09T20:56:25 1744232185

Does Gemini have a web interface similar to claude.ai? I am lazy[1], but I am also poor. I would not be able to afford 100 USD per month.

[1] But if it is cheap enough, has large context window, then I might consider setting up something akin to claude.ai with Gemini's API.

ZeroCool2u · 2025-04-09T21:05:10 1744232710

Yeah AI Studio is free with decent rate limits, though obviously more developer focused: https://aistudio.google.com/

The official Gemini app works well for me too and there's a nice free tier and it's free if you have a newer Pixel phone. Otherwise $20/month for the Advanced tier. There's no $200/month option.

https://gemini.google.com/app

indigodaddy · 2025-04-09T21:17:01 1744233421

There's also Google's https://idx.dev - which is a webide/vscode dealio and you can use gemini in agentic mode (mix of 2.0/2.5 but if you use your own gemini key you can guarantee 2.5 Pro i think)

edit, well it now appears to be https://firebase.studio/ - that is a recent change I haven't used it since it changed its name..

johnisgood · 2025-04-09T21:15:30 1744233330

I mostly use LLMs on PC, as I use LLMs mainly for coding.

Does AI Studio allow you to have projects with project files and whatnot?

How about its context window length, more or less than Claude's?

I am also interested in open-source alternatives to the web interface that claude.ai has, I know there are some but I have forgotten their names, would be cool to have a list here.

simonw · 2025-04-09T21:25:44 1744233944

The best open source UI I know of is https://openwebui.com/ - you can point it at any OpenAI API compatible endpoint and both Gemini and Anthropic offer those now.

You can use the Gemini API for free with quite generous allowances, including for 2.5 Pro.

johnisgood · 2025-04-09T21:30:49 1744234249

Thanks Simon, will take a look.

Extremely off-topic: are you still around DS?

simonw · 2025-04-09T22:56:44 1744239404

johnisgood · 2025-04-10T09:53:39 1744278819

DarkScience's IRC server.

simonw · 2025-04-10T13:12:50 1744290770

Wow that takes me back! I've not been active on IRC in about a decade I'm afraid.

johnisgood · 2025-04-10T15:51:56 1744300316

So we have talked a decade ago?! Damn! I remember you from DS. :D

bionhoward · 2025-04-09T22:02:28 1744236148

AI studio is only developer focused if you’re not working on AI, which is a prohibited use case according to the Gemini API / AI Studio “Additional Terms”