More

ukuina · 2025-11-23T20:13:44 1763928824

To the sibling comments: Apple holds <10% of the worldwide PC market: https://www.gartner.com/en/newsroom/press-releases/2025-01-1...

ukuina · 2025-11-21T01:40:40 1763689240

Congrats on the move to Google!

Please allow me to rant to someone who can actually do something about this.

Vertex AI has been a nightmare to simply sign up, link a credit card, and start using Claude Sonnet (now available on Vertex AI).

The sheer number of steps required for this (failed) user journey is dizzying:

* AI Studio, get API key

* AI Studio, link payment method: Auto-creates GCP property, which is nice

* Punts to GCP to actually create the payment method and link to GCP property

* Try to use API key in Claude Code; need to find model name

* Look around to find actual model name, discover it is only deployed on some regions, thankfully, the property was created on the correct region

* Specify the new endpoint and API key, Claude Code throws API permissions errors

* Search around Vertex and find two different places where the model must be provisioned for the account

* Need to fill out a form to get approval to use Claude models on GCP

* Try Claude Code again, fails with API quota errors

* Check Vertex to find out the default quota for Sonnet 4.5 is 0 TPM (why is this a reasonable default?)

* Apply for quota increase to 10k tokens/minute (seemingly requires manual review)

* Get rejection email with no reasoning

* Apply for quota increase to 1 token/minute

* Get rejection email with no reasoning

* Give up

Then I went to Anthropic's own site, here's what that user journey looks like:

* console.anthropic.com, get API key

* Link credit card

* Launch Claude Code, specify API key

* Success

I don't think this is even a preferential thing with Claude Code, since the API key is working happily in OpenCode as well.

leopoldj · 2025-11-21T02:49:00 1763693340

You went further with GCP than I did. I was asked repeatedly by support to contact some kind of a Google sales team.

I get the feeling GCP is not good for individuals like I. My friends who work with enterprise cloud have very high opinion about their tech stack.

TheCraiggers · 2025-11-21T04:21:50 1763698910

> I get the feeling GCP is not good for individuals like I.

Google isn't good for individuals at all. Unless you've got a few million followers or get lucky on HN, support is literally non-existent. Anyone that builds a business on Google is nuts.

dotancohen · 2025-11-22T02:15:36 1763777736

I'd like to state the AWS, in contrast, has been great to me as an individual. The two times that I needed to speak to a human, I had one on the phone resolving my issue. And both issues were due to me making mistakes - on my small personal account.

belter · 2025-11-21T18:21:04 1763749264

I propose a new benchmark for Agentic AI...Be able to sign up for a Google Service...

resters · 2025-11-24T23:07:50 1764025670

Yes, it’s extremely complicated. I gave up on fire base for one project because I could not figure out how to get the right permissions set up and my support request resulted in someone copying and pasting a snippet from the instructions that I obviously had not understood in the first place.

It’s also extremely cumbersome to sign up for Google AI. The other day I tried to get deep seek working via Google’s hosted offering and gave up after about an hour. The form just would not complete without error and there was not a useful message to work with.

It would seem that in today’s modern world of AI assistance, Google could set up one that would help users do the simplest things. Why not just let the agent direct the user to the correct forms and let the user press submit?

jdyer9 · 2025-11-22T19:26:12 1763839572

Oh man, I've been playing with GCP's vertex AI endpoints, and this is so representative of my experience. It's actually bananas how difficult it is, even compared to other GCP endpoints

te_chris · 2025-11-21T13:08:09 1763730489

Then you actually use it! I dare someone to try and get Gemini live vertex app working.

ukuina · 2025-11-18T21:02:24 1763499744

I wonder if OpenOMF has the same limits.

c-hendricks · 2025-11-18T21:04:49 1763499889

It's a keyboard thing and less of a software thing.

ukuina · 2025-11-15T17:48:56 1763228936

This is so neat looking. Is there an equivalent for MacOS?

gedy · 2025-11-15T17:58:27 1763229507

Not exactly afaik, but I've recently been going to System Settings > Accessibility > Display, and turning on:

    Increase contrast
    Reduce transparency
    Differentiate without color
    Show toolbar button shapes

https://imgur.com/a/DqfN07k

I like the retro and simple vibe compared to the new Liquid Glass controls.

sfpotter · 2025-11-15T18:54:15 1763232855

Ah! Thank you! Even on Sequoia this is a massive improvement!

gedy · 2025-11-15T19:16:27 1763234187

Great, glad to help. FYI there are similar settings for iOS and I do same on my phone.

ukuina · 2025-11-13T15:54:48 1763049288

At 0:52 in their demo video, there is a grammatical inconsistency in the agent's text output. The annotations in the video are therefore suspected to be created by humans after the fact. Is Google up to their old marketing/hyping tricks again?

> SIMA 2 Reasoning:

> The user wants me to go to the ‘tomato house’. Based on the description ‘ripe tomato’, I identify the red house down the street.

m_w_ · 2025-11-13T16:10:56 1763050256

I can't speak to the content of the actual game being played, but it wouldn't surprise me if there was an in-game text prompt:

> "The house that looks like a ripe tomato!"

that was transformed into a "user prompt" in a more instructional format

> "Go to the tomato house"

And both were used in the agent output. At least the Y-axes on the graphs look more reasonable than some other recent benchmarks.

vessenes · 2025-11-13T20:57:27 1763067447

The scene just before you describe has the user write "ripe tomato" in the description - you can see it in the video. The summary elides it, but the "ripe tomato" instruction is also clearly part of the context.

ukuina · 2025-11-03T12:04:05 1762171445

Very much this.

You are better off asking it a write a script to invoke itself N times across the task list.

threecheese · 2025-11-04T04:37:59 1762231079

Same. I think there’s an untapped market (feature really) here, which if isn’t solved by GPT-next will start to reveal itself as a problem more and more.

LLMs are really bad at being comprehensive, in general, and from one inference to the next their comprehensive-ness varies wildly. Because LLMs are surprising the hell out of everyone with their abilities, less attention is paid to this; they can do a thing well, and for now that’s good enough. As we scale usage, I expect this gap will become more obvious and problematic (unless solved in the model, like everything else).

A solution I’ve been toying with is something like a reasoning step, which could probably be done with mostly classical NLP, that identifies constraints up front and guides the inference to meet them. Like a structured output but at a session level.

I am currently doing what you suggest though, I have the agent create a script which invokes … itself … until the constraints are met, but that obviously requires that I am engaged there; I think it could be done autonomously, with at least much better consistency (at the end of the day even that guiding hand is inference based and therefore subject to the same challenges).

ukuina · 2025-11-03T12:01:42 1762171302

The "loaded question" approach works for getting MUCH better pro/con lists, too, in general, across all LLMs.

ukuina · 2025-10-31T11:09:20 1761908960

> if LLMs "knew" when they're out of their depth, they could be much more useful.

I used to think this, but no longer sure.

Large-scale tasks just grind to a halt with more modern LLMs because of this perception of impassable complexity.

And it's not that they need extensive planning, the LLM knows what needs to be done (it'll even tell you!), it's just more work than will fit within a "session" (arbitrary) and so it would rather refuse than get started.

So you're now looking at TODOs, and hierarchical plans, and all this unnecessary pre-work even when the task scales horizontally very well (if it just jumped into it).

ukuina · 2025-10-31T10:39:35 1761907175

Is it really worth the time, though?

You are better off allowing your overworked neural pathways some much-needed rest.

ukuina · 2025-10-30T13:51:52 1761832312

That question was asked 8 years ago. Coincidence? I think not!