Personally, this feels like the direction scraping should move into. From defini...

tomberin · on March 25, 2023

I was most worried about #2 but surprised how much temperature seems to have gotten that under control in my cases. The author added a HallucinationChecker for this but said on Mastodon he hasn't found many real-world cases to test it with yet.

Regarding 3 & 4:

Definitely take a look at the existing examples in the docs, I was particularly surprised at how well it handled nested dicts/etc. (not to say that there aren't tons of cases it won't handle, GPT-4 is just astonishingly good at this task)

Your project looks very cool too btw! I'll have to give it a shot.

polishdude20 · on March 25, 2023

This seems like part of the problem we're always complaining about where hardware is getting better and better but software is getting more and more bloated so the performance actually goes down.

specproc · on March 25, 2023

Yeah, #1 just makes this seem pointless for the time being. The whole point of needing something like this is horizontal scaling.

Also not clear from my phone down the pub if inference is needed at each step. That would be slow, no? Even (especially?) if you owned the model.

tomberin · on March 25, 2023

No inference is needed. IME it can do a single page in ~10s, $0.01/page. Not practical for most use cases, great for a limited few right now.

sebzim4500 · on March 25, 2023

Yeah seems like it would make way more sense to have an LLM output the CSS rules. Or maybe output something slightly more powerful, but still cheap to compute.