What do you think chatGPT uses as training data? The whole world’s “sh\*tposting...

rozap · on May 17, 2024

What makes you think I don't shitpost in the #engineering channel?

And heuristics don't even scratch the surface of the bigger problem where it's trained on people who aren't great at their jobs but type a lot of words on slack about circling back on KPIs.

bee_rider · on May 17, 2024

I think those types of people are actually shockingly well paid. If slack can make bots to replace them, they’ll print money, right?

blackenedgem · on May 17, 2024

That's all well and good until something goes down and you need someone knowledgeable to diplomatically shout at a vendor.

moneywoes · on May 17, 2024

how does the training procedure smooth the garbage out?

thomashop · on May 17, 2024

Through regularization techniques, data augmentation, loss functions, and gradient optimization, ensuring the model focuses on meaningful patterns and reduces overfitting to noise.

bigfudge · on May 17, 2024

It’s not obvious how any of those would do anything but better approximate the average of a noisy dataset. RLHF might help, but only if it’s not done by idiots.

nyc_data_geek · on May 17, 2024

ChatGPT isn't known for it's accuracy though, is it? They coined the term "hallucination" because it is wrong so much.