K, I'll play. Let's say that Reinforcement Learners (algorithms/strategies/agent...

unlikelymordant · on Feb 17, 2023

I think this argument would have to conclude that training any RL agent at all is unethical, since updating its weights 'away' from some stimulus could be considered pain.

i_dont_know_ · on Feb 17, 2023

I think what I'm getting at is that there are different types of 'pain', some that we'd consider ethical and some that we wouldn't.

Constantly subjecting a person (or some abstract simulation that responds like a person) to the equivalent of continuous bodily pain would be deeply unethical, but, say, giving a person clues towards solving a puzzle would be considered less so.

Overall, I think you're right though, if we somehow discovered we've created simulated people (or sentient beings) then we probably shouldn't use them to solve arbitrary problems.

n4r9 · on Feb 17, 2023

Surely not, for the same reason that producing and raising a child is not inherently unethical?

Aaron2222 · on Feb 17, 2023

You may joke, but there are people that think having children is unethical[0], and not because of population issues.

[0]: https://www.reddit.com/r/philosophy/comments/27p93c/having_c... (Note I haven't read all of that post, just enough to know it shows at least some people think that way)

unlikelymordant · on Feb 17, 2023

RL agents are exposed to millions upon millions of stimuli, almost all of which will be 'painful' at least initially (according to the websites definition of pain). I think for children negative stimuli are not all painful, pain is a certain small subset of negative stimuli that damage is being done.

n4r9 · on Feb 17, 2023

I think children are constantly experiencing rewards and punishments of a sort analagous to reinforcement learning. Until a certain age their brain is hardwired to want to please their parents, so they pay a lot of attention to the parents' faces, and find reward in smiles and find punishment in frowns. After that, they are driven by a certain amount of ego, and find reward in self-accomplishment and find punishment in being told what to do.