Comment by echelon

Comment by echelon 5 hours ago

This is something I hadn't considered.

Today's role play and doomer fantasy will result in future models that are impossible to introspect and that don't let on about nefarious intent.

The alarmists cried wolf, so we taught the next generation of wolves to look like sheep.

randallsquared 3 hours ago

Right, but of course this is fundamentally a problem with the "training" approach as opposed to a hypothetical direct writing of weights. A model where the builder directly selects traits rather than trying to hammer them into shape will be more efficient and steerable, but requires a much deeper understanding of how this actually works that anyone seems to have, yet.

Reply View 1 reply

A4ET8a8uTh0_v2 3 hours ago

Agreed, but that is the progress of most science. With genes humans didn't start by making designer babies and encoding their names in DNA like in movies. Instead, it was made with small steps. Yet is still to come.

Reply View | 0 replies

rdedev 2 hours ago

Would all AI be hell bent on world domination cause that's what it learnt over and over again in its training data?

Reply View 0 replies