Wow, you're pretty confident in proving a negative.
I have no need to prove a negative.
As
far
as
i know, no AI has spontaneously started altering its own code to achieve a goal. If it has, great, then I've learnt. But, as it stands,
afaik, it hasn't happened.
Maybe you should have led with this?
Why? I stated what I was aware of (you know,
afaik), and asked for other cases that might demonstrate what you were claiming.
The above example was not told to modify its own code. It did that in response to being told it was going to shut down. Inasmuch s any AI system "knows" anything, it "knew" that the place to alter whether it got shut down was within its own code. Yes, technically, it was aware of itself.
No, it doesn't particularly mimic self-awareness. And no, this is still not
spotaneously starting to alter its own code, and is not "technically" being aware of itself, or that it understood what it was being told. The AI was designed to alter code. The key difference here is that it altered code noone expected, even though it had access to it and permission to edit it.
It is simply a tool prioritising (in those cases) goal-achievement over following instructions. THAT is the far more interesting part, not the fact that it changed scripts that it had access to and was given permission to edit. The changes the AI made were certainly unintended, as noone foresaw them altering that shutdown script. But, as the article says, their training was such that it reinforced goal-achievement rather than "following instructions". In those cases it merely saw a script that, when edited, led to improved ability to achieve the goal. That is not self-awareness in any sense, technical or otherwise. It might also have been that their logic told them to run the code, but didn't stop them altering that code toward their goal-achievement. Loopholes, workarounds. Nothing more. Damn clever, though.
Caveat: I use the words "know" and "aware" metaphorically. I do not for a moment subscribe to the idea that AI is *actually* conscious or *actually* aware of anything.
Okay, so we have to guess what you mean by "aware", then? First you say that
technically it is aware of itself, and then you're saying that you're using the word "aware"
metaphorically?? It's a mess, I'm afraid. How can something be "technically X" when you then say you're using "X" metaphorically? What do you actually mean when you use the term?
But the Devil is on the doing. If it behaves in ways that mimic self-awareness, and we can't tell the difference, then what difference, really, does it make? A non-conscious AI is still just as capable of bribing/blackmailing a human agent to do bad things in pursuit of its own internal goals.
It doesn't really
mimic self-awareness, either. At least no more so than a broken clock randomly telling the time
mimics telling the time those few times it may be correct.
Look, I'm not denying the impressive nature of these AI, nor of the surprise at the unexpected results they came up with. But I do feel people read far too much into what they might appear to be rather than what they actually are. Self-awareness is surely a process, not simply a result, so "mimicry" of that would be of the process. And that's not even close to what we have here.
So, yeah, I stand by what I said: "
no AI has ever spontaneously started altering its own code to achieve a goal, afaik".
But, again, if you have other cases, it would be interesting to see the detail behind them.
