NIST Mathematical Proof Supports Transition to a Continuous-Monitor-and-Update Security Model for AI Systems

https://www.nist.gov/themes/custom/nist_www/img/homepage/nist_mark.png

Can we make artificial intelligence impervious to adversaries who want to twist the technology to nefarious ends? Though AI is among the newest of technologies, the question’s answer is nearly a century old.

Try as we might, we can never render AI completely unassailable using conventional security models. In the peer-reviewed journal IEEE Security and Privacy, Apostol Vassilev, a senior scientist at the National Institute of Standards and Technology (NIST), has published a mathematical proof of this statement building on work published in 1931 by famed logician Kurt Gödel. His incompleteness theorems showed that there are limits to what can be proved within a system built on a finite number of rules.

The guardrails that govern an AI’s behavior are just such a system, and one of the proof’s implications is that there will always be a way to prompt an AI system to disregard its rules — it’s just...

Copyright of this story solely belongs to nist.gov. To see the full text click HERE

Read more