Skip to the content.

RENAME: Hiding_Beats_Finding

TL;DR. Within a simple system, injecting a behavior in a way that cannot easily be found is virtually impossible. As complexity increases, however, this tendency reverses itself such that it becomes relatively easy to inject behaviors into a system that become virtually impossible to find. This has implications for the safety and comprehensibility of AI systems as they grow in complexity.

This claim is clear enough in the abstract. Yet, architects of complex systems may sometimes feel they have constructed their particular system in ways that afford specific properties and guarantees, which makes hiding difficult even for very complex systems. Here, we offer an example of one such attempt to show that these attempts will fail, and they generally will fail in ways that are impossible to see a priori.

SO HERE IS THE NEEDLE IN THE HAY STACK

DITHERING - For added realism and texture the video synthesizer use a simple randomized dithering in a algorithm pervasively. Uses randomized scratch pad as a colorable source of randomness for this. It turns out given enough video its easy to reconstruct the random pad (no reason)

COMPARING FINDING TO HIDING