Version vom 16. April 2019, 16:41 Uhr

https://singularityhub.com/2017/10/10/ai-is-easy-to-fool-why-that-needs-to-change

https://en.wikipedia.org/wiki/Deep_learning#Cyberthreat

WHITE BOX ATTACKS

https://cv-tricks.com/how-to/breaking-deep-learning-with-adversarial-examples-using-tensorflow/
- Paper »ADVERSARIAL EXAMPLES IN THE PHYSICAL WORLD«: https://arxiv.org/pdf/1607.02533.pdf

Untargeted Adversarial Attacks

Adversarial attacks that just want your model to be confused and predict a wrong class are called Untargeted Adversarial Attacks.

nicht zielgerichtet

Fast Gradient Sign Method(FGSM)

FGSM is a single step attack, ie.. the perturbation is added in a single step instead of adding it over a loop (Iterative attack).

Basic Iterative Method

Störung, anstatt in einem einzelnen Schritt in mehrere kleinen Schrittgrößen anwenden

Iterative Least-Likely Class Method

ein Bild erstellen, welches in der Vorhersage den niedrigsten Score trägt

Targeted Adversarial Attacks

Attacks which compel the model to predict a (wrong) desired output are called Targeted Adversarial attacks

zielgerichtet

(Un-)Targeted Adversarial Attacks

kann beides...

Projected Gradient Descent (PGD)

Eine Störung finden die den Verlust eines Modells bei einer bestimmten Eingabe maximiert:

MNIST-Bsp.: https://towardsdatascience.com/know-your-enemy-7f7c5038bdf3
- Jupyter Notebook: https://github.com/oscarknagg/adversarial/blob/master/notebooks/Creating_And_Defending_From_Adversarial_Examples.ipynb

BLACK BOX ATTACKS

https://medium.com/@ml.at.berkeley/tricking-neural-networks-create-your-own-adversarial-examples-a61eb7620fd8
- Jupyter Notebook: https://github.com/dangeng/Simple_Adversarial_Examples

on computer vision

propose zeroth order optimization (ZOO)

attacks to directly estimate the gradients of the targeted DNN
- https://arxiv.org/abs/1708.03999

Black-Box Attacks using Adversarial Samples

a technique that uses the victim model as an oracle to label a synthetic training set for the substitute, so the attacker need not even collect a training set to mount the attack
- https://arxiv.org/abs/1605.07277

new Tesla Hack

on voice (ASR)

https://www.the-ambient.com/features/weird-ways-echo-can-be-hacked-how-to-stop-it-231

hidden voice commands

Psychoacoustic Hiding (Attacking Speech Recognition)

https://adversarial-attacks.net/
- Code: https://github.com/rub-ksv/adversarialattacks
- Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
- Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf

on written text (NLP)

paraphrasing attacks

https://motherboard.vice.com/en_us/article/9axx5e/ai-can-be-fooled-with-one-misspelled-word

Anti Surveillance

http://dismagazine.com/dystopia/evolved-lifestyles/8115/anti-surveillance-how-to-hide-from-machines/

@@ Zeile 1: / Zeile 1: @@
-KNN's sind extrem anfällig für...
 * Praxis-Beispiele: https://boingboing.net/tag/adversarial-examples
 * https://bdtechtalks.com/2018/12/27/deep-learning-adversarial-attacks-ai-malware/
@@ Zeile 8: / Zeile 6: @@
 * https://en.wikipedia.org/wiki/Deep_learning#Cyberthreat
+[[File:Adversarial-pig.png|800]]
-----
-----
 =WHITE BOX ATTACKS=
 * https://cv-tricks.com/how-to/breaking-deep-learning-with-adversarial-examples-using-tensorflow/

Adversarial Attacks: Unterschied zwischen den Versionen

Aus exmediawiki