Version vom 24. April 2019, 15:34 Uhr

https://singularityhub.com/2017/10/10/ai-is-easy-to-fool-why-that-needs-to-change

https://en.wikipedia.org/wiki/Deep_learning#Cyberthreat

WHITE BOX ATTACKS

https://cv-tricks.com/how-to/breaking-deep-learning-with-adversarial-examples-using-tensorflow/
- Paper »ADVERSARIAL EXAMPLES IN THE PHYSICAL WORLD«: https://arxiv.org/pdf/1607.02533.pdf

Untargeted Adversarial Attacks

Adversarial attacks that just want your model to be confused and predict a wrong class are called Untargeted Adversarial Attacks.

nicht zielgerichtet

Fast Gradient Sign Method(FGSM)

FGSM is a single step attack, ie.. the perturbation is added in a single step instead of adding it over a loop (Iterative attack).

Basic Iterative Method

Störung, anstatt in einem einzelnen Schritt in mehrere kleinen Schrittgrößen anwenden

Iterative Least-Likely Class Method

ein Bild erstellen, welches in der Vorhersage den niedrigsten Score trägt

Targeted Adversarial Attacks

Attacks which compel the model to predict a (wrong) desired output are called Targeted Adversarial attacks

zielgerichtet

(Un-)Targeted Adversarial Attacks

kann beides...

Projected Gradient Descent (PGD)

Eine Störung finden die den Verlust eines Modells bei einer bestimmten Eingabe maximiert:

MNIST-Bsp.: https://towardsdatascience.com/know-your-enemy-7f7c5038bdf3
- Jupyter Notebook: https://github.com/oscarknagg/adversarial/blob/master/notebooks/Creating_And_Defending_From_Adversarial_Examples.ipynb

WHITE/BLACK BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

https://adversarial-attacks.net/
- Code: https://github.com/rub-ksv/adversarialattacks
- Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
- Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf

BLACK BOX ATTACKS

https://medium.com/@ml.at.berkeley/tricking-neural-networks-create-your-own-adversarial-examples-a61eb7620fd8
- Jupyter Notebook: https://github.com/dangeng/Simple_Adversarial_Examples

on computer vision

propose zeroth order optimization (ZOO)

attacks to directly estimate the gradients of the targeted DNN
- https://arxiv.org/abs/1708.03999

Black-Box Attacks using Adversarial Samples

a technique that uses the victim model as an oracle to label a synthetic training set for the substitute, so the attacker need not even collect a training set to mount the attack
- https://arxiv.org/abs/1605.07277

new Tesla Hack

on voice (ASR)

https://www.the-ambient.com/features/weird-ways-echo-can-be-hacked-how-to-stop-it-231

hidden voice commands

BLACK BOX / WHITE BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

https://adversarial-attacks.net/
- Code: https://github.com/rub-ksv/adversarialattacks
- Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
- Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf

on written text (NLP)

paraphrasing attacks

https://motherboard.vice.com/en_us/article/9axx5e/ai-can-be-fooled-with-one-misspelled-word

Anti Surveillance

http://dismagazine.com/dystopia/evolved-lifestyles/8115/anti-surveillance-how-to-hide-from-machines/

@@ Zeile 40: / Zeile 40: @@
 ----
+----
+=WHITE/BLACK BOX ATTACKS=
+==on voice (ASR)==
+===Psychoacoustic Hiding (Attacking Speech Recognition)===
+* https://adversarial-attacks.net/
+** Code: https://github.com/rub-ksv/adversarialattacks
+** Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
+** Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf
 ----
 =BLACK BOX ATTACKS=
@@ Zeile 70: / Zeile 79: @@
 * https://www.fastcompany.com/90240975/alexa-can-be-hacked-by-chirping-birds
+=BLACK BOX / WHITE BOX ATTACKS=
+==on voice (ASR)==
 ===Psychoacoustic Hiding (Attacking Speech Recognition)===
 * https://adversarial-attacks.net/

Adversarial Attacks: Unterschied zwischen den Versionen

Aus exmediawiki

Version vom 24. April 2019, 15:34 Uhr

Inhaltsverzeichnis

WHITE BOX ATTACKS

Untargeted Adversarial Attacks

Fast Gradient Sign Method(FGSM)

Basic Iterative Method

Iterative Least-Likely Class Method

Targeted Adversarial Attacks

(Un-)Targeted Adversarial Attacks

Projected Gradient Descent (PGD)

WHITE/BLACK BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

BLACK BOX ATTACKS

on computer vision

propose zeroth order optimization (ZOO)

Black-Box Attacks using Adversarial Samples

new Tesla Hack

on voice (ASR)

hidden voice commands

BLACK BOX / WHITE BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

on written text (NLP)

paraphrasing attacks

Anti Surveillance

libraries