https://singularityhub.com/2017/10/10/ai-is-easy-to-fool-why-that-needs-to-change

https://en.wikipedia.org/wiki/Deep_learning#Cyberthreat

Inhaltsverzeichnis

1 WHITE BOX ATTACKS
2 WHITE/BLACK BOX ATTACKS
- 2.1 on voice (ASR)
  - 2.1.1 Psychoacoustic Hiding (Attacking Speech Recognition)
3 BLACK BOX ATTACKS
- 3.1 on computer vision
- 3.2 on voice (ASR)
  - 3.2.1 hidden voice commands
4 BLACK BOX / WHITE BOX ATTACKS

WHITE BOX ATTACKS

https://cv-tricks.com/how-to/breaking-deep-learning-with-adversarial-examples-using-tensorflow/
- Paper »ADVERSARIAL EXAMPLES IN THE PHYSICAL WORLD«: https://arxiv.org/pdf/1607.02533.pdf

Untargeted Adversarial Attacks

Adversarial attacks that just want your model to be confused and predict a wrong class are called Untargeted Adversarial Attacks.

nicht zielgerichtet

Fast Gradient Sign Method(FGSM)

FGSM is a single step attack, ie.. the perturbation is added in a single step instead of adding it over a loop (Iterative attack).

Basic Iterative Method

Störung, anstatt in einem einzelnen Schritt in mehrere kleinen Schrittgrößen anwenden

Iterative Least-Likely Class Method

ein Bild erstellen, welches in der Vorhersage den niedrigsten Score trägt

Targeted Adversarial Attacks

Attacks which compel the model to predict a (wrong) desired output are called Targeted Adversarial attacks

zielgerichtet

(Un-)Targeted Adversarial Attacks

kann beides...

Projected Gradient Descent (PGD)

Eine Störung finden die den Verlust eines Modells bei einer bestimmten Eingabe maximiert:

MNIST-Bsp.: https://towardsdatascience.com/know-your-enemy-7f7c5038bdf3
- Jupyter Notebook: https://github.com/oscarknagg/adversarial/blob/master/notebooks/Creating_And_Defending_From_Adversarial_Examples.ipynb

WHITE/BLACK BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

https://adversarial-attacks.net/
- Code: https://github.com/rub-ksv/adversarialattacks
- Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
- Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf

BLACK BOX ATTACKS

https://medium.com/@ml.at.berkeley/tricking-neural-networks-create-your-own-adversarial-examples-a61eb7620fd8
- Jupyter Notebook: https://github.com/dangeng/Simple_Adversarial_Examples

on computer vision

propose zeroth order optimization (ZOO)

attacks to directly estimate the gradients of the targeted DNN
- https://arxiv.org/abs/1708.03999

Black-Box Attacks using Adversarial Samples

a technique that uses the victim model as an oracle to label a synthetic training set for the substitute, so the attacker need not even collect a training set to mount the attack
- https://arxiv.org/abs/1605.07277

new Tesla Hack

on voice (ASR)

https://www.the-ambient.com/features/weird-ways-echo-can-be-hacked-how-to-stop-it-231

hidden voice commands

BLACK BOX / WHITE BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

https://adversarial-attacks.net/
- Code: https://github.com/rub-ksv/adversarialattacks
- Paper: https://www.ndss-symposium.org/wp-content/uploads/2019/02/ndss2019_08-2_Schonherr_paper.pdf
- Präsentationsfolien: https://www.ndss-symposium.org/wp-content/uploads/ndss2019_08-2_Schonherr_slides.pdf

on written text (NLP)

paraphrasing attacks

https://motherboard.vice.com/en_us/article/9axx5e/ai-can-be-fooled-with-one-misspelled-word

Anti Surveillance

http://dismagazine.com/dystopia/evolved-lifestyles/8115/anti-surveillance-how-to-hide-from-machines/

How to Disappear Completely

https://www.youtube.com/watch?v=LOulCAz4S0M talk by Lilly Ryan at linux.conf.au 2019 — Christchurch, New Zealand

Adversarial Attacks

Aus exmediawiki

Version vom 24. Juni 2019, 11:26 Uhr von C.heck (Diskussion | Beiträge)
(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)

Inhaltsverzeichnis

WHITE BOX ATTACKS

Untargeted Adversarial Attacks

Fast Gradient Sign Method(FGSM)

Basic Iterative Method

Iterative Least-Likely Class Method

Targeted Adversarial Attacks

(Un-)Targeted Adversarial Attacks

Projected Gradient Descent (PGD)

WHITE/BLACK BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

BLACK BOX ATTACKS

on computer vision

propose zeroth order optimization (ZOO)

Black-Box Attacks using Adversarial Samples

new Tesla Hack

on voice (ASR)

hidden voice commands

BLACK BOX / WHITE BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

on written text (NLP)

paraphrasing attacks

Anti Surveillance

How to Disappear Completely

libraries

Adversarial Attacks

Aus exmediawiki

Version vom 24. Juni 2019, 11:26 Uhr von C.heck (Diskussion | Beiträge)(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)

Inhaltsverzeichnis

WHITE BOX ATTACKS

Untargeted Adversarial Attacks

Fast Gradient Sign Method(FGSM)

Basic Iterative Method

Iterative Least-Likely Class Method

Targeted Adversarial Attacks

(Un-)Targeted Adversarial Attacks

Projected Gradient Descent (PGD)

WHITE/BLACK BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

BLACK BOX ATTACKS

on computer vision

propose zeroth order optimization (ZOO)

Black-Box Attacks using Adversarial Samples

new Tesla Hack

on voice (ASR)

hidden voice commands

BLACK BOX / WHITE BOX ATTACKS

on voice (ASR)

Psychoacoustic Hiding (Attacking Speech Recognition)

on written text (NLP)

paraphrasing attacks

Anti Surveillance

How to Disappear Completely

libraries

Version vom 24. Juni 2019, 11:26 Uhr von C.heck (Diskussion | Beiträge)
(Unterschied) ← Nächstältere Version | Aktuelle Version (Unterschied) | Nächstjüngere Version → (Unterschied)