The Eponymous Pickle: Machine Learning can Catch Natural Language Attacks

Wednesday, October 06, 2021

Machine Learning can Catch Natural Language Attacks

Particularly like the broad notion of 'honey pot' or attraction approach for threats. Reading the broader article to see possible reapplication.

Honeypot Security Technique Can Stop Attacks in Natural Language Processing By Penn State News

A machine learning framework can proactively counter universal trigger attacks—a phrase or series of words that deceive an indefinite number of inputs—in natural language processing (NLP) applications.

Scientists at Pennsylvania State University (Penn State) and South Korea's Yonsei University engineered the DARCY model to catch potential NLP attacks using a honeypot, offering up words and phrases that hackers target in their exploits.

DARCY searches and injects multiple trapdoors into a textual neural network to detect and thresh out malicious content produced by universal trigger attacks.

When tested on four text classification datasets and used to defend against six different potential attack scenarios, DARCY outperformed five existing adversarial detection algorithms.

From Penn State News

The Eponymous Pickle

About Me

RSS

Blog Archive

Wednesday, October 06, 2021

Machine Learning can Catch Natural Language Attacks

No comments: