Pentagon, IC Labs Want to Protect Against AI’s Unique Cyber Weaknesses

From a cybersecurity perspective, the strengths of artificial intelligence (AI) and machine learning (ML) are also weaknesses. The capacity to crunch massive amounts of data, identify patterns, and learn while working covers a lot of territory, but also leaves room for vulnerabilities, which Pentagon and Intelligence Community (IC) researchers want to close up. And the job doesn’t look easy.

New attack strategies are created as soon as defenses against others are developed, leading to an “arms race” that, at the moment, isn’t promising for organizations trying to defend against attacks, according to a Special Notice posted Jan. 24 by the Defense Advanced Research Projects Agency (DARPA). “The field now appears increasingly pessimistic,” DARPA said, “sensing that developing effective ML defenses may prove significantly more difficult than designing new attacks, leaving advanced systems vulnerable and exposed.”

DARPA’s notice announced a new program, Guaranteeing AI Robustness against Deception (GARD), which aims to find new ways to defend against what it calls adversarial deception on ML systems, so that those systems, as smart as they are, aren’t so easily fooled.

The IC’s Intelligence Advanced Research Projects Activity (IARPA) also is focusing on a specific example of the problem with its new TrojAI program, which looks to prevent Trojans from being introduced into AI or ML system training data, allowing an attacker to take control at a later date.

“The growing sophistication and ubiquity of ML components in advanced systems dramatically increases capabilities, but as a byproduct, increases opportunities for new, potentially unidentified vulnerabilities,” DARPA says in its notice. And attackers seems to have the upper hand. “As defenses are developed to address new attack strategies and vulnerabilities, improved attack methodologies capable of bypassing the defense algorithms are created.”

Defending against those attacks is currently being hampered by a lack of a full understanding of adversarial attacks, which leave open blind spots that can be avoided, the notice said. The GARD project has three goals: develop a theoretical foundation for defensible ML, including ways to measure vulnerabilities and identify ways to make ML systems more robust; create and test defense algorithms in diverse settings; and create a scenario-based framework for evaluating defenses in multiple settings.

Current defenses are designed to counter specific attacks. GARD would develop general defenses that would work against broad categories of attacks and threats that can change tactics. The test framework would take the measure of defenses against a variety of scenarios, including physical world models, poisoning and/or inference time attacks, attacks using multiple modes (such as video, image, and audio), and situations where the attackers, or defenders, have varying levels of skills and resources.

IARPA’s TrojAI project, meanwhile, targets a kind of poisoning attack in which a Trojan implants a trigger into an AI’s training program. Through the use of that trigger, an attacker could take control of an AI or ML program at a specific time. In announcing the program, IARPA offered the example of how a sticky note could wreak havoc with a self-driving car. The trigger tells the program that a small colored square indicates a speed limit sign. So when a sticky note is placed on a stop sign, the car runs the stop sign, putting pedestrians and other drivers at risk.

It’s potentially that simple, and could be applied in a lot of other situations where AI programs make decisions. IARPA said defending against such Trojans is complicated by AI’s learning ability and their training, which often involve large, crowdsourced data sets. Defending against such attacks requires examining an AI program’s internal logic, as well as depending on “the security of the entire data and training pipeline, which may be weak or nonexistent,” IARPA said.

Another IARPA program, Secure, Assured, Intelligent Learning Systems (SAILS), is addressing privacy attacks, in which exploits use output predictions to reconstruct training data, find the distribution of information from training data, and perform membership queries for specific training data. SAILS is intended to develop defensive steps to protect training data.

DARPA has scheduled a GARD Proposer’s Day for Feb. 6, with registration due by Feb. 1. IARPA will hold a SAILS Proposer’s Day for both TrojAI and SAILS on Feb. 26, with registration due by Feb. 20.

Cookie	Duration	Description
AWSALBCORS	7 days	Amazon Web Services set this cookie for load balancing.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
JSESSIONID	session	New Relic uses this cookie to store a session identifier so that New Relic can monitor session counts for an application.
PHPSESSID	session	This cookie is native to PHP applications. The cookie stores and identifies a user's unique session ID to manage user sessions on the website. The cookie is a session cookie and will be deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pxhd	1 year	PerimeterX sets this cookie for server-side bot detection, which helps identify malicious bots on the site.

Cookie	Duration	Description
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
li_gc	5 months 27 days	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
__cf_bm	30 minutes	Cloudflare set the cookie to support Cloudflare Bot Management.

Cookie	Duration	Description
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.
_gat	1 minute	Google Universal Analytics sets this cookie to restrain request rate and thus limit data collection on high-traffic sites.

Cookie	Duration	Description
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
ln_or	1 day	Linkedin sets this cookie to registers statistical data on users' behaviour on the website for internal analytics.
pardot	past	The pardot cookie is set while the visitor is logged in as a Pardot user. The cookie indicates an active session and is not used for tracking.
UID	1 year 1 month 4 days	Scorecard Research sets this cookie for browser behaviour research.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
__gads	1 year 24 days	Google sets this cookie under the DoubleClick domain, tracks the number of times users see an advert, measures the campaign's success, and calculates its revenue. This cookie can only be read from the domain they are currently on and will not track any data while they are browsing other sites.

Cookie	Duration	Description
anj	3 months	AppNexus sets the anj cookie that contains data stating whether a cookie ID is synced with partners.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
GoogleAdServingTest	session	Google sets this cookie to determine what ads have been shown to the website visitor.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
muc_ads	1 year 1 month 4 days	Twitter sets this cookie to collect user behaviour and interaction data to optimize the website.
personalization_id	1 year 1 month 4 days	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
uuid2	3 months	The uuid2 cookie is set by AppNexus and records information that helps differentiate between devices and browsers. This information is used to pick out ads delivered by the platform and assess the ad performance and its attribute payment.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
_mkto_trk	1 year 1 month 4 days	This cookie, provided by Marketo, has information (such as a unique user ID) that is used to track the user's site usage. The cookies set by Marketo are readable only by Marketo.
__gpi	1 year 24 days	Google Ads Service uses this cookie to collect information about from multiple websites for retargeting ads.