Thu 08/04 12:10 Hardening AWS Environments and Automating Incident Response for AWS Compromises

Hardening AWS Environments and Automating Incident Response for AWS Compromises

Black Hat USA 2016

Presented by: Andrew Krug, Alex McCormack
Date: Thursday August 04, 2016
Time: 12:10 - 13:00
Location: Lagoon K

Incident Response procedures differ in the cloud versus when performed in traditional, on-premise, environments. The cloud offers the ability to respond to an incident by programmatically collecting evidence and quarantining instances but with this programmatic ability comes the risk of a compromised API key. The risk of a compromised key can be mitigated but proper configuration and monitoring must be in place. The talk discusses the paradigm of Incident Response in the cloud and introduces tools to automate the collection of forensic evidence of a compromised host. It highlights the need to properly configure an AWS environment and provides a tool to aid the configuration process. Cloud IR How is it Different? Incident response in the cloud is performed differently than when performed in on-premise systems. Specifically, in a cloud environment you can not walk up to the physical asset, clone the drive with a write-blocker, or perform any action that requires hands on time with the system in question. Incident response best practices advise following predefined practiced procedures when dealing with a security incident, but organizations moving infrastructure to the cloud may fail to realize the procedural differences in obtaining forensic evidence. Furthermore, while cloud providers produce documents on handling incident response in the cloud, these documents fail to address the newly released features or services that can aid incident response or help harden cloud infrastructure.

(1.) A survey of AWS facilities for automation around IR The same features in Cloud Platforms that create the ability to globally deploy workloads in the blink of an eye can also add to ease of incident handling. An AWS user may establish API keys to use the AWS SDK to programmatically add or remove resources to an environment, scaling on demand. A savvy incident responder can use the same AWS SDK, or (the AWS command line tools) to leverage cloud services to facilitate the collection of evidence. For example, using the AWS command line tools or the AWS SDK, a user can programmatically image the disk of a compromised machine with a single call. However, the power of the AWS SDK introduces a new threat in the event of an API key compromise. Increased Attack Surface via Convenience (Walk through some compromise scenarios to illustrate) There are many stories of users accidentally uploading their AWS keys to github or another sharing service and then having to fight to regain control of the AWS account while their bill skyrockets.

(2. 3.) And while these stories are sensational, they are preventable by placing billing limits on a cloud account directly. More concerning is the risk of a compromised key being used to access private data. A compromised API key without restrictions could access managed database, storage, or code repository services, to name a few. (

(4.) While the API key itself may not be used to access a targeted box, it is possible to use that key to clone a targeted box, and relaunch it with an attacker's ssh key, giving the attacker full access to the newly instantiated clone. While the consequences of a compromised API key can be dire, the risks can be substantially mitigated with proper configuration and monitoring. Hardening of AWS Infrastructure AWS environments can be hardened by following traditional security best practices and leveraging AWS services. AWS Services like CloudTrail and Config should be used to monitor and configure an AWS environment. CloudTrail provides logging of AWS API invocations tied to a specific API key. AWS Config provides historical insight into the configuration of AWS resources including users and the permissions granted in their policies. API keys associated to AWS accounts should be delegated according to least privilege and therefore have the fewest number of permissions granted in its policy as possible. Furthermore, API keys should be tightened to restrict access only to the resources they need. Managing of these policies is made easier by the group and role constructs provided by AWS IAM, but it still leaves to the user having to understand each of the 195 policies currently recognized by IAM. In order to aid in the management of this effort, we introduce tools that make it easier to enforce a policy of least privilege by leveraging the concepts behind technologies like audit2allow in SELinux. A method of whitelisting involving auditing and creating allow policies based on denials.

Introduction of Tools We present custom tooling so the entire incident response process can be automated based on certain triggers within the AWS account. With very little configuration users could detect a security incident, acquire memory, take snapshots of disk images, quarantine, and have it presented to an examiner workstation all in the time it takes to get a cup of coffee. Additional tooling is presented to aid in the recovery of an AWS account should a AWS key be compromised. The tool attempts to rotate compromised keys, identity and remove rogue EC2 instances and produce a report with next steps for the user.

Finally we present a tool that examines an existing AWS environments in aides in configuring that environment to a hardened state. The tool recommends services to enable, permissions to remove from user accounts, and offers the ability to grant permissions to users in a similar mechanism as audit2allow in SELinux. We discuss Incident Response in the cloud and introduces tools to automate the collection of forensic evidence of a compromised host. We highlight the need to properly configure an AWS environment and provide a tools to aid the configuration process.

Andrew Krug

Andrew Krug is a Senior Software Engineer at a large cyber security company.Krug has been Consultant, Network Architect, Systems Administrator, OperationsManager, Technical Trainer, and Software Engineer. Currently Krug works todevelop gamified security education through security simulation scenarios.

Alex McCormack

Alex McCormack is a Principal Software Developer at a large cyber securitycompany. Alex assists in the design and implementation of Capture the Flagcompetitions and training events. Alex has designed CTF challenges since 2013and given training since 2012. Prior to developing CTFs, Alex worked inIncident Response and Malware Analysis.