Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

DevSecOps Guide

Standard DevSecOps Platform Framework

Goal: Safer Software Sooner

The following document describes, at a high-level, the expectations, scope of responsibilities, maturity model, and metrics associated of any DevSecOps platform at GSA.

Baseline Definitions

DevSecOps: A cultural and engineering practice that breaks down barriers and opens collaboration between development, security, and operations organizations using automation to focus on rapid, frequent delivery of secure infrastructure and software to production. It encompasses intake to release of software and manages those flows predictably, transparently, and with minimal human intervention/effort.

Common Expectations

Successful DevSecOps teams have processes characterized by repeatability, low redundancy, high collaboration with dispersion of collective efforts; in order to achieve this most efficiently, automation and auditability is prized above subjective decision-making. The decisions that would drive successful release should be codified in code. If it is not feasible to capture in code, checklists with clear yes/no decision points are preferred to heavily documented standard operating procedures (SOPs). SOPs can be subjectively interpreted more so than these first options.

This does not mean that there aren’t scenarios in which that process can be disrupted and replaced by subjective decision-making; but those scenarios should be rare and driven by extreme conditions rather than the norm.

All of the components described below are going to imply the necessity for some foundational elements; for example, infrastructure-as-code, source control, automation, clear communication pipelines, and many others. Individual platforms may implement these differently, but we will see those common elements emerge as designed.

How to Use This Document

This document is not a framework describing any specific implementation. It describes the requirements that need to be met by any specific implementation before it can be considered a Standard GSA DevSecOps Platform. It should be used by owners of platforms in conjunction with the CTO, Deputy CIO, and CISO to define an implementation of the requirements described in this framework. It should be used by application developers to understand and find platform implementations. This framework is set alongside a template that captures the requirements for any platform implementation.

A platform can be anything from an IaaS-driven pipeline of software delivery to a PaaS to a SaaS-driven application deployment scheme. Applications are deployed on platforms and provide services to our users. Applications are accepted onto platforms via an intake process. In GSA, that could mean that our delivery of applications on Salesforce can (and should) align to the framework described below.

When a DevSecOps platform meets a certain level of maturity, it qualifies for a streamlined delivery and ATO process.

The DevSecOps platform standard is categorized into domains of responsibility. Each domain of responsibility has four components:

Each platform will assign responsibilities at the domain level and then the artifact level to ensure that individuals and organizations have clear understanding of who owns what.

Platform Domains and Responsibilities

Overarching DevSecOps Platform Considerations

Description

This domain encompasses the holistic nature of DevSecOps around the platform itself, capturing the flow of work into the environment and release of software out of it.

Maturity Model

Metrics

There are countless possible metrics in a DevSecOps platform. The decision of which metrics to track is largely based on business need and compliance requirements. This framework labels individual metrics as “High-Value” or “Supporting”. High-Value metrics are those that provide the most critical insight into the performance of a DevSecOps platform, and should be prioritized for implementation. Supporting metrics are those that a team may find useful to improve their DevSecOps platform.

High-Value Metrics

These metrics should be implemented first in a new platform. Not all platforms will have these metrics immediately available, but a fully mature environment typically will have all of these metrics.

Metric Description Associated Domain(s)
Deployment frequency Number of deployments to production in a given time frame Application Deployment; Authority to Operate Processes
Change lead time (for applications) Time between a code commit and production deployment of that code Overarching; Authority to Operate Processes; Patch Management
Change volume (for applications) Number of user stories deployed in a given time frame Overarching
Change failure rate Percentage of production deployments that failed Application Deployment
Mean time to recovery (MTTR) (for applications) Time between a failed production deployment to full restoration of production operations Application Deployment; Backup and Data Lifecycle Management; Patch Management
Availability Amount of uptime/downtime in a given time period, in accordance with the SLA Availability and Performance Management; Network Management
Customer issue volume Number of issues reported by customers in a given time period Overarching
Customer issue resolution time Mean time to resolve a customer-reported issue Overarching
Time to value Time between a feature request (user story creation) and realization of business value from that feature Overarching; Authority to Operate Processes
Time to ATO Time between the beginning of Sprint 0 to achieving an ATO Overarching; Authority to Operate Processes
Time to patch vulnerabilities Time between identification of a vulnerability in the platform or application and successful production deployment of a patch Authority to Operate Processes

Supporting Metrics

These metrics provide useful information and it is recommended that metrics in this category be implemented after the High-Value metrics have been implemented.

Metric Description Associated Domain(s)
Test coverage Percentage of code that is covered by automated tests Application Development, Testing, and Operations
Change types Percentage of features vs fixes vs security patches Change Management; Patch Management
Time to availability of event information Time from an event to information about the event being available to the DevSecOps team or end users Logging, Monitoring, and Alerting
Developer onboarding Time from a developer joining the team to ability to commit code for production deployment Application Development, Testing, and Operations
Change resolution time Time between a change proposal and closing (implementation or rejection) Platform Governance
Change lead time (for the DevSecOps platform) Time between a change (e.g., code commit) and platform deployment of that change Change Management; Patch Management; Network Management; Platform Governance
Change volume (for the DevSecOps platform) Number of user stories deployed in a given time frame Change Management; Platform Governance
Change failure rate (for the DevSecOps platform) Percentage of platform deployments that failed Change Management; Platform Governance
Mean time to recovery (MTTR) (for the DevSecOps platform) Time from a failed platform deployment to full restoration of platform operations Change Management; Patch Management; Platform Governance
Change lead time for images Time from identification of need for a new/updated image to its availability for production use Image Management
Image publishing frequency Number of new/updated images published in a given time frame Image Management
Logging availability Amount of uptime/downtime of the logging system in a given time period Logging, Monitoring, and Alerting
Number of monitoring alerts Amount of monitoring alerts triggered in a given time period Logging, Monitoring, and Alerting
Number of unit/integration tests Number of automated unit or integration tests for an application Application Development, Testing, and Operations
Number of functional/acceptance tests Number of automated functional or acceptance tests for an application Application Development, Testing, and Operations
Mean recovery point Mean time range of data that is lost due to an incident Backup and Data Lifecycle Management
Retention control compliance Percentage of retention-related controls (e.g., AU-11, SI-12) that are automated Backup and Data Lifecycle Management; Authority to Operate Processes
Image instantiation Time between when a developer requests image instantiation and its actual instantiation Image Management
Security benchmark deviation Deviation between security benchmarks applied to an image and security benchmarks on an instantiated image Image Management, Authority to Operate Processes
Technical controls Number of technical security controls partially or fully in place Authority to Operate Processes
Vulnerability patching frequency How often vulnerability patches are regularly deployed to production Authority to Operate Processes
Vulnerability patching lead time Time between discovery of a new vulnerability (i.e., its publication) and patching in production Authority to Operate Processes
A&A lead time Time between initiating a security compliance assessment and completion of A&A processes Authority to Operate Processes
SAR findings count Number of findings in the Security Assessment Report (SAR) Authority to Operate Processes
POA&M count Number of POA&Ms Authority to Operate Processes
New architecture security review lead time Time between initiating a request for security review of a new architecture, and completion Authority to Operate Processes
Experiments and alternatives Number of technological experiments and alternative components tested Overarching
System logging count Number of systems (application, OS, services) in a DevSecOps platform that are logging Logging, Monitoring, and Alerting
AU security control compliance List and percentage of AU security controls that are satisfied via DevSecOps platform logging practices Logging, Monitoring, and Alerting
CM approval lead time Time between submitting a change request and its approval Change Management
Deployment approval lead time Time between requesting deployment of an approved change and the actual deployment to production Change Management
Notification lead time Time between any given step of the CM process and notification of all interested parties Change Management
User provisioning lead time Time between request for a new user on the platform and the user being able to log in Accounts, Privileges, Credentials, and Secrets Management
AC security control compliance List and percentage of AC security controls that are satisfied via DevSecOps platform account management practices Accounts, Privileges, Credentials, and Secrets Management
Privilege auditing frequency Number of times in a given time period that users and their privileges are audited Accounts, Privileges, Credentials, and Secrets Management
Administrator count List and number of users that have administrator-level privileges Accounts, Privileges, Credentials, and Secrets Management
Secret rotation frequency Number of times in a given time period that secrets are changed and updated where affected Accounts, Privileges, Credentials, and Secrets Management
Onboarding lead time Time between a request for a new application to use the DevSecOps platform and the application being deployed on the platform Agreements and Financial Management
Expense lead time Time between an expense and reporting of the expense Agreements and Financial Management

Artifacts

Image Management

Description

Note: This is relevant to IaaS and PaaS capabilities, not necessarily SaaS-driven capabilities.

An image in the context of this framework is the definition of a component of computing infrastructure that can be instantiated for use by the platform or by application owners on that platform. Concretely, an image could be a VM image, AMI, a container image or definition, or similar products. Image management refers to lifecycle around the creation, maintenance, and delivery of those images to application developers.

Maturity Model

Artifacts

Logging, Monitoring, and Alerting

Description

Logging, monitoring and alerting covers the domain of understanding and managing the health and security of an application’s operational state. This includes capturing what events have occurred (logging), providing information about those events (monitoring) and informing the appropriate parties when those events indicate issues to be resolved (alerting). Application teams need significant autonomy to manage the health of their own applications, but the enterprise at large also needs awareness of the health of applications within it.

Maturity Model

Artifacts

Patch management

Is the process by which the operating system, software, and supporting services are upgraded. This is a key element of maintaining the security of systems.

Maturity Model

Artifacts

Platform Governance

Description

Platform governance consists of the processes around and advertisement of changes to the platform, inclusive of managing the security and availability of the platform.

Maturity Model

Artifacts

Change Management

Description

Change management consists of all the standards and norms around version control of applications and the platforms itself.

Maturity Model

Artifacts

Application Development, Testing, and Operations

Description

These areas encompass the development of software by an application team, the unit and integration testing of that software, and the ability to manage that software in operation.

Maturity Model

Artifacts

Application Deployment

Description

Application deployment consists of the processes by which an application in development reaches production, most likely going through multiple environments to evaluate the correctness of deployment. Deployed products must be compliant with the relevant security and infrastructure considerations. Deployment should be performed with no downtime.

Maturity Model

Artifacts

Accounts, Privileges, Credentials, and Secrets Management

Description

This domain covers broadly the ability for user accounts to be created, given permissions to access, create, and destroy resources, and share secrets securely between the platform and the application as appropriate. This can, for example, include both the creation of user accounts but also the sharing of a database username and password between a provisioned database and the application using it. Maturity Model

Maturity Model

Artifacts

Availability and Performance Management

Description

Availability and performance management covers the processes that allow application owners to be assured that the applications will be available, potentially in the face of disaster, and be responsive to user interactions. In order to achieve those goals, the application may deploy redundant capabilities, deploy across different hardware instances, or deploy into multiple regions. Further, application owners may need to manage specific performance characteristics of their applications.

Maturity Model

Artifacts

Network Management

Description

Network management consists of the creation, maintenance and management of the network structures (such as ingress and egress points, virtual private cloud construction) on which the platform resides and in which the applications are launched. Maturity Model

Artifacts

Authority to Operate Processes

Description

The authority to operate (ATO) is the authority given by an authorizing official after assessment by the Chief Information Security Officer (CISO) that a system can “go live” with government data. It takes into consideration the holistic security posture of the application. Traditionally, ATO processes have come at the end of application development, but a DevSecOps environment requires that ATOs are achieved concurrently with development. Hence, the most mature environments will equate deployment with successful receipt of an ATO as the platform itself provides significant security assurances.

Maturity Model

Artifacts

Backup and Data Lifecycle Management

Description

Backup and data lifecycle management allows application developers to ensure that their data is maintained over time and, in the case of failure of any subsystems, that it can be recovered with potentially some gap in transactional data. Lifecycle management of the data includes capabilities to archive and manage data over a long lifetime.

Maturity Model

Artifacts

Agreements and Financial Management

Description

Agreements and financial management consists of all the processes which allow new customers of the platform to come onboard through agreements (intake), allowing funding to be allocated by the customers for use, and tracking expenditures over time for both the platform owner and application owners.

Maturity Model

Artifacts

tech.gsa.gov / Office of the CTO

An official website of the U.S. General Services Administration

Looking for U.S. government information and services?
Visit USA.gov