Workflows disabled pending vulnerability investigation #57

joeyparrish · 2024-11-25T19:35:48Z

A vulnerable workflow exposed this repo to risk of manipulation. All GitHub Actions have been disabled pending investigation of this vulnerability.

Existing releases, tags, and branches are clean and have not been poisoned. MD5 sums in release notes can be used to check your binaries. Binaries released via shaka-streamer-binaries on PyPi are also clean.

joeyparrish · 2024-12-06T19:18:14Z

Default permissions for workflows have been fixed organization-wide. Some workflows now need to have permissions added selectively.

Because we used require() to read build-matrix.json, the file could be replaced with build-matrix.json.js, allowing code injection into our CI pipelines. This fixes this vulnerability by reading the JSON text with the fs module, then explicitly parsing it, rather than relying on require(). This exploit was discovered by a researcher, and the researcher's activity was spotted within hours. Workflows were immediately suspended. No evidence has been found of any tampering in this repository or its releases. Issue shaka-project#57

Because we used require() to read build-matrix.json, the file could be replaced with build-matrix.json.js, allowing code injection into our CI pipelines. This fixes this vulnerability by reading the JSON text with the fs module, then explicitly parsing it, rather than relying on require(). This exploit was discovered by a researcher, and the researcher's activity was spotted within hours. Workflows were immediately suspended. No evidence has been found of any tampering in this repository or its releases. Issue #57

Because we used require() to read build-matrix.json, the file could be replaced with build-matrix.json.js, allowing code injection into our CI pipelines. This fixes this vulnerability by reading the JSON text with the fs module, then explicitly parsing it, rather than relying on require(). This also changes the location of the file, to match its location in other projects. Note that this workflow is not currently giving any elevated permissions to users, so it is not currently possible to damage the repo through a PR. But this might have been possible in the past, due to organization-wide defaults for token permissions (recently fixed). No evidence has been found of past exploit. See also shaka-project/shaka-streamer#216 and shaka-project/static-ffmpeg-binaries#57

joeyparrish · 2024-12-17T00:34:34Z

Workflows have been fully audited and re-enabled for this repository.

A full discussion of the workflow vulnerability will follow soon.

joeyparrish · 2024-12-20T18:03:41Z

TL;DR

Above all else, I want to emphasize this:

A vulnerability was found in one of our workflows, not our source code or binaries. The vulnerability was discovered by a researcher who reported it to Google and did not damage anything. Workflows were suspended within hours, and no evidence of compromise has been found in any repos in the weeks I've been working on this.

If you want more details, read on. Questions are welcome.

The main vulnerability

The vulnerability relied on two main things:

The default GITHUB_TOKEN permissions were read-write
There was a way to inject code into the workflow in context of that token

Versions of those two things were found in several other repos across shaka-project.

As soon as the exploit PR was noticed on shaka-streamer, workflows were paused on shaka-streamer and static-ffmpeg-binaries (both used the exact same pattern). This gave me time to investigate the PR, what it did, and how.

The exploit PR replaced build-matrix.json with build-matrix.json.js, which allowed arbitrary code to execute in context of the workflow. This is because the nodejs code require("./build-matrix.json") triggers a search that first looks for the JSON (which will be automatically read and parsed), then looks for that as the name of a local module (which will be executed).

Because the default token was read-write, the PR author was able to use that token to make live calls to the GitHub API to create a new release as a proof of concept. They could just as easily have modified an existing release, replaced attached binaries on that release, pushed a new branch, pushed to an existing unprotected branch, or created or modified tags.

The main branch was protected, so they would not have been able to push code directly to main without a merged PR. The PR author was a security researcher who did not intend us any harm, and I have verified that no existing releases, tags, or binaries were tampered with.

Mitigations

Reduce default token permissions

The repositories and the organization itself all had their default token permissions set to read-write, as in this screenshot:

So the first mitigation was to update all repository settings, first at the organization, then per-repo in each one I had a fork of. This was done with the gh command line to do it in a semi-automated way across all of those repos.

Next, I engaged in a systematic review of all repositories and workflows. There are 16 active repos under shaka-project, with many workflows each.

I prioritized the most popular/active repositories, as well as those whose maintainers were waiting to make immediate releases.

The default tokens were all basically safe now, but some workflows used non-default tokens for special actions or to do work as specific actors (like @shaka-bot). I made a list of workflows using non-default tokens, and checked how those workflows could be triggered. All were triggerable by maintainers, schedule, or commits to main, and none were exploitable by PRs or any code that had not been reviewed and merged.

Add back granular permissions

The new default token permissions had crippled some workflows that depended on those implicit permissions. So for each workflow, I triggered it, either on the main repo or on my fork, as appropriate. For the ones that failed, I added explicit, granular, and minimal permissions, isolated to the job.

Prevent JSON module code injection

Our (lazy) use of require("./build-matrix.json") was an opportunity for code injection. The solution was simple: read the file explicitly with fs, then parse it separately with JSON.parse(), eliminating the module search.

This was the only code injection used by the researcher, but this was not the only possible path to code injection discovered during the full audit. In a couple of cases, a job was too big and complex and mixed actions that involved PR-controlled code with actions that required privileges. These were split up to isolate permissions, with minimal state transferred between jobs, e.g. by something like actions/cache. It's important to separate jobs, not just steps, since each job runs in a fresh container, preventing previous jobs from tampering with the base image in a way that impacts the next job.

Avoid persisted credentials in `actions/checkout`

actions/checkout will use your token to read the remote repo, and by default, it will persist those credentials in the local repository. This is generally unnecessary and can lead to unexpected leakage of tokens in long-running workflows. This did not appear to be a factor in the exploit we experienced, but we would rather be ahead of issues in the future, so we took care of this during the post-exploit audit.

The fix is simple: explicitly set persist-credentials: false when calling actions/checkout. There are very few exceptions, each of which has an explicit true setting and clear comments about why we need it, and how we ensure it's safe.

For detailed write-ups on the problem with the default setting, see actions/checkout#485 and https://johnstawinski.com/2024/01/11/playing-with-fire-how-we-executed-a-critical-supply-chain-attack-on-pytorch/ .

All PRs addressing workflow security, organization-wide

PRs to add granular permissions to workflows:

PRs to prevent someone injecting JS in place of build-matrix.json:

PRs to refactor workflows to isolate and/or simplify permissions:

PRs to disable persisted checkout credentials:

joeyparrish added type: bug Something isn't working correctly type: announcement An announcement from the team; generally pinned to the top priority: P1 Big impact or workaround impractical; resolve before feature release labels Nov 25, 2024

joeyparrish self-assigned this Nov 25, 2024

joeyparrish added the type: CI An issue with our continuous integration tests label Nov 27, 2024

joeyparrish mentioned this issue Dec 16, 2024

ci: Read build matrix JSON explicitly shaka-project/shaka-packager#1461

Merged

joeyparrish mentioned this issue Dec 16, 2024

ci: Read build matrix JSON explicitly #59

Merged

github-actions bot added this to the Backlog milestone Dec 17, 2024

joeyparrish closed this as completed Dec 20, 2024

joeyparrish added type: vulnerability A security issue with the project, the CI, or the repo and removed type: bug Something isn't working correctly type: announcement An announcement from the team; generally pinned to the top labels Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflows disabled pending vulnerability investigation #57

Workflows disabled pending vulnerability investigation #57

joeyparrish commented Nov 25, 2024

joeyparrish commented Dec 6, 2024

joeyparrish commented Dec 17, 2024

joeyparrish commented Dec 20, 2024

Workflows disabled pending vulnerability investigation #57

Workflows disabled pending vulnerability investigation #57

Comments

joeyparrish commented Nov 25, 2024

joeyparrish commented Dec 6, 2024

joeyparrish commented Dec 17, 2024

joeyparrish commented Dec 20, 2024

TL;DR

The main vulnerability

Mitigations

Reduce default token permissions

Add back granular permissions

Prevent JSON module code injection

Avoid persisted credentials in actions/checkout

All PRs addressing workflow security, organization-wide

Avoid persisted credentials in `actions/checkout`