repo-metrics

Repo Metrics collects, processes and presents various metrics related to GitHub repositories.

Collection: The collector lambda collects metrics from GitHub and supplementary services like SonarCloud and Snyk, and stores them in a file, snapshot.json.
Aggregation: snapshot.json is read by the aggregation lambda, its data processed into a format suitable for presentation, and stored in another file, webapp.json.
Reporting: snapshot.json is read by the reporter lambda, and the sum of current vulnerabilities is sent to Slack channel #cals-dev-info.
Presentation: webapp.json read by the webapp and presented at https://d2799m9v6pw1zy.cloudfront.net/.

Instance URL: https://d2799m9v6pw1zy.cloudfront.net/

Documentation: https://liflig.atlassian.net/l/cp/rhke7t35

Overview

%%{init: {'theme':'neutral'}}%%
graph TB
subgraph Repo Metrics

  subgraph Sources
    GitHub
    Snyk
    SonarCloud
  end

  subgraph "Data Processing (state machine, runs every 6h)"
    subgraph Collection
      collector(Lambda: Collector)
      secrets(Secrets Manager)
      raw_data[(S3 Bucket</br>Raw data)]
      collector -- Fetch API credentials --> secrets
      collector -- Fetch data --> Sources
      collector -- Write raw data --> raw_data
    end

    subgraph Aggregation
      aggregator(Lambda: Aggregator)
      processed_data[(S3 Bucket</br>Repo data)]
      aggregator -- Write processed data --> processed_data
    end
  end

  subgraph Reporting
    report(Lambda: Reporter</br>schedule: about every 7h)
    chat(Slack)
    report -- Send report --> chat
  end

  subgraph Presentation
    cf(CloudFront)
    static_files[(S3 Bucket</br>Static files)]
    user(User)
    cf -- Read static files --> static_files
    user -- Browse --> cf
  end

  aggregator -- Read raw data --> Collection
  report -- Read raw data --> Collection
  cf -- Read processed data --> Aggregation

end

Build

Build all packages:

task
# or
task build

Build specific packages:

task types.build
task lambdas.build
task webapp.build
task infra.build

Run

To run repo-metrics locally, we must provide a data file to the webapp. This file is located in packages/repo-collector/data/webapp.json, and may be produced using either of the two approaches outlined below.

1. Collect local data

Alternative 1: Collect and aggregate data from live services (GitHub, ..)

This approach downloads data from remote sources to the local file system, then processes it into a webapp friendly format.

Requires cals-cli to be configured with tokens for SonarCloud, Snyk and GitHub.

$ task update-local-data

Alternative 2: Download existing data from S3

This approach downloads unprocessed (snapshot files) and processed (webapp friendly) data from S3 to the local file system.

Requires: Active shell session using administrative privileges in the liflig-incubator account, e.g. aws-vault exec liflig-incubator-admin.

$ task download-s3-data

2. Serve data and run webapp

After data has been collected and aggregated into packages/repo-collector/data/webapp.json, we serve it to the webapp. Do this in two separate windows/panes, as data must be served while the webserver runs.

Serve local data: task serve-local-data
Start webserver: task start-webserver

Open local server at: http://localhost:3000

API Key setup

cals-cli is used to do the remote calls and also controls how keys are set up.

API Keys must be set for:

GitHub
Snyk
SonarCloud

Deployment

This repo is built and deployed automatically on pushes to master.

Manually updating repo-metrics

The lambdas used for updating data are orchestrated by an AWS Step Function state machine. This state machine runs a schedule, but we can trigger it manually to refresh existing data.

Run the below command using AWS Vault and the liflig-incubator-admin role.

$ task update-remote-data

Architecture Decision Records (ADR)

Architecture Decision Records in this project are stored in the ./doc/adr directory.

Refer to the first ADR for more information.

Contributing

This project accepts contributions. To get started, please contact the maintainers at Slack.

Name		Name	Last commit message	Last commit date
Latest commit History 3,697 Commits
.github		.github
doc/adr		doc/adr
packages		packages
.adr-dir		.adr-dir
.editorconfig		.editorconfig
.gitignore		.gitignore
.ldp.json		.ldp.json
.tool-versions		.tool-versions
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
package-lock.json		package-lock.json
package.json		package.json
renovate.json		renovate.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

repo-metrics

Overview

Build

Run

1. Collect local data

Alternative 1: Collect and aggregate data from live services (GitHub, ..)

Alternative 2: Download existing data from S3

2. Serve data and run webapp

API Key setup

Deployment

Manually updating repo-metrics

Architecture Decision Records (ADR)

Contributing

About

Uh oh!

Contributors 15

Uh oh!

Languages

License

capralifecycle/liflig-repo-metrics

Folders and files

Latest commit

History

Repository files navigation

repo-metrics

Overview

Build

Run

1. Collect local data

Alternative 1: Collect and aggregate data from live services (GitHub, ..)

Alternative 2: Download existing data from S3

2. Serve data and run webapp

API Key setup

Deployment

Manually updating repo-metrics

Architecture Decision Records (ADR)

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 15

Uh oh!

Languages