How do I clean up older data from the database? #12047

alangenfeld · 2023-02-02T17:29:15Z

alangenfeld
Feb 2, 2023
Maintainer

After using dagster for an extended period of time, my database storage has grown too large. Can I remove older content to reduce the size?

Answered by alangenfeld

Feb 2, 2023

One option is to use the python APIs against the DagsterInstance to query for older runs and delete them. This is a destructive operation that will remove the events, tags, and run record from the database. This will remove dagster s understanding that this run ever occurred, which can be particularly impactful to partitioned jobs and assets.

Perform this operation with great care.

An example script would look something like this.

import datetime

from dagster import DagsterInstance, RunsFilter

instance = DagsterInstance.get()

# define the time threshold for what is old enough, this example uses 1 week
week_ago = datetime.datetime.now() - datetime.timedelta(days=7)

old_run_records = in…

View full answer

alangenfeld · 2023-02-02T17:38:45Z

alangenfeld
Feb 2, 2023
Maintainer Author

One option is to use the python APIs against the DagsterInstance to query for older runs and delete them. This is a destructive operation that will remove the events, tags, and run record from the database. This will remove dagster s understanding that this run ever occurred, which can be particularly impactful to partitioned jobs and assets.

Perform this operation with great care.

An example script would look something like this.

import datetime

from dagster import DagsterInstance, RunsFilter

instance = DagsterInstance.get()

# define the time threshold for what is old enough, this example uses 1 week
week_ago = datetime.datetime.now() - datetime.timedelta(days=7)

old_run_records = instance.get_run_records(
    filters=RunsFilter(created_before=week_ago),
    limit=10,  # limit how many are fetched at a time, perform this operation in batches
    ascending=True,  # start from the oldest
)

# in this simple example we delete serially
# for higher throughput you could parallelize with threads
for record in old_run_records:
    # delete all the database contents for this run
    instance.delete_run(record.dagster_run.run_id)

8 replies

prha Dec 20, 2023
Maintainer

Responded in #18807

NiallRees Feb 27, 2024

Is there a way to do this in bulk? We have around 10000 run entries per day, this is slow to execute. Is there an easy way to get a database connection to just run a command like:

DELETE FROM public.event_logs
WHERE timestamp < '2024-02-01';

I realise this isn't recommended but happy to take the risk.

SiviP-Glossai Aug 10, 2024

Hi,

I am doing just that.. Happy to share my code with you.

I do have a problem where deleting doesn't really free up space. What did you end up doing?

BTW we are using RDS

stefanadelbert Mar 5, 2025

I have implemented a job which uses the dagster instance to find and delete runs older than a threshold. I'm running it now and I can see it deleting runs, but the database size is INCREASING! I'm inside the postgres container of my self-hosted instance and I can see that the parts of /var/lib/postgresql/data/base/... are increasing in size.
I suspect it's because postgres will not free up space voluntarily. I assume that I need to run vacuum in some form. Could someone please help with this?
I'm using psql in the postgres container to look at the database. I can see that the event_logs is what is taking up all the space. I can see that it has 5M+ rows. I can see that the number of rows is reducing as I'm deleting runs.

select count(*) from event_logs;

However, I can see that the table size has increased in that same time.

SELECT
    schemaname,
    relname,
    pg_size_pretty(pg_total_relation_size(relid)) AS total_size
FROM
    pg_stat_user_tables
ORDER BY
    pg_total_relation_size(relid) DESC
LIMIT 10;

I will try the following once all old runs have been deleted,

vacuum event_logs;
vacuum analyse event_logs;
vacuum full event_logs;

Update:
I did this and was able to free up some space, but still have lots more old runs to delete, so should be able to claim even more back.

HynekBlaha Mar 5, 2025

#12047 (comment)
cc: @stefanadelbert

dmosesson · 2023-05-31T17:20:28Z

dmosesson
May 31, 2023

Will this trigger the relevant cleanup to take place as well (like VACUUM/VACUUM FULL)? If not I think this might make things faster, but from a disk space perspective, posgres will hold on the space.

1 reply

alangenfeld Sep 1, 2023
Maintainer Author

Correct, you may need to take additional actions or tweak any automatic vacuum settings depending on the specific database and version in use.

HuyNguyenNeyuData · 2023-11-15T03:37:16Z

HuyNguyenNeyuData
Nov 15, 2023

If I want to keep ASSET_MATERIALIZATION and ASSET_OBSERVATION event for UI, is there a way to keep them?

7 replies

prha Apr 9, 2024
Maintainer

No, it's because we have two types of logs: structured and unstructured.

Structured logs will always have a dagster_event_type populated, and are required for dagster to keep track of what has happened. They power things like asset history, step retries, run retries, etc.

Unstructured logs (dagster_event_type is null) are just strings that are output by the ops/assets using context.log and don't contribute to dagster's understanding of what has happened in the run (but may contain useful debugging info on a per-run basis). That info generally has diminishing value after the run has completed.

steffyd Feb 24, 2025

@prha just want to confirm, using the dagster_instance.delete_run() would also delete the asset materialization information, including the metadata regarding the partition that was materialized? Wouldn't that make the asset UI not helpful as it wouldn't be reliable?

prha Feb 24, 2025
Maintainer

Yes, that's right. We treat our event log as an immutable store, though most events do not have value besides as a record of the computation. If you feel comfortable deleting records from your event log manually for old runs, you could delete rows that do not have an asset_key populated, as long as you did not care about maintaining the history for that particular run.

steffyd Feb 24, 2025

I see... that's unfortunate. Our current event_logs table has over 60 GBs of data, and we were hoping to keep that trim. We have a lot of jobs that are running fairly continuously that aren't asset related, so things can grow very quickly. But if the dagster_instance.delete_run removes asset materialization metadata as well, it doesn't sound like we can use that as a method of keeping things small. @prha Is there a different method you would recommend for removing old logs and runs we wouldn't care about any more while still maintaining asset materializations in perpetuity? You mention deleting records from the event log manually, but how would you recommend deleting the correct values from that table, and what sort of ramifications would that have? Thank you!

prha Feb 25, 2025
Maintainer

It's a little tricky, because many event types are tied to different levels of functionality that you may or may not want to maintain over time.

For instance, asset events have clear value for a long time, especially for partitioned assets (e.g. showing partition status).
Step events and input/output events are critical for re-execution / retrying failed runs.
Run status events (e.g. job failure / job success) power run status sensors, which people often use to power alerting features.

You could probably set up some sort of schedule to delete the events that you don't care about after some operational window. For example, if you don't anticipate having to retry / reexecute a run, you could probably delete input / output events after some acceptable amount of time (e.g. 30 days). You might be okay deleting all run status events after 24 hours, because you're reasonably sure that all your run status sensors have not been paused and have taken all the actions they need to.

The dagster_event_type column in the DB is they key to how these events are used in the system. We don't maintain an exhaustive list of whitelist of which events are tied to which set of functionality, because that tends to evolve over time.

I'd recommend doing some analytics on your event log broken out by type, to see what event types are taking up the most space.

We've seen that a lot of deployments with a large volume of logs can have a significant portion of that volume made up of unstructured events (dagster_event_type is null) which have no value to the Dagster framework itself. They are always safe to delete after any amount of time, with the caveat that they presumably were added to help pipeline maintainers debug their own code.

cleboo · 2024-06-21T06:38:58Z

cleboo
Jun 21, 2024

Is there a clean way to also remove the associated folders on disk? I have a disk that's slowly getting filled up with the contents of intermediate results of runs.

1 reply

cleboo Jun 21, 2024

I hacked together this approach that works locally and in a docker setup with local storage only:

def do_run_cleanup(context: OpExecutionContext):
    # ... find run to delete -> run ...
    path = Path(context.instance._local_artifact_storage.storage_dir) / Path(
        run.dagster_run.run_id
    )
    if path.is_dir():
        shutil.rmtree(path)

But that is going to break for a more involved storage setup (kubernetes, s3, ...).

tiberiuana · 2024-06-21T10:56:56Z

tiberiuana
Jun 21, 2024

Just for inspiration, we are currently cleaning up the disk by blindly deleting storage older than 30 days.

find storage/ -type f -mtime +30 -exec rm -f {} \;

No guarantee this is a good idea, I can only confirm this hasn't broken our production instance (yet).

1 reply

stefanadelbert Sep 4, 2024

I'm doing the same. I've built it into a dagster job which I run periodically and it works well.

garethbrickman · 2024-09-03T22:44:59Z

garethbrickman
Sep 3, 2024

In Dagster+, runs can be deleted via an authorized call to GraphQL API using the runId with the deleteRun mutation. An Organization Admin would need to provision a user token from in the Organization settings > Tokens tab. Below is an example script:

import os
from gql import Client, gql
from gql.transport.requests import RequestsHTTPTransport

# Define the endpoint URL and token
org_name = "your-org-name"
base_url = f"https://{org_name}.dagster.cloud/"
deployment_name = "prod"  # string name for actual deployments, branch deployments use a string ID of their deploymentId
url = base_url + deployment_name + "/graphql"
token = os.getenv("DAGSTER_CLOUD_USER_TOKEN")  # a user token generated from the Organization Settings page in Dagster+. Note: use a user token, not agent token

# Define the transport with the endpoint URL and any headers if needed
transport = RequestsHTTPTransport(
    url=url,
    headers={
        "Dagster-Cloud-Api-Token": token,
    },
    use_json=True,
    timeout=60,
)

# Instantiate the client
client = Client(transport=transport)

# Define the GraphQL mutation
delete_run_mutation = gql("""
mutation DeleteRun($runId: String!) {
  deleteRun(runId: $runId) {
    __typename
    ... on DeletePipelineRunSuccess {
      runId
    }
    ... on RunNotFoundError {
      runId
    }
    ... on PythonError {
      message
      stack
    }
  }
}
""")

# Define the query variables
query_variables = {
    "runId": "your-run-id"  # replace with the actual run ID you want to delete
}

# Execute the mutation
try:
    result = client.execute(delete_run_mutation, variable_values=query_variables)
    print(result)
except Exception as e:
    print(f"An error occurred: {e}")

1 reply

stefanadelbert Sep 4, 2024

FWIW, this is also possible for the community edition. I can confirm that it does remove the run such that it disappears from the Dagster UI. I can also comfirm that LogMessageEvents available from GraphQL for a particular run were also no longer available after deleting the run via GraphQL.
So, my assumption is that it is cleaning up database entries associated with the run, including logs.
I will consider implementing a cleanup job in Dagster which fetches all runs older than some threshold and then deletes those runs, either using GraphQL or the dagster instance.

HynekBlaha · 2025-03-05T06:35:26Z

HynekBlaha
Mar 5, 2025

Database clean-up job (Dagster OSS)

We are running this dagster database clean-up job weekly.

Deleting DEBUG logs older than 1 week, and INFO/WARNING logs older than 2 months.
Deleting unimportant event_logs (after 1 month they were created).
I carefully handpicked them, so nothing breaks in the UI when they get deleted. But I advise to test it on a single run first.

@op
def postgresql_dagster_cleanup_op(context: OpExecutionContext, postgresql_dagster: DatabaseConfig):
    # connect
    conn = psycopg2.connect(
        host=postgresql_dagster.host,
        database=postgresql_dagster.database,
        user=postgresql_dagster.user,
        password=postgresql_dagster.password,
    )    
    context.log.debug("Successfully connected to PostgreSQL!")
    cursor = conn.cursor()

    # remove debug logs older than a week
    cursor.execute(
        dedent(
            """\
            delete from event_logs el
            where
                dagster_event_type is null
                and event::jsonb->>'level' = '10'
                and timestamp < CURRENT_DATE - INTERVAL '1 week'
            """
        ),
    )
    conn.commit()
    context.log.info(f"Removed {cursor.rowcount} debug logs older than a week!")

    # remove info logs older than 2 months
    cursor.execute(
        dedent(
            """\
            delete from event_logs el
            where
                dagster_event_type is null
                and event::jsonb->>'level' = '20'
                and timestamp < CURRENT_DATE - INTERVAL '2 months';
        """
        ),
    )
    conn.commit()
    context.log.info(f"Removed {cursor.rowcount} info logs older than 2 months!")

    # remove warning logs older than 2 months
    cursor.execute(
        dedent(
            """\
            delete from event_logs el
            where
                dagster_event_type is null
                and event::jsonb->>'level' = '30'
                and timestamp < CURRENT_DATE - INTERVAL '2 months';
        """
        ),
    )
    conn.commit()
    context.log.info(f"Removed {cursor.rowcount} warning logs older than 2 months!")

    # remove debug logs older than a month (https://second-foundation.atlassian.net/browse/DATP-1551)
    cursor.execute(
        dedent(
            """\
            DELETE
            FROM event_logs
            WHERE 1 = 1
                AND timestamp < CURRENT_DATE - INTERVAL '1 month'
                AND dagster_event_type IN (
                    -- Transition states. Used in business logic, but not important after the job finishes.
                    'ASSET_MATERIALIZATION_PLANNED',
                    -- System logs
                    'ENGINE_EVENT',
                    'HANDLED_OUTPUT',
                    'LOADED_INPUT',
                    -- Transition states. Used in business logic, but not important after the job finishes.
                    'PIPELINE_CANCELING',
                    'PIPELINE_ENQUEUED',
                    'PIPELINE_STARTING',
                    -- System logs
                    'RESOURCE_INIT_FAILURE',
                    'RESOURCE_INIT_STARTED',
                    'RESOURCE_INIT_SUCCESS',
                    'STEP_INPUT',
                    'STEP_OUTPUT',
                    'STEP_WORKER_STARTED',
                    'STEP_WORKER_STARTING'
                )
        """
        ),
    )
    conn.commit()
    context.log.info(f"Removed {cursor.rowcount} not important events older than 1 month!")

    @job
    def postgresql_dagster_cleanup_job():
        postgresql_dagster_cleanup_op()

Additional indexes:

CREATE INDEX CONCURRENTLY idx_clear_event_logs_user_logs ON dagster.event_logs (((event::jsonb)->>'level'), timestamp) WHERE dagster_event_type isnull;

CREATE INDEX CONCURRENTLY idx_clear_event_logs_system_events ON dagster.event_logs (timestamp) WHERE dagster_event_type IN (
                    -- Transition states. Used in business logic, but not important after the job finishes.
                    'ASSET_MATERIALIZATION_PLANNED',
                    -- System logs
                    'ENGINE_EVENT',
                    'HANDLED_OUTPUT',
                    'LOADED_INPUT',
                    -- Transition states. Used in business logic, but not important after the job finishes.
                    'PIPELINE_CANCELING',
                    'PIPELINE_ENQUEUED',
                    'PIPELINE_STARTING',
                    -- System logs
                    'RESOURCE_INIT_FAILURE',
                    'RESOURCE_INIT_STARTED',
                    'RESOURCE_INIT_SUCCESS',
                    'STEP_INPUT',
                    'STEP_OUTPUT',
                    'STEP_WORKER_STARTED',
                    'STEP_WORKER_STARTING'
                )
;

What can you expect when running clean-up?

This clean-up job can run up to several hours (without indexes) or a few minutes (with indexes) depending on your database size, I/O speed.
Both indexes are relatively small compared to the event_logs table size, so I highly recommend creating them.
🚨 When any row gets inserted/updated/deleted, the information is saved to transaction log, so the disk size will temporarily increase! In case of our database (event_logs size = 370GB), the transaction log increased to 60GB! When your disk runs out of space, your DB will change to ReadOnly and dagster will stop running (I learned the hard way). So if you are reading this, because your got Grafana alert telling you that Dagster DB is running out of space, please scale it up beforehand.

The clean up job finished... what now?

You will see higher CPU/IO for few hours as the transaction logs get applied.
⚠️ You will see DISK USAGE going back TO WHERE IT WAS, BUT NOT LOWER as transaction logs get applied.
Deleted rows were only marked as DEAD ROWS and not deleted yet.
You need to run VACUUM to let postgres reuse them for new inserts.

Details about VACUUM (RECOVERING DISK SPACE)

The standard form of VACUUM removes dead row versions in tables and indexes and marks the space available for future reuse. However, it will not return the space to the operating system, except in the special case where one or more pages at the end of a table become entirely free and an exclusive table lock can be easily obtained. In contrast, VACUUM FULL actively compacts tables by writing a complete new version of the table file with no dead space. This minimizes the size of the table, but can take a long time. It also requires extra disk space for the new copy of the table, until the operation completes.

The usual goal of routine vacuuming is to do standard VACUUMs often enough to avoid needing VACUUM FULL. The autovacuum daemon attempts to work this way, and in fact will never issue VACUUM FULL. In this approach, the idea is not to keep tables at their minimum size, but to maintain steady-state usage of disk space: each table occupies space equivalent to its minimum size plus however much space gets used up between vacuum runs. Although VACUUM FULL can be used to shrink a table back to its minimum size and return the disk space to the operating system, there is not much point in this if the table will just grow again in the future. Thus, moderately-frequent standard VACUUM runs are a better approach than infrequent VACUUM FULL runs for maintaining heavily-updated tables.

Here are my utility queries for VACUUM:

-- Get dead tuples: deleted/updated rows that were not collected yet
SELECT relname, n_dead_tup FROM pg_stat_user_tables ORDER BY n_dead_tup DESC;

-- Get analyze stats
SELECT relname, last_vacuum, last_analyze, last_autovacuum, last_autoanalyze,  vacuum_count, autovacuum_count, analyze_count, autoanalyze_count
FROM pg_stat_all_tables 
WHERE relname = 'event_logs'

-- Table/Index size
SELECT relname, pg_size_pretty(pg_relation_size(oid)) AS table_size,
       pg_size_pretty(pg_total_relation_size(oid) - pg_relation_size(oid)) AS index_size
FROM pg_class
WHERE relname = 'event_logs'

-- Run vacuum + refresh indexes
VACUUM VERBOSE ANALYZE dagster.event_logs

-- Vacuum progress
SELECT 
    n.nspname || '.' || c.relname AS table_name,
    v.phase,
    round(100.0 * v.heap_blks_scanned / NULLIF(v.heap_blks_total, 0), 2) AS pct_scanned,
    round(100.0 * v.heap_blks_vacuumed / NULLIF(v.heap_blks_total, 0), 2) AS pct_vacuumed,
    v.heap_blks_total,
    v.heap_blks_scanned,
    v.heap_blks_vacuumed
FROM 
    pg_stat_progress_vacuum v
JOIN 
    pg_class c ON v.relid = c.oid
JOIN 
    pg_namespace n ON c.relnamespace = n.oid;

My notes:

I recommend to watch this video: PostgresQL Europe: Managing your Tuple Graveyard - Chelsea Dole. Very educative!
After running VACUUM on event_logs table, it will stop growing in size as the space from deleted rows will get reused for new inserts.
To visibly reduce disk size, you need to run VACUUM FULL (🚨 needs EXCLUSIVE LOCK => Downtime). Also needs extra space to create copy of shrunk table, so you need to upscale the disk first.
pg_repack or pg_sqeeze act like VACUUM FULL, but with less EXCLUSIVE LOCKS (which are blocking).

Using pg_squeeze

🚀 Follow up: Shrinking the database with pg_repack

I was able to run pg_repack on the whole database and cut the overall disk usage by half!
There was no downtime and the whole process took ~2,5h with under-utilized database.

SELECT
    relname AS table_name,
    pg_size_pretty(pg_relation_size(quote_ident(schemaname) || '.' || quote_ident(relname))) AS table_size,
    pg_size_pretty(pg_indexes_size(quote_ident(schemaname) || '.' || quote_ident(relname))) AS indexes_size,
    pg_size_pretty(pg_total_relation_size(quote_ident(schemaname) || '.' || quote_ident(relname))) AS total_size
FROM 
    pg_stat_user_tables
WHERE 
    schemaname = 'dagster'
ORDER BY 
    table_name

>

table_name                    |table_size|indexes_size|total_size|  Before / After  |table_size|indexes_size|total_size|
------------------------------+----------+------------+----------+                  +----------+------------+----------+
alembic_version               |8192 bytes|16 kB       |56 kB     |                  |8192 bytes|16 kB       |24 kB     |
asset_check_executions        |0 bytes   |24 kB       |32 kB     |                  |0 bytes   |24 kB       |32 kB     |
asset_daemon_asset_evaluations|4056 kB   |1480 kB     |37 MB     |                  |2920 kB   |632 kB      |29 MB     |
asset_event_tags              |2361 MB   |2359 MB     |4721 MB   |                  |2268 MB   |1888 MB     |4157 MB   |
asset_keys                    |24 MB     |608 kB      |37 MB     |                  |3408 kB   |208 kB      |10 MB     |
backfill_tags                 |208 kB    |160 kB      |408 kB    |                  |192 kB    |160 kB      |384 kB    |
bulk_actions                  |6848 kB   |752 kB      |12 MB     |                  |4704 kB   |480 kB      |7488 kB   |
concurrency_limits            |0 bytes   |16 kB       |24 kB     |                  |0 bytes   |16 kB       |24 kB     |
concurrency_slots             |0 bytes   |16 kB       |40 kB     |                  |0 bytes   |8192 bytes  |16 kB     |
daemon_heartbeats             |2152 kB   |80 kB       |2392 kB   |                  |56 kB     |32 kB       |128 kB    |
dynamic_partitions            |1112 kB   |1184 kB     |2336 kB   |                  |1080 kB   |1056 kB     |2168 kB   |
event_logs                    |137 GB    |31 GB       |169 GB    |        !         |44 GB     |5986 MB     |51 GB     |
instance_info                 |8192 bytes|16 kB       |64 kB     |                  |8192 bytes|16 kB       |32 kB     |
instigators                   |22 MB     |560 kB      |29 MB     |                  |776 kB    |104 kB      |1632 kB   |
job_ticks                     |446 MB    |83 MB       |633 MB    |                  |446 MB    |83 MB       |633 MB    |
jobs                          |23 MB     |568 kB      |29 MB     |                  |808 kB    |104 kB      |1680 kB   |
kvs                           |1384 kB   |32 kB       |1456 kB   |                  |32 kB     |32 kB       |104 kB    |
pending_steps                 |0 bytes   |16 kB       |24 kB     |                  |0 bytes   |16 kB       |24 kB     |
run_tags                      |4114 MB   |8488 MB     |12 GB     |                  |4052 MB   |6740 MB     |11 GB     |
runs                          |4899 MB   |1199 MB     |7535 MB   |                  |4562 MB   |685 MB      |6516 MB   |
secondary_indexes             |8192 bytes|32 kB       |80 kB     |                  |8192 bytes|32 kB       |48 kB     |
snapshots                     |231 MB    |29 MB       |400 MB    |                  |216 MB    |23 MB       |379 MB    |

Since we are using Azure Psqlflex instance instance, some parts might different, but I will share my path.

Original guide by Azure
Enable pg_repack in Azure UI (under azure.extensions) LINK
Install pg_repack client (matching your postgres version)

brew install postgresql@14 pqxn
pgxn install pg_repack==1.4.7

Add permissions for dagster schema to admin user you run the pg_repack with

GRANT dagster to my_admin

Test with dry run

/opt/homebrew/Cellar/postgresql@14/14.17_1/bin/pg_repack --host=my-database.postgres.database.azure.com --username=my_admin --dbname=dagster --schema=dagster --jobs=2 --no-kill-backend --no-superuser-check --dry-run
INFO: Dry run enabled, not executing repack
Password:
NOTICE: Setting up workers.conns
INFO: repacking table "dagster.alembic_version"
INFO: repacking table "dagster.asset_check_executions"
INFO: repacking table "dagster.asset_daemon_asset_evaluations"
INFO: repacking table "dagster.asset_event_tags"
INFO: repacking table "dagster.asset_keys"
INFO: repacking table "dagster.backfill_tags"
INFO: repacking table "dagster.bulk_actions"
INFO: repacking table "dagster.concurrency_limits"
INFO: repacking table "dagster.concurrency_slots"
INFO: repacking table "dagster.daemon_heartbeats"
INFO: repacking table "dagster.dynamic_partitions"
INFO: repacking table "dagster.event_logs"
INFO: repacking table "dagster.instance_info"
INFO: repacking table "dagster.instigators"
INFO: repacking table "dagster.jobs"
INFO: repacking table "dagster.job_ticks"
INFO: repacking table "dagster.kvs"
INFO: repacking table "dagster.pending_steps"
INFO: repacking table "dagster.runs"
INFO: repacking table "dagster.run_tags"
INFO: repacking table "dagster.secondary_indexes"
INFO: repacking table "dagster.snapshots"

Run

/opt/homebrew/Cellar/postgresql@14/14.17_1/bin/pg_repack --host=my-database.postgres.database.azure.com --username=my_admin --dbname=dagster --schema=dagster --jobs=2 --no-kill-backend --no-superuser-check
Password:
NOTICE: Setting up workers.conns
INFO: repacking table "dagster.alembic_version"
INFO: repacking table "dagster.asset_check_executions"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_4085990 ON repack.table_4085983 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_4085992 ON repack.table_4085983 USING btree (asset_key, check_name, materialization_event_storage_id, partition)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_4085990 ON repack.table_4085983 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE UNIQUE INDEX index_4085993 ON repack.table_4085983 USING btree (asset_key, check_name, run_id, partition)
LOG: Command finished in worker 1: CREATE INDEX index_4085992 ON repack.table_4085983 USING btree (asset_key, check_name, materialization_event_storage_id, partition)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_4085993 ON repack.table_4085983 USING btree (asset_key, check_name, run_id, partition)
INFO: repacking table "dagster.asset_daemon_asset_evaluations"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_484858 ON repack.table_484850 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_484861 ON repack.table_484850 USING btree (asset_key, evaluation_id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_484858 ON repack.table_484850 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_484860 ON repack.table_484850 USING btree (evaluation_id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_484861 ON repack.table_484850 USING btree (asset_key, evaluation_id)
LOG: Command finished in worker 0: CREATE INDEX index_484860 ON repack.table_484850 USING btree (evaluation_id)
INFO: repacking table "dagster.asset_event_tags"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_25020 ON repack.table_25013 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_25027 ON repack.table_25013 USING btree (asset_key, key, value)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_25020 ON repack.table_25013 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_25028 ON repack.table_25013 USING btree (event_id)
LOG: Command finished in worker 0: CREATE INDEX index_25028 ON repack.table_25013 USING btree (event_id)
LOG: Command finished in worker 1: CREATE INDEX index_25027 ON repack.table_25013 USING btree (asset_key, key, value)
INFO: repacking table "dagster.asset_keys"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24996 ON repack.table_24986 USING btree (asset_key)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24994 ON repack.table_24986 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24994 ON repack.table_24986 USING btree (id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24996 ON repack.table_24986 USING btree (asset_key)
INFO: repacking table "dagster.backfill_tags"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_4415514 ON repack.table_4415508 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_4415516 ON repack.table_4415508 USING btree (backfill_id, id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_4415514 ON repack.table_4415508 USING btree (id)
LOG: Command finished in worker 1: CREATE INDEX index_4415516 ON repack.table_4415508 USING btree (backfill_id, id)
INFO: repacking table "dagster.bulk_actions"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24903 ON repack.table_24894 USING btree (key)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24901 ON repack.table_24894 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24901 ON repack.table_24894 USING btree (id)
LOG: Assigning worker 1 to build index #2: CREATE INDEX index_24907 ON repack.table_24894 USING btree (key)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24903 ON repack.table_24894 USING btree (key)
LOG: Assigning worker 0 to build index #3: CREATE INDEX index_24906 ON repack.table_24894 USING btree (action_type)
LOG: Command finished in worker 1: CREATE INDEX index_24907 ON repack.table_24894 USING btree (key)
LOG: Assigning worker 1 to build index #4: CREATE INDEX index_24905 ON repack.table_24894 USING btree (selector_id)
LOG: Command finished in worker 0: CREATE INDEX index_24906 ON repack.table_24894 USING btree (action_type)
LOG: Assigning worker 0 to build index #5: CREATE INDEX index_24908 ON repack.table_24894 USING btree (status)
LOG: Command finished in worker 1: CREATE INDEX index_24905 ON repack.table_24894 USING btree (selector_id)
LOG: Command finished in worker 0: CREATE INDEX index_24908 ON repack.table_24894 USING btree (status)
INFO: repacking table "dagster.concurrency_limits"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_4086003 ON repack.table_4085995 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_4086005 ON repack.table_4085995 USING btree (concurrency_key)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_4086003 ON repack.table_4085995 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_4086005 ON repack.table_4085995 USING btree (concurrency_key)
INFO: repacking table "dagster.concurrency_slots"
INFO: repacking table "dagster.daemon_heartbeats"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24890 ON repack.table_24884 USING btree (daemon_type)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_484846 ON repack.table_24884 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_484846 ON repack.table_24884 USING btree (id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24890 ON repack.table_24884 USING btree (daemon_type)
INFO: repacking table "dagster.dynamic_partitions"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_25008 ON repack.table_25000 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_25010 ON repack.table_25000 USING btree (partitions_def_name, partition)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_25008 ON repack.table_25000 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_25010 ON repack.table_25000 USING btree (partitions_def_name, partition)
INFO: repacking table "dagster.event_logs"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24977 ON repack.table_24970 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_24979 ON repack.table_24970 USING btree (dagster_event_type, id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24977 ON repack.table_24970 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_24982 ON repack.table_24970 USING btree (asset_key, dagster_event_type, id) WHERE (asset_key IS NOT NULL)
LOG: Command finished in worker 1: CREATE INDEX index_24979 ON repack.table_24970 USING btree (dagster_event_type, id)
LOG: Assigning worker 1 to build index #3: CREATE INDEX index_24983 ON repack.table_24970 USING btree (asset_key, dagster_event_type, partition, id) WHERE ((asset_key IS NOT NULL) AND (partition IS NOT NULL))
LOG: Command finished in worker 0: CREATE INDEX index_24982 ON repack.table_24970 USING btree (asset_key, dagster_event_type, id) WHERE (asset_key IS NOT NULL)
LOG: Assigning worker 0 to build index #4: CREATE INDEX index_24981 ON repack.table_24970 USING btree (run_id, id)
LOG: Command finished in worker 1: CREATE INDEX index_24983 ON repack.table_24970 USING btree (asset_key, dagster_event_type, partition, id) WHERE ((asset_key IS NOT NULL) AND (partition IS NOT NULL))
LOG: Assigning worker 1 to build index #5: CREATE INDEX index_24980 ON repack.table_24970 USING btree (step_key)
LOG: Command finished in worker 1: CREATE INDEX index_24980 ON repack.table_24970 USING btree (step_key)
LOG: Command finished in worker 0: CREATE INDEX index_24981 ON repack.table_24970 USING btree (run_id, id)
INFO: repacking table "dagster.instance_info"
INFO: repacking table "dagster.instigators"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_25056 ON repack.table_25047 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_25058 ON repack.table_25047 USING btree (selector_id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_25056 ON repack.table_25047 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_25060 ON repack.table_25047 USING btree (instigator_type)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_25058 ON repack.table_25047 USING btree (selector_id)
LOG: Command finished in worker 0: CREATE INDEX index_25060 ON repack.table_25047 USING btree (instigator_type)
INFO: repacking table "dagster.jobs"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_25042 ON repack.table_25031 USING btree (job_origin_id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_25040 ON repack.table_25031 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_25040 ON repack.table_25031 USING btree (id)
LOG: Assigning worker 1 to build index #2: CREATE INDEX index_25044 ON repack.table_25031 USING btree (job_type)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_25042 ON repack.table_25031 USING btree (job_origin_id)
LOG: Command finished in worker 1: CREATE INDEX index_25044 ON repack.table_25031 USING btree (job_type)
INFO: repacking table "dagster.job_ticks"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_25072 ON repack.table_25063 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_25076 ON repack.table_25063 USING btree (job_origin_id, status)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_25072 ON repack.table_25063 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_25077 ON repack.table_25063 USING btree (job_origin_id, "timestamp")
LOG: Command finished in worker 1: CREATE INDEX index_25076 ON repack.table_25063 USING btree (job_origin_id, status)
LOG: Assigning worker 1 to build index #3: CREATE INDEX index_25075 ON repack.table_25063 USING btree (selector_id, "timestamp")
LOG: Command finished in worker 0: CREATE INDEX index_25077 ON repack.table_25063 USING btree (job_origin_id, "timestamp")
LOG: Assigning worker 0 to build index #4: CREATE INDEX index_25074 ON repack.table_25063 USING btree (job_origin_id)
LOG: Command finished in worker 1: CREATE INDEX index_25075 ON repack.table_25063 USING btree (selector_id, "timestamp")
LOG: Command finished in worker 0: CREATE INDEX index_25074 ON repack.table_25063 USING btree (job_origin_id)
INFO: repacking table "dagster.kvs"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_484842 ON repack.table_24915 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24921 ON repack.table_24915 USING btree (key)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24921 ON repack.table_24915 USING btree (key)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_484842 ON repack.table_24915 USING btree (id)
INFO: repacking table "dagster.pending_steps"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_484884 ON repack.table_484876 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_484886 ON repack.table_484876 USING btree (concurrency_key, run_id, step_key)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_484884 ON repack.table_484876 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_484886 ON repack.table_484876 USING btree (concurrency_key, run_id, step_key)
INFO: repacking table "dagster.runs"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24933 ON repack.table_24924 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24935 ON repack.table_24924 USING btree (run_id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24933 ON repack.table_24924 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_24944 ON repack.table_24924 USING btree (partition_set, partition)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24935 ON repack.table_24924 USING btree (run_id)
LOG: Assigning worker 1 to build index #3: CREATE INDEX index_24943 ON repack.table_24924 USING btree (status, update_timestamp, create_timestamp)
LOG: Command finished in worker 0: CREATE INDEX index_24944 ON repack.table_24924 USING btree (partition_set, partition)
LOG: Assigning worker 0 to build index #4: CREATE INDEX index_24942 ON repack.table_24924 USING btree (status)
LOG: Command finished in worker 1: CREATE INDEX index_24943 ON repack.table_24924 USING btree (status, update_timestamp, create_timestamp)
LOG: Assigning worker 1 to build index #5: CREATE INDEX index_24945 ON repack.table_24924 USING btree (pipeline_name, id)
LOG: Command finished in worker 0: CREATE INDEX index_24942 ON repack.table_24924 USING btree (status)
LOG: Assigning worker 0 to build index #6: CREATE INDEX index_4415487 ON repack.table_24924 USING btree (backfill_id, id)
LOG: Command finished in worker 1: CREATE INDEX index_24945 ON repack.table_24924 USING btree (pipeline_name, id)
LOG: Command finished in worker 0: CREATE INDEX index_4415487 ON repack.table_24924 USING btree (backfill_id, id)
INFO: repacking table "dagster.run_tags"
WARNING: skipping invalid index: CREATE UNIQUE INDEX idx_run_tags_run_id ON dagster.run_tags USING btree (key, value, run_id)
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24955 ON repack.table_24948 USING btree (id)
LOG: Initial worker 1 to build index: CREATE INDEX index_4086007 ON repack.table_24948 USING btree (run_id, id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24955 ON repack.table_24948 USING btree (id)
LOG: Assigning worker 0 to build index #2: CREATE INDEX index_4450048 ON repack.table_24948 USING btree (key, value, run_id)
LOG: Command finished in worker 1: CREATE INDEX index_4086007 ON repack.table_24948 USING btree (run_id, id)
LOG: Command finished in worker 0: CREATE INDEX index_4450048 ON repack.table_24948 USING btree (key, value, run_id)
INFO: repacking table "dagster.secondary_indexes"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24869 ON repack.table_24859 USING btree (name)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24867 ON repack.table_24859 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24867 ON repack.table_24859 USING btree (id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24869 ON repack.table_24859 USING btree (name)
INFO: repacking table "dagster.snapshots"
LOG: Initial worker 0 to build index: CREATE UNIQUE INDEX index_24880 ON repack.table_24873 USING btree (id)
LOG: Initial worker 1 to build index: CREATE UNIQUE INDEX index_24882 ON repack.table_24873 USING btree (snapshot_id)
LOG: Command finished in worker 0: CREATE UNIQUE INDEX index_24880 ON repack.table_24873 USING btree (id)
LOG: Command finished in worker 1: CREATE UNIQUE INDEX index_24882 ON repack.table_24873 USING btree (snapshot_id)

Feel free to reach out to me if you need to discuss it. :)

3 replies

How do I clean up older data from the database? #12047

Uh oh!

alangenfeld Feb 2, 2023 Maintainer

Replies: 7 comments · 22 replies

Uh oh!

alangenfeld Feb 2, 2023 Maintainer Author

Uh oh!

prha Dec 20, 2023 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alangenfeld Sep 1, 2023 Maintainer Author

Uh oh!

Uh oh!

prha Apr 9, 2024 Maintainer

Uh oh!

Uh oh!

prha Feb 24, 2025 Maintainer

Uh oh!

Uh oh!

prha Feb 25, 2025 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Database clean-up job (Dagster OSS)

What can you expect when running clean-up?

The clean up job finished... what now?

My notes:

🚀 Follow up: Shrinking the database with pg_repack

Uh oh!

Uh oh!

Uh oh!

alangenfeld
Feb 2, 2023
Maintainer

Replies: 7 comments 22 replies

alangenfeld
Feb 2, 2023
Maintainer Author

prha Dec 20, 2023
Maintainer

alangenfeld Sep 1, 2023
Maintainer Author

prha Apr 9, 2024
Maintainer

prha Feb 24, 2025
Maintainer

prha Feb 25, 2025
Maintainer