indexer: drop df columns and refactoring #19308

gegaowp · 2024-09-10T21:49:27Z

Description

df_ columns except df_kind references have been removed from both indexer reader and graphql, this pr removes them from the ingestion path.

to make it happen, some refactoring was necessary, previously StoredObject was the From source Stored* of objects_history and objects_snapshot, this pr changes the source to IndexedObject, which is more intuitive as StoredObject is supposed to be coupled with objects table while IndexedObject is table agnostic.

Test plan

ci and local run

Release notes

Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required.

For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates.

vercel · 2024-09-10T21:49:37Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
sui-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 20, 2024 0:23am

3 Skipped Deployments

Name	Status	Preview	Updated (UTC)
multisig-toolkit	⬜️ Ignored (Inspect)	Visit Preview	Sep 20, 2024 0:23am
sui-kiosk	⬜️ Ignored (Inspect)	Visit Preview	Sep 20, 2024 0:23am
sui-typescript-docs	⬜️ Ignored (Inspect)	Visit Preview	Sep 20, 2024 0:23am

lxfind · 2024-09-11T16:10:44Z

In order to drop a column, the process needs to take 2 releases, as described in https://www.notion.so/mystenlabs/Indexer-Release-Process-fdd71dffb3334b7c9ecd8df3cacbe5a7?pvs=4#2ad9f8dcd47241459ab700bdad13d863.
In the first release the Rust types are updated so that it tolerates the missing columns, and in the second release the table is dropped from the schema.
@amnn This is another data point in my view that we may want to consider not thinking too much about properly handling breaking changes in the short term.

gegaowp · 2024-09-11T16:20:24Z

@lxfind yes and for this specific change, the columns were always Nullable and we have removed the callsites from indexerJSON RPC and GraphQL on previous prs, so that we can fast forward to step 3, which is this pr, and then on next deployment, when the new migration is run, the columns will be dropped, did I miss anything?

lxfind · 2024-09-11T16:25:38Z

In a standard release process, where we want to ensure zero down time, we will need to be able to perform blue-green deployment, where we keep the old writer running while starting the new writer. So there will be a period of time where both old writer and new writer are running. In this case, when the new writer removes a column, the old writer will fail because it would still attempt to write to the dropped column (even if they are null). Same issue applies to readers. So in the future the proper way would be that we need to update the Rust type first before we drop the column.

However, if we want to speed up the process, I think we could ignore the proper rollout for now and always first shutdown the old indexer instances and then star the new one, which would avoid this problem, but with a few mins down time.

amnn · 2024-09-11T19:05:20Z

@amnn This is another data point in my view that we may want to consider not thinking too much about properly handling breaking changes in the short term.

However, if we want to speed up the process, I think we could ignore the proper rollout for now and always first shutdown the old indexer instances and then star the new one, which would avoid this problem, but with a few mins down time.

I think this is fine for now -- my main interest in following the backwards compatibility process is to test it out (and discover interesting edge cases like the impact of blue-green deployment on how quickly we can move) and less about actually maintaining backwards compatibility or reducing down time at the moment.

amnn

Change looks good as well, thanks for the clean-up @gegaowp !

## Description df_ columns except df_kind references have been removed from both indexer reader and graphql, this pr removes them from the ingestion path. to make it happen, some refactoring was necessary, previously `StoredObject` was the `From` source Stored* of `objects_history` and `objects_snapshot`, this pr changes the source to `IndexedObject`, which is more intuitive as `StoredObject` is supposed to be coupled with `objects` table while `IndexedObject` is table agnostic. ## Test plan ci and local run --- ## Release notes Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required. For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates. - [ ] Protocol: - [ ] Nodes (Validators and Full nodes): - [ ] Indexer: - [ ] JSON-RPC: - [ ] GraphQL: - [ ] CLI: - [ ] Rust SDK: - [ ] REST API: --------- Co-authored-by: Will Yang <willyang@mystenlabs.com>

gegaowp requested review from amnn, emmazzz, stefan-mysten, suiwombat and wlmyng as code owners September 10, 2024 21:49

gegaowp changed the title ~~Drop df columns~~ draft not ready for review: drop df columns and refactor Sep 10, 2024

gegaowp removed request for amnn, emmazzz, stefan-mysten, suiwombat and wlmyng September 10, 2024 21:50

vercel bot deployed to Preview – sui-docs September 10, 2024 21:54 View deployment

gegaowp force-pushed the drop-df-columns branch 2 times, most recently from 51a20ed to 876bc4e Compare September 11, 2024 15:58

gegaowp changed the title ~~draft not ready for review: drop df columns and refactor~~ indexer: drop df columns and refactoring Sep 11, 2024

gegaowp requested review from amnn, bmwill, emmazzz, lxfind and wlmyng September 11, 2024 16:04

vercel bot deployed to Preview – sui-docs September 11, 2024 16:07 View deployment

amnn approved these changes Sep 11, 2024

View reviewed changes

gegaowp force-pushed the drop-df-columns branch from 876bc4e to 61260d5 Compare September 11, 2024 20:43

vercel bot deployed to Preview – sui-docs September 11, 2024 20:48 View deployment

gegaowp force-pushed the drop-df-columns branch from 61260d5 to d969ce9 Compare September 12, 2024 15:10

gegaowp force-pushed the drop-df-columns branch from 26cc857 to ac149e6 Compare September 17, 2024 17:14

vercel bot deployed to Preview – sui-docs September 17, 2024 17:19 View deployment

gegaowp force-pushed the drop-df-columns branch from ac149e6 to af2f7a7 Compare September 18, 2024 02:08

vercel bot deployed to Preview – sui-docs September 18, 2024 02:12 View deployment

gegaowp force-pushed the drop-df-columns branch 2 times, most recently from 7928f9b to 27fb7a8 Compare September 19, 2024 03:09

vercel bot deployed to Preview – sui-docs September 19, 2024 03:14 View deployment

gegaowp force-pushed the drop-df-columns branch from 27fb7a8 to 50feadf Compare September 19, 2024 03:15

vercel bot deployed to Preview – sui-docs September 19, 2024 03:19 View deployment

gegaowp force-pushed the drop-df-columns branch from 50feadf to ff55250 Compare September 19, 2024 03:54

vercel bot deployed to Preview – sui-docs September 19, 2024 03:55 View deployment

gegaowp force-pushed the drop-df-columns branch from ff55250 to 16a853b Compare September 19, 2024 14:53

vercel bot deployed to Preview – sui-docs September 19, 2024 14:55 View deployment

gegaowp force-pushed the drop-df-columns branch from 16a853b to 97a5942 Compare September 19, 2024 16:37

vercel bot deployed to Preview – sui-docs September 19, 2024 16:38 View deployment

gegaowp force-pushed the drop-df-columns branch from 97a5942 to 4ff1fc4 Compare September 19, 2024 16:46

vercel bot deployed to Preview – sui-docs September 19, 2024 16:47 View deployment

indexer: drop df_ columns

a4ec188

gegaowp force-pushed the drop-df-columns branch from 4ff1fc4 to 00415c9 Compare September 19, 2024 17:44

vercel bot deployed to Preview – sui-docs September 19, 2024 17:46 View deployment

revert re-name and extend timeout

75c5b17

gegaowp force-pushed the drop-df-columns branch from 00415c9 to 75c5b17 Compare September 19, 2024 18:53

vercel bot deployed to Preview – sui-docs September 19, 2024 18:55 View deployment

this might be the fix

91dd41c

vercel bot deployed to Preview – sui-docs September 20, 2024 00:23 View deployment

gegaowp merged commit 8e09680 into MystenLabs:main Sep 20, 2024
43 of 44 checks passed

gegaowp deleted the drop-df-columns branch September 20, 2024 01:31

This was referenced Jan 28, 2025

indexer reader: derive dynamic field info iotaledger/iota#5049

Closed

indexer: drop df columns and refactoring iotaledger/iota#5146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

indexer: drop df columns and refactoring #19308

indexer: drop df columns and refactoring #19308

Uh oh!

gegaowp commented Sep 10, 2024 •

edited

Loading

Uh oh!

vercel bot commented Sep 10, 2024 •

edited

Loading

Uh oh!

lxfind commented Sep 11, 2024

Uh oh!

gegaowp commented Sep 11, 2024

Uh oh!

lxfind commented Sep 11, 2024

Uh oh!

amnn commented Sep 11, 2024

Uh oh!

amnn left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

indexer: drop df columns and refactoring #19308

indexer: drop df columns and refactoring #19308

Uh oh!

Conversation

gegaowp commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test plan

Release notes

Uh oh!

vercel bot commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lxfind commented Sep 11, 2024

Uh oh!

gegaowp commented Sep 11, 2024

Uh oh!

lxfind commented Sep 11, 2024

Uh oh!

amnn commented Sep 11, 2024

Uh oh!

amnn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gegaowp commented Sep 10, 2024 •

edited

Loading

vercel bot commented Sep 10, 2024 •

edited

Loading