Implement all_done_min_one_success trigger rule #28396

gdavoian · 2022-12-15T16:02:16Z

gdavoian
Dec 15, 2022

Description

The trigger rule was requested in #10758 and #17010, but none_failed_min_one_success was proposed as a solution.

I find the proposed solution unsatisfactory since none_failed_min_one_success isn't the same as all_done_min_one_success (the latter rule allows some upstream tasks to fail as soon as at least one of them succeeds).

I can try working on a PR if the feature is approved.

Use case/motivation

I have a few data ingestion tasks (each one ingests data from a separate external data source to the internal data lake) and a data loading task (it loads data from the data lake to the data warehouse). I want the data loading task to be triggered if at least one of the data ingestion tasks succeeds. Obviously, if all the data ingestion tasks fail, then there is no point in triggering the data loading task. But if at least one of them succeeds, then in my particular case it's better to load the just ingested data (and figure out why the other tasks failed later) rather than not loading anything at all.

Related issues

Are you willing to submit a PR?

Yes I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct

2022-12-15T16:02:18Z

boring-cyborg[bot]
bot Dec 15, 2022

Thanks for opening your first issue here! Be sure to follow the issue template!

0 replies

potiuk · 2022-12-15T22:09:09Z

potiuk
Dec 15, 2022
Collaborator

Is there any reason you can't have an "all_done" task that checks if there is an upstream task that succeeded and trigger data loading then?

INGEST 1. ----\

INGEST 2  ----- (all_done) --> CHECK ANY UPSTREAM SUCCEEDED [fail if there none of the upstreams succeeded] -> LOAD 

INGEST 3  ----/

IMHO, it is fully doable, does not require to add any complexity and adding more complex trigger rules, it very well reflects your use case (you anyhow need to wait with the decision until all of the ingestion tasks are done), and it also gives you clear indication of the case where the "CHECK ALL UPSTREAMS" task will clearly inform you that this was the reason.

Checking the state of those tasks can be easily done by XCom (and if you do the "CHECK" task as @task decorated operator, it is as simple as checking the output of all of the tasks. Moreover the "check" task could simply pass the list of all ingested datasets as a single input tho the LOAD task, so that it does not have to rely on checking if any of the tasks failed or not.

For me this is a very nice separation of concerns - having one task to decide whether to proceed or not based on deep analysis of which of those tasks succeed and which did not, and separately having a LOAD tasks that loads whatever you pass to it - without worrying about states there. I am afraid your LOAD task in your case will have to do that kind of check anyway if you would like to get the rule you want (but in a multi-function task). Splitting those seems much cleaner (and you could make yur "CHECK" task reusable across several different LOAD tasks as well - without having to implement the same "select only those that succeeded" logic.

Anything wrong with this approach in your case?

0 replies

gdavoian · 2022-12-15T23:53:11Z

gdavoian
Dec 15, 2022
Author

@potiuk your proposed solution sounds really good, thank you!

I'd just like to remark that the data loading task doesn't necessarily have to know which data ingestion tasks actually failed/succeeded to make further decisions. E.g., it may use some kind of a pattern like YYYY/mm/dd/**.json to load the current/latest data from the data lake into the data warehouse (whatever files it was able to find matching the pattern).

So my idea was to just have a trigger rule handling such simple cases (of course, without introducing extra complexity). Yes, I agree that adding yet another auxiliary task to check the state of the upstream tasks isn't a big deal, but it's still a kind of logic that might arise again and again...

Also, I believe the approach described above is pretty generic, so it can be used to imitate a lot of different behaviors, including but not limited to none_failed, none_skipped, none_failed_min_one_success. Since the latter ones actually have their own trigger rules, I don't see a reason why the all_done_min_one_success behavior doesn't deserve a trigger rule on its own :)

I know that what I'm trying to describe covers a pretty simple use case, but I really think that good software should strive to make simple things even easier (and wrong things impossible!), and in this particular case, unfortunately, there isn't an easy way to achieve the desired behavior.

0 replies

potiuk · 2022-12-16T00:03:57Z

potiuk
Dec 16, 2022
Collaborator

I'd just like to remark that the data loading task doesn't necessarily have to know which data ingestion tasks actually failed/succeeded to make further decisions. E.g., it may use some kind of a pattern like YYYY/mm/dd/**.json to load the current/latest data from the data lake into the data warehouse (whatever files it was able to find matching the pattern).

Then your LOAD task should fail if it does not find any file matching the pattern (and still can use all_done). Problem solved.

3 replies

gdavoian Dec 16, 2022
Author

But why to even trigger the LOAD task if all the INGEST tasks failed (all_done)? It can be marked as failed immediately. On the other hand, if at least one INGEST task succeeded, then for sure there would be some data to load.

potiuk Dec 16, 2022
Collaborator

Because every new trigger rule adds complexity to the most complex part of Airflow and one that has potential HUGE performance implications. That's why it's better to perform the check in a single task that gets triggered rather than in scheduler that has to make potentially 100s similar decisions a second.

drnk Feb 16, 2023

Voting for the @gdavoian case. Цill add my five cents. With really helpful mapped instances we started to look from a bit different angle to successes and failures of instanced tasks. And ALL_DONE_ONE_SUCCESS will be really helpful to have more brief DAGs. Otherwise we reduce the complexity in one place, moving it to another.

potiuk · 2022-12-16T00:09:25Z

potiuk
Dec 16, 2022
Collaborator

Conveting it into discussion. I do not really see a reason for new trigger rule, seems that those patterns we have now are supporting the use cases pretty well (either by intermediate task or by simply failing when there are no outputs from upstream tasks produced) - for example with pattern matching, while using all_done trigger rule. IMHO those cases do not justify a new rule. But if others think differently the discussion is open and a PR might be created out of that if there are peoeple convinced it should be done.

2 replies

gdavoian Dec 16, 2022
Author

Sure, I'm open to discussion and I'd like to hear different opinions. The interesting thing is that this particular trigger rule has already been mentioned at least twice (#10758 + #17010), so maybe my use case isn't so unique.

potiuk Dec 16, 2022
Collaborator

Sure. We are all open to it even if every one of us have their own opinions. That's why converted it into discussion.

The fact that something is an issue/feature, does not mean that it has a bigger value than discussion. Moving something to discussion is merely a statement that we are not sure (as a community) if the feature is worth implementing. This is the only difference of feature issue vs. discussion. Features are things that we as maintainers are absolutely sure is a good idea and we hope this will attract people who can read the feature description and implement it.

People often mislead "Feature issue" with "It will be implemented some day, enough that I create an issue and it will get implemented". This is not the case.

Really the ONLY way things get implemented in open-source software like Airflow it to ..... implement it. If you do not raise a PR yourself or find and convince someone to do it, the feature will not "magically" turn into a merged code. Someone has to write a PR, the PR has to pass CI tests and review of maintainers (in such case more than one because things are touching the core) and get merged by one of the maintainers. It does not really matter whether it is a feature issue or a discussion if you (creator of it) will not make an effort to get it implemented or whether someone will pick an interest in it.

So converting to discussion is really a sign "We have other alternatives and we are not really sure it is a good idea". But it does not close the way for anyone to create a PR and convince maintainers this is good PR to merge.

If you yourself make a PR implementing it or convince someone to do it - feel free. The discussion might then move to such PR when maintainers see performance implications (and getting some performance benchmarks in this case is likely a prerequisite for such a change). As you can read in our CONTRIBUTING.rst - No need ot have an open issue for PR.

drnk · 2023-02-16T08:21:20Z

drnk
Feb 16, 2023

Could there be a solution with implementation of ShortCircuitOperator with ALL_DONE trigger rule and callable which will check is there at least one success upstream task?

0 replies

wrbrant · 2025-07-09T22:10:21Z

wrbrant
Jul 9, 2025

I ran into this problem and like many, I really wish there was a trigger_rule that would solve this. It makes perfect sense. none_failed_one_success kind of feels like an "and gate" to me. Yet we don't have a trigger for the "or gate", which is a pretty basic operator. I hope someday that it becomes a trigger rule. But in the meantime, I came up with the solution so that you, the reader, don't have to. It took me a bit at first because I'm new to Airflow, but from my initial tests, this does what I want it to do (with some further configuration).

NOTE: the only important part in this example is the function all_done_min_one_success. Everything else is just for context.

import datetime
import pendulum
from airflow.decorators import dag, task_group, task
from airflow.models.taskinstance import TaskInstance
from airflow.models.dagrun import DagRun
from airflow.exceptions import AirflowTaskTerminated
# pylint: disable=expression-not-assigned,no-value-for-parameter,pointless-statement

RETRIES = 0

@dag(
    dag_id="my_dag",
    schedule_interval=None,
    start_date=pendulum.datetime(2021, 1, 1, tz="UTC"),
    catchup=False,
    dagrun_timeout=datetime.timedelta(minutes=7 * 24 * 60)
)
def my_dag():
    "this is an example dag"

    @task(trigger_rule="all_done")
    def all_done_min_one_success(dag_run:DagRun = None, ti:TaskInstance = None):
        "Workaround task to check if all upstream tasks are done and at least one has succeeded"
        # Get all tasks that are directly upstream of this task
        # For *all* tasks, not just direct relatives, use 'get_flat_relative_ids(upstream=True)'
        upstream_task_ids: set[str] = ti.task.get_direct_relative_ids(upstream=True)

        # Get list of all tasks that have succeeded and have failed for this DagRun
        succeeded_task_instances: list[TaskInstance] = dag_run.get_task_instances(state="success")
        failed_task_instances: list[TaskInstance] = dag_run.get_task_instances(state="failed")

        # Get the intersections of succeeded and of failed ids with direct relatives
        succeeded_upstream_task_ids = upstream_task_ids.intersection([task.task_id for task in succeeded_task_instances])
        failed_upstream_task_ids = upstream_task_ids.intersection([task.task_id for task in failed_task_instances])

        # Check to see if there is at least one task id that has succeeded
        if len(succeeded_upstream_task_ids) >= 1:
            # (Optional) log or print successes and failures
            print("the following tasks succeeded: %s", succeeded_upstream_task_ids)
            print("the following tasks failed: %s", failed_upstream_task_ids)
        else:
            # Log the failed tasks and raise an exception, such as AirflowTaskTerminated
            # For a full list of exceptions, see the documentation here:
            # https://airflow.apache.org/docs/apache-airflow/stable/_api/airflow/exceptions/index.html#exceptions
            print("no tasks succeeded. The following tasks failed: %s", failed_upstream_task_ids)
            raise AirflowTaskTerminated("no tasks succeeded")

    @task(retries=RETRIES)
    def dummy1():
        print("hello")
    @task(retries=RETRIES)
    def dummy1_1():
        print("world")
    @task(retries=RETRIES)
    def dummy2():
        raise Exception("something has gone wrong")

    @task_group(group_id="my_group")
    def my_group():
        dummy1() >> dummy1_1()
        dummy2()

    my_group() >> all_done_min_one_success() >> dummy1() >> dummy1_1()

dag = my_dag()

0 replies

Implement all_done_min_one_success trigger rule #28396

Uh oh!

Uh oh!

gdavoian Dec 15, 2022

Description

Use case/motivation

Related issues

Are you willing to submit a PR?

Code of Conduct

Replies: 7 comments · 5 replies

Uh oh!

boring-cyborg[bot] bot Dec 15, 2022

Uh oh!

Uh oh!

potiuk Dec 15, 2022 Collaborator

Uh oh!

Uh oh!

gdavoian Dec 15, 2022 Author

Uh oh!

Uh oh!

potiuk Dec 16, 2022 Collaborator

Uh oh!

gdavoian Dec 16, 2022 Author

Uh oh!

potiuk Dec 16, 2022 Collaborator

Uh oh!

drnk Feb 16, 2023

Uh oh!

potiuk Dec 16, 2022 Collaborator

Uh oh!

gdavoian Dec 16, 2022 Author

Uh oh!

Uh oh!

potiuk Dec 16, 2022 Collaborator

Uh oh!

drnk Feb 16, 2023

Uh oh!

wrbrant Jul 9, 2025

gdavoian
Dec 15, 2022

Replies: 7 comments 5 replies

boring-cyborg[bot]
bot Dec 15, 2022

potiuk
Dec 15, 2022
Collaborator

gdavoian
Dec 15, 2022
Author

potiuk
Dec 16, 2022
Collaborator

gdavoian Dec 16, 2022
Author

potiuk Dec 16, 2022
Collaborator

potiuk
Dec 16, 2022
Collaborator

gdavoian Dec 16, 2022
Author

potiuk Dec 16, 2022
Collaborator

drnk
Feb 16, 2023

wrbrant
Jul 9, 2025