Introduction to Apache Airflow in Python
Mike Metzger
Data Engineer




all_success - Every prior task finished successfully, no skipsall_failed - Every prior task failedall_done - Every prior task completed, regardless of outcomeone_failed - At least one task failedone_success - At least one task succeedednone_failed - All prior tasks finished successfully or skippedfrom airflow.utils.trigger_rule import TriggerRule@task decoratortrigger_rule=TriggerRule.<TriggerRule enum>@task(trigger_rule=TriggerRule.ALL_SUCCESS)
def run_if_everything_succeeds:
print('All previous tasks succeeded!')
retries - task parameter defining how many times Airflow should retry a failed task before marking it as failedretry_delay - task parameter accepting a timedelta defining the wait time between retry attempts@task(
trigger_rule=TriggerRule.ONE_SUCCESS,
retries=3,
retry_delay=timedelta(minutes=5)
)
from airflow.providers.standard.operators.trigger_dagrun
import TriggerDagRunOperatortrigger_dag_id - Must match dag_idwait_for_completion - Block further tasks until completepoke_interval - How often to checkconf - Data to pass to child Dagtrigger_child = TriggerDagRunOperator(
task_id="trigger_child_pipeline",
trigger_dag_id="child_pipeline_dag",
wait_for_completion=True,
poke_interval=30,
conf={
"source": "s3://my-bucket/raw/",
},
)
task1 >> trigger_child # Run first task, then kick off child Dag
trigger_child >> cleanup # Run a cleanup task after child Dag completes
Introduction to Apache Airflow in Python