Building Data Pipelines with Airflow
Volker Janz
Senior Developer Advocate at Astronomer

Poke mode (default)
from airflow.providers.standard.sensors.filesystem \
import FileSensor
wait = FileSensor(
task_id="wait",
filepath="/data/report.csv",
mode="poke",
poke_interval=30,
)
Reschedule mode
from airflow.providers.standard.sensors.filesystem \
import FileSensor
wait = FileSensor(
task_id="wait",
filepath="/data/report.csv",
mode="reschedule",
poke_interval=300,
)

from airflow.providers.standard.sensors.filesystem import FileSensor wait_for_data = FileSensor( task_id="wait_for_data", filepath="/data/incoming/report.csv",deferrable=True,poke_interval=30, timeout=3600, )
poke_interval and timeout still apply| Mode | Best for | Worker slot |
|---|---|---|
| poke | Short waits (< 1 min) | Held entire time |
| reschedule | Moderate waits, no Triggerer | Freed between checks |
| deferrable | Long waits, many sensors | Never used |
AIRFLOW__OPERATORS__DEFAULT_DEFERRABLE=True
deferrable=False if neededBuilding Data Pipelines with Airflow