Wednesday 13 October 2021

Airflow: Introduction to DAG

DAG is a directed acyclic graph.

 

DAG is a directed graph that doesn’t have any cycles. Apache airflow uses DAG to represent collection of tasks and their dependencies. Each node is a DAG represent a task and edge represent a dependency.

 

Where can I place the dag definition?

DAG is defined in python file and can be placed in dags folder.

 

Important properties of DAG in apache airflow

Property

Description

dag_id

Unique identifies to represent your dag

description

Description about dag

start_date

When dag should start

schedule_interval

Define how often your dag runs

depend_on_past

Run next DAG run only if the previous dag run completed successfully.

default_args

Map (or) dictionary of variables used to initialize operators.

Previous                                                    Next                                                    Home

No comments:

Post a Comment