Tuesday, 31 May 2022

Apache Atlas tutorial

      Introduction to Apache Atlas
      Quick introduction to Data Catalog
            Key stakeholders of Data Catalog
            Is Data Catalog stores actual data?
            Data Inventory vs Data Catalog
            How a data catalog ensures data quality?
            How a data catalog ensures data Governance?
      Setup Apache atlas in embedded mode
      Apache Atlas: Load sample data
      Atlas: Type system
      Apache Atlas: Entities
      Apache Atlas: Entity vs Struct meta types
      Apache atlas: get the type definition by name
      Apache Atlas: core built-in types
      Apache Atlas: Define types, relationships and entities
      Quick Introduction to Data lineage
            Why do we need to answer the question 'where the data originated from?
            Why do we need to answer the question 'how it has been modified or enriched along the way?'
            Why do we need to answer the question 'which downstream processes or systems consume the data?
            Types of Data Lineage
                  Field level data lineage
                  Table level data lineage
                  Process level lineage
                  End to end lineage
                  Business Lineage
                  Physical lineage
                  Operational lineage
                  Forward Data lineage
                  Backward Data lineage
            Apache Atlas: data lineage example
      Apache Atlas: Create and attach classification to an entity
      Apache Atlas: Glossary, category and terms
      Apache Atlas: Advanced search
      Apache Atlas: See the audit reports
      Apache Atlas: example to add relationship between two entities
      Apache Atlas: Hard delete an entity
Previous                                                    Next                                                    Home

No comments:

Post a Comment