Wednesday 1 June 2022

Apache Atlas: Load sample data

In my previous post, I explained the installation process of Apache Atlas. In this post, I am going to explain how to load sample data to experiment with Apache atlas.

 

Go to atlas installation directory, navigate to bin folder and execute the script ‘quick_start.py’.

bash-3.2$ ./quick_start.py 
Enter username for atlas :- admin
Enter password for atlas :- 

  ‘quick_start.py’ script prompt for the username and password. Username is admin and password is admin, once the details entered, Script will start creating types, entities and lineage.

 

$ ./quick_start.py 
Enter username for atlas :- admin
Enter password for atlas :- 

Creating sample types: 
Created type [DB]
Created type [Table]
Created type [StorageDesc]
Created type [Column]
Created type [LoadProcess]
Created type [LoadProcessExecution]
Created type [View]
Created type [JdbcAccess]
Created type [ETL]
Created type [Metric]
Created type [PII]
Created type [Fact]
Created type [Dimension]
Created type [Log Data]
Created type [Table_DB]
Created type [View_DB]
Created type [View_Tables]
Created type [Table_Columns]
Created type [Table_StorageDesc]

Creating sample entities: 
Created entity of type [DB], guid: 0c9e2a2f-345b-4be5-a84d-fa247543a065
Created entity of type [DB], guid: 7a731c08-76d3-49f7-b53e-a19e22d41c51
Created entity of type [DB], guid: 773a3629-4d70-425f-99bb-9d5f3e4717f1
Created entity of type [Table], guid: db71918c-152b-4730-b028-17822f0c0c54
Created entity of type [Table], guid: 6edb96e7-cfdd-4294-9305-9a0d3e0f4ef8
Created entity of type [Table], guid: 47ea153d-5dea-4ecf-a503-084e55e152db
Created entity of type [Table], guid: 1cbf586a-d069-409e-b3b0-0d84df737035
Created entity of type [Table], guid: d30790a9-13e9-41d9-8962-be0f73d35cdf
Created entity of type [Table], guid: 1d7fe9c6-1732-496a-81f2-7b05bc2e775f
Created entity of type [Table], guid: 21e99241-fde0-472a-903c-74b79b2b3dc1
Created entity of type [Table], guid: 719336d9-1d36-44d3-8cd1-d9bca84fec0d
Created entity of type [View], guid: 3a368508-4b10-4947-8e24-0c781687c8b2
Created entity of type [View], guid: a980dd23-f536-40b3-a2ba-705993018059
Created entity of type [LoadProcess], guid: afb88ebb-0351-4a44-b670-41ca1021f3a1
Created entity of type [LoadProcessExecution], guid: 17a7d46d-ad0b-47a9-bc08-55f1d82eec2d
Created entity of type [LoadProcessExecution], guid: 55de77f8-07a2-452f-afa8-c62befb801e1
Created entity of type [LoadProcess], guid: a970999e-c0f2-46e1-8122-80ab4a697ef3
Created entity of type [LoadProcessExecution], guid: e167c2dd-19ea-411f-95d1-1e45a7fe3b0c
Created entity of type [LoadProcessExecution], guid: 4e779715-cede-4d41-9642-ce0f2974a275
Created entity of type [LoadProcess], guid: 175c3b87-4ed1-499b-bc18-97cc6b35fce0
Created entity of type [LoadProcessExecution], guid: 94215ab7-98f9-4abb-a9ba-4eaffda866fc
Created entity of type [LoadProcessExecution], guid: 27854b6a-87d9-41fe-96fa-1628a09fc02f

Sample DSL Queries: 
query [from DB] returned [3] rows.
query [DB] returned [3] rows.
query [DB where name=%22Reporting%22] returned [1] rows.
query [DB where name=%22encode_db_name%22] returned [ 0 ] rows.
query [Table where name=%2522sales_fact%2522] returned [1] rows.
query [DB where name="Reporting"] returned [1] rows.
query [DB where DB.name="Reporting"] returned [1] rows.
query [DB name = "Reporting"] returned [1] rows.
query [DB DB.name = "Reporting"] returned [1] rows.
query [DB where name="Reporting" select name, owner] returned [1] rows.
query [DB where DB.name="Reporting" select name, owner] returned [1] rows.
query [DB has name] returned [3] rows.
query [DB where DB has name] returned [3] rows.
query [DB is JdbcAccess] returned [ 0 ] rows.
query [from Table] returned [8] rows.
query [Table] returned [8] rows.
query [Table is Dimension] returned [4] rows.
query [Column where Column isa PII] returned [6] rows.
query [View is Dimension] returned [2] rows.
query [Column select Column.name] returned [8] rows.
query [Column select name] returned [8] rows.
query [Column where Column.name="customer_id"] returned [4] rows.
query [from Table select Table.name] returned [8] rows.
query [DB where (name = "Reporting")] returned [1] rows.
query [DB where DB is JdbcAccess] returned [ 0 ] rows.
query [DB where DB has name] returned [3] rows.
query [DB as db1 Table where (db1.name = "Reporting")] returned [ 0 ] rows.
query [Dimension] returned [7] rows.
query [JdbcAccess] returned [2] rows.
query [ETL] returned [10] rows.
query [Metric] returned [6] rows.
query [PII] returned [6] rows.
query [`Log Data`] returned [5] rows.
query [Table where name="sales_fact", columns] returned [4] rows.
query [Table where name="sales_fact", columns as column select column.name, column.dataType, column.comment] returned [4] rows.
query [from DataSet] returned [10] rows.
query [from Process] returned [3] rows.

Sample Lineage Info: 
loadSalesDaily(LoadProcess) -> sales_fact_daily_mv(Table)
loadSalesMonthly(LoadProcess) -> sales_fact_monthly_mv(Table)
sales_fact(Table) -> loadSalesDaily(LoadProcess)
time_dim(Table) -> loadSalesDaily(LoadProcess)
sales_fact_daily_mv(Table) -> loadSalesMonthly(LoadProcess)
Sample data added to Apache Atlas Server.

  Now, login to atlas ui ‘http://localhost:21000/’, and confirm that the types are created successfully.

 


 

Select ‘Table’ as option for ‘Search By Type’ and click on Search button to see all the onboarded tables.



 

 

Previous                                                    Next                                                    Home

No comments:

Post a Comment