Skip to Main Content
Build a Data Lake with Autonomous Data Warehouse

About This Workshop

Youtube Video

About This Workshop
Data is everywhere stored both locally and on the cloud in different format such as CSV, Parquet, JSON, ORC, Avro, and the new table format that will be available soon such as Iceberg and Delta Share. Although the metadata can stored in different places, Autonomous Data Warehouse (ADW) can work with it whether it is stored locally (using PL/SQL,API, and the UI) or in central places in data catalogs (such as OCI DCAT and AWS Glue). ADW provides high performance when querying external data at scale. Finally, ADW provides multi-language support to process Data Lakes such as SQL, Python, R, and REST.
The labs in this workshop walk you through all the steps that you need to build and access the Data Lake (Oracle Object Storage buckets) using Autonomous Database (ADB). First, you set-up the workshop environment and create the necessary resources such as a compartment, an Autonomous Database instance, and Oracle Object Storage resources. Next, evolve object storage files into meaningful business entities by defining the metadata for external data in ADB using several methods such as programmatically using PL/SQL scripts, using the Database Actions user interface in ADB, or by synchronizing with Data Catalogs (to derive the metadata) such as OCI Data Catalog and AWS Glue. Next, you can gain insights from the Data Lakes by querying data that lives in your Oracle Database and combine that with data that is stored in your Oracle Object Storage buckets using regular SQL select statements that includes joined tables. You also move data to Object Storage buckets by exporting your datasets. Finally, you query your data using the SQL Worksheet.

Workshop Info

2 hours, 30 minutes
  • Introduction
  • Set up the Workshop Environment
  • Load Local Data
  • Load and Analyze JSON DataLab 4: Link to Data in Public Object Storage Buckets
  • Link Data from Private Object Storage Buckets
  • Load and Analyze the News from REST Endpoints
  • Create External Tables Using the Delta Sharing Protocol
  • Query Data from Multi-Cloud Data Lakes
  • Use Oracle as a Data Provider
  • Integrate ADW with Centrally Managed Data Catalogs
  • Improve Query Performance
  • ADW Support for Multiple Languages 
  • Clean up Resources Used in this Workshop (Optional)

The user must have the necessary permissions to: 

  • Provision and use an ADB instance
  • Optionally, provision and use OCI Data Catalog instance
  • Manage Oracle Object Storage buckets
  • Familiarity with Oracle Database and SQL is desirable, but not required
  • Some understanding of cloud and database terms is helpful
  • Familiarity with Oracle Cloud Infrastructure (OCI) is helpful

Other Workshops you might like

Ask Oracle
Helping you on LiveLabs