Creating and managing tables

Data Management in Databricks

Smriti Mishra

Founder, NordData Insight

The data library

Image of a cartoon library building

Data Management in Databricks

Create and manage

  • Databricks databases group related tables for organization.
  • Creating a database sets up a scalable structure.
  • Organized data aids access and management.
  • Tables store structured data for queries.

Library bookshelves

Data Management in Databricks

Effective management

Discard old books in a bin

  • Effective management involves querying, updating, and deleting data.
  • Deletion should be done carefully to avoid data loss.
  • Safe deletion practices include data backup before removal.
Data Management in Databricks

Using the LOCATION keyword

  • Databricks automatically manages table storage locations.
  • Default setup simplifies data storage management.
  • Custom storage paths can be set using the LOCATION keyword.
  • Flexibility is important for compliance, cost, or performance needs.

Empty library shelves

Data Management in Databricks

Overriding the default storage

  • LOCATION overrides default storage.
  • Store data in external cloud locations:
    • examples: AWS S3, Azure Blob Storage
  • Useful when integrating with existing infrastructure.
  • Ensures compliance for sensitive data.

Cartoon image representing sensitive health records

Data Management in Databricks

Dynamic data management

  • LOCATION supports dynamic data management.
  • Move data easily if regulations or costs change.
  • Relocate data seamlessly without disruption.
  • Maintain structure and workflow during changes.

Librarian shelving books

Data Management in Databricks

Let's practice!

Data Management in Databricks

Preparing Video For Download...