1
Introduction to NoSQL databases
Gratuito
Explore the basics of NoSQL databases, and how they're used in a modern data architecture. Learn to identify the pros and cons of using column-oriented, document, key-value, and graph databases to make data available to downstream processes and data consumers.
2
Column-oriented Databases
Dive into column-oriented databases with the help of Snowflake. Learn to populate Snowflake tables using a number of tools, and uncover the fundamentals of micro-partitioning and data clustering to optimize query performance with query pruning. Leverage common table expressions, materialized and non-materialized views to streamline analytics workflows. Step into semi-structured data with Snowflake's VARIANT types.
3
Document Databases
Dive deep into the world of document data with Postgres JSON, and explore the intricacies of semi-structured data. Learn how to extract and transform top-level and nested document data using a number of built-in operators and functions.
4
Key-value and Graph Databases
Take your NoSQL skills to the next level by mastering the fundamentals of key-value databases. Explore common use-cases for key-value data, and learn how to use Redis and Python to read and write data. Wrap up with a basic overview of graph databases, and their place in the modern data stack.

Initializing

Leveraging micro-partitions and data clustering

During a quick chat in the hall with your Lead Data Engineer, she shared with you that Snowflake is using data clustering to sort data within micro-partitions by the year field in the olympic_medals table. You have a few queries that you regularly execute against this table, which you'd like to update to better take advantage of Snowflake's micro-partitions and data clustering.

The create_engine function from the sqlalchemy module has been imported, and a connection object has been created and stored in the variable conn.

Update the Snowflake query to only return records for games that took place in 2000 later.
Return the results of the Snowflake query as a pandas DataFrame, and print the result set.