Load data from the Million Song Dataset into a final dimensional model in RedShift utilizing Apache Airflow.
-
Updated
Jun 2, 2020 - Python
Load data from the Million Song Dataset into a final dimensional model in RedShift utilizing Apache Airflow.
Cassandra ETL Pipeline
This is a project based on the Data Engineering Coding Challenge of Verve Company.
BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP
Proof of concept to manage data warehouse data transformations
Repo for tracking content related to DBT cloud
Formelsammlung der Vorlesung von J.Ebneter im FS 2016
Portfolio Project 2: CoinMarketCap Data Pipeline
Project for IBM Data Science course on Visualization & Dashboards -- Analyzed historical sales data, performing EDA and setting up an interactive dashboard
ELT for New York City (NYC) Collision Dataset
Trata-se de um processo de ELT (Extração, Carga e Transformação) que integra um sistema legado com um banco de dados relacional (no exemplo, um MySQL) para um banco NoSQL (ElasticSearch) sem alterações significativas nos dados transferidos.
Project for IBM Data Engineering & Python course on Linux & Shell Scripts -- Wrote and executed bash scripts to manipulate folders and files to create a full directory backup with automation using crontab
Collection of data Extract, Transform, Load
Add a description, image, and links to the elt topic page so that developers can more easily learn about it.
To associate your repository with the elt topic, visit your repo's landing page and select "manage topics."