Follow

Follow

Tag

data-engineering

#data-engineering

Read more stories on Hashnode

Articles with this tag

Building Data Magic: Serverless Data Pipelines with AWS

Aug 20, 20232 min read8 views

Hey there, Today, we’re diving into the world of serverless data pipelines using AWS. We’ll use real code and a sample dataset to make this journey...

Building Data Magic: Serverless Data Pipelines with AWS

PySpark SQL: An Introduction to Structured Data Processing with Code Examples

Apr 2, 20233 min read73 views

Introduction Apache Spark is one of the most widely used distributed computing frameworks that allow for fast and efficient processing of large...

PySpark SQL: An Introduction to Structured Data Processing with Code Examples

Advanced PySpark SQL: Exploring Window Functions, UDFs, and Broadcast Join with Code Examples

Apr 2, 20234 min read80 views

PySpark SQL is a powerful module for processing structured data using SQL queries in Python programming language. In addition to the basic...

Advanced PySpark SQL: Exploring Window Functions, UDFs, and Broadcast Join with Code Examples

Best Practices for Data Migration from On-Premises to AWS Cloud with Disaster Recovery Mechanism

Apr 2, 20233 min read8 views

As businesses increasingly move towards cloud computing, data migration from on-premises infrastructure to the cloud has become a crucial aspect of...

Best Practices for Data Migration from On-Premises to AWS Cloud with Disaster Recovery Mechanism

Setting Up AWS DMS Service for Data Ingestion from On-Premises DB to AWS with CDC Approach

Apr 2, 20234 min read47 views

AWS Database Migration Service (DMS) is a fully managed service that makes it easy to migrate databases to AWS quickly, securely, and seamlessly. In...

Setting Up AWS DMS Service for Data Ingestion from On-Premises DB to AWS with CDC Approach

Building a Scalable Data Warehouse on AWS: A Comprehensive Guide

Apr 1, 20233 min read23 views

Data warehousing is the process of storing, organizing, and managing large volumes of structured and unstructured data in a centralized repository,...

Building a Scalable Data Warehouse on AWS: A Comprehensive Guide