AWS Solutions Architect Associate
Skills: AWS, Cloud, AWS Services, etc.
I am a Data Engineer and a certified AWS Solutions Architect
I like to help people get insights from their data and implement AWS services.
Colombian data analyst and future data engineer
I'm a Data Analyst and Report Developer with a robust experience with Python and relational databases
I've also worked with AWS and using IaC tools like CDK.
I love to learn. This web page is a proof of that. But here is a couple of things you might like to know to get me to know me better:
I'd like to share with you a couple of certifications that I've earned across my learning path
Skills: AWS, Cloud, AWS Services, etc.
Skills: AWS, Cloud, AWS Services, etc.
Skills: Data Management, Programming for Data Engineering, and Exploratory Analysis
Skills: SQL, PostgreSQL, Data Analysis, Data management, exploratory data analysis
Follow me on Medium
Overview
As a final project for a Data Engineer course, I have created an ELT pipeline moving data from a PostgreSQL database (running on Digital Ocean), implementing a Delta Lake with GCP services such as Cloud Storage, BigQuery, Dataproc and Data Studio. And to manage all the workflow and schedule the pipeline, we get Airflow running hosted on a Digital Ocean's Droplet (virtual machine) and with Docker.
Overview
I plan to teach you how to analyze a sample data set in this article. Still, first, we’ll need to set up our environment to work with PySpark.