Data Pipeline Automation by integrating Django Signals with Celery

Mannu Gupta (~theparadoxer02)


Description:

In this talk I will describe, how we can automate a sophisticated data to multiple pipeline, monitor every single stage and handle the different scenarios with each stages. The main components used for architecting the application are Django Signals and Celery.

Additionally I will also shed some light on the advantage and disadvantage of using Apache Airflow which might be a good alternative for the above solution.

Insights:

Airflow is developed in Python. Airflow is a historically important tool in the data engineering ecosystem. It introduced the ability to combine a strict Directed Acyclic Graph (DAG) model with Pythonic flexibility in a way that made it appropriate for a wide variety of use cases.

Prerequisites:

  • Basic understanding of Django Signals
  • Basic understanding of Celery

Content URLs:

still working on it.
Slides

Speaker Info:

He is currently employed as Software Engineer at Essentia SoftServ.
He has a keen interest in multiple tech domains, but Backend and DevOps interest him the most.

Speaker Links:

Github
Twitter

Section: Developer tools and automation
Type: Talks
Target Audience: Intermediate
Last Updated: