Distributed scheduling leveraging multiple nodes in the cluster

Avra Sengupta (~avra) | 27 Apr, 2015

4

Votes

Description:

Setting up a cron job in a machine, is perhaps the most easiest way of scheduling a particular task. But in a distributed system, spawning across several nodes, critical tasks can't just be scheduled on a single node. That would introduce a single point of failure (SPOF). We also can't schedule the same set of jobs in every node, as we don't want duplication of a task.

The solution to the above problem is a distributed cron scheduler spread across the cluster, working concurrently on different nodes to perform a task from a set of jobs in such a manner that we neither miss a job nor do we perform the same job twice.

Prerequisites:

A basic understanding of how a distributed system works, and how crond works.

Content URLs:

http://www.gluster.org/community/documentation/index.php/Features/Scheduling_of_Snapshot

Speaker Info:

I am a software engineer at Red Hat Inc., working on GlusterFS, a distributed file system. I have ~4 yrs of experience as a software developer in linux powered distributed systems.

Find out more about GlusterFS at http://www.gluster.org/ and if you want to get your hands dirty with the code, http://glusterhacker.blogspot.in/ should help you get started.

Speaker Links:

http://glusterhacker.blogspot.in/

Section:	Concurrency
Type:	Talks
Target Audience:	Beginner
Last Updated:	27 Apr, 2015

Comments