The document discusses an efficient resource management mechanism with a fault-tolerant model for computational grids, focusing on job scheduling and resource management in grid computing. It introduces a primary-backup approach for fault tolerance, highlighting two algorithms for scheduling backups of independent and dependent tasks, while considering the impact of communication protocols like TCP and UDP. The proposed system integrates resources from diverse geographical environments, employing mechanisms for fault detection and task recovery.
Related topics: