Fault-Tolerant Execution of Mobile Agents



In this paper, we will address the list of problems that have to be solved in mobile agent systems and we will present a set of fault-tolerance techniques that can increase the robustness of agent-based applications without introducing a high performance overhead. The framework includes a set of schemes for failure detection, checkpointing and restart, software rejuvenation, a resource-aware atomic migration protocol, a reconfigurable itinerary, a protocol that avoids agents to get caught in node failures and a simple scheme to deal with network partitions. At the end, we will present some performance results that show the effectiveness of these fault-tolerance techniques.


Mobile Agents


DSN-00 - International Conference on Dependable Systems and Networks, June 2000

Cited by

Year 2004 : 2 citations

 1. Taesoon Park, Ilsoo Byun and Heon Y. Yeom "Lazy Agent Replication and Asynchronous Consensus for the Fault-Tolerant Mobile Agent System" NETWORKING 2004, Third International IFIP-TC6 Networking Conference, Athens, Greece, May 9-14, 2004.

 2. Sung Jin Choi, Maeng Soon Baik, Hong Soo Kim, Jun Weon Yoon, Jin Gon Shon, Chong Sun Hwang, "Region-based stage construction protocol for fault tolerant execution of mobile agent", Proc. 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004. March 2004

Year 2003 : 8 citations

 Marin, O.; Bertier, M.; Sens, P.; "DARX - a framework for the fault-tolerant support of agent software" Software Reliability Engineering, 2003. ISSRE 2003. 14th International Symposium on , 17-20 Nov. 2003, Pages:406 - 416.

 Toan Phung; Loke, S.W.; Harland, J.; "Adding flexibility using structured goals: the case of itinerant mobile agents" Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on , 13-16 Oct. 2003, Pages:562 - 565

 Maeng Soon Baik, In Sung Kang, Yun Hee Kang and Chong Sun Kang, "Optimistic Fault-Tolerant Approach for Mobile Agent in Multi-Region Mobile Agent Computing Environment", In the International Conference on Parallel and Distributed Processing Techniques and Applications, PDPTA-03, 2003

 Pears, S. Jie Xu Boldyreff, C., "A dynamic shadow approach for mobile agents to survive crash failures", Sixth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing, May 2003.

 Zahia Guessoum, Jean-Pierre Briot, Olivier Marin, Athmane Hamel, Pierre Sens. "Dynamic and Adaptive Replication for Large-Scale Reliable Multi-agent Systems", in Software Engineering for Large-Scale Multi-Agent Systems: Research Issues and Practical Applications, Lecture Notes in Computer Science, Volume 2603 / 2003, August 2003, Springer-Verlag Heidelberg

 Overeinder, B. Brazier, F. Marin, O., "Fault tolerance in scalable agent support systems: integrating DARX in the AgentScape framework", 3rd IEEE/ACM International Symposium onCluster Computing and the Grid, 2003. Proceedings. CCGrid 2003, May 2003

 Pears, S., Jie Xu, Boldyreff, C., "Mobile agent fault tolerance for information retrieval applications: an exception handling approach", The Sixth International Symposium on Autonomous Decentralized Systems, 2003. ISADS April 2003.

 Pleisch, S., Schiper, A., "Fault-tolerant mobile agent execution", IEEE Transactions on Computers, Feb. 2003, pages 209-222, Volume: 52, Issue: 2

Year 2002 : 5 citations

 1. Hyunjoo Kim, Heon Y. Yeom, Taesoon Park , "The Cost of Checkpointing , Logging and Recovery for the Mobile Agent Systems", Pacific Rim International Symposium on Dependable Computing(PRDC02), Dec. 2002, Tokyo, Japan

 2. Taesoon Park, Ilsoo Byun, Hyunjoo Kim, Heon Y. Yeom, "The Performance of Checkpointing and Replication Schemes for Fault Tolerant Mobile Agent Systems", Proc. 21st IEEE Symposium on Reliable Distributed Systems (SRDS'02), October 2002, Osaka University, Suita, Japan

 3. P. Dasgupta, "Fault tolerance in MAgNET: A Mobile Agent E-commerce System," Proceedings of the Sixth International Conference on Internet Computing, Las Vegas, NV, June 2002, pp. 733-739

 4. Jian Chen "A Hierarchical Fault-Tolerance Framework for Mobile Intelligent Agent Systems" PhD Thesis, The University of British Columbia, Canada, April 2002.

 5. Stefan Pleisch "Fault-Tolerant and Transactional Mobile Agent Execution" PhD Thesis, Ã?cole Polytechnique Fédérale de Lausanne, Thèse nr. 2654 (2002).

Year 2001 : 6 citations

 1. Stefan Pleisch, André Schiper "FATOMAS - A Fault-Tolerant Mobile Agent System Based on the Agent-Dependent Approach" Proceedings of the 2001 International Conference on Dependable Systems and Networks, 1-4 July 2001, Göteborg, Sweden, IEEE Computer Society, ISBN 0-7695-1101-5, pp. 215-224.

 2. Erica Y. Tang, Jie Xu, Keith Bennett "The SeCode Approach: Towards Fault-Tolerant and Secure Execution of Mobile Code" Supplement of the 2001 International Conference on Dependable Systems and Networks, 1-4 July 2001, Göteborg, Sweden, IEEE Computer Society, pp B-74,B-75.

 3. Prithviraj(Raj) Dasgupta "Mobile Software Agent Enabled E-commerce: System Design and Profit Maximizing Algorithms" PhD Thesis, University of California, Santa Barbara, June 2001.

 4. S. Mishra, "Agent Fault Tolerance Using Group Communication". Proceedings of the 2001 International Conference on Parallel and Distributed Processing Techniques and Application (PDPTA 2001), Las Vegas, NV (June 2001).

 5. Daria Chacón, Benjamin Bell, and John McCormick "Using Autonomous Agents to Improve Efficiency and Robustness in Slow, Unreliable, Networks," AAAI 2001 Spring Symposium on robust Autonomy, March 2001 Stanford University.

 6. Stefan Pleisch, Andre Schiper "Approaches to Fault-Tolerant Mobile Agent Execution" Research Report, IBM Research, RZ 3333 (# 93379) 03/19/01