Beyond Nagios - Design of a Cloud Monitoring System
نویسنده
چکیده
The paper describes a monitoring system specially designed for cloud infrastructures. We identify the features that are relevant for the task: scalability, that allows utilization in systems of thousands of nodes, flexibility, to be customized for a large number of applications, openness, to allow the coexistence of user and administration monitoring. We take as a starting point the Nagios monitoring system, that has been successfully used for Grid monitoring and is still used for clouds. We analyze its shortcomings when applied to cloud monitoring, and propose a new monitoring system, that we call Rocmon, that sums up Nagios experience with a cloud perspective. Like Nagios, Rocmon is plugin-oriented to be flexible. To be fully inter-operable and long-living, it uses standard tools: the OGF OCCI for the configuration interface, the REST paradigm to take advantage of Web tools, and HTML5 WebSockets for data transfers. The design is checked with an open source Ruby implementation featuring the most relevant aspects.
منابع مشابه
OCReM: OpenStack-based cloud datacentre resource monitoring and management scheme
Managing virtualised computing, network and storage resources at large-scale in both public and private cloud datacentres is a challenging task. As an open source cloud operating system, OpenStack needs to be enhanced for managing cloud datacentre resources. In order to improve OpenStack functions to support cloud datacentre resource management, we present OCReM: OpenStack-based cloud datacentr...
متن کاملMAVIS: Managing Datacenters using Smartphones
Distributed monitoring plays a crucial role in managing the activities of cloud-based datacenters. System administrators have long relied on monitoring systems such as Nagios and Ganglia to obtain status alerts on their desktop-class machines. However, the popularity of mobile devices is pushing the community to develop datacenter monitoring solutions for smartphone-class devices. Here we lay o...
متن کاملResource Monitoring Algorithms Evaluation For Cloud Environment
Cloud computing is a type of distributed computing allowing to share many resources such as CPU, memory, storage ...etc. The status of these resources changes from time to time due to the dynamic adaptive ability of the cloud computing characteristics. Hence, the powerful and scalable monitoring algorithm is needed to monitor the status of these resources throughout the time. There are many mod...
متن کاملAgent-based monitoring system for cloud/Grid computing
Goal of the project is to create a robust, fault tolerant, agent-based monitoring system, which is capable of working in cloud/Grid type environment with zero configuration. Created solution should be helpful for system administrators (of a clouds / Grids). To provide additional information about the situation a rule-based inferencing system is going to be included in it. It can be used to prov...
متن کاملAnalysis of the modernization prospects of the WLCG monitoring framework's messaging subsystem
framework’s messaging subsystem V. Airiian# Dubna State University, Dubna, Russia Laboratory of Information Technologies, JINR ABSTRACT The purpose of the project is an analysis of the modernization prospects of the WLCG monitoring framework’s messaging subsystem based on Nagios monitoring software and Apache ActiveMQ technologies. The modernization process demands thorough examination of the e...
متن کامل