+

CN104079434A - Device and method for managing physical devices in cloud computing system - Google Patents

Device and method for managing physical devices in cloud computing system Download PDF

Info

Publication number
CN104079434A
CN104079434A CN201410321156.XA CN201410321156A CN104079434A CN 104079434 A CN104079434 A CN 104079434A CN 201410321156 A CN201410321156 A CN 201410321156A CN 104079434 A CN104079434 A CN 104079434A
Authority
CN
China
Prior art keywords
physical machine
physical
alarm
information
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410321156.XA
Other languages
Chinese (zh)
Inventor
陈杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yonyou Software Co Ltd
Original Assignee
Yonyou Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yonyou Software Co Ltd filed Critical Yonyou Software Co Ltd
Priority to CN201410321156.XA priority Critical patent/CN104079434A/en
Publication of CN104079434A publication Critical patent/CN104079434A/en
Pending legal-status Critical Current

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides a device for managing physical devices in a cloud computing system. The device comprises a physical machine automatic remote deployment unit, a physical machine life cycle management unit and a physical machine monitoring and warning unit, wherein the physical machine automatic remote deployment unit is used for conducting automatic batch deployment on virtualization operation systems of physical bare machines through a PXE and quickly completing readiness or expansion of a virtualization resource pool; the physical machine life cycle management unit is used for managing the life cycle of each physical machine and operating a remote switch of each physical machine; the physical machine monitoring and warning unit is used for monitoring resources where the physical machines are located in real time to obtain monitoring information and conducting warning management on the resources where the physical machines are located. The invention further provides a method for managing the physical devices in the cloud computing system. Through the technical scheme, management of the physical devices of the multiple physical machines can be completed on the basis of existing virtualization resource management, and the universal and unified management thought that the multiple physical machines participate in management on the physical devices of mass physical machines is established.

Description

The device and method of physical equipment management in cloud computing system
Technical field
The present invention relates to field of computer technology, particularly, relate to the method for physical equipment management in the device of the management of physical equipment in a kind of cloud computing system and a kind of cloud computing system.
 
Background technology
Cloud computing is more and more paid close attention to by more people, and in enterprise, the value of cloud computing to be embodied in: pay as required, as required expansion, rapid deployment, respond fast, cut down expenses, reduce risk.The physical equipment quantity of managing under cloud computing environment is also more and more huger, and the cost of maintenance is also to increase along with the development of cloud environment.
In cloud computing system, stress the management to virtual resources, we can know the state of current virtual machine, can carry out the switching on and shutting down operation of virtual machine etc.Cannot well management and monitoring physical machine in traditional cloud computing system, we cannot know the state of physical machine equipment, we cannot carry out to physical machine the operation of remote on-off.
When having after large batch of physical machine purchases, how to allow operating system automatic deployment in batches, complete fast the ready or dilatation in virtual resources pond? how, when alarm appears in physical equipment, notify timely keeper to safeguard?
How problem above-mentioned ubiquity in cloud computing system, solve the above-mentioned problem, thereby effectively reduce O&M cost, and this is current problem to be solved.
Therefore, need physical equipment administrative skill in a kind of new cloud computing system, can on the basis of existing virtual resources management, complete the physical equipment management of a plurality of physical machine, set up general, unified management thinking towards physical machine physical equipment management in enormous quantities that a plurality of physical machine participate in.
 
Summary of the invention
The present invention is just based on the problems referred to above, physical equipment administrative skill in a kind of new cloud computing system has been proposed, can be on the basis of existing virtual resources management, complete the physical equipment management of a plurality of physical machine, set up general, unified management thinking towards physical machine physical equipment management in enormous quantities that a plurality of physical machine participate in.
In view of this, the device that the present invention proposes physical equipment management in a kind of cloud computing system, comprising: physical machine Long-range Automatic Deployment unit, for using PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches; Physical machine life cycle management unit, for managing the life cycle of every physical machine, and the remote switch machine operation to physical machine; Physical machine monitoring and Alarm Unit, for physical machine place resource is monitored in real time, obtain monitor message; And physical machine place resource is carried out to alarm management.In this technical scheme, in cloud computing system, physical machine equipment is managed and safeguarded and monitoring, can realize management, maintenance and monitoring to physical machine, reduce the cost of O&M.
In technique scheme, preferably, described physical machine Long-range Automatic Deployment unit, specifically comprises: client computer starts module, for client computer, starts, and the program in network interface card PXE ROM is transferred internal memory and carries out; Client is obtained IP address module, finds after Dynamic Host Configuration Protocol server acquisition request IP address for client at network; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client; TFTP file delivery module, relates to tftp server for this client transmission boot for Dynamic Host Configuration Protocol server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file; Kernel starts and file configuration module, for starting kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.In this technical scheme, use PXE, by network startup, allow the client on network to download startup file from remote start server, the operating system of physics bare machine is carried out to batch automatic deployment, complete fast the ready or dilatation in virtual resources pond, can greatly reduce manual complexity and the repeated work of disposing.
In technique scheme, preferably, described physical machine life cycle management unit, specifically comprises: physical machine life cycle logging modle, for the facility information that records every physical machine to database; Physical machine remote operation module, realizes the remote switch machine operation to physical machine for the mode by IPMI.In this technical scheme, by the management to physical machine life cycle, can so that the service condition of every physical machine made rational planning for; By using IPMI, can guarantee remote-operated reliability.
In technique scheme, preferably, the monitoring of described physical machine and Alarm Unit, specifically comprise: server system real-time monitoring module, for by the monitor message of IPMI and OS aspect Real-time Obtaining server system; Server system alarm module, contact method, query warning list of thing while occurring for the alarm regulation of configures physical machine, configuration alarm, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.In this technical scheme, by real-time monitoring and alarm in time, can be to a large amount of distributing server centralized management.
In technique scheme, preferably, the facility information of every physical machine of described physical machine life cycle logging modle record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Described hardware purchase information comprises device name, numbering and model; And/or described physical machine remote operation module realizes the remote switch machine operation to physical machine by the mode of IPMI, comprise main frame power on electricity operation and reboot operation under operation, main frame; And/or described server system real-time monitoring module, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of described monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of described monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; Described system resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access; And/or described server system alarm module carries out the various ways of alarm, comprise message box, mail, alerting tone and note.
According to a further aspect of the invention, the method of physical equipment management in a kind of cloud computing system has also been proposed, comprise: step 202: use PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches; Step 204: manage the life cycle of every physical machine, and the remote switch machine operation to physical machine; Step 206: physical machine place resource is monitored in real time, obtained monitor message; And physical machine place resource is carried out to alarm management.In this technical scheme, in cloud computing system, physical machine equipment is managed and safeguarded and monitoring, can realize management, maintenance and monitoring to physical machine, reduce the cost of O&M.
In technique scheme, preferably, described step 202, specifically comprises: step 302: client computer starts, and the program in network interface card PXE ROM is transferred internal memory and carries out; Step 304: client finds after Dynamic Host Configuration Protocol server at network, acquisition request IP address; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client; Step 306:DHCP server contact is this client transmission boot to tftp server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file; Step 308: start kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.In this technical scheme, use PXE, by network startup, allow the client on network to download startup file from remote start server, the operating system of physics bare machine is carried out to batch automatic deployment, complete fast the ready or dilatation in virtual resources pond, can greatly reduce manual complexity and the repeated work of disposing.
In technique scheme, preferably, described step 204, specifically comprises: step 402: record the facility information of every physical machine in database; Step 404: the mode by IPMI realizes the remote switch machine operation to physical machine.In this technical scheme, by the management to physical machine life cycle, can so that the service condition of every physical machine made rational planning for; By using IPMI, can guarantee remote-operated reliability.
In technique scheme, preferably, described step 206, specifically comprises: step 502: by the monitor message of IPMI and OS aspect Real-time Obtaining server system; Step 504: contact method, query warning list of thing when the alarm regulation of configures physical machine, configuration alarm occur, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.In this technical scheme, by real-time monitoring and alarm in time, can be to a large amount of distributing server centralized management.
In technique scheme, preferably, the facility information of every physical machine of described step 402 record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Described hardware purchase information comprises device name, numbering and model; And/or described step 404 realizes the remote switch machine operation to physical machine by the mode of IPMI, comprise main frame power on electricity operation and reboot operation under operation, main frame; And/or described step 502, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of described monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of described monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; Described system resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access; And/or described step 504 is carried out the various ways of alarm, comprise message box, mail, alerting tone and note.
By above technical scheme, can on the basis of existing virtual resources management, complete the physical equipment management of a plurality of physical machine, set up general, unified management thinking towards physical machine physical equipment management in enormous quantities that a plurality of physical machine participate in.
 
Accompanying drawing explanation
Fig. 1 shows the block diagram of the device of physical equipment management in cloud computing system according to an embodiment of the invention;
Fig. 2 shows the flow chart of the method for physical equipment management in cloud computing system according to an embodiment of the invention;
Fig. 3 shows the flow chart of physical machine Long-range Automatic Deployment according to an embodiment of the invention;
Fig. 4 shows the flow chart of physical machine life cycle management according to an embodiment of the invention;
Fig. 5 shows the flow chart of physical machine monitoring according to an embodiment of the invention and alarm;
Fig. 6 shows the illustraton of model of the device of physical equipment management in cloud computing system according to an embodiment of the invention;
Fig. 7 shows the operation principle schematic diagram of IPMI according to an embodiment of the invention.
 
Embodiment
In order more clearly to understand above-mentioned purpose of the present invention, feature and advantage, below in conjunction with the drawings and specific embodiments, the present invention is further described in detail.It should be noted that, in the situation that not conflicting, the application's embodiment and the feature in embodiment can combine mutually.
A lot of details have been set forth in the following description so that fully understand the present invention; but; the present invention can also adopt other to be different from other modes described here and implement, and therefore, protection scope of the present invention is not subject to the restriction of following public specific embodiment.
Fig. 1 shows the block diagram of the device of physical equipment management in cloud computing system according to an embodiment of the invention.
As shown in Figure 1, the device 100 that in cloud computing system, physical equipment is managed according to an embodiment of the invention, comprise: physical machine Long-range Automatic Deployment unit 102, be used for using PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches; Physical machine life cycle management unit 104, for managing the life cycle of every physical machine, and the remote switch machine operation to physical machine; Physical machine monitoring and Alarm Unit 106, for physical machine place resource is monitored in real time, obtain monitor message; And physical machine place resource is carried out to alarm management.In this technical scheme, in cloud computing system, physical machine equipment is managed and safeguarded and monitoring, can realize management, maintenance and monitoring to physical machine, reduce the cost of O&M.
In technique scheme, preferably, physical machine Long-range Automatic Deployment unit 102, specifically comprises: client computer starts module 1022, for client computer, starts, and the program in network interface card PXE ROM is transferred internal memory and carries out; Client is obtained IP address module 1024, finds after Dynamic Host Configuration Protocol server acquisition request IP address for client at network; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client; TFTP file delivery module 1026, relates to tftp server for this client transmission boot for Dynamic Host Configuration Protocol server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file; Kernel starts and file configuration module 1028, for starting kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.In this technical scheme, use PXE, by network startup, allow the client on network to download startup file from remote start server, the operating system of physics bare machine is carried out to batch automatic deployment, complete fast the ready or dilatation in virtual resources pond, can greatly reduce manual complexity and the repeated work of disposing.
In technique scheme, preferably, physical machine life cycle management unit 104, specifically comprises: physical machine life cycle logging modle 1042, for the facility information that records every physical machine to database; Physical machine remote operation module 1044, realizes the remote switch machine operation to physical machine for the mode by IPMI.In this technical scheme, by the management to physical machine life cycle, can so that the service condition of every physical machine made rational planning for; By using IPMI, can guarantee remote-operated reliability.
In technique scheme, preferably, physical machine monitoring and Alarm Unit 106, specifically comprise: server system real-time monitoring module 1062, for by the monitor message of IPMI and OS aspect Real-time Obtaining server system; Server system alarm module 1064, contact method, query warning list of thing while occurring for the alarm regulation of configures physical machine, configuration alarm, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.In this technical scheme, by real-time monitoring and alarm in time, can be to a large amount of distributing server centralized management.
In technique scheme, preferably, the facility information of every physical machine of physical machine life cycle logging modle 1042 record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Hardware purchase information comprises device name, numbering and model; And/or physical machine remote operation module 1044 realizes the remote switch machine operation to physical machine by the mode of IPMI, comprise main frame power on electricity operation and reboot operation under operation, main frame; And/or server system real-time monitoring module 1062, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; System resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access; And/or the various ways that server system alarm module 1064 carries out alarm, comprises message box, mail, alerting tone and note.
Fig. 2 shows the flow chart of the method for physical equipment management in cloud computing system according to an embodiment of the invention.
As shown in Figure 2, the method for physical equipment management in cloud computing system, comprising: step 202: use PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches according to an embodiment of the invention; Step 204: manage the life cycle of every physical machine, and the remote switch machine operation to physical machine; Step 206: physical machine place resource is monitored in real time, obtained monitor message; And physical machine place resource is carried out to alarm management.In this technical scheme, in cloud computing system, physical machine equipment is managed and safeguarded and monitoring, can realize management, maintenance and monitoring to physical machine, reduce the cost of O&M.
In technique scheme, preferably, as shown in Figure 3, step 202, specifically comprises: step 302: client computer starts, and the program in network interface card PXE ROM is transferred internal memory and carries out; Step 304: client finds after Dynamic Host Configuration Protocol server at network, acquisition request IP address; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client; Step 306:DHCP server contact is this client transmission boot to tftp server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file; Step 308: start kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.In this technical scheme, use PXE, by network startup, allow the client on network to download startup file from remote start server, the operating system of physics bare machine is carried out to batch automatic deployment, complete fast the ready or dilatation in virtual resources pond, can greatly reduce manual complexity and the repeated work of disposing.
In technique scheme, preferably, as shown in Figure 4, step 204, specifically comprises: step 402: record the facility information of every physical machine in database; Step 404: the mode by IPMI realizes the remote switch machine operation to physical machine.In this technical scheme, by the management to physical machine life cycle, can so that the service condition of every physical machine made rational planning for; By using IPMI, can guarantee remote-operated reliability.
In technique scheme, preferably, as shown in Figure 5, step 206, specifically comprises: step 502: by the monitor message of IPMI and OS aspect Real-time Obtaining server system; Step 504: contact method, query warning list of thing when the alarm regulation of configures physical machine, configuration alarm occur, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.In this technical scheme, by real-time monitoring and alarm in time, can be to a large amount of distributing server centralized management.
In technique scheme, preferably, the facility information of every physical machine of step 402 record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Hardware purchase information comprises device name, numbering and model; And/or step 404 realizes the remote switch machine operation to physical machine by the mode of IPMI, comprise main frame power on electricity operation and reboot operation under operation, main frame; And/or step 502, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; System resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access; And/or step 504 is carried out the various ways of alarm, comprise message box, mail, alerting tone and note.
Technical scheme of the present invention, thereby the solution problems of the prior art of take reduce O&M cost effectively as foothold, increase, to the management maintenance of physical machine equipment and monitoring, reduces the O&M cost of cloud computing system, is applicable to the requirement to physical equipment management in each cloud computing system.
Thereby in order to solve problems of the prior art, effectively reduce O&M cost, technical scheme of the present invention manages and safeguards and monitoring physical machine equipment in cloud computing system.Technical scheme of the present invention, mainly solves cloud computing system weak to physical equipment management function, realizes the management of physical machine, maintenance and monitoring, reduces the cost of O&M.
For example, technical scheme of the present invention, the model of the technical solution of the present invention showing referring to Fig. 6, mainly can comprise following module:
(1) physical machine Long-range Automatic Deployment unit, the operating system that this physical machine Long-range Automatic Deployment unit carries out physics bare machine is automatic deployment in batches, completes fast the ready or dilatation in virtual resources pond, has greatly reduced complexity and the repeated work of manual deployment.
(2) physical machine life cycle management unit, this physical machine life cycle management Single Component Management various kinds of equipment relevant information, as device name, numbering, model, residing concrete physical location information etc., and can complete and handle the long-range upper and lower electricity of main frame and the operation such as restart.
(3) physical machine monitoring and Alarm Unit, this physical machine monitoring realizes physical machine resource is monitored with Alarm Unit, and monitor message comprises by IPMI and the monitor message obtained by OS aspect.Alarm management, contact method, query warning list of thing when the alarm regulation of configurable physical machine, configuration alarm occur.
And for example, the specific implementation of technical solution of the present invention is as follows:
(1) physical machine Long-range Automatic Deployment unit, we will use PXE, by network startup, allow the client on network to download startup file from remote start server.So just provide network manager's management for the startup file of client and the ability of operating system.PXE has extensive application in operating system automatic deployment and non-disk workstation environment.
PXE automatic deployment os starting process is as follows:
A) client computer starts, and because BIOS is provided with network interface card, starts, so the program in network interface card PXE ROM is transferred internal memory, carries out.
B) client is found Dynamic Host Configuration Protocol server in network, then asks an IP address;
C) Dynamic Host Configuration Protocol server provides IP address and other network parameters for client.
D) Dynamic Host Configuration Protocol server is related to a tftp server boot of client transmission for this reason.
E) client is carried out after receiving boot, and boot can ask TFTP to transmit the configuration file of boot; After receiving, read configuration file, according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file.
F) start kernel.
G) kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.
This physical machine Long-range Automatic Deployment unit can carry out the batch automatic deployment of virtualization system, completes fast the ready or dilatation in virtual resources pond, has greatly reduced complexity and the repeated work of manual deployment.
(2) physical machine life cycle management unit, we will record the life cycle of each physical machine, all information will be recorded in database, comprise the relevant informations such as physical location, device numbering and the network address that hardware is bought information, added time, the time of scrapping, physical machine.
We also realize the mode by IPMI the remote switch machine operation to physical machine.IPMI (IPMI) is a kind of hardware management interface specification of open standard, has defined the ad hoc approach that embedded management subsystem communicates.Even if the running of server itself is undesired, or due to any former thereby service cannot be provided, IPMI still can normal operation.IPMI fundamental diagram is as Fig. 7.
(3) physical machine is monitored and Alarm Unit, traditional system monitoring management method is generally that system manager regularly arrives that machine room is maked an inspection tour or adopt monitoring class software supervision, and said method exists poor in timeliness, server to delay after machine cannot trace reason, the more shortcoming of occupying system resources; IPMI can realize the real-time monitoring to server system, monitoring server static system information (information such as CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card) and multidate information (the system resource information such as the temperature of the equipment such as mainboard, CPU, SCSI module, fan board, voltage, rotation speed of the fan information and cpu busy percentage, memory usage, hard disk I/O flowing of access).
In physical machine monitoring and Alarm Unit, we can arrange the alarm threshold of different monitoring amount, when above-mentioned monitored amount surmounts the anomalous events such as thresholding, can, by various ways (message box, mail, alerting tone, note) alarm, alarm event be recorded into daily record simultaneously.Environmental applications advantage to a large amount of distributing server centralized management is particularly evident.
Technical scheme of the present invention, in cloud computing system, introduce physical machine management, realized the operating system batch automatic deployment of physics bare machine, complete fast the ready or dilatation in virtual resources pond, to reduce manual complexity and the repeated work of disposing of keeper, and the life cycle of whole physical machine is managed, improved the efficiency of resource management, and physical host has been realized to real-time monitoring, reduced the cost of O&M.
Compare traditional cloud computing system (due to the effective management, the monitoring that lack physical host, the O&M cost causing is high), empirical tests, introduces after technical scheme of the present invention, can effectively reduce O&M cost.
More than be described with reference to the accompanying drawings technical scheme of the present invention, considered and in correlation technique, there is no easy, the unified solution for physical machine management in enormous quantities.In existing cloud computing system, physical equipment is managed physical equipment management process in the cloud computing system that cannot complete physical machine participation in enormous quantities.Therefore, the present invention proposes the method for physical equipment management in the device of the management of physical equipment in a kind of cloud computing system and a kind of cloud computing system, can be on the basis of existing virtual resources management, complete the physical equipment management of a plurality of physical machine, set up general, unified management thinking towards physical machine physical equipment management in enormous quantities that a plurality of physical machine participate in.
The foregoing is only the preferred embodiments of the present invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (10)

1. a device for physical equipment management in cloud computing system, is characterized in that, comprising:
Physical machine Long-range Automatic Deployment unit, for using PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches;
Physical machine life cycle management unit, for managing the life cycle of every physical machine, and the remote switch machine operation to physical machine;
Physical machine monitoring and Alarm Unit, for physical machine place resource is monitored in real time, obtain monitor message; And physical machine place resource is carried out to alarm management.
2. the device of physical equipment management in cloud computing system according to claim 1, is characterized in that, described physical machine Long-range Automatic Deployment unit, specifically comprises:
Client computer starts module, for client computer, starts, and the program in network interface card PXE ROM is transferred internal memory and carries out;
Client is obtained IP address module, finds after Dynamic Host Configuration Protocol server acquisition request IP address for client at network; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client;
TFTP file delivery module, relates to tftp server for this client transmission boot for Dynamic Host Configuration Protocol server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file;
Kernel starts and file configuration module, for starting kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.
3. the device of physical equipment management in cloud computing system according to claim 1 and 2, is characterized in that, described physical machine life cycle management unit, specifically comprises:
Physical machine life cycle logging modle, for the facility information that records every physical machine to database;
Physical machine remote operation module, realizes the remote switch machine operation to physical machine for the mode by IPMI.
4. the device of physical equipment management in cloud computing system according to claim 3, is characterized in that, described physical machine monitoring and Alarm Unit, specifically comprise:
Server system real-time monitoring module, for by the monitor message of IPMI and OS aspect Real-time Obtaining server system;
Server system alarm module, contact method, query warning list of thing while occurring for the alarm regulation of configures physical machine, configuration alarm, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.
5. the device that in cloud computing system according to claim 4, physical equipment is managed, it is characterized in that, the facility information of every physical machine of described physical machine life cycle logging modle record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Described hardware purchase information comprises device name, numbering and model;
And/or,
Described physical machine remote operation module realizes the remote switch machine operation to physical machine by the mode of IPMI, comprises main frame power on electricity operation and reboot operation under operation, main frame;
And/or,
Described server system real-time monitoring module, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of described monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of described monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; Described system resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access;
And/or,
Described server system alarm module carries out the various ways of alarm, comprises message box, mail, alerting tone and note.
6. a method for physical equipment management in cloud computing system, is characterized in that, comprising:
Step 202: use PXE, the virtualizing operating systems of automatic deployment physics bare machine, completes the ready or dilatation in virtual resources pond fast in batches;
Step 204: manage the life cycle of every physical machine, and the remote switch machine operation to physical machine;
Step 206: physical machine place resource is monitored in real time, obtained monitor message; And physical machine place resource is carried out to alarm management.
7. the method for physical equipment management in cloud computing system according to claim 6, is characterized in that, described step 202, specifically comprises:
Step 302: client computer starts, the program in network interface card PXE ROM is transferred internal memory and carries out;
Step 304: client finds after Dynamic Host Configuration Protocol server at network, acquisition request IP address; Dynamic Host Configuration Protocol server provides corresponding IP address and network parameter for client;
Step 306:DHCP server contact is this client transmission boot to tftp server; Client is carried out after receiving boot, and boot request TFTP transmits the configuration file of boot; Boot is read the configuration file that TFTP transmits, and according to this configuration file content and client's situation, client-requested TFTP transmits kernel image file and root file system file;
Step 308: start kernel; Kernel, according to the configuration file of bootstrap, by Network Capture operating system Auto-mounting script, and obtains the required installation file of system by network service, according to the configuration of Auto-mounting script, installs.
8. according to the method for physical equipment management in the cloud computing system described in claim 6 or 7, it is characterized in that, described step 204, specifically comprises:
Step 402: record the facility information of every physical machine in database;
Step 404: the mode by IPMI realizes the remote switch machine operation to physical machine.
9. the method for physical equipment management in cloud computing system according to claim 8, is characterized in that, described step 206, specifically comprises:
Step 502: by the monitor message of IPMI and OS aspect Real-time Obtaining server system;
Step 504: contact method, query warning list of thing when the alarm regulation of configures physical machine, configuration alarm occur, the alarm threshold of physical machine different monitoring amount is set, when monitored amount surmounts the anomalous events such as thresholding, alarm by various ways is recorded into daily record by alarm event simultaneously.
10. the method that in cloud computing system according to claim 9, physical equipment is managed, it is characterized in that, the facility information of every physical machine of described step 402 record, comprises that hardware buys physical location, device numbering and the network address of information, added time, the time of scrapping, physical machine; Described hardware purchase information comprises device name, numbering and model;
And/or,
Described step 404 realizes the remote switch machine operation to physical machine by the mode of IPMI, comprises main frame power on electricity operation and reboot operation under operation, main frame;
And/or,
Described step 502, by the monitor message of the server system of IPMI and OS aspect Real-time Obtaining, comprises the static information of monitored server system and the multidate information of monitored server system; The static information of described monitored server system comprises the information of CPU, internal memory, hard disk, CD-ROM drive, network interface card, video card, operating system, RAID card, PCI additional card; The multidate information of described monitored server system comprises temperature, voltage, rotation speed of the fan information and the system resource information of mainboard, CPU, SCSI module, fan board; Described system resource information comprises cpu busy percentage, memory usage, hard disk I/O flowing of access;
And/or,
Described step 504 is carried out the various ways of alarm, comprises message box, mail, alerting tone and note.
CN201410321156.XA 2014-07-07 2014-07-07 Device and method for managing physical devices in cloud computing system Pending CN104079434A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410321156.XA CN104079434A (en) 2014-07-07 2014-07-07 Device and method for managing physical devices in cloud computing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410321156.XA CN104079434A (en) 2014-07-07 2014-07-07 Device and method for managing physical devices in cloud computing system

Publications (1)

Publication Number Publication Date
CN104079434A true CN104079434A (en) 2014-10-01

Family

ID=51600490

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410321156.XA Pending CN104079434A (en) 2014-07-07 2014-07-07 Device and method for managing physical devices in cloud computing system

Country Status (1)

Country Link
CN (1) CN104079434A (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104360894A (en) * 2014-11-18 2015-02-18 浪潮(北京)电子信息产业有限公司 Method and device for simulating physical equipment
CN104486148A (en) * 2014-12-04 2015-04-01 北京百度网讯科技有限公司 Server recovery control method and device
CN104639378A (en) * 2015-03-10 2015-05-20 浪潮集团有限公司 Automatic server deployment method based on PXE (pre-boot execution environment)
CN105306225A (en) * 2015-11-03 2016-02-03 国云科技股份有限公司 Openstack-based physical machine remote shutdown method
CN105353713A (en) * 2015-12-15 2016-02-24 国网北京市电力公司 Computer room monitoring system
CN105446657A (en) * 2015-11-11 2016-03-30 浪潮电子信息产业股份有限公司 Method for monitoring RAID card
CN105681081A (en) * 2016-01-12 2016-06-15 华为技术有限公司 Physical machine management method and device
CN106682198A (en) * 2016-12-29 2017-05-17 北京奇虎科技有限公司 Method and device for achieving automatic database deploying
CN106775798A (en) * 2016-01-28 2017-05-31 新华三技术有限公司 A kind of installation method of operating system and device
CN107566165A (en) * 2017-08-18 2018-01-09 国网山东省电力公司信息通信公司 A kind of method and system for finding and disposing electric power cloud data center available resources
CN107562518A (en) * 2017-08-26 2018-01-09 杭州云哟科技有限责任公司 Video card ROM extraction collection systems and method based on KVM virtualization technology
CN107566174A (en) * 2017-09-05 2018-01-09 郑州云海信息技术有限公司 A kind of network interface card identification and the realization method and system of bulk filling system
CN107995287A (en) * 2017-11-30 2018-05-04 郑州云海信息技术有限公司 A method for remotely monitoring the health status of data center nodes through IPMI
CN108011880A (en) * 2017-12-04 2018-05-08 郑州云海信息技术有限公司 The management method and computer-readable recording medium monitored in cloud data system
CN108900656A (en) * 2018-08-23 2018-11-27 郑州云海信息技术有限公司 A kind of method and device of batch deployment
CN109245917A (en) * 2018-08-20 2019-01-18 郑州云海信息技术有限公司 A kind of method and device of the bare machine management based on cloud platform
CN109818768A (en) * 2017-11-21 2019-05-28 中国移动通信有限公司研究院 A physical facility management system, PNF network management system and method
CN109962941A (en) * 2017-12-14 2019-07-02 华为技术有限公司 Communication method, device and server
CN110688130A (en) * 2019-10-14 2020-01-14 天津卓朗科技发展有限公司 Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment
CN110750464A (en) * 2019-09-05 2020-02-04 北京浪潮数据技术有限公司 Computer node storage pooling method, device and system
CN111742317A (en) * 2018-02-14 2020-10-02 微软技术许可有限责任公司 Clears bare metal resources to a trusted state usable in cloud computing
CN112350855A (en) * 2020-10-26 2021-02-09 浪潮云信息技术股份公司 Configuration-based cloud center management method
CN113381881A (en) * 2021-05-25 2021-09-10 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN114363295A (en) * 2020-09-28 2022-04-15 华为云计算技术有限公司 Tenant server management method and device
CN115442264A (en) * 2022-08-24 2022-12-06 浪潮云信息技术股份公司 Method and system for monitoring physical host ecology in cloud environment
CN117040999A (en) * 2023-08-18 2023-11-10 中科芯集成电路有限公司 Remote control method for physical server

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1929410A (en) * 2006-09-04 2007-03-14 曙光信息产业(北京)有限公司 Intelligent computers group monitoring method
CN101577698A (en) * 2008-05-09 2009-11-11 中兴通讯股份有限公司 System with external intelligent management server and method for monitoring server and processing commands
CN101719089A (en) * 2009-10-30 2010-06-02 曙光信息产业(北京)有限公司 Remote management method and system of distributed type assembly
WO2012054023A1 (en) * 2010-10-20 2012-04-26 Hewlett-Packard Development Company, L.P. Computer system with computers that perform network boots
CN102710788A (en) * 2012-06-18 2012-10-03 苏州超集信息科技有限公司 Rapid and unattended operation system
CN103297504A (en) * 2013-05-09 2013-09-11 浙江大学 Method for quickly deploying operating systems in physical bare computers in cloud data center
CN103401699A (en) * 2013-07-18 2013-11-20 深圳先进技术研究院 Cloud data center security monitoring early warning system and method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1929410A (en) * 2006-09-04 2007-03-14 曙光信息产业(北京)有限公司 Intelligent computers group monitoring method
CN101577698A (en) * 2008-05-09 2009-11-11 中兴通讯股份有限公司 System with external intelligent management server and method for monitoring server and processing commands
CN101719089A (en) * 2009-10-30 2010-06-02 曙光信息产业(北京)有限公司 Remote management method and system of distributed type assembly
WO2012054023A1 (en) * 2010-10-20 2012-04-26 Hewlett-Packard Development Company, L.P. Computer system with computers that perform network boots
CN102710788A (en) * 2012-06-18 2012-10-03 苏州超集信息科技有限公司 Rapid and unattended operation system
CN103297504A (en) * 2013-05-09 2013-09-11 浙江大学 Method for quickly deploying operating systems in physical bare computers in cloud data center
CN103401699A (en) * 2013-07-18 2013-11-20 深圳先进技术研究院 Cloud data center security monitoring early warning system and method

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104360894A (en) * 2014-11-18 2015-02-18 浪潮(北京)电子信息产业有限公司 Method and device for simulating physical equipment
CN104486148A (en) * 2014-12-04 2015-04-01 北京百度网讯科技有限公司 Server recovery control method and device
CN104639378A (en) * 2015-03-10 2015-05-20 浪潮集团有限公司 Automatic server deployment method based on PXE (pre-boot execution environment)
CN105306225B (en) * 2015-11-03 2018-09-07 国云科技股份有限公司 Openstack-based physical machine remote shutdown method
CN105306225A (en) * 2015-11-03 2016-02-03 国云科技股份有限公司 Openstack-based physical machine remote shutdown method
CN105446657A (en) * 2015-11-11 2016-03-30 浪潮电子信息产业股份有限公司 Method for monitoring RAID card
CN105446657B (en) * 2015-11-11 2018-06-19 浪潮电子信息产业股份有限公司 Method for monitoring RAID card
CN105353713A (en) * 2015-12-15 2016-02-24 国网北京市电力公司 Computer room monitoring system
CN105681081A (en) * 2016-01-12 2016-06-15 华为技术有限公司 Physical machine management method and device
CN105681081B (en) * 2016-01-12 2019-06-21 华为技术有限公司 Physical machine management method and device
CN106775798A (en) * 2016-01-28 2017-05-31 新华三技术有限公司 A kind of installation method of operating system and device
CN106682198A (en) * 2016-12-29 2017-05-17 北京奇虎科技有限公司 Method and device for achieving automatic database deploying
CN106682198B (en) * 2016-12-29 2020-09-04 北京奇虎科技有限公司 Method and device for realizing automatic database deployment
CN107566165A (en) * 2017-08-18 2018-01-09 国网山东省电力公司信息通信公司 A kind of method and system for finding and disposing electric power cloud data center available resources
CN107562518A (en) * 2017-08-26 2018-01-09 杭州云哟科技有限责任公司 Video card ROM extraction collection systems and method based on KVM virtualization technology
CN107562518B (en) * 2017-08-26 2020-12-18 杭州云哟科技有限责任公司 Graphics card ROM extraction and collection system and method based on KVM virtualization technology
CN107566174A (en) * 2017-09-05 2018-01-09 郑州云海信息技术有限公司 A kind of network interface card identification and the realization method and system of bulk filling system
CN109818768B (en) * 2017-11-21 2022-02-25 中国移动通信有限公司研究院 A physical facility management system, PNF network management system and method
CN109818768A (en) * 2017-11-21 2019-05-28 中国移动通信有限公司研究院 A physical facility management system, PNF network management system and method
CN107995287A (en) * 2017-11-30 2018-05-04 郑州云海信息技术有限公司 A method for remotely monitoring the health status of data center nodes through IPMI
CN108011880A (en) * 2017-12-04 2018-05-08 郑州云海信息技术有限公司 The management method and computer-readable recording medium monitored in cloud data system
CN109962941A (en) * 2017-12-14 2019-07-02 华为技术有限公司 Communication method, device and server
CN111742317A (en) * 2018-02-14 2020-10-02 微软技术许可有限责任公司 Clears bare metal resources to a trusted state usable in cloud computing
CN109245917A (en) * 2018-08-20 2019-01-18 郑州云海信息技术有限公司 A kind of method and device of the bare machine management based on cloud platform
CN108900656A (en) * 2018-08-23 2018-11-27 郑州云海信息技术有限公司 A kind of method and device of batch deployment
CN110750464A (en) * 2019-09-05 2020-02-04 北京浪潮数据技术有限公司 Computer node storage pooling method, device and system
CN110688130A (en) * 2019-10-14 2020-01-14 天津卓朗科技发展有限公司 Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment
CN114363295A (en) * 2020-09-28 2022-04-15 华为云计算技术有限公司 Tenant server management method and device
CN114363295B (en) * 2020-09-28 2024-09-24 华为云计算技术有限公司 Management method and device of tenant server
CN112350855A (en) * 2020-10-26 2021-02-09 浪潮云信息技术股份公司 Configuration-based cloud center management method
CN113381881A (en) * 2021-05-25 2021-09-10 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN113381881B (en) * 2021-05-25 2022-12-09 山东浪潮爱购云链信息科技有限公司 Method and device for monitoring alarm processing of host
CN115442264A (en) * 2022-08-24 2022-12-06 浪潮云信息技术股份公司 Method and system for monitoring physical host ecology in cloud environment
CN117040999A (en) * 2023-08-18 2023-11-10 中科芯集成电路有限公司 Remote control method for physical server

Similar Documents

Publication Publication Date Title
CN104079434A (en) Device and method for managing physical devices in cloud computing system
US10198284B2 (en) Ensuring operational integrity and performance of deployed converged infrastructure information handling systems
CN104360878B (en) A kind of method and device of application software deployment
US8578337B2 (en) Method and system for quality assurance subscription service
CN103167034B (en) Based on the construction method of the monitoring Agent of CloudStack dummy node
CN102929769B (en) Virtual machine internal-data acquisition method based on agency service
CN109684038B (en) Docker service container log processing method and device and electronic equipment
CN112230847B (en) A method, system, terminal and storage medium for monitoring K8s storage volume
CN117555760B (en) Server monitoring method and device, substrate controller and embedded system
CN111061741A (en) A power test data management method, system, terminal and storage medium
CN116304233A (en) Telemetry target query injection for enhanced debugging in a micro-service architecture
CN105141478A (en) Method for monitoring state of sas card hard disk of linux server
US20210263718A1 (en) Generating predictive metrics for virtualized deployments
CN107943637A (en) A kind of mains cycle test device and method based on IPMI platforms
CN109144821A (en) Physical server automatic management method in a kind of cloud computation data center
US12277433B2 (en) Desired state configuration for virtual machines
CN103248696A (en) Dynamic configuration method for virtual resource in cloud computing environment
CN104283970A (en) A cloud computing service device, system and cloud computing method
US9218205B2 (en) Resource management in ephemeral environments
CN104516744A (en) Software updating method and system
CN105354127A (en) Cloud management platform based monitoring method
CN116401109A (en) A control method, device, and medium for a chassis management system
CN103902310B (en) Scheduling system and method for starting of virtual machines
US12020039B2 (en) Compute instance warmup operations
CN209881824U (en) Data center and cloud computing system based on private cloud platform

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100094 Beijing city Haidian District North Road No. 68, UFIDA Software Park

Applicant after: Yonyou Network Technology Co., Ltd.

Address before: 100094 Beijing city Haidian District North Road No. 68, UFIDA Software Park

Applicant before: UFIDA Software Co., Ltd.

COR Change of bibliographic data
RJ01 Rejection of invention patent application after publication

Application publication date: 20141001

RJ01 Rejection of invention patent application after publication
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载