WO2007068650A2 - Simultaneous download to multiple targets - Google Patents

Simultaneous download to multiple targets Download PDF

Info

Publication number
WO2007068650A2
WO2007068650A2 PCT/EP2006/069438 EP2006069438W WO2007068650A2 WO 2007068650 A2 WO2007068650 A2 WO 2007068650A2 EP 2006069438 W EP2006069438 W EP 2006069438W WO 2007068650 A2 WO2007068650 A2 WO 2007068650A2
Authority
WO
WIPO (PCT)
Prior art keywords
target device
communication medium
bmc
firmware
update
Prior art date
Application number
PCT/EP2006/069438
Other languages
French (fr)
Other versions
WO2007068650A3 (en
Inventor
Sudhir Gurunandan Rao
Darren Christopher Douglas
Jason James Graves
Lei Liu
Original Assignee
International Business Machines Corporation
Ibm United Kingdom Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corporation, Ibm United Kingdom Limited filed Critical International Business Machines Corporation
Publication of WO2007068650A2 publication Critical patent/WO2007068650A2/en
Publication of WO2007068650A3 publication Critical patent/WO2007068650A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/34Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters 
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • G06F8/61Installation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant

Definitions

  • Embodiments of the invention relate to simultaneous download (e.g., of firmware or software) from a source to multiple targets.
  • An enterprise may be described as a large entity (e.g., a corporation, government entity, etc.) that uses computers.
  • An enterprise class storage system may be described as a storage system that is designed for high performance and high availability and that is primarily used by larger corporations to store data.
  • Non-disruptive maintenance for enterprise class storage systems enables achievement of availability higher than 99.99%. In particular, it is desirable for enterprise class storage systems to provide availability at least at 99.995%.
  • Non- disruptive maintenance for many enterprise class storage systems also includes firmware upgrades for devices on the enterprise class storage systems. Non-disruptive operation requires redundant devices to survive failures and non-disruptive maintenance requires increasing the reliability of the upgrade and minimizing the time to upgrade firmware on redundant devices.
  • some devices mirror other devices in the enterprise class storage system, and these mirrored devices may share responsibility and control over certain resources.
  • Each of the mirrored devices should have a same level of firmware.
  • FIG. 1 illustrates a prior art system for downloading firmware to multiple target devices.
  • an initiating device 100 downloads firmware over device communication mediums 140, 150, 160 to Target Device A 110, Target Device B 120, and Target Device C 130, respectively.
  • the initiating device 100 downloads the firmware over a device communication medium connecting the initiating device 100 to a target device that is to be updated with firmware.
  • the target device firmware is updated with the firmware.
  • firmware is downloaded to one target device at a time on the device communication medium 140, 150, 160 being used. Even if the initiating device 100 is downloading firmware to multiple target devices, a separate device communication medium 140, 150, 160 is used for each target device.
  • another device may initiate and control the update.
  • the traditional technique is to perform firmware upgrades serially (i.e., by downloading the firmware to one device at a time) .
  • This approach has the potential to reduce overall reliability and availability of the enterprise class storage system owing to potential for errors leading to mismatching firmware on redundant devices and total upgrade time being proportional to the number of redundant devices, with any enterprise class storage system fault in this time having the potential for increasing enterprise class storage system downtime (e.g., in an enterprise class storage system with a redundant pair of devices, firmware upgrade takes two times (2X) the time to upgrade one device) .
  • FIG. 1 illustrates a prior art system for downloading firmware to multiple target devices
  • FIG. 2 illustrates details of a computing system in accordance with certain embodiments .
  • FIGS. 3A and 3B illustrate logic for a simultaneous firmware download in accordance with certain embodiment.
  • FIGS. 4A and 4B illustrate firmware download protocol control flow and data flow between a management module and an active BMC and between an active BMC and a standby BMC in accordance with certain embodiments.
  • FIG. 5 illustrates a system architecture that may be implemented by a server in accordance with certain embodiments .
  • FIG. 6 illustrates a system architecture that may be implemented with functionality of a management module in accordance with certain embodiments .
  • Embodiments perform a simultaneous download to multiple targets that share a common device communication medium (i.e., a transmission medium) .
  • a common device communication medium i.e., a transmission medium
  • embodiments provide a way to simultaneously update both devices (also referred to as "target” devices) from the device initiating the update (also referred to as an "initiating" device) .
  • Embodiments improve efficiency for the initiating device initiating the firmware update by performing a redundant update that does not require any changes on the initiating device.
  • FIG. 2 illustrates details of a computing system in accordance with certain embodiments of the present invention.
  • a Management Module (MM) 200 is an initiating device and is connected via an external device communication medium 250 to a server 210.
  • the server 210 includes active Baseboard Management Controller (BMC) 220 and standby Baseboard Management Controller (BMC) 230.
  • the active and standby BMCs 220, 230 provide management capabilities to local resources and redundant management capabilities to shared resources.
  • the standby BMC 230 includes an indicator 232 that is set to maintain synchronization between the active BMC 220 and the standby BMC 230 during downloads.
  • the active BMC 220 and standby BMC 230 may be referred to as first and second target devices, respectively.
  • the active BMC 220 and the standby BMC 230 may communicate over a mirroring link 242.
  • the mirroring link 242 may be described as a private device communication medium that enables private communication between the active and standby BMCs 220, 230.
  • the management module 200 is not able to send communications directly on the mirroring link 242.
  • the management module 200 downloads firmware to both the active and standby BMCs 220, 230 by issuing the firmware over a single external device communication medium 250 to both the active and standby BMCs 220, 230.
  • the internal device communication medium 240 is coupled to the external device communication medium 250.
  • the external device communication medium 250 may be a bus (e.g., a RS485 serial bus inter - integrated circuit (I2C) bus, a Dual Port RAM (DPRAM), or other bus-based media)
  • the internal device communication medium 240 may be a bus (e.g., an internal RS485 serial bus, inter - integrated circuit (I2C) bus, a Dual Port RAM (DPRAM), or other bus-based media) connected to the external device communication medium 250.
  • the firmware download solution is used for a blade server in an IBM® BladeCenter® chassis (available from International Business Machines Corporation) , where the blade server has dual baseboard management controllers (BMCs) .
  • the blade server may be described as an enterprise class storage system.
  • any set of target devices e.g., redundant devices, such as the active and standby BMCs 220, 230
  • may use any shared communication medium e.g., internal device communication medium 240
  • any private device communication medium e.g., mirroring link 242
  • Promiscuous mode may be described as a mode in which the target devices intercept and read each communication (e.g., network packet) that arrives in its entirety.
  • Embodiments may be used in networks that are serial or non-serial. Although examples herein may refer to firmware download, embodiments are applicable to software downloads. Also, there may be any number of devices that receive the download over a same device communication medium.
  • the management module 200 is only aware of the active BMC 220.
  • the active BMC 220 is the target device that receives the download from the management module, which is the initiator of the firmware download.
  • Both the active BMC 220 and the standby BMC 230 use the same communication medium to communicate with the management module 200.
  • the active and standby BMCs 220, 230 provide management capabilities to local resources and redundant management capabilities to shared resources.
  • the active and standby BMCs 220, 230 maintain consistency with each other through the mirroring link 220 (e.g., an inter - integrated circuit (I2C) bus) .
  • the standby BMC 230 may take over communication with the management module 200. Therefore, when a firmware download occurs through the management module 200, both the active and standby BMCs 220, 230 are updated so that they maintain firmware consistency.
  • I2C inter - integrated circuit
  • one BMC e.g., the active BMC 220
  • one BMC is said to "own" the external device communication medium 250, and so one BMC is capable of communicating with the management module 200 at a time.
  • the management module 200 is aware of sending commands to the active BMC 220, the management module 200 does not speak directly to the active BMC 220. Instead, the management module 200 sends messages to an address on the external device communication medium 250 that is associated with the server 210 slot, and the active and standby BMCs 220, 230 are capable of responding to and/or listening on this address. Therefore, from the management module 200 perspective, the management module is speaking to one BMC at any moment in time.
  • the dual BMCs 220, 230 in a server 210 there is no hardware inhibitor that prevents both BMCs from actively using the external device communication medium 250 (where the standby BMC 230 may access the external device communication medium 250 via the internal device communication medium 240) .
  • the external device communication medium 250 is actively used by a default BMC that is defined as the active BMC 220, and the other BMC 230 remains in an inactive state with the internal device communication medium 240 until the active BMC 220 that is actively using the external device communication medium 250 fails.
  • embodiments enable the active and standby BMCs 220, 230 to communicate on the mirroring link 242 and enable the standby BMC 230 to use the internal device communication medium 240 while firmware is being downloaded to the active BMC 220.
  • the active BMC 220 communicates with the standby BMC 230 over the mirroring link 242 to indicate to the standby BMC 230 that firmware is about to be downloaded.
  • the standby BMC acknowledges receipt of this indication to the active BMC 220 and starts to listen to communications on the internal device communication medium 240.
  • the standby BMC 230 does not listen to communications on the internal device communication medium 240.
  • embodiments avoid the use of separate, external device communication mediums for each target device (see prior art FIG. 1) .
  • the active and standby BMCs 220, 230 are designed for redundancy, it is important that when firmware on the active BMC 220 is updated, the firmware on the standby BMC 230 is also updated with the same firmware .
  • FIGS. 3A and 3B illustrate logic for a simultaneous download in accordance with certain embodiment.
  • Control begins at block 300 of FIG. 3A with the management module 200 initiating download via the external device communication medium 250 by issuing a download command that indicates that an update is to be downloaded.
  • the download command may be, for example, one that indicates that the active BMC 220 is to move into a state to receive an update (i.e., an "update state") .
  • the update may occur at a BMC when that BMC is in a boot mode.
  • Boot mode may be described as a mode in which an operating system runs in a restricted manner, such that the operating system processes a subset of commands (rather than all commands the operating system would otherwise process) , and where the subset of commands pertains to updating firmware and to verifying that an instruction set in memory has not been corrupted.
  • the active BMC 220 receives the download command.
  • the active BMC issues the same download command to the standby BMC 230 via the mirroring link 242.
  • the management module 200 establishes an update session with the active BMC 220, the active BMC 220 contacts the standby BMC 230 and establishes a standby BMC 230 update proxy session.
  • block 306 and blocks 308-312 may occur in parallel or serial form.
  • the active BMC 220 enters an update phase.
  • the standby BMC 230 receives the download command.
  • the standby BMC 230 enters an update phase.
  • the standby BMC 230 activates the internal device communication medium 240. Activating the internal communication medium 240 results in the standby BMC 230 listening on the internal device communication medium 240.
  • the standby BMC 230 Prior to receipt of the download command, the standby BMC 230 was not listening on the internal device communication medium 240.
  • the standby BMC 230 may be said to be a passive listener that is snooping on packets on the external device communication medium 250 via the internal communication medium 240.
  • the active and standby BMCs 220, 230 receive at least a portion of an update via the device communication mediums 240, 250.
  • the update include a portion of firmware or software that is being downloaded.
  • the active BMC 220 listens on the external device communication medium 250, while the standby BMC 230 listens on the internal device communication medium 240.
  • the management module 200 determines whether the download is complete (i.e., whether the update has downloaded completely) . In certain embodiments, the management module 200 maintains an indication of how many blocks make up a download and makes the determination of whether the download is complete based on the number of blocks that have been sent. If so, processing continues to block 318, otherwise, processing continues to block 322. In block 322, the management module 200 continues sending at least a portion of the update to the active and standby BMCs 220, 230. In this manner, the active and standby BMCs 220, 230 may receive the update as a series of downloads or as a single download (e.g., depending on the size of the download) .
  • the active BMC 220 that is actively using the external device communication medium 250 receives the download command from the management module 200, the active BMC 220 sends a corresponding download command over the mirroring link 242 to the standby BMC 230.
  • the standby BMC 230 then activates the internal device communication medium. Therefore, for the update state, both the active and standby BMCs 220, 230 actively use the communication mediums 240, 250 until the download is complete.
  • the management module 200 is executing the download as if downloading to one BMC. Therefore, with embodiments, no changes are needed to the management module 200 to operate with more than one BMC in a server 210 using this simultaneous download solution.
  • the management module 200 issues a reset download command, which indicates to the active and standby BMCs 220, 230 that they are to come out of the update state (e.g., boot mode) and start running with the updated firmware.
  • the active and standby BMCs 220, 230 run with the updated firmware. That is, when the management module 200 issues the reset download command, both the active and standby BMCs 220, 230 receive the reset download command and reset.
  • the active and standby BMCs 220, 230 return to their default operating state and the BMC that is predefined as the active BMC 220 on the deice communication medium 250 reclaims that activity, while the standby BMC 230 returns to an inactive state with the internal device communication medium 240.
  • both the active and standby BMCs 220, 230 update the appropriate data (e.g., firmware or software), and the active BMC 220 waits for an affirmative acknowledgment from the standby BMC 230 on the mirroring link 242.
  • the active BMC 220 acknowledges the affirmative acknowledgment of the standby BMC 230.
  • the standby BMC 230 uses this affirm-acknowledge protocol to maintain an indicator (e.g., a bit) in set/reset states to keep the active and standby BMCs 220, 230 synched in a lock-step manner.
  • the active BMC 220 may be interrupted as a result of which timing delays may break the synchronization without use of the indicator.
  • the active BMC 220 responds to the management module 200 for every packet upon receiving the affirmative acknowledgment from the standby BMC 230 or upon time out. Upon time out, the management module 200 may resend the packet.
  • thresholds may be added in the management module 200 to recover from active BMC 220 or standby BMC 230 faults, where the thresholds relate to timeouts by the active BMC 220 or standby BMC 230.
  • FIGS. 4A and 4B illustrate a firmware download protocol control flow and data flow between a management module 200 and an active BMC 220 and between the active BMC 220 and a standby BMC 230 in accordance with certain embodiments.
  • the solid lines illustrate control flow
  • the dashed lines illustrate data flow (i.e., message flow) .
  • the active BMC 220 syncs up with the standby BMC 230 with a command-send and acknowledgment (e.g., an "OK" response).
  • communication between the BMCs is on the mirroring link 242 (see Fig. 2) .
  • the standby BMC 230 listens on the internal device communication medium 240 and updates the firmware at the same time as the active BMC 230, which is listening on the external device communication medium 250.
  • the standby BMC 230 sends an acknowledgment (e.g., another "OK" response) to the active BMC 220.
  • the active BMC 220 responds back with an acknowledgment that the standby BMC 230 uses to clear an internal indicator (e.g., a bit) .
  • This indicator and the acknowledgment/response protocol for each firmware update is used to keep the active and standby BMC pair 220, 230 in lock- step so that one BMC does not go ahead of the other.
  • the management module 200 may reinitiate the firmware download in the event of errors or timeouts.
  • the active BMC 220 and standby BMC 230 would remain in the update state (e.g., boot mode) .
  • Control begins at block 400 of FIG. 4A with the management module 200 issuing a firmware download command to the active BMC 220 via the external device communication medium 250 and waits to receive an acknowledgment.
  • the management module 200 receives the acknowledgment from the active BMC 220.
  • the management module 200 sends a next firmware refresh code section update and waits to receive an acknowledgment.
  • block 406 FIG. 4B
  • the management module 200 receives the acknowledgment from the active BMC 420.
  • the management module 200 determines whether the firmware download is complete. If so, processing continues to block 410, otherwise, processing loops back to block 404 (FIG. 4A) .
  • the management module 200 sends a download complete message over the external device communication medium 250.
  • the active and standby BMCs 220, 230 receive the firmware update as a series of downloads.
  • the active BMC 220 receives the firmware download command from the management module 200 and sends the firmware download command to the standby BMC via the mirroring link 242 and waits for an acknowledgment.
  • the active BMC 220 receives an acknowledgment from the standby BMC 230 that the firmware download command was received at the standby BMC 230 and responds with an acknowledgment to the management module 200 that indicates that the firmware download command was received at the active BMC 220.
  • the active BMC 220 receives the next firmware refresh code section update.
  • the active BMC 220 waits for an acknowledgment from the standby BMC 230.
  • the standby BMC 230 is directly receiving the firmware refresh code section update via the internal device communication medium 240, and, once the update is received, the standby BMC 230 sends an acknowledgment that indicates receipt of the firmware refresh code section update to the active BMC 220. From block 426 (FIG. 4A), processing continues to block 428 (FIG. 4B) .
  • the active BMC 220 determines whether an acknowledgment has been received from the standby BMC 230. If so, processing continues to block 430, otherwise, processing continues to block 436. In block 430, the active BMC 220 sends an acknowledgment to the standby BMC 230 and to the management module 200.
  • the acknowledgment to the standby BMC 230 indicates that an indicator may be cleared (i.e., reset) .
  • the acknowledgment to the management module 200 indicates that the last received firmware refresh code section update has been processed.
  • the active BMC 220 determines whether the firmware download is complete. If so, processing continues to block 434, otherwise, processing loops back to block 424 (FIG. 4A) to receive the next update. In block 434, the active BMC 220 returns to normal operation.
  • processing continues to block 436, and the active BMC 220 determines whether processing by the standby BMC 230 has timed out (i.e., has taken more than a predetermined amount of time) . If so, processing continues to block 438, otherwise, processing loops back to block 426 (FIG. 4A) and waits for the acknowledgment.
  • the active BMC 220 processes the error. In certain embodiments, the active BMC 220 sends an error message to the management module 200 and waits to hear from the management module 200. In certain embodiments, the active BMC 220 lets the management module 200 timeout and waits to hear from the management module 200.
  • the management module 200 may resend the firmware download command in the event that the firmware download takes longer than an amount of time allotted by the management module 200. Also, the management module 200 may timeout if the firmware download takes too long. In this event, the firmware download may be reported as failed to the end user, and the user may reinitiate the firmware download. However, if the active BMC 220 reports an error, the active BMC is reporting that firmware download failed, which is why the firmware download may begin again. In certain alternative embodiments, the active BMC 220 may return an error on the last packet received, and the management module 200 (or an initiating device) may resend just the last packet. From block 438, processing continues to block 420, and the active BMC 220 waits to receive the firmware download command from the management module 200.
  • the standby BMC 230 receives the firmware download command from the active BMC 230 and activates the internal device communication medium for reception. In block 452, the standby BMC 230 responds with an acknowledgment that the firmware download command has been received to the active BMC 220. In block 454, the standby BMC 230 receives the next firmware refresh code section update via the internal device communication medium 240, sends an acknowledgment of receipt to the active BMC 220, and sets an internal indicator. From block 454, processing continues to block 456 (FIG. 4B) .
  • the standby BMC 230 receives an acknowledgment to clear the indicator from the active BMC 230 and clears the indicator.
  • the standby BMC 230 determines whether the firmware download is complete. If so, processing continues to block 460, otherwise, processing loops back to block 454 (FIG. 4A) to receive the next update. In block 460, the standby BMC 230 returns to normal operation.
  • one standby BMC has been shown. However, there may be more than one standby BMC or target device.
  • the firmware download protocol described herein may be performed via arbitration over the shared equivalent of the mirroring link.
  • multiple standby BMCs may passively listen for the download, in the event that there is more than one standby BMC listening in to the firmware download, certain embodiments may require arbitration with regard to the responses the standby BMCs provide back to the active BMC.
  • the active BMC may only obtain one response at a time from the standby BMCs, so, if one of the standby BMCs fails to get the update correctly, that standby BMC sends a message to the active BMC.
  • the arbitration is in relation to the responses provided by the standby BMCs.
  • the simultaneous (e.g., "one-shot") firmware download minimizes the time for a non-disruptive firmware upgrade (i.e., to IX the time to download firmware to one device) and increases the overall availability and reliability by avoiding firmware disparity across target devices, which could happen otherwise as a result of errors, and also by reducing the time for discovery of a faulty firmware upgrade.
  • embodiments exploit the existence of an additional device communication medium (i.e., the mirroring link) between redundant devices that may normally exist for detection for failure of the active device.
  • This additional device communication medium is distinct from the device communication medium that is used for firmware upgrades and may be used to correctly and reliably accomplish a simultaneous firmware upgrade.
  • IBM and BladeCenter are registered trademarks or common law marks of International Business Machines Corporation in the United States and/or other countries.
  • the described operations may be implemented as a method, computer program product or apparatus using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof.
  • Each of the embodiments may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements.
  • the embodiments may be implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
  • the embodiments may take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system.
  • a computer-usable or computer readable medium may be any apparatus that may contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • the described operations may be implemented as code maintained in a computer-usable or computer readable medium, where a processor may read and execute the code from the computer readable medium.
  • the medium may be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium.
  • Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a rigid magnetic disk, an optical disk, magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), volatile and non-volatile memory devices (e.g., a random access memory (RAM), DRAMs, SRAMs, a read-only memory (ROM) , PROMs, EEPROMs, Flash Memory, firmware, programmable logic, etc.) .
  • Current examples of optical disks include compact disk - read only memory (CD-ROM), compact disk - read/write (CD-R/W) and DVD.
  • the code implementing the described operations may further be implemented in hardware logic (e.g., an integrated circuit chip, Programmable Gate Array (PGA) , Application Specific Integrated Circuit (ASIC), etc.). Still further, the code implementing the described operations may be implemented in "transmission signals", where transmission signals may propagate through space or through a transmission media, such as an optical fiber, copper wire, etc.
  • the transmission signals in which the code or logic is encoded may further comprise a wireless signal, satellite transmission, radio waves, infrared signals, Bluetooth, etc.
  • the transmission signals in which the code or logic is encoded is capable of being transmitted by a transmitting station and received by a receiving station, where the code or logic encoded in the transmission signal may be decoded and stored in hardware or a computer readable medium at the receiving and transmitting stations or devices.
  • a computer program product may comprise computer useable or computer readable media, hardware logic, and/or transmission signals in which code may be implemented.
  • code may be implemented.
  • the computer program product may comprise any suitable information bearing medium known in the art.
  • logic may include, by way of example, software, hardware, firmware, and/or combinations of software and hardware.
  • Certain implementations may be directed to a method for deploying computing infrastructure by a person or automated processing integrating computer-readable code into a computing system, wherein the code in combination with the computing system is enabled to perform the operations of the described implementations.
  • FIGs. 3A, 3B, 4A, and 4B describes specific operations occurring in a particular order. In alternative embodiments, certain of the logic operations may be performed in a different order, modified or removed. Moreover, operations may be added to the above described logic and still conform to the described embodiments. Further, operations described herein may occur sequentially or certain operations may be processed in parallel, or operations described as performed by a single process may be performed by distributed processes.
  • FIG. 5 illustrates a system architecture 500 that may be implemented by the server 210 in accordance with certain embodiments.
  • the system architecture 500 is suitable for storing and/or executing program code and includes at least one processor 502 coupled directly or indirectly to memory elements 504 through a system bus 520.
  • the memory elements 504 may include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
  • the memory elements 504 include an operating system 505 and one or more computer programs 506.
  • the memory elements 504 may also include code 530 that implements some or all of the described operations taught by embodiments of the invention. Although code 530 is shown, the described operations taught by embodiments of the invention may alternatively be implemented in hardware or in a combination of hardware and software.
  • I/O devices 512, 514 may be coupled to the system either directly or through intervening I/O controllers 510.
  • Network adapters 508 may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters 508.
  • the system architecture 500 may be coupled to storage 516 (e.g., a non-volatile storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.) .
  • the storage 516 may comprise an internal storage device or an attached or network accessible storage.
  • Computer programs 506 in storage 516 may be loaded into the memory elements 504 and executed by a processor 502 in a manner known in the art.
  • the system architecture 500 may include fewer components than illustrated, additional components not illustrated herein, or some combination of the components illustrated and additional components.
  • the system architecture 500 may comprise any computing device known in the art, such as a mainframe, server, personal computer, workstation, laptop, handheld computer, telephony device, network appliance, virtualization device, storage controller, etc.
  • FIG. 6 illustrates a system architecture 600 that may be implemented with functionality of a management module 200 in accordance with certain embodiments.
  • the system architecture 600 is suitable for storing and/or executing program code and includes at least one processor 602 coupled directly or indirectly to memory elements 604 through a system bus 620.
  • the memory elements 604 may include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
  • the memory elements 604 include an operating system 605 and one or more computer programs 606.
  • the memory elements 604 may also include code 630 that implements some or all of the described operations taught by embodiments of the invention. Although code 630 is shown, the described operations taught by embodiments of the invention may alternatively be implemented in hardware or in a combination of hardware and software.
  • I/O devices 612, 614 may be coupled to the system either directly or through intervening I/O controllers 610.
  • Network adapters 608 may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters 608.
  • the system architecture 600 may be coupled to storage 616 (e.g., a non-volatile storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.) .
  • the storage 616 may comprise an internal storage device or an attached or network accessible storage.
  • Computer programs 606 in storage 616 may be loaded into the memory elements 604 and executed by a processor 602 in a manner known in the art.
  • the system architecture 600 may include fewer components than illustrated, additional components not illustrated herein, or some combination of the components illustrated and additional components.
  • the system architecture 600 may comprise any computing device known in the art, such as a mainframe, server, personal computer, workstation, laptop, handheld computer, telephony device, network appliance, virtualization device, storage controller, etc.

Abstract

Provided are techniques for processing a download. Under control of a first target device (220), a download command is received via a first communication medium (250) that indicates that a firmware update is to be downloaded. The download command is issued to a second target device (230) via a second communication medium (242), wherein the second communication medium is a private device communication medium that enables private communication between the first target device and the second target device. The update is received via the first communication medium and a third communication medium (240). In response to determining that the update has been downloaded, the first and second target device run with the update.

Description

SIMULTANEOUS DOWNLOAD TO MULTIPLE TARGETS
BACKGROUND
Field of the Invention
Embodiments of the invention relate to simultaneous download (e.g., of firmware or software) from a source to multiple targets.
Description of the Related Art
An enterprise may be described as a large entity (e.g., a corporation, government entity, etc.) that uses computers. An enterprise class storage system may be described as a storage system that is designed for high performance and high availability and that is primarily used by larger corporations to store data. Non-disruptive maintenance for enterprise class storage systems enables achievement of availability higher than 99.99%. In particular, it is desirable for enterprise class storage systems to provide availability at least at 99.995%. Non- disruptive maintenance for many enterprise class storage systems also includes firmware upgrades for devices on the enterprise class storage systems. Non-disruptive operation requires redundant devices to survive failures and non-disruptive maintenance requires increasing the reliability of the upgrade and minimizing the time to upgrade firmware on redundant devices.
To support redundancy, some devices mirror other devices in the enterprise class storage system, and these mirrored devices may share responsibility and control over certain resources. Each of the mirrored devices should have a same level of firmware. In conventional enterprise class storage systems, there are separate device communication mediums (or "paths") to each device such that each device can receive a separate update of the same firmware.
FIG. 1 illustrates a prior art system for downloading firmware to multiple target devices. In FIG. 1, an initiating device 100 downloads firmware over device communication mediums 140, 150, 160 to Target Device A 110, Target Device B 120, and Target Device C 130, respectively. To download firmware, the initiating device 100 downloads the firmware over a device communication medium connecting the initiating device 100 to a target device that is to be updated with firmware. Then, the target device firmware is updated with the firmware. When there are multiple target devices 110, 120, 130, firmware is downloaded to one target device at a time on the device communication medium 140, 150, 160 being used. Even if the initiating device 100 is downloading firmware to multiple target devices, a separate device communication medium 140, 150, 160 is used for each target device. In some cases of downloading firmware to target devices, another device (other than the initiating device 100) may initiate and control the update.
Thus, the traditional technique is to perform firmware upgrades serially (i.e., by downloading the firmware to one device at a time) . This approach has the potential to reduce overall reliability and availability of the enterprise class storage system owing to potential for errors leading to mismatching firmware on redundant devices and total upgrade time being proportional to the number of redundant devices, with any enterprise class storage system fault in this time having the potential for increasing enterprise class storage system downtime (e.g., in an enterprise class storage system with a redundant pair of devices, firmware upgrade takes two times (2X) the time to upgrade one device) .
Thus, there is a need in the art for improved simultaneous download of firmware to multiple target devices.
SUMMARY OF EMBODIMENTS OF THE INVENTION
Provided are a method as claimed in claim 1 and corresponding computer program and system.
BRIEF DESCRIPTION OF THE DRAWINGS
Referring now to the drawings in which like reference numbers represent corresponding parts throughout:
FIG. 1 illustrates a prior art system for downloading firmware to multiple target devices FIG. 2 illustrates details of a computing system in accordance with certain embodiments .
FIGS. 3A and 3B illustrate logic for a simultaneous firmware download in accordance with certain embodiment.
FIGS. 4A and 4B illustrate firmware download protocol control flow and data flow between a management module and an active BMC and between an active BMC and a standby BMC in accordance with certain embodiments.
FIG. 5 illustrates a system architecture that may be implemented by a server in accordance with certain embodiments .
FIG. 6 illustrates a system architecture that may be implemented with functionality of a management module in accordance with certain embodiments .
DETAILED DESCRIPTION
In the following description, reference is made to the accompanying drawings which form a part hereof and which illustrate several embodiments of the invention. It is understood that other embodiments may be utilized and structural and operational changes may be made without departing from the scope of the invention.
Embodiments perform a simultaneous download to multiple targets that share a common device communication medium (i.e., a transmission medium) . When the devices share the same device communication medium, embodiments provide a way to simultaneously update both devices (also referred to as "target" devices) from the device initiating the update (also referred to as an "initiating" device) . Embodiments improve efficiency for the initiating device initiating the firmware update by performing a redundant update that does not require any changes on the initiating device.
FIG. 2 illustrates details of a computing system in accordance with certain embodiments of the present invention. In FIG. 2, a Management Module (MM) 200 is an initiating device and is connected via an external device communication medium 250 to a server 210. The server 210 includes active Baseboard Management Controller (BMC) 220 and standby Baseboard Management Controller (BMC) 230. The active and standby BMCs 220, 230 provide management capabilities to local resources and redundant management capabilities to shared resources. The standby BMC 230 includes an indicator 232 that is set to maintain synchronization between the active BMC 220 and the standby BMC 230 during downloads. The active BMC 220 and standby BMC 230 may be referred to as first and second target devices, respectively. The active BMC 220 and the standby BMC 230 may communicate over a mirroring link 242. The mirroring link 242 may be described as a private device communication medium that enables private communication between the active and standby BMCs 220, 230. The management module 200 is not able to send communications directly on the mirroring link 242. The management module 200 downloads firmware to both the active and standby BMCs 220, 230 by issuing the firmware over a single external device communication medium 250 to both the active and standby BMCs 220, 230. The internal device communication medium 240 is coupled to the external device communication medium 250. In certain embodiments, the external device communication medium 250 may be a bus (e.g., a RS485 serial bus inter - integrated circuit (I2C) bus, a Dual Port RAM (DPRAM), or other bus-based media) , and the internal device communication medium 240 may be a bus (e.g., an internal RS485 serial bus, inter - integrated circuit (I2C) bus, a Dual Port RAM (DPRAM), or other bus-based media) connected to the external device communication medium 250.
In certain embodiments, the firmware download solution is used for a blade server in an IBM® BladeCenter® chassis (available from International Business Machines Corporation) , where the blade server has dual baseboard management controllers (BMCs) . The blade server may be described as an enterprise class storage system. However, embodiments are applicable to any set of target devices (e.g., redundant devices, such as the active and standby BMCs 220, 230), may use any shared communication medium (e.g., internal device communication medium 240) between the target devices that permits "snooping" in promiscuous mode, and may use any private device communication medium (e.g., mirroring link 242) between the redundant target devices. Promiscuous mode may be described as a mode in which the target devices intercept and read each communication (e.g., network packet) that arrives in its entirety. Embodiments may be used in networks that are serial or non-serial. Although examples herein may refer to firmware download, embodiments are applicable to software downloads. Also, there may be any number of devices that receive the download over a same device communication medium.
In FIG. 2, the management module 200 is only aware of the active BMC 220. Thus, the active BMC 220 is the target device that receives the download from the management module, which is the initiator of the firmware download. Both the active BMC 220 and the standby BMC 230 use the same communication medium to communicate with the management module 200. As mentioned above, the active and standby BMCs 220, 230 provide management capabilities to local resources and redundant management capabilities to shared resources. The active and standby BMCs 220, 230 maintain consistency with each other through the mirroring link 220 (e.g., an inter - integrated circuit (I2C) bus) . Thus, in case of failure of the active BMC 220, the standby BMC 230 may take over communication with the management module 200. Therefore, when a firmware download occurs through the management module 200, both the active and standby BMCs 220, 230 are updated so that they maintain firmware consistency.
During normal operation (e.g., when a firmware download is not occurring), one BMC (e.g., the active BMC 220) is said to "own" the external device communication medium 250, and so one BMC is capable of communicating with the management module 200 at a time. Although the management module 200 is aware of sending commands to the active BMC 220, the management module 200 does not speak directly to the active BMC 220. Instead, the management module 200 sends messages to an address on the external device communication medium 250 that is associated with the server 210 slot, and the active and standby BMCs 220, 230 are capable of responding to and/or listening on this address. Therefore, from the management module 200 perspective, the management module is speaking to one BMC at any moment in time. In certain embodiments of the dual BMCs 220, 230 in a server 210, there is no hardware inhibitor that prevents both BMCs from actively using the external device communication medium 250 (where the standby BMC 230 may access the external device communication medium 250 via the internal device communication medium 240) . In certain embodiments, however, the external device communication medium 250 is actively used by a default BMC that is defined as the active BMC 220, and the other BMC 230 remains in an inactive state with the internal device communication medium 240 until the active BMC 220 that is actively using the external device communication medium 250 fails. Thus, to address the issue of simultaneous firmware downloads, embodiments enable the active and standby BMCs 220, 230 to communicate on the mirroring link 242 and enable the standby BMC 230 to use the internal device communication medium 240 while firmware is being downloaded to the active BMC 220. In particular, the active BMC 220 communicates with the standby BMC 230 over the mirroring link 242 to indicate to the standby BMC 230 that firmware is about to be downloaded. In response, the standby BMC acknowledges receipt of this indication to the active BMC 220 and starts to listen to communications on the internal device communication medium 240. Prior to receipt of an indication from the active BMC 220 that firmware is about to be downloaded, the standby BMC 230 does not listen to communications on the internal device communication medium 240. Also, unlike conventional systems, embodiments avoid the use of separate, external device communication mediums for each target device (see prior art FIG. 1) .
Since the active and standby BMCs 220, 230 are designed for redundancy, it is important that when firmware on the active BMC 220 is updated, the firmware on the standby BMC 230 is also updated with the same firmware .
FIGS. 3A and 3B illustrate logic for a simultaneous download in accordance with certain embodiment. Control begins at block 300 of FIG. 3A with the management module 200 initiating download via the external device communication medium 250 by issuing a download command that indicates that an update is to be downloaded. The download command may be, for example, one that indicates that the active BMC 220 is to move into a state to receive an update (i.e., an "update state") . In certain embodiments, the update may occur at a BMC when that BMC is in a boot mode. Boot mode may be described as a mode in which an operating system runs in a restricted manner, such that the operating system processes a subset of commands (rather than all commands the operating system would otherwise process) , and where the subset of commands pertains to updating firmware and to verifying that an instruction set in memory has not been corrupted. In block 302, the active BMC 220 receives the download command. In block 304, the active BMC issues the same download command to the standby BMC 230 via the mirroring link 242. Thus, when the management module 200 establishes an update session with the active BMC 220, the active BMC 220 contacts the standby BMC 230 and establishes a standby BMC 230 update proxy session.
The processing of block 306 and blocks 308-312 may occur in parallel or serial form. In block 306, the active BMC 220 enters an update phase. In block 308, the standby BMC 230 receives the download command. In block 310, the standby BMC 230 enters an update phase. In block 312, the standby BMC 230 activates the internal device communication medium 240. Activating the internal communication medium 240 results in the standby BMC 230 listening on the internal device communication medium 240. Prior to receipt of the download command, the standby BMC 230 was not listening on the internal device communication medium 240. In particular, the standby BMC 230 may be said to be a passive listener that is snooping on packets on the external device communication medium 250 via the internal communication medium 240.
From blocks 306, 312 processing continues to block 314 in FIG. 3B. In block 314, the active and standby BMCs 220, 230 receive at least a portion of an update via the device communication mediums 240, 250. Examples of the update include a portion of firmware or software that is being downloaded. In particular, the active BMC 220 listens on the external device communication medium 250, while the standby BMC 230 listens on the internal device communication medium 240.
In block 316, the management module 200 determines whether the download is complete (i.e., whether the update has downloaded completely) . In certain embodiments, the management module 200 maintains an indication of how many blocks make up a download and makes the determination of whether the download is complete based on the number of blocks that have been sent. If so, processing continues to block 318, otherwise, processing continues to block 322. In block 322, the management module 200 continues sending at least a portion of the update to the active and standby BMCs 220, 230. In this manner, the active and standby BMCs 220, 230 may receive the update as a series of downloads or as a single download (e.g., depending on the size of the download) .
Thus, when the active BMC 220 that is actively using the external device communication medium 250 receives the download command from the management module 200, the active BMC 220 sends a corresponding download command over the mirroring link 242 to the standby BMC 230. When this download command is received, the standby BMC 230 then activates the internal device communication medium. Therefore, for the update state, both the active and standby BMCs 220, 230 actively use the communication mediums 240, 250 until the download is complete. From the management module 200 perspective, the management module 200 is executing the download as if downloading to one BMC. Therefore, with embodiments, no changes are needed to the management module 200 to operate with more than one BMC in a server 210 using this simultaneous download solution.
Continuing with FIG. 3B, in block 318, the management module 200 issues a reset download command, which indicates to the active and standby BMCs 220, 230 that they are to come out of the update state (e.g., boot mode) and start running with the updated firmware. In block 320, the active and standby BMCs 220, 230 run with the updated firmware. That is, when the management module 200 issues the reset download command, both the active and standby BMCs 220, 230 receive the reset download command and reset. On reset, the active and standby BMCs 220, 230 return to their default operating state and the BMC that is predefined as the active BMC 220 on the deice communication medium 250 reclaims that activity, while the standby BMC 230 returns to an inactive state with the internal device communication medium 240.
Thus, for each packet that the management module 200 sends on the external device communication medium 250, both the active and standby BMCs 220, 230 update the appropriate data (e.g., firmware or software), and the active BMC 220 waits for an affirmative acknowledgment from the standby BMC 230 on the mirroring link 242. The active BMC 220 acknowledges the affirmative acknowledgment of the standby BMC 230. The standby BMC 230 uses this affirm-acknowledge protocol to maintain an indicator (e.g., a bit) in set/reset states to keep the active and standby BMCs 220, 230 synched in a lock-step manner. Note that the active BMC 220 may be interrupted as a result of which timing delays may break the synchronization without use of the indicator. The active BMC 220 responds to the management module 200 for every packet upon receiving the affirmative acknowledgment from the standby BMC 230 or upon time out. Upon time out, the management module 200 may resend the packet. In certain embodiments thresholds may be added in the management module 200 to recover from active BMC 220 or standby BMC 230 faults, where the thresholds relate to timeouts by the active BMC 220 or standby BMC 230.
FIGS. 4A and 4B illustrate a firmware download protocol control flow and data flow between a management module 200 and an active BMC 220 and between the active BMC 220 and a standby BMC 230 in accordance with certain embodiments. In FIGS. 4A and 4B, the solid lines illustrate control flow, while the dashed lines illustrate data flow (i.e., message flow) .
In certain embodiments, at the beginning of the firmware download session, the active BMC 220 syncs up with the standby BMC 230 with a command-send and acknowledgment (e.g., an "OK" response). In certain embodiments, communication between the BMCs is on the mirroring link 242 (see Fig. 2) . After the active BMC 220 sends the same download command to the standby BMC 230 that the active BMC 220 receives from the management module 200, the standby BMC 230 listens on the internal device communication medium 240 and updates the firmware at the same time as the active BMC 230, which is listening on the external device communication medium 250. At the end of such a firmware update, the standby BMC 230 sends an acknowledgment (e.g., another "OK" response) to the active BMC 220. The active BMC 220 responds back with an acknowledgment that the standby BMC 230 uses to clear an internal indicator (e.g., a bit) . This indicator and the acknowledgment/response protocol for each firmware update is used to keep the active and standby BMC pair 220, 230 in lock- step so that one BMC does not go ahead of the other.
In certain embodiments, the management module 200 may reinitiate the firmware download in the event of errors or timeouts. In the event of errors or timeouts, the active BMC 220 and standby BMC 230 would remain in the update state (e.g., boot mode) .
Control begins at block 400 of FIG. 4A with the management module 200 issuing a firmware download command to the active BMC 220 via the external device communication medium 250 and waits to receive an acknowledgment. In block 402, the management module 200 receives the acknowledgment from the active BMC 220. In block 404, the management module 200 sends a next firmware refresh code section update and waits to receive an acknowledgment. From block 404 (FIG. 4A), processing continues to block 406 (FIG. 4B) . In block 406, the management module 200 receives the acknowledgment from the active BMC 420. In block 408, the management module 200 determines whether the firmware download is complete. If so, processing continues to block 410, otherwise, processing loops back to block 404 (FIG. 4A) . In block 410, the management module 200 sends a download complete message over the external device communication medium 250. Thus, the active and standby BMCs 220, 230 receive the firmware update as a series of downloads.
In FIG. 4A, in block 420, the active BMC 220 receives the firmware download command from the management module 200 and sends the firmware download command to the standby BMC via the mirroring link 242 and waits for an acknowledgment. In block 422, the active BMC 220 receives an acknowledgment from the standby BMC 230 that the firmware download command was received at the standby BMC 230 and responds with an acknowledgment to the management module 200 that indicates that the firmware download command was received at the active BMC 220. In block 424, the active BMC 220 receives the next firmware refresh code section update. In block 426, the active BMC 220 waits for an acknowledgment from the standby BMC 230. At this point in processing, the standby BMC 230 is directly receiving the firmware refresh code section update via the internal device communication medium 240, and, once the update is received, the standby BMC 230 sends an acknowledgment that indicates receipt of the firmware refresh code section update to the active BMC 220. From block 426 (FIG. 4A), processing continues to block 428 (FIG. 4B) .
In block 428, the active BMC 220 determines whether an acknowledgment has been received from the standby BMC 230. If so, processing continues to block 430, otherwise, processing continues to block 436. In block 430, the active BMC 220 sends an acknowledgment to the standby BMC 230 and to the management module 200. The acknowledgment to the standby BMC 230 indicates that an indicator may be cleared (i.e., reset) . The acknowledgment to the management module 200 indicates that the last received firmware refresh code section update has been processed.
In block 432, the active BMC 220 determines whether the firmware download is complete. If so, processing continues to block 434, otherwise, processing loops back to block 424 (FIG. 4A) to receive the next update. In block 434, the active BMC 220 returns to normal operation.
From block 428, if an acknowledgment has not been received from the standby BMC 230, processing continues to block 436, and the active BMC 220 determines whether processing by the standby BMC 230 has timed out (i.e., has taken more than a predetermined amount of time) . If so, processing continues to block 438, otherwise, processing loops back to block 426 (FIG. 4A) and waits for the acknowledgment. In block 438, the active BMC 220 processes the error. In certain embodiments, the active BMC 220 sends an error message to the management module 200 and waits to hear from the management module 200. In certain embodiments, the active BMC 220 lets the management module 200 timeout and waits to hear from the management module 200. The management module 200 may resend the firmware download command in the event that the firmware download takes longer than an amount of time allotted by the management module 200. Also, the management module 200 may timeout if the firmware download takes too long. In this event, the firmware download may be reported as failed to the end user, and the user may reinitiate the firmware download. However, if the active BMC 220 reports an error, the active BMC is reporting that firmware download failed, which is why the firmware download may begin again. In certain alternative embodiments, the active BMC 220 may return an error on the last packet received, and the management module 200 (or an initiating device) may resend just the last packet. From block 438, processing continues to block 420, and the active BMC 220 waits to receive the firmware download command from the management module 200.
In FIG. 4A, in block 450, the standby BMC 230 receives the firmware download command from the active BMC 230 and activates the internal device communication medium for reception. In block 452, the standby BMC 230 responds with an acknowledgment that the firmware download command has been received to the active BMC 220. In block 454, the standby BMC 230 receives the next firmware refresh code section update via the internal device communication medium 240, sends an acknowledgment of receipt to the active BMC 220, and sets an internal indicator. From block 454, processing continues to block 456 (FIG. 4B) .
In block 456, the standby BMC 230 receives an acknowledgment to clear the indicator from the active BMC 230 and clears the indicator. In block 458, the standby BMC 230 determines whether the firmware download is complete. If so, processing continues to block 460, otherwise, processing loops back to block 454 (FIG. 4A) to receive the next update. In block 460, the standby BMC 230 returns to normal operation.
For ease of illustration, one standby BMC has been shown. However, there may be more than one standby BMC or target device. The firmware download protocol described herein may be performed via arbitration over the shared equivalent of the mirroring link. In particular, because multiple standby BMCs may passively listen for the download, in the event that there is more than one standby BMC listening in to the firmware download, certain embodiments may require arbitration with regard to the responses the standby BMCs provide back to the active BMC. The active BMC may only obtain one response at a time from the standby BMCs, so, if one of the standby BMCs fails to get the update correctly, that standby BMC sends a message to the active BMC. Thus, the arbitration is in relation to the responses provided by the standby BMCs.
The simultaneous (e.g., "one-shot") firmware download minimizes the time for a non-disruptive firmware upgrade (i.e., to IX the time to download firmware to one device) and increases the overall availability and reliability by avoiding firmware disparity across target devices, which could happen otherwise as a result of errors, and also by reducing the time for discovery of a faulty firmware upgrade.
Thus, embodiments exploit the existence of an additional device communication medium (i.e., the mirroring link) between redundant devices that may normally exist for detection for failure of the active device. This additional device communication medium is distinct from the device communication medium that is used for firmware upgrades and may be used to correctly and reliably accomplish a simultaneous firmware upgrade.
IBM and BladeCenter are registered trademarks or common law marks of International Business Machines Corporation in the United States and/or other countries.
Additional Embodiment Details
The described operations may be implemented as a method, computer program product or apparatus using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof.
Each of the embodiments may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements. The embodiments may be implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
Furthermore, the embodiments may take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable or computer readable medium may be any apparatus that may contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
The described operations may be implemented as code maintained in a computer-usable or computer readable medium, where a processor may read and execute the code from the computer readable medium. The medium may be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a rigid magnetic disk, an optical disk, magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), volatile and non-volatile memory devices (e.g., a random access memory (RAM), DRAMs, SRAMs, a read-only memory (ROM) , PROMs, EEPROMs, Flash Memory, firmware, programmable logic, etc.) . Current examples of optical disks include compact disk - read only memory (CD-ROM), compact disk - read/write (CD-R/W) and DVD.
The code implementing the described operations may further be implemented in hardware logic (e.g., an integrated circuit chip, Programmable Gate Array (PGA) , Application Specific Integrated Circuit (ASIC), etc.). Still further, the code implementing the described operations may be implemented in "transmission signals", where transmission signals may propagate through space or through a transmission media, such as an optical fiber, copper wire, etc. The transmission signals in which the code or logic is encoded may further comprise a wireless signal, satellite transmission, radio waves, infrared signals, Bluetooth, etc. The transmission signals in which the code or logic is encoded is capable of being transmitted by a transmitting station and received by a receiving station, where the code or logic encoded in the transmission signal may be decoded and stored in hardware or a computer readable medium at the receiving and transmitting stations or devices.
A computer program product may comprise computer useable or computer readable media, hardware logic, and/or transmission signals in which code may be implemented. Of course, those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the embodiments, and that the computer program product may comprise any suitable information bearing medium known in the art.
The term logic may include, by way of example, software, hardware, firmware, and/or combinations of software and hardware.
Certain implementations may be directed to a method for deploying computing infrastructure by a person or automated processing integrating computer-readable code into a computing system, wherein the code in combination with the computing system is enabled to perform the operations of the described implementations.
The logic of FIGs. 3A, 3B, 4A, and 4B describes specific operations occurring in a particular order. In alternative embodiments, certain of the logic operations may be performed in a different order, modified or removed. Moreover, operations may be added to the above described logic and still conform to the described embodiments. Further, operations described herein may occur sequentially or certain operations may be processed in parallel, or operations described as performed by a single process may be performed by distributed processes.
The illustrated logic of FIGs. 3A, 3B, 4A, and 4B may be implemented in software, hardware, programmable and non-programmable gate array logic or in some combination of hardware, software, or gate array logic. FIG. 5 illustrates a system architecture 500 that may be implemented by the server 210 in accordance with certain embodiments. The system architecture 500 is suitable for storing and/or executing program code and includes at least one processor 502 coupled directly or indirectly to memory elements 504 through a system bus 520. The memory elements 504 may include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution. The memory elements 504 include an operating system 505 and one or more computer programs 506. The memory elements 504 may also include code 530 that implements some or all of the described operations taught by embodiments of the invention. Although code 530 is shown, the described operations taught by embodiments of the invention may alternatively be implemented in hardware or in a combination of hardware and software.
Input/Output (I/O) devices 512, 514 (including but not limited to keyboards, displays, pointing devices, etc.) may be coupled to the system either directly or through intervening I/O controllers 510.
Network adapters 508 may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters 508.
The system architecture 500 may be coupled to storage 516 (e.g., a non-volatile storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.) . The storage 516 may comprise an internal storage device or an attached or network accessible storage. Computer programs 506 in storage 516 may be loaded into the memory elements 504 and executed by a processor 502 in a manner known in the art.
The system architecture 500 may include fewer components than illustrated, additional components not illustrated herein, or some combination of the components illustrated and additional components. The system architecture 500 may comprise any computing device known in the art, such as a mainframe, server, personal computer, workstation, laptop, handheld computer, telephony device, network appliance, virtualization device, storage controller, etc.
FIG. 6 illustrates a system architecture 600 that may be implemented with functionality of a management module 200 in accordance with certain embodiments. The system architecture 600 is suitable for storing and/or executing program code and includes at least one processor 602 coupled directly or indirectly to memory elements 604 through a system bus 620. The memory elements 604 may include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution. The memory elements 604 include an operating system 605 and one or more computer programs 606. The memory elements 604 may also include code 630 that implements some or all of the described operations taught by embodiments of the invention. Although code 630 is shown, the described operations taught by embodiments of the invention may alternatively be implemented in hardware or in a combination of hardware and software.
Input/Output (I/O) devices 612, 614 (including but not limited to keyboards, displays, pointing devices, etc.) may be coupled to the system either directly or through intervening I/O controllers 610.
Network adapters 608 may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters 608.
The system architecture 600 may be coupled to storage 616 (e.g., a non-volatile storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.) . The storage 616 may comprise an internal storage device or an attached or network accessible storage. Computer programs 606 in storage 616 may be loaded into the memory elements 604 and executed by a processor 602 in a manner known in the art.
The system architecture 600 may include fewer components than illustrated, additional components not illustrated herein, or some combination of the components illustrated and additional components. The system architecture 600 may comprise any computing device known in the art, such as a mainframe, server, personal computer, workstation, laptop, handheld computer, telephony device, network appliance, virtualization device, storage controller, etc.
The foregoing description of embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the embodiments to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. It is intended that the scope of the embodiments be limited not by this detailed description, but rather by the claims appended hereto. The above specification, examples and data provide a complete description of the manufacture and use of the composition of the embodiments. Since many embodiments may be made without departing from the scope of the embodiments, the embodiments reside in the claims hereinafter appended or any subsequently-filed claims, and their equivalents .

Claims

1 . A method compri s ing steps of :
receiving a download command from an initiating device via a first communication medium (250) that indicates that a firmware update is to be downloaded to a first target device upon which the firmware is to be installed, wherein the first target device is included within a server, and the server also includes a second target device which acts as a mirror device to the first target device, and wherein the initiating device sends the download command by sending a message to an address on the first communication medium that is associated with a slot of the server;
issuing the download command to the second target device via a second communication medium (242), wherein the second communication medium is a private device communication medium that enables private communication between the first target device and the second target device, private in the sense that the initiating device is not able to send communications on the second communication medium;
the first target device receiving the update via the first communication medium (250) ; and
the second target device receiving the update from the initiating device via a third communication medium (240) which is internal to the server and which branches off from the first communication medium (250), wherein the second target device begins listening on the third communication medium in response to receiving the download command from the first target device over the second communication medium (242) .
2. The method of claim 1, further comprising:
under control of the first target device,
waiting to receive an acknowledgment from the second target device indicating that the download command has been received.
3. The method of claim 1, wherein the update is received as a series of downloads and further comprising:
under control of the first target device,
waiting to receive an acknowledgment from the second target device for each one of the series of downloads; and
in response to determining that the acknowledgment has not been received from the second target device within a predetermined amount of time, processing an error.
4. The method of claim 1, wherein the update is received as a series of downloads and further comprising:
under control of the second target device,
sending an acknowledgment to the first target device upon receipt of each one of the series of downloads.
5. The method of claim 1, wherein the second target device maintains an indicator to maintain synchronization between the first target device and the second target device while receiving the update.
6. A system comprising means adapted for carrying out all the steps of the method according to any preceding method claim.
7. A computer program comprising instructions for carrying out all the steps of the method according to any preceding method claim, when said computer program is executed on a computer system.
PCT/EP2006/069438 2005-12-14 2006-12-07 Simultaneous download to multiple targets WO2007068650A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/304,407 US7814479B2 (en) 2005-12-14 2005-12-14 Simultaneous download to multiple targets
US11/304,407 2005-12-14

Publications (2)

Publication Number Publication Date
WO2007068650A2 true WO2007068650A2 (en) 2007-06-21
WO2007068650A3 WO2007068650A3 (en) 2007-08-23

Family

ID=37700759

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2006/069438 WO2007068650A2 (en) 2005-12-14 2006-12-07 Simultaneous download to multiple targets

Country Status (2)

Country Link
US (1) US7814479B2 (en)
WO (1) WO2007068650A2 (en)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7383461B2 (en) * 2004-02-12 2008-06-03 International Business Machines Corporation Method and system to recover a failed flash of a blade service processor in a server chassis
US8214827B2 (en) * 2005-12-05 2012-07-03 Flash Networks, Ltd Method and system for improving user confidence and experience in content purchasing via a service provider premises
US7908470B1 (en) * 2006-10-31 2011-03-15 Hewlett-Packard Development Company, L.P. Multi-processor computer with plural boot memories
US8645938B2 (en) * 2006-11-17 2014-02-04 International Business Machines Corporation System and method for replacing code
US8495292B2 (en) 2006-12-06 2013-07-23 Fusion-Io, Inc. Apparatus, system, and method for an in-server storage area network
US8776037B2 (en) * 2007-01-04 2014-07-08 International Business Machines Corporation Apparatus and method to update multiple devices disposed in a computing system
US8185886B2 (en) * 2007-06-26 2012-05-22 Intel Corporation Method and apparatus to enable dynamically activated firmware updates
US8082439B2 (en) * 2007-12-06 2011-12-20 Hewlett-Packard Development Company, L.P. Firmware modification in a computer system environment supporting operational state changes
US8306652B2 (en) * 2008-03-14 2012-11-06 International Business Machines Corporation Dual-band communication of management traffic in a blade server system
US8402455B2 (en) * 2008-03-17 2013-03-19 Landis+Gyr Innovations, Inc. Methods and systems for distributing firmware through an over-the-air network
EP2129041A1 (en) * 2008-05-27 2009-12-02 Siemens Milltronics Process Instruments Inc. A network and method for updating a plurality of devices in such a network
US8392904B2 (en) * 2009-03-12 2013-03-05 International Business Machines Corporation Apparatus, system, and method for efficient code update
US8495618B1 (en) * 2010-03-31 2013-07-23 American Megatrends, Inc. Updating firmware in a high availability enabled computer system
JP5655557B2 (en) 2010-07-12 2015-01-21 ソニー株式会社 Microscope control device, image display device, image management server, focus position information generation method, image display method, image management method, and microscope image management system
US8972966B2 (en) * 2012-01-05 2015-03-03 Lenovo (Singapore) Pte. Ltd. Updating firmware in a hybrid computing environment
CN105739656A (en) * 2014-12-08 2016-07-06 营邦企业股份有限公司 Cabinet with automatic reset function and automatic reset method thereof
KR101705424B1 (en) * 2015-01-22 2017-02-09 인천국제공항공사 System for remote updating firmwave of individual lighting controller in airfield light system
US20160239370A1 (en) * 2015-02-12 2016-08-18 Aic Inc. Rack having automatic recovery function and automatic recovery method for the same
US9921915B2 (en) * 2015-10-16 2018-03-20 Quanta Computer Inc. Baseboard management controller recovery
CN105335265A (en) * 2015-12-10 2016-02-17 浪潮电子信息产业股份有限公司 Automatic BMC upgrading and downgrading testing method based on LINUX platform
CN105550090A (en) * 2015-12-10 2016-05-04 浪潮电子信息产业股份有限公司 LINUX platform-based automatic BMC pressure test method
US10051455B2 (en) * 2016-03-18 2018-08-14 Plume Design, Inc. Systems and methods for changing topology and firmware in distributed wi-fi networks
US10114633B2 (en) 2016-12-08 2018-10-30 International Business Machines Corporation Concurrent I/O enclosure firmware/field-programmable gate array (FPGA) update in a multi-node environment
CN107193594A (en) * 2017-05-17 2017-09-22 郑州云海信息技术有限公司 A kind of USB interface-based principal and subordinate BMC firmware upgrade methods
CN107153558A (en) * 2017-05-17 2017-09-12 郑州云海信息技术有限公司 A kind of principal and subordinate's BMC firmware upgrade methods based on SPI interface
CN107748667A (en) * 2017-10-23 2018-03-02 郑州云海信息技术有限公司 A kind of method for lifting server power supply online updating firmware reliability
FR3078799B1 (en) * 2018-03-12 2021-06-04 Bull Sas MANAGEMENT OF CONFIGURATION DATA FOR A MULTIMODULE SERVER
CN109445822B (en) * 2018-11-02 2021-10-22 郑州云海信息技术有限公司 Method and system for refreshing double-mirror-image BMC (baseboard management controller) in batches by CMC (controller card)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5974312A (en) * 1997-07-10 1999-10-26 Ericsson Inc. System and method for updating a memory in an electronic device via wireless data transfer
US6113652A (en) * 1995-04-27 2000-09-05 General Data Comm, Inc. Communications network equipment capable of non-disruptive software upgrade
US20030237018A1 (en) * 2002-06-25 2003-12-25 Hitachi, Ltd. Server takeover system and method

Family Cites Families (64)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5826075A (en) 1991-10-16 1998-10-20 International Business Machines Corporation Automated programmable fireware store for a personal computer system
US5925140A (en) 1993-04-30 1999-07-20 Intel Corporation Apparatus and method for error free loading of a programmable non-volatile memory over a datalink
US5467286A (en) 1993-11-03 1995-11-14 Square D Company Metering unit with downloadable firmware
US6009500A (en) 1995-06-07 1999-12-28 Compaq Computer Corporation Replacement of erroneous firmware in a redundant non-volatile memory system
US6018806A (en) 1995-10-16 2000-01-25 Packard Bell Nec Method and system for rebooting a computer having corrupted memory using an external jumper
JPH09325904A (en) 1996-06-04 1997-12-16 Nec Corp Firmware information backup method
US5793943A (en) 1996-07-29 1998-08-11 Micron Electronics, Inc. System for a primary BIOS ROM recovery in a dual BIOS ROM computer system
US6247168B1 (en) * 1997-04-29 2001-06-12 Rockwell Technologies, Llc Embedded non-volatile programming tool
US6055632A (en) 1997-09-25 2000-04-25 Allen-Bradley Company, Llc Method and apparatus for transferring firmware to a non-volatile memory of a programmable controller system
JP3437083B2 (en) 1998-02-23 2003-08-18 富士通株式会社 I / O control device and firmware updating method thereof
US6199194B1 (en) 1998-09-25 2001-03-06 Adaptec, Inc. Method and system for programming firmware over a computer network
JP2000307591A (en) 1999-04-19 2000-11-02 Hitachi Telecom Technol Ltd Automatic fault restoration device for information processing system
US20040030768A1 (en) 1999-05-25 2004-02-12 Suban Krishnamoorthy Unified system and method for downloading code to heterogeneous devices in distributed storage area networks
US6332198B1 (en) 2000-05-20 2001-12-18 Equipe Communications Corporation Network device for supporting multiple redundancy schemes
US6442067B1 (en) 2000-05-23 2002-08-27 Compaq Information Technologies Group, L.P. Recovery ROM for array controllers
US6675258B1 (en) * 2000-06-30 2004-01-06 Lsi Logic Corporation Methods and apparatus for seamless firmware update and propagation in a dual raid controller system
JP2002091719A (en) 2000-07-11 2002-03-29 Ricoh Co Ltd System and method for managing image forming device, central managing device to be used therefor and image forming device
US6665813B1 (en) 2000-08-03 2003-12-16 International Business Machines Corporation Method and apparatus for updateable flash memory design and recovery with minimal redundancy
US6907602B2 (en) 2000-08-10 2005-06-14 Mustek Systems Inc. Method for updating firmware of computer device
US20020087668A1 (en) 2000-12-29 2002-07-04 San Martin Raul S. Automatic upgrade of live network devices
US6813571B2 (en) 2001-02-23 2004-11-02 Power Measurement, Ltd. Apparatus and method for seamlessly upgrading the firmware of an intelligent electronic device
US20040015940A1 (en) 2001-05-29 2004-01-22 3Com Corporation Intelligent device upgrade engine
US7076576B2 (en) 2001-06-19 2006-07-11 Fujitsu Limited Data transfer in multi-node computer system
US6836859B2 (en) * 2001-08-15 2004-12-28 Sun Microsystems, Inc. Method and system for version control in a fault tolerant system
GB0123861D0 (en) 2001-10-04 2001-11-28 Marconi Comm Ltd Comunications networks
US6983397B2 (en) 2001-11-29 2006-01-03 International Business Machines Corporation Method, system, and program for error handling in a dual adaptor system where one adaptor is a master
US6957361B2 (en) 2001-11-29 2005-10-18 International Business Machines Corporation Method, system, and program for error handling in a dual adaptor system
US7069349B2 (en) 2002-01-10 2006-06-27 Intel Corporation IPMI dual-domain controller
US6988182B2 (en) 2002-02-13 2006-01-17 Power Measurement Ltd. Method for upgrading firmware in an electronic device
US7143275B2 (en) 2002-08-01 2006-11-28 Hewlett-Packard Development Company, L.P. System firmware back-up using a BIOS-accessible pre-boot partition
US7036007B2 (en) 2002-09-09 2006-04-25 Intel Corporation Firmware architecture supporting safe updates and multiple processor types
US20040076043A1 (en) 2002-10-21 2004-04-22 Phoenix Technologies Ltd. Reliable and secure updating and recovery of firmware from a mass storage device
US7246266B2 (en) 2002-11-21 2007-07-17 Chris Sneed Method and apparatus for firmware restoration in modems
US7085957B2 (en) 2002-11-21 2006-08-01 Texas Instruments Incorporated Upgrading of firmware with tolerance to failures
JP2004199277A (en) 2002-12-17 2004-07-15 Fujitsu Ltd Bios redundancy management method, data processor, and storage system
US7145682B2 (en) 2002-12-20 2006-12-05 Hewlett-Packard Development Company, L.P. Temporary printer firmware upgrade
JP2004302991A (en) 2003-03-31 2004-10-28 Giga-Byte Technology Co Ltd Backup method of interface bios for computer system
US6978452B2 (en) 2003-04-02 2005-12-20 Beach Unlimited Llc Upgrading digital media servers
US6996673B2 (en) 2003-08-21 2006-02-07 International Business Machines Corporation Method and apparatus for managing inventory and door status during firmware update of an automated data storage library
US7219201B2 (en) * 2003-09-17 2007-05-15 Hitachi, Ltd. Remote storage disk control device and method for controlling the same
JP4598387B2 (en) * 2003-09-17 2010-12-15 株式会社日立製作所 Storage system
US20050081090A1 (en) 2003-09-29 2005-04-14 Giga-Byte Technology Co., Ltd. Method for automatically and safely recovering BIOS memory circuit in memory device including double BIOS memory circuits
JP4420275B2 (en) 2003-11-12 2010-02-24 株式会社日立製作所 Failover cluster system and program installation method using failover cluster system
US7080279B2 (en) 2003-12-03 2006-07-18 International Business Machines Corporation Autonomic rollback
US7376943B2 (en) 2003-12-18 2008-05-20 Lsi Corporation Safe method for upgrading firmware of optical disk product
JP2005242555A (en) 2004-02-25 2005-09-08 Hitachi Ltd Storage control system and method for loading firmware on disk type storage device owned by storage control system
US20050223372A1 (en) 2004-04-01 2005-10-06 Borchers Gregory E Methods and systems for firmware download configuration
JP2007534095A (en) 2004-04-20 2007-11-22 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Restoring all programmable content and firmware of optical drives
US7797525B2 (en) 2004-07-01 2010-09-14 Hewlett-Packard Development Company, L.P. Operating system installation
US20060069825A1 (en) 2004-09-24 2006-03-30 Hodder Leonard B Method and system of transferring firmware from a host device to a printing device
US20060085564A1 (en) 2004-10-14 2006-04-20 Bomhoff Matthew D Flash mirroring
JP2006127252A (en) 2004-10-29 2006-05-18 Fujitsu Component Ltd Switch, switching method and program
JP4929726B2 (en) 2005-03-07 2012-05-09 富士ゼロックス株式会社 Image processing system
US7673174B2 (en) 2005-04-06 2010-03-02 University Of Washington Recovering device drivers
TWI345175B (en) 2005-06-08 2011-07-11 Winbond Electronics Corp Method for updating firmware of memory card
US7907531B2 (en) 2005-06-13 2011-03-15 Qualcomm Incorporated Apparatus and methods for managing firmware verification on a wireless device
TW200713037A (en) 2005-09-06 2007-04-01 Benq Corp System and method for updating firmware
US7600055B2 (en) 2006-01-03 2009-10-06 International Business Machines Corporation Apparatus, system, and method for firmware update of redundant controllers
JP4486609B2 (en) 2006-03-31 2010-06-23 富士通株式会社 Storage device, storage control firmware active program replacement method for storage device, and storage control firmware active program replacement program
US20080027602A1 (en) 2006-05-30 2008-01-31 Yeap Tet H System and method for deterring theft of vehicles and other products having integral computer means
US20080109647A1 (en) 2006-11-07 2008-05-08 Lee Merrill Gavens Memory controllers for performing resilient firmware upgrades to a functioning memory
US8286156B2 (en) 2006-11-07 2012-10-09 Sandisk Technologies Inc. Methods and apparatus for performing resilient firmware upgrades to a functioning memory
US9112891B2 (en) 2007-02-02 2015-08-18 Sharp Laboratories Of America, Inc. Remote firmware management for electronic devices
US20080235501A1 (en) 2007-03-19 2008-09-25 James Ray Bailey Method For Detecting and Correcting Firmware Corruption

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6113652A (en) * 1995-04-27 2000-09-05 General Data Comm, Inc. Communications network equipment capable of non-disruptive software upgrade
US5974312A (en) * 1997-07-10 1999-10-26 Ericsson Inc. System and method for updating a memory in an electronic device via wireless data transfer
US20030237018A1 (en) * 2002-06-25 2003-12-25 Hitachi, Ltd. Server takeover system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
QIAN A: "UPDATING BIOS AND FIRMWARE ON A GROUP OF SERVERS" INTERNET CITATION, [Online] May 2002 (2002-05), XP002295550 Retrieved from the Internet: URL:http://www1.us.dell.com/content/topics /global.aspx/power/en/ps2q02_qi an?c=us&cs=RC968571&1=en&s=hea> [retrieved on 2004-09-07] *

Also Published As

Publication number Publication date
US7814479B2 (en) 2010-10-12
WO2007068650A3 (en) 2007-08-23
US20070169106A1 (en) 2007-07-19

Similar Documents

Publication Publication Date Title
US7814479B2 (en) Simultaneous download to multiple targets
US11194679B2 (en) Method and apparatus for redundancy in active-active cluster system
US7761734B2 (en) Automated firmware restoration to a peer programmable hardware device
US7661022B2 (en) System for error handling in a dual adaptor system where one adaptor is a master
US7761735B2 (en) Automated firmware restoration to a peer programmable hardware device
US7721155B2 (en) I2C failure detection, correction, and masking
CN110580235B (en) SAS expander communication method and device
US8676751B2 (en) High availability database systems and methods
JP5186551B2 (en) Method and program for restoring automatic firmware of peer programmable hardware device
US8055934B1 (en) Error routing in a multi-root communication fabric
WO2022155919A1 (en) Fault handling method and apparatus, and system
US20030154288A1 (en) Server-client system and data transfer method used in the same system
CN109445984B (en) Service recovery method, device, arbitration server and storage system
US7584271B2 (en) Method, system, and computer readable medium for delaying the configuration of a shared resource
JP2021111137A (en) Cluster system, server device, takeover method and program
US20140173120A1 (en) Method for establishing communication, computer system, and computer

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 06830443

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 06830443

Country of ref document: EP

Kind code of ref document: A2