WO1998019239A1

WO1998019239A1 - Distributed virtual software interface or machine

Info

Publication number: WO1998019239A1
Application number: PCT/US1997/019304
Authority: WO
Inventors: Christopher J. Thompson; Phillippe Kruchten
Original assignee: Raytheon Systems Canada Ltd.
Priority date: 1996-10-25
Filing date: 1997-10-24
Publication date: 1998-05-07

Abstract

A distributed virtual software interface (20) that provides a reusable framework for building large, distributed fault tolerant Ada applications. The software architecture (20) is implemented in a distributed computer system (30) having a plurality of computers (31) that are interconnected by way of a network (13), and wherein each computer (31) has an operating system. The architecture (20) includes a distributed intermediate software layer (21) that is distributed among the plurality of computers (31) and interfaces with the operating system (11) of the computer (31) on which it is disposed. The distributed intermediate software layer (21) generates intermediate instructions that cause the operating system (11) to implement primitive operating system instructions. The distributed virtual software interface (20) has a distributed object-oriented software layer (22) distributed among the plurality of computers (31) that provides communication between computers (31) using objects. The distributed object-oriented software layer (22) includes instructions that distribute objects of the same class to computers (31) that are linked by attributes, operations and associations between objects within a class in response to the creation of a new object on one of the computers (31). Communication between computers (31) is provided using a number of predefined communication protocols. The distributed object-oriented software layer (22) interfaces with the distributed middle software layer (21) disposed on each respective computer (31) and generates the intermediate instructions. At least one software application is provided on each computer (31) that interfaces with the distributed object-oriented software layer (22) and processes objects routed to it.

Description

DISTRIBUTED VIRTUAL SOFTWARE INTERFACE OR MACHINE

BACKGROUND

The present invention relates generally to distributed, fault tolerant computer systems, and more particularly, to a distributed virtual software interface or machine that interfaces between computers, operating systems and applications that run on the computers of a distributed, multi-computer fault tolerant computer system. Computers that are used in distributed, fault tolerant computer systems, for example, have an operaung system that embodies primitive instructional codes that implement various services. Such primitive instructional codes route data to and from storage devices, write data to display devices, send data to a printer or modem, and transmit data across a network to other computers, for example. Unfortunately, conventional computers and computer operating systems inherently limit the development of large, distributed, fault tolerant computer systems, such as air traffic control systems, and the like.

Conventional computer architectures and operating systems do not directly support distributed computing environments where there is direct cooperation between several processes required to perform specified system functions, including those with client/server relationships. In addition, conventional computer architectures and operating systems are not easily reconfigured on demand or in response to detected hardware or operating system faults so that continuous service is provided in the event of such faults. There are two primary motivations for building a distributed virtual machine as contemplated by the present invention. The first is to permit large scale software reuse within a product line (including all of its related parts: process, design, source code, and documentation. The second is separating the functional concerns of the application domain from the requirements of building a reliable distributed computer system. Accordingly, it is an objective of the present invenuon to provide for a distπbuted virtual software interface that interfaces between computers, operating systems and applications that run on the computers of a distπbuted. multi-computer fault tolerant computer system

SUMMARY OF THE INVENTION To meet the above and other oDjectives, the present invention provides for a distnbuted virtual software interface that provides a reusable framework for building large, distπbuted fault tolerant Ada applications Ada is a registered trademark of the U.S. government. Ada Joint Program Office The framework, referred to as a distπbuted virtual macmne or αistnbuted virtual software interface, provides a portable suite of integrated services for building a wide vaπety of distπbuted. fault tolerant computer systems The distπbuted virtual software interface specifically provides a foundation for a family of automated air traffic control systems currently under development by the assignee of the present invenuon.

The distπbuted virtual software interface provides for an object-oπented software architecture that is implemented in a distπbuted computer system having a plurality of computers that are interconnected by way of a network Each computer has an operating system that implements pnmitive operating system instructions such as networ communications, data storage, data display, and the like

The arcnitecture compπses a distπbuted intermediate software layer that is distπbuted among the plurality of computers and interfaces with the operating system of the computer on which it is disposed and that generates intermediate instructions that cause the operating system to implement pnmitive operaung system instructions in response thereto. A distπbuted object-oπented software layer is distributed among the plurality of computers providing communication between computers using objects that are instances of object classes that are defined by attπbutes of objects, operations on objects and associations between objects The distπbuted object-oπented software layer compπses instructions that distribute objects of the same class to computers that are linked by the attπbutes, operauons and associations between the objects within the ciass in response to the creation of a new object on one of the computers. Communicauon between computers is provided using predefined communication protocols. At least one software application is disposed on each computer that interfaces with the distributed object-oπented software layer disposed on the respective computer and processes objects distributed to it.

The distributed virtual software interface provides an "abstract instruction set" for developing distributed computer systems, facilitating application development without concern for the actual physical distribution of the computers forming the system. This level of abstraction is essential for managing complex air traffic control systems, for example, which typically require the development of up to one million lines of Ada source code to effectively operate the distπbuted system.

The architecture of the distπbuted virtual software interface supports a distπbuted computing system where there is direct cooperation between processes required to perform specified system functions, including client/server relationships. In addition, distπbuted computer systems employing the distπbuted virtual software interface may also be reconfigured on demand or in resDonse to detected faults so that the system provides continuous uninterrupted service. The architecture of the distributed virtual software interface provides a robust architectural framework for building large distπbuted Ada-based computer systems. The architecture of the distπbuted virtual software interface provides a framework for building similar fault tolerant applications through large scale reuse of previously developed blocks of code. Large blocks of the distributed virtual software interface may be reused within an array of computer systems, thus minimizing development costs of additional systems.

BRIEF DESCRIPTION OF THE DRAWINGS

The various features and advantages of the present invention may be more readily understood with reference to the following detailed descπption taken in conjunction with the accompanying drawings, wherein like reference numerals designate like structural elements, and in which:

Fig. 1 illustrates a conventional software architecture known in the prior art; Fig. 2 is a diagram showing an object-oriented software architecture implemented in a distributed computer system in accordance with the pπnciples of the present invention;

Fig. 3 shows a static view of a typical software architecture built using the object-oriented software architecture of Fig. 2;

Fig. 4 shows a logical view of a set of classes making use of distributed object services of the object-oriented software architecture;

Fig. 5 shows an illustration of an example object scenario based on the class diagram of Fig. 4, using additional operation annotations; 239

Fig. 6 shows the same sequence of operations depicted in the scenario view of Fig. 5. but showing an actual distribution of the resulting software;

Fig. 7 illustrates independent processing elements that compπse a class implemented using distributed object services provided by the object-oπented software architecture of Fig. 2, and

Fig. 8 illustrates elements of a distπbuted class.

DETAILED DESCRIPTION

Referring to the drawing figures. Fig. 1 illustrates a conventional software architecture 10 that may be used with a plurality of computers 14 connected to a network 13 Each of the computers 14 has an operating system 11 that permits communicauon between the computers 14 and that implements pnmitive instructions that operates each computer 14 Typicailv. one or more software applications 12 run on each computer 14 In an example of an air traffic control system, such software applications 12 may include flight management software 12a, conflict prediction software 12b, sector management software 12c. flow management software 12d. radar data processing software 12, and display data processing software 12f, for example. In the conventional software architecture 10 running on each of the networked computers 14, each software application 12 directly communicates with every other application using low level operating system services. This results in tight coupling between the applications, wherein each application knows of the existence of the other applications. In addition, a large amount of replicated funcuonaiity is implemented within each apphcauon. Furthermore, eacn application is dependent on the particular computer operating system and related hardware To improve upon the conventional software architecture 10 of Fig. 1, Fig. 2 shows an improved distπbuted object-oπented architecture 20 that forms a distπbuted virtual software interface or machine in accordance with the pπnciples of the present invention. The distributed object-oπented framework realized by the distπbuted virtual software interface allows each of the applications to be less dependent on each other due to its implicit invocation capabilities, allows common functions to be implemented once within the framework rather than implemented repeatedly within the applications, and allows applications to be developed without direct dependency on the operating system or hardware. The distπbuted object-oπented architecture 20 is implemented in a distπbuted computer system 30 having a plurality of computers 31 that each have an operating system 1 1 and that are interconnected by way of a network 13.

The architecture 20 includes a distπbuted intermediate software layer 21 that is distπbuted among the plurality of computers 31. At least a portion of the distributed 9239

intermediate software layer 21 is disposed on each computer 31, and interfaces with the operating system 1 1 of the computer 31 on which it resides The distributed intermediate sortware layer 21 causes the operating system 11 to implement pnmitive operating system instructions in response to intermediate instructions that are generated thereby A distπbuted object-oπented software layer 22 is distπbuted among the plurality of computers 31 At least a portion of the distπbuted object-oπented software layer 22 is disposed on each computer 31. and interfaces with the respective portion of the distributed middle software layer 21 residing on the computer 31 At least one software application is disposed on each computer 31 that interfaces with the distributed object-oπented software layer 22 disposed on the respective computer 31

The distπbuted object-onented software layer 22 provides communication between computers 31 using objects that are instances of object classes that are defined by attributes of objects operations on obiects and associations between objects The distπbuted object-oπented software layer 22 comprises instructions that distribute objects of the same class to computers 31 that are linked by the attπbutes, operauons and associations between the objects within the class in response to the creation of a new object on one of the computers 31 Communication between the computers 31 is provided using predefined, well-known, communication protocols, including asynchronous remote procedure call (ARPC), remote procedure call (RPC), asynchronous message communication (AMC), and broadcast protocols The distπbuted ob_ject- oπented software layer 22 interfaces with the distπbuted middle software layer 21 disposed on each computer 31 and causes generation of the intermediate instructions

Fig 3 shows that large scale reuse is achieved by systematically supporting vaπability in an underlying commercial off-the-shelf (COTS) and hardware environment that includes the distπbuted middleware layer 21 (such as the Universal Network Architecture Services (UNAS) software) available from TRW Data Technologies Division as well as in the application specific functionality, which provides the ability to add new functionality or tailor existing functionality The separauon of the functional concerns from those of building a reliable distπbuted system is achieved by embodying resources that route data (distπbute objects) in the distπbuted virtual software interface 22 and providing a the simple, powerful interface 22 that permits apphcauon development The distπbuted virtual software interface 22 has been designed with a number of architectural objectives including portability, support of an object-oriented design, support for the distribution of objects, support for event driven solutions, support for late binding of application processes to the actual physical architecture (virtual node - physical node mapping), and support for fault tolerance 239

The distributed virtual software interface 22 is portable because it does not depend on non-portable COTS software products, it uses a well-known International Standards Organization ( ISO) POSIX interface and Simple Network Management Protocol (SNMP) standards, uses Ada type/byte stream conversions for communication and storage of data in a heterogeneous environment, and isolates hardware or operating system dependent features to software packages with portable specifications. The distπbuted virtual machine 22 uses the UNAS COTS software layer 21 which is portable to a wide range of hardware and operating system platforms.

A key aspect of the distributed virtual software interface 22 is its distπbuted object service which provides access to objects of a given class in a transparent manner with respect to their physical locauons The distπbuted object service allows distπbuted application classes to be designed with inteπaces where the details of the distπbuted environment are encapsulated berund the class interface The distπouted object service provides access to objects in a distributed environment, supports physical distribution of object state (through rep cauon ) within tne distπbuted environment while ensuπng data consistency, and provides class operation delegation which allows a class operation to be invoked in one process and executed in another, such as on a different computer 31 on the network 13

The distπbuted virtual machine 22 also supports large scale reuse of the software through its support for event-dπven invocation (implicit invocation). Event- dπven invocation reduces coupling between cooperating objects and helps build software architectures that are more resilient to changes The distπbuted virtual machine 22 supports the addition of new classes and customer specific functionality on top of core application classes via implicit invocation based on cnanges in the distπbuted state of the core application classes This is accomplished through registration, using a notification (NOTI) protocol, that allow classes to register to be noufied ' of changes to the distπbuted application state Once registered, these classes have their operations implicitly invoked when a specified change in the distπbuted state occurs This allows new application classes to be added on top of a core application without modification (or changes in compilauon dependencies) to the core application The distπbuted virtual machine 22 permits design decisions as to how distπbuted applications are ultimately partitioned into independent processes (execuung Ada programs) and how those processes are allocated to the available distπbuted hardware resources to be deferred until late m an application development process and easily changed dunng the development process This late binding to the physical architecture and repartitioning into processes (with no or little code modifications) makes systems built on top of the distπbuted virtual machine 22 easily managed dunng 239

7

development (when small, simplified configurations are often desirable). This late binding aspect of the present invention also makes the systems easily scaleable to widely different hardware configurations in support of large scale software reuse. The distπbuted virtual machine 22 provides direct support for building fault-tolerant distributed applications. Fault-tolerance is supported through a well-defined fault detection and recovery taxonomy, and througn automauc and manual reconfiguration of the processes within a local area based distnbuted network 13 (LAN) of computers 31.

The architecture of the distnbuted virtual machine 22 will be further descnbed with reference to Figs. 4-8. The architecture of the distπbuted virtual machine 22 may be "viewed" from different perspectives including, a static view (Fig. 4) that describes the organization of the software in a development environment (and its compilation dependencies), a logical view (Fig. 5 ) that is an object oriented model of the design, a dynamic view (Fig 6) that captures the concurrency and synchronization aspects of the design, a physical view (Fig. 7) that describes the mapping of the software onto the hardware, and a scenano view (Fig. 7j that describes its use. Fig. 8 illustrates elements of a distnbuted class employed with the distπbuted virtual machine 22.

Refernng to Fig. 4, it shows a static view of a typical software architecture 20 built using the distnbuted virtual machine 22. The architecture 20 is divided into layers 51-54 where each layer only depends on the layers 51-54 below it, in that all compilation dependencies are downward only). In this view, the distnbuted virtual machine 22 sits at the bottom of the developed software hierarchy in terms of dependencies. The distπbuted virtual machine 22 depends on a POSLX interface to the operating system 1 1. a small numoer of operating system services outside of the POSLX interface, and the UNAS sortware layer 21 (Fig. 2) The distributed virtual machine 22 provides a simple, but complete interface to classes of applications 24 that directly make use of it. Exported services provided by the distπbuted virtual macrune 22 (object distπbution, time distribution, and synchronization of objects, event and error recording) allow application developers to focus on developing application funcuonality, with minimal concern regarding how the application operates in a distnbuted environment. Because the distπbuted virtual machine 22 supports event dπven invocation, a top layer 51 of functionality is supported that makes use of the core apphcauon layer 52 and is able to initiate operations on the core application 24 and have its own operations implicitly invoked when changes m the distπbuted state of the core application 24 occur. Fig. 5 shows a logical view of classes 61-64 that use distnbuted object services

60 of the distributed virtual machine 22. The major class 61 in this example is a flight class 61 that provides services to other higher level clients 71-73, which in this example 9239

8

are an traffic client 71. an airline regulatory client 72. and an airline flight stnp client 73. The flight class 61 uses distributed object services 60 to enable its operations and state to be accessed and manipulated by clients 71-73 that are physically distributed. The external interface (to the air traffic and airline regulatory clients 71, 72) and user interface classes 62-64 have a "use" relationship with the flight class 60, allowing the flight class 60 to be independent of its clients 71-73. Even though the class relationship is in this direction, the distributed object services 60 allow the higher level classes 61- 64 to have their operations implicitly invoked when client specified changes occur to the state of the flight class 61. A scenano view is used to show how elements in the static and logical views work together to provide end-to-end (externally visible stimulus and response) operational capabilities. Scenaπos are selected to demonstrate the certain "slices" through the architecture 20. showing how vaπous elements interact to achieve a common objective. The design of a scenario is expressed using object scenano diagrams that capture the dynamic semantics and operational relauonships between objects.

Because of the architectural framework, and unique interclass relationships that the distributed virtual machine 22 provides to applications built on top of it. a Booch object scenano diagram notation has is used to explicitly make these interacuons visible. Classes 61-64 implemented using the distributed object services 60 support the following kinds of interactions with their clients 71-73: operation invocation using normal Ada subprogram call semantics (referred to as "CALL"), operation invocation using synchronous RPC semantics ( referred to as "RPC"). operation invocation using asynchronous RPC semantics (referred to as "ARPC"), and operation invocation using implicit notification semantics (referred to as "NOTI").

The annotation of the associated object scenano diagrams with an appropπate operation prefix (CALL. RPC. ARPC, NOTI) has been found to be an effective technique for documenting the intended inter-class design decisions while providing insight into the dynamic architectural view. Fig. 6 illustrates an example object scenano based on the class diagram of Fig. 5, using additional operation annotations.

More specifically, Fig. 6 shows a typical pattern of an object 74 representing the interests of an external system 71 (the air traffic client 71 ) using ARPC semantics when invoking a file flight plan operation 75 exported by the flight class 61 (Fig. 5).

This allows the file flight plan operation 75 to be delegated and performed elsewhere within the distributed system 20. The air traffic object 74 asynchronously receives results from the invoked file flight plan operation 75 (e.g., a semantic response 75a indicating the outcome of the requested operation). The delegauon and subsequent impiementauon of the file flight plan operauon 75 is encapsulated within the flight class 61. As part of the implementation of the file flight plan operation 75, a distributed object 76 is used to commit the newly created object state. This allows the distnbuted object services 60 to subsequently redistnbute (replicate) the object 76 to other processes and computers 31 (or nodes 31), such as an airline regulatory object 77 (the airline regulatory chent 72) and electronic flight stnp 78 (the electronic flight stnp client 73) within the distnbuted system 20 that require such objects. Finally, any other client classes that have expressed interest in newly created flights flight plans 75 will have their operations implicitly invoked using the notification services (NOTI) provided by the distnbuted object services 60.

The dynamic and physical views of a system having the distπbuted virtual machine architecture 20 are shown in Fig. 7 These views show the effect of building distnbuted application classes using the services of the distnbuted virtual macrune 22. In Fig. 7, the same sequence of operations as shown in the scenario view of Fig. 6 is depicted, but an actual distπbution of the resulting software is shown. In this case, the file flight plan operation 75 invoked in a process 81 (Process 1) on one computer 31 (Node A), is implemented in another process 82 (Process 2) on another computer 31 (Node B), and reactions to the newly created object 73 are tnggered in different processes 83-85 on various computers 31 (Nodes B-D). An important property of the distnbuted object services 60 of the distnbuted virtual machine 22 is that inter-process communication within the architecture 20 is between instances of the same class. This is because the architecture of the distnbuted virtual machine 22 allows distπbuted classes to encapsulate, behind their interfaces, details of any physical distπbution of objects that occurs. A client 71-73 (such as the air traffic client 71 , the airline regulatory client 72, or the electronic flight stnp client 73) or client class simply invokes a particular class operation using either synchronous or asynchronous semantics (this is a client decision), while the details of how and where the operation is actually executed is not visible to the client 71-73.

Fig. 7 also shows the client/server relationships that exist between processes 81-85 and classes 61-64. The relationship between two processes 81-85 is always client/server, and a single process 81-85 can be both a client and a server. In Fig. 7, the first process 81 (Process 1) is a client of the second process 82 (Process 2), and the second process 82 (Process 2) is a client of the third process 83 (Process 3). The client/server model also extends to the class level. In addition, in order to descπbe a distnbuted class, the a service interface is used (in addition to client and server). The air traffic class 62 and the flight class 61 have a client/server relationship because the air traffic class 62 invokes operations of the flight class 61. However, the flight class 61 9239

10

operates as a service interface and a server depending on which process 82. 83 (Process 2 or 3) it executes. The flight class 61 in the first process 81 (Process 1) operates as a service interface because it is pnmaπly responsible for delegating operations to the instance of the class operating as a server in the second process 82 (Process 2). This client/service-interface/server relauonship is applied uniformly in all distπbuted class interactions.

Fig. 7 shows the role of the distπbuted object service 60 (Fig. 5) in physically distπbuting copies of objects 76 to nodes 31 (computers 31) and processes 82-85 where they are required. The distributed object 76 in the third process 83 (Process 3) is a reference copy of the object 73. while distributed objects 76 in the second, fourth and fifth processes 82. 84, 85 (Processes 2, 4, 5) are surrogate copies of the object 76. It is the responsibility of the distπbuted object service 60 to ensure all objects 76 remain consistent and provide uniform access to ooject state independent of the existence of a local suπogate copy The ability to provide simple exported intertaces for distπbuted classes 61-64 that are immune to changes in tne dynamic and physical architectures, and the operations provided by the distnbuted virtual machine 22, allow details of the distnbution to change without any impact on the client or server classes 61-64. Each independent process 81-85 that is based on services provided by the distributed virtual machine 22 conceptually has a single thread of control Processing is always initiated by receipt of a message (ARPC, RPC. NOTI) and may occur asynchronously or penodically. This simplified concunency model within processes 81-85 greatly reduces debugging complexity that is often found in multi-tasking applications and eliminates any concern for mutual exclusion within the context of a process 81-85. Where additional asynchronous behavior is required, the distπbuted virtual machine 22 provides a light weight thread scheduler (referred to as the Pivot thread scheduler) whose services are detailed below.

A more detailed descπption of the services provided by the distπbuted virtual machine 22 are descπbed below in a bottom-up fashion, starting with low-level services and then higher level distributed services. The distributed virtual machine 22 provides a number of low level services for its own use as well as for applications. The distπbuted virtual machine 22 uses subprogram variables. In order to provide a functionality similar to types 'access to subprograms" in Ada 95, a subprogram variable service allows the creation of objects that are "pointers" to procedures with specific parameter profiles. Subprogram vaπables are "type safe", and are modeled after Ada 95 access to subprograms The use of subprogram variables in the distπbuted virtual machine 22 provides support for event-driven (implicit) invocauon. 19239

An off-line tool. ATIG (Ada Type Interchange Generator, developed by Little Tree Consulting), is used to automatically generate Ada code that converts Ada objects into common "byte strea ' data types and vice-versa. This mechanism is similar to the stream I/O attributes of Ada 95. but is implemented in Ada 83/87. This tool is widely used in the implementation of the distnbuted virtual machine. It allows heterogeneous communication between processes with no need for representation clauses. Two formats are supported, including a binary format and a human-legible (text) format.

The Pivot thread scheduler is a very light weight asynchronous thread scheduler that does not use Ada tasks. Services provided by the Pivot thread scheduler have been developed to provide a high-performance scheduler where a potentially large number of dynamically created asynchronous threads are required. High performance is obtained because there are no task or process context switches, nor any need for additional mutual exclusion mechanisms A Pivot tnread is associated with an Ada procedure. Each Pivot thread is a member of a "group ' where all threads of the same group share a common context. Each thread can also have a pnvate context, persistent between invocations of the thread.

There are different kinds of threads and ways to invoke them. A thread may be time sensitive (periodic), event sensitive (reacting to the occuπence of a global stimulus), command sensitive (reacting to a stimulus directed to its group), response sensitive (reacting to a stimulus specifically directed to the thread) or synchronous (directly invoked). The Pivot thread scheduler also supports implicit invocation. It provides services of a bookkeeper, a dispatcher, and a watchdog. It manages service requests and delivers subsequent responses. Such schedulers are generally well understood by Ada programmers The Pivot thread scheduler is a software component that provides threads of control. The Pivot scheduler provides services of a bookkeeper, maintaining knowledge of which threads have expressed an interest in particular events. The Pivot thread scheduler provides services of a dispatcher, invoking threads in response to the occuπence of events that have been designated as "interesting '. The Pivot thread scheduler provides services of a broker, managing a context that may be shared by a number of threads that contnbute to a collective goal. The Pivot thread scheduler provides services of a watchdog, alerting those threads that have requested to be invoked when a specified penod of time has elapsed. The Pivot thread scheduler provides a mechanism that orchestrates execution of an application whose behavior is highly dependent upon a number of dynamic state vaπables.

Threads are references to subprograms whose invocations by the thread scheduler are based on three different sensitivities: events, responses, and times. There 19239

12

are three categoπes of Pivot threads that exploit these sensitivities, namely, event sensitive, response sensitive, and time sensitive threads, respectively.

An application creates an event sensitive thread that is invoked as the result of specific events, which, for example, may be produced in response to distributed object notifications. When an application "produces^" an event, all threads having expressed an interest in the event are "dispatched" by the Pivot thread scheduler. Whereas the creation of an event results in the dispatch of any number of event sensitive threads that have previously expressed interest, a response sensitive thread is created to direct data to a single thread. The application creates a time sensitive thread that is invoked as the result of an expiry condition specified in an argument list. In general, the creation of threads using the Pivot thread scheduler is well understood by Ada programmers.

The distπbuted virtual machine 22 provides several parameteπzation services to s clients 61-64 The distπbuted virtual machine 22 uses these parametenzation services to provide portability and

. These services allow each executable program instance to have its behavior tailored by a set of resource data. Thus, a program instance is a pair: [executable binary, resources]. A resource is an initial value that was not wntten explicitly in the source code but must be provided for the program to run. A resource can be as simple as a single integer value or as complex as a table of arbitrary values. Two kinds of resources include constant and vanable system parameters which allow the parameteπzation of applications and of the distributed virtual machine 22 before start of execution (constant system parameters) and during runtime ( vanable system parameters). Constant system parameters may be used dunng elaboration (for example, for the bounds of subtypes ). Services to define and manage collections of system parameters are provided by the distπbuted virtual machine 22. The system parameter mechanism allows generation of binary data files that are read by the software when it is initially loaded. The advantage of this mechanism is that it allows modification of these parameters, thereby modifying the software behavior in some desired way without the requirement to modify source code and recompile and relink all of the software. This significantly shortens the time it takes to make certain changes to the system.

The distributed virtual machine 22 permits the use of Ada exceptions. However, an application running under normal operating conditions should not raise an Ada exception. The explicit raising of an excepuon under abnormal conditions is always made through an eπor reporting mechanism, by reporting an eπor with an appropnate seventy. The different severities include informative, which is used to report interesting, but infrequent events; warning, which is used to report an unexpected condition within the software, but local recovery was possible and the 239

13

software is able to continue to provide service: serious but not fatal, which is used to report that the current operation is unable to be completed, but there is no reason that subsequent operations won^'t be successful; controlled and fatal, which is used to report the occuπence of a fatal event, but the software knows what's wrong and is confident that a controlled shutdown of the process will be successful; and wild and fatal, which is used to report conditions that should never happen, such as if there is a strong indication of a programming eπor. or the loss of an essential system resource (memory). Only the last three reports result in an exception, and only the exception raised for severity "serious but not fatal" is handled by the general application code. The primary motivation for this strategy is to avoid exception handling code from creeping all over the place and dwarfing the useful code, a phenomenon often observed in large Ada systems. The eπor reporting mechanism allows applications to define and repoπ their own eπors and to parameterize the eπor text through resource data. Reported eπors are centralized for recording, for display and for subsequent query. The distributed virtual machine 22 provides services that allow an application to be instrumented for the collection of performance data. The distributed virtual machine uses this capability for some of its operations in implementing the distributed services. These services support the grouping of measurements for input to statistical analysis and threshold evaluation. The services support the development of higher level responses (i.e., application specific) to sustained poor performance trends. (NOTE: if this topic is not pertinent to the invention, then it should be deleted. Otherwise, this topic needs to be explained in more detail.)

The distributed virtual machine 22 provides services that allow application classes to achieve intra-class message-based communication independent of their physical location in the architecture. Class communication services are the foundation for all inter-process communication. A class 61-64 may have different roles with regard to inter-process communication: service interface, server or both.

There are four types of communication protocols used in the distributed virtual machine 22. An ARPC protocol is used by the vast majority of classes. This is the standard protocol used for all delegated operations. The caller's thread of control is released immediately after sending the message, and the caller will be notified (asynchronously) when a return message is received. The ARPC model is based on the metaphor of taking one's car in to be serviced at an auto repair shop. The general sequence of events is illustrated as follows. A client takes a car to the repair shop requesting work to be performed (invokes a class operation). At the same time, the client also provides a phone number (in addition to other normal input parameters) so that the repair shop can call back when the service is complete (implicitly invoke the client to inform of operation completion). As a result of the request for service, the client is provided a work order that uniquely identifies this service request (an output parameter of the operation request). At some time later, the client (asynchronously) receives a phone call from the repair shop indicating that the requested service (operation) has been completed. Once again, the work order is used to uniquely identify the requested operation. At some time convenient to the client, the client returns to the repair shop (invokes a class operation to retrieve the results of the requested operation). Based on the work order the results of the service (operation) are returned to the client. An RPC protocol is used by classes that need to implement synchronous communication. The caller's thread of control is blocked until a return message is received or the RPC times out. An AMC protocol is used by classes implementing some ad-hoc communication protocol. A broadcast protocol is used to distribute messages to multiple receivers. The broadcast protocol enables a class to broadcast a message to multiple destinations by sending only one physical message over the network. Class instances can act as transmitters or can subscribe to the broadcast service, in which case they will receive all subsequent broadcast messages of the class. The distributed object service 60 allows applications to access objects of a class without concern for where the objects are actually located within the distributed system 20. It provide its users with features including uniform access, an application-oriented cache, data consistency, notification, failure recovery, time management, notices, recording services, data extraction services, and tactical configuration. Uniform access refers to the ability of a distributed class (and consequently clients of that class) to access any of its objects without knowledge of the actual location of the object. With regard to the application-oriented cache (illustrated in Fig. 8), local caching of objects (i.e., the physical distribution of objects values throughout the system) allows distributed classes (and their clients) to make the best possible use of limited memory resources (and network bandwidth) by indicating, on the basis of local knowledge of an object's criteria of interest, that certain objects shouid be immediately available for read access at any time. This service is known as subscription and the local copies of object values are refeπed to as suπogates.

A subscription to objects can be expressed either through a filter or by explicit specification of an object's handle. A filter provides a means of specifying a subset of the objects of a class by applying a Boolean predicate to each object of the class when it is updated for those objects that are of interest. The subscription capability directly supports the software design principle of bringing objects to where they are needed rather than always deferring to some central processing resource. Not only does this provide for improved response times for read access to objects, but it also supports continued access to objects even dunng failure of other processing elements, thus contπbuting to the fault tolerance characteπstics of the distπbuted virtual machine.

Data consistency is ensured between multiple copies of a given object within a class (and across different classes) Transactions allow several objects (possibly of different classes) to be accessed automatically, in the sense that modifications performed in the course of the transaction are committed on an all-or-nothing basis The notification capability allows clients to register to be notified (via a callback) of changes to objects of a distributed class This capability allows for the reversal of dependency between two classes, and allo s the distributed class to only be concerned with maintaining its own state, rather than also concerning itself with the other classes that are interested in those state changes

The notification capaoilitv allows tne client class to register interest in objects based on their state or based on changes to their state As with the suoscπption capability, a filter may be specified to indicate which objects are of interest When an object is committed at its central object store, its state is evaluated against the notification filter and if it passes, then the client is notified. In addition to expressing interest in objects via a filter, clients may also express interest based on change to the state of an object A mutation is a descπption of the difference between two objects When two objects are successive versions of the same object, a mutation describes the set of changes that occuπed from one version to the next Mutations allow clients to be notified upon arbitrary change to some set or attributes of an object The mutation is always used in conjunction witn a notification filter and acts as a second level criteria that must be passed before a notification w ill occur The mutation of an object is computed in a central object store (at transaction commit time) and distnbuted as an additional attπbute of the object This allows the mutation to be accessed as any other attnbute of the object, allowing clients of the distπbuted class to base their processing on knowing exactly which attπbutes changed in the cuπent version of the object Automatic failure recovery is supported by keeping one or several backup copies of each object on various processing nodes 31. and ensunng that these copies remain synchronized and consistent

With regards to time management, the distnbuted virtual machine 22 simultaneously supports concepts of real and logical time Logical time is a time that has a linear reiauon with real time The parameters of this linear relauon may be modified on line, within limits For each type of time (real and logical) there is a corresponding type of duration (real and logical) In an operational system, logical time 239

16

is equivalent to real time. For training or simulation, logical time may be different from real time: it may flow 'faster" or "slower ', or be "frozen ^'.

The real time clock is provided by the host processor system time. The distπbuted virtual machine 22 assumes that the different processor system clocks are synchronized by an external mechanism, such as the well-known Network Time

Protocol (NTP). The services provided on times and durations are reading the cuπent (real or logical) time, anthmetic on times and durations, and control of the distπbuted logical time (setting the initial time, modifying the time flow, that is the "speed" of the logical time versus the real time) for all the processes of an apphcauon. real and logical timers, "one shot" or cyclic, that invoke a specified operation when they expire.

The distributed virtual machine 22 provides a notice mechanism that is used to disseminate operational events. This is another mechanism that supports the concept of implicit invocation: when a notice is created, it can be received by any interested client in the system without the creator^'s knowledge of the particular recipients. The notice mechamsm is based on a metaphor of pinning notes on a bulletin board and allowing unknown interested parties to read and react to them.

Notices are divided into notice categones. and each notice category is further subdivided into keyword subsets. A notice belongs to a single notice category and is a member of one or more keyword subsets. Each notice contains some infoπnation, the type of which depends on the notice category.

A bulletin is a set of notice categones. Each notice category is a member of one or more bulletins. A notice is "posted" on all of the bulletins associated with its category A notice has one creator and zero or more suoscπbers. Subscπbers may subscπbe to individual notice categories, as well as to bulletins. Each notice has a state during us lifetime: created, completed or timed-out.

Notices have several characteπstics. in that their lifetimes may be limited, and upon expiration, a notice will be automatically deleted. Subscnbers may "respond" to a notice, giving feed-back to the creator. A number of responses to a notice may be required, and when this number of responses is given, the state of the notice is changed to "completed". For notices that need one or more responses, a "response" penod may be limited. If at the end of this penod, the proper number of responses have not been received, the state of the notice becomes "timed-out". Notices, when they are "timed- out", may be escalated: another notice of a different category, displayed on different bulletins, is created and distπbuted to potentially different subscribers. A notice may be "updated" by its creator after us creation. Subscnbers to a specific notice category may specify the states and keywords for which they want to be notified. The distπbuted virtual machine 22 provides a recording mechamsm that allows applications to record on-line distπbuted operauonal data in a central locauon. The metaphor of a sound track studio is used to implement this mechanism, with classes such as signal, mixer, recorder, tape and tape library Similar recorded data of all processes in an application are centralized on one physical location. Fault tolerance mechanisms are used to ensure continuous operation of the recording services. Data extraction services are also provided to support application level data reduction and report generation

Tactical configuration services of the distnbuted virtual machine 22 provide capabilities to manage and control mapping of the dynamic architecture to the physical architecture. This is accomplished through real-time momtonng of the cuπent system configuration, manually controlling the cuπent system configuration (startup, shutdown, reconfiguration), and automatic reconfiguration in response to system faults The distπbuted virtual machine concept for management of a network starts with the notion of a logical network, where a logical network is a set of communicating processes, operating within a common (logical) time base, that provide a service. The distπbuted virtual machine 22 supports managing muluple simultaneous logical networks that share the same set of physical resources (nodes 31 and physical network 13). Each logical network includes many settings where each setting is a named collection of processes that may execute on a node 31 of the physical network 13.

The momtonng capabilities provide insight into the cuπent logical networks, settings and processes as well as insight into the cuπent allocation of nodes 31 to the physical network 13 The manual reconfiguration services allow for controlled system level startup and shutdo n as well as the execution of scπpted reconfigurations without service interruption. Common reconfiguration actions can be saved as a scπpt and recalled at any time.

A descnption of how the distπbuted virtual macnine 22 may be used to build fault tolerant distπbuted applications is descnbed below Fault tolerance is defined and a general taxonomy of faults that the software can detect and respond to is discussed. Each of the software services provided by distnbuted virtual machine 22 that play a significant role in the fault tolerance approach is discussed.

A distπbuted application provides services. A service is coπect if, in response to inputs, it behaves in a manner that is consistent with its specification. When a service fails to behave in its prescribed manner, then a fault has occuπed. If a fault is unhandled or exists for a prolonged time, then a failure has occurred. The fault tolerant mechanisms of the distπbuted virtual machine 22 prevents failures from occurnng. From the very specific to the very general, software faults can be classified into the following hierarchy whicn includes crash fault, omission fault, timing fault and arbitrary fault. A crash fault is the failure of a service to respond to an input and all subsequent inputs An omission fault is a super-set of crash faults, and includes the case where a service fails to respond to a particular input, but may respond to subsequent inputs. A timing fault occurs when a service fails to respond or responds too early or too late. Performance faults are a subset of timing faults that are predominantly due to responses that are too late. An arbitrary fault is either a timing fault, or the service produces a response different from the one specified. The distπbuted virtual machine 22 provides a vaπety of integrated services to detect and recover from the vaπous classes of software faults. Tacucal configuration services are responsible for detection and process level recovery from crash faults. Class communication services are responsible for the masking (from the application classes) of crash faults and the detection ol omission faults The distπbuted object services are responsible for ensunng data consistency and continued access after the recovery from crash and omission faults. Performance measurement services are responsible for the detection of performance faults.

The tactical configuration services that support the detection and recovery from crash faults are described below The crash faults handled by the tactical configuration services can be further divided into process faults and node faults. Process faults can, in turn, be divided into three sub-categoπes depending on the kind of failures they may lead to: blindfolded, wild, and controlled failures.

Blindfolded failures occur when a process fails without even being aware of it (for example, a class enters an infinite loop) The watchdog capabilities of the UNAS software layer 21 are responsible for detecting these failures Wild failures are detected from within the process, but their origin or impact is unknown. These failures manifest themselves as unhandled exceptions and are detected by an exception handler at the bottom stack frame of the main task in each process. Controlled failures are the result of an explicit action by the software that has detected a fatal condition. Controlled failures manifest themselves as a unique Ada exception raised in response to the reporting of a fatal condition A controlled failure is different from a wild failure in the sense that with a controlled failure, the process can be shutdown in an organized fashion, giving the applications an opportunity to save data or do any other appropπate task. To enable tailoring of recovery processing in response crash faults, each active process operates in a process mode, which helps define the process's fault tolerance strategy. The distπbuted virtual machine 22 supports the following predefined process 39

19

modes- solitary, shanng, primary, and secondary A process is in the solitary mode if it is actively providing services to client processes, and there are no associated shanng or secondary processes active A process is in the shanng mode if it is actively providing services while operating in a network configuration where there are multiple instances of this same process actively shanng the service load. A process is in the pnmary mode if it is actively providing services and is operating m a network configuration where it has been paired with another process in secondary mode

A process in the secondary mode is not actively providing services (to client processes), but is potentially interacting with another process instance operating in pnmary mode The secondary mode process is responsible for being prepared to take over the services provided by the primary mode process in the event of a fault.

Resource data, along with the mode of the process expeπencmg the failure, determine the particular recovery action to be taken in response to a crash fault Recovery actions can be specified for individual processes, for individual nodes 31, or for the entire network, providing a natural hierarchy of possible recovery actions dependent on the type of failure (process or node 31) and the cuπent system configuration In the case of an entire node failure, a special mode (node failure) is used to direct the particular recovery actions

Tactical configuration services support recovery strategies in response to crash faults including starting a new process on the same node 31 or on a different node 31 , changing the mode of an existing process on the same node 31 or a different node 31 , shutting down an existing process on the same node 31 or a different node 31 , and performing a soft shutdown and warm restart of the failed process (applicable for controlled failures) The class communication services play two roles in the fault tolerance design aiding in the masking of crash faults by supporting automatic re-establishment of client/server communication paths as part of node 31 or process recovery actions taken by the tactical configuration services, and providing automatic detection of omission faults The masking of crash faults from the majoπty of the application code occurs as the result of the automatic recovery acuons descnbed in the previous section as well as the automatic re-establishment of client/server commumcauon paths performed by the class commumcauon services After the recovery of a failed server process (on the same or different node 31), the class communication services automaucally determine the new address for the server and re-establish the commumcauon connections between that server and all of its clients. In addition, the class communication services export operauons that allow distπbuted classes to be built such that they take explicit advantage of operaung in the shanng or primary/secondary configuration These operations support the design of higher level fault recovery strategies within distπbuted classes by allowing them to naturally react to changes in the network topology

Omission faults are a super-set of crash faults and include the case where a server fails to respond to a request but has not crashed. Omission fault detection has been built into the asynchronous and synchronous remote procedure call (ARPC and RPC) protocols provided by the class communication services through the automatic detection of time-out conditions. These protocols detect the failure of a server to respond to a client request within a specified duration. If a client does not receive a response within the specified duration, an eπor response is generated and returned to the client indicating that the request has not completed due to a time-out. The RPC and ARPC protocols guarantee that once a client receives a time-out mdicauon, the chent will not receive the real response later, even if it actually arπves.

The ume-outs built into the ARPC and RPC protocols are only intended to detect conditions when something has really gone wrong, rather than detecting the condition where responses are just late (timing fault) Consequently, the time-out durauon associated with an ARPC or RPC is several times larger than the actual time the request is expected to take.

There is a tension between the goals of data lntegπty, fast response time and fault-tolerance. Providing data lntegπty requires that only a single copy of eacn object be maintained, and all accesses to an object be seπalized Fast response time (less than a second or even half a second) to air traffic controllers, and high fault-tolerance and availability of the system all require to bnnging objects close to where they are being used. Th s, in turn, leads to a high degree of redundant information. The distnbuted object services resolve this tension by providing fast read access to objects through rep cauon, while still ensunng data lntegπty, even in the presence of faults.

Fig. 8 illustrates independent processing elements or components that compπse a class implemented using distπbuted object services of the distπbuted virtual machine 22. The components include class instances providing service interface behavior, class instances providing server role behavior, and central object storage/data consistency/availability assurance.

Referπng to Fig. 8, a plurality of individual computers 31 are shown that each run processes 81-85 (i.e., Ada programs). The processes 81, 82 shown in the top portion of Fig. 8 each contain a client application 92a, 92b (Client) making use of the services of a distπbuted class 91 The distnbuted class 91 illustrated on the left side of Fig. 8 is in a service interface role and communicates witn one or more instances in a server role ( shown oy the processes on the πgnt side of Fig. 8) The bi-directional aπow connecting the computers 31 represents internal communication that occurs dunng operation delegation Each process 81. 82 that contains a distnbuted class 1 also contains a single transaction manager 94 Both the distnbuted class 91 and the transaction manager 94 communicate with the primary central object store 95 associated with the distributed class 91 The central object store 95 is responsible for committing transactions, distributing suπogate copies of objects to proper remote caches 96 . and communicating with a secondary central object store 97 Of these core capabilities of the distnbuted object services, the data consistency, notification, and automatic failure recovery capabilities play key roles in the fault tolerance system 20 Ensuring data consistency in the presence of crash faults is the combined responsibility of the transaction manager 94 and the central object store 95 provided by the distπbuted object services. The transaction manager 94 provides operations to initiate, roll back, and commit transactions Integnty is ensured by keeping in a transaction log all modificauons to objects requested as part of that transaction, and making their effect permanent only upon the successful completion of the commit operation If the transaction is rolled back, or is never committed due to the occuπence of a fault, the transaction log is discarded and no change to the object state occurs. The design of the transaction manager 94 was heavily influenced by the desire for it to be as "light-weight" as possible. This is why the transaction manager 94 is not based on the use of locks or two-phase-commit protocols, but an approach of detecting object version mismatches at transaction commit time (rather than trying to secure locks on objects at each step of the transaction) From the above descπpuon of the transaction capabilities, the transaction manager 94 ensures data integnty in the case of a crash fault occurπng in the committing process (prior to the transaction being committed). The transaction manager 94 only partially accounts for ensunng data integnty in the case of a crash fault occurπng in the pnmary central object store 95. The automatic failure recovery capability provided by the distnbuted object services provides the balance of the capabilities. The automatic failure recovery capabilities provided by the distπbuted object services provide tolerance in the presence of crash faults occurπng in the pnmary central object store 95 or in the case of complete node failures. The automatic failure recovery funcuona ty is an example of the pnmary/secondary process mode capabilities descnbed above. For each central object store operating in the pnmary process mode, there may exist another central object store operaung in the secondary process mode on a different computer 31 It is the responsibility of the pnmarv central object store 95 to keep the secondary central object store 97 synchronized regarding the state of all objects. The communication that occurs when committing a transaction, descnbed above, as well as the communication that occurs when initializing the secondary central object store, accomplishes this synchronization In the case where the tactical configuration mechanism detects a crash fault in the pnmary central object store process, the automatically miuated recovery action causes the secondary central object store process to undergo a mode change, from secondary to pnmary In addition, it is the responsibility of the central object store to provide persistence for all objects. This persistency is relied upon when the system is lmually started or re-started from a cold state.

The notification capability provided by the distributed object services provides a powerful form of communication between classes by allowing an operation of one class to be implicitly invoked as the result of a change to the state of an object m another class. Since notifications trigger second-order processing on other classes, it is lmpor- tant that these events are not lost in the presence of faults. Performing the evaluation of notificauon cπtena (and subsequent notifications), not only at transacuon commit ume, but also when the notification cπtena are initially registered or re-registered, ensures notification events occur, even in the presence of faults. By evaluating notification cπtena at the time of registration, the distπbuted virtual machine 22 can recover from both crash faults in the pnmary central object store 95, and in any second-order processing (i.e.. the second-order processing will occur).

The distπbuted virtual machine 22 has been reduced to practice and is cuπently undergoing testing The entire framework cuπently represents over 50.000 source lines of Ada code (not including the UNAS software layer 21) The distπbuted virtual machine 22 has been tested in distπbuted systems that have physical networks containing over fifty computers 31 or nodes 31

Thus, a distnbuted virtual software interface that interfaces between computers, operating systems and applications that run on the computers of a distπbuted, multicomputer fault tolerant computer system has been disclosed. It is to be understood that the descnbed embodiment is merely illustrative of some of the many specific embodiments which represent applications of the pπnciples of the present invention. Clearly, numerous and other aπangements can be readily devised by those skilled in the art without departing from the scope of the invention.

Claims

CLAIMSWhat is claimed is:

1. An object-oriented software architecture (20) implemented in a distributed computer system (30) having a plurality of computers (31) that are interconnected by way of a network (13), and wherein each computer (31) comprises an operating system, said architecture (20) characterized by: a distributed intermediate software layer (21) that is distributed among the plurality of computers (31) that interfaces with the operating system (11) of the computer (31) on which it is disposed and that generates intermediate instructions that cause the operating system (11) to implement primitive operating system instructions in response thereto; a distributed object-oriented software layer (22) distributed among the plurality of computers (31) that provides communication between computers (31) using objects that are instances of object classes that are defined by attributes of objects, operations on objects and associations between objects, and wherein the distributed object-oriented software layer (22) comprises instructions that distribute objects of the same class to computers (31) that are linked by the attributes, operations and associations between the objects within the class in response to the creation of a new object on one of the computers (31), and wherein communication between computers (31) is provided using predefined communication protocols, and wherein the distributed object-oriented software layer (22) interfaces with the distributed middle software layer (21) disposed on each respective computer (31 ) and causes generation of the intermediate instructions; and at least one software application disposed on each computer (31) that interfaces with the distributed object-oriented software layer (22) disposed on the respective computer (31) and processes objects distributed to it.

2. The architecture (20) of Claim 1 wherein the operating system (11) is characterized by a UND operating system (11).

3. The architecture (20) of Claim 3 wherein the distributed intermediate software layer (21) is characterized by Universal Network Architecture Services software.

4. The architecture (20) of Claim 3 wherein the distributed object-oriented software layer (22) supports multiple communications protocols.

5. The architecture (20) of Claim 1 wherein the distributed object-oriented software layer (22) supports an asynchronous remote procedure call communications protocol.

6. The architecture (20) of Claim 5 wherein the distributed object-oriented software layer (22) supports a remote procedure call communications protocol.

7. The architecture (20) of Claim 5 wherein the distributed object-oriented software layer (22) supports an asynchronous message call communications protocol.

8. The architecture (20) of Claim 5 wherein the distributed object-oriented software layer (22) supports a broadcast communications protocol.

9. The architecture (20) of Claim 5 wherein the distributed object-oriented software layer (22) provides for class-based communications between respective ones of the computers (31).