Because of its immense promise in dealing with complex tasks in heterogeneous environments, workflow technology has drawn continuing interest from the research and commercial communities. Although much attention has (rightly) been focused on techniques for scheduling workflows and for interfacing them to legacy databases and applications, corresponding attention has not been directed toward their conceptual modeling. We believe that the success of conceptual modeling will determine the eventual success of workflow technology.Every workflow system implicitly incorporates a metamodel. Metamodels are the objects of our study. We identify major categories of workflow metamodels, and a list of criteria to use as evaluation dimensions. We model fragments of a workflow in the manufacturing domain in the different metamodels, and compare their effectiveness in capturing the desired properties in a modular, reusable fashion.
This evaluation is useful not only in helping one decide which metamodel to employ, but also highlights some topics for future research.
*This research was supported by the National Science Foundation under grant IRI-9529179, IBM corporation, and the NCSU College of Engineering. We are indebted to Manny Aparicio for useful discussions.
Briefly, workflows can be defined as composite tasks that comprise coordinated human and computational subtasks. At the very least, workflows separate the control and coordination aspects of the composite tasks from the details of the component tasks. They are frequently heterogeneous and distributed over multiple systems. This facilitates the update and reuse of workflows, and has led to much commercial and research interest in tools for constructing and managing them.
Much research attention has concentrated on techniques for workflow management, and not enough on techniques for workflow modeling. Every approach implicitly carries in it some representational language or metamodel, but the metamodels themselves have not been the objects of study and evaluation in recent research. Recognizing that modeling issues will become increasingly important as workflow technology attempts to expand its applications, we compare some of the leading approaches to workflow modeling. We do so in the context of a simple application from the manufacturing domain. This is a variation of the example scenario from the SHIIP (SHipbuilding Information Infrastructure Protocol) project being conducted by the NIIIP (National Industrial Information Infrastructure Protocol) consortium [11]. We are collaborating with one of the leading industrial members of SHIIP, so our interest is not exclusively academic.
The potential benefits of this exercise are twofold. From an engineering standpoint, an evaluation, by presenting the strengths and shortcomings of different metamodels, helps us choose a metamodel for our specific application. From a scientific standpoint, it highlights the opportunities for further research.
The rest of this paper is organized as follows. Section 2 lays out our key terminology, identifies four categories of metamodels and proposes a list of evaluation dimensions. Section 3 describes the SHIIP example with a detailed scenario. Section 4 introduces several representative metamodels and renders part of SHIIP representations in those metamodels. Section 5 carries out an overall evaluation for those metamodels. Section 6 offers concluding remarks.
Some basic terms are used extensively in the literature. We present a short description for some important concepts. Although they are not formal definitions, these descriptions help clarify the meanings we adhere to in this paper. Figure 1 illustrates the relationship of those concepts.

Since the process model is central to the workflow model, we identify four categories of workflow metamodel based primarily on their process metamodels.
Task is the fundamental element in the process metamodel of task-flow based workflow metamodel. It emphasizes the analysis of all the potential sequences of tasks in the specification phase. During model execution, an instance of the workflow decides which sequence to follow on the basis of static specification and dynamic running context. In the graphical representation of process model, typically a directed acyclic graph (DAG), tasks show up as the nodes, state information as conditions attached to the edges.
A state abstracts the context information at a particular time when a system is running. A state transition occurs when a particular event or activity happens. The behaviors of a workflow are represented as a sequence of state transitions. In the process model DAG, states show up as a node, and tasks or events as conditions attached to an edge.
Relationship-capturing based metamodel is founded at the view that the workflow brings all the tasks together on the basis of certain relationship. The process model is built by means of capturing this particular relationship. Example relationships can be triggers [8] or enabling or disabling conditions. The elements are typically tasks, so in certain respects these resemble the task flow metamodels.
These approaches identify the communications among the various actors. They must also involve tasks, of course, but they present the tasks as occurring in the context of the relationships and communications of the actors, not independently.





Different metamodels are constructed from different perspectives. It would not be accurate to assert that one approach is superior to another in all circumstances. However, there are always some trade-offs. A metamodel might miss some aspects when it concentrates on others. The purpose of our evaluation is to highlight those trade-offs.
We first make an evaluation at the level of major types of metamodels. Among the four types of metamodels, task-flow based metamodel is the most straightforward in the sense that people make a schedule in about the same way. Since they focus on how to bring tasks into order with little concern for their location, almost all of the task-flow based metamodels have centralized schedules.
In contrast, state-transition based metamodels lead to a distributed control logic. Although we can consider the state transitions from the perspective of the whole system, most state-transition based metamodels have individual state-transition diagrams for each role, which can communicate with each other as necessary. In a sense, such metamodels examine the system with a distributed viewpoint.
Relationship-capturing metamodels propose a novel perspective to represent a workflow system. However, such metamodels are built on the basic assumption that the system behaviors can be fully captured by certain relationship. The assumption is somewhat restrictive, This might explain why only few of such metamodels are available.
Communication-based metamodels offer an interesting approach for workflows. These are especially suited to groupware settings, and not so much to data-intensive applications. They have the advantage of identifying the actors for each task, but this can also be a disadvantage when multiactor tasks are considered,
| Approach | Granularity | Control | Data | Org | Rol | Exc | Trn | Com |
|---|---|---|---|---|---|---|---|---|
| FlowMark | task | basic, synchronization | together | yes | no | yes | no | no |
| Action | task | basic, synchronization | implicit | no | ¾ | yes | no | yes |
| WIDE | task | basic, synchronization, concurrency | separate | yes | yes | yes | yes | no |
| Statecharts | event, task | basic | implicit | no | ¾ | no | no | no |
| Triggers | event, task | basic | separate | no | ¾ | no | no | no |
Now we turn to the metamodels used in our examples. Table 1 summarizes the features of these metamodels in terms of the dimensions introduced in Section 2.3. In this table, together indicates that data flow is represented in the same model as control flow, but with different markings for distinction; separate means that data flow is modeled separately; implicit means that there is no explicit data flow representation. Note that implicit data flow can be reasonable, since data exchange may be accomplished by communications carried out for control flow. However, explicit data flow is clearer. The granularity of Statechart and Triggers is event, because transitions and triggers can be thought as forms of events. To support synchronization, ordinary branching operators are often extended by, for instance, introducing exclusive OR/AND join operators in FlowMark and WIDE. WIDE further introduces a multitask operator for concurrency support [3]. FlowMark and WIDE both support an organizational model. WIDE has achieved greater flexibility with the dynamic task assignment supported by role-binding. ActionWorkflow does not support organizational exception handling because it does not have an organizational model at all. However, ActionWorkflow provides some rudimentary commitment support. Transaction and commitment support receive little attention in current approaches, with the exception of WIDE, which provides transaction support through the cooperation between a global transaction manager and local transaction managers.
Let us consider the approaches in more detail. First, the task-flow based metamodels, FlowMark and WIDE, bear much similarity from the viewpoint of conceptual modeling. Both models take tasks as atomic. They schedule tasks by capturing their temporal interdependence and then build an organizational model to assign those tasks to actors for organizational constraints. In contrast, ActionWorkflow considers task as divisible. It captures the static task hierarchy by the relationship that one task loop could be part of another one. Also, a task is always associated with a requester and performer.
Second, the state-transition based metamodel, Statecharts face a central issue about how to define the states. Too fine a definition leads to a large number of states, which might be intractable in some cases; too coarse to a composite activation part, where Statecharts do not provide an efficient mechanism to describe the internal structure of the activation. Also, although Statecharts allow the communication between two state diagrams, little concern has been paid to mechanisms to support such communication.
Third, the Trigger metamodel is a perfect example of relationship-capturing metamodel. The correspondence between the Trigger metamodel and Petri-nets contributes the most significant feature, which enables workflow automation and distributed control logic. However, the Trigger metamodel requires identifying a responsible role for any task. This requirement is not suitable for some cooperative tasks because such cooperative information is lost in the representation of the Trigger metamodel. For our example, the check procedure looks like a task of Manager in the graphical representation of Trigger metamodel, even though it is performed by Manager and Foreman together in our scenario. Also, the Trigger metamodel does not provide any conditional branch, which does not seem reasonable for the practical application.
Fourth, the construction of models in ActionWorkflow is more complicated than in FlowMark or WIDE. However, more information is captured in ActionWorkflow. On the one hand, the coordination behavior between the requester and performer represented in ActionWorkflow model facilitates the support of commitment. On the other hand, the emphasis on the coordination of tasks causes some problems for ActionWorkflow. It does not appear appropriate to name a requester and performer for local tasks. Another problem is that the sequential control relationships are not well-defined among the components of a composite task.
Although we considered only a few products and projects, the ideas cover many of the cases of interest. A large number of workflow products exist, but they are not based on radically differing metamodels. Some of the metamodels already have associated enactment tools; such tools are under development for some of the others. This is encouraging, since it is only by enacting the workflow models can we be sure that the desired behavior is being obtained.
The various models have strong similarities in how they handle tasks and control flow among them. However, they differ the most in advanced features such as concurrency, role-binding, and transaction and commitment support. In fact, their coverage of these features still leaves much to be desired. This is probably because the workflow research community still has not come to a good understanding of what these issues entail. However, these limitations have hindered the deployment of workflow technology in managing tasks in large-scale, open, heterogeneous enterprises.
We expect that the near future will witness the following developments. First, more formal methodologies will be developed. We believe that metamodels ought always to be accompanied by methodologies for how to use them in practice. However, presently, the methodologies for workflow metamodels have not been very well worked out. Methodologies will seek closer integration with external database schemas [2]. Databases provides excellent persistent storage and strong local transaction support. The open issues for such integration include how to efficiently access external, typically heterogeneous, databases and how to make the extension of global transaction and commitment support.
Second, metamodels will facilitate naturally decentralized models and environments. This contrasts with extant metamodels, which typically presuppose a centralized control structure. In general, there are two alternatives to achieve distributed solutions. One is to use distributed programming technology, like COBRA, to make multiple servers transparent from users. Another way is to remove client/server architecture by means of distributing the control logic itself. The second, more radical, approach requires more expressive metamodels in which to capture the distributed control aspects.
Third, we envisage the widespread application of the agent metaphor. Agents provide a major step beyond distributed control. This is the use of higher-level abstraction. One such abstraction is commitments¾more advanced than ActionWorkflow's¾which we have begun for use in a potentially more powerful approach to workflows [12, 7].
[2] F. Casati, P. Grefen, B. Pernici, G. Pozzi, and G. Sánchez. WIDE workflow model and architecture, 1997. http://www.sema.es/projects/WIDE/Documents/.
[3] Daniel K. C. Chan and Jochem Vonk. A specification language for the WIDE workflow model, 1997. http://www.sema.es/projects/WIDE/Documents/.
[4] Peter J. Denning. Work is a closed-loop process. American Scientist, 80:314-317, July-August 1992.
[5] Jim Gray and Andreas Reuter. Transaction Processing: Concepts and Techniques. Morgan Kaufmann, San Mateo, 1993.
[6] David Harel and Eran Gery. Executable object modeling with statecharts. IEEE Computer, pages 31-42, July 1997.
[7] Anuj K. Jain and Munindar P. Singh. Using spheres of commitment to support virtual enterprises. In Proceedings of the 4th ISPE International Conference on Concurrent Engineering: Research and Applications (CE). International Society for Productivity Enhancements (ISPE), 1997.
[8] Stef Joosten. Trigger modelling for workflow analysis. In Proc. CON'94 Workflow Management, pages 236-247, 1994.
[9] Frank Leymann and Dieter Roller. Business process management with flowmark. In Proc. of COMPCON Spring 1994. IEEE, 1994.
[10] Raul Medina-Mora, Terry Winograd, Rodrigo Flores, and Fernando Flores. The Action Workflow approach to workflow management technology. In ACM, Proceedings of the Conference On Computer-Supported Coooperative Work, November 1992.
[11] About SHIIP, 1997. http://shiip.npo.org/about-SHIIP.html.
[12] Munindar P. Singh. Commitments among autonomous agents in information-rich environments. In Proceedings of the 8th European Workshop on Modelling Autonomous Agents in a Multi-Agent World (MAAMAW), pages 141-155, 1997.
[13] Munindar P. Singh and Michael N. Huhns. Automating workflows for service provisioning: Integrating ai and database technologies. IEEE Expert, 9(5):19-23, 1994.
[14] Terry Winograd. A language/action perspective on the design of cooperative work. Human-Computer Interaction, 3(3-30), 1988.