Requirements for an expert system explanation facility


Edinburgh Research Explorer 
 
 
Requirements for an expert system explanation facility

Citation for published version:
Moore, JD & Paris, CL 1991, 'Requirements for an expert system explanation facility', Computational
Intelligence, vol. 7, no. 4, pp. 367-370. https://doi.org/10.1111/j.1467-8640.1991.tb00409.x

Digital Object Identifier (DOI):
10.1111/j.1467-8640.1991.tb00409.x

Link:
Link to publication record in Edinburgh Research Explorer

Document Version:
Publisher's PDF, also known as Version of record

Published In:
Computational Intelligence

General rights
Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s)
and / or other copyright owners and it is a condition of accessing these publications that users recognise and
abide by the legal requirements associated with these rights.

Take down policy
The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer
content complies with UK legislation. If you believe that the public display of this file breaches copyright please
contact openaccess@ed.ac.uk providing details, and we will remove access to the work immediately and
investigate your claim.

Download date: 06. Apr. 2021

https://doi.org/10.1111/j.1467-8640.1991.tb00409.x
https://doi.org/10.1111/j.1467-8640.1991.tb00409.x
https://www.research.ed.ac.uk/portal/en/publications/requirements-for-an-expert-system-explanation-facility(ef7ff645-9d9f-49ff-8d47-34b60b987091).html


361 

Requirements for an expert system explanation facility 

JOHANNA D. MOORE 
Department of Computer Science and Learning Research and Development Center, University of Pittsburgh, 

Pittsburgh, PA 15260, U.S.A. 
AND 

CI~CILE L. PARIS 
In formation Sciences Institute, University of Southern California, 4676 AdmiraIty Way, Marina del Rey, 

Received August 9, 1991 
Revision accepted September 24, 1991 

CA 90292-669.5, U.S.A. 

Comput. Inttll. 7, 367-370 (1991) 

For the past several years, we have worked on building 
an explanation component for an expert system building 
framework (or “shell”), the Explainable Expert System 
(EESQ Framework. In this short paper, we describe the char- 
acteristics that we believe to be essential for an explanation 
component of an expert system. We then identify important 
features of the EES architecture that support the desired 
capabilities. Finally, we discuss some areas where fruitful 
work remains to be done. 

1. Characterisitcs of a good explanation facility 
Every type of natural language application has its own 

requirements. Based on studies of human explanation giving 
and on our system-building experience, we outline here the 
requirements we have identified for an expert system expla- 
nation facility. 

Fidelity. In order to be trusted, an explanation must 
accurately reflect the system’s knowledge and reasoning 
(Swartout 1983, 1990). To ensure fidelity, an explanation 
facility must synthesize text directly from the same knowl- 
edge sources used for problem solving. It cannot rely on 
templates or canned text written a priori by a programmer 
because, as the system evolves, it is impossible to ensure that 
the corresponding text is an accurate reflection of the pro- 
gram’s behavior. 

Knowledge from multiplesources. To support the range 
of questions users wish to ask, an expert system must provide 
several different knowledge sources, including terminological 
knowledge, factual domain knowledge, problem-solving 
knowledge, and an execution history (see Swartout et al. 
1991). An explanation facility must have strategies that 
enabie it to decide on the type(s) of information needed and 
extract it from these knowledge sources. 

Naturalness. Expert systems are often required to pro- 
duce multisentential explanations, for example, t o  provide 
justifications of the system’s actions or recommendations, 
descriptions of its problem-solving strategies, or definitions 
of the terms it uses. The explainer must thus be able to 
organize information into a coherent presentation. To gen- 
erate coherent natural language explanations, the system’s 
explanations should follow standard patterns of discourse 
employed by humans. To do so, the explanation facility 
must have knowledge about discourse structure and strate- 
gies for employing that knowledge. 

The naturalness criterion applies not only to individual 
explanations, but to entire explanation dialogues as well. 
Printed in Canada / ImDrimC au Canada 

Taken as a whole, the explanations produced by the system 
during a dialogue must form a coherent set. 

Responsiveness. Analyses of naturally occurring advisory 
dialogues show that advice-seekers often do not understand 
the advisor’s explanations and frequently ask follow-up 
questions (Moore 1989b). The system must thus be able to 
answer follow-up questions or offer an alternative explana- 
tion if a user is not satisfied with a given explanation. 

Flexibility. An explanation facility must have a variety 
of strategies for answering a given question. This is impor- 
tant for at least two reasons. First, if a given explanation 
is not understood, the system must be able to offer an alter- 
native explanation. Second, the system must be able to pre- 
sent the same information from different perspectives, 
depending on contextual factors such as the user’s knowl- 
edge or goals. 

Sensitivity. Explanations should be influenced by infor- 
mation about the user’s knowledge and goals, the problem- 
solving situation, and the previous dialogue. Question and 
answer pairs cannot be treated independently; later explana- 
tions must take prior explanations into account. 

Extensibility. An explanation facility must be able to 
handle many different types of questions, some of which 
may not be foreseen at design time. It must be possible to 
easily extend the explanation facility to respond to additional 
question types or to add new strategies for answering existing 
question types. 

Portability. Because our generation facility is a com- 
ponent of an expert system shell, it can include only domain- 
independent strategies. The architecture must thus aid users 
in customizing the explainer by adding domain-dependent 
strategies to suit their requirements. 

Adaptive capabilities. An explanation facility should 
have the ability to automatically modify its existing strategies 
and learn new strategies based on experience and interactions 
with users (Paris 1990). 

2. Brief system overview 
We have devised an explanation architecture that explicitly 

plans explanations to achieve discourse goals denoting what 
information should be communicated to the user (e.g., make 
the user know a certain concept, persuade the user to per- 
form an action). When a goal is posted, the planner searches 
its library of explanation strategies looking for candidates 
that could achieve it. In general, there will be several can- 
didate strategies for achieving a goal. The planner employs 


368 COMPUT. INTELL. VOL. 7, 1991 

(GOAL USER (DO USER REPIACE-1)) 

(RECOMMEND SYSTEM USER REPLACE-1) (PERSUADED 
"You rhouMloplw (SETO X 1 )  (GOAL USER (DO USER REPLACE-I))) 

with ( S m  X 1)" 

(MOTIVATION REPLACE-1 ENHANCE-MAINTAINABILIW) 
I 

(BEL USER (SOMEREF (DIFFERENCES SETF SETO ENHANCE-MAINTAINABILITY))) 
I 

(BEL USER (REF (DIFFERENCES SETF SETQ ) USE)) 

/ \  (CONTRAST (USE SETF ASSIGN-TOGV)) 
N 

(INFORM SYSTEM USER 

"SETO can on& be used lo assign a value 
(USE SETO ASSIGN-TO-SV)) 

lo (INFORM a simple SYSTEM v a d l e .  " USER (USE SETF / \  ASSIGN-TOGV)) 
" S E F  can be wed to assign a value (BEL USER (CONCEPT GENERALIZED-VARIABLE)) 
lo any generaked variable. " 

" I  
(ELABORATION GENERALIZED-VARIABLE) 

(INFORM SYSTEM USER 

\ (CLASS-ASCRIPTION GENERALIZED-VARIABLE 
SToRAGE-LoCATIoN))) (ELABORATION -OBJECT-ATTRIBUTE 

GENERALIZED-VARIABLE NAMED-BY) .* A generalized w h b l e  is a storage bcarion" 
I 

(INFORM SYSTEM USER 
(NAMED-BY GENERALIZED-VARIABLE ACCESS-FUNCTION)) 

"rhaf can be named by any access function" 

Sample explanation generated by this text plan: 

You should replace (SHTQ X 1) with (SETF X 1 ) .  SETQ can only be used to assign 
a value to a simple-variable. SETF can be used to assign a value t o  any generalized- 
variable. A generalized-variable is a storage location that can be named by any access 
function. 

FIG. 1. A completed text plan and the explanation produced. 

a set of selection heuristics to determine which strategy is 
most appropriate in the current situation. These selection 
heuristics take into account information about the user's 
knowledge and goals (as recorded in the user model), the 
conversation that has occurred so far (as recorded in the dia- 
logue history), and information about whether or not a strat- 
egy requires assumptions to be made. Once a strategy is 
selected, it may in turn post subgoals for the planner to 
refine. Planning continues in a top-down fashion until all 
goals are refined into speech acts, such as INFORM and 
RECOMMEND. 
As the system plans explanations, it records the goal struc- 

ture of the response being produced. In addition, it keeps 
track of any assumptions it makes about what the user 
knows, as well as alternative strategies that could have been 
chosen at any point in the planning process. The result is 
a text plan for achieving the original discourse goal. Text 
plans are recorded in the dialogue history before being 
passed to the Penman text generation system (Mann and 

Matthiessen 1983), which performs the process of realiza- 
tion into English text. 
In our system, a completed text plan is more than simply 

a specification for the realization process. It is an explicit 
representation of the planning or "design" process that pro- 
duces an explanation. To briefly illustrate this point, we 
show a sample text plan in Fig. 1 accompanied by the text 
it produces. This text plan comes from an interaction with 
the program enhancement advisor (PEA) (Neches et al. 
1985), an advice-giving system constructed using the EES 
framework that is intended to aid users in improving their 
Common Lisp programs.' 

The system produces the text plan shown in Fig. 1 to 
satisfy the discourse goal of achieving the state in which the 
user has adopted the domain goal of replacing (SETQ X 1 ) 

'PEA recommends transformations that improve the "style" 
of the user's code. It does not attempt to understand the content 
of the user's program. 


MOORE AND PARIS 369 

with (SETFX l ) ,  denoted by (GOAL USER (DO USER 
REPLACE-1)) in the figure. Basically this text plan does the 
following. It recommends that the user performs the replace- 
ment and then attempts to persuade the user to do this act. 
To persuade the user to replace SETQ with SETF, it 
motivates this act in terms of the shared domain goal of 
enhancing the maintainability of the program. To motivate 
this act, the system then chooses a strategy that contrasts 
the object being replaced (SETQ) with the object replacing 
it (SETF) in terms of the shared domain goal ENHANCE- 
MAINTAINABILITY. Although there are many differences 
between SETQ and SETF in the expert system’s knowledge 
base, the difference relevant t o  the current domain goal of 
enhancing maintainability is in the generality of their usage. 
Thus, the system informs the user that SETQ can be used 
to assign a value to a simple variable and contrasts this with 
the use of SETF, namely to assign a value to any generalized 
variable. Finally, as the planner constructs the speech act 
to inform the user that SETF may be used to assign a value 
to any generalized variable, it determines from the user 
model that it must introduce a concept, GENERALIZED- 
VARIABLE, that is not known t o  the current user. This 
causes the planner to post the additional subgoal (BEL USER 
(CONCEPT GENERALIZED-VARIABLE)) to introduce this 
new term. This goal is achieved by providing the elaborating 
information defining the concept in terms of its class 
membership and defining attributes. 

As shown in Fig. 1 and described more fully in Moore 
and Paris (1 989), a text plan represents the roles individual 
clauses in the text play in achieving discourse goals, as well 
as how the clauses relate to one another rhetorically. For 
example, contrasting the usage of SETQ with the usage of 
SETF is intended to achieve the discourse goal of persuading 
the user to do the replacement act. Moreover, the portion 
of text that contains the contrast is rhetorically related to 
the recommendation by a MOTIVATION relation. Similarly, 
defining the term GENERALIZED-VARIABLE is intended to 
make the user know this concept, and this text serves as 
ELABORATION for the INFORM act which first introduces 
the term. In addition, information about what entities are 
salient at each point in the explanation (attentional informa- 
tion) can be derived from a text plan. 

3. Important architectural features 
In our system, knowledge about explanation is represented 

explicitly in a set of plan operators that were derived by 
studying naturally occurring explanations. These operators 
integrate multiple sources of knowledge. First, they encode 
stantlard ways that discourse (i.e., intentional) goals are 
achieved by rhetorical means, thus achieving our goal of 
naturalness. For example, as shown in the figure above, one 
strategy persuades the user to perform a replacement act 
using the rhetorical strategy of motivating the act by con- 
trasting the replacee with the replacer in terms of achieving 
shared goal@). . _ _ _ _ _  Second, operators contain applicability con- 
straints that specify the knowledge that must be available 
if the operator is to be used. These criteria can refer to the 
expert system’s knowledge bases, the user model, or the dia- 
logue: history. For example, the operator for explaining a 
concept by analogy requires that there be an analogous con- 
cept that is familiar to the user or has been mentioned pre- 
viously in the dialogue. Planning using these operators 
allows the system to produce coherent explanations directly 

_- 

from the expert system’s domain knowledge (fidelity). 
Because operator constraints also reference the user model, 
the system can tailor the content and organization of its 
explanations t o  the individual user (sensitivity) (Paris 1990; 
Moore and Paris 1992).* 

A second important feature of our architecture is that 
explanations themselves are structured objects that the sys- 
tem can reason about. We have demonstrated that the text 
plans recorded by our system provide the context necessary 
for handling a range of dialogue phenomena. More specifi- 
cally, by reasoning about the prior explanations it has pro- 
duced, our system is able t o  interpret users’ follow-up ques- 
tions and answer them in context of the ongoing dialogue 
(Moore and Swartout 1989; Moore 1989b), select a perspec- 
tive when describing or comparing objects (Moore 1989a), 
and avoid repeating information that has already been com- 
municated or that the user already knows (Moore and Paris 
1989). Thus our system exhibits responsiveness and some 
types of Sensitivity t o  dialogue context. 

4. Future directions 
While we have made considerable progress towards 

achieving our goals, there are several areas in which much 
work remains. First, there is considerable discussion about 
the type of planning architecture that is best suited for 
explanation generation. Some argue for an approach in 
which an explanation is completely planned and revised by 
critics before it is generated (Suthers 1991; Lester and Porter 
1991). Others advocate a simple goal refinement strategy 
which facilitates the interleaving of planning with realiza- 
tion (Cawsey 1989; Moore 19890). There is a clear tradeoff 
here. The former approach allows explanations to be opti- 
mized along various dimensions, but must expend consid- 
erable effort planning and revising without feedback from 
the user. The latter approach allows incremental explana- 
tion generation and thus the developing explanation can 
adapt t o  the changing context quickly as user feedback is 
received. This tradeoff should be further explored through 
empirical tests. 

Yet others argue that planning text requires several 
inherently different types of reasoning, and therefore that 
a single top-down planning mechanism is insufficient (Hovy 
1988; Mooney et al. 1991; Suthers 1991). We believe that 
it is crucial t o  clearly identify the range of explanation tasks 
and the control strategies best suited to achieving each of 
these tasks. Suthers (1991) has begun this process. 

Another issue that concerns us is the feasibility of pro- 
viding domain-independent explanation strategies, which we 
must do if we are to provide a shell for constructing explain- 
able expert systems. EES has already been used to construct 
expert systems in several different domains. In addition to 
the program enhancement advisor, EES has been used t o  
construct expert systems for diagnosing faults in a space sta- 
tion, diagnosing faults in local area networks, and aiding 
physicians in prescribing treatment for patients in the cardiac 
intensive care unit. We have been able to reuse a considerable 
number of explanation ‘operators across these various 
domains. These include operators for justifying the system’s 
actions, defining terms, describing objects, and describing 
the system’s general problem-solving strategies. While it is 

’In addition, Bateman and Paris (1989) have begun to examine 
how to tailor the phrasing of the generated texts to the users. 


370 COMPUT. INTELL. VOL. 7, 1991 

likely that additional domain-specific strategies will be 
needed in some domains (Kittredge et al. 1991), our initial 
experience is promising. 

Finally, we believe that more work must be done in under- 
standing t h e  nature of explanation dialogues. Explanation 
dialogues are largely user-controlled, a n d  thus the structure 
of the dialogue emerges only as t h e  interaction proceeds. 
W e  have only begun to understand this structure and the 
processes t h a t  access it. Many interesting questions remain. 
H o w  should the dialogue history be managed and used? Can 
things be forgotten (i.e., removed f r o m  the dialogue his- 
tory)? Which things and when? How should later explana- 
tions be affected by previous utterances? What information 
must be recorded in order to guarantee that, t a k e n  as a 
whole, t h e  explanations produced by t h e  system f o r m  a 
coherent set? What is t h e  relationship between t h e  dialogue 
history a n d  user model? A r e  these separate entities? If so, 
should information migrate from t h e  dialogue history to the 
user model? What information? H o w  can we handle other 
dialogue phenomena such as interruptions and subdialogues? 
D o  we require meta-level strategies for managing t h e  dia- 
logue, such as the discourse plans o f  Litman a n d  Allen 
(1987) or those of Cawsey (1989)? We hope to be able to 
study some of these questions in t h e  future. 

Acknowledgment 
The research described in this paper was supported by the 

Defense Advanced Research Projects Agency (DARPA) 
under a NASA Ames cooperative agreement number NCC 
2-520. 

BATEMAN, J.A., and PARIS, C.L. 1989. Phrasing a text in terms 
the user can understand. Proceedings of the Eleventh Interna- 
tional Joint Conference on Artificial Intelligence, Detroit, MI, 

CAWSEY, A. 1989. Generating explanatory discourse: a plan-based 
interactive approach. Ph.D. thesis, University of Edinburgh, 
Edinburgh, United Kingdom. 

HOVY, E.H. 1988. Two types of planning in language generation. 
Proceedings of the Twenty-Sixth Annual Meeting of the Asso- 
ciation for Computational Linguistics, State University of 
New York, Buffalo, NY, pp. 179-186. 

KITTEDGE, R., KORELSKY, T., and RAMBOW, 0. 1991. On the 
need for domain communication knowledge. computational 
Intelligence, 7(4): this issue. 

LESTER, J., and PORTER, B. 1991. An architecture for planning 
multi-paragraph pedagogic explanations. Proceedings of the 

pp. 1511-1517. 

AAAI-91 Workshop on Comparative Analysis of Explanation 
Planning Architectures, Anaheim, CA, pp. 27-41. 

LITMAN, D.J., and ALLEN, J.F. 1987. A plan recognition model 
for subdialogues in conversations. Cognitive Science. 11: 163-200. 

MANN, W.C., and MATTHIESSEN, c. 1983. Nigel: a systemic 
grammar for text generation. Technical Report RR-83-105. Infor- 
mation Sciences Institute, University of Southern California, 
Marina del Rey, CA. 

MOONEY, D.J.. CARBERRY, M., and McCoy. K.F. 1991. Captur- 
ing high-level structure of naturally occurring extended explana- 
tions using bottom-up strategies. Computational Intelligence, 
7(4): this issue. 

MOORE, I.D. 1989~. A reactive approach to explanation in expert 
and advice-giving systems. Ph.D. thesis, University of California, 
Los Angeles, CA. 

19896. Responding to “huh?”: answering vaguely 
articulated follow-up questions. Proceedings of the Conference 
on Human Factors in Computing Systems, Austin, TX, pp. 91-%. 

MOORE, J.D., and PARIS, C.L. 1989. Planning text for advisory 
dialogues. Proceedings of the Twenty-Seventh Annual Meeting of 
the Association for Computational Linguistics, Vancouver, B.C., 

___ 1992. User models and dialogue: an integrated approach 
to producing effective explanations. In User modeling and user- 
adapted interaction. Kluwer Academic Publishers, Dordrecht, 
The Netherlands. In press. 

MOORE, J.D., and SWARTOUT, W.R. 1989. A reactive approach 
to explanation. Proceedings of the Eleventh International Joint 
Conference on Artificial Intelligence, Detroit, MI, pp. 1504-1510. 

NECHES, R., SWARTOUT, W.R. and MOORE, J.D. 1985. Enhanced 
maintenance and explanation of expert systems through explicit 
models of their development. lEEE Transactions on Software 
Engineering, SE-ll(11): 1337-1351. 

PARIS, C.L. 1990. Generation and explanation: building an 
explanation facility for the explainable expert systems frame- 
work. In Natural language generation in artificial intelligence 
and computational linguistics. Kluwer Academic Publishers, 
Boston/Dordrecht/London. pp. 49-82. 

SLITHERS, D.D. 1991. Task-appropriate hybrid architectures for 
explanation. Computational Intelligence, 7(4): this issue. 

SWARTOUT, W.R. 1983. XPLAIN: a system for creating and 
explaining expert consulting systems. Artificial Intelligence, 
21(3): 285-325. Also available as Report ISI/RS-83-4, Informa- 
tion Sciences Institute, University of Southern California, 
Marina del Rey, CA. 

1990. Evaluation criteria for expert system explanation. 
Proceedings of the AAAI-90 Workshop on Evaluation of 
Natural Language Generation Systems, Boston, MA. 

SWARTOUT, W.R. PARIS, C.L., and MOORE, J.D. 1991. Design 
for explainable expert systems. IEEE Expert, 6(3): 58-64. 

pp. 203-211.