M4M 2007
Extension of Description Logics for Reasoning
About Typicality
Laura Giordano1 ,2 ,3
Dipartimento di Informatica
Università del Piemonte Orientale “A. Avogadro”
Alessandria, Italy
Valentina Gliozzi1 ,2 ,4
Dipartimento di Informatica
Università degli Studi di Torino
Torino, Italy
Nicola Olivetti1 ,2 ,5
LSIS-UMR CNRS 6168
Université “Paul Cézanne”
Aix en Provence, France
Gian Luca Pozzato1 ,2 ,6
Dipartimento di Informatica
Università degli Studi di Torino
Torino, Italy
Abstract
We extend the Description Logic ALC with a “typicality” operator T that allows us to reason about
the prototypical properties and inheritance with exceptions. The resulting logic is called ALC + T. The
typicality operator is intended to select the “most normal” or “most typical” instances of a concept.
In our framework, knowledge bases may then contain, in addition to ordinary ABoxes and TBoxes, subsumption relations of the form “T(C) is subsumed by P ”, expressing that typical C-members have the
property P . The semantics of a typicality operator is defined by a set of postulates that are strongly related
to Kraus-Lehmann-Magidor axioms of preferential logic P.
We first show that T enjoys a simple semantics provided by ordinary structures equipped by a preference
relation. This allows us to obtain a modal interpretation of the typicality operator. Using such a modal
interpretation, we present a tableau calculus for deciding satisfiability of ALC + T knowledge bases. Our
calculus gives a nondeterministic-exponential time decision procedure for satisfiability of ALC + T. We then
extend ALC + T knowledge bases by a nonmonotonic completion that allows inferring defeasible properties
of specific concept instances.
Keywords: Description Logics, Nonmonotonic Reasoning, Preferential Logics, Tableaux Calculi.
This paper is electronically published in
Electronic Notes in Theoretical Computer Science
URL: www.elsevier.nl/locate/entcs
L. Giordano et al
1
Introduction
The family of description logics (DLs) is one of the most important formalisms of
knowledge representation. DLs are reminiscent of the early semantic networks and
of frame-based systems. They offer two key advantages: a well-defined semantics
based on first-order logic and a good trade-off between expressivity and complexity.
DLs have been successfully implemented by a range of systems and they are at
the base of languages for the semantic web such as OWL. A DL knowledge base
(KB) comprises two components: (i) the TBox, containing the definition of concepts
(and possibly roles), and a specification of inclusions relations among them, and (ii)
the ABox containing instances of concepts and roles, in other words, properties and
relations of individuals. Since the very objective of the TBox is to build a taxonomy
of concepts, the need of representing prototypical properties and of reasoning about
defeasible inheritance of such properties easily arises. The traditional approach is to
handle defeasible inheritance by integrating some kind of nonmonotonic reasoning
mechanism. This has led to study nonmonotonic extensions of DLs [1,2,3,5,6,7,13].
However, finding a suitable nonmonotonic extension for inheritance reasoning with
exceptions is far from obvious. Let us put forward some desiderata for such an
extension:
(i) The (nonmonotonic) extension must have a clear semantics and should be based
on the same semantics as the underlying monotonic DL.
(ii) The extension should allow to specify prototypical properties in a natural and
direct way.
(iii) The extension must be decidable, if so is the underlying monotonic DL and,
possibly, computationally effective.
The nonmonotonic extensions proposed in the literature do not seem to fully satisfy
all the above desiderata.
[1] proposes the extension of DL with Reiter’s default logic. However, the same
authors have pointed out that this integration may lead to both semantical and
computational difficulties. Indeed, the unsatisfactory treatment of open defaults via
Skolemization may lead to an undecidable default consequence relation. For this
reason, [1] proposes a restricted semantics for open default theories, in which default
rules are only applied to individuals explicitly mentioned in the ABox. Furthermore,
Reiter’s default logic does not provide a direct way of modeling inheritance with
exceptions. This has motivated the study of extensions of DLs with prioritized
defaults [13,2].
A more general approach is undertaken in [6], where it is proposed an extension
of DL with two epistemic operators. This extension allows one to encode Reiter’s
default logic as well as to express epistemic concepts and procedural rules. However,
1 This paper has been also presented at the 13th Conference on Logic for Programming, Artificial Intelligence, and Reasoning (LPAR 2007) [10].
2 This research has been partially supported by “MIUR PRIN05: Specification and verification of agent
interaction protocols”.
3 Email: laura@mfn.unipmn.it
4 Email: gliozzi@di.unito.it
5 Email: nicola.olivetti@univ-cezanne.fr
6 Email: pozzato@di.unito.it
2
L. Giordano et al
this extension has a rather complicated modal semantics, so that the integration
with the existing systems requires significant changes to the standard semantics of
DLs.
In [3] the authors propose an extension of DL with circumscription. One of motivating applications of circumscription is indeed to express prototypical properties
with exceptions, and this is done by introducing “abnormality” predicates, whose
extension is minimized. The authors provide decidability and complexity results
based on theoretical analysis. However, they do not provide a calculus for their
logic. Moreover, the use of circumscription to model inheritance with exceptions is
not that straightforward, as we remark below.
In this work, we propose a novel approach to defeasible inheritance reasoning
based on the typicality operator T. The intended meaning is that, for any concept
C, T(C) singles out the instances of C that are considered as “typical” or “normal”.
Thus assertions as “normally students do not pay taxes”, or “typically users do not
have access to confidential files” [3] are represented by T(Student) ⊑ ¬TaxPayer
and T(User ) ⊑ ¬∃hasAccess.ConfidentialFile .
Before entering in the technical details, let us sketch how we intend to use the
typicality operator and what kind of inferential services we expect to profit. We
assume that a KB comprises, in addition to the standard TBox and ABox, a set
of assertions of the type T(C) ⊑ D where D is a concept not mentioning T. The
reasoning system should be able to infer or propagate prototypical properties of the
concepts specified in the TBox, then to ascribe defeasible properties to individuals.
For instance, let the KB contain:
T(Student ) ⊑ ¬TaxPayer
T(Student ⊓ Worker ) ⊑ TaxPayer
T(Student ⊓ W orker ⊓ ∃HasChild .⊤) ⊑ ¬TaxPayer
T(Unemployed ) ⊑ ¬TaxPayer
corresponding to the assertions: normally a student does not pay taxes, normally a
working student pays taxes, but normally a working student having children does
not pay taxes (because he is discharged by the government) etc...Observe that,
if the same properties were expressed by ordinary inclusions, such as Student ⊑
¬TaxPayer etc...we would simply get that there are not working students and so
on, thus the KB would collapse. This collapse is avoided as we do not assume that T
is monotonic, that is to say C ⊑ D does not imply T(C) ⊑ T(D). To continue with
the example, if the TBox contains PersonWithNoIncome ≡ Student ⊔ Unemployed ,
then the system should be able to infer T(PersonWithNoIncome ) ⊑ ¬TaxPayer .
Suppose next that the ABox contains alternatively the following facts about john:
1. student (john)
2. student (john), worker (john)
3. student (john), worker (john), ∃HasChild .⊤(john)
Then the reasoning system should be able to infer the expected (defeasible) conclusion about john in each case:
1. ¬TaxPayer (john)
2. TaxPayer (john)
3
L. Giordano et al
3. ¬TaxPayer (john)
Observe that setting up a similar specification of the KB by using default logic
or circumscription is not that simple: with default logic [1,2,5,6,7,13], one has
to specify a priority on default application (or one has to find a smart encoding of defaults giving priority to more specific information); with circumscription [3], one has to introduce abnormality predicates, and then establish which
predicates are minimized, fixed, or variable, and finally for the minimized ones
what is their priority with respect to minimization (a total or a partial order).
As a further step, the system should be able to infer (defeasible) properties also
of individuals implicitly introduced by existential restrictions, for instance, if the
ABox further contains ∃HasChild.Student (jack), it should conclude (defeasibly)
∃HasChild .¬TaxPayer (jack).
Given the nonmonotonic character of the T operator, there is a difficulty with
handling irrelevant information, for instance, given the KB as above, one should be
able to infer as well:
T(Student ⊓ SportLover ) ⊑ ¬TaxPayer
T(Student ⊓ Worker ⊓ SportLover ) ⊑ TaxPayer
as SportLover is irrelevant with respect to being a TaxPayer or not. For the same
reason, the conclusion about john being a TaxPayer or not should not be influenced
by the addition of SportLover (john) to the ABox. We refer to this problem as the
problem of Irrelevance.
In this paper we lay down the base of an extension of DL with a typicality operator. Our starting point is a monotonic extension of the basic ALC with the T
operator. The operator is supposed to satisfy a set of postulates that are essentially
a reformulation of Kraus, Lehmann, and Magidor (KLM) axioms of preferential
logic, namely the assertion T(C) ⊑ P is equivalent to the conditional assertion
C |∼ P of KLM preferential logic P. It turns out that the semantics of the typicality
operator can be equivalently specified by considering a preference relation (a strict
partial order) on individuals: the typical members of a concept C are just the most
preferred individuals, or “most normal”, of C according to the preference relation.
The preference relation is the only additional ingredient that we need in our semantics. We assume that “most normal” members of a concept C always exist, whenever
the concept C is non-empty. This assumption corresponds to the Smoothness Condition of KLM logics, or the well-known Limit Assumption in conditional logics.
Taking advantage of this semantic setting, we can give a modal interpretation to
the typicality operator: the modal operator ✷ has intuitively the same properties
as in Gödel-Löb modal logic G of arithmetic provability. We then define a tableau
system for this extension based on this modal interpretation, thereby obtaining a
decision procedure and an upper complexity bound (NEXPTIME).
These are the main results of the paper, however the monotonic extension is not
enough to perform inheritance reasoning of the kind described above: (i) we need a
way of inferring defeasible properties of individuals, (ii) we need a way of handling
Irrelevance.
In the paper we deal with (i) by defining a completion of an ABox: the idea is
that each individual is assumed to be a typical member of the most specific concept
4
L. Giordano et al
to which it belongs. Such a completion allows to perform inferences as the ones
1.,2.,3. above. The paper outlines how we intend to cope with typicality of all
instances and with Irrelevance. In particular, dealing with Irrelevance (ii) requires
a nonmonotonic mechanism. The idea is to complete a KB with a set of default
rules. The default rules are not used to express defeasible properties of concepts (as
in default extension of DLs), but to propagate defeasible properties of a concept to
its subsumed concepts, e.g. to infer T(Student ⊓ SportLover ) ⊑ ¬TaxPayer from
T(Student ) ⊑ ¬TaxPayer . Thus in our approach the nonmonotonic mechanism is
only needed to handle Irrelevance, whose treatment by means of default rules will
be the object of future work.
2
The logic ALC + T: the typicality operator T
We consider an alphabet of concept names C, of role names R, and of individuals
O. The language L of the logic ALC + T is defined by distinguishing concepts
and extended concepts as follows: (Concepts) A ∈ C and ⊤ are concepts of L;
if C, D ∈ L and R ∈ R, then C ⊓ D, C ⊔ D, ¬C, ∀R.C, ∃R.C are concepts of L.
(Extended concepts) if C is a concept, then C and T(C) are extended concepts, and
all the boolean combinations of extended concepts are extended concepts of L. A
knowledge base is a pair (TBox,ABox). TBox contains subsumptions C ⊑ D, where
C ∈ L is either a concept or an extended concept T(C ′ ), and D ∈ L is a concept.
ABox contains expressions of the form C(a) and aRb where C ∈ L is an extended
concept, R ∈ R, and a, b ∈ O.
In order to provide a semantics to the operator T, we extend the definition of a
model used in “standard” terminological logic ALC:
Definition 2.1 [Semantics of T with selection function] A model is any structure
h∆, I, fT i, where: ∆ is the domain; I is the extension function that maps each
extended concept C to C I ⊆ ∆, and each role R to a RI ⊆ ∆I × ∆I . I is defined in
the usual way (as for ALC) and, in addition, (T(C))I = fT (C I ). fT : P ow(∆) →
P ow(∆) is a function satisfying the following properties:
(fT − 2) if S "= ∅, then also fT (S) "= ∅
!
!
(fT − 4) fT ( Si ) ⊆
fT (Si )
(fT − 3) if fT (S) ⊆ R, then fT (S) = fT (S ∩ R)
!
"
(fT − 5)
fT (Si ) ⊆ fT ( Si )
(fT − 1) fT (S) ⊆ S
Intuitively, given the extension of some concept C, fT selects the typical instances
of C. (fT − 1) requests that typical elements of S belong to S. (fT − 2) requests
that if there are elements in S, then there are also typical such elements. The next
properties constraint the behavior of fT with respect to ∩ and ∪ in such a way that
they do not entail monotonicity. According to (fT − 3), if the typical elements of S
are in R, then they coincide with the typical elements of S ∩ R, thus expressing a
weak form of monotonicity (namely cautious monotonicity). (fT − 4) corresponds
S
S
to one direction of the equivalence fT ( Si ) = fT (Si ), so that it does not entail
T
T
monotonicity. Similar considerations apply to the equation fT ( Si ) = fT (Si ),
T
T
of which only the inclusion fT (Si ) ⊆ fT ( Si ) is derivable. (fT − 5) is a further
constraint on the behavior of fT with respect to arbitrary unions and intersections;
5
L. Giordano et al
it would be derivable if fT were monotonic. We can prove the following proposition:
Proposition 2.2 fT (S ∪ R) ∩ S ⊆ fT (S)
We can give an alternative semantics for T based on a preference relation. The idea
is that there is a global preference relation among individuals and that the typical
members of a concept C, i.e. selected by fT (C I ), are the minimal elements of C
with respect to this preference relation. Observe that this notion is global, that is to
say, it does not compare individuals with respect to a specific concept (something
like y is more typical than x with respect to concept C). In this framework, an
object x ∈ ∆ is a typical instance of some concept C, if x ∈ C I and there is no Celement in ∆ more typical than x. The typicality preference relation is partial since
it is not always possible to establish which object is more typical than which other.
The following definition is needed before we provide the Representation Theorem.
Definition 2.3 Given a relation <, which is a strict partial order (i.e. an irreflexive
and transitive relation) over a domain ∆, for all S ⊆ ∆, we define M in< (S) = {x :
x ∈ S and 6 ∃y ∈ S s.t. y < x}. We say that < satisfies the Smoothness Condition
iff for all S ⊆ ∆, for all x ∈ S, either x ∈ M in< (S) or ∃y ∈ M in< (S) such that
y < x.
We are now ready to prove the Representation Theorem below, showing that given
a model with a selection function, we can define on the same domain a preference
relation < such that, for all S ⊆ ∆, fT (S) = M in< (S). Notice that, as a difference
with respect to related results (Theorem 3 in [12]), the relation is defined on the
same domain ∆ of fT . On the other hand, if < is a preference relation satisfying
the Smoothness Condition, then the operator defined as fT (S) = M in< (S) satisfies
the postulates of Definition 2.1.
Theorem 2.4 (Representation Theorem) Given any model h∆, I, fT i, fT satisfies postulates (fT − 1) to (fT − 5) above iff it is possible to define on ∆ a strict
partial order <, satisfying the Smoothness Condition, such that for all S ⊆ ∆,
fT (S) = M in< (S).
Proof. (“Only if ” direction) Given fT satisfying postulates (fT − 1) to (fT − 5),
we define < as follows: for all x, y ∈ ∆, we let x < y if ∀S ⊆ ∆, if y ∈ fT (S) then
x 6∈ S, and ∃R ⊆ ∆ such that S ⊂ R and x ∈ fT (R). We prove that:
(i) < is irreflexive. Easily follows by the definition of <.
(ii) < is transitive. Let (a) x < y and (b) y < z. Let z ∈ fT (S) for some S, then
by definition of <, y 6∈ S, and ∃R s.t. S ⊂ R and y ∈ fT (R). Furthermore,
x 6∈ R and ∃Q : R ⊂ Q and x ∈ fT (Q). From this we can conclude that x 6∈ S
(otherwise x ∈ R), and S ⊂ Q, hence x < z.
(iii) fT (S) ⊆ M in< (S). Let x ∈ fT (S). Suppose x 6∈ M in< (S), i.e. for some
y ∈ S, y < x. By definition of <, y 6∈ S, contradiction, hence x ∈ M in< (S).
(iv) M in< (S) ⊆ fT (S). Let x ∈ M in< (S). Then x ∈ S, i.e. S 6= ∅. By (fT − 2),
S
fT (S) 6= ∅. Suppose x 6∈ fT (S). Consider Ri for all Ri ⊆ ∆ s.t. x ∈ fT (Ri ).
S
By (fT − 5), we have x ∈ fT ( Ri ).
S
S
S
Consider now fT ( Ri ∪ S). We can easily show that fT ( Ri ∪ S) 6⊆ Ri
6
L. Giordano et al
S
S
(otherwise, by (fT − 3) fT ( Ri ∪ S) = fT ( Ri ), and by Proposition 2.2,
S
S
fT ( Ri ) ∩ S ⊆ fT (S), which contradicts the fact that x ∈ fT ( Ri ) but
S
S
x 6∈ fT (S)). Consider hence y ∈ fT ( Ri ∪ S) s.t. y 6∈ Ri . By definition of
S
S
<, y < x. Furthermore, by (fT − 1) y ∈ S (since y ∈ Ri ∪ S and y 6∈ Ri ).
It follows that x 6∈ M in< (S), contradiction, hence M in< (S) ⊆ fT (S).
(v) < satisfies the Smoothness Condition. Let S 6= ∅ and x ∈ S. If x ∈ fT (S) then
by point 3 we have x ∈ M in< (S). If x 6∈ fT (S) we can reason as for point 4
S
S
to conclude that there is y ∈ fT ( Ri ∪ S) s.t. y 6∈ Ri (hence y ∈ S), and
y < x. By Proposition 2.2, we have y ∈ fT (S), hence by point 3 we conclude
y ∈ M in< (S).
The points above allow us to conclude.
(“If ” direction) Given a strict partial order < satisfying the Smoothness Condition,
we can define fT : P ow(∆) → P ow(∆) by letting fT (S) = M in< (S). It can be
easily shown that fT satisfies postulates (fT − 1) to (fT − 5). The proof is left to
the reader.
✷
Having the above Representation System, from now on, we will refer to the following
semantics for ALC + T:
Definition 2.5 [Semantics of ALC + T] A model M is any structure h∆, <, Ii,
where ∆ and I are defined as in Definition 2.1, and < is a strict partial order over ∆
satisfying the Smoothness Condition (see Definition 2.3 above). As a difference with
respect to Definition 2.1, the semantics of the T operator is: (T(C))I = M in< (C I ).
For concepts (built from operators of ALC), C I is defined in the usual way.
Definition 2.6 [Model satisfying a Knowledge Base] Consider a model M, as defined in Definition 2.5. We extend I so that it assigns to each individual a of O an
element aI of the domain ∆. Given a KB (TBox,ABox), we say that:
•
M satisfies TBox if for all inclusions C ⊑ D in TBox, and all elements x ∈ ∆, if
x ∈ C I then x ∈ D I .
•
M satisfies ABox if: (i) for all C(a) in ABox, we have that aI ∈ C I , (ii) for all
aRb in ABox, we have that (aI , bI ) ∈ RI .
M satisfies a knowledge base if it satisfies both its TBox and its ABox.
Notice that the meaning of T can be split into two parts: for any object x of the
domain ∆, x ∈ (T(C))I just in case (i) x ∈ C I , and (ii) there is no y ∈ C I such
that y < x. In order to isolate the second part of the meaning of T (for the purpose
of the calculus that we will present in section 3) we introduce a new modality
whose interpretation in M is defined as follows.
Definition 2.7 (C)I = {x ∈ ∆ | for every y ∈ ∆, if y < x then y ∈ C I }
The basic idea is simply to interpret the preference relation < as an accessibility
relation. By the Smoothness Condition, it turns out that the modality has the
properties of Gödel-Löb modal logic of provability G. The Smoothness Condition
ensures that typical elements of C I exist whenever C I 6= ∅, by preventing infinitely
7
L. Giordano et al
descending chains of elements. This condition therefore corresponds to the finitechain condition on the accessibility relation (as in G). A similar correspondence has
been presented in [9,8] to interpret the preference relation in KLM logics.
The following relation between T and holds:
Proposition 2.8 For all x ∈ ∆, we have x ∈ (T(C))I iff x ∈ C I and x ∈ (¬C)I
Since we only use to capture the meaning of T, in the following we will always
use followed by a negated concept, as in ¬C.
We can establish the following equation between our typicality operator and the
nonmonotonic (conditional) inference operator |∼ (describing what can be typically
derived from a given premise) by letting C |∼ D iff T(C) ⊑ D. It can be easily shown
that there is a correspondence between the properties of T and the properties of |∼
in the system P described in [12].
3
Reasoning
In this section we present a tableau calculus for deciding the satisfiability of a knowledge base. Given a KB (TBox,ABox), any concrete reasoning system should provide
the usual reasoning services, namely satisfiability of the KB, concept satisfiability,
subsumption, and instance checking. It is well known that the latter three services
are reducible to the satisfiability of a KB.
We introduce a labelled tableau calculus for our logic ALC + T, which enriches
the labelled tableau calculus for ALC presented in [4]. The calculus is called T ALC+T
and it is based on the notion of constraint system. We consider a set of variables
drawn from a denumerable set V. T ALC+T makes use of labels, which are denoted
with x, y, z, . . .. Labels represent objects. An object is either a variable or an
individual of the ABox, that is to say an element of O ∪ V.
R
A constraint is a syntactic entity of the form x −→ y or x : C, where R is a role
and C is either an extended concept or has the form ¬D or ¬¬D, where D is a
concept. As we will define in Definition 3.1, the ABox of an ALC+T-knowledge base
can be translated into a set of constraints by replacing every membership assertion
R
C(a) with the constraint a : C and every role aRb with the constraint a −→ b. A
tableau is a tree whose nodes are pairs hS | U i, where:
R
•
S contains constraints (or labelled formulas) of the form x : C or x −→ y;
•
U contains formulas of the form C ⊑ DL , representing subsumption relations
C ⊑ D of the TBox. L is a list of labels. As we will discuss later, this list is used
in order to ensure the termination of the tableau calculus.
A node hS | U i is also called a constraint system. A branch is a sequence of
nodes hS1 | U1 i, hS2 | U2 i, . . . , hSn | Un i . . ., where each node hSi | Ui i is obtained
by its immediate predecessor hSi−1 | Ui−1 i by applying a rule of T ALC+T , having
hSi−1 | Ui−1 i as the premise and hSi | Ui i as one of its conclusions. A branch is
closed if one of its nodes is an instance of (Clash), otherwise it is open. We say that
a tableau is closed if all its branches are closed.
Given a KB, we define its corresponding constraint system as follows:
8
L. Giordano et al
Definition 3.1 [Corresponding constraint system] Given an ALC + T-knowledge
base (TBox,ABox), we define its corresponding constraint system hS | U i as follows:
R
S = {a : C | C(a) ∈ ABox} ∪ {a −→ b | aRb ∈ ABox} and U = {C ⊑ D ∅ | C ⊑
D ∈ T Box}.
Definition 3.2 [Model satisfying a constraint system] Let M be a model as defined
in Definition 2.6. We define a function α which assigns to each variable of V an
element of ∆, and assigns every individual a ∈ O to aI ∈ ∆. M satisfies x : C
R
under α if α(x) ∈ C I and x −→ y under α if (α(x), α(y)) ∈ RI . A constraint system
hS | U i is satisfiable if there is a model M and a function α such that M satisfies
under α every constraint in S and that, for all C ⊑ D ∈ U and for all x occurring
in S, we have that if α(x) ∈ C I then α(x) ∈ D I .
It can be easily shown that:
Proposition 3.3 Given an ALC + T-knowledge base, it is satisfiable if and only if
its corresponding constraint system is satisfiable.
Therefore, in order to check the satisfiability of (TBox,ABox), we build its corresponding constraint system hS | U i, and then we use T ALC+T to check the satisfiability of hS | U i. In order to check a constraint system hS | U i for satisfiability,
our calculus T ALC+T adopts the usual technique of applying the rules until either
a contradiction is generated (Clash) or a model satisfying hS | U i can be obtained
from the resulting constraint system.
In order to take into account the TBox, we use a technique of unfolding, similar
to the one described in [4]. Given a node hS | U i, for each subsumption C ⊑ D L ∈
U and for each label x that appears in the tableau, we add to S the constraint
x : ¬C ⊔ D. As mentioned above, each formula C ⊑ D is equipped by the list L
of labels in which it has been unfolded in the current branch. This is needed in
order to avoid multiple unfolding of the same subsumption by using the same label,
generating non-termination in a proof search.
Before introducing the rules of T ALC+T we need some more definitions. First,
as in [4], we assume that labels are introduced in a tableau according to an ordering
≺, that is to say if y is introduced in the tableau, then x ≺ y for all labels x that
are already in the tableau.
Given a tableau node hS | U i and an object x, we define σ(hS | U i, x) = {C |
x : C ∈ S}. Furthermore, we say that two labels x and y are S-equivalent, written
x ≡S y, if they label the same set of concepts, i.e. σ(hS | U i, x) = σ(hS | U i, y).
Intuitively, S-equivalent labels can represent the same element in the model built
M
= {y : C, y : ¬C | x : ¬C ∈ S}.
by the rules of T ALC+T . Last, we define Sx→y
ALC+T
The rules of T
are presented in Figure 1. Rules (∃+ ), (∀− ), and (− ) are
called dynamic since they introduce a new variable in their conclusions. The other
rules are called static. We do not need any extra rule for the positive occurrences of
M . The
the operator, since these are taken into account by the computation of Sx→y
side conditions on the rules (∃+ ) and (∀− ) are introduced in order to ensure a terminating proof search, by implementing the standard blocking technique described
below. The rules of T ALC+T are applied with the following standard strategy: 1.
apply a rule to a variable x ∈ V only if no rule is applicable to a variable y ∈ V such
9
L. Giordano et al
!S, x : ¬¬C | U "
!S, x : C, x : ¬C | U " (Clash)
!S, x : T(C) | U "
!S, x : C, x : !¬C | U "
!S, x : C | U "
!S, x : ¬T(C) | U "
(T+ )
!S, x : ¬C | U "
R
!S, x : ∀R.C, x −→ y | U %
!S, x : ∀R.C, x −→ y, y : C | U %
!S, x : ¬∀R.C | U #
(T− )
!S, x : ∃R.C | U #
(∃+ )
y new
R
!S, x : ∃R.C, x −→ y, y : C | U % if ! ∃z ≺ x s.t. z ≡S,x:∃R.C x and
R
(∀+ )
R
!S, x : ¬!¬C | U "
(¬)
if y : C !∈ S
! ∃u s.t. x −→ u ∈ S and u : C ∈ S
R
!S, x : ¬∃R.C, x −→ y | U %
(∀− )
y new
!S, x : ¬∀R.C, x −→ y, y : ¬C | U % if ! ∃z ≺ x s.t. z ≡S,x:¬∀R.C x and
R
! ∃u s.t. x −→ u ∈ S and u : ¬C ∈ S
R
R
(∃− )
!S, x : ¬∃R.C, x −→ y, y : ¬C | U %
if y : ¬C !∈ S
L
!S, x : ¬!¬C | U "
M
!S, y : C, y : !¬C, Sx→y
| U"
!S | U, C ⊑ D #
(!− )
!S, x : ¬C ⊔ D | U, C ⊑ DL,x $
y new
(Unfold)
if x occurs in S and x !∈ L
Fig. 1. The calculus T ALC+T . To save space, we omit the standard rules for ⊔ and ⊓.
that y ≺ x; 2. apply dynamic rules ((− ) first) only if no static rule is applicable.
This strategy ensures that the variables are considered one at a time according to
the ordering ≺. Consider an application of a dynamic rule to a variable x of a constraint system hS | U i. For all hS ′ | U ′ i obtained from hS | U i by a sequence of rule
applications, it can be easily shown that (i) no rule can be applied in hS ′ | U ′ i to
a variable y s.t. y ≺ x and (ii) σ(hS | U i, x) = σ(hS ′ | U ′ i, x). The calculus so obtained is sound and complete with respect to the semantics described in Definition
3.2 (we omit the proof to save space):
Theorem 3.4 (Soundness and Completeness of T ALC+T ) Given a constraint
system hS | U i, it is unsatisfiable iff it has a closed tableau.
An example of derivation in T ALC+T in presented in Figure 2.
!x : T(A), x : S | T(A) ⊑ T ∅ , T(S) ⊑ A∅ , T(S) ⊑ ¬T ∅ #
!x : A, x : !¬A, x : S | . . ."
!x : ¬T(A) ⊔ T, x : A, x : !¬A, x : S | T(A) ⊑ T
!x : ¬T(A), x : A, x : !¬A, x : S | . . ."
!x : ¬A, x : A, . . . | . . ."
(Clash)
!x : ¬!¬A, x : !¬A, . . . | . . ."
(Clash)
{x}
, . . .$
(T+ )
(Unfold)
(⊔+ )
!x : T, x : A, x : !¬A, x : S | . . ."
(Unfold)
!x : ¬T(S) ⊔ ¬T, x : T, x : !¬A, x : S, . . . | T(S) ⊑ ¬T {x} , . . .$
(T− )
!x : ¬T(S), x : !¬A, x : S, . . . | . . ."
(T− )
!x : ¬!¬S, x : !¬A, . . . | . . ."
(!− )
!y : S, y : !¬S, y : ¬A, . . . | . . ."
!x : ¬S, x : S, . . . | . . ."
(Clash)
!x : ¬T, x : T, . . . | . . ."
(Clash)
(⊔+ )
(Unfold)
!y : ¬T(S) ⊔ A, y : S, y : !¬S, y : ¬A, . . . | T(S) ⊑ A{y} , . . .$
(⊔+ )
!y : ¬A, y : A, . . . | . . ."
−
(T )
(Clash)
!y : ¬!¬S, y : !¬S, . . . | . . ."
(Clash)
!y : ¬T(S), y : S, y : !¬S, . . . | . . ."
!y : ¬S, y : S, . . . | . . ."
(Clash)
Fig. 2.
A closed tableau showing that T(Adult ) ⊑ ¬Student can be inferred from
{T(Adult ) ⊑ TaxPayer , T(Student) ⊑ Adult , T(Student) ⊑ ¬TaxPayer }. It is shown that the KB whose
TBox contains the above subsumptions, and whose ABox contains an x which is both a typical Adult and
a Student, is unsatisfiable. To save space, we use A for Adult , T for TaxPayer , and S for Student.
Let us conclude this section by analyzing termination and complexity of T ALC+T .
In general, non-termination in labelled tableau calculi can be caused by two different
reasons: 1. some rules copy their principal formula in the conclusion(s), and can
thus be reapplied over the same formula without any control; 2. dynamic rules may
generate infinitely-many labels, creating infinite branches.
10
L. Giordano et al
Concerning the first source of non-termination (point 1), the only rules copying
their principal formulas in their conclusions are (∀+ ), (∃− ), (Unfold), (∀− ), and
(∃+ ). However, the side conditions on these rules avoid multiple applications on the
same formula. Indeed, (Unfold) can be applied to a constraint system hS | U, C ⊑
DL i by using the label x only if it has not yet been applied to x in the current
branch (i.e. x does not belong to L). Concerning (∀+ ), the rule can be applied
R
to hS, x : ∀R.C, x −→ y | U i only if y : C does not belong to S. When y : C is
introduced in the branch, the rule will not further apply to x : ∀R.C. The same for
(∃− ), (∃+ ), and (∀− ).
Concerning the second source of non-termination (point 2), we can prove that we
only need to adopt the standard loop-checking machinery known as blocking, which
ensures that the rules (∃+ ) and (∀− ) do not introduce infinitely-many labels on a
branch. Thanks to the properties of , no other additional machinery is required to
ensure termination. Indeed, we can show that the interplay between rules (T− ) and
(− ) does not generate branches containing infinitely-many labels. Let us discuss
the termination in more detail.
Without the side conditions on the rules (∃+ ) and (∀− ), the calculus T ALC+T
does not ensure a terminating proof search. Indeed, given a constraint system
hS | U i, it could be the case that (∃+ ) is applied to a constraint x : ∃R.C ∈ S,
R
introducing a new label y and the constraints x −→ y and y : C. If an inclusion T(∃R.C) ⊑ D belongs to U , then (Unfold) can be applied by using y, thus
generating a branch containing y : ¬T(∃R.C), to which (T− ) can be applied introducing y : ¬¬(∃R.C). An application of (− ) introduces a new variable z and
the constraint z : ∃R.C, to which (∃+ ) can be applied generating a new label u.
(Unfold) can then be re-applied on T(∃R.C) ⊑ D by using u, incurring a loop. In
order to prevent this source of non termination, we adopt the standard technique
of blocking: the side condition of the (∃+ ) rule says that this rule can be applied to
a node hS, x : ∃R.C | U i only if there is no z occurring in S such that z ≺ x and
z ≡S,x:∃R.C x. In other words, if there is an “older” label z which is equivalent to
x with respect to S, x : ∃R.C, then (∃+ ) is not applicable, since the condition and
the strategy imply that the (∃+ ) rule has already been applied to z. In this case,
we say that x is blocked by z. The same for (∀− ).
As mentioned, another possible source of infinite branches could be determined
by the interplay between rules (T− ) and (− ). This cannot occur, i.e. the interplay between these two rules does not generate branches containing infinitely-many
labels. Intuitively, the application of (− ) to x : ¬¬C adds y : ¬C to the
conclusion, so that (T− ) can no longer consistently introduce y : ¬¬C. This is
due to the properties of (no infinite descending chains of < are allowed). More
in detail, if (Unfold) is applied to T(C) ⊑ D by using x, an application of (T− )
introduces a branch containing x : ¬¬C; when a new label y is generated by an
application of (− ) on x : ¬¬C, we have that y : ¬C is added to the current
constraint system. If (Unfold) and (T− ) are also applied to T(C) ⊑ D on the new
label y, then the conclusion where y : ¬¬C is introduced is closed, by the presence
of y : ¬C. By this fact, we do not need to introduce any loop-checking machinery
on the application of (− ).
The following Theorem holds:
11
L. Giordano et al
Theorem 3.5 (Termination of T ALC+T ) Let hS | U i be a constraint system,
then any tableau generated by T ALC+T is finite.
Since the calculus T ALC+T is sound and complete (Theorem 3.4), and since an
ALC + T-knowledge base is satisfiable iff its corresponding constraint system is
satisfiable (Proposition 3.3), from Theorem 3.5 above it immediately follows that:
Theorem 3.6 (Decidability) Checking whether a given ALC + T-knowledge base
(TBox,ABox) is satisfiable is a decidable problem.
Let us conclude this section with a complexity analysis of the calculus T ALC+T :
Theorem 3.7 (Complexity) Given an ALC + T-knowledge base (TBox,ABox),
the problem of checking whether it is satisfiable can be solved in nondeterministic
exponential time.
Proof. We first show that the number of labels generated on a branch is at most
exponential in the size of KB. Let n be the size of a KB. Given a constraint system
hS | U i, the number of extended concepts appearing in hS | U i, including also all
the ones appearing as a subformula of other concepts, is O(n). As there are at most
O(n) concepts, there are at most O(2n ) variables labelling distinct sets of concepts.
Hence, there are O(2n ) non-blocked variables in S.
Let m be the maximum number of direct successors of each variable x ∈ S,
obtained by applying dynamic rules. m is bound by the number of ∃R.C concepts (O(n)) plus the number of ¬∀R.C concepts (O(n)) plus the number of ¬¬C
concepts (O(n)). Therefore, there are at most O(2n × m) variables in S, where
m ≤ 3n. The number of individuals in the ABox is bound by n too, and each individual has at most m direct successors. The number of labels in S is then bound
by O((2n + n) × m), and hence by O(22n ).
For a given label x, the concepts labelled by x introduced in the branch (namely,
all the possible subconcepts of the initial constraint system, as well as all boxed
subconcepts) are O(n). According to the standard strategy, after all static rules
have been applied to a label x in phase 1, no other concepts labelled by x can
be introduced later on a branch. Hence, the labelled concepts introduced on the
branch is O(n) for each label, and the number of all labelled concepts on the branch
is O(n × 22n ). Therefore, a branch can contain at most an exponential number of
applications of tableau rules.
The satisfiability of a KB can thus be solved by defining a procedure which
nondeterministically generates an open branch of exponential size (in the size of
KB). The problem is in NEXPTIME.
✷
Although we have shown that the satisfiability is in NEXPTIME, in future research
we will further consider whether this bound is optimal or not.
4
Reasoning about Typicality
Logic ALC + T allows one to reason monotonically about typicality. In ALC + T
we can consistently express, for instance, the fact that three different concepts, as
12
L. Giordano et al
student, working student and working student with children, have a different status
as taxpayers.
What about the typical properties of an individual john that we know being a
working student, and having children? Of course, if we know that john is a typical
instance of the concept Student ⊓ Worker ⊓ ∃HasChild .⊤, i.e. if the ABox contains
the assertion (∗) T(Student ⊓ Working ⊓ ∃HasChild .⊤)(john), then, in ALC + T, we
can conclude that ¬TaxPayer (john). However, in absence of (*), we cannot derive
¬TaxPayer (john).
In general, we would like to infer that individuals have the properties which are
typical of the most specific concept to which they belong. To this purpose, we define
a completion of the knowledge base which adds to the ABox, for each individual a
occurring in the ABox, the assertion that a is a typical instance of the most specific
concept C to which it belongs. Although in general ABoxes can contain typicality
assertions about individuals, in practice we assume that typicality assertions are
automatically generated by the system by means of the completion, and are not
inserted by the user. From now on, we therefore assume that the initial ABox of a
KB does not contain any typicality assertion.
Definition 4.1 [Completion of a Knowledge Base] The KB (TBox,ABox’) is the
completion of the KB (TBox,ABox), if ABox’ is obtained from ABox by adding to
it, for all individual names a in the ABox, the assertion T(C1 ⊓ . . . ⊓ Cj )(a), where
C1 , . . . , Cj are all the concepts Ci such that: (1) Ci is a subconcept of any concept
occurring in (TBox,ABox); (2) Ci does not contain T; (3) a is an instance of Ci ,
i.e. Ci (a) is derivable in ALC from (TBox,ABox).
For instance, assuming that Student(john), Worker (john) and ∃HasChild .⊤(john)
are the only assertions concerning john derivable from the KB, the completion above
would add T(Student ⊓ Worker ⊓ ∃HasChild .⊤)(john) to the ABox, as Student ⊓
Worker ⊓ ∃HasChild .⊤ is the most specific concept of which john is an instance.
From this, we can conclude in ALC + T that john does not pay taxes.
The completion adds T(C1 ⊓ . . . ⊓ Cj )(a) by considering each Ci (a) derivable
in ALC from the KB, rather than considering only Ci (a) in the ABox. This is
needed, for instance, to infer that john does not pay taxes from the KB containing
Professor ⊑ ∀HasChild .Student , Professor (paul), and HasChild(paul, john).
As a matter of fact, if we had in the ABox the information that john is a
TaxPayer , this would not cause an inconsistent completion of the KB. Indeed,
in such a case, Student ⊓ Worker ⊓ ∃HasChild .⊤ ⊓ TaxPayer would be the most
specific concept of which john is an instance, so that the assertion T(Student ⊓
Worker ⊓ ∃HasChild .⊤ ⊓ TaxPayer )(john) would be added in the completion of the
KB. This does not allow to infer that ¬TaxPayer (john). Hence, no inconsistency
arises. However, it could be the case that the KB obtained by the completion is
inconsistent, even if the initial KB is consistent. For instance, the KB containing
T(C) ⊑ ∀R.E, T(D) ⊑ ∀R.¬E, C(a), D(b), R(a, c), and R(b, c) is consistent,
whereas its completion, including also T(C)(a) and T(D)(b), is not. In this case,
we keep the initial KB unaltered, instead of the one obtained by the completion.
Notice that the completion of the ABox only introduces O(n) new formulas
a : T(C1 ⊓ . . . ⊓ Cj ), one for each named individual a in the ABox. Furthermore,
13
L. Giordano et al
the size of each formula T(C1 ⊓ . . . ⊓ Cj ) is O(n2 ) as C1 , . . . , Cj are all distinct
subformulas of the initial formula (O(n)), and each Ci has size O(n). Hence, after
the completion construction, the size of the KB is polynomial in n. Moreover, for
each individual a (O(n)) and for each concept C (O(n)), we have to check whether
C(a) is derivable in ALC from the KB, which is a problem in EXPTIME. Hence,
the completion construction requires exponential time and produces a KB of size
polynomial in the size of the original one:
Theorem 4.2 The problem of deciding satisfiability of the knowledge base after
completion is in NEXPTIME in the size of the original KB.
As mentioned, given a consistent KB, its completion could be inconsistent. In
this case, we choose to keep the original KB. As an alternative, we could consider
all maximal consistent KBs (extensions) that can be generated by adding, for all
individuals, the relative most-specific concept assumptions. We could then perform
either a skeptical or a credulous reasoning with respect to such extensions.
Preferential logic allows to deal with some forms of inheritance among concepts,
by the property of cautious monotonicity (which comes from the semantic property (fT − 3)): if T(C) ⊑ D and T(C) ⊑ E, then T(C ⊓ D) ⊑ E. Coming
back to the example above, if we knew that all students typically have a teacher,
i.e. T(Student) ⊑ ∃HasTeacher .⊤, and that john is a student and has a teacher
(Student (john) and ∃HasTeacher .⊤(john) are in the ABox) then, by the completion construction above, we would get T(Student ⊓ ∃HasTeacher .⊤)(john), and, by
cautious monotonicity, we would conclude that john does not pay taxes.
5
Extensions
We consider this work as a first step. We plan to extend our approach in the
following directions.
5.1
Inheritance with exceptions
Once the completion of a KB has been defined as above, the problem of inferring
the typical properties of an individual is reduced to the problem of inferring the
properties of the most specific concept to which it belongs. This can be done by
reasoning on the typical properties of concepts in the TBox. Although Preferential
Description Logic allows to capture - through cautious monotonicity - some form of
inheritance of typical properties among concepts, there are cases in which cautious
monotonicity is not strong enough to derive the intended conclusions. For instance,
if we know that jack is a student who is a sport lover, we cannot conclude that
jack is not a tax payer, as we do not have the property that typical students (or
all students) are sport lovers, and hence cautious monotonicity is not applicable.
Here we are faced with the problem of Irrelevance. Since the property of being a
sport lover is irrelevant with respect to the property of paying taxes, we would like
to infer that also T(Student ⊓ SportLover ) ⊑ ¬TaxPayer , and therefore that jack
is not a tax payer. In order to allow this form of inheritance among concepts, we
can introduce a default rule of the following type:
14
L. Giordano et al
T(Student) ⊑ ¬TaxPayer
: T(Student ⊓ SportLover ) 6⊑ TaxPayer
T(Student ⊓ SportLover ) ⊑ ¬TaxPayer
(IRR)
By the default rule above, if typical students are not tax payers, and it is consistent
to assume that typical students who are sport lovers are not tax payers, then we
could conclude that typical sport lover students are not tax payers. With this rule,
the typical properties of a more general concept C are considered one by one, and
are inherited by a more specific concept (C ⊓ D) if it is consistent to do so. In order
to deal with default rules like this one, we need to integrate our calculus with a
standard mechanism to reason about defaults.
5.2
Reasoning on the typicality of all instances
The completion of a knowledge base, as defined above, only applies to individuals
explicitly named in the ABox. However, we would like to reason on the typical
properties of all individuals. Assume, for instance, that the ABox contains the
assertions: ∃HasChild .Worker (bill) and ∀HasChild .Student (bill). Thus, bill has a
child who is a student and is working. We want to be able to infer that bill has a
child who is a tax payer. To this purpose, we need to assume that the bill’s child is
a typical working student.
To reason about the typicality of all individuals, we would need to assume that
all individuals generated during the tableau construction are typical instances of
the most specific concept to which they belong. To this purpose, we could think of
applying the completion construction of Definition 4.1 to all generated individuals.
The completion construction would be applied only when all relevant formulas y :
C1 , . . . , y : Cj with label y have already been introduced in the branch. According
to the strategy described in section 3, the completion should be performed only
after the application of the rules to all x s.t. x ≺ y.
5.3
Extension to other DLs
We want to study the extension of our approach to more expressive description
logics. For instance, we plan to extend ALCN R considered in [4] with our typicality
operator T, and consider which is the complexity corresponding to this extension.
Finally, we want to study the extension of the language of concepts by allowing
arbitrary occurrences of the operator T.
6
Conclusions
We have proposed an extension of ALC for reasoning about typicality in Description
Logic framework. The resulting logic is called ALC+T. We have proposed a calculus
for deciding the satisfiability of a general knowledge base in ALC + T. The calculus,
called T ALC+T , is analytic, terminating, and allows us to decide the satisfiability of
a knowledge base in ALC + T in nondeterministic exponential time. The calculus is
reminiscent of the tableaux calculi for KLM logics presented in [9,8]. We have then
shown how to complete the ABox by means of typicality assumptions, in order to
infer prototypical properties of the individuals explicitly mentioned in the ABox.
15
L. Giordano et al
We have argued how to apply a similar completion also to individuals implicitly
mentioned in the ABox, in order to infer their properties. Finally, we have sketched
how to reason about the inheritance of typical properties from more general to more
specific concepts handling with irrelevant information, by using appropriate default
rules.
KLM logics are related to probabilistic reasoning. A probabilistic extension of
DLs has been proposed in [11]. In particular, the notion of conditional constraint in
[11] allows typicality assertions to be expressed (with a specified probability). We
plan to compare in details this probabilistic approach to ours elsewhere.
References
[1] Baader, F. and B. Hollunder, Embedding defaults into terminological knowledge representation
formalisms, Journal of Automated Reasoning 14 (1995), pp. 149–180.
[2] Baader, F. and B. Hollunder, Priorities on defaults with prerequisites, and their application in treating
specificity in terminological default logic, Journal of Automated Reasoning 15 (1995), pp. 41–68.
[3] Bonatti, P. A., C. Lutz and F. Wolter, Description logics with circumscription., in: Proceedings of KR,
2006, pp. 400–410.
[4] Buchheit, M., F. M. Donini and A. Schaerf, Decidable reasoning in terminological knowledge
representation systems, Journal of Artificial Intelligence Research (JAIR) 1 (1993), pp. 109–138.
[5] Donini, F. M., M. Lenzerini, D. Nardi, W. Nutt and A. Schaerf, An epistemic operator for description
logics, Artificial Intelligence 100 (1998), pp. 225–274.
[6] Donini, F. M., D. Nardi and R. Rosati, Description logics of minimal knowledge and negation as failure,
ACM Transactions on Computational Logic (ToCL) 3 (2002), pp. 177–225.
[7] Eiter, T., T. Lukasiewicz, R. Schindlauer and H. Tompits, Combining answer set programming with
description logics for the semantic web, in: Proceedings of KR, 141-151, 2004.
[8] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Analytic Tableaux Calculi for KLM Rational
Logic R, in: M. Fisher, W. van der Hoek, B. Konev and A. Lisitsa, editors, Proceedings of JELIA 2006,
LNAI 4160, pp. 190–202.
[9] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Analytic Tableaux for KLM Preferential and
Cumulative Logics, in: G. Sutcliffe and A. Voronkov, editors, Proceedings of LPAR 2005, LNAI 3835,
pp. 666–681.
[10] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Preferential Description Logics, in:
N. Dershowitz and A. Voronkov, editors, Proceedings of LPAR 2007, LNAI 4790, pp. 257–272.
[11] Giugno, R. and T. Lukasiewicz, P-SHOQ(D): A Probabilistic Extension of SHOQ(D) for Probabilistic
Ontologies in the Semantic Web, in: S. Flesca, S. Greco, N. Leone and G. Ianni, editors, Proceedings
of JELIA 2002, LNAI 2424, pp. 86–97.
[12] Kraus, S., D. Lehmann and M. Magidor, Nonmonotonic reasoning, preferential models and cumulative
logics, Artificial Intelligence 44 (1990), pp. 167–207.
[13] Straccia, U., Default inheritance reasoning in hybrid kl-one-style logics, in: Proceedings of IJCAI, 1993,
pp. 676–681.
16