Extension of Description Logics for Reasoning About Typicality

Nicola Olivetti

Extension of Description Logics for Reasoning About Typicality

Nicola Olivetti

2007

visibility

…

description

16 pages

link

1 file

We extend the Description Logic ALC with a “typicality” operator T that allows us to reason about the prototypical properties and inheritance with exceptions. The resulting logic is called ALC + T. The typicality operator is intended to select the “most normal” or “most typical” instances of a concept. In our framework, knowledge bases may then contain, in addition to ordinary ABoxes and TBoxes, subsumption relations of the form “T(C) is subsumed by P”, expressing that typical C-members have the property P. The semantics of a typicality operator is defined by a set of postulates that are strongly related to Kraus-Lehmann-Magidor axioms of preferential logic P. We first show that T enjoys a simple semantics provided by ordinary structures equipped by a preference relation. This allows us to obtain a modal interpretation of the typicality operator. Using such a modal interpretation, we present a tableau calculus for deciding satisfiability of ALC + T knowledge bases. Our calculus give...

M4M 2007 Extension of Description Logics for Reasoning About Typicality Laura Giordano1 ,2 ,3 Dipartimento di Informatica Università del Piemonte Orientale “A. Avogadro” Alessandria, Italy Valentina Gliozzi1 ,2 ,4 Dipartimento di Informatica Università degli Studi di Torino Torino, Italy Nicola Olivetti1 ,2 ,5 LSIS-UMR CNRS 6168 Université “Paul Cézanne” Aix en Provence, France Gian Luca Pozzato1 ,2 ,6 Dipartimento di Informatica Università degli Studi di Torino Torino, Italy Abstract We extend the Description Logic ALC with a “typicality” operator T that allows us to reason about the prototypical properties and inheritance with exceptions. The resulting logic is called ALC + T. The typicality operator is intended to select the “most normal” or “most typical” instances of a concept. In our framework, knowledge bases may then contain, in addition to ordinary ABoxes and TBoxes, subsumption relations of the form “T(C) is subsumed by P ”, expressing that typical C-members have the property P . The semantics of a typicality operator is defined by a set of postulates that are strongly related to Kraus-Lehmann-Magidor axioms of preferential logic P. We first show that T enjoys a simple semantics provided by ordinary structures equipped by a preference relation. This allows us to obtain a modal interpretation of the typicality operator. Using such a modal interpretation, we present a tableau calculus for deciding satisfiability of ALC + T knowledge bases. Our calculus gives a nondeterministic-exponential time decision procedure for satisfiability of ALC + T. We then extend ALC + T knowledge bases by a nonmonotonic completion that allows inferring defeasible properties of specific concept instances. Keywords: Description Logics, Nonmonotonic Reasoning, Preferential Logics, Tableaux Calculi. This paper is electronically published in Electronic Notes in Theoretical Computer Science URL: www.elsevier.nl/locate/entcs L. Giordano et al 1 Introduction The family of description logics (DLs) is one of the most important formalisms of knowledge representation. DLs are reminiscent of the early semantic networks and of frame-based systems. They offer two key advantages: a well-defined semantics based on first-order logic and a good trade-off between expressivity and complexity. DLs have been successfully implemented by a range of systems and they are at the base of languages for the semantic web such as OWL. A DL knowledge base (KB) comprises two components: (i) the TBox, containing the definition of concepts (and possibly roles), and a specification of inclusions relations among them, and (ii) the ABox containing instances of concepts and roles, in other words, properties and relations of individuals. Since the very objective of the TBox is to build a taxonomy of concepts, the need of representing prototypical properties and of reasoning about defeasible inheritance of such properties easily arises. The traditional approach is to handle defeasible inheritance by integrating some kind of nonmonotonic reasoning mechanism. This has led to study nonmonotonic extensions of DLs [1,2,3,5,6,7,13]. However, finding a suitable nonmonotonic extension for inheritance reasoning with exceptions is far from obvious. Let us put forward some desiderata for such an extension: (i) The (nonmonotonic) extension must have a clear semantics and should be based on the same semantics as the underlying monotonic DL. (ii) The extension should allow to specify prototypical properties in a natural and direct way. (iii) The extension must be decidable, if so is the underlying monotonic DL and, possibly, computationally effective. The nonmonotonic extensions proposed in the literature do not seem to fully satisfy all the above desiderata. [1] proposes the extension of DL with Reiter’s default logic. However, the same authors have pointed out that this integration may lead to both semantical and computational difficulties. Indeed, the unsatisfactory treatment of open defaults via Skolemization may lead to an undecidable default consequence relation. For this reason, [1] proposes a restricted semantics for open default theories, in which default rules are only applied to individuals explicitly mentioned in the ABox. Furthermore, Reiter’s default logic does not provide a direct way of modeling inheritance with exceptions. This has motivated the study of extensions of DLs with prioritized defaults [13,2]. A more general approach is undertaken in [6], where it is proposed an extension of DL with two epistemic operators. This extension allows one to encode Reiter’s default logic as well as to express epistemic concepts and procedural rules. However, 1 This paper has been also presented at the 13th Conference on Logic for Programming, Artificial Intelligence, and Reasoning (LPAR 2007) [10]. 2 This research has been partially supported by “MIUR PRIN05: Specification and verification of agent interaction protocols”. 3 Email: laura@mfn.unipmn.it 4 Email: gliozzi@di.unito.it 5 Email: nicola.olivetti@univ-cezanne.fr 6 Email: pozzato@di.unito.it 2 L. Giordano et al this extension has a rather complicated modal semantics, so that the integration with the existing systems requires significant changes to the standard semantics of DLs. In [3] the authors propose an extension of DL with circumscription. One of motivating applications of circumscription is indeed to express prototypical properties with exceptions, and this is done by introducing “abnormality” predicates, whose extension is minimized. The authors provide decidability and complexity results based on theoretical analysis. However, they do not provide a calculus for their logic. Moreover, the use of circumscription to model inheritance with exceptions is not that straightforward, as we remark below. In this work, we propose a novel approach to defeasible inheritance reasoning based on the typicality operator T. The intended meaning is that, for any concept C, T(C) singles out the instances of C that are considered as “typical” or “normal”. Thus assertions as “normally students do not pay taxes”, or “typically users do not have access to confidential files” [3] are represented by T(Student) ⊑ ¬TaxPayer and T(User ) ⊑ ¬∃hasAccess.ConfidentialFile . Before entering in the technical details, let us sketch how we intend to use the typicality operator and what kind of inferential services we expect to profit. We assume that a KB comprises, in addition to the standard TBox and ABox, a set of assertions of the type T(C) ⊑ D where D is a concept not mentioning T. The reasoning system should be able to infer or propagate prototypical properties of the concepts specified in the TBox, then to ascribe defeasible properties to individuals. For instance, let the KB contain: T(Student ) ⊑ ¬TaxPayer T(Student ⊓ Worker ) ⊑ TaxPayer T(Student ⊓ W orker ⊓ ∃HasChild .⊤) ⊑ ¬TaxPayer T(Unemployed ) ⊑ ¬TaxPayer corresponding to the assertions: normally a student does not pay taxes, normally a working student pays taxes, but normally a working student having children does not pay taxes (because he is discharged by the government) etc...Observe that, if the same properties were expressed by ordinary inclusions, such as Student ⊑ ¬TaxPayer etc...we would simply get that there are not working students and so on, thus the KB would collapse. This collapse is avoided as we do not assume that T is monotonic, that is to say C ⊑ D does not imply T(C) ⊑ T(D). To continue with the example, if the TBox contains PersonWithNoIncome ≡ Student ⊔ Unemployed , then the system should be able to infer T(PersonWithNoIncome ) ⊑ ¬TaxPayer . Suppose next that the ABox contains alternatively the following facts about john: 1. student (john) 2. student (john), worker (john) 3. student (john), worker (john), ∃HasChild .⊤(john) Then the reasoning system should be able to infer the expected (defeasible) conclusion about john in each case: 1. ¬TaxPayer (john) 2. TaxPayer (john) 3 L. Giordano et al 3. ¬TaxPayer (john) Observe that setting up a similar specification of the KB by using default logic or circumscription is not that simple: with default logic [1,2,5,6,7,13], one has to specify a priority on default application (or one has to find a smart encoding of defaults giving priority to more specific information); with circumscription [3], one has to introduce abnormality predicates, and then establish which predicates are minimized, fixed, or variable, and finally for the minimized ones what is their priority with respect to minimization (a total or a partial order). As a further step, the system should be able to infer (defeasible) properties also of individuals implicitly introduced by existential restrictions, for instance, if the ABox further contains ∃HasChild.Student (jack), it should conclude (defeasibly) ∃HasChild .¬TaxPayer (jack). Given the nonmonotonic character of the T operator, there is a difficulty with handling irrelevant information, for instance, given the KB as above, one should be able to infer as well: T(Student ⊓ SportLover ) ⊑ ¬TaxPayer T(Student ⊓ Worker ⊓ SportLover ) ⊑ TaxPayer as SportLover is irrelevant with respect to being a TaxPayer or not. For the same reason, the conclusion about john being a TaxPayer or not should not be influenced by the addition of SportLover (john) to the ABox. We refer to this problem as the problem of Irrelevance. In this paper we lay down the base of an extension of DL with a typicality operator. Our starting point is a monotonic extension of the basic ALC with the T operator. The operator is supposed to satisfy a set of postulates that are essentially a reformulation of Kraus, Lehmann, and Magidor (KLM) axioms of preferential logic, namely the assertion T(C) ⊑ P is equivalent to the conditional assertion C |∼ P of KLM preferential logic P. It turns out that the semantics of the typicality operator can be equivalently specified by considering a preference relation (a strict partial order) on individuals: the typical members of a concept C are just the most preferred individuals, or “most normal”, of C according to the preference relation. The preference relation is the only additional ingredient that we need in our semantics. We assume that “most normal” members of a concept C always exist, whenever the concept C is non-empty. This assumption corresponds to the Smoothness Condition of KLM logics, or the well-known Limit Assumption in conditional logics. Taking advantage of this semantic setting, we can give a modal interpretation to the typicality operator: the modal operator ✷ has intuitively the same properties as in Gödel-Löb modal logic G of arithmetic provability. We then define a tableau system for this extension based on this modal interpretation, thereby obtaining a decision procedure and an upper complexity bound (NEXPTIME). These are the main results of the paper, however the monotonic extension is not enough to perform inheritance reasoning of the kind described above: (i) we need a way of inferring defeasible properties of individuals, (ii) we need a way of handling Irrelevance. In the paper we deal with (i) by defining a completion of an ABox: the idea is that each individual is assumed to be a typical member of the most specific concept 4 L. Giordano et al to which it belongs. Such a completion allows to perform inferences as the ones 1.,2.,3. above. The paper outlines how we intend to cope with typicality of all instances and with Irrelevance. In particular, dealing with Irrelevance (ii) requires a nonmonotonic mechanism. The idea is to complete a KB with a set of default rules. The default rules are not used to express defeasible properties of concepts (as in default extension of DLs), but to propagate defeasible properties of a concept to its subsumed concepts, e.g. to infer T(Student ⊓ SportLover ) ⊑ ¬TaxPayer from T(Student ) ⊑ ¬TaxPayer . Thus in our approach the nonmonotonic mechanism is only needed to handle Irrelevance, whose treatment by means of default rules will be the object of future work. 2 The logic ALC + T: the typicality operator T We consider an alphabet of concept names C, of role names R, and of individuals O. The language L of the logic ALC + T is defined by distinguishing concepts and extended concepts as follows: (Concepts) A ∈ C and ⊤ are concepts of L; if C, D ∈ L and R ∈ R, then C ⊓ D, C ⊔ D, ¬C, ∀R.C, ∃R.C are concepts of L. (Extended concepts) if C is a concept, then C and T(C) are extended concepts, and all the boolean combinations of extended concepts are extended concepts of L. A knowledge base is a pair (TBox,ABox). TBox contains subsumptions C ⊑ D, where C ∈ L is either a concept or an extended concept T(C ′ ), and D ∈ L is a concept. ABox contains expressions of the form C(a) and aRb where C ∈ L is an extended concept, R ∈ R, and a, b ∈ O. In order to provide a semantics to the operator T, we extend the definition of a model used in “standard” terminological logic ALC: Definition 2.1 [Semantics of T with selection function] A model is any structure h∆, I, fT i, where: ∆ is the domain; I is the extension function that maps each extended concept C to C I ⊆ ∆, and each role R to a RI ⊆ ∆I × ∆I . I is defined in the usual way (as for ALC) and, in addition, (T(C))I = fT (C I ). fT : P ow(∆) → P ow(∆) is a function satisfying the following properties: (fT − 2) if S "= ∅, then also fT (S) "= ∅ ! ! (fT − 4) fT ( Si ) ⊆ fT (Si ) (fT − 3) if fT (S) ⊆ R, then fT (S) = fT (S ∩ R) ! " (fT − 5) fT (Si ) ⊆ fT ( Si ) (fT − 1) fT (S) ⊆ S Intuitively, given the extension of some concept C, fT selects the typical instances of C. (fT − 1) requests that typical elements of S belong to S. (fT − 2) requests that if there are elements in S, then there are also typical such elements. The next properties constraint the behavior of fT with respect to ∩ and ∪ in such a way that they do not entail monotonicity. According to (fT − 3), if the typical elements of S are in R, then they coincide with the typical elements of S ∩ R, thus expressing a weak form of monotonicity (namely cautious monotonicity). (fT − 4) corresponds S S to one direction of the equivalence fT ( Si ) = fT (Si ), so that it does not entail T T monotonicity. Similar considerations apply to the equation fT ( Si ) = fT (Si ), T T of which only the inclusion fT (Si ) ⊆ fT ( Si ) is derivable. (fT − 5) is a further constraint on the behavior of fT with respect to arbitrary unions and intersections; 5 L. Giordano et al it would be derivable if fT were monotonic. We can prove the following proposition: Proposition 2.2 fT (S ∪ R) ∩ S ⊆ fT (S) We can give an alternative semantics for T based on a preference relation. The idea is that there is a global preference relation among individuals and that the typical members of a concept C, i.e. selected by fT (C I ), are the minimal elements of C with respect to this preference relation. Observe that this notion is global, that is to say, it does not compare individuals with respect to a specific concept (something like y is more typical than x with respect to concept C). In this framework, an object x ∈ ∆ is a typical instance of some concept C, if x ∈ C I and there is no Celement in ∆ more typical than x. The typicality preference relation is partial since it is not always possible to establish which object is more typical than which other. The following definition is needed before we provide the Representation Theorem. Definition 2.3 Given a relation <, which is a strict partial order (i.e. an irreflexive and transitive relation) over a domain ∆, for all S ⊆ ∆, we define M in< (S) = {x : x ∈ S and 6 ∃y ∈ S s.t. y < x}. We say that < satisfies the Smoothness Condition iff for all S ⊆ ∆, for all x ∈ S, either x ∈ M in< (S) or ∃y ∈ M in< (S) such that y < x. We are now ready to prove the Representation Theorem below, showing that given a model with a selection function, we can define on the same domain a preference relation < such that, for all S ⊆ ∆, fT (S) = M in< (S). Notice that, as a difference with respect to related results (Theorem 3 in [12]), the relation is defined on the same domain ∆ of fT . On the other hand, if < is a preference relation satisfying the Smoothness Condition, then the operator defined as fT (S) = M in< (S) satisfies the postulates of Definition 2.1. Theorem 2.4 (Representation Theorem) Given any model h∆, I, fT i, fT satisfies postulates (fT − 1) to (fT − 5) above iff it is possible to define on ∆ a strict partial order <, satisfying the Smoothness Condition, such that for all S ⊆ ∆, fT (S) = M in< (S). Proof. (“Only if ” direction) Given fT satisfying postulates (fT − 1) to (fT − 5), we define < as follows: for all x, y ∈ ∆, we let x < y if ∀S ⊆ ∆, if y ∈ fT (S) then x 6∈ S, and ∃R ⊆ ∆ such that S ⊂ R and x ∈ fT (R). We prove that: (i) < is irreflexive. Easily follows by the definition of <. (ii) < is transitive. Let (a) x < y and (b) y < z. Let z ∈ fT (S) for some S, then by definition of <, y 6∈ S, and ∃R s.t. S ⊂ R and y ∈ fT (R). Furthermore, x 6∈ R and ∃Q : R ⊂ Q and x ∈ fT (Q). From this we can conclude that x 6∈ S (otherwise x ∈ R), and S ⊂ Q, hence x < z. (iii) fT (S) ⊆ M in< (S). Let x ∈ fT (S). Suppose x 6∈ M in< (S), i.e. for some y ∈ S, y < x. By definition of <, y 6∈ S, contradiction, hence x ∈ M in< (S). (iv) M in< (S) ⊆ fT (S). Let x ∈ M in< (S). Then x ∈ S, i.e. S 6= ∅. By (fT − 2), S fT (S) 6= ∅. Suppose x 6∈ fT (S). Consider Ri for all Ri ⊆ ∆ s.t. x ∈ fT (Ri ). S By (fT − 5), we have x ∈ fT ( Ri ). S S S Consider now fT ( Ri ∪ S). We can easily show that fT ( Ri ∪ S) 6⊆ Ri 6 L. Giordano et al S S (otherwise, by (fT − 3) fT ( Ri ∪ S) = fT ( Ri ), and by Proposition 2.2, S S fT ( Ri ) ∩ S ⊆ fT (S), which contradicts the fact that x ∈ fT ( Ri ) but S S x 6∈ fT (S)). Consider hence y ∈ fT ( Ri ∪ S) s.t. y 6∈ Ri . By definition of S S <, y < x. Furthermore, by (fT − 1) y ∈ S (since y ∈ Ri ∪ S and y 6∈ Ri ). It follows that x 6∈ M in< (S), contradiction, hence M in< (S) ⊆ fT (S). (v) < satisfies the Smoothness Condition. Let S 6= ∅ and x ∈ S. If x ∈ fT (S) then by point 3 we have x ∈ M in< (S). If x 6∈ fT (S) we can reason as for point 4 S S to conclude that there is y ∈ fT ( Ri ∪ S) s.t. y 6∈ Ri (hence y ∈ S), and y < x. By Proposition 2.2, we have y ∈ fT (S), hence by point 3 we conclude y ∈ M in< (S). The points above allow us to conclude. (“If ” direction) Given a strict partial order < satisfying the Smoothness Condition, we can define fT : P ow(∆) → P ow(∆) by letting fT (S) = M in< (S). It can be easily shown that fT satisfies postulates (fT − 1) to (fT − 5). The proof is left to the reader. ✷ Having the above Representation System, from now on, we will refer to the following semantics for ALC + T: Definition 2.5 [Semantics of ALC + T] A model M is any structure h∆, <, Ii, where ∆ and I are defined as in Definition 2.1, and < is a strict partial order over ∆ satisfying the Smoothness Condition (see Definition 2.3 above). As a difference with respect to Definition 2.1, the semantics of the T operator is: (T(C))I = M in< (C I ). For concepts (built from operators of ALC), C I is defined in the usual way. Definition 2.6 [Model satisfying a Knowledge Base] Consider a model M, as defined in Definition 2.5. We extend I so that it assigns to each individual a of O an element aI of the domain ∆. Given a KB (TBox,ABox), we say that: • M satisfies TBox if for all inclusions C ⊑ D in TBox, and all elements x ∈ ∆, if x ∈ C I then x ∈ D I . • M satisfies ABox if: (i) for all C(a) in ABox, we have that aI ∈ C I , (ii) for all aRb in ABox, we have that (aI , bI ) ∈ RI . M satisfies a knowledge base if it satisfies both its TBox and its ABox. Notice that the meaning of T can be split into two parts: for any object x of the domain ∆, x ∈ (T(C))I just in case (i) x ∈ C I , and (ii) there is no y ∈ C I such that y < x. In order to isolate the second part of the meaning of T (for the purpose of the calculus that we will present in section 3) we introduce a new modality whose interpretation in M is defined as follows. Definition 2.7 (C)I = {x ∈ ∆ | for every y ∈ ∆, if y < x then y ∈ C I } The basic idea is simply to interpret the preference relation < as an accessibility relation. By the Smoothness Condition, it turns out that the modality has the properties of Gödel-Löb modal logic of provability G. The Smoothness Condition ensures that typical elements of C I exist whenever C I 6= ∅, by preventing infinitely 7 L. Giordano et al descending chains of elements. This condition therefore corresponds to the finitechain condition on the accessibility relation (as in G). A similar correspondence has been presented in [9,8] to interpret the preference relation in KLM logics. The following relation between T and holds: Proposition 2.8 For all x ∈ ∆, we have x ∈ (T(C))I iff x ∈ C I and x ∈ (¬C)I Since we only use to capture the meaning of T, in the following we will always use followed by a negated concept, as in ¬C. We can establish the following equation between our typicality operator and the nonmonotonic (conditional) inference operator |∼ (describing what can be typically derived from a given premise) by letting C |∼ D iff T(C) ⊑ D. It can be easily shown that there is a correspondence between the properties of T and the properties of |∼ in the system P described in [12]. 3 Reasoning In this section we present a tableau calculus for deciding the satisfiability of a knowledge base. Given a KB (TBox,ABox), any concrete reasoning system should provide the usual reasoning services, namely satisfiability of the KB, concept satisfiability, subsumption, and instance checking. It is well known that the latter three services are reducible to the satisfiability of a KB. We introduce a labelled tableau calculus for our logic ALC + T, which enriches the labelled tableau calculus for ALC presented in [4]. The calculus is called T ALC+T and it is based on the notion of constraint system. We consider a set of variables drawn from a denumerable set V. T ALC+T makes use of labels, which are denoted with x, y, z, . . .. Labels represent objects. An object is either a variable or an individual of the ABox, that is to say an element of O ∪ V. R A constraint is a syntactic entity of the form x −→ y or x : C, where R is a role and C is either an extended concept or has the form ¬D or ¬¬D, where D is a concept. As we will define in Definition 3.1, the ABox of an ALC+T-knowledge base can be translated into a set of constraints by replacing every membership assertion R C(a) with the constraint a : C and every role aRb with the constraint a −→ b. A tableau is a tree whose nodes are pairs hS | U i, where: R • S contains constraints (or labelled formulas) of the form x : C or x −→ y; • U contains formulas of the form C ⊑ DL , representing subsumption relations C ⊑ D of the TBox. L is a list of labels. As we will discuss later, this list is used in order to ensure the termination of the tableau calculus. A node hS | U i is also called a constraint system. A branch is a sequence of nodes hS1 | U1 i, hS2 | U2 i, . . . , hSn | Un i . . ., where each node hSi | Ui i is obtained by its immediate predecessor hSi−1 | Ui−1 i by applying a rule of T ALC+T , having hSi−1 | Ui−1 i as the premise and hSi | Ui i as one of its conclusions. A branch is closed if one of its nodes is an instance of (Clash), otherwise it is open. We say that a tableau is closed if all its branches are closed. Given a KB, we define its corresponding constraint system as follows: 8 L. Giordano et al Definition 3.1 [Corresponding constraint system] Given an ALC + T-knowledge base (TBox,ABox), we define its corresponding constraint system hS | U i as follows: R S = {a : C | C(a) ∈ ABox} ∪ {a −→ b | aRb ∈ ABox} and U = {C ⊑ D ∅ | C ⊑ D ∈ T Box}. Definition 3.2 [Model satisfying a constraint system] Let M be a model as defined in Definition 2.6. We define a function α which assigns to each variable of V an element of ∆, and assigns every individual a ∈ O to aI ∈ ∆. M satisfies x : C R under α if α(x) ∈ C I and x −→ y under α if (α(x), α(y)) ∈ RI . A constraint system hS | U i is satisfiable if there is a model M and a function α such that M satisfies under α every constraint in S and that, for all C ⊑ D ∈ U and for all x occurring in S, we have that if α(x) ∈ C I then α(x) ∈ D I . It can be easily shown that: Proposition 3.3 Given an ALC + T-knowledge base, it is satisfiable if and only if its corresponding constraint system is satisfiable. Therefore, in order to check the satisfiability of (TBox,ABox), we build its corresponding constraint system hS | U i, and then we use T ALC+T to check the satisfiability of hS | U i. In order to check a constraint system hS | U i for satisfiability, our calculus T ALC+T adopts the usual technique of applying the rules until either a contradiction is generated (Clash) or a model satisfying hS | U i can be obtained from the resulting constraint system. In order to take into account the TBox, we use a technique of unfolding, similar to the one described in [4]. Given a node hS | U i, for each subsumption C ⊑ D L ∈ U and for each label x that appears in the tableau, we add to S the constraint x : ¬C ⊔ D. As mentioned above, each formula C ⊑ D is equipped by the list L of labels in which it has been unfolded in the current branch. This is needed in order to avoid multiple unfolding of the same subsumption by using the same label, generating non-termination in a proof search. Before introducing the rules of T ALC+T we need some more definitions. First, as in [4], we assume that labels are introduced in a tableau according to an ordering ≺, that is to say if y is introduced in the tableau, then x ≺ y for all labels x that are already in the tableau. Given a tableau node hS | U i and an object x, we define σ(hS | U i, x) = {C | x : C ∈ S}. Furthermore, we say that two labels x and y are S-equivalent, written x ≡S y, if they label the same set of concepts, i.e. σ(hS | U i, x) = σ(hS | U i, y). Intuitively, S-equivalent labels can represent the same element in the model built M = {y : C, y : ¬C | x : ¬C ∈ S}. by the rules of T ALC+T . Last, we define Sx→y ALC+T The rules of T are presented in Figure 1. Rules (∃+ ), (∀− ), and (− ) are called dynamic since they introduce a new variable in their conclusions. The other rules are called static. We do not need any extra rule for the positive occurrences of M . The the operator, since these are taken into account by the computation of Sx→y side conditions on the rules (∃+ ) and (∀− ) are introduced in order to ensure a terminating proof search, by implementing the standard blocking technique described below. The rules of T ALC+T are applied with the following standard strategy: 1. apply a rule to a variable x ∈ V only if no rule is applicable to a variable y ∈ V such 9 L. Giordano et al !S, x : ¬¬C | U " !S, x : C, x : ¬C | U " (Clash) !S, x : T(C) | U " !S, x : C, x : !¬C | U " !S, x : C | U " !S, x : ¬T(C) | U " (T+ ) !S, x : ¬C | U " R !S, x : ∀R.C, x −→ y | U % !S, x : ∀R.C, x −→ y, y : C | U % !S, x : ¬∀R.C | U # (T− ) !S, x : ∃R.C | U # (∃+ ) y new R !S, x : ∃R.C, x −→ y, y : C | U % if ! ∃z ≺ x s.t. z ≡S,x:∃R.C x and R (∀+ ) R !S, x : ¬!¬C | U " (¬) if y : C !∈ S ! ∃u s.t. x −→ u ∈ S and u : C ∈ S R !S, x : ¬∃R.C, x −→ y | U % (∀− ) y new !S, x : ¬∀R.C, x −→ y, y : ¬C | U % if ! ∃z ≺ x s.t. z ≡S,x:¬∀R.C x and R ! ∃u s.t. x −→ u ∈ S and u : ¬C ∈ S R R (∃− ) !S, x : ¬∃R.C, x −→ y, y : ¬C | U % if y : ¬C !∈ S L !S, x : ¬!¬C | U " M !S, y : C, y : !¬C, Sx→y | U" !S | U, C ⊑ D # (!− ) !S, x : ¬C ⊔ D | U, C ⊑ DL,x $ y new (Unfold) if x occurs in S and x !∈ L Fig. 1. The calculus T ALC+T . To save space, we omit the standard rules for ⊔ and ⊓. that y ≺ x; 2. apply dynamic rules ((− ) first) only if no static rule is applicable. This strategy ensures that the variables are considered one at a time according to the ordering ≺. Consider an application of a dynamic rule to a variable x of a constraint system hS | U i. For all hS ′ | U ′ i obtained from hS | U i by a sequence of rule applications, it can be easily shown that (i) no rule can be applied in hS ′ | U ′ i to a variable y s.t. y ≺ x and (ii) σ(hS | U i, x) = σ(hS ′ | U ′ i, x). The calculus so obtained is sound and complete with respect to the semantics described in Definition 3.2 (we omit the proof to save space): Theorem 3.4 (Soundness and Completeness of T ALC+T ) Given a constraint system hS | U i, it is unsatisfiable iff it has a closed tableau. An example of derivation in T ALC+T in presented in Figure 2. !x : T(A), x : S | T(A) ⊑ T ∅ , T(S) ⊑ A∅ , T(S) ⊑ ¬T ∅ # !x : A, x : !¬A, x : S | . . ." !x : ¬T(A) ⊔ T, x : A, x : !¬A, x : S | T(A) ⊑ T !x : ¬T(A), x : A, x : !¬A, x : S | . . ." !x : ¬A, x : A, . . . | . . ." (Clash) !x : ¬!¬A, x : !¬A, . . . | . . ." (Clash) {x} , . . .$ (T+ ) (Unfold) (⊔+ ) !x : T, x : A, x : !¬A, x : S | . . ." (Unfold) !x : ¬T(S) ⊔ ¬T, x : T, x : !¬A, x : S, . . . | T(S) ⊑ ¬T {x} , . . .$ (T− ) !x : ¬T(S), x : !¬A, x : S, . . . | . . ." (T− ) !x : ¬!¬S, x : !¬A, . . . | . . ." (!− ) !y : S, y : !¬S, y : ¬A, . . . | . . ." !x : ¬S, x : S, . . . | . . ." (Clash) !x : ¬T, x : T, . . . | . . ." (Clash) (⊔+ ) (Unfold) !y : ¬T(S) ⊔ A, y : S, y : !¬S, y : ¬A, . . . | T(S) ⊑ A{y} , . . .$ (⊔+ ) !y : ¬A, y : A, . . . | . . ." − (T ) (Clash) !y : ¬!¬S, y : !¬S, . . . | . . ." (Clash) !y : ¬T(S), y : S, y : !¬S, . . . | . . ." !y : ¬S, y : S, . . . | . . ." (Clash) Fig. 2. A closed tableau showing that T(Adult ) ⊑ ¬Student can be inferred from {T(Adult ) ⊑ TaxPayer , T(Student) ⊑ Adult , T(Student) ⊑ ¬TaxPayer }. It is shown that the KB whose TBox contains the above subsumptions, and whose ABox contains an x which is both a typical Adult and a Student, is unsatisfiable. To save space, we use A for Adult , T for TaxPayer , and S for Student. Let us conclude this section by analyzing termination and complexity of T ALC+T . In general, non-termination in labelled tableau calculi can be caused by two different reasons: 1. some rules copy their principal formula in the conclusion(s), and can thus be reapplied over the same formula without any control; 2. dynamic rules may generate infinitely-many labels, creating infinite branches. 10 L. Giordano et al Concerning the first source of non-termination (point 1), the only rules copying their principal formulas in their conclusions are (∀+ ), (∃− ), (Unfold), (∀− ), and (∃+ ). However, the side conditions on these rules avoid multiple applications on the same formula. Indeed, (Unfold) can be applied to a constraint system hS | U, C ⊑ DL i by using the label x only if it has not yet been applied to x in the current branch (i.e. x does not belong to L). Concerning (∀+ ), the rule can be applied R to hS, x : ∀R.C, x −→ y | U i only if y : C does not belong to S. When y : C is introduced in the branch, the rule will not further apply to x : ∀R.C. The same for (∃− ), (∃+ ), and (∀− ). Concerning the second source of non-termination (point 2), we can prove that we only need to adopt the standard loop-checking machinery known as blocking, which ensures that the rules (∃+ ) and (∀− ) do not introduce infinitely-many labels on a branch. Thanks to the properties of , no other additional machinery is required to ensure termination. Indeed, we can show that the interplay between rules (T− ) and (− ) does not generate branches containing infinitely-many labels. Let us discuss the termination in more detail. Without the side conditions on the rules (∃+ ) and (∀− ), the calculus T ALC+T does not ensure a terminating proof search. Indeed, given a constraint system hS | U i, it could be the case that (∃+ ) is applied to a constraint x : ∃R.C ∈ S, R introducing a new label y and the constraints x −→ y and y : C. If an inclusion T(∃R.C) ⊑ D belongs to U , then (Unfold) can be applied by using y, thus generating a branch containing y : ¬T(∃R.C), to which (T− ) can be applied introducing y : ¬¬(∃R.C). An application of (− ) introduces a new variable z and the constraint z : ∃R.C, to which (∃+ ) can be applied generating a new label u. (Unfold) can then be re-applied on T(∃R.C) ⊑ D by using u, incurring a loop. In order to prevent this source of non termination, we adopt the standard technique of blocking: the side condition of the (∃+ ) rule says that this rule can be applied to a node hS, x : ∃R.C | U i only if there is no z occurring in S such that z ≺ x and z ≡S,x:∃R.C x. In other words, if there is an “older” label z which is equivalent to x with respect to S, x : ∃R.C, then (∃+ ) is not applicable, since the condition and the strategy imply that the (∃+ ) rule has already been applied to z. In this case, we say that x is blocked by z. The same for (∀− ). As mentioned, another possible source of infinite branches could be determined by the interplay between rules (T− ) and (− ). This cannot occur, i.e. the interplay between these two rules does not generate branches containing infinitely-many labels. Intuitively, the application of (− ) to x : ¬¬C adds y : ¬C to the conclusion, so that (T− ) can no longer consistently introduce y : ¬¬C. This is due to the properties of (no infinite descending chains of < are allowed). More in detail, if (Unfold) is applied to T(C) ⊑ D by using x, an application of (T− ) introduces a branch containing x : ¬¬C; when a new label y is generated by an application of (− ) on x : ¬¬C, we have that y : ¬C is added to the current constraint system. If (Unfold) and (T− ) are also applied to T(C) ⊑ D on the new label y, then the conclusion where y : ¬¬C is introduced is closed, by the presence of y : ¬C. By this fact, we do not need to introduce any loop-checking machinery on the application of (− ). The following Theorem holds: 11 L. Giordano et al Theorem 3.5 (Termination of T ALC+T ) Let hS | U i be a constraint system, then any tableau generated by T ALC+T is finite. Since the calculus T ALC+T is sound and complete (Theorem 3.4), and since an ALC + T-knowledge base is satisfiable iff its corresponding constraint system is satisfiable (Proposition 3.3), from Theorem 3.5 above it immediately follows that: Theorem 3.6 (Decidability) Checking whether a given ALC + T-knowledge base (TBox,ABox) is satisfiable is a decidable problem. Let us conclude this section with a complexity analysis of the calculus T ALC+T : Theorem 3.7 (Complexity) Given an ALC + T-knowledge base (TBox,ABox), the problem of checking whether it is satisfiable can be solved in nondeterministic exponential time. Proof. We first show that the number of labels generated on a branch is at most exponential in the size of KB. Let n be the size of a KB. Given a constraint system hS | U i, the number of extended concepts appearing in hS | U i, including also all the ones appearing as a subformula of other concepts, is O(n). As there are at most O(n) concepts, there are at most O(2n ) variables labelling distinct sets of concepts. Hence, there are O(2n ) non-blocked variables in S. Let m be the maximum number of direct successors of each variable x ∈ S, obtained by applying dynamic rules. m is bound by the number of ∃R.C concepts (O(n)) plus the number of ¬∀R.C concepts (O(n)) plus the number of ¬¬C concepts (O(n)). Therefore, there are at most O(2n × m) variables in S, where m ≤ 3n. The number of individuals in the ABox is bound by n too, and each individual has at most m direct successors. The number of labels in S is then bound by O((2n + n) × m), and hence by O(22n ). For a given label x, the concepts labelled by x introduced in the branch (namely, all the possible subconcepts of the initial constraint system, as well as all boxed subconcepts) are O(n). According to the standard strategy, after all static rules have been applied to a label x in phase 1, no other concepts labelled by x can be introduced later on a branch. Hence, the labelled concepts introduced on the branch is O(n) for each label, and the number of all labelled concepts on the branch is O(n × 22n ). Therefore, a branch can contain at most an exponential number of applications of tableau rules. The satisfiability of a KB can thus be solved by defining a procedure which nondeterministically generates an open branch of exponential size (in the size of KB). The problem is in NEXPTIME. ✷ Although we have shown that the satisfiability is in NEXPTIME, in future research we will further consider whether this bound is optimal or not. 4 Reasoning about Typicality Logic ALC + T allows one to reason monotonically about typicality. In ALC + T we can consistently express, for instance, the fact that three different concepts, as 12 L. Giordano et al student, working student and working student with children, have a different status as taxpayers. What about the typical properties of an individual john that we know being a working student, and having children? Of course, if we know that john is a typical instance of the concept Student ⊓ Worker ⊓ ∃HasChild .⊤, i.e. if the ABox contains the assertion (∗) T(Student ⊓ Working ⊓ ∃HasChild .⊤)(john), then, in ALC + T, we can conclude that ¬TaxPayer (john). However, in absence of (*), we cannot derive ¬TaxPayer (john). In general, we would like to infer that individuals have the properties which are typical of the most specific concept to which they belong. To this purpose, we define a completion of the knowledge base which adds to the ABox, for each individual a occurring in the ABox, the assertion that a is a typical instance of the most specific concept C to which it belongs. Although in general ABoxes can contain typicality assertions about individuals, in practice we assume that typicality assertions are automatically generated by the system by means of the completion, and are not inserted by the user. From now on, we therefore assume that the initial ABox of a KB does not contain any typicality assertion. Definition 4.1 [Completion of a Knowledge Base] The KB (TBox,ABox’) is the completion of the KB (TBox,ABox), if ABox’ is obtained from ABox by adding to it, for all individual names a in the ABox, the assertion T(C1 ⊓ . . . ⊓ Cj )(a), where C1 , . . . , Cj are all the concepts Ci such that: (1) Ci is a subconcept of any concept occurring in (TBox,ABox); (2) Ci does not contain T; (3) a is an instance of Ci , i.e. Ci (a) is derivable in ALC from (TBox,ABox). For instance, assuming that Student(john), Worker (john) and ∃HasChild .⊤(john) are the only assertions concerning john derivable from the KB, the completion above would add T(Student ⊓ Worker ⊓ ∃HasChild .⊤)(john) to the ABox, as Student ⊓ Worker ⊓ ∃HasChild .⊤ is the most specific concept of which john is an instance. From this, we can conclude in ALC + T that john does not pay taxes. The completion adds T(C1 ⊓ . . . ⊓ Cj )(a) by considering each Ci (a) derivable in ALC from the KB, rather than considering only Ci (a) in the ABox. This is needed, for instance, to infer that john does not pay taxes from the KB containing Professor ⊑ ∀HasChild .Student , Professor (paul), and HasChild(paul, john). As a matter of fact, if we had in the ABox the information that john is a TaxPayer , this would not cause an inconsistent completion of the KB. Indeed, in such a case, Student ⊓ Worker ⊓ ∃HasChild .⊤ ⊓ TaxPayer would be the most specific concept of which john is an instance, so that the assertion T(Student ⊓ Worker ⊓ ∃HasChild .⊤ ⊓ TaxPayer )(john) would be added in the completion of the KB. This does not allow to infer that ¬TaxPayer (john). Hence, no inconsistency arises. However, it could be the case that the KB obtained by the completion is inconsistent, even if the initial KB is consistent. For instance, the KB containing T(C) ⊑ ∀R.E, T(D) ⊑ ∀R.¬E, C(a), D(b), R(a, c), and R(b, c) is consistent, whereas its completion, including also T(C)(a) and T(D)(b), is not. In this case, we keep the initial KB unaltered, instead of the one obtained by the completion. Notice that the completion of the ABox only introduces O(n) new formulas a : T(C1 ⊓ . . . ⊓ Cj ), one for each named individual a in the ABox. Furthermore, 13 L. Giordano et al the size of each formula T(C1 ⊓ . . . ⊓ Cj ) is O(n2 ) as C1 , . . . , Cj are all distinct subformulas of the initial formula (O(n)), and each Ci has size O(n). Hence, after the completion construction, the size of the KB is polynomial in n. Moreover, for each individual a (O(n)) and for each concept C (O(n)), we have to check whether C(a) is derivable in ALC from the KB, which is a problem in EXPTIME. Hence, the completion construction requires exponential time and produces a KB of size polynomial in the size of the original one: Theorem 4.2 The problem of deciding satisfiability of the knowledge base after completion is in NEXPTIME in the size of the original KB. As mentioned, given a consistent KB, its completion could be inconsistent. In this case, we choose to keep the original KB. As an alternative, we could consider all maximal consistent KBs (extensions) that can be generated by adding, for all individuals, the relative most-specific concept assumptions. We could then perform either a skeptical or a credulous reasoning with respect to such extensions. Preferential logic allows to deal with some forms of inheritance among concepts, by the property of cautious monotonicity (which comes from the semantic property (fT − 3)): if T(C) ⊑ D and T(C) ⊑ E, then T(C ⊓ D) ⊑ E. Coming back to the example above, if we knew that all students typically have a teacher, i.e. T(Student) ⊑ ∃HasTeacher .⊤, and that john is a student and has a teacher (Student (john) and ∃HasTeacher .⊤(john) are in the ABox) then, by the completion construction above, we would get T(Student ⊓ ∃HasTeacher .⊤)(john), and, by cautious monotonicity, we would conclude that john does not pay taxes. 5 Extensions We consider this work as a first step. We plan to extend our approach in the following directions. 5.1 Inheritance with exceptions Once the completion of a KB has been defined as above, the problem of inferring the typical properties of an individual is reduced to the problem of inferring the properties of the most specific concept to which it belongs. This can be done by reasoning on the typical properties of concepts in the TBox. Although Preferential Description Logic allows to capture - through cautious monotonicity - some form of inheritance of typical properties among concepts, there are cases in which cautious monotonicity is not strong enough to derive the intended conclusions. For instance, if we know that jack is a student who is a sport lover, we cannot conclude that jack is not a tax payer, as we do not have the property that typical students (or all students) are sport lovers, and hence cautious monotonicity is not applicable. Here we are faced with the problem of Irrelevance. Since the property of being a sport lover is irrelevant with respect to the property of paying taxes, we would like to infer that also T(Student ⊓ SportLover ) ⊑ ¬TaxPayer , and therefore that jack is not a tax payer. In order to allow this form of inheritance among concepts, we can introduce a default rule of the following type: 14 L. Giordano et al T(Student) ⊑ ¬TaxPayer : T(Student ⊓ SportLover ) 6⊑ TaxPayer T(Student ⊓ SportLover ) ⊑ ¬TaxPayer (IRR) By the default rule above, if typical students are not tax payers, and it is consistent to assume that typical students who are sport lovers are not tax payers, then we could conclude that typical sport lover students are not tax payers. With this rule, the typical properties of a more general concept C are considered one by one, and are inherited by a more specific concept (C ⊓ D) if it is consistent to do so. In order to deal with default rules like this one, we need to integrate our calculus with a standard mechanism to reason about defaults. 5.2 Reasoning on the typicality of all instances The completion of a knowledge base, as defined above, only applies to individuals explicitly named in the ABox. However, we would like to reason on the typical properties of all individuals. Assume, for instance, that the ABox contains the assertions: ∃HasChild .Worker (bill) and ∀HasChild .Student (bill). Thus, bill has a child who is a student and is working. We want to be able to infer that bill has a child who is a tax payer. To this purpose, we need to assume that the bill’s child is a typical working student. To reason about the typicality of all individuals, we would need to assume that all individuals generated during the tableau construction are typical instances of the most specific concept to which they belong. To this purpose, we could think of applying the completion construction of Definition 4.1 to all generated individuals. The completion construction would be applied only when all relevant formulas y : C1 , . . . , y : Cj with label y have already been introduced in the branch. According to the strategy described in section 3, the completion should be performed only after the application of the rules to all x s.t. x ≺ y. 5.3 Extension to other DLs We want to study the extension of our approach to more expressive description logics. For instance, we plan to extend ALCN R considered in [4] with our typicality operator T, and consider which is the complexity corresponding to this extension. Finally, we want to study the extension of the language of concepts by allowing arbitrary occurrences of the operator T. 6 Conclusions We have proposed an extension of ALC for reasoning about typicality in Description Logic framework. The resulting logic is called ALC+T. We have proposed a calculus for deciding the satisfiability of a general knowledge base in ALC + T. The calculus, called T ALC+T , is analytic, terminating, and allows us to decide the satisfiability of a knowledge base in ALC + T in nondeterministic exponential time. The calculus is reminiscent of the tableaux calculi for KLM logics presented in [9,8]. We have then shown how to complete the ABox by means of typicality assumptions, in order to infer prototypical properties of the individuals explicitly mentioned in the ABox. 15 L. Giordano et al We have argued how to apply a similar completion also to individuals implicitly mentioned in the ABox, in order to infer their properties. Finally, we have sketched how to reason about the inheritance of typical properties from more general to more specific concepts handling with irrelevant information, by using appropriate default rules. KLM logics are related to probabilistic reasoning. A probabilistic extension of DLs has been proposed in [11]. In particular, the notion of conditional constraint in [11] allows typicality assertions to be expressed (with a specified probability). We plan to compare in details this probabilistic approach to ours elsewhere. References [1] Baader, F. and B. Hollunder, Embedding defaults into terminological knowledge representation formalisms, Journal of Automated Reasoning 14 (1995), pp. 149–180. [2] Baader, F. and B. Hollunder, Priorities on defaults with prerequisites, and their application in treating specificity in terminological default logic, Journal of Automated Reasoning 15 (1995), pp. 41–68. [3] Bonatti, P. A., C. Lutz and F. Wolter, Description logics with circumscription., in: Proceedings of KR, 2006, pp. 400–410. [4] Buchheit, M., F. M. Donini and A. Schaerf, Decidable reasoning in terminological knowledge representation systems, Journal of Artificial Intelligence Research (JAIR) 1 (1993), pp. 109–138. [5] Donini, F. M., M. Lenzerini, D. Nardi, W. Nutt and A. Schaerf, An epistemic operator for description logics, Artificial Intelligence 100 (1998), pp. 225–274. [6] Donini, F. M., D. Nardi and R. Rosati, Description logics of minimal knowledge and negation as failure, ACM Transactions on Computational Logic (ToCL) 3 (2002), pp. 177–225. [7] Eiter, T., T. Lukasiewicz, R. Schindlauer and H. Tompits, Combining answer set programming with description logics for the semantic web, in: Proceedings of KR, 141-151, 2004. [8] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Analytic Tableaux Calculi for KLM Rational Logic R, in: M. Fisher, W. van der Hoek, B. Konev and A. Lisitsa, editors, Proceedings of JELIA 2006, LNAI 4160, pp. 190–202. [9] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Analytic Tableaux for KLM Preferential and Cumulative Logics, in: G. Sutcliffe and A. Voronkov, editors, Proceedings of LPAR 2005, LNAI 3835, pp. 666–681. [10] Giordano, L., V. Gliozzi, N. Olivetti and G. L. Pozzato, Preferential Description Logics, in: N. Dershowitz and A. Voronkov, editors, Proceedings of LPAR 2007, LNAI 4790, pp. 257–272. [11] Giugno, R. and T. Lukasiewicz, P-SHOQ(D): A Probabilistic Extension of SHOQ(D) for Probabilistic Ontologies in the Semantic Web, in: S. Flesca, S. Greco, N. Leone and G. Ianni, editors, Proceedings of JELIA 2002, LNAI 2424, pp. 86–97. [12] Kraus, S., D. Lehmann and M. Magidor, Nonmonotonic reasoning, preferential models and cumulative logics, Artificial Intelligence 44 (1990), pp. 167–207. [13] Straccia, U., Default inheritance reasoning in hybrid kl-one-style logics, in: Proceedings of IJCAI, 1993, pp. 676–681. 16

Log In

Extension of Description Logics for Reasoning About Typicality

Related papers

Related topics