Academia.eduAcademia.edu

Finite State Automatons in Information States

The aim of this paper is to introduce a certain kind of information state representation in a dynamic system of propositional logic, using finite state automatons and highlight its advantages, including relation to inquisitive semantics and belief revision.

3 Finite state automatons in information states* Emil Gergely Dyekiss Eötvös Loránd University The aim of this paper is to introduce a certain kind of information state representation in a dynamic system of propositional logic, using finite state automatons and highlight its advantages, including relation to inquisitive semantics and belief revision. 1 Preface Dynamic semantics offers a very straightforward and plausible theory for modelling dialogues. In dynamic semantics the meaning of a sentence is a function which assigns information states to information states. Sentences change the information in the mind of the hearer. (Veltman 1996: 1) The aim of modelling a dialogue is at least twofold. One is better understanding the process of interactions in dialogues, the other is simulation (or computational implementation) of a dialogue system. In the latter case it is very important what kind of tools and structures the model relies on. In a theory some structures are plausible and easy to use, but they are unusable in an implemented system. Such structures are the infinite sets. It is better to get rid of them already in the theory. The simplest dynamic semantic theories are modelling information states as model sets. See for instance (Kálmán and Rádai 2001: 88). An information state consists of models assumed possible by the hearer. Depending on the theory, these sets can be infinite. Of course, we cannot handle them in an implemented dialogue system. We cannot store them, just perhaps give a method to enumerate them. It is unlikely that people have such structures in their mind. If we switch from model sets to structured information states, because of the poor abilities of the former, the latter can be based upon (parts of) formulae to be able to handle revision. But if the dialogue changes without growing information, the information states (based on formulae) change, what contradicts our intuition that without new information, no information state change should be performed. If ⇤ I would like to thank the valuable help of László Kálmán, Márton Makrai and everyone who made comments on my thoughts or asked me about details and finally driving me on a path to arrive to this article — including Márta Maleczki, Márton Muntág, Péter Rebrus, Anna Szabolcsi, Dániel Vásárhelyi. I hope that I did not forget to mention anybody and I admit that all mistakes in this article are my own. 46 E MIL G ERGELY DYEKISS we model information states by automatons then we can perhaps define information states such a way that old information is accepted by the automaton and causes no change in the information state. So perhaps automatons offer a better approach. In this article I suggest modelling information states by tables and automatons instead of model sets and I investigate the advantages of this turn. I try to apply as simple automatons as possible: finite state automatons, covering as wide variety of phenomena as possible. 2 The language we are modelling The definitions in this section are based on (Kálmán and Rádai 2001: 207–209, 88-89), but slightly changed. 2.1 Syntax Definition 1 The Language of Propositional Logic L0 =de f hLC, NLC, Fi Members: Logical and non-logical constants, formulae respectively. Definition 2 Logical Constants of L0 LC =de f {(, ), ¬, ^, _} Members: Symbols of opening and closing parentheses, negation, conjunction, disjunction, respectively. Definition 3 Non-logical constants of L0 NLC =de f {p, q, ...} Members: Symbols of atomic propositions. Definition 4 Formulae of L0 F is the smallest set satisfying the following conditions: 1. If p 2 NLC then p 2 F. A propositional constant is a formula. 2. If A 2 F then ¬A 2 F. A negated formula is a formula. 3. If A, B 2 F then (A ^ B) 2 F. Conjunctive formulae. 4. If A, B 2 F then (A _ B) 2 F. Disjunctive formulae. Finite state automatons in information states 47 2.2 Classical Semantics For later reference, I define the classical semantics for this language. Definition 5 Discourse Universe U =de f {T [ F} where T is the set of true, F is the set of false statements and they satisfy that T \ F = 0/ and T [ F 6= 0. / Definition 6 Interpretation Function For an interpretation function ρ it is true that ρ(p) 2 U in case of p 2 NLC. Definition 7 Models of Propositional Logic M =de f hT, F, ρi where T is the set of true, F is the set of false statements and ρ is an interpretation function. Definition 8 Classical Semantic Values for Propositional Logic 1. [p]M =de f 1, if ρ(p) 2 T , 0 otherwise. 2. [¬p]M =de f 1, if ρ(p) 2 F, 0 otherwise. 3. [(A ^ B)]M =de f 1, if [A] = 1 and [B] = 1, 0 otherwise. 4. [(A _ B)]M =de f 1, if [A] = 1 or [B] = 1, 0 otherwise. assuming that the semantic value of the formula A in the model M (of language L0 ) is noted by [A]M and p 2 NLC; A, B 2 Form, furthermore 1 denotes the true, and 0 the false truth value. 2.3 Simple Update Semantics Let’s take a look at a simple update semantics for this language. Definition 9 Information States in Simple Update Semantics The set of information states is Σ0 and if M is the class of models of language L0 , then Σ0 =de f P(M), the powerset of M. An information state contains the models assumed possible according to the hearer’s knowledge. 48 E MIL G ERGELY DYEKISS Definition 10 Semantic Values in Simple Update Semantics If A is a formula, then its semantic value is JAK : Σ0 ! Σ0 1. JpK(σ ) =de f {M 2 σ : [p]M = 1} 2. J¬AK(σ ) =de f σ \ JAK(σ ) 3. J(A ^ B)K(σ ) =de f JBK(JAK(σ )) = JAK ◦ JBK(σ ) 4. J(A _ B)K(σ ) =de f JAK(σ ) [ JBK(σ ) Assuming that p 2 NLC; A, B 2 Form and [A] is the classical semantic value of A. Reverting from Contradictory States is Impossible Note that according to this simple semantics, all formulae eliminate models from the former information state, keeping no models in the information state after a contradiction. Since there is no method to add models to an information state by any formulae, it is not possible to revert from a contradictory information state.1 Even if there would be a formula with the ability of adding models to an information state, in the case of such a simple structure as a model set, we have no information about which models to add after revision. So we could not get an adequate result after revision. 3 Automatons In Information States Let’s create automatons accepting model sets! For this purpose we have to slightly change the definition of L0 . The only necessary change affects the set of non-logical constants. Non-logical constants should be ordered. I will use a notation of a letter and a number instead of different letters. Definition 11 Nonlogical Constants Revisited NLC =de f {pi : i 2 N, p1 2 NLC, and there is no such 1 < j 2 N, that if p j 2 / NLC, that is: the elements of NLC are such pi -s, that i starts from NLC, then p j−1 2 1 and continuously increases by 1 (maybe until infinity). N denotes the set of natural numbers. Now we can create a code for each model, and use this code as input for the automatons. We use the classical semantics of the models. 1 In the system proposed in my paper we can have several contradictory states. In the original update semantics in (Veltman 1996: 8) there was only one, and it was called the ‘absurd state’ Finite state automatons in information states 49 Definition 12 Coding the Models The code of a model M 2 M is a string on the alphabet Σ = {0, 1}. The nth letter in the code is the value of [pn ]M . Automatons will read the codes of the models and accept or reject them as necessary. The intention is that such an automaton should accept the models assumed possible for the hearer in the current information state. The ordering of atomic propositions seems implausible if we think of human behaviour. People have access to propositions so fast, that a random access seems more probable than a sequential one. This is a shadow on my proposal, but helps defining a simple but powerful representation. About The Finiteness of the Codes Depending on the size of NLC, codes of the models can be of even infinite length. This will not cause problems in practice, because we will construct automatons such a way, that if the automaton enters an accepting (terminal) state, reading the ‘rest’ of the code will not be necessary, it will be accepted anyway. This terminal state will be reached after finite steps and no edge will lead to other state from it, it will be a ‘tale’ of the automaton with loopback edges of all letters of the alphabet. 3.1 Examples Before the exact definition of building automatons for the information states, I will give examples to show how they work. The automatons on figure 1 contain exactly one initial state (marked by a short arrow pointing towards them2 ) and one terminal state (marked by a short arrow pointing away from them) in the mentioned ‘tale’ style. The automaton on the left accepts codes starting with ‘1’ what stands for evaluating p1 to true. No path to the terminal state for codes starting with ‘0’. The one on the right will reject codes containing ‘1’ as the second letter, i.e. the codes of the models evaluating p2 to true. Figure 1: Two simple automatons: one for p1 and another for ¬p2 2 Using the notation of (Eilenberg 1974: 13) 50 E MIL G ERGELY DYEKISS The automaton on figure 2 demonstrates conjunction. There is no edge from the state labelled p2 to the state labelled p3 with letter ‘1’, similarly, no edge from p5 with letter ‘0’. So this automaton accepts only models evaluating p2 to false and p5 to true. Figure 2: Automaton for ¬p2 ^ p5 The single(!) automaton on figure 3 has two initial states, and two terminal states. It can be split to two separate branches — each containing one initial and one terminal state. The branches can be treated as separate automatons accepting formulae standing on the sides of the disjunction. First branch accepts models evaluating (¬p2 ^ p5 ) to true, the other accepts models evaluating (p1 ^ p3 ) to true. Finally, the automaton with the two branches accepts models acceptable by the first or the second branch. Figure 3: Automaton including disjunction (¬p2 ^ p5 ) _ (p1 ^ p3 ) 3.2 Towards Technical Specification Now it is time to define how we can build these automatons. We have to give exact specification of adding states, edges and how to label them. The last example showed that labelling the states is not so easy. The automaton contained states with similar labels. If we assume several branches, we can get unreadable or confusing, perhaps ambiguous labels. Somehow we have to specify which state we are talking about. This can be achieved by labelling or by some other technique. I choose the latter and introduce a rich description of automatons by tables or spreadsheets. This is defined in the next chapter. I will offer a way for deriving the automatons from these tables. Finite state automatons in information states 4 51 Spreadsheet Semantics for Dialogues The tables used for representing dialogues are designed for containing a bit more data than necessary for the construction of automatons. They store the whole history of the semantics of the dialogue. I assume that the table has a header which is not treated as a data row. A table will have some ‘administrative’ columns additional to the ones containing data for edges. Because of the small alphabet used, tables will be completely different from the usual transition matrix representation. See (Eilenberg 1974: 14). The administrative columns are the following: Definition 13 Administrative Columns of the Tables Number: A sequence number for identifying the rows. Topmost row has the number 1 and each row has the number we get by adding one to the number of the row right above it. Parent: The number of the row which was the immediate ancestor of the row. Alive: A value indicating if this row is ‘alive’ or ‘dead’ (is participating in the creation of the automaton or not). This value can be 1 (true: alive) or 0 (false). The rest of the columns will belong to constants of NLC. They will contain nothing or ‘0’ or ‘1’ or both, depending on the edges starting from the state. See definition later. Now I define the empty table which corresponds to the original, ignorant information state of the hearer.3 Definition 14 Table for the Empty Information State It contains only the three administrative column headers, no data rows, no other columns. Definition 15 The Effect of an Atomic Formula pi on the Table 1. Changing the Columns We have to achieve to have data columns with header containing p1 ... pi from left to right. If there is no column with header pi , we have to add columns(s) and insert pn in the header of the header of the nth data column. 2. Changing the Rows If the table contains only the header row, then we have to add a new row under the header. Its number will be 1, parent 0, alive: 1. In all non-administrative 3 This state is called ‘minimal’ in (Veltman 1996: 2) 52 E MIL G ERGELY DYEKISS columns we should write ‘0, 1’. Finally we have to remove ‘0’ from the column of pi , keeping only the ‘1’ there. If the table contained data rows, then we have to copy all of them which are alive, and paste under the last row of the table. We have to change the number of the newly inserted rows to the number which is the result of adding 1 to the number of the row one above the new row. Their parents will be the rows which were copied, and the parent rows should be marked as not alive. Now we have to modify the living rows of the table. If they contain no data in the column of pi , then we have to fill all the new empty cells of the row by ‘0, 1’. Finally, we remove ‘0’ from the column of pi (if it contains ‘0’ at all). Note that it might happen that a cell in a living row of the table is empty. This means that the branch of the automaton created by using this row is cut into two disconnected parts. The rightmost part will be unreachable, and left hand part will not lead to a terminal state. This case shows that the branch is contradictory. Definition 16 The Effect of a Negated Atomic Formula ¬pi on the Table We do the same as in case of an atomic formula, but we remove‘1’ instead of ‘0’ where appropriate. Definition 17 The Effect of a Conjunctive Formula (A ^ B) on the Table First we apply A, then B on the result of the previous operation. Definition 18 The Effect of a Disjunctive Formula (A _ B) on the Table If the table does not contain data rows, then we add rows by processing A, then we fill another empty table by processing B. If they contain different number of columns, we add columns (and fill their headers) to the table which contains fewer columns, to get tables with the same number of columns. During this addition we fill the new cells in the living rows of the table by ‘0, 1’. Finally we merge the two tables by inserting the rows of the table of B under the rows of the table of A. We also have to increase the numbers in the inserted rows by the count of the original rows of the table of A in the columns ‘Number’ and ‘Parent’ (except if column ‘Parent’ contains 0). If the table already contained data, we have to do the same, but we have to keep the currently living original rows of the table as rows not alive, above the rows generated by A, which will be also above the rows generated by B. Definition 19 The Effect of a Negated Conjunctive Formula ¬(A ^ B) on the Table We use De Morgan’s law and proceed with processing (¬A _ ¬B). Finite state automatons in information states 53 Definition 20 The Effect of a Negated Disjunctive Formula ¬(A _ B) on the Table We use another law of De Morgan and proceed with processing (¬A ^ ¬B). Definition 21 The Effect of a Double Negated Formula ¬¬A on the Table We drop the double negation and process A. # Parent Alive p1 p2 1 2 0 1 0 1 0, 1 0, 1 0 0 0, 1 3 (1) 4 (2) 0 3 (1) 0 1 1 1 0, 1 1 p3 p4 p5 Comment: Header 0, 1 1 ¬p2 (¬p2 ^ p5 ) 0, 1 0, 1 p1 (p1 ^ p3 ) + 2 cells! Table 1: The table representation of (¬p2 ^ p5 ) _ (p1 ^ p3 ) with additional comments We defined the effect of all kind of formulae on the tables — but our goal is to have automatons. Now I define how to generate automatons from the tables. Definition 22 Generating Automatons Based on Tables 1. We care only for the ‘living’ rows. 2. Let’s add an automaton state for all cells of the table in the non-administrative columns of data rows. 3. Let’s mark the states belonging to the first data column as initial states. 4. Let’s add one more state on the right of the state belonging to the last data column of each data row, marked as terminal state. 5. Let’s add edges to the automaton. The numbers ‘0’ and ‘1’ in the cells are to be labels for the edges between the cell’s automaton state and its right neighbour. If the cell contains both of them, two edges are necessary, but if the cell is empty, we do not have any edges between the two states. 6. Let’s add edges beginning at the terminal states of the automaton. We need two for each. One with the label ‘0’, and one with ‘1’, both ending also at this state. To see the relation between the table representation and the automaton generated from it, compare figure 3 and the table above. 54 E MIL G ERGELY DYEKISS Notes Note that we use finite state automatons. If the automaton has only one living row, then the automaton is deterministic, but if it has more than one living rows, then it is non-deterministic. Non-determinism is due to disjunction. The difference between the table representation and the automatons is the lack of dialogue history in the latter. The algorithm of automaton generation from the tables is simple. Almost reversible — except the data in the administrative columns and in not living rows — so we cannot reconstruct the history of the dialogue based on the automaton of the information state. 5 Central Semantic Concepts We have to define a few semantic concepts to be able to say something about the logical properties of the system. We need at least the following: Definition 23 Compatibility An information state σ is compatible with the formula A if by applying the formula to σ we get an information state with an automaton in which there is at least one path from at least one of the initial states to at least one of the terminal states. Definition 24 Incompatibility An information state σ is incompatible with a formula A if by applying the formula to the information state we get an information state with an automaton in which there is no path from any of the initial states to any of the terminal states. Definition 25 Support An information state σ supports a formula A if in the table representation of the information state we get by applying A to σ , every living row of the original table will be a parent of at least one of the new rows of the new table. Definition 26 Consequence A formula B is consequence of the formulae A1 , A2 , ..., An iff for every information state σ it is true that by applying A1 to σ , and then A2 on its result etc. until An , the resulting information state supports B. Finite state automatons in information states 6 55 Outlook 6.1 Inquisitive Semantics Inquisitive semantics (Groenendijk and Roelofsen 2009) makes difference between the informative and the inquisitive content of sentences. Informative content can eliminate some of the still possible models, while inquisitive content creates (perhaps overlapping) groups of them, called alternatives. The traditional semantics of questions (Groenendijk and Stokhof 1997) uses disjunct partitions of the models — overlapping of the alternatives is the real new light in inquisitive semantics. Figure 4: (¬p1 _ p2 ) by an automaton and in inquisitive semantics Besides questions, disjunction also bears inquisitive content. In a simple p1 _ p2 case inquisitive semantics assumes two alternatives: one in which p1 is true and another, where p2 is true. They overlap: p1 and p2 can be true at the same time. We can see in figure 4, that the automaton has two branches — they are logically equivalent to the alternatives of inquisitive semantics. This example is very simple. If we consider more complex alternatives by more complex formulae, then we will have more branches then alternatives. But we can make groups of the branches. Parents of the branches (more precisely: the rows of the table from which we derived the branch) can create the real alternatives from branches. If all the branches have the same parent, a branch alone represents an alternative. If there are more then one branches, and they have (at least some) different parents, then the parents determine the alternatives. 6.2 Belief Revision The simple update semantics for propositional logic, which represents information states as model sets, is not able to get out of a contradictory information state, because it is represented by an empty set and we do not know which models to add, moreover we do not have operators which can add models to the set (just eliminating rules). 56 E MIL G ERGELY DYEKISS We can define contraction and revision by the table and the automaton representation of information states. The most important application of revision is perhaps to get out of a contradictory information state. A contradictory information state is represented by an automaton in which there is no path from any initial state to any terminal state — or represented by a table which has at least one empty cell in each of its living rows. In my approach, revision (Alchourrón, Gärdenfors, and Makinson 1985) of an atomic formula or its negation is quite straightforward, because we have to add a ‘0’ or a ‘1’ into each cell in the column which has this atomic formula in its header. (Or in a more complicated way: adding new rows, and making changes there.) This approach enables to get out of the contradictory state by a sequence of some (can be more than one) steps. Contraction or revision of compound formulae is a bit more difficult. If we want to enable contraction of only those formulae which were part of the dialogue, we need the table representation (opposed to the automaton representation), because it preserves the whole history of the dialogue. I cannot define the exact steps of contraction here, but it is necessary to note, that the structured information state described in this article make it possible to define contraction at all, which is important. I also note that questions can make it possible to determine which propositions to contract, and the information state representations described here make it possible to handle questions and answers in general (similarly to disjunction, using the concept of alternatives of inquisitive semantics) and to propose appropriate question for contraction. 7 Summary In this article I suggested tables for the representation of information states in a dynamic propositional logic system. These tables can store historical data about the dialogue and serve as base for the derivation of finite state automatons which can accept model sets by reading their codes. I also stated relations to other theories (inquisitive semantics and belief revision) emphasizing that this kind of representation is powerful enough for defining the semantics of questions in line with inquisitive semantics, and for defining contraction and revision, two important operations of belief revision. Finite state automatons in information states 57 Future stories The system I suggested was sketched briefly and could be defined in a more exact formal way, including the semantics of questions and answers, handling contradiction, contraction and revision. The most useful improvement of this approach would be to extend the theory to predicate logic in which dynamic semantics brings its very natural and impressive power of handling existential quantification and binding of discourse referents. Bibliography Alchourrón, C. E., P. Gärdenfors, and D. Makinson (1985). On the logic of theory change: Partial meet contraction and revision functions. Journal of Symbolic Logic 50, 510–530. Eilenberg, S. (1974). Automata, Languages, and Machines. New York and London: Academic Press. Groenendijk, J. and F. Roelofsen (2009, May 6-8). Inquisitive semantics and pragmatics. In Proceedings of the International Workshop on Semantics, Pragmatics and Rhetorics, Donostia, Spain. Groenendijk, J. and M. Stokhof (1997). Questions. Handbook of Logic and Language 25, 1055–1124. Kálmán, L. and G. Rádai (2001). Dinamikus szemantika. Budapest: Osiris Kiadó. Veltman, F. (1996). Defaults in update semantics. Journal of Philosophical Logic 25, 221–261.