Random bipartite posets and extremal problems

William Trotter

Random bipartite posets and extremal problems

William Trotter

2020, Acta Mathematica Hungarica

visibility

…

description

24 pages

link

1 file

Previously, Erdős, Kierstead and Trotter [5] investigated the dimension of random height 2 partially ordered sets. Their research was motivated primarily by two goals: (1) analyzing the relative tightness of the Füredi-Kahn upper bounds on dimension in terms of maximum degree; and (2) developing machinery for estimating the expected dimension of a random labeled poset on n points. For these reasons, most of their effort was focused on the case 0 < p ≤ 1/2. While bounds were given for the range 1/2 ≤ p < 1, the relative accuracy of the results in the original paper deteriorated as p approaches 1. Motivated by two extremal problems involving conditions that force a poset to contain a large standard example, we were compelled to revisit this subject, but now with primary emphasis on the range 1/2 ≤ p < 1. Our sharpened analysis shows that as p approaches 1, the expected value of dimension increases and then decreases, answering in the negative a question posed in the original paper. Along the way, we apply inequalities of Talagrand and Janson, establish connections with latin rectangles and the Euler product function, and make progress on both extremal problems.

arXiv:2003.07935v1 [math.CO] 17 Mar 2020 RANDOM BIPARTITE POSETS AND EXTREMAL PROBLEMS CSABA BIRÓ, PETER HAMBURGER, H. A. KIERSTEAD, ATTILA PÓR, WILLIAM T. TROTTER, AND RUIDONG WANG Abstract. Previously, Erdős, Kierstead and Trotter [5] investigated the dimension of random height 2 partially ordered sets. Their research was motivated primarily by two goals: (1) analyzing the relative tightness of the Füredi-Kahn upper bounds on dimension in terms of maximum degree; and (2) developing machinery for estimating the expected dimension of a random labeled poset on n points. For these reasons, most of their eﬀort was focused on the case 0 < p ≤ 1/2. While bounds were given for the range 1/2 ≤ p < 1, the relative accuracy of the results in the original paper deteriorated as p approaches 1. Motivated by two extremal problems involving conditions that force a poset to contain a large standard example, we were compelled to revisit this subject, but now with primary emphasis on the range 1/2 ≤ p < 1. Our sharpened analysis shows that as p approaches 1, the expected value of dimension increases and then decreases, answering in the negative a question posed in the original paper. Along the way, we apply inequalities of Talagrand and Janson, establish connections with latin rectangles and the Euler product function, and make progress on both extremal problems. 1. Introduction This paper is concerned primarily with the combinatorics of ﬁnite partially ordered sets, also called posets, but to motivate our line of research, we start with a brief discussion of analogous questions for graphs. For a graph G, let ω(G) denote the clique number of G, the maximum number of vertices in a complete subgraph of G. Also, let χ(G) denote the chromatic number of G, the least number of colors required for a proper coloring of G. Let N denote the set of positive integers, and when n ∈ N , we write [n] for {1, ..., n}. Let n ∈ N , and let G be a graph on n vertices. Then we have the trivial inequality ω(G) ≤ χ(G) ≤ n. Furthermore, if χ(G) = n, then ω(G) = n. We then ask whether these statements are “stable,” i.e., if G is a graph on n vertices and χ(G) is close to n, must ω(G) also be close to n? More formally: Question 1.1 (Question 1 for Graphs). Does there exist a function f : N → N such that for every c ∈ N , if n > f (c), G is a graph on n vertices, and χ(G) ≥ n−c, then ω(G) ≥ n − f (c)? The answer to Question 1 for graphs is easily seen to be yes, since we may take f (c) = 2c. To see that this function satisﬁes the desired property, we simply carry out the following iterative process: While G is not a complete graph, choose two Date: February 21, 2020. 2010 Mathematics Subject Classification. 06A07, 05C35. Key words and phrases. Poset, bipartite poset, dimension, standard example. 1 2 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG non-adjacent vertices and remove them. Each such operation decreases the size of the graph by 2 but lowers the chromatic number by at most 1. So the operation must halt in at most c steps. Here is a second question of a related nature. Question 1.2 (Question 2 for Graphs). For integers k and n with k ≥ 2 fixed and n → ∞, what is the maximum value g(k, n) of χ(G) among all graphs G on n vertices with ω(G) < k? For example, when k = 3, a graph G with ω(G) < 3 is said to be triangle-free, and it is well known that g(3, p n), the maximum chromatic number of a triangle-free graph on n vertices, is Θ( n/ log n). The primary goal of this paper is to investigate analogous questions for posets. We assume that readers are familiar with basic notation and terminology for posets, including comparable and incomparable pairs of points; chains and antichains; minimal and maximal elements; and linear extensions. Beyond these basics, we will include all essential notation and terminology for the results presented in this paper. To develop the poset analogue of chromatic number, we have the following deﬁnitions. Let P be a poset. A non-empty family F = {L1 , . . . , Ld } of linear extensions of P is called a realizer of P when x ≤ y in P if and only if x ≤ y in Lj for each j ∈ [d]. Dushnik and Miller [4] deﬁned the dimension of a poset P , denoted dim(P ), as the least positive integer d for which there is a realizer F = {L1 , . . . , Ld } of P . Analogies between dimension for posets and chromatic number for graphs have been widely studied, and indeed the book chapter [14] is devoted entirely to this topic. For the poset analogue of a clique, we have the following construction. For an integer d ≥ 2, let Sd be the height 2 poset with Min(Sd ) = {a1 , . . . , ad }, Max(Sd ) = {a′1 , . . . , a′d } and ai < a′j in Sd if and only if i 6= j. Clearly, dim(Sd ) = d, and posets in the family {Sd : d ≥ 2} are called standard examples. For a poset P , we deﬁne the standard example number of P , denoted se(P ), as follows. Set se(P ) = 1 if P does not contain a subposet isomorphic to the standard example S2 ; otherwise se(P ) is the largest d ≥ 2 for which P contains a subposet isomorphic to the standard example Sd . We then have the trivial inequality dim(P ) ≥ se(P ). As is well known, for every d ≥ 2, there is a poset P with se(P ) = 1 and dim(P ) = d. Nevertheless, it is of interest to study classes of posets where large dimension requires large standard example number. When n ∈ N and G is a graph on n vertices, the inequality χ(G) ≤ n is trivial, as is the assertion that the inequality is tight only when G is a complete graph on n vertices. The analogous results for posets are more substantive. Hiraguchi [9] proved that if n ≥ 2 and P is a poset on 2n+1 points, then dim(P ) ≤ n. Kimble [11] proved1 that if n ≥ 4, this inequality is tight only when P contains the standard example Sn . For the poset analogue of Question 1 for graphs, we then have: Question 1.3 (Question 1 for Posets). Does there exist a function sa : N → N such that for every c ∈ N , if n > sa(c), P is a poset on 2n+1 points, and dim(P ) ≥ n−c, then se(P ) ≥ n − sa(c)? In stating Question 1 for posets, we use the notation sa(c) to remind readers that we are discussing “stability analysis.” Unlike the situation with graphs, we 1We refer the reader to the discussion in [14] about subtleties of this proof, and we note that it does not hold when n = 2 or when n = 3. RANDOM POSETS 3 know of no elementary argument to show that the function sa(c) is well deﬁned, ignoring any issue of how fast sa(c) would have to grow in terms of c. However, in [3], Biró, Hamburger, Pór and Trotter showed the function sa(c) is well deﬁned and satisﬁes sa(c) = O(c2 ). Also, they gave a construction using ﬁnite projective planes to show that sa(c) = Ω(c4/3 ). The research for this paper began with the challenge of ﬁnding the correct exponent on c in the function sa(c), knowing that the answer is in the interval [4/3, 2]. In this paper, we will raise the lower bound on this interval to 3/2. For the poset analogue of Question 2 for graphs, we have: Question 1.4 (Question 2 for Posets). For integers d and n with d ≥ 2 fixed and n → ∞, what is the maximum value f (d, n) of dim(P ) among all posets P on n points with se(P ) < d? Question 2 for posets was ﬁrst posed in [13], and then referenced again in [5]. Here we obtain a better result, and we remove the requirement that d be large. For historical reasons, the value of f (2, n) has been studied—albeit with diﬀerent notation and terminology—for many years. No doubt this results from the fact that the class of posets with standard example number 1 is the class of interval orders. Combining results of several authors (see the discussion in [3]), the value of f (2, n) can be determined to within an additive error of at most 5. However, as a crude estimate, we have f (2, n) = lg lg n + (1/2 + o(1)) lg lg lg n. For a ﬁxed value of d ≥ 3, Biró, Hamburger and Pór [2] proved that f (d, n) = o(n), but this leaves open the possibility that f (d, n) behaves in the same slowgrowing manner as f (2, n). However, we will show in Section 6 that there is a positive constant αd so that f (d, n) = Ω(nαd ). 1.1. Links with Random Bipartite Posets. Working on Question 1 for posets led us to revisit the following model for a random bipartite poset, introduced and studied by Erdős, Kierstead and Trotter [5]. Let n ∈ N and ﬁx disjoint sets A and A′ , each of size n. Then Ω(n, p) denotes the probability space consisting of posets P such that (1) the ground set of P is A ∪ A′ with A ⊆ Min(P ) and A′ ⊆ Max(P ); and (2) for a pair (a, a′ ) ∈ A × A′ , set P(a < a′ in P ) = p, (in general, p is a function of n) with events corresponding to distinct pairs independent. To place this work in historical perspective, we give here a brief overview of key results, beginning with a discussion of upper bounds on dimension. For a poset P (of arbitrary height), let ∆U (P ) denote the maximum size of sets of the form UP (x) = {y ∈ P : x ≤ y in P } taken over all elements x ∈ P . Analogously, ∆D (P ) is the maximum size of sets of the form DP (x) = {z ∈ P : z ≤ x in P }. Then set ∆(P ) = max{∆U (P ), ∆D (P )}. In [8], Füredi and Kahn proved that if ∆(P ) = k, then dim(P ) < 50k log2 k. The ﬁrst inequality in the following theorem is due to Füredi and Kahn [8]. The second is a quite recent result of Scott and Wood [12] improving the bound from [8] cited immediately above. Readers may note that the Lovász local lemma [6] was used in both [8] and [12]. 4 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG Theorem 1.5. If u = ∆U (P ), k = ∆(P ) and |P | = n, then dim(P ) < 1 + 2(u + 1) log n dim(P ) < k log 1+o(1) and k Continuing with upper bounds, as is well known, almost all labeled posets on n points have the following structure: P is the union Min(P ) ∪ A ∪ Max(P ) of three disjoint antichains; the size of A is (1/2 ± o(1))n; both Min(P ) and Max(P ) have size (1/4 ± o(1))n; and x < y in P whenever x ∈ Min(P ) and y ∈ Max(P ). It is then straightforward to show that there is a constant c1 > 0 such that almost all labeled posets on n points have dimension at most n/4 − c1 n/ log n. Turning to lower bounds, it is more challenging to ﬁnd good lower bounds in either of these two settings. For example, no explicit construction is known for a poset P with ∆(P ) = k and dim(P ) > k + 1 for any value of k. Also, simple counting only shows that almost all labeled posets on n elements have dimension Ω(n/ log n). The following lower bounds are proved in [5]. In stating these bounds, we use the standard abbreviation a.a.s. for asymptotically almost surely. Theorem 1.6. For every ǫ > 0, there exists positive constants δ1 , δ2 , δ3 so that a.a.s., ( −1+ǫ δ1 pnnlog pn < p ≤ 1/ log n, and o if n dim(P ) > max δ2 n, n − δ3 n/(p log n) if 1/ log n ≤ p < 1 − n−1+ǫ . The ﬁrst inequality in Theorem 1.6 shows that the two upper bounds in Theorem 1.5 are essentially best possible—although there remains an o(log k) multiplicative gap for the second. The second inequality in Theorem 1.6 was used in [5] to show that is a positive constant c2 > 0 so that almost all labeled posets on n elements have dimension greater than n/4 − c2 n/ log n. When n−1 log2 n < p ≤ 1/ log n, good upper bounds on the expected value of the dimension of a poset P ∈ Ω(n, p) are provided by Theorem 1.5, since ∆(P ) and ∆U (P ) are sharply concentrated around pn. For the range 1/ log n ≤ p < 1, the following upper bound is given in [5]. Theorem 1.7. If ǫ > 0 and 1/ log n ≤ p < 1, then a.a.s., dim(P ) ≤ n − n log(1/p) . (2 + ǫ) log n The improvements we make here will all be for the range p ≥ 1/2. Accordingly, we extract the following upper and lower bounds from Theorems 1.7 and 1.6 for this range. Note that the modest improvement in the upper bound results from the narrowing of the range on p. Note also that p is bounded away from 1 for the lower bound. Corollary 1.8 (Old Upper and Lower Bounds). Suppose 1/2 ≤ p < 1, then a.a.s., dim(P ) < n − n log(1/p) . 2 log n Furthermore, for every ǫ > 0, there exists δ > 0 so that p < 1 − n−1+ǫ , then a.a.s., δn . dim(P ) > n − log n RANDOM POSETS 5 In discussing the merits of our new results, the old upper and lower bounds given in Corollary 1.8 will be the basis of comparison. −1/2 To make the connection with Question . If √ 1 for posets concrete, let p = 1−n the old upper bound P ∈ Ω(n, p), then a.a.s., se(P ) = O( n log n). Furthermore, √ in Corollary 1.8 implies that a.a.s., dim(P ) ≤ n− n/(2 log n). It is easy to see that if this upper bound is tight, up to a poly-log multiplicative factor on the diﬀerence n − dim(P ), then the exponent 2 on c in the function sa(c) is correct. However, the old lower bound in Corollary 1.8 only asserts that there is a constant δ such that a.a.s., dim(P ) ≥ n − δn/ log n. This inequality is enough to prove that sa(c) = Ω(c log c), but we already had a constructive proof of an even better lower bound. This shortcoming was the launching point for revisiting the subject of dimension for random bipartite posets, but now with the speciﬁc goal of obtaining better bounds when p ≥ 1/2. The bridges to Question 2 for posets were not clear at the outset of our research but came into view as better bounds and connections to Question 1 unfolded. These details will become clear later in the paper. 1.2. Statement of Improved Bounds for Random Bipartite Posets. To avoid sporadic eﬀects when p is very close to 1, we assume p ≤ 1 − n−1 log2 n. Consistent with modern research in combinatorics, we will typically treat a quantity like (1 − p)n/ log n as if it is an integer when it tends to inﬁnity with n. The minor errors this approach introduces can be easily repaired. For the range 1/2 ≤ p ≤ 1 − n−1 log2 n, many of the results and arguments are more naturally phrased in terms of the complementary parameter q = 1 − p. Accordingly, for the balance of the paper, the symbol p will be used exclusively as a quantity (usually a function of n) from the interval [0, 1], while q will always be 1 − p. Some of the proofs of our new results are extensions and reﬁnements of arguments appearing in [5] and [3], but most of our results require entirely new approaches. In particular, we will apply some second moment methods, Talagrand’s inequality and Janson’s inequality. None of these tools were used in [5] or in [3]. Our improved upper bounds, stated below Q∞in comprehensive form, involve the well-studied Euler product function φ(q) = i=1 (1 − q i ). Theorem 1.9. Suppose n−1 log2 n ≤ q ≤ 1/2, 0 < ǫ < 1 and z = n2 q log(1/φ(q)). Then a.a.s.,   if n−1 log2 n ≤ q ≤ n−1/2 log n. n − (2 − ǫ) log(qn)/q dim(P ) < n − qn/(2 log(qn)) if n−1/2 log n < q ≤ n−1/3 .   n − n log(1/φ(q))/ log z if n−1/3 < q ≤ 1/2. In the discussion to follow, we will refer to these three inequalities as New Upper Bounds (1), (2), and (3). New Upper Bounds (2) and (3) are minor improvements relative to the old upper bound. For example, when q = 1/2, the old upper bound asserts that a.a.s., dim(P ) < n − 0.346n/ log n. On the other hand, New Upper Bound (3) improves this to a.a.s., dim(P ) < n − 0.621n/ log n. When √ q = n−1/2 log n, the old upper bound asserts that a.a.s., dim(P√) < n − n/2, while New Upper Bound (2) improves this to a.a.s., dim(P ) < n − n. New Upper Bound (1) is substantially better than the old bound. For example, when q = n−α and 0 < α < 1/2, the old upper bound asserts that a.a.s., dim(P ) < 6 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG n − n1−α /2 log n, while New Upper Bound (1) improves this to a.a.s., dim(P ) < n − (1 − α)nα log n. Here is a statement, again in comprehensive form, of our improved lower bounds. Theorem 1.10. Suppose n−1 log2 n ≤ q ≤ 1/2 and 0 < ǫ < 1. If q ≥ n−1/4 log3 n, set z = log n + 4 log q − 8 log log n. Then a.a.s.,  n − (2 + ǫ) log(qn)/q if n−1 log2 n ≤ q ≤ n−4/5 .    1/2  n − 32 n log n/q if n−4/5 ≤ q ≤ (32)1/3 n−1/3 log1/3 n. dim(P ) >  n − 8qn if (32)1/3 n−1/3 log1/3 n ≤ q ≤ 18 n−1/4 log3 n.    n − 24qn/z if 18 n−1/4 log3 n ≤ q ≤ 1/2. In the discussion to follow, we will refer to these inequalities as New Lower Bounds (1) through (4). In the range n−1 log2 n ≤ q ≤ 1/2, our bounds show that a.a.s., dim(P ) ≥ (1−o(1))n. So the accuracy of our upper and lower bounds should be judged on the quantity n−dim(P ). For this quantity, our upper and lower bounds diﬀer by a multiplicative factor that is 1+o(1) when n−1 log2 n ≤ q ≤ n−4/5 , and by O(log n) when n−1/3 ≤ q ≤ 1/2. In [5], it was asked whether the expected value of dim(P ) behaves monotonically as a function of p. Our results answer this question negatively. In particular, when q = n−1/4 , we have a.a.s., dim(P ) ≥ n − 8n3/4 and when q = n−4/5 , we have a.a.s., dim(P ) ≤ n − n4/5 . In the range n−4/5 ≤ q ≤ n−1/3 , the ratio of our two bounds on n − dim(P ) deteriorates. In particular, for the special value of q = n−1/2 , we are only able to show that (roughly speaking) n1/2 ≤ n − dim(P ) ≤ n3/4 . The remainder of the paper is organized as follows. In the next section, we provide essential background material. In the following three sections, we give proofs of our new bounds, grouping arguments according to the underlying scheme. The setup for the application of the inequalities of Talagrand and Janson will be given just before the results are needed. We return to the motivating extremal problems in Section 6, and we close with some brief comments on remaining problems in Section 7. 2. Essential Background Material For a poset P , we use the compact notation a <P a′ when a < a′ in P . Similarly, we write a kP a′ when a is incomparable to a′ in P . However, to avoid double subscripts, when Lj is a linear extension of P , we will use the long form a < a′ in Lj . We will be concerned (almost exclusively) with the class B of bipartite posets whose ground set is the union of two disjoint antichains A and A′ with A ⊆ Min(P ) and A′ ⊆ Max(P ). Bipartite posets have been studied extensively in the literature, and we will follow here the conventions that have emerged in this research. We will write B(n) for the class of bipartite posets with |A| = |A′ | = n. For a poset P ∈ B, we let IP consist of all pairs (a, a′ ) ∈ A × A′ with a kP a′ . Let d ∈ N , and let F = {L1 , . . . , Ld } be a family of linear extensions of P . We abuse notation slightly and call F a realizer of P if for every (a, a′ ) ∈ IP , there is some j ∈ [d] with a > a′ in Lj . We then deﬁne the dimension of P , denoted dim(P ), as the least positive integer d such that P has a realizer of size d. It is easy to see that this altered notion of dimension never exceeds the original Dushnik-Miller RANDOM POSETS 7 deﬁnition, and the diﬀerence is at most 1. In our work, an additive error of this magnitude can be safely ignored. 2.1. Matchings, Independence Number, and Clique Number. The following proposition, which holds for posets in general, is nearly self-evident. It is stated for emphasis. Proposition 2.1. Suppose that (x, y) is an ordered pair of distinct points in a poset P with x kP y. Then there exists a linear extension L = L(x, y) of P such that: (1) If w ∈ P and w kP x, then x >L w. (2) If z ∈ P and z kP y, then z >L y. When P ∈ B and (a, a′ ) ∈ A × A′ with a kP a′ , we let L(a, a′ ) denote the set of linear extensions of P satisfying the requirements of Proposition 2.1 for the pair (a, a′ ). Let P ∈ B and let d ∈ [n]. A matching (of size d) in P consists of a pair (T, T ′ ) of d-element subsets of A and A′ , respectively, and labelings T = {a1 , . . . , ad } and T ′ = {a′1 , . . . , a′d } such that aj kP a′j for every j ∈ [d]. There are obvious notions of maximal and maximum matchings. Also, when T ⊆ A, T ′ ⊆ A′ , and d = |T | = |T ′ |, we say that T and T ′ can be matched if such labelings of T and T ′ exist. Lemma 2.2. Let P ∈ B. Then dim(P ) is at most the minimum size of a maximal matching in P . Proof. Let T = {a1 , . . . , ad } and T ′ = {a′1 , . . . , a′d } be labelings determining a matching (T, T ′) of size d in P . If this matching is maximal, we show that dim(P ) ≤ d. For each j ∈ [d], let Lj be any linear extension in L(aj , a′j ). Clearly, F = {L1 , . . . , Ld } is a realizer of P . The following elementary lemma is implicit in [5] and explicit in [3]. Lemma 2.3. Let P ∈ B. If IP 6= ∅, and dim(P ) = d, then there is a realizer F = {L1 , . . . , Ld } of P for which there is a matching (T, T ′ ) with T = {a1 , . . . , ad } and T ′ = {a′1 , . . . , a′d } in P such that Lj ∈ L(aj , a′j ) for each j ∈ [d]. Proof. Given a realizer F = {L1 , . . . , Ld }, we note that for every j ∈ [d], the highest element of A is over the lowest element of A′ . If this assertion failed for some j ∈ [d], then F −{Lj } would be a realizer for P . Now carry out the following modiﬁcations, in an iterative manner, to the linear extensions in F . For each j =∈ [d], let aj be the highest element of A. Since d = dim(P ), loss of generality, we may assume that aj > a′ in Lj for every a′ ∈ A′ with aj kP a′ . If not, simply move all such elements from above aj to the gap immediately below it. An analogous remark holds for the lowest element a′j of A′ in Lj . Then for all k with j < k ≤ d, move aj to the bottom of Lk and move a′j to the top of Lk . After these steps have been taken, the resulting family is a realizer satisfying the requirements of the lemma. Throughout this paper, we will exploit connections between posets and graphs, and we have already discussed the clique number of a graph G, denoted ω(G). Here is the analogous concept for bipartite posets. Let P ∈ B. We call a pair (V, V ′ ) a clique pair when V ⊆ A, V ′ ⊆ A′ , and v <P v ′ for all (v, v ′ ) ∈ V × V ′ . A clique pair (V, V ′ ) is balanced if |V | = |V ′ ). In turn, we deﬁne the balanced clique number of P , denoted bcn(P ), as the largest integer r such that P contains a clique pair (V, V ′ ) 8 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG with |V | = |V ′ | = r. By convention, bcn(P ) = 0 if there is no pair (a, a′ ) ∈ A × A with a <P a′ . Let G be a graph. A set I of vertices in G is said to be independent if there are no edges in G with both endpoints in I. In turn, the independence number of G, denoted α(G), is the maximum size of an independent set of vertices in G. Analogously, when P ∈ B, we will refer to a pair (U, U ′ ) as an independent pair, when U ⊆ A, U ′ ⊆ A′ , and u kP u′ for all (u, u′ ) ∈ U × U ′ . An independent pair (U, U ′ ) is balanced if |U | = |U ′ |, and the balanced independence number of P , denoted bin(P ), is the largest integer s such that P contains an independent pair (U, U ′ ) with |U | = |U ′ | = s. Now bin(P ) = 0 if there is no pair (a, a′ ) ∈ A × A′ with a kP a′ . The following lemma is implicit in [3]. Lemma 2.4. Let P ∈ B and suppose that bin(P ) < 2. If IP 6= ∅, then dim(P ) is the minimum size of a maximal matching in P . Proof. Let d = dim(P ). We know from Lemma 2.3 that d is at most the minimum size of a maximal matching. We now show that this inequality is tight. Let F = {L1 , . . . , Ld } be a realizer of P satisfying the requirements of Lemma 2.3. Then let T = {a1 , . . . , ad } and T ′ = {a′1 , . . . , a′d } be the matching associated with F . Set M = A − T and M ′ = A′ − T ′ . We claim that the matching evidenced by T and T ′ is maximal. Suppose this assertion fails and there is a pair (x, x′ ) ∈ M × M ′ with x kP x′ . Since F is a realizer, there is some j ∈ [d] with x > x′ in Lj . This implies that both elements of U = {aj , x} are incomparable with both elements of U ′ = {a′j , x′ }. In turn, this implies that (U, U ′ ) is a balanced independent pair in P , so that bin(P ) ≥ 2. The contradiction completes the proof. 3. Matchings, Clique Number, Independence Number and Talagrand’s Inequality In this section, we prove New Upper Bound (1) and New Lower Bound (1). The proofs have the same ﬂavor, and where their ranges overlap, we are able to determine a.a.s., the expected value of n − dim(P ) to within a multiplicative ratio that is at most 1 + o(1). The arguments for these bounds require preliminary lemmas, some of which may be of independent interest. Let s ∈ [n]. Also let S and S ′ be s-element subsets of A and A′ , respectively, with s = |S| = |S ′ |. We say the defect of the pair (S, S ′ ) is s if (S, S ′ ) is a clique; otherwise, the defect of (S, S ′ ) is the least non-negative integer δ such that there are subsets T ⊆ S and T ′ ⊆ S ′ with s − δ = |T | = |T ′ | such that T and T ′ can be matched. Lemma 3.1. Suppose that n−1 log2 n ≤ q ≤ n−1/2 log n. Then a.a.s., the following statement holds: If n/2 ≤ s ≤ n, S ⊆ A, S ′ ⊆ A′ , and s = |S| = |S ′ |, then the defect of the pair (S, S ′ ) is at most 24/q. Proof. Set δ = 24/q. There are at most 22n pairs of the form (S, S ′ ) where S ⊆ A, S ′ ⊆ A′ and |S| = |S ′ | ≥ n/2. The lemma follows if we can show that for any such pair, the probability that there is no matching of size |S| − δ between S and S ′ is o(2−2n ). Fix such a pair and let s = |S| = |S ′ |. For each non-empty subset W ⊆ S, let N (W ) consist of all elements of S ′ that are incomparable with at least one element of W . If |N (W )| ≥ |W | − δ, for all RANDOM POSETS 9 subsets W ⊆ S, then a matching of the desired size exists. So we consider the event F that holds if there is some subset W ⊆ S with |N (W )| < |W | − δ. Trivially, the inequality |N (W )| ≥ |W | − δ holds when |W | ≤ δ. Also, if F fails for all sets W ⊂ A with |W | = s − δ, then it fails for all sets W ⊂ A with s−δ |W | > s − δ. It follows that F ⊆ ∪i=δ Fi , where event Fi holds when there is an i-element subset W ⊆ S such that |N (W )| < |W | − δ. Now suppose that δ ≤ i ≤ s/2. Then there are at most si choices for the set W . For each choice of W , there are at most si choices for an i-element subset W ′ of S ′ such that W ′ ∩ N (W ) = ∅. It follows that 2 s P(Fi ) ≤ (1 − q)i(s−i) i < exp(2i log n)(1 − q)in/4 since s − i ≥ n/4. < exp(−iqn/8) since qn/4 > 4 log n. < exp(−δqn/8) since i ≥ δ. < exp(2i log n) exp(−iqn/4) = exp(−3n) = o(2 −2n substituting for δ. /n). A symmetric calculation shows P(Fi ) = o(2−2n /n) when s/2 ≤ i ≤ s − δ. It follows that s−δ X P(F ) ≤ = o(2−2n ). i=δ This completes the proof of the lemma. For the proof of the next lemma, we follow (essentially) the notation and terminology of Corollaries 4.3.3, 4.3.4 and 4.3.5 in Alon and Spencer [1]. For a random variable X, we denote the expected value of X as E[X]. Lemma 3.2. Suppose n−1 log2 n ≤ q ≤ n−1/2 log n. Let X be the random variable counting the number of balanced independent pairs of size 2. Then E[X] → ∞ and a.a.s., X ∼ E[X]. 2 ′ Proof. Set m = n2 . Then let {(U1 , U1′ ), . . . , (Um , Um )} be a listing of pairs such ′ that for each i ∈ [m], Ui and Ui are 2-element subsets of A and A′ , respectively. For each i ∈ [m], we have an event Ei that holds if (Ui , Ui′ ) is a balanced independent pair. Also, we let Xi be the associated indicator random variable. Then X = X1 + · · · + Xm , and we note that the random variables X1 , . . . , Xm are symmetric. We note that since q ≥ n−1 log2 n, 2 n E[X] = q 4 ≥ n4 q 4 /5 ≥ log8 n/5 → ∞. 2 We write Ei ∼ Ej when i, j are distinct elements of [m] and the events Ei and Ej are dependent. Clearly, Ei ∼ Ej when Ui × Ui′ and Uj × Uj′ intersect. When they intersect, the number of common pairs is either 1 or 2. We ﬁx an index i and then calculate the quantity ∆∗ deﬁned by X P[Ej |Ei ]. (1) ∆∗ = j∼i 10 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG There are 4(n − 2)2 choices for the index j so that Ui × Ui′ and Uj × Uj have exactly one common pair. For each such j, the value of P[Ej |Ei ] is q 3 . Similarly, there are 4(n − 2) choices for the index j so that Ui × Ui′ and Uj × Uj′ have exactly two common pairs. For each such j, the value of P[Ej |Ei ] is q 2 . Using ﬁrst that q ≥ n−1 log2 n and then that q ≤ n−1/2 log n, we have ∆∗ = 4q 3 (n − 2)2 + 4q 2 (n − 2) < 4q 5 n4 / log4 n + 4q 5 n4 / log6 n = o(E[X]). Now the conditions of Corollary 4.3.5 from [1] are satisﬁed and we conclude that almost always, X ∼ E[X]. The next lemma is a straightforward application of Markov’s inequality. Lemma 3.3. Suppose n−1 log2 n ≤ q ≤ n−1/2 log n and 0 < ǫ < 1. Then a.a.s., bcn(P ) < (2 + ǫ) log(qn)/q. Proof. Set r = (2 + ǫ) log(qn)/q and let X count the number of balanced clique pairs of size r in P . Then 2 2 n E[X] = (1 − q)r < n2r exp(−qr2 ) = exp(−2ǫ log(qn)) = o(1). r Since E[X] = o(1), it follows that a.a.s., bcn(P ) < r. The elementary inequality in Lemma 3.3 is essentially best possible. However, this assertion is considerably more challenging to prove2. There are several diﬀerent forms of Talagrand’s inequalities in the literature. We will use the version given in [10, Theorem 2.29]. Let R and R+ denote, respectively, the set of real numbers and the set of positive real numbers. When (Λ1 , . . . , Λn ) is a sequence of subsets of R, we denote by Λ the product Λ1 × · · · × Λn . When i ∈ [n], z ∈ Λ, we denote by z(i) the value of coordinate i of z. Theorem 3.4 (Talagrand Inequality). Let X = f (Z1 , . . . , Zn ) be a random variable determined by n independent trials Z1 , . . . , Zn , where f : Λ → R and each Zi takes on values in a finite set Λi . Suppose c1 , . . . , cn ∈ R+ and ψ : R → R. If for all z, w ∈ Λ, both (1) for all i ∈ [n], if z(j) = w(j) for all j ∈ [n] − {i}, then |f (z) − f (w)| ≤ ci , and (2) for all if f (z) ≥ α, then there is J ⊆ [n] such that both Pα ∈ R, 2 c ≤ ψ(α), and (a) j j∈J (b) if z(j) = w(j) for all j ∈ J, then f (w) ≥ α, then for every γ ∈ R and every β ≥ 0, β2 P(X ≤ γ − β)P(X ≥ γ) ≤ e− 4ψ(γ) . Lemma 3.5. If n−1 log2 n ≤ q ≤ n−1/2 log n and 0 < ǫ < 1, then a.a.s., bcn(P ) ≥ (2 − ǫ) log(qn)/q. Proof. Before we begin calculations, we explain how Theorem 3.4 will be applied. Label the elements of A and A′ arbitrarily (no assumptions about matchings) as A = {a1 , . . . , an } and A′ = {a′1 , . . . , a′n }. For each i ∈ [n], let Λi be the family of 2 We are grateful to Tomasz Luczak who greatly assisted us in this eﬀort, especially the use of Talagrand’s Inequality, and the setup using a vertex martingale. RANDOM POSETS 11 all subsets of A′ . Then it is natural to view Λ as just a coding of the posets in Bn , i.e., a poset P ∈ B(n) uniquely determines for each i ∈ [n] the set of all a′ ∈ A′ with ai <P a′ . In turn, the random variables Zi with i ∈ [n] then capture the space Ω(n, p). We deﬁne a function f : Λ → N by setting f (P ) = bcn(P ). Then we have a random variable X = f (P ). We note that if P and Q are posets in B(n), and there is some ai ∈ A such that the only diﬀerences between P and Q involve pairs from {ai } × A′ , then |f (P ) − f (Q)| ≤ 1, i.e., we take ci = 1 for all i ∈ [n]. Furthermore, whenever we have f (P ) ≥ α, this can be certiﬁed by a set J of size α. So we simply take ψ(α) = α. With ǫ ﬁxed, we want to show that P X > (2 − ǫ) log(qn)/q tends to 1. Set δ = ǫ/2. It is enough to prove that P X > (2 − ǫ) log(qn)/q ≥ 1 − δ. Set k = γ = (2 − δ) log(qn)/q and β = δ log(qn)/q. Note that γ − β = (2 − ǫ) log(qn)/q. Substituting these values into Talagrand’s inequality, we obtain: 2 2 (qn) − δ log 4q2 k . P X ≤ (2 − ǫ) log(qn)/q P X ≥ k ≤ e Substituting for k in the right hand side of the last inequality, we obtain δ2 . P X ≤ (2 − ǫ) log(qn)/q P X ≥ k ≤ e−ck where c = 4(2 − δ)2 This implies that either (1) P X ≤ (2 − ǫ) log(qn)/q ≤ δ or (2) P(X ≥ k) ≤ e−ck /δ. To complete the proof, we need only show that statement (2) cannot hold. This will be accomplished by showing that P(X ≥ k) ≥ e−o(k) . (2) Now let Y be the random variable counting the number of balanced clique pairs (V, V ′ ) of size k. Then X ≥ k if and only if Y > 0. Then from [10, Remark 3.1], we have (E[Y ])2 (3) P(Y > 0) ≥ . E[Y 2 ] Accordingly, we want to show that P(Y > 0) ≥ e−o(k) . Working with the reciprocal, which simpliﬁes the analysis, we then want to show that: E[Y 2 ] ≤ eo(k) . (E[Y ])2 (4) We have: E[Y ] E[Y 2 ] = 2 (E[Y ]) = Pk k k X X i=0 j=0 2 i=0 k i Pk j=0 n−k k k−i j n 2 k k i n−k k−i k (E[Y ])2 n−k k−j j n−k k−j 2 (1 − q)k −ij (1 − q)−ij . There are (k + 1) terms in the sum, so it suﬃces to show that every term has size at most exp(o(k)). Clearly, this holds whenever i = 0 or j= 0. So we are n concerned only with terms where i, j ≥ 1. Trivially, we have ki n−k k−i ≤ k , so that k n−k (5) i k−i n k ≤ 1. 12 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG A symmetric inequality holds for j. Using only the elementary bound in (5), we observe that if i ≤ k/ log2 (qn), then the term for the pair (i, j) is at most (1 − q)−ij ≤ exp(qk 2 / log2 (qn)) 2 log(qn) / log2 (qn)) q = exp(2k/ log(qn)) ≤ exp(qk = exp(o(k)). A symmetric statement holds when j ≤ k/ log2 (qn). Now we focus on the terms when i, j ≥ k/ log2 (qn). For such terms, we have the following improved bound. !i !i !i n k n−k k 3k 2 3k k k−i i k−i < = . < n n i n in i k k A symmetric inequality holds for j. Accordingly, when k/ log2 (qn) ≤ i, j ≤ k, we have the following upper bound on the term for (i, j): !i+j 3k log2 (qn) (6) exp(qij). n We assume without loss of generality that i ≤ j. We then take the logarithm of the expression in (6) to obtain: (7) (i + j) log 3 + log k + 2 log log(qn) − log n + qij Considering j ﬁxed, this is a linear function of i, deﬁned on the interval [k/ log2 (qn), j]. So it achieves its maximum value either at i = k/ log2 (qn) or at i = j. The choice depends on the sign of the coeﬃcient of i, which is (8) log 3 + log k + 2 log log(qn) − log n + qj = log 3 + log(2 − δ) + 3 log log(qn) − log(qn) + qj. Case 1. j ≤ log(qn) − 3 log log(qn) − log 3 − log(2 − δ) /q. In this case, the coeﬃcient of i is negative, so the maximum value is achieved when i = k/ log2 (qn). The term associated with (i, j) is less than exp(qij), and for j, we use the generous upper bound bound j ≤ log(qn)/q. It follows that the term for (i, j) is at most: exp(q log(qn) k = exp(k/ log(qn) = exp(o(k)). q log (qn) 2 Case 2. j > log(qn) − 3 log log(qn) − log 3 − log(2 − δ) /2. In this case, the maximum value is achieved when i = j, and the term associated with (i, j) is at most: !j 9k 2 log4 (qn) 2 exp(qj) (9) n2 RANDOM POSETS 13 In (9), we note that exp(qj) ≤ exp(qk) = (qn)2−δ . Using this inequality and substituting for k, the expression in (9) becomes: ! 9 log6 (qn) = o(1). (qn)δ With this observation, the proof of the lemma is complete. Lemma 3.6 (New Upper Bound (1)). Suppose n−1 log2 n ≤ q ≤ n−1/2 log n and 0 < ǫ < 1. Then a.a.s., dim(P ) ≤ n − (2 − ǫ) log(qn)/q. Proof. Let ǫ1 = ǫ/2. Then set r = (2 − ǫ1 ) log(qn)/q. Using Lemma 3.5, it follows that a.a.s., the following two statements hold: (1) the balanced clique number of P is at least r, and (2) for every pair (S, S ′ ), with S ⊆ A, S ′ ⊆ A′ and |S| = |S ′ | ≥ n/2, the defect of (S, S ′ ) is less than 24/q. Set δ = 24/q, and let (V, V ′ ) be a balanced clique pair of size r in P . Set S = A − V and S ′ = A′ − V ′ . Note that |S| = |S| > n/2. Let (T, T ′ ) be a maximum matching in S∪S ′ . Then |T | = |T ′ | ≥ |S|−δ. Let Q be the subposet of P determined by the points in the matching together with the points in V ∪ V ′ . Then (T, T ′ ) is a maximal matching in Q. It follows from Lemma 2.2 that dim(Q) ≤ |T | ≤ n − r. We note that 2δ < 48/q < ǫ1 log(qn)/q. The removal of a point from a bipartite poset decreases dimension by at most 1, and Q is obtained from P by removing at most 2δ points. It follows that a.a.s., dim(P ) ≤ dim(Q) + 2δ ≤ (n − r) + ǫ1 log(qn)/q = n − (2 − ǫ) log(qn)/q. Readers will note that the proof of the next result uses the “alteration” method (see Chapter 3 in Alon and Spencer [1]) ﬁrst used by Erdős in his probabilistic proof of the existence of graphs with large girth and large chromatic number. Lemma 3.7 (New Lower Bound (1)). Suppose 0 < ǫ < 1. If n−1 log2 n ≤ q ≤ n−4/5 , then a.a.s., dim(P ) > n − (2 + ǫ) log(qn)/q. Proof. Set ǫ1 = ǫ/2 and r = (2 + ǫ1 ) log(qn)/q. Using Lemma 3.3, we know that a.a.s., the balanced clique number of P is less than r. Let X be the random variable counting the number of balanced independent pairs (U, U ′ ) with |U | = |U ′ | = 2. Using Lemma 3.2, we know that a.a.s., X ∼ E[X]. 2 Since E[X] = n2 q 4 ∼ q 4 n4 /4, we will settle for the weaker inequality a.a.s., X ≤ n4 q 4 /2. When q ≤ n−4/5 , this implies that a.a.s., X < ǫ1 log(qn)/q. It follows that there are subsets S ⊂ A and S ′ ⊂ A′ with |S| = |S ′ | = n − ǫ1 q −1 log(qn) such that a.a.s., the subposet Q of P with ground set S ∪ S ′ has balanced clique number less than r and balanced independence number less than 2 Then a.a.s., dim(P ) ≥ dim(Q) > n − ǫ1 log(qn)/q − (2 + ǫ1 ) log(qn)/q = n − (2 + ǫ) log(qn)/q. 14 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG 4. Lower Bounds and an Application of Janson’s Inequality In this section, we prove New Lower Bounds (2), (3) and (4). All three proofs require concepts developed in [5], starting with a good bound on the expected value of the balanced independence number. The following elementary lemma uses only Markov’s inequality. Lemma 4.1. Suppose n−1 log2 n/n ≤ q ≤ 1/2 and 0 < ǫ < 1. Then a.a.s., bin(P ) < t := ⌈2[log n + log log(1/q)]/ log(1/q)⌉. Proof. Let Y count the number of balanced independent pairs of size t. Then 2 2 n E[Y ] = q t < exp(t(2 log n − t log(1/q))) = o(1). t With this lemma in mind, for the balance of this section, whenever the value of q is speciﬁed, we set: (10) t = ⌈(2 log n + log log n)/ log(1/q)⌉. Fix a value of q, with t then determined by (10). A short pair is a pair (σ, σ ′ ) where σ is a linear order on a (t − 1)-element subset of A, and σ ′ is a linear order on a (t − 1)-element subset of A′ . Let d ∈ [n] and let Σ = {(σj , σj′ ) : 1 ≤ j ≤ d} be a family of short pairs. With the family Σ ﬁxed, we make the following deﬁnitions. For a pair (a, a′ ) ∈ A × A′ and an integer j ∈ [d], (1) Event Rj (a, a′ ) holds if a ∈ σj and b kP a′ for all b ∈ σj with hj (b) < hj (a). (2) Event Rj′ (a, a′ ) holds if a′ ∈ σj′ and b′ kP a for all b′ ∈ σj′ with h′j (b′ ) < h′j (a′ ). We note that Rj (a, a′ ) holds whenever a is the highest element of σj . Also, Rj′ (a, a′ ) holds whenever a′ is the lowest element of σj′ . For a pair (a, a′ ) ∈ A × A′ , let (a < a′ ) be the event that holds when a <P a′ . Also, let (a k a′ ) be the event that holds when a kP a′ . Now set R(a, a′ ) = (a < a′ ) ∨ ∨j∈[d] Rj (a, a′ ) ∨ ∨j∈[d] Rj′ (a, a′ ) . We say that Σ realizes the pair (a, a′ ) when R(a, a′ ) holds In turn set R(P ) = ∧(a,a′ )∈A×A′ R(a, a′ ). We say Σ is a short realizer for P when R(P ) holds. Then we deﬁne the short dimension of P , denoted sdim(P ), as the least positive integer d such that there is a family Σ = {(σj , σj′ ) : j ∈ [d]} of short pairs such that Σ is a short realizer of P . We observe that a.a.s., sdim(P ) ≤ dim(P ). To see this, let F = {L1 , . . . , Ld } be a realizer of P . Then for each j ∈ [d], let σj be the linear order consisting of the highest t − 1 elements of A in Lj . Also, let σj′ be the linear order consisting of the lowest t − 1 elements of A′ in Lj . Since a.a.s., the balanced independence number of P is less than t, it follows that a.a.s., Σ = {(σj , σj′ ) : j ∈ [d]} is a short realizer for P . Accordingly, a lower bound on sdim(P ) is also a lower bound on dim(P ). The next step in the argument for all three lower bounds is to ﬁx a short family Σ = {(σj , σj′ ) : 1 ≤ j ≤ d} and consider the event R(P ) that holds when Σ is a short realizer for P . We will determine a reasonably accurate upper bound p0 on P R(P ) . The number of short families is less than n2(t−1)d and d ≤ n, so we can say that a.a.s., sdim(P ) > d if n2tn p0 = e2tn log n p0 = o(1). RANDOM POSETS 15 With Σ ﬁxed, let T consist of those elements a ∈ A such that there is at least one j ∈ [d] with a the highest element of A in σj . Then set M = A − T . Analogously, let T ′ consist of those elements a′ ∈ A′ such that there is at least one j ∈ [d] with a′ the lowest element of σj′ . Then set M ′ = A′ − T ′ . Also, set s = t − 2. We note that R(a, a′ ) holds whenever a ∈ T or a′ ∈ T ′ . Accordingly, R(P ) = ∧(x,x′ )∈M×M ′ R(x, x′ ). Our next goal will be to determine a bound on d that forces P(R(P )) to be exponentially small. Some additional notation and terminology is required. We describe this notation in full detail for M . The notation for M ′ is dual. When j ∈ [d], x ∈ M , and x ∈ σj , we let hj (x) count the number of elements y ∈ σj with y higher than x in σj . By convention, we set hj (x) = ∞ if x 6∈ σj . It is natural to view the quantity hj (x) as the height of x in σj . For an integer i ∈ [s], we then let µi (x) count the number of j ∈ [d] with hj (x/W ) = i. We view µi (x) as the multiplicity of x for height i. Then deﬁne the quantity w(x) by setting w(x) = s X µi (x)21−i . i=1 We view the quantity w(x) as the weight of x. Note that X w(x) < 2d. x∈M Since d ≤ n, there is a subset M0 ⊂ M with |M0 | = m/2 such that w(x) < 4d/m for every x ∈ M0 . The preceding discussion is followed in a dual manner to determine a subset M0′ ⊂ M ′ with |M0′ | = m/2 so that w(x′ ) < 4d/m for every x′ ∈ M0′ . Set I = M0 × M0′ . In the analysis to follow, we will need the following elementary fact. It is stated formally, as we will need it again in the following section. Proposition 4.2 (Weight-Shift). If 0 ≤ q ≤ 1/2 and i ∈ N , then 1 − q i < (1 − q i+1 )2 . For an event E in a probability space Ω, we use the notation E to denote the event that holds when E fails. Let (x, x′ ) ∈ I. We consider the events in the family {Rj (x, x′ ) : j ∈ [d]}. As explained in [5], these events are positively correlated, i.e., when j and k are distinct integers in [d], P(Rj (x, x′ )|Rk (x, x′ )) ≥ P(Rj (x, x′ ))). With the convention that 1 − q ∞ = 1, it follows that: Y (1 − q hj (x) ) Using correlation. P(∧j∈[d] Rj (x, x′ ) ≥ j∈[d] ≥ Y i∈[s] (1 − q i )µi (x) ≥ (1 − q)w(x) 4n/m ≥ (1 − q) Deﬁnition of multiplicity. Using Proposition 4.2. . Analogously, we have: ′ P ∧j∈[d] Rj (x, x′ ) ≥ (1 − q)4n/m . 16 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG It follows that (11) P R(x, x′ )) ≤ 1 − q(1 − q)8n/m . 4.1. Applying the Janson Inequality. We will use the Janson inequality in the proofs of New Lower Bounds (2) and (3). Here is the set up for this result, following (essentially) the presentation in Chapter 8 of Alon and Spencer [1]. The text [10] by Janson, Luczak and Rucinski is cited for the proof. Let I be a ﬁnite set and let {Fi : i ∈ I} be a ﬁnite family of events in a probability space Ω. When i and j are distinct elements of I, we write Fi ∼ Fj when Fi and Fj are dependent. Also, we set X ∆= {P(Fi ∧ Fj ) : (i, j) ∈ I × I, Fi ∼ Fj }, and µ= X P(Fi ). i∈I Here is the statement of the Janson inequality we will apply. Theorem 4.3 (Janson Inequality). Let {Fi : i ∈ I} be a finite family of events with P(Fi ) ≤ 1/2 for all i ∈ I. If ∆ ≤ µ, then P ∧i∈I Fi ≤ exp(−µ/2). Lemma 4.4 (New Lower Bounds (2) and (3)). If n−4/5 ≤ q ≤ 18 n−1/4 log3 n, then a.a.s., ( n − 32(n log n/q)1/2 if n−4/5 ≤ q ≤ (16)1/3 n−1/3 log1/3 n. dim(P ) > n − 8qn if (16)1/3 n−1/3 log1/3 n ≤ q ≤ 18 n−1/4 log3 n. Proof. For this range, we note that t ≤ 9. For every (x, x′ ) ∈ I, we have an event F (x, x′ ) that holds when Σ fails to realize the pair (x, x′ ). Note that for F (x, x′ ) to hold, we need x kP x′ , so P(F (x, x′ )) ≤ q ≤ 1/2. We also observe that P(F (x, x′ )) ≥ q(1 − q)8n/m . Set m = qnz where z ≥ 8. In general, we will have z → ∞, but the restriction z ≥ 8 is enough to imply that (1 − q)8n/m ≥ 1/4. It follows that µ, the exected number of pairs that fail, is at least m2 q/16. When x, y ∈ M0 and x′ , y ′ ∈ M0′ , we observe that F (x, x′ ) ∼ F (y, y ′ ) if and only if |{x, x′ } ∩ {y, y ′ }| = 1. Furthermore, when F (x, x′ ) ∼ F (y, y ′ ), and event F (x, x′ ) ∧ F (y, y ′ ) holds, we must have x kP x′ and y kP y ′ . These two events are independent and each has probability q. It follows that P F (x, x′ ) ∧ F (y, y ′ ) ≤ q 2 . There are are 2(m/2)(m/2) such pairs so ∆ ≤ m2 q 2 /2. To apply the Janson inequality, we need ∆ ≤ µ, but this simply requires q ≤ 1/8. We conclude that P R(Σ) ≤ P ∧(x,x′ )∈I F (x, x′ ) ≤ exp(−µ/2) ≤ exp(−m2 q/32). Recall that the number of short families is less than n2tn ≤ exp(18n log n), since t ≤ 9. Noting that 18 · 32 = 576 < 210 , can conclude that a.a.s., dim(P ) > d if 210 n log n ≤ m2 q. Since m = qnz, this becomes 210 log n ≤ q 3 nz 2 . This requires 1/2 (12) z ≥ 32 log n/(q 3 n) . RANDOM POSETS 17 When z = 8, this inequality holds when q ≥ (16)1/3 n−1/3 log1/3 n, and this completes the proof of New Lower Bound (3). Now we assume that q ≤ (16)1/3 n−1/3 log1/3 n. Now we treat inequality (12) 1/2 as an equation, i.e., we set z = 32 log n/(q 3 n) . The equation m = qnz is equivalent to m = 32(n log n/q)1/2 , and with this observation, the proof of New Upper Bound (2) is complete. 4.2. A Family of Independent Events. To obtain a proof of New Lower Bound (4), we simply update the original argument in [5] as given on pages 262–268. We have elected not to repeat the details of this argument. Instead, we will provide only an outline of the steps to be taken, with notational changes made to agree with our treatment here (1) We identify a subset J of M × M ′ with |J | = m3 /(72ns2 ). (2) For each (x, x′ ) ∈ I, we determine an event E(x, x′ ) such that R(x, x′ ) ⊆ E(x, x′ ) and Pr(E(x, x′ )) ≥ q(1 − q)24n/m . (3) Events in the family {E(x, x′ ) : (x, x′ ) ∈ J } are independent. Since the events in J are independent, it follows that (13) i m3 h 24n/m 72ns2 P(R(P )) ≤ 1 − q(1 − q) . Inequality (13) provides an upper bound on p0 , the maximum value of P(R(P )). To show that a.a.s., dim(P ) > d = n − m, it suﬃces to require that: m3 /72ns2 = o(1) e2tn log n 1 − q(1 − q)24n/m In the range we consider, it will always be the case that q(1 − q)24n/m = o(1). With this restriction, the preceding inequality holds if: (14) n2 t3 log n = o q(1 − q)24n/m m3 . In the proof of the next lemma, we will refer to (14) as the “master inequality.” Lemma 4.5 (New Lower Bound (4)). If 81 n−1/4 log3 n ≤ q ≤ 1/2 and z = log n + 4 log q − 8 log log n, then a.a.s., dim(P ) ≥ n − 24qn/z. Proof. Set m = 24qn/z. The lower bound on q implies qn4 ≥ log11 n. It follows that z ≥ 3 log log n so that m = o(qn). With this value of m, we can safely approximate (1 − q)24n/m as e−z . Accordingly, the master inequality becomes n2 t3 log n = o(qe−z q 3 n3 /z 3 ), which is equivalent to t3 z 3 ez log n = o(q 4 n). We note that ez = qn4 / log8 n. So the master inequality holds if t3 z 3 log n = o(log8 n). However, t ≤ 3 log n. Furthermore, z < log n. It follows that t3 z 3 log n = O(log7 n), so that the master inequality holds. We observe that there is a threshold occurring when q ∼ n−1/4 . When q is below this threshold, the Janson inequality approach gives a better result, and when q is above this threshold, the original approach using a family of independent events is better. 18 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG 5. Generalized Latin Rectangles and the Euler Product Function In this section, we prove New Upper Bounds (2) and (3). The arguments require a “one-sided” reformulation of dimension, using the same approach (at least one half of it) taken in the last section. Let P ∈ B and let F = {σ1 , . . . , σd } be a nonempty family of linear orders such that for each j ∈ [d], σj is a linear order on a non-empty subset of A. Now there is no restriction on the size of these linear orders. For a pair (a, a′ ) ∈ A × A′ , we say that F realizes (a, a′ ) if either (1) a <P a′ or (2) there is some j ∈ [d] with a ∈ σj and b kP a′ for all b′ ∈ σj with hj (b) < hj (a). In turn, we say F is a one-sided realizer for P when F realizes (a, a′ ) for all pairs (a, a′ ) ∈ A × A′ . Clearly, dim(P ) is the least d ≥ 1 for which P has a one-sided realizer of size d. Our strategy for proving New Upper Bounds (1) and (2) will be to design a single candidate family F = {σ1 , . . . , σd } and show that a.a.s., this family is a one-sided realizer of a poset P . To implement this strategy, we must pause to establish a connection with a classic concept in combinatorics. 5.1. Generalized Latin Rectangles. Recall that when m and s are integers with 1 ≤ s ≤ m, an s × m array (matrix) R is called a latin rectangle when (1) each row of R is a permutation of the integers in [m], and (2) the entries in each column of R are distinct. As is well known, if 2 ≤ s ≤ m, an (s − 1) × m latin rectangle R can always be extended to an s × m latin rectangle by adding a new row. Now let (m, r, s) be a triple of positive integers. An s × (rm) array R of integers from [m] will be called an (m, r, s) − GLR (where GLR is an abbreviation for generalized latin rectangle) when the following conditions are met: (1) In each row of R, each integer in [m] occurs exactly r times. (2) In each column C of R, the s integers occuring in column C are distinct. (3) For each distinct pair i, j ∈ {1, 2, . . . , m}, there is at most one column C in R for which i is below j in column C. Note that when r = 1, the third requirement is not part of the traditional deﬁnition for a latin rectangle. However, it will be soon be clear why we want this additional restriction in place. Here is an example of a (9, 2, 3)-GLR.  1 1 8 9 3 6 2 2 9 1 4 7 3 3 1 2 5 8 4 4 2 3 6 9 5 3 7 5 6 4 4 1 8 6 7 5 5 2 9 7 8 6 6 3 1 8 9 7 7 4 2  9 8 5 The reader may note that it is impossible to extend this array to a (9, 2, 4) − GLR. More generally, we have the following natural extremal problem: For a pair (m, r) of positive integers, ﬁnd the the largest integer s = f (m, r) for which there is an (m, r, s) − GLR. Trivially, f (m, r) ≥ 1. Lemma 5.1. Let m, r, s be positive integers with s ≥ 2. If f (m, r) ≥ s, then rs(s − 1) ≤ 2(m − 1). Proof. Suppose that R is an (m, r, s) − GLR. There are rm columns in R and for each column C in R, there are s(s − 1)/2 ordered pairs (i, j) where i is below j in column C. The last two conditions in the deﬁnition of an (m, r, s)-GLR force rms(s − 1)/2 ≤ m(m − 1), so that rs(s − 1) ≤ 2(m − 1). RANDOM POSETS 19 Lower bounds on this extremal problem are more challenging, but we will give an explicit construction which is suﬃcient for our purposes. If R is an (m, r, s) − GLR, we say R is resolvable if it consists of r latin rectangles placed side by side, i.e., each row partitions into r blocks of consecutive elements and each block is a permutation of [m]. The following lemma is an elementary extension of the classical result for latin rectangles, and we only outline the proof, leaving the details as an exercise for students. Lemma 5.2. Let m, r and s be positive integers with s ≥ 2. If m > 2rs3 , then there is an (m, r, s) − GLR. Proof. Consider the problem of adding a last row to a resolvable (m, r, s−1)−GLR. Proceeding block by block, we have a balanced (m, m) bipartite graph G with positions 1 through m on one side of G and sets of allowable choices for each of the m positions on the other side. Clearly, the most challenging case in completing the last row is the last block. Consider one of the m columns in the last block, and let x be one of the s − 1 integers that already occurs in this column. Then x is over (r − 1)s(s − 1)/2 other integers in the ﬁrst r − 1 blocks, and x is over (s − 1)(s − 2)/2 other integers in the last block. When x is the lowest element in the column, then x itself is not allowable. It follows that the number of allowable choices is: (15) m − (r − 1)s(s − 1)2 /2 − (s − 1)2 (s − 2)/2 − 1 Note that the inequality 2rs3 < m implies that the quantity in inequality (15) is at least m/2. A parallel argument shows that each of the integers in [m] belongs to at least m/2 of the sets of allowable choices. It is an immediate consequence of Hall’s theorem that a balanced bipartite graph G with 2m vertices and minimum degree δ(G) ≥ m/2 has a complete matching. Futhermore, a matching in G provides a legal way to complete the last row. Let m, r, s be integers, and let R be an (m, r, s) − GLR. We set d = mr and n = d + m. We ﬁx an arbitrary d-element subset T = {a1 , . . . , ad } of A and set M = A−T = {x1 , . . . , xm }. We construct a family F = {σ1 , . . . , σd } of linear orders on (s + 1)-element subsets of A as follows. For each j ∈ [d], we set hj (aj ) = 0. Then for each pair (i, j) ∈ [s] × [d], we set hj (xα ) = i when the integer in row i and column j of R is α. When (a, a′ ) ∈ A × A′ , it is obvious that F realizes (a, a′ ) if a ∈ T . Let X be the random variable counting the number of pairs (a, a′ ) ∈ A × A′ for which F fails to realize (a, a′ ). Since a must belong to M , the expected value of X is given by: (16) E[X] = nmq s Y i=1 Qs i r (1 − q i ) . Note that the Q expression i=1 (1 − q ) is a partial product of the Euler product ∞ function φ(q) = i=1 (1 − q i ). Using the weight-shift propososition 4.2 from the preceding section, it follows that (1 − q s ) s Y i=1 (1 − q i ) < φ(q) < s Y (1 − q i ). i=1 20 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG Lemma 5.3 (New Upper Bound (3)). If n−1/3 ≤ q ≤ 1/2 and z = n2 q log(1/φ(q)), then a.a.s., dim(P ) < n − n log(1/φ(q))/ log z. Proof. We will only consider values of m = n − d with m ≥ qn/ log10 n. With this restriction r = d/m = n/m − 1 ≤ log11 n/q. The requirement for the existence of a (m, r, s) − GLR is m > 2rs3 . With the restrictions on q and m, the requirement is met when s = n1/10 . For such a large value of s, we are safe if we estimate Q s i i=1 (1 − q ) by φ(q). Accordingly, we can conclude that a.a.s., dim(P ) ≤ d if r (17) E[X] = nmq φ(q) → 0. We note that φ(q) < 1, r = d/n = n/m − 1 and p = 1 − q ≤ 1/2. It follows that inequality (17) is equivalent to: (18) nmq = o en log(1/φ(q))/m . With z set at n2 q log(1/φ(q), we note that z and log z tend to inﬁnity with n. Now set m = n log(1/φ(q))/ log z. Since elog z = z = n2 q log(1/φ(q)), inequality (18) is equivalent to: (19) n2 q log(1/φ(q)) = o n2 q log(1/φ(q)) log z . Clearly, this last inequality is satisﬁed. We are reasonably conﬁdent that New Upper Bound (3) is asymptotically correct. Lemma 5.4 (New Upper Bound (2)). If n−1/2 log n < q ≤ n−1/3 , then a.a.s., dim(P ) < n − qn/(2 log(qn)). Proof. We note that for all pairs (m, r), there is a (m, r, 1) − GLR. Of course, this simply means that we put every element of M in second position d/m = r times. Setting s = 1 in equation (16), we can conclude that a.a.s., dim(P ) ≤ d if (20) nmq(1 − q)d/m → 0. Again, we note that (1 − q)d/m = (1 − q)n/m /(1 − q) ≥ (1 − q)n−m /2. Now set m = qn/(2 log(qn)). Noting that 2 log(qn) → ∞, we have (1 − q)n/m = e−qn/m = e2 log(qn) = n2 q 2 . Therefore, inequality (20) is equivalent to: (21) n2 q 2 = o log(qn)n2 q 2 . This last equation holds since qn ≥ log2 n so that log(qn) → ∞. 6. Applications to the Extremal Problems We now return to Question 1 and the problem of ﬁnding the correct exponent on the function sa(c). Previously, we reported that we had been able to use the asymmetric form of the Lovász local lemma to raise the lower bound on the exponent of c in sa(c) from 4/3 to 3/2. This unpublished result was presented at several conferences and seminars and was proved by using the local lemma to ﬁnd a rare poset with independence number less than 2 and only moderately large standard example number. Lemma 2.3 was then used to determine the dimension of such a poset. However, our new bounds allow us to obtain a simple proof of this same improvement. Consider the value q = n−1/3 . If a bipartite poset P ∈ B(n) contains RANDOM POSETS 21 the standard example Sd , then its clique size is at least d/2. Since the clique size of P ∈ Ω(n, p) is a.a.s., less than 3q −1 log(qn) = 2n1/3 log n, it follows that a.a.s., se(P ) < 4n1/3 log n. On the other hand, with q = n−1/3 , we know that a.a.s., dim(P ) ≥ n − 32 n log n/q)1/2 = n − 32n2/3 log1/2 n. Setting c = 32n2/3 log1/2 n, the upper bound on se(P ) forces f (c) = Ω c3/2 / log3/4 c . 6.1. Progress on Question 2 for Posets. We begin with the following elementary result, for which we only outline the proof. Ironically, when applied it will be for p < 1/2. Lemma 6.1. Suppose n−1 log2 n ≤ q ≤ 1/2. Then a.a.s., A and A′ can be matched. Proof. Clearly, it is enough to prove the lemma when q = n−1 log2 n. The basic idea is to show that a.a.s., Hall’s matching condition is satisﬁed. For a subset W ⊂ A, let N (W ) consist of all elements of A′ that are incomparable with at least one element of W . We want to show that a.a.s., |N (W )| ≥ |W | for every subset W ⊆ A. First, we take care of the case when W is very small or very large. Set r = log n/5. Consider the events E1 that holds if there is some a ∈ A incomparable with fewer than r elements of A′ . Dually, event E2 holds if there is some element a′ ∈ A′ incomparable with fewer than r elements of A. Simple counting shows that P(Ei ) = o(1) for i = 1, 2. Now consider the event F that holds if there is some set W with r ≤ |W | ≤ n − r such that |(N (W )| < |W |. Then we show that P(F ) = o(1). Readers will note that this part of the proof is very similar to the proof (which is provided) of Lemma 3.1. Once we have shown that P(E1 ) = P(E2 ) = o(1) and P(F ) = o(1), the proof is complete. Recall that f (d, n) is the maximum value of dim(P ) among all posets on n points with standard example size less than d. Theorem 6.2. For all d ≥ 3, 2d−1 n1− d(d−1) . f (d, n) ≥ 8 log n Proof. Fix a value of d ≥ 3. We work in the space Ω(n, p) with 2d−1 p = n− d(d−1) . (22) We note that p < 1/2. Now set ǫ= 2d(d − 1) −2 2d − 1 and t = 2 log n/p. We note that ǫ > 0. Using Lemma 6.1, we know that a.a.s., the sets A and A′ can be matched. Furthermore, the following claim is just Lemma 3.3 stated in complementary form. Claim 1. For every ǫ with 0 < ǫ < 1, a.a.s., bin(P ) < (2 + ǫ) log(pn)/p. Therefore, a.a.s., bin(P ) < (2 + ǫ) log(pn)/p = (2 + ǫ) (2d − 1) log n = 2 log n/p = t. (d − 1)p Next, we need the following technical claim. 22 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG Claim 2. If P is a poset in B(n) and bin(P ) < t, then P does not contain a bipartite subposet Q = V ∪ V ′ with |V | = |V ′ | = 2t satisfying the following condition: The elements of V and V ′ can be labeled as V = {v1 , . . . , v2t } and ′ V ′ = {v1′ , . . . , v2t } such that there is a linear extension L of Q with vi > vi′ in L for each i ∈ [2t]. Proof. We argue by contradiction. After a relabeling, we may assume that v1 > · · · > v2t in L. However, this implies that vi > vj′ in L whenever 1 ≤ i ≤ t and t + 1 ≤ j ≤ 2t, which implies that bin(P ) ≥ t. Next, we let X be the random variable that counts the number of copies of the standard example Sd in P . Then the expected value of X is given by 2 n n2d d2 −d p = n/6. E[X] = d!(1 − p)d pd(d−1) < 6 d Let E be the event that occurs when X > n/4. Then P(E) < 2/3. It follows that there is a poset P ∈ Ω(n, p) such that (1) A and A′ can be matched; (2) bin(P ) < t; and (3) the number of copies of the standard example Sd in P is at most n/4. Let A = {a1 , a2 , . . . , an } and A′ = {a′1 , a′2 , . . . , a′n } be labelings that evidence a matching between A and A′ . Without loss of generality, we may assume that any copy of Sd contained in P (there are at most n/4 of them) contains some point in {ai : n/2 < i ≤ n} ∪ {a′i : n/2 < i ≤ n}. Hence there are no copies of Sd in the bipartite subposet Q = B ∪ B ′ where B = {ai : 1 ≤ i ≤ n/2} and B ′ = {a′i : 1 ≤ i ≤ n/2}. Note that we have |Q| = n. Now let F be any family of linear extensions of Q which is a realizer of P . Then F must reverse the pairs in {(ai , a′i ) : 1 ≤ i ≤ n/2}. However, in view of the claim, no linear extension can reverse 2t of these pairs. we conclude that 2d−1 pn n1− d(d−1) n = = . dim(P ) ≥ dim(Q) ≥ 4t 8 log n 8 log n 7. Some Comments on Open Problems We view the problem of determining the expected value of dim(P ) for posets in Ω(n, p) when n−4/5 < q ≤ n−1/3 to be a real challenge. We suspect that our New Upper Bounds are near the truth, but we cannot rule out the possibility that for almost all P , if the short dimension of P is d as evidenced by a short realizer Σ of size d, then P Σ(R) is very small. Second, although we believe we know the expected value of n − dim(P ) to within a 1+o(1) multiplicative factor when 1/2 ≤ p < 1−n−1/3 as speciﬁed by New Upper Bound (1), the challenge is that there are other constructions besides generalized latin rectangles that achieve the same bound. For the ﬁrst extremal problem, stability analysis, we continue to think it likely that the correct exponent for c in the function sa(c) is 2. As we have noted, this would be veriﬁed if the upper bound on dim(P ) when q = n−1/2 is correct to within a poly-log multiplicative factor on n − dim(P ). Alternatively, one could revisit the proof given in [3] and try to lower the exponent in the inequality sa(c) = O(c2 ). Success in this eﬀort would of course imply that our upper bounds on dim(P ) are not as good as we think. RANDOM POSETS 23 For the second extremal problem, it would be very interesting to show that for each d ≥ 3, there is a constant cd , with 0 < cd < 1, such that f (n, d) < ncd , although it is not clear that such a constant exists, even when d = 3. 8. Acknowledgment The authors would like to thank Noga Alon, Alan Frieze, and Tomasz Luczak for very helpful communications concerning second moment methods, the concept of defect, and Talagrand’s inequality. As noted previously, the heart of the proof of Lemma 3.5 was provided by Luczak in a personal communication. References [1] N. Alon and J. Spencer, The Probabilistic Method, 4th Edition, Wiley Series in Discrete Mathematics and Optimization, 2016. [2] C. Biró, P. Hamburger and A. Pór, Standard examples as subposets of posets, Order 32 (2015) 293–299. doi:10.1007/s11083-014-9331-y. [3] C. Biró, P. Hamburger, A. Pór and W. T. Trotter, Forcing posets with large dimension to contain large standard examples, Graphs and Combinatorics 32 (2016) 861–880. doi:10.1007/s00373-015-1624-4. [4] B. Dushnik and E. W. Miller, Partially ordered sets, Amer. J. Math. 63 (1941), 600–610. doi:10.2307/2371374. [5] P. Erdös, H. Kierstead and W. T. Trotter, The dimension of random ordered sets, Random Structures and Algorithms 2 (1991), 253–275. doi:10.1002/rsa.3240020302. [6] P. Erdős and L. Lovász, Problems and results on 3-chromatic hypergraphs and some related problems, in Finite and Infinte Sets, A. Hajnal, R. Rado and V. T. Sós, eds., North Holland, 1975, 609–625. web.cs.elte.hu/ lovasz/scans/LocalLem.pdf. [7] A. M. Frieze, On the independence number of random graphs, Discrete Math. 81 (1990), 171–175. doi:10.1016/0012-365X(90)90149-C. [8] Z. Füredi and J. Kahn, On the dimension of ordered sets of bounded degree, Order 3 (1986), 15–20. doi:10.1007/BF00403406. [9] T. Hiraguchi, On the dimension of orders, Science Reports of the Kanazawa University 4 (1955), 1–20. scirep.w3.kanazawa-u.ac.jp/articles/04-01-001.pdf. [10] S. Janson, T. Luczak and A. Rucinski, Random Graphs, Wiley, New York, N.Y., 2000. [11] R. J. Kimble, Extremal Problems in Dimension Theory for Partially Ordered Sets, Ph.D. thesis, Massachusetts Institute of Technology, 1973. dspace.mit.edu/bitstream/handle/1721.1/82903/30083917-MIT.pdf. [12] A. Scott and D. Wood, Better bounds for poset dimension and boxicity, Transactions Amer. Math. Soc., to appear. Available on the arXiv at 1804.03271. [13] W. T. Trotter, Problems and conjectures in the combinatorial theory of ordered sets, Annals Discrete Math. 41 (1989), 401–416. doi:10.1016/S0167-5060(08)70478-3 [14] W. T. Trotter, Dimension for posets and chromatic number for graphs, in 50 Years of Combinatorics, Graph Theory and Computing, Chung et al., eds., to appear. 24 BIRÓ, HAMBURGER, KIERSTEAD, PÓR, TROTTER, AND WANG Department of Mathematics, University of Louisville, Louisville, Kentucky 40292 E-mail address: csaba.biro@louisville.edu Purdue University, IN, E-mail address: hamburge@pfw.edu School of Mathematical and Statistical Sciences, Arizona State University, Tempe, Arizona 85287 E-mail address: kierstead@asu.edu Department of Mathematics, Western Kentucky University, Bowling Green, Kentucky 42101 E-mail address: attila.por@wku.edu School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia 30332 E-mail address: trotter@math.gatech.edu School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia 30332 E-mail address: rwang49@math.gatech.edu

Log In

Random bipartite posets and extremal problems

Related papers

Related topics