backgroundcolor=, basicstyle=

A General Upper Bound for the Runtime of a Coevolutionary Algorithm on Impartial Combinatorial Games

Alistair Benford  and Per Kristian Lehre School of Computer Science, University of Birmingham, Birmingham, B15 2TT, UK. [email protected] of Computer Science, University of Birmingham, Birmingham, B15 2TT, UK. [email protected]
This research was supported by a Turing AI Fellowship (EPSRC grant ref EP/V025562/1).
Abstract

Due to their complex dynamics, combinatorial games are a key test case and application for algorithms that train game playing agents. Among those algorithms that train using self-play are coevolutionary algorithms (CoEAs). However, the successful application of CoEAs for game playing is difficult due to pathological behaviours such as cycling, an issue especially critical for games with intransitive payoff landscapes.

Insight into how to design CoEAs to avoid such behaviours can be provided by runtime analysis. In this paper, we push the scope of runtime analysis for CoEAs to combinatorial games, proving a general upper bound for the number of simulated games needed for UMDA to discover (with high probability) an optimal strategy. This result applies to any impartial combinatorial game, and for many games the implied bound is polynomial or quasipolynomial as a function of the number of game positions. After proving the main result, we provide several applications to simple well-known games: Nim, Chomp, Silver Dollar, and Turning Turtles. As the first runtime analysis for CoEAs on combinatorial games, this result is a critical step towards a comprehensive theoretical framework for coevolution.

1 Introduction

Many of the most well-known games in the world are combinatorial games. Combinatorial games are typically perfect-information games played by two players without chance moves. The game has a finite number of possible positions, and players alternately take turns moving the game from one position to another, according to a set of rules describing which moves are legal. Combinatorial games are an exceptionally broad class of games, including famous games enjoyed the world over such as Chess or Go. Even those with simple rules can engender deep and complex strategic interactions between players. While this strategic depth is a key part of the appeal for human players, it can also render the task of computing a winning strategy to be extremely difficult. Indeed, games for which this task is known to be EXPTIME-complete (in terms of board size) include Chess [20], Go (with the ko rule) [53], and Checkers [54]. It is also known that determining an optimal strategy for a poset game (a class of combinatorial games which we will encounter in Section 6.4) is PSPACE-complete in terms of the size of the underlying poset [24]. (For further results, see [10].)

While classical methods are impractical for such cases, strong strategies can still be developed by using heuristic approaches, such as neural networks, Monte Carlo tree search, or genetic programming. Indeed, combinatorial games are a long-standing focus in the development of artificial intelligence, from Donald Michie’s seminal use of reinforcement learning on Tic-Tac-Toe [45], to Deep Blue’s famous matches against then-world Chess champion Garry Kasparov [9], to the recent groundbreaking results of DeepMind [60]. Many recent successes in this area train game playing agents using self-play, and among those self-play heuristics are coevolutionary algorithms (CoEAs) [51]. For a CoEA, self-play is realised through one or more evolving populations of individuals who compete against their contemporaries. In each iteration, the strongest individuals are selected based on their competitive interactions. Through genetic mutation and crossover, these strongest individuals are then used as parents for the individuals in the next iteration.

The successful application of CoEAs is deeply challenging, often due to the potential for games with intransitive payoff landscapes to induce cyclic behaviour [17]. For standard evolutionary algorithms, which apply similar methods to traditional optimisation problems, insight into how to avoid pathological behaviours can be provided by runtime analysis, which exists in great breadth and depth in literature and continues to be actively developed [12]. However, despite clear demand (see [51]), runtime analysis that addresses the challenges unique to CoEAs is far more limited. Indeed, while existing coevolutionary runtime analysis concerns a range of algorithms and design features, there are only three problem settings to which it so far applies: Bilinear, a game played on bitstrings whose outcome depends only on the number of 1111-bits selected by each player [40, 31, 32]; Diagonal, a benchmark problem inspired by binary test-based optimisation [42]; and a class of symmetric zero-sum games with a payoff landscape that is globally very simple, but possibly locally intransitive [2]. Accordingly, our core research aim is to push the scope of runtime analysis for CoEAs towards games which feature more complex strategic interaction between players, and more closely reflect real-world games.

Motivated by the numerous empirical investigations into the topic (see Section 1.1), we focus our analysis on the use of CoEAs for combinatorial games, and in particular impartial combinatorial games. A combinatorial game is said to be impartial if both players share the same set of available moves at each game position [26]. It is common to also adopt the normal play convention, which assumes that a player loses if they have no legal moves available. For instance, consider SubtractionNim72superscriptsubscriptSubtractionNim72\textsc{SubtractionNim}_{7}^{2}SubtractionNim start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (formally introduced in Section 6.1), in which the game positions are {0,1,2,3,4,5,6}0123456\{0,1,2,3,4,5,6\}{ 0 , 1 , 2 , 3 , 4 , 5 , 6 } and a player must subtract either 1111 or 2222 from the position on their turn. A strategy may be encoded as a string of length 6666, with entry i𝑖iitalic_i indicating whether a 1111 or a 2222 is to be subtracted when the game position is i𝑖iitalic_i. One way the game may play out is then: Player 1 :122111:absent122111\displaystyle:\texttt{122111}: 122111 Player 2 :122122:absent122122\displaystyle:\texttt{122122}: 122122 Turn 1: P1 subtracts 1. The new position is 5. Turn 2: P2 subtracts 2. The new position is 3. Turn 3: P1 subtracts 2. The new position is 1. Turn 4: P2 subtracts 1. The new position is 0. Turn 5: P1 has no legal moves. P2 is the winner.

(In fact, any strategy of the form 12*12* will always win this game, provided the corresponding player does not move first.)

The main result of this paper (Theorem 5.2 and Corollary 5.5) is the first runtime analysis for a coevolutionary algorithm on impartial combinatorial games. In broad terms, it says the following.

Theorem 1.1 (Corollary 5.5, informal version).

Let 𝒜𝒜\mathcal{A}caligraphic_A be the coevolutionary algorithm specified in Section 3, and let G𝐺Gitalic_G be an impartial combinatorial game with n𝑛nitalic_n possible positions. Then, with high probability, 𝒜𝒜\mathcal{A}caligraphic_A discovers an optimal strategy for G𝐺Gitalic_G within nO(s¯)superscript𝑛𝑂¯𝑠n^{O(\overline{s})}italic_n start_POSTSUPERSCRIPT italic_O ( over¯ start_ARG italic_s end_ARG ) end_POSTSUPERSCRIPT game evaluations, where s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG is a precisely defined invariant of the corresponding game graph.

We note that the notion of a game graph is defined in Section 2 and the invariant s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG is defined in Section 4. For many games we find s¯=O(1)¯𝑠𝑂1\overline{s}=O(1)over¯ start_ARG italic_s end_ARG = italic_O ( 1 ) or s¯=O(logn)¯𝑠𝑂𝑛\overline{s}=O(\log{n})over¯ start_ARG italic_s end_ARG = italic_O ( roman_log italic_n ), and so this result implies a range of polynomial and quasipolynomial runtimes. While it appears likely that the upper bound provided is higher than the true runtime for specific games, a major strength is that it is immediately applicable to any impartial combinatorial game. As we also provide an easy method for bounding s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG above when its exact value is not obvious (see Proposition 4.4), deriving runtimes for well-known games is straightforward. Indeed, after distilling into a more concise form (Corollary 5.5), we will see applications to games including Nim, Silver Dollar, Turning Turtles, and Chomp.

To understand what is the significance of our result, it is helpful to first clarify what it is not. In no uncertain terms, this paper is not an account of a superior ready-to-use method for efficiently finding optimal strategies for combinatorial games. Strategies will here be encoded by exhaustively listing a preferred action for every possible game position, and thus the methods presented are necessarily at least linear in the number of game states, both in terms of memory and of time. With this naive representation, classical algorithms can already establish optimal strategies in time O(n)𝑂𝑛O(n)italic_O ( italic_n ) using Sprague-Grundy theory (see Section 2.1), which is best possible. However, many games are parameterised in such a way that the number of possible game positions grows exponentially (accordingly, we emphasise that Theorem 1.1 is not in contradiction with the aforementioned EXPTIME and PSPACE results). While the classical approach breaks down in such cases, a CoEA can still find success by replacing the exhaustive listing of actions with a model that maps features of the game position onto an action.

However, even when using the naive representation, our understanding of how to successfully apply CoEAs is very limited. If we wish to consistently apply CoEAs to advanced problems, whether they are rooted in game-playing or not, we must attain a comprehensive understanding of their behaviour in these simpler settings. Indeed, seemingly simple instances still produce payoff landscapes with features that make them difficult to optimise heuristically, such as intransitivity (as an example, in the already-introduced representation for SubtractionNim72superscriptsubscriptSubtractionNim72\textsc{SubtractionNim}_{7}^{2}SubtractionNim start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, 111121 defeats 122112, which in turn defeats 12122, which in turn defeats 111121, regardless of who plays first).

Thus, the main contribution of this paper is precisely this: a first step towards a theoretical understanding of CoEAs on combinatorial games. This greatly expands the scope of rigorous runtime analysis available for coevolution (which so far does not apply to any turn-based game, let alone combinatorial ones), and additionally complements the abundance of existing empirical analysis, which we review in Section 1.1. While it remains a long term goal to push analysis towards more sophisticated representations, insights into algorithm design gained here still hold great relevance to coevolution in general. Furthermore, we believe our addition to the range of techniques available in this critical domain will in turn further the development of future runtime analysis of CoEAs.

Finally, we note here that the algorithm we analyse is a type of coevolutionary algorithm called an estimation of distribution algorithm (EDA), and moreover that this EDA applies to multi-valued decision variables (these notions are covered in Section 3). While not the main focus of this paper, there is only a small amount of preexisting analysis for EDAs operating over non-binary search domains, despite the clear utility of such algorithms. Our proof includes a detailed treatment of this setting, and may also provide methods useful in future analysis in this area.

In the remainder of this section, we review existing related work before stating notation. In Section 2 we give a more comprehensive discussion of impartial combinatorial games and review some Sprague-Grundy theory that will be relevant to our proof. In Section 3 we state the algorithm to which our result applies (UMDA), with an emphasis on its extension to multi-valued decision variables. In Section 4 we motivate and define the graph property s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG appearing in Theorem 1.1, before then presenting the main result in Section 5. Following this, we apply the main result to a menagerie of selected impartial combinatorial games in Section 6.

1.1 Related work

Empirical analysis of coevolutionary algorithms for game playing. As game playing is a natural application for CoEAs, there have been a large number of empirical investigations into this topic, of which we can only list a small fraction here. In terms of impartial combinatorial games, Rosin and Belew [55] investigated the effect of using features such as fitness sharing and archives in CoEAs optimising a 4-pile instance of Nim, noting that Nim was a difficult coevolutionary problem despite lending itself to simple crossover-friendly representations. Additionally, Jákowski, Krawiec, and Wieloch [35] observed in relation to experiments on SubtractionNim2003superscriptsubscriptSubtractionNim2003\textsc{SubtractionNim}_{200}^{3}SubtractionNim start_POSTSUBSCRIPT 200 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT that intransitivity presents a strong challenge for CoEAs. Non-impartial (yet still almost symmetric) combinatorial games studied in the context of coevolution include Tic-Tac-Toe [35, 55], Backgammon [50], Othello [36, 63, 64], Senet [16], Checkers [5], Chess [18, 30], and Go [43]. More general game-playing applications include Pong [46], Bomberman [23], Poker [48], Resistance [39], as well as games invented to emulate real-world applications such as cyber security and defense [28, 41]. For a general survey, see [38].

Runtime analysis of coevolutionary algorithms. Until recently, the only existing coevolutionary runtime analysis result, due to Jansen and Wiegand [34], applied to a cooperative coevolutionary algorithm, which uses multiple populations to collectively solve traditional optimisation problems. The first runtime analysis applicable to competitive coevolution was established by Lehre [40], who showed that a population-based CoEA which selects using a pairwise dominance relation is able to approximate the Nash equilibrium of instances of a game called Bilinear in expected polynomial time. A key theoretical insight into algorithm design from the same paper was the identification of an error threshold for mutation rate, above which no CoEA can efficiently optimise Bilinear. Further runtime analysis for CoEAs on Bilinear has concerned the roles played by fitness aggregation methods [31] and archives [32] in algorithm behaviour. Inspired by promising applications of CoEAs for optimising binary test-based problems, Lin and Lehre [42] provided runtime analysis establishing the benefit of using a CoEA over a traditional EA for optimising a benchmark problem called Diagonal. In [2], Benford and Lehre considered the importance of maintaining a diverse set of opponents when coevolving game strategies, showing that any CoEA able to retain only one individual between generations cannot efficiently find optimal strategies on a certain class of symmetric zero-sum games, even though with high probability a coevolutionary EDA finds an optimal strategy in polynomial time.

1.2 Notation

Given a finite set S𝑆Sitalic_S, a probability distribution over S𝑆Sitalic_S is a function p:S[0,1]:𝑝𝑆01p:S\to[0,1]italic_p : italic_S → [ 0 , 1 ] satisfying sSp(s)=1subscript𝑠𝑆𝑝𝑠1\sum_{s\in S}p(s)=1∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p ( italic_s ) = 1. We say that an S𝑆Sitalic_S-valued random variable x𝑥xitalic_x is distributed according to p𝑝pitalic_p, written xpsimilar-to𝑥𝑝x\sim pitalic_x ∼ italic_p, if (x=s)=p(s)𝑥𝑠𝑝𝑠\mathbb{P}(x=s)=p(s)blackboard_P ( italic_x = italic_s ) = italic_p ( italic_s ) holds for every sS𝑠𝑆s\in Sitalic_s ∈ italic_S. Given also a subset AS𝐴𝑆A\subseteq Sitalic_A ⊆ italic_S, we write p(A)=sAp(s)𝑝𝐴subscript𝑠𝐴𝑝𝑠p(A)=\sum_{s\in A}p(s)italic_p ( italic_A ) = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_A end_POSTSUBSCRIPT italic_p ( italic_s ). Given a number γ[0,1]𝛾01\gamma\in[0,1]italic_γ ∈ [ 0 , 1 ] we use 𝒫γ(S)subscript𝒫𝛾𝑆\mathcal{P}_{\gamma}(S)caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S ) to denote the set of probability distributions p𝑝pitalic_p over S𝑆Sitalic_S satisfying p(s)γ𝑝𝑠𝛾p(s)\geqslant\gammaitalic_p ( italic_s ) ⩾ italic_γ for every sS𝑠𝑆s\in Sitalic_s ∈ italic_S, and we also write 𝒫(S)=𝒫0(S)𝒫𝑆subscript𝒫0𝑆\mathcal{P}(S)=\mathcal{P}_{0}(S)caligraphic_P ( italic_S ) = caligraphic_P start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_S ).

A rooted directed graph is a triple G=(V,F,v0)𝐺𝑉𝐹subscript𝑣0G=(V,F,v_{0})italic_G = ( italic_V , italic_F , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), were V𝑉Vitalic_V is a vertex set, F𝐹Fitalic_F is a function mapping each vertex onto its out-neighbourhood, and v0Vsubscript𝑣0𝑉v_{0}\in Vitalic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_V is a distinguished root vertex. Throughout we will assume all directed graphs are acyclic. We write E(G)={(u,v)V2:vF(u)}𝐸𝐺conditional-set𝑢𝑣superscript𝑉2𝑣𝐹𝑢E(G)=\{(u,v)\in V^{2}:v\in F(u)\}italic_E ( italic_G ) = { ( italic_u , italic_v ) ∈ italic_V start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT : italic_v ∈ italic_F ( italic_u ) } for the set of edges of G𝐺Gitalic_G and Δ=maxvV|F(v)|Δsubscript𝑣𝑉𝐹𝑣\Delta=\max_{v\in V}|F(v)|roman_Δ = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT | italic_F ( italic_v ) | for the maximum degree of G𝐺Gitalic_G. A directed path in G𝐺Gitalic_G is a sequence of vertices u0u1usubscript𝑢0subscript𝑢1subscript𝑢u_{0}u_{1}\ldots u_{\ell}italic_u start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT such that uiF(ui1)subscript𝑢𝑖𝐹subscript𝑢𝑖1u_{i}\in F(u_{i-1})italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT ) for each i[]𝑖delimited-[]i\in[\ell]italic_i ∈ [ roman_ℓ ]. For a path P=u0u1u𝑃subscript𝑢0subscript𝑢1subscript𝑢P=u_{0}u_{1}\ldots u_{\ell}italic_P = italic_u start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT we have |P|=+1𝑃1|P|=\ell+1| italic_P | = roman_ℓ + 1. If vV𝑣𝑉v\in Vitalic_v ∈ italic_V has no out-neighbours, then we say v𝑣vitalic_v is a sink. We use Int(G)={vV:F(v)}Int𝐺conditional-set𝑣𝑉𝐹𝑣\text{Int}(G)=\{v\in V:F(v)\neq\emptyset\}Int ( italic_G ) = { italic_v ∈ italic_V : italic_F ( italic_v ) ≠ ∅ } to denote the set of non-sink vertices of G𝐺Gitalic_G (the interior vertices).

All logarithms are the natural logarithm unless stated otherwise, and given k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N we write logkn=(logn)ksuperscript𝑘𝑛superscript𝑛𝑘\log^{k}{n}=(\log{n})^{k}roman_log start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_n = ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT.

2 Impartial combinatorial games

Let us briefly review the representation of impartial games via directed graphs and some Sprague-Grundy theory (see, for example, [27, 47]). An impartial combinatorial game is a finite acyclic rooted directed graph G=(V,F,v0)𝐺𝑉𝐹subscript𝑣0G=(V,F,v_{0})italic_G = ( italic_V , italic_F , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) (see Section 1.2), where V𝑉Vitalic_V is a vertex set of size n𝑛nitalic_n, and v0Vsubscript𝑣0𝑉v_{0}\in Vitalic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_V is the initial game position. Players take it in turns to move the current position to one of its out-neighbours. We adopt the convention that if a player is unable to make a move because the current position has no out-neighbours (i.e., it is a sink), then that player loses. This is usually referred as the normal play convention. We will also always assume that for each vV𝑣𝑉v\in Vitalic_v ∈ italic_V, there is a directed path from v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to v𝑣vitalic_v, so that every game position is reachable.

We will encode strategies for impartial combinatorial games as an assignment of each non-sink game position v𝑣vitalic_v to an element of F(v)𝐹𝑣F(v)italic_F ( italic_v ) (that is, an out-neighbour of v𝑣vitalic_v), with this assignment indicating the preferred move at each game position. Formally, recalling that Int(G)Int𝐺\text{Int}(G)Int ( italic_G ) denotes the set of vV𝑣𝑉v\in Vitalic_v ∈ italic_V with F(v)𝐹𝑣F(v)\neq\emptysetitalic_F ( italic_v ) ≠ ∅, then

𝒳G=vInt(G)F(v)subscript𝒳𝐺subscriptproduct𝑣Int𝐺𝐹𝑣\mathcal{X}_{G}=\prod_{v\in\text{Int}(G)}F(v)caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT italic_F ( italic_v )

will be the set of strategies for G𝐺Gitalic_G. Note that an element x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT may be regarded as a mapping Int(G)VInt𝐺𝑉\text{{Int}}(G)\to VInt ( italic_G ) → italic_V, and so we will write x(v)𝑥𝑣x(v)italic_x ( italic_v ) for the image of a position vV𝑣𝑉v\in Vitalic_v ∈ italic_V under this mapping. This formulation coincides closely with that featured in the aforementioned work of Richie on reinforcement learning for optimal Tic-Tac-Toe play [45], and has similarities to subsequent ‘move selector’ representations which identify a preferred action based on the current game position using, for example, genetic programming [23], neural networks [43, 46], or a game-specific mapping [48, 39]. However, it stands distinct from ‘state evaluator’ representations which play by evaluating board positions, whether by recording evaluations for all possible positions [35, 55], genetic programming [29, 16, 30], neural networks [50, 64, 5, 18], or otherwise.

As is typical for the uses of coevolution for gameplaying discussed in Section 1.1, players receive a payoff depending only on whether the final outcome of the game was win or lose. Accordingly, let fG:𝒳G×𝒳G{1,1}:subscript𝑓𝐺subscript𝒳𝐺subscript𝒳𝐺11f_{G}:\mathcal{X}_{G}\times\mathcal{X}_{G}\to\{-1,1\}italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT : caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT × caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT → { - 1 , 1 } be the payoff function for G𝐺Gitalic_G, where fG(x,y)=1subscript𝑓𝐺𝑥𝑦1f_{G}(x,y)=1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 indicates that x𝑥xitalic_x wins against y𝑦yitalic_y and fG(x,y)=1subscript𝑓𝐺𝑥𝑦1f_{G}(x,y)=-1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = - 1 indicates that x𝑥xitalic_x loses against y𝑦yitalic_y (where x𝑥xitalic_x makes the first move). Precisely, if we recursively define for vV𝑣𝑉v\in Vitalic_v ∈ italic_V,

fGv(x,y)={fGx(v)(y,x)if vInt(G),1otherwise,superscriptsubscript𝑓𝐺𝑣𝑥𝑦casessuperscriptsubscript𝑓𝐺𝑥𝑣𝑦𝑥if vInt(G),1otherwise,f_{G}^{v}(x,y)=\begin{cases}-f_{G}^{x(v)}(y,x)&\qquad\text{if $v\in\text{Int}(% G)$,}\\ -1&\qquad\text{otherwise,}\end{cases}italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = { start_ROW start_CELL - italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x ( italic_v ) end_POSTSUPERSCRIPT ( italic_y , italic_x ) end_CELL start_CELL if italic_v ∈ Int ( italic_G ) , end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL otherwise, end_CELL end_ROW

then fG(x,y)=fGv0(x,y)subscript𝑓𝐺𝑥𝑦superscriptsubscript𝑓𝐺subscript𝑣0𝑥𝑦f_{G}(x,y)=f_{G}^{v_{0}}(x,y)italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ). It will also be convenient to define for x,y𝒳G𝑥𝑦subscript𝒳𝐺x,y\in\mathcal{X}_{G}italic_x , italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT,

PathG(x,y)={v0,x(v0),y(x(v0)),x(y(x(v0))),}.subscriptPath𝐺𝑥𝑦subscript𝑣0𝑥subscript𝑣0𝑦𝑥subscript𝑣0𝑥𝑦𝑥subscript𝑣0\text{Path}_{G}(x,y)=\{v_{0},x(v_{0}),y(x(v_{0})),x(y(x(v_{0}))),\ldots\}.Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = { italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , italic_y ( italic_x ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) , italic_x ( italic_y ( italic_x ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ) , … } .

We will always assume that there is some x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT such that fG(x,y)=1subscript𝑓𝐺𝑥𝑦1f_{G}(x,y)=1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 for every y𝒳G𝑦subscript𝒳𝐺y\in\mathcal{X}_{G}italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (i.e., that the first player has a winning strategy for G𝐺Gitalic_G). Indeed, if this is not the case, then the second player has a winning strategy, and so we can add a fictitious initial position vsuperscript𝑣v^{\ast}italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to G𝐺Gitalic_G with F(v)={v0}𝐹superscript𝑣subscript𝑣0F(v^{\ast})=\{v_{0}\}italic_F ( italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = { italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } to obtain a game equally challenging as G𝐺Gitalic_G but with a winning strategy for the first player. We thus define the set of optimal strategies for G𝐺Gitalic_G to be

Opt(G)={x𝒳G:fG(x,y)=1 for every y𝒳},Opt𝐺conditional-set𝑥subscript𝒳𝐺fG(x,y)=1 for every y𝒳\text{Opt}(G)=\{x\in\mathcal{X}_{G}:\text{$f_{G}(x,y)=1$ for every $y\in% \mathcal{X}$}\},Opt ( italic_G ) = { italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT : italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 for every italic_y ∈ caligraphic_X } ,

and remark that the above assumption implies that Opt(G)Opt𝐺\text{Opt}(G)Opt ( italic_G ) will always be non-empty.

2.1 The Sprague-Grundy function

First introduced independently by Sprague [61, 62] and Grundy [25], the Sprague-Grundy function of an impartial combinatorial game is a function mapping game positions onto non-negative integers, which contains information about the game’s strategic landscape and how optimal play is affected when building new games out of smaller ones [19]. Formally, given G=(V,F,v0)𝐺𝑉𝐹subscript𝑣0G=(V,F,v_{0})italic_G = ( italic_V , italic_F , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), the Sprague-Grundy function h:V0:𝑉subscript0h:V\to\mathbb{N}_{0}italic_h : italic_V → blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is defined recursively. First, all sink vertices are given the value 00. Then, once all out-neighbours of v𝑣vitalic_v have a value assigned, we define

h(v)=mex{h(w):wF(v)},𝑣mexconditional-set𝑤𝑤𝐹𝑣h(v)=\text{{mex}}\,{\{h(w):w\in F(v)\}},italic_h ( italic_v ) = mex { italic_h ( italic_w ) : italic_w ∈ italic_F ( italic_v ) } ,

where mexS=min(0S)mex𝑆subscript0𝑆\text{{mex}}\,{S}=\min{(\mathbb{N}_{0}\setminus S)}mex italic_S = roman_min ( blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∖ italic_S ) denotes the smallest non-negative number not in a finite set S𝑆Sitalic_S (the ‘minimum excluded integer’).

Given vV𝑣𝑉v\in Vitalic_v ∈ italic_V, if the current position is v𝑣vitalic_v then the player making the next move has a winning strategy if and only if h(v)0𝑣0h(v)\neq 0italic_h ( italic_v ) ≠ 0. Accordingly, if h(v)=0𝑣0h(v)=0italic_h ( italic_v ) = 0 then the player making the next move will always lose against an opponent who plays optimally. Thus, victory can be assured for the player making the first move by always choosing to move to vertices in h1({0})superscript10h^{-1}(\{0\})italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( { 0 } ). Because this happens automatically whenever F(v)h1({0})=𝐹𝑣superscript10F(v)\setminus h^{-1}(\{0\})=\emptysetitalic_F ( italic_v ) ∖ italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( { 0 } ) = ∅, an optimal strategy can be guaranteed by learning optimal moves at a set WGsubscript𝑊𝐺W_{G}italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (which we refer to as critical positions) defined in the following way.

Definition 2.1.

Given an impartial combinatorial game G=(V,F,v0)𝐺𝑉𝐹subscript𝑣0G=(V,F,v_{0})italic_G = ( italic_V , italic_F , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), let

WG={vInt(G):h(v)0 and F(v)h1({0})},subscript𝑊𝐺conditional-set𝑣Int𝐺h(v)0 and F(v)h1({0})W_{G}=\{v\in\text{\emph{Int}}(G):\text{$h(v)\neq 0$ and $F(v)\setminus h^{-1}(% \{0\})\neq\emptyset$}\},italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = { italic_v ∈ Int ( italic_G ) : italic_h ( italic_v ) ≠ 0 and italic_F ( italic_v ) ∖ italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( { 0 } ) ≠ ∅ } ,

where h:V0:𝑉subscript0h:V\to\mathbb{N}_{0}italic_h : italic_V → blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the Sprague-Grundy function for G𝐺Gitalic_G.

The following lemma formalises this notion in a general form that will be useful to quote later.

Lemma 2.2.

Let h:V0:𝑉subscript0h:V\to\mathbb{N}_{0}italic_h : italic_V → blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denote the Sprague-Grundy function of a combinatorial game G𝐺Gitalic_G. Let u1,,unsubscript𝑢1subscript𝑢𝑛u_{1},\ldots,u_{n}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be an ordering of V𝑉Vitalic_V such that F(ui){u1,,ui1}𝐹subscript𝑢𝑖subscript𝑢1subscript𝑢𝑖1F(u_{i})\subseteq\{u_{1},\ldots,u_{i-1}\}italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⊆ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } for every i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]. Then, the following holds for every i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ].

  1. A1

    If h(ui)0subscript𝑢𝑖0h(u_{i})\neq 0italic_h ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≠ 0 and x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies h(x(v))=0𝑥𝑣0h(x(v))=0italic_h ( italic_x ( italic_v ) ) = 0 for every vWG{u1,,ui}𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖v\in W_{G}\cap\{u_{1},\ldots,u_{i}\}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }, then fGui(x,y)=1superscriptsubscript𝑓𝐺subscript𝑢𝑖𝑥𝑦1f_{G}^{u_{i}}(x,y)=1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 holds for every y𝒳G𝑦subscript𝒳𝐺y\in\mathcal{X}_{G}italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT.

  2. A2

    If h(ui)=0subscript𝑢𝑖0h(u_{i})=0italic_h ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0 and y𝒳G𝑦subscript𝒳𝐺y\in\mathcal{X}_{G}italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies h(y(v))=0𝑦𝑣0h(y(v))=0italic_h ( italic_y ( italic_v ) ) = 0 for every vWG{u1,,ui}𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖v\in W_{G}\cap\{u_{1},\ldots,u_{i}\}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }, then fGui(x,y)=1superscriptsubscript𝑓𝐺subscript𝑢𝑖𝑥𝑦1f_{G}^{u_{i}}(x,y)=-1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 holds for every x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT.

In particular, with our assumption that the first player always has a winning strategy for G𝐺Gitalic_G, if x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies h(x(v))=0𝑥𝑣0h(x(v))=0italic_h ( italic_x ( italic_v ) ) = 0 for every vWG𝑣subscript𝑊𝐺v\in W_{G}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, then xOpt(G)𝑥Opt𝐺x\in\text{\emph{Opt}}(G)italic_x ∈ Opt ( italic_G ).

Proof.

We prove that the conditions A1 andA2 always hold by induction on i𝑖iitalic_i. For the case i=1𝑖1i=1italic_i = 1, note that we must have h(u1)=0subscript𝑢10h(u_{1})=0italic_h ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = 0 (as uiInt(G)subscript𝑢𝑖Int𝐺u_{i}\notin\text{Int}(G)italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∉ Int ( italic_G )) and fGu1(x,y)=1superscriptsubscript𝑓𝐺subscript𝑢1𝑥𝑦1f_{G}^{u_{1}}(x,y)=-1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 for any x,y𝒳G𝑥𝑦subscript𝒳𝐺x,y\in\mathcal{X}_{G}italic_x , italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT. For the inductive stage, there are two cases to consider. First, if h(ui)=0subscript𝑢𝑖0h(u_{i})=0italic_h ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0 and y𝒳G𝑦subscript𝒳𝐺y\in\mathcal{X}_{G}italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies h(y(v))=0𝑦𝑣0h(y(v))=0italic_h ( italic_y ( italic_v ) ) = 0 for every vWG{u1,,ui}𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖v\in W_{G}\cap\{u_{1},\ldots,u_{i}\}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }, then because mex{h(w):wF(ui)}=0mexconditional-set𝑤𝑤𝐹subscript𝑢𝑖0\text{{mex}}\,{\{h(w):w\in F(u_{i})\}}=0mex { italic_h ( italic_w ) : italic_w ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } = 0 we must have h(x(ui))0𝑥subscript𝑢𝑖0h(x(u_{i}))\neq 0italic_h ( italic_x ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ≠ 0 for any x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, and hence

fGui(x,y)=fGx(ui)(y,x)=A11.superscriptsubscript𝑓𝐺subscript𝑢𝑖𝑥𝑦superscriptsubscript𝑓𝐺𝑥subscript𝑢𝑖𝑦𝑥A11f_{G}^{u_{i}}(x,y)=-f_{G}^{x(u_{i})}(y,x)\overset{\emph{\ref{oc1}}}{=}-1.italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y , italic_x ) overOVERACCENT start_ARG = end_ARG - 1 .

On the other hand, if h(ui)0subscript𝑢𝑖0h(u_{i})\neq 0italic_h ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≠ 0 and x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies h(x(v))=0𝑥𝑣0h(x(v))=0italic_h ( italic_x ( italic_v ) ) = 0 for every vWG{u1,,ui}𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖v\in W_{G}\cap\{u_{1},\ldots,u_{i}\}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }, then in fact h(x(ui))=0𝑥subscript𝑢𝑖0h(x(u_{i}))=0italic_h ( italic_x ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) = 0 (for this holds by default if uiWGsubscript𝑢𝑖subscript𝑊𝐺u_{i}\notin W_{G}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∉ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT), and so for any y𝒳G𝑦subscript𝒳𝐺y\in\mathcal{X}_{G}italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT,

fGui(x,y)=fGx(ui)(y,x)=A21,superscriptsubscript𝑓𝐺subscript𝑢𝑖𝑥𝑦superscriptsubscript𝑓𝐺𝑥subscript𝑢𝑖𝑦𝑥A21f_{G}^{u_{i}}(x,y)=-f_{G}^{x(u_{i})}(y,x)\overset{\emph{\ref{oc2}}}{=}1,italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y , italic_x ) overOVERACCENT start_ARG = end_ARG 1 ,

as required. ∎

Note that the final conclusion of Lemma 2.2 is a sufficient condition, but not a necessary condition, as demonstrated by Figure 1.

v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTa𝑎aitalic_ab𝑏bitalic_bc𝑐citalic_cd𝑑ditalic_d10210
Figure 1: In the combinatorial game illustrated above, Sprague-Grundy values at each game position are shown in red. In this game, WG={v0,b}subscript𝑊𝐺subscript𝑣0𝑏W_{G}=\{v_{0},b\}italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = { italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b }. However, any strategy x𝑥xitalic_x with x(v0)=d𝑥subscript𝑣0𝑑x(v_{0})=ditalic_x ( italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = italic_d is automatically optimal (the first player wins on their first turn), and so the condition of Lemma 2.2 is not a necessary one.

3 UMDA

Rather than storing a population as a set of points in the search space, as is the case for most EAs, an estimation of distribution algorithm (EDA) represents its population as a probability distribution over the search space [49]. Whereas most algorithms sample candidates for selection from their current population uniformly at random, an EDA instead samples from its probability distribution. After selection has been completed, the selected individuals are then used to update the probability distribution for the next generation. Much of the existing runtime analysis for EDAs (see [8, 11, 13, 65, 66]) has emphasised the benefit provided by a high level of diversity among generated search points. This is also the case in the recent first runtime analysis of a coevolutionary EDA [2], wherein the difficulty presented by locally intransitive payoff landscapes could be provably averted by evaluating strategies against a diverse set of opponents. As intransitivity is also apparent in impartial combinatorial games, a coevolutionary EDA is a good candidate for a first runtime analysis on this topic too.

Most existing theoretical analysis of EDAs concerns those operating over bitstrings – that is, {0,1}nsuperscript01𝑛\{0,1\}^{n}{ 0 , 1 } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the search domain. However, as outlined in Section 2, our formulation of strategies gives rise to a more complicated search domain. For a parent set S𝑆Sitalic_S, we are considering search domains of the form 𝒳=iISi𝒳subscriptproduct𝑖𝐼subscript𝑆𝑖\mathcal{X}=\prod_{i\in I}S_{i}caligraphic_X = ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where I𝐼Iitalic_I is an indexing set and SiSsubscript𝑆𝑖𝑆S_{i}\subseteq Sitalic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_S for each iI𝑖𝐼i\in Iitalic_i ∈ italic_I. Given a tuple piI𝒫(Si)𝑝subscriptproduct𝑖𝐼𝒫subscript𝑆𝑖p\in\prod_{i\in I}\mathcal{P}(S_{i})italic_p ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), let Univ(𝒳,p)Univ𝒳𝑝\text{Univ}(\mathcal{X},p)Univ ( caligraphic_X , italic_p ) denote the probability distribution over 𝒳𝒳\mathcal{X}caligraphic_X such that if xUniv(𝒳,p)similar-to𝑥Univ𝒳𝑝x\sim\text{Univ}(\mathcal{X},p)italic_x ∼ Univ ( caligraphic_X , italic_p ) then for any y𝒳𝑦𝒳y\in\mathcal{X}italic_y ∈ caligraphic_X,

(x=y)=iIp(i)(yi),𝑥𝑦subscriptproduct𝑖𝐼𝑝𝑖subscript𝑦𝑖\mathbb{P}(x=y)=\prod_{i\in I}p(i)(y_{i}),blackboard_P ( italic_x = italic_y ) = ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT italic_p ( italic_i ) ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ,

so that the distribution of x𝑥xitalic_x is that of an independent univariate sampling for each iI𝑖𝐼i\in Iitalic_i ∈ italic_I. For notational convenience, given a tuple piI𝒫(Si)𝑝subscriptproduct𝑖𝐼𝒫subscript𝑆𝑖p\in\prod_{i\in I}\mathcal{P}(S_{i})italic_p ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) we will often write for iI𝑖𝐼i\in Iitalic_i ∈ italic_I and sS𝑠𝑆s\in Sitalic_s ∈ italic_S,

p(i,s)={p(i)(s)if sSi,0otherwise.𝑝𝑖𝑠cases𝑝𝑖𝑠if sSi,0otherwise.p(i,s)=\begin{cases}p(i)(s)&\qquad\text{if $s\in S_{i}$,}\\ 0&\qquad\text{otherwise.}\end{cases}italic_p ( italic_i , italic_s ) = { start_ROW start_CELL italic_p ( italic_i ) ( italic_s ) end_CELL start_CELL if italic_s ∈ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL otherwise. end_CELL end_ROW

The coevolutionary EDA we consider will represent its current population as an element piI𝒫(Si)𝑝subscriptproduct𝑖𝐼𝒫subscript𝑆𝑖p\in\prod_{i\in I}\mathcal{P}(S_{i})italic_p ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), with individuals being generated according to Univ(𝒳,p)Univ𝒳𝑝\text{Univ}(\mathcal{X},p)Univ ( caligraphic_X , italic_p ). In the case where I=[n]𝐼delimited-[]𝑛I=[n]italic_I = [ italic_n ] and Si={0,1}subscript𝑆𝑖01S_{i}=\{0,1\}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { 0 , 1 } for each i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we recover the standard framework for univariate EDAs operating over bitstrings. For these EDAs, the tuple pi[n]𝒫({0,1})𝑝subscriptproduct𝑖delimited-[]𝑛𝒫01p\in\prod_{i\in[n]}\mathcal{P}(\{0,1\})italic_p ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT caligraphic_P ( { 0 , 1 } ) is often represented as a frequency vector (p(1),,p(n))[0,1]n𝑝1𝑝𝑛superscript01𝑛(p(1),\ldots,p(n))\in[0,1]^{n}( italic_p ( 1 ) , … , italic_p ( italic_n ) ) ∈ [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where p(i)𝑝𝑖p(i)italic_p ( italic_i ) is the probability that xUniv({0,1}n,p)similar-to𝑥Univsuperscript01𝑛𝑝x\sim\text{Univ}(\{0,1\}^{n},p)italic_x ∼ Univ ( { 0 , 1 } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_p ) has a 1111-bit in position i𝑖iitalic_i. A common feature for EDAs operating over bitstrings is to constrain these frequencies to the interval [γ,1γ]𝛾1𝛾[\gamma,1-\gamma][ italic_γ , 1 - italic_γ ] for some small γ𝛾\gammaitalic_γ at the end of each generation. For the general case, where we track a tuple piI𝒫(Si)𝑝subscriptproduct𝑖𝐼𝒫subscript𝑆𝑖p\in\prod_{i\in I}\mathcal{P}(S_{i})italic_p ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), we need to constrain each p(i)𝒫(Si)𝑝𝑖𝒫subscript𝑆𝑖p(i)\in\mathcal{P}(S_{i})italic_p ( italic_i ) ∈ caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) to the set 𝒫γ(Si)subscript𝒫𝛾subscript𝑆𝑖\mathcal{P}_{\gamma}(S_{i})caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). To achieve this, we adopt the following minor variation of the multi-valued EDA framework proposed by Ben Jedidia, Doerr, and Krejca [1]. Given γ[0,1|S|)𝛾01𝑆\gamma\in[0,\textstyle{\frac{1}{|S|}})italic_γ ∈ [ 0 , divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ) and p𝒫(S)𝑝𝒫𝑆p\in\mathcal{P}(S)italic_p ∈ caligraphic_P ( italic_S ), let

βγ+(p)=sSmax{p(s)γ,0},βγ(p)=sSmax{γp(s),0},formulae-sequencesubscriptsuperscript𝛽𝛾𝑝subscript𝑠𝑆𝑝𝑠𝛾0subscriptsuperscript𝛽𝛾𝑝subscript𝑠𝑆𝛾𝑝𝑠0\beta^{+}_{\gamma}(p)=\sum_{s\in S}\max{\{p(s)-\gamma,0\}},\qquad\beta^{-}_{% \gamma}(p)=\sum_{s\in S}\max{\{\gamma-p(s),0\}},italic_β start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_p ) = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT roman_max { italic_p ( italic_s ) - italic_γ , 0 } , italic_β start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_p ) = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT roman_max { italic_γ - italic_p ( italic_s ) , 0 } ,

Let πγS:𝒫(S)𝒫γ(S):superscriptsubscript𝜋𝛾𝑆𝒫𝑆subscript𝒫𝛾𝑆\pi_{\gamma}^{S}:\mathcal{P}(S)\to\mathcal{P}_{\gamma}(S)italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT : caligraphic_P ( italic_S ) → caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S ) then be the function given by

πγS(p)(s)={γif p(s)γ,γ+(1βγ(p)βγ+(p))(p(s)γ)if p(s)γ.superscriptsubscript𝜋𝛾𝑆𝑝𝑠cases𝛾if p(s)γ,𝛾1superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝𝑝𝑠𝛾if p(s)γ.\pi_{\gamma}^{S}(p)(s)=\begin{cases}\gamma&\qquad\text{if $p(s)\leqslant\gamma% $,}\\ \gamma+\left(1-\textstyle{\frac{\beta_{\gamma}^{-}(p)}{\beta_{\gamma}^{+}(p)}}% \right)(p(s)-\gamma)&\qquad\text{if $p(s)\geqslant\gamma$.}\end{cases}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) = { start_ROW start_CELL italic_γ end_CELL start_CELL if italic_p ( italic_s ) ⩽ italic_γ , end_CELL end_ROW start_ROW start_CELL italic_γ + ( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) end_ARG ) ( italic_p ( italic_s ) - italic_γ ) end_CELL start_CELL if italic_p ( italic_s ) ⩾ italic_γ . end_CELL end_ROW

For the case |S|=2𝑆2|S|=2| italic_S | = 2 the definition reduces to πγS(p)(s)=min{max{p(s),γ},1γ}superscriptsubscript𝜋𝛾𝑆𝑝𝑠𝑝𝑠𝛾1𝛾\pi_{\gamma}^{S}(p)(s)=\min{\{\max{\{p(s),\gamma\}},1-\gamma\}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) = roman_min { roman_max { italic_p ( italic_s ) , italic_γ } , 1 - italic_γ }, and so this model fits the usual method for constraining univariate EDAs over bitstrings.

Despite some differences in notation, the function πγSsuperscriptsubscript𝜋𝛾𝑆\pi_{\gamma}^{S}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT is nearly identical the restriction described in [1]. In the context of [1, Section 4.2], our only modification is to forego an initial clamping of probabilities to the interval [γ,1(|S|1)γ]𝛾1𝑆1𝛾[\gamma,1-(|S|-1)\gamma][ italic_γ , 1 - ( | italic_S | - 1 ) italic_γ ], as the upper border of 1(|S|1)γ1𝑆1𝛾1-(|S|-1)\gamma1 - ( | italic_S | - 1 ) italic_γ is already implied by the fact that the remaining steps produce an element of 𝒫γ(S)subscript𝒫𝛾𝑆\mathcal{P}_{\gamma}(S)caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S ). Indeed, an actual difference between the two methods only arises for inputs p𝑝pitalic_p satisfying maxsSp(s)>1(|S|1)γsubscript𝑠𝑆𝑝𝑠1𝑆1𝛾\max_{s\in S}p(s)>1-(|S|-1)\gammaroman_max start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p ( italic_s ) > 1 - ( | italic_S | - 1 ) italic_γ, and even in such cases the difference is not significant.

The fact that πγSsuperscriptsubscript𝜋𝛾𝑆\pi_{\gamma}^{S}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT always outputs an element of 𝒫γ(S)subscript𝒫𝛾𝑆\mathcal{P}_{\gamma}(S)caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S ) is verified by B1 in the following lemma, which also establishes several further properties of πγsubscript𝜋𝛾\pi_{\gamma}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT which will be useful for our later proofs.

Lemma 3.1.

Let βγ+superscriptsubscript𝛽𝛾\beta_{\gamma}^{+}italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT, βγsuperscriptsubscript𝛽𝛾\beta_{\gamma}^{-}italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT, and πγSsuperscriptsubscript𝜋𝛾𝑆\pi_{\gamma}^{S}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT be as defined in Section 3. Then, the following properties hold.

  1. B1

    For any p𝒫(S)𝑝𝒫𝑆p\in\mathcal{P}(S)italic_p ∈ caligraphic_P ( italic_S ), sSπγS(p)(s)=1subscript𝑠𝑆superscriptsubscript𝜋𝛾𝑆𝑝𝑠1\sum_{s\in S}\pi_{\gamma}^{S}(p)(s)=1∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) = 1.

  2. B2

    If p(s)γ𝑝𝑠𝛾p(s)\geqslant\gammaitalic_p ( italic_s ) ⩾ italic_γ, then (1βγ(p)1γ|S|)p(s)πγS(p)(s)p(s)1superscriptsubscript𝛽𝛾𝑝1𝛾𝑆𝑝𝑠superscriptsubscript𝜋𝛾𝑆𝑝𝑠𝑝𝑠\left(1-\textstyle{\frac{\beta_{\gamma}^{-}(p)}{1-\gamma|S|}}\right)p(s)% \leqslant\pi_{\gamma}^{S}(p)(s)\leqslant p(s)( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG 1 - italic_γ | italic_S | end_ARG ) italic_p ( italic_s ) ⩽ italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) ⩽ italic_p ( italic_s ).

  3. B3

    For any SiSsubscript𝑆𝑖𝑆S_{i}\subseteq Sitalic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_S, p𝒫(Si)𝑝𝒫subscript𝑆𝑖p\in\mathcal{P}(S_{i})italic_p ∈ caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and sS𝑠𝑆s\in Sitalic_s ∈ italic_S, πγSi(p)(s)max{γ,p(s)}superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝑠𝛾𝑝𝑠\pi_{\gamma}^{S_{i}}(p)(s)\leqslant\max{\{\gamma,p(s)\}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) ⩽ roman_max { italic_γ , italic_p ( italic_s ) }.

  4. B4

    For any Si,ASsubscript𝑆𝑖𝐴𝑆S_{i},A\subseteq Sitalic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A ⊆ italic_S and p𝒫(Si)𝑝𝒫subscript𝑆𝑖p\in\mathcal{P}(S_{i})italic_p ∈ caligraphic_P ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), πγSi(p)(A)p(A)+γ|Si|superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝐴𝑝𝐴𝛾subscript𝑆𝑖\pi_{\gamma}^{S_{i}}(p)(A)\leqslant p(A)+\gamma|S_{i}|italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_A ) ⩽ italic_p ( italic_A ) + italic_γ | italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |.

Proof.

We first note that the definitions of βγ+superscriptsubscript𝛽𝛾\beta_{\gamma}^{+}italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT and βγsuperscriptsubscript𝛽𝛾\beta_{\gamma}^{-}italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT imply that for any γ[0,1/|S|)𝛾01𝑆\gamma\in[0,1/|S|)italic_γ ∈ [ 0 , 1 / | italic_S | ) and p𝒫(S)𝑝𝒫𝑆p\in\mathcal{P}(S)italic_p ∈ caligraphic_P ( italic_S ),

βγ+(p)βγ(p)superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝\displaystyle\beta_{\gamma}^{+}(p)-\beta_{\gamma}^{-}(p)italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) - italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) =sS(max{p(s)γ,0}max{γp(s),0})absentsubscript𝑠𝑆𝑝𝑠𝛾0𝛾𝑝𝑠0\displaystyle=\sum_{s\in S}(\max{\{p(s)-\gamma,0\}}-\max{\{\gamma-p(s),0\}})= ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT ( roman_max { italic_p ( italic_s ) - italic_γ , 0 } - roman_max { italic_γ - italic_p ( italic_s ) , 0 } )
=sS(max{p(s)γ,0}+min{p(s)γ,0})=sS(p(s)γ)=1γ|S|.absentsubscript𝑠𝑆𝑝𝑠𝛾0𝑝𝑠𝛾0subscript𝑠𝑆𝑝𝑠𝛾1𝛾𝑆\displaystyle=\sum_{s\in S}(\max{\{p(s)-\gamma,0\}}+\min{\{p(s)-\gamma,0\}})=% \sum_{s\in S}(p(s)-\gamma)=1-\gamma|S|.= ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT ( roman_max { italic_p ( italic_s ) - italic_γ , 0 } + roman_min { italic_p ( italic_s ) - italic_γ , 0 } ) = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT ( italic_p ( italic_s ) - italic_γ ) = 1 - italic_γ | italic_S | . (1)

Because 1γ|S|>01𝛾𝑆01-\gamma|S|>01 - italic_γ | italic_S | > 0 it immediately follows from (1) that

βγ(p)<βγ+(p).superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝\beta_{\gamma}^{-}(p)<\beta_{\gamma}^{+}(p).italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) < italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) . (2)

With these observations, we are now ready to prove the desired properties.

B1: If p𝒫(S)𝑝𝒫𝑆p\in\mathcal{P}(S)italic_p ∈ caligraphic_P ( italic_S ), then setting S+={sS:p(s)γ}superscript𝑆conditional-set𝑠𝑆𝑝𝑠𝛾S^{+}=\{s\in S:p(s)\geqslant\gamma\}italic_S start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = { italic_s ∈ italic_S : italic_p ( italic_s ) ⩾ italic_γ } and S=SS+superscript𝑆𝑆superscript𝑆S^{-}=S\setminus S^{+}italic_S start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = italic_S ∖ italic_S start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT we have

sSπγS(p)(s)=γ|S|+sS+(1βγ(p)βγ+(p))(p(s)γ)=γ|S|+(βγ+(p)βγ(p)βγ+(p))βγ+(p)=(1)1.subscript𝑠𝑆superscriptsubscript𝜋𝛾𝑆𝑝𝑠𝛾𝑆subscript𝑠superscript𝑆1superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝𝑝𝑠𝛾𝛾𝑆superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝italic-(1italic-)1\displaystyle\sum_{s\in S}\pi_{\gamma}^{S}(p)(s)=\gamma|S|+\sum_{s\in S^{+}}% \left(1-\textstyle{\frac{\beta_{\gamma}^{-}(p)}{\beta_{\gamma}^{+}(p)}}\right)% (p(s)-\gamma)=\gamma|S|+\left(\textstyle{\frac{\beta_{\gamma}^{+}(p)-\beta_{% \gamma}^{-}(p)}{\beta_{\gamma}^{+}(p)}}\right)\beta_{\gamma}^{+}(p)\overset{% \eqref{eq:beta3}}{=}1.∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) = italic_γ | italic_S | + ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) end_ARG ) ( italic_p ( italic_s ) - italic_γ ) = italic_γ | italic_S | + ( divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) - italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) end_ARG ) italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG 1 .

B2: If p(s)γ𝑝𝑠𝛾p(s)\geqslant\gammaitalic_p ( italic_s ) ⩾ italic_γ, then by setting α=βγ(p)/βγ+(p)𝛼superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝\alpha=\beta_{\gamma}^{-}(p)/\beta_{\gamma}^{+}(p)italic_α = italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) / italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ),

(1βγ(p)1γ|S|)p(s)1superscriptsubscript𝛽𝛾𝑝1𝛾𝑆𝑝𝑠\displaystyle\left(1-\textstyle{\frac{\beta_{\gamma}^{-}(p)}{1-\gamma|S|}}% \right)p(s)( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG 1 - italic_γ | italic_S | end_ARG ) italic_p ( italic_s ) (1)(1βγ(p)βγ+(p))p(s)=(1α)p(s)(1α)p(s)+αγ=γ+(1α)(p(s)γ)italic-(1italic-)1superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝𝑝𝑠1𝛼𝑝𝑠1𝛼𝑝𝑠𝛼𝛾𝛾1𝛼𝑝𝑠𝛾\displaystyle\overset{\eqref{eq:beta3}}{\leqslant}\left(1-\textstyle{\frac{% \beta_{\gamma}^{-}(p)}{\beta_{\gamma}^{+}(p)}}\right)p(s)=(1-\alpha)p(s)% \leqslant(1-\alpha)p(s)+\alpha\gamma=\gamma+(1-\alpha)(p(s)-\gamma)start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG ( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) end_ARG ) italic_p ( italic_s ) = ( 1 - italic_α ) italic_p ( italic_s ) ⩽ ( 1 - italic_α ) italic_p ( italic_s ) + italic_α italic_γ = italic_γ + ( 1 - italic_α ) ( italic_p ( italic_s ) - italic_γ )
=πγS(p)(s)=γ+(1βγ(p)βγ+(p))(p(s)γ)(2)γ+(p(s)γ)=p(s),absentsuperscriptsubscript𝜋𝛾𝑆𝑝𝑠𝛾1superscriptsubscript𝛽𝛾𝑝superscriptsubscript𝛽𝛾𝑝𝑝𝑠𝛾italic-(2italic-)𝛾𝑝𝑠𝛾𝑝𝑠\displaystyle=\pi_{\gamma}^{S}(p)(s)=\gamma+\left(1-\textstyle{\frac{\beta_{% \gamma}^{-}(p)}{\beta_{\gamma}^{+}(p)}}\right)(p(s)-\gamma)\overset{\eqref{eq:% beta4}}{\leqslant}\gamma+(p(s)-\gamma)=p(s),= italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) = italic_γ + ( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_p ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_p ) end_ARG ) ( italic_p ( italic_s ) - italic_γ ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG italic_γ + ( italic_p ( italic_s ) - italic_γ ) = italic_p ( italic_s ) ,

and so B2 holds.

B3: If p(s)γ𝑝𝑠𝛾p(s)\leqslant\gammaitalic_p ( italic_s ) ⩽ italic_γ then πγSi(p)(s)γ=max{γ,p(s)}superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝑠𝛾𝛾𝑝𝑠\pi_{\gamma}^{S_{i}}(p)(s)\leqslant\gamma=\max{\{\gamma,p(s)\}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) ⩽ italic_γ = roman_max { italic_γ , italic_p ( italic_s ) }. On the hand, if p(s)γ𝑝𝑠𝛾p(s)\geqslant\gammaitalic_p ( italic_s ) ⩾ italic_γ, then B2 implies that πγSi(p)p(s)=max{γ,p(s)}superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝑝𝑠𝛾𝑝𝑠\pi_{\gamma}^{S_{i}}(p)\leqslant p(s)=\max{\{\gamma,p(s)\}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ⩽ italic_p ( italic_s ) = roman_max { italic_γ , italic_p ( italic_s ) }. In either case, B3 holds.

B4: We can compute

πγSi(p)(A)superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝐴\displaystyle\pi_{\gamma}^{S_{i}}(p)(A)italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_A ) =πγSi(p)(ASi)=sASiπγSi(p)(s)B3sASimax{γ,p(s)}sASi(p(s)+γ)absentsuperscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝐴subscript𝑆𝑖subscript𝑠𝐴subscript𝑆𝑖superscriptsubscript𝜋𝛾subscript𝑆𝑖𝑝𝑠B3subscript𝑠𝐴subscript𝑆𝑖𝛾𝑝𝑠subscript𝑠𝐴subscript𝑆𝑖𝑝𝑠𝛾\displaystyle=\pi_{\gamma}^{S_{i}}(p)(A\cap S_{i})=\sum_{s\in A\cap S_{i}}\pi_% {\gamma}^{S_{i}}(p)(s)\overset{\emph{\ref{pi-2}}}{\leqslant}\sum_{s\in A\cap S% _{i}}\max{\{\gamma,p(s)\}}\leqslant\sum_{s\in A\cap S_{i}}(p(s)+\gamma)= italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_A ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_A ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_p ) ( italic_s ) overOVERACCENT start_ARG ⩽ end_ARG ∑ start_POSTSUBSCRIPT italic_s ∈ italic_A ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_max { italic_γ , italic_p ( italic_s ) } ⩽ ∑ start_POSTSUBSCRIPT italic_s ∈ italic_A ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_p ( italic_s ) + italic_γ )
=p(A)+γ|ASi|p(A)+γ|Si|,absent𝑝𝐴𝛾𝐴subscript𝑆𝑖𝑝𝐴𝛾subscript𝑆𝑖\displaystyle=p(A)+\gamma|A\cap S_{i}|\leqslant p(A)+\gamma|S_{i}|,= italic_p ( italic_A ) + italic_γ | italic_A ∩ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ⩽ italic_p ( italic_A ) + italic_γ | italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ,

as required. ∎

Algorithm 1 UMDA with binary tournament selection
1:Search domain 𝒳=iISi𝒳subscriptproduct𝑖𝐼subscript𝑆𝑖\mathcal{X}=\prod_{i\in I}S_{i}caligraphic_X = ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.
2:Function f:𝒳×𝒳{1,1}:𝑓𝒳𝒳11f:\mathcal{X}\times\mathcal{X}\to\{-1,1\}italic_f : caligraphic_X × caligraphic_X → { - 1 , 1 }.
3:Algorithm parameters μ𝜇\mu\in\mathbb{N}italic_μ ∈ blackboard_N and γ>0𝛾0\gamma>0italic_γ > 0.
4:for iI𝑖𝐼i\in Iitalic_i ∈ italic_I do
5:     for sSi𝑠subscript𝑆𝑖s\in S_{i}italic_s ∈ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
6:         Set p0(i)(s)=1|Si|subscript𝑝0𝑖𝑠1subscript𝑆𝑖p_{0}(i)(s)=\frac{1}{|S_{i}|}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_i ) ( italic_s ) = divide start_ARG 1 end_ARG start_ARG | italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | end_ARG.
7:     end for
8:end for
9:for t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N until termination criterion met do
10:     for j[μ]𝑗delimited-[]𝜇j\in[\mu]italic_j ∈ [ italic_μ ] do
11:         Sample xUniv(𝒳,pt)similar-to𝑥Univ𝒳subscript𝑝𝑡x\sim\text{Univ}(\mathcal{X},p_{t})italic_x ∼ Univ ( caligraphic_X , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT )
12:         Sample yUniv(𝒳,pt)similar-to𝑦Univ𝒳subscript𝑝𝑡y\sim\text{Univ}(\mathcal{X},p_{t})italic_y ∼ Univ ( caligraphic_X , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT )
13:         if f(x,y)=1𝑓𝑥𝑦1f(x,y)=1italic_f ( italic_x , italic_y ) = 1 then
14:              Set Pt+1(j)=xsubscript𝑃𝑡1𝑗𝑥P_{t+1}(j)=xitalic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_j ) = italic_x
15:         else if f(x,y)=1𝑓𝑥𝑦1f(x,y)=-1italic_f ( italic_x , italic_y ) = - 1 then
16:              Set Pt+1(j)=ysubscript𝑃𝑡1𝑗𝑦P_{t+1}(j)=yitalic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_j ) = italic_y
17:         end if
18:     end for
19:     for iI𝑖𝐼i\in Iitalic_i ∈ italic_I do
20:         for sSi𝑠subscript𝑆𝑖s\in S_{i}italic_s ∈ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
21:              Set qt+1(i)(s)=1μ|{j:Pt+1(j) has an s in position i}|subscript𝑞𝑡1𝑖𝑠1𝜇conditional-set𝑗Pt+1(j) has an s in position iq_{t+1}(i)(s)=\frac{1}{\mu}|\{j:\text{$P_{t+1}(j)$ has an $s$ in position $i$}\}|italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ( italic_s ) = divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG | { italic_j : italic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_j ) has an italic_s in position italic_i } |
22:         end for
23:          Set pt+1(i)=πγSi(qt+1(i))subscript𝑝𝑡1𝑖superscriptsubscript𝜋𝛾subscript𝑆𝑖subscript𝑞𝑡1𝑖p_{t+1}(i)=\pi_{\gamma}^{S_{i}}(q_{t+1}(i))italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) = italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) )
24:     end for
25:end for

A description of the algorithm we analyse is now provided by Algorithm 1, which effectively generalises the version appearing in [2] (which applied only to bitstrings and omitted the step involving πγSisuperscriptsubscript𝜋𝛾subscript𝑆𝑖\pi_{\gamma}^{S_{i}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT). Note that due to the use of πγSisuperscriptsubscript𝜋𝛾subscript𝑆𝑖\pi_{\gamma}^{S_{i}}italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT in line 23, we always have ptiI𝒫γ(Si)subscript𝑝𝑡subscriptproduct𝑖𝐼subscript𝒫𝛾subscript𝑆𝑖p_{t}\in\prod_{i\in I}\mathcal{P}_{\gamma}(S_{i})italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

A key step towards analysing the performance of Algorithm 1 on impartial combinatorial games is understanding the distribution of a selected individual Pt+1(j)subscript𝑃𝑡1𝑗P_{t+1}(j)italic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_j ). This will be handled by the following lemma. Its conclusion gives an exact expression for how the probability a selected individual would choose to move from u𝑢uitalic_u to v𝑣vitalic_v compares to the probability a sampled individual would choose to move from u𝑢uitalic_u to v𝑣vitalic_v (where a selected individual is simply the winner of a game played between two independent sampled individuals).

Lemma 3.2.

Let G𝐺Gitalic_G be an impartial combinatorial game, and let pvInt(G)𝒫(F(v))𝑝subscriptproduct𝑣Int𝐺𝒫𝐹𝑣p\in\prod_{v\in\text{\emph{Int}}(G)}\mathcal{P}(F(v))italic_p ∈ ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P ( italic_F ( italic_v ) ). Suppose that x,yUniv(𝒳G,p)similar-to𝑥𝑦Univsubscript𝒳𝐺𝑝x,y\sim\text{\emph{Univ}}(\mathcal{X}_{G},p)italic_x , italic_y ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p ) are independent, and

z={xif fG(x,y)=1,yif fG(x,y)=1.𝑧cases𝑥if fG(x,y)=1,𝑦if fG(x,y)=1.z=\begin{cases}x&\qquad\text{if $f_{G}(x,y)=1$,}\\ y&\qquad\text{if $f_{G}(x,y)=-1$.}\end{cases}italic_z = { start_ROW start_CELL italic_x end_CELL start_CELL if italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 , end_CELL end_ROW start_ROW start_CELL italic_y end_CELL start_CELL if italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = - 1 . end_CELL end_ROW

Then, for any uV𝑢𝑉u\in Vitalic_u ∈ italic_V and vF(u)𝑣𝐹𝑢v\in F(u)italic_v ∈ italic_F ( italic_u ),

(z(u)=v)=p(u,v)[1+(uPathG(x,y))(1(fGv(x,y)=1)(fGu(x,y)=1))].𝑧𝑢𝑣𝑝𝑢𝑣delimited-[]1𝑢subscriptPath𝐺𝑥𝑦1superscriptsubscript𝑓𝐺𝑣𝑥𝑦1superscriptsubscript𝑓𝐺𝑢𝑥𝑦1\mathbb{P}(z(u)=v)=p(u,v)\cdot[1+\mathbb{P}(u\in\text{\emph{Path}}_{G}(x,y))% \cdot(1-\mathbb{P}(f_{G}^{v}(x,y)=1)-\mathbb{P}(f_{G}^{u}(x,y)=1))].blackboard_P ( italic_z ( italic_u ) = italic_v ) = italic_p ( italic_u , italic_v ) ⋅ [ 1 + blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) ] . (3)

For an intuition behind (3), the comparative factor has effectively three terms (here interpreted in the context of Algorithm 1):

  • (uPathG(x,y))𝑢subscriptPath𝐺𝑥𝑦\mathbb{P}(u\in\text{{Path}}_{G}(x,y))blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ), the probability the algorithm encounters position u𝑢uitalic_u;

  • (fGu(x,y)=1)superscriptsubscript𝑓𝐺𝑢𝑥𝑦1\mathbb{P}(f_{G}^{u}(x,y)=1)blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ), the probability that u𝑢uitalic_u is observed as a winning position; and

  • 1(fGv(x,y)=1)1superscriptsubscript𝑓𝐺𝑣𝑥𝑦11-\mathbb{P}(f_{G}^{v}(x,y)=1)1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ), the probability that v𝑣vitalic_v is observed as a losing position.

If it is likely for v𝑣vitalic_v to be observed as a losing position, but unlikely for u𝑢uitalic_u to be observed as a winning position, then it is beneficial to deliberately move from u𝑢uitalic_u to v𝑣vitalic_v (placing your opponent in a likely losing position) rather than play out with whatever the current strategy is from u𝑢uitalic_u (where you are unlikely to win), thus incurring an increase in the prevalence of z(u)=v𝑧𝑢𝑣z(u)=vitalic_z ( italic_u ) = italic_v among selected individuals. On the other hand, if the reverse is true, then it is beneficial to deliberately avoid moving from u𝑢uitalic_u to v𝑣vitalic_v and instead play out normally, thus incurring a decrease in prevalence of z(u)=v𝑧𝑢𝑣z(u)=vitalic_z ( italic_u ) = italic_v. This helps motivate the effect of the term 1(fGv(x,y)=1)(fGu(x,y)=1)1superscriptsubscript𝑓𝐺𝑣𝑥𝑦1superscriptsubscript𝑓𝐺𝑢𝑥𝑦11-\mathbb{P}(f_{G}^{v}(x,y)=1)-\mathbb{P}(f_{G}^{u}(x,y)=1)1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ). The magnitude of this effect scales with the relative frequency with which u𝑢uitalic_u is encountered as a game position, which corresponds to (uPathG(x,y))𝑢subscriptPath𝐺𝑥𝑦\mathbb{P}(u\in\text{{Path}}_{G}(x,y))blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ).

Proof of Lemma 3.2.

First, we will introduce some notation to assist with this proof. Let us write r=(uPathG(x,y))𝑟𝑢subscriptPath𝐺𝑥𝑦r=\mathbb{P}(u\in\text{Path}_{G}(x,y))italic_r = blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ), su=(fGu(x,y)=1)subscript𝑠𝑢superscriptsubscript𝑓𝐺𝑢𝑥𝑦1s_{u}=\mathbb{P}(f_{G}^{u}(x,y)=1)italic_s start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ), and sv=(fGv(x,y)=1)subscript𝑠𝑣superscriptsubscript𝑓𝐺𝑣𝑥𝑦1s_{v}=\mathbb{P}(f_{G}^{v}(x,y)=1)italic_s start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ). Let us also write

A𝐴\displaystyle Aitalic_A ={wV{u}:there is a directed path from w to u},absentconditional-set𝑤𝑉𝑢there is a directed path from w to u\displaystyle=\{w\in V\setminus\{u\}:\text{there is a directed path from $w$ % to $u$}\},= { italic_w ∈ italic_V ∖ { italic_u } : there is a directed path from italic_w to italic_u } ,
B𝐵\displaystyle Bitalic_B ={wV{u}:there is a directed path from u to w},absentconditional-set𝑤𝑉𝑢there is a directed path from u to w\displaystyle=\{w\in V\setminus\{u\}:\text{there is a directed path from $u$ % to $w$}\},= { italic_w ∈ italic_V ∖ { italic_u } : there is a directed path from italic_u to italic_w } ,

and note that A𝐴Aitalic_A, B𝐵Bitalic_B, and {u}𝑢\{u\}{ italic_u } are pairwise disjoint sets. Finally, if we have Path(x,y)=v0v1vPath𝑥𝑦subscript𝑣0subscript𝑣1subscript𝑣\text{Path}(x,y)=v_{0}v_{1}\ldots v_{\ell}Path ( italic_x , italic_y ) = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT when regarded as a directed path (where here and throughout we drop the subscript from PathGsubscriptPath𝐺\text{Path}_{G}Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT to simplify notation), then we will define

Path1(x,y)superscriptPath1𝑥𝑦\displaystyle\text{Path}^{1}(x,y)Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ={vi:i is even},absentconditional-setsubscript𝑣𝑖i is even\displaystyle=\{v_{i}:\text{$i$ is even}\},= { italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_i is even } ,
Path2(x,y)superscriptPath2𝑥𝑦\displaystyle\text{Path}^{2}(x,y)Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ={vi:i is odd}.absentconditional-setsubscript𝑣𝑖i is odd\displaystyle=\{v_{i}:\text{$i$ is odd}\}.= { italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_i is odd } .

Note that because Path(x,y)Path𝑥𝑦\text{Path}(x,y)Path ( italic_x , italic_y ) is the disjoint union of Path1(x,y)superscriptPath1𝑥𝑦\text{Path}^{1}(x,y)Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) and Path2(x,y)superscriptPath2𝑥𝑦\text{Path}^{2}(x,y)Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ), we have

r=(uPath1(x,y))+(uPath2(x,y)).𝑟𝑢superscriptPath1𝑥𝑦𝑢superscriptPath2𝑥𝑦r=\mathbb{P}(u\in\text{Path}^{1}(x,y))+\mathbb{P}(u\in\text{Path}^{2}(x,y)).italic_r = blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) + blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) . (4)

The event z(u)=v𝑧𝑢𝑣z(u)=vitalic_z ( italic_u ) = italic_v can be written as the disjoint union of the following six events.

E1subscript𝐸1\displaystyle E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =uPath(x,y)fG(x,y)=1x(u)=vabsent𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1𝑥𝑢𝑣\displaystyle=u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=1\wedge x(u)=v= italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 ∧ italic_x ( italic_u ) = italic_v
E2subscript𝐸2\displaystyle E_{2}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =uPath(x,y)fG(x,y)=1y(u)=vabsent𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1𝑦𝑢𝑣\displaystyle=u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=-1\wedge y(u)=v= italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = - 1 ∧ italic_y ( italic_u ) = italic_v
E3subscript𝐸3\displaystyle E_{3}italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT =uPath1(x,y)x(u)=vfGv(y,x)=1absent𝑢superscriptPath1𝑥𝑦𝑥𝑢𝑣superscriptsubscript𝑓𝐺𝑣𝑦𝑥1\displaystyle=u\in\text{Path}^{1}(x,y)\wedge x(u)=v\wedge f_{G}^{v}(y,x)=-1= italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ∧ italic_x ( italic_u ) = italic_v ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1
E4subscript𝐸4\displaystyle E_{4}italic_E start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT =uPath1(x,y)fGu(x,y)=1y(u)=vabsent𝑢superscriptPath1𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑥𝑦1𝑦𝑢𝑣\displaystyle=u\in\text{Path}^{1}(x,y)\wedge f_{G}^{u}(x,y)=-1\wedge y(u)=v= italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ∧ italic_y ( italic_u ) = italic_v
E5subscript𝐸5\displaystyle E_{5}italic_E start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT =uPath2(x,y)y(u)=vfGv(x,y)=1absent𝑢superscriptPath2𝑥𝑦𝑦𝑢𝑣superscriptsubscript𝑓𝐺𝑣𝑥𝑦1\displaystyle=u\in\text{Path}^{2}(x,y)\wedge y(u)=v\wedge f_{G}^{v}(x,y)=-1= italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ∧ italic_y ( italic_u ) = italic_v ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1
E6subscript𝐸6\displaystyle E_{6}italic_E start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT =uPath2(x,y)fGu(y,x)=1x(u)=vabsent𝑢superscriptPath2𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑦𝑥1𝑥𝑢𝑣\displaystyle=u\in\text{Path}^{2}(x,y)\wedge f_{G}^{u}(y,x)=-1\wedge x(u)=v= italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 ∧ italic_x ( italic_u ) = italic_v

Let us examine the probability of each of these events occurring. For E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, the event uPath(x,y)fG(x,y)=1𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=1italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 can be determined using only (x(w))wusubscript𝑥𝑤𝑤𝑢(x(w))_{w\neq u}( italic_x ( italic_w ) ) start_POSTSUBSCRIPT italic_w ≠ italic_u end_POSTSUBSCRIPT and (y(w))wusubscript𝑦𝑤𝑤𝑢(y(w))_{w\neq u}( italic_y ( italic_w ) ) start_POSTSUBSCRIPT italic_w ≠ italic_u end_POSTSUBSCRIPT, and so is independent of the event x(u)=v𝑥𝑢𝑣x(u)=vitalic_x ( italic_u ) = italic_v. Similarly, in E2subscript𝐸2E_{2}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT the event uPath(x,y)fG(x,y)=1𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=-1italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = - 1 is independent of the event y(u)=v𝑦𝑢𝑣y(u)=vitalic_y ( italic_u ) = italic_v. Therefore,

(E1)+(E2)=subscript𝐸1subscript𝐸2absent\displaystyle\mathbb{P}(E_{1})+\mathbb{P}(E_{2})=blackboard_P ( italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + blackboard_P ( italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = (uPath(x,y)fG(x,y)=1)p(u,v)𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1𝑝𝑢𝑣\displaystyle\,\mathbb{P}(u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=1)\cdot p(u% ,v)blackboard_P ( italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = 1 ) ⋅ italic_p ( italic_u , italic_v )
+(uPath(x,y)fG(x,y)=1)p(u,v)𝑢Path𝑥𝑦subscript𝑓𝐺𝑥𝑦1𝑝𝑢𝑣\displaystyle+\mathbb{P}(u\notin\text{Path}(x,y)\wedge f_{G}(x,y)=-1)\cdot p(u% ,v)+ blackboard_P ( italic_u ∉ Path ( italic_x , italic_y ) ∧ italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) = - 1 ) ⋅ italic_p ( italic_u , italic_v )
=\displaystyle== (uPath(x,y))p(u,v)𝑢Path𝑥𝑦𝑝𝑢𝑣\displaystyle\,\mathbb{P}(u\notin\text{Path}(x,y))\cdot p(u,v)blackboard_P ( italic_u ∉ Path ( italic_x , italic_y ) ) ⋅ italic_p ( italic_u , italic_v )
=\displaystyle== (1r)p(u,v).1𝑟𝑝𝑢𝑣\displaystyle\,(1-r)\cdot p(u,v).( 1 - italic_r ) ⋅ italic_p ( italic_u , italic_v ) . (5)

For E3subscript𝐸3E_{3}italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, the event uPath1(x,y)𝑢superscriptPath1𝑥𝑦u\in\text{Path}^{1}(x,y)italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) can be determined using only (x(w))wAsubscript𝑥𝑤𝑤𝐴(x(w))_{w\in A}( italic_x ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_A end_POSTSUBSCRIPT and (y(w))wAsubscript𝑦𝑤𝑤𝐴(y(w))_{w\in A}( italic_y ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_A end_POSTSUBSCRIPT, and the event fGv(x,y)=1superscriptsubscript𝑓𝐺𝑣𝑥𝑦1f_{G}^{v}(x,y)=1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 can be determined using only (x(w))wBsubscript𝑥𝑤𝑤𝐵(x(w))_{w\in B}( italic_x ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_B end_POSTSUBSCRIPT and (y(w))wBsubscript𝑦𝑤𝑤𝐵(y(w))_{w\in B}( italic_y ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_B end_POSTSUBSCRIPT. Therefore, all three component events in E3subscript𝐸3E_{3}italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are independent of each other. The same is also true of E5subscript𝐸5E_{5}italic_E start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT. Therefore, noting that (fGv(x,y)=1)=(fGv(y,x)=1)superscriptsubscript𝑓𝐺𝑣𝑥𝑦1superscriptsubscript𝑓𝐺𝑣𝑦𝑥1\mathbb{P}(f_{G}^{v}(x,y)=-1)=\mathbb{P}(f_{G}^{v}(y,x)=-1)blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 ), we can write

(E3)+(E5)=subscript𝐸3subscript𝐸5absent\displaystyle\mathbb{P}(E_{3})+\mathbb{P}(E_{5})=blackboard_P ( italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) + blackboard_P ( italic_E start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ) = (uPath1(x,y))p(u,v)(fGv(y,x)=1)𝑢superscriptPath1𝑥𝑦𝑝𝑢𝑣superscriptsubscript𝑓𝐺𝑣𝑦𝑥1\displaystyle\,\mathbb{P}(u\in\text{Path}^{1}(x,y))\cdot p(u,v)\cdot\mathbb{P}% (f_{G}^{v}(y,x)=-1)blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ⋅ italic_p ( italic_u , italic_v ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 )
+(uPath2(x,y))p(u,v)(fGv(x,y)=1)𝑢superscriptPath2𝑥𝑦𝑝𝑢𝑣superscriptsubscript𝑓𝐺𝑣𝑥𝑦1\displaystyle+\mathbb{P}(u\in\text{Path}^{2}(x,y))\cdot p(u,v)\cdot\mathbb{P}(% f_{G}^{v}(x,y)=-1)+ blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ⋅ italic_p ( italic_u , italic_v ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 )
=\displaystyle== ((uPath1(x,y))+(uPath2(x,y)))p(u,v)(fGv(x,y)=1)𝑢superscriptPath1𝑥𝑦𝑢superscriptPath2𝑥𝑦𝑝𝑢𝑣superscriptsubscript𝑓𝐺𝑣𝑥𝑦1\displaystyle\,(\mathbb{P}(u\in\text{Path}^{1}(x,y))+\mathbb{P}(u\in\text{Path% }^{2}(x,y)))\cdot p(u,v)\cdot\mathbb{P}(f_{G}^{v}(x,y)=-1)( blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) + blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ) ⋅ italic_p ( italic_u , italic_v ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 )
=(4)italic-(4italic-)\displaystyle\overset{\eqref{eq:r-1-2}}{=}start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG rp(u,v)(1sv).𝑟𝑝𝑢𝑣1subscript𝑠𝑣\displaystyle\,r\cdot p(u,v)\cdot(1-s_{v}).italic_r ⋅ italic_p ( italic_u , italic_v ) ⋅ ( 1 - italic_s start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) . (6)

For E4subscript𝐸4E_{4}italic_E start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, the event uPath1(x,y)𝑢superscriptPath1𝑥𝑦u\in\text{Path}^{1}(x,y)italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) can be determined using only (x(w))wAsubscript𝑥𝑤𝑤𝐴(x(w))_{w\in A}( italic_x ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_A end_POSTSUBSCRIPT and (y(w))wAsubscript𝑦𝑤𝑤𝐴(y(w))_{w\in A}( italic_y ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_A end_POSTSUBSCRIPT, and the event fGu(x,y)=1superscriptsubscript𝑓𝐺𝑢𝑥𝑦1f_{G}^{u}(x,y)=-1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 can be determined using only (x(w))w{u}Bsubscript𝑥𝑤𝑤𝑢𝐵(x(w))_{w\in\{u\}\cup B}( italic_x ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ { italic_u } ∪ italic_B end_POSTSUBSCRIPT and (y(w))wBsubscript𝑦𝑤𝑤𝐵(y(w))_{w\in B}( italic_y ( italic_w ) ) start_POSTSUBSCRIPT italic_w ∈ italic_B end_POSTSUBSCRIPT. Therefore, all three component events in E4subscript𝐸4E_{4}italic_E start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT are independent of each other. The same is also true of E6subscript𝐸6E_{6}italic_E start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT. Therefore, noting that (fGu(x,y)=1)=(fGu(y,x)=1)superscriptsubscript𝑓𝐺𝑢𝑥𝑦1superscriptsubscript𝑓𝐺𝑢𝑦𝑥1\mathbb{P}(f_{G}^{u}(x,y)=-1)=\mathbb{P}(f_{G}^{u}(y,x)=-1)blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 ), we can write

(E4)+(E6)=subscript𝐸4subscript𝐸6absent\displaystyle\mathbb{P}(E_{4})+\mathbb{P}(E_{6})=blackboard_P ( italic_E start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) + blackboard_P ( italic_E start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ) = (uPath1(x,y))(fGu(x,y)=1)p(u,v)𝑢superscriptPath1𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑥𝑦1𝑝𝑢𝑣\displaystyle\,\mathbb{P}(u\in\text{Path}^{1}(x,y))\cdot\mathbb{P}(f_{G}^{u}(x% ,y)=-1)\cdot p(u,v)blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) ⋅ italic_p ( italic_u , italic_v )
+(uPath1(x,y))(fGu(y,x)=1)p(u,v)𝑢superscriptPath1𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑦𝑥1𝑝𝑢𝑣\displaystyle+\mathbb{P}(u\in\text{Path}^{1}(x,y))\cdot\mathbb{P}(f_{G}^{u}(y,% x)=-1)\cdot p(u,v)+ blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 ) ⋅ italic_p ( italic_u , italic_v )
=\displaystyle== ((uPath1(x,y))+(uPath2(x,y)))(fGu(x,y)=1)p(u,v)𝑢superscriptPath1𝑥𝑦𝑢superscriptPath2𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑥𝑦1𝑝𝑢𝑣\displaystyle\,(\mathbb{P}(u\in\text{Path}^{1}(x,y))+\mathbb{P}(u\in\text{Path% }^{2}(x,y)))\cdot\mathbb{P}(f_{G}^{u}(x,y)=-1)\cdot p(u,v)( blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) + blackboard_P ( italic_u ∈ Path start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x , italic_y ) ) ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) ⋅ italic_p ( italic_u , italic_v )
=(4)italic-(4italic-)\displaystyle\overset{\eqref{eq:r-1-2}}{=}start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG r(1su)p(u,v).𝑟1subscript𝑠𝑢𝑝𝑢𝑣\displaystyle\,r\cdot(1-s_{u})\cdot p(u,v).italic_r ⋅ ( 1 - italic_s start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ⋅ italic_p ( italic_u , italic_v ) . (7)

We can now combine these observations to obtain

(z(u)=v)𝑧𝑢𝑣\displaystyle\mathbb{P}(z(u)=v)blackboard_P ( italic_z ( italic_u ) = italic_v ) =i[6](Ei)=(5),(6),(7)(1r)p(u,v)+rp(u,v)(1sv)+r(1su)p(u,v)absentsubscript𝑖delimited-[]6subscript𝐸𝑖italic-(5italic-)italic-(6italic-)italic-(7italic-)1𝑟𝑝𝑢𝑣𝑟𝑝𝑢𝑣1subscript𝑠𝑣𝑟1subscript𝑠𝑢𝑝𝑢𝑣\displaystyle=\sum_{i\in[6]}\mathbb{P}(E_{i})\overset{\eqref{eq:E12},\eqref{eq% :E35},\eqref{eq:E46}}{=}(1-r)\cdot p(u,v)+r\cdot p(u,v)(1-s_{v})+r\cdot(1-s_{u% })p(u,v)= ∑ start_POSTSUBSCRIPT italic_i ∈ [ 6 ] end_POSTSUBSCRIPT blackboard_P ( italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_OVERACCENT italic_( italic_) , italic_( italic_) , italic_( italic_) end_OVERACCENT start_ARG = end_ARG ( 1 - italic_r ) ⋅ italic_p ( italic_u , italic_v ) + italic_r ⋅ italic_p ( italic_u , italic_v ) ( 1 - italic_s start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) + italic_r ⋅ ( 1 - italic_s start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) italic_p ( italic_u , italic_v )
=p(u,v)[1+r(1svsu)],absent𝑝𝑢𝑣delimited-[]1𝑟1subscript𝑠𝑣subscript𝑠𝑢\displaystyle=p(u,v)\cdot[1+r\cdot(1-s_{v}-s_{u})],= italic_p ( italic_u , italic_v ) ⋅ [ 1 + italic_r ⋅ ( 1 - italic_s start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT - italic_s start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ] ,

as required. ∎

As an aside, we note here a parallel with evolutionary game theory. Consider the discrete time replicator equation with nonlinear payoff functions (see (2.1) of [57]; also [33] for the more standard continuous and linear versions),

qi=qi(1+aijqjaj),superscriptsubscript𝑞𝑖subscript𝑞𝑖1subscript𝑎𝑖subscript𝑗subscript𝑞𝑗subscript𝑎𝑗q_{i}^{\prime}=q_{i}(1+a_{i}-\textstyle\sum_{j}q_{j}a_{j}),italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 + italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , (8)

where we interpret qisubscript𝑞𝑖q_{i}italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as the proportion of type i𝑖iitalic_i in a population and aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as the fitness of a type i𝑖iitalic_i individual. The following proposition demonstrates that by identifying qisubscript𝑞𝑖q_{i}italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT appropriately, (3) can be seen to be of the form provided by (8).

Proposition 3.3.

In the setting of Lemma 3.2, let uV𝑢𝑉u\in Vitalic_u ∈ italic_V be fixed and enumerate F(u)={v1,,vk}𝐹𝑢subscript𝑣1subscript𝑣𝑘F(u)=\{v_{1},\ldots,v_{k}\}italic_F ( italic_u ) = { italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }. Let us identify

qisubscript𝑞𝑖\displaystyle q_{i}italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =p(u,vi)absent𝑝𝑢subscript𝑣𝑖\displaystyle=p(u,v_{i})= italic_p ( italic_u , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )
aisubscript𝑎𝑖\displaystyle a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =(uPathG(x,y))(1(fGvi(x,y)=1)).absent𝑢subscriptPath𝐺𝑥𝑦1superscriptsubscript𝑓𝐺subscript𝑣𝑖𝑥𝑦1\displaystyle=\mathbb{P}(u\in\text{{Path}}_{G}(x,y))\cdot(1-\mathbb{P}(f_{G}^{% v_{i}}(x,y)=1)).= blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) .

Then (3) can be rewritten as qi=qi(1+aij[k]qjaj)superscriptsubscript𝑞𝑖subscript𝑞𝑖1subscript𝑎𝑖subscript𝑗delimited-[]𝑘subscript𝑞𝑗subscript𝑎𝑗q_{i}^{\prime}=q_{i}(1+a_{i}-\sum_{j\in[k]}q_{j}a_{j})italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 + italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_k ] end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ).

Proof.

First, note the identity

(fGu(x,y)=1)=j[k]p(u,vj)(fGvj(y,x)=1)=j[k]qj(1(fGvj(x,y)=1)).superscriptsubscript𝑓𝐺𝑢𝑥𝑦1subscript𝑗delimited-[]𝑘𝑝𝑢subscript𝑣𝑗superscriptsubscript𝑓𝐺subscript𝑣𝑗𝑦𝑥1subscript𝑗delimited-[]𝑘subscript𝑞𝑗1superscriptsubscript𝑓𝐺subscript𝑣𝑗𝑥𝑦1\mathbb{P}(f_{G}^{u}(x,y)=1)=\sum_{j\in[k]}p(u,v_{j})\cdot\mathbb{P}(f_{G}^{v_% {j}}(y,x)=-1)=\sum_{j\in[k]}q_{j}\cdot(1-\mathbb{P}(f_{G}^{v_{j}}(x,y)=1)).blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_k ] end_POSTSUBSCRIPT italic_p ( italic_u , italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_y , italic_x ) = - 1 ) = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_k ] end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) . (9)

Therefore,

(z(u)=vi)𝑧𝑢subscript𝑣𝑖\displaystyle\mathbb{P}(z(u)=v_{i})blackboard_P ( italic_z ( italic_u ) = italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) =(3)p(u,vi)[1+(uPathG(x,y))(1(fGvi(x,y)=1)(fGu(x,y)=1))]italic-(3italic-)𝑝𝑢subscript𝑣𝑖delimited-[]1𝑢subscriptPath𝐺𝑥𝑦1superscriptsubscript𝑓𝐺subscript𝑣𝑖𝑥𝑦1superscriptsubscript𝑓𝐺𝑢𝑥𝑦1\displaystyle\overset{\eqref{eq:sampled-individual}}{=}p(u,v_{i})\cdot[1+% \mathbb{P}(u\in\text{{Path}}_{G}(x,y))\cdot(1-\mathbb{P}(f_{G}^{v_{i}}(x,y)=1)% -\mathbb{P}(f_{G}^{u}(x,y)=1))]start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG italic_p ( italic_u , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ [ 1 + blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) ]
=qi[1+ai(uPathG(x,y))(fGu(x,y)=1)]absentsubscript𝑞𝑖delimited-[]1subscript𝑎𝑖𝑢subscriptPath𝐺𝑥𝑦superscriptsubscript𝑓𝐺𝑢𝑥𝑦1\displaystyle=q_{i}\cdot[1+a_{i}-\mathbb{P}(u\in\text{{Path}}_{G}(x,y))\cdot% \mathbb{P}(f_{G}^{u}(x,y)=1)]= italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ [ 1 + italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⋅ blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ]
=(9)qi[1+ai(uPathG(x,y))j[k]qj(1(fGvj(x,y)=1))]italic-(9italic-)subscript𝑞𝑖delimited-[]1subscript𝑎𝑖𝑢subscriptPath𝐺𝑥𝑦subscript𝑗delimited-[]𝑘subscript𝑞𝑗1superscriptsubscript𝑓𝐺subscript𝑣𝑗𝑥𝑦1\displaystyle\overset{\eqref{eq:rep-step}}{=}q_{i}\cdot[1+a_{i}-\mathbb{P}(u% \in\text{{Path}}_{G}(x,y))\cdot\textstyle\sum_{j\in[k]}q_{j}\cdot(1-\mathbb{P}% (f_{G}^{v_{j}}(x,y)=1))]start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ [ 1 + italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⋅ ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_k ] end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) ]
=qi(1+aij[k]qjaj),absentsubscript𝑞𝑖1subscript𝑎𝑖subscript𝑗delimited-[]𝑘subscript𝑞𝑗subscript𝑎𝑗\displaystyle=q_{i}(1+a_{i}-\textstyle\sum_{j\in[k]}q_{j}a_{j}),= italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 + italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_k ] end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ,

as required. ∎

In this sense, when executing Algorithm 1 on a game G𝐺Gitalic_G, the evolution of the distribution p(u,)𝑝𝑢p(u,\,\cdot\,)italic_p ( italic_u , ⋅ ) at each vertex u𝑢uitalic_u of G𝐺Gitalic_G stochastically emulates these replicator dynamics. However, a key difference is that in standard evolutionary game theory, each fitness function ai:=ai(q1,,qk)assignsubscript𝑎𝑖subscript𝑎𝑖subscript𝑞1subscript𝑞𝑘a_{i}:=a_{i}(q_{1},\ldots,q_{k})italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT := italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) typically depends only on the distribution of types in the population; whereas the expression ai=(uPathG(u,vi))(1(fGvi(x,y)=1))subscript𝑎𝑖𝑢subscriptPath𝐺𝑢subscript𝑣𝑖1superscriptsubscript𝑓𝐺subscript𝑣𝑖𝑥𝑦1a_{i}=\mathbb{P}(u\in\text{Path}_{G}(u,v_{i}))\cdot(1-\mathbb{P}(f_{G}^{v_{i}}% (x,y)=1))italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ⋅ ( 1 - blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) ) depends on the distribution of ‘types’ not just at the node u𝑢uitalic_u, but also at possibly all other nodes as well, and so the dynamics of each node cannot be considered in isolation.

4 Switchability

In Section 1 we noted that our main result implies a probabilistic upper bound of nO(s¯)superscript𝑛𝑂¯𝑠n^{O(\overline{s})}italic_n start_POSTSUPERSCRIPT italic_O ( over¯ start_ARG italic_s end_ARG ) end_POSTSUPERSCRIPT on an impartial combinatorial game, where s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG is an (often small) invariant of the corresponding game graph. In this section, we define this invariant and prove a key lemma.

Rather than defining this property, which we call switchability, for a game as a whole, we will actually define switchability as a property s(u)𝑠𝑢s(u)italic_s ( italic_u ) of each vertex u𝑢uitalic_u in the game’s vertex set V𝑉Vitalic_V. Then later we will take s¯=maxuVs(u)¯𝑠subscript𝑢𝑉𝑠𝑢\overline{s}=\max_{u\in V}s(u)over¯ start_ARG italic_s end_ARG = roman_max start_POSTSUBSCRIPT italic_u ∈ italic_V end_POSTSUBSCRIPT italic_s ( italic_u ) (see Corollary 5.5). Intuitively, s(u)𝑠𝑢s(u)italic_s ( italic_u ) measures the ‘smallest’ possible set of edges AE(G)𝐴𝐸𝐺A\subseteq E(G)italic_A ⊆ italic_E ( italic_G ) such that any pair of strategies x,y𝒳G𝑥𝑦subscript𝒳𝐺x,y\in\mathcal{X}_{G}italic_x , italic_y ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfying A{(v,x(v)):vV}𝐴conditional-set𝑣𝑥𝑣𝑣𝑉A\subseteq\{(v,x(v)):v\in V\}italic_A ⊆ { ( italic_v , italic_x ( italic_v ) ) : italic_v ∈ italic_V } and A{(v,y(v)):vV}𝐴conditional-set𝑣𝑦𝑣𝑣𝑉A\subseteq\{(v,y(v)):v\in V\}italic_A ⊆ { ( italic_v , italic_y ( italic_v ) ) : italic_v ∈ italic_V } must also satisfy uPathG(x,y)𝑢subscriptPath𝐺𝑥𝑦u\in\text{Path}_{G}(x,y)italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ). The motivation is that if x,yUniv(𝒳G,p)similar-to𝑥𝑦Univsubscript𝒳𝐺𝑝x,y\sim\text{Univ}(\mathcal{X}_{G},p)italic_x , italic_y ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p ) for some pvInt(G)𝒫γ(F(v))𝑝subscriptproduct𝑣Int𝐺subscript𝒫𝛾𝐹𝑣p\in\prod_{v\in\text{Int}(G)}\mathcal{P}_{\gamma}(F(v))italic_p ∈ ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_F ( italic_v ) ), then uPathG(x,y)𝑢subscriptPath𝐺𝑥𝑦u\in\text{Path}_{G}(x,y)italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) is assured by having x𝑥xitalic_x and y𝑦yitalic_y take certain values at the vertices appearing at the tail of some edge in A𝐴Aitalic_A, which occurs with probability at least γ2s(u)superscript𝛾2𝑠𝑢\gamma^{2s(u)}italic_γ start_POSTSUPERSCRIPT 2 italic_s ( italic_u ) end_POSTSUPERSCRIPT. A property that places a lower bound on (uPathG(x,y))𝑢subscriptPath𝐺𝑥𝑦\mathbb{P}(u\in\text{Path}_{G}(x,y))blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) in such a way will be very useful as we seek to apply Lemma 3.2 later.

B𝐵Bitalic_Bv0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTu𝑢uitalic_uA𝐴Aitalic_A
Figure 2: An example of switchability.

In the description above, a naive approach would be to take ‘smallest’ to simply mean having fewest edges. However, while this gives a working definition, when then bounding (uPathG(x,y))𝑢subscriptPath𝐺𝑥𝑦\mathbb{P}(u\in\text{Path}_{G}(x,y))blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) below, it is clear that significant improvements can be made in many cases. Consider the example shown in Figure 2. Our naive approach suggests that if x,yUniv(𝒳G,p)similar-to𝑥𝑦Univsubscript𝒳𝐺𝑝x,y\sim\text{Univ}(\mathcal{X}_{G},p)italic_x , italic_y ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p ) for some pvInt(G)𝒫γ(F(v))𝑝subscriptproduct𝑣Int𝐺subscript𝒫𝛾𝐹𝑣p\in\prod_{v\in\text{Int}(G)}\mathcal{P}_{\gamma}(F(v))italic_p ∈ ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_F ( italic_v ) ), then (uPathG(x,y))γ10𝑢subscriptPath𝐺𝑥𝑦superscript𝛾10\mathbb{P}(u\in\text{Path}_{G}(x,y))\geqslant\gamma^{10}blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⩾ italic_γ start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT. However, it would be better to observe that (uPathG(x,y))γ𝑢subscriptPath𝐺𝑥𝑦𝛾\mathbb{P}(u\in\text{Path}_{G}(x,y))\geqslant\gammablackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⩾ italic_γ, as visiting u𝑢uitalic_u can be assured a single choice to move to u𝑢uitalic_u made by the player who makes the first move after reaching the layer B𝐵Bitalic_B (in this case, always the player y𝑦yitalic_y).

To better capture this notion, we will not take ‘smallest’ to mean fewest edges, but rather smallest depth, defined in the following way (we recall here that all graphs are assumed to be acyclic).

Definition 4.1.

Given a set of edges AE(G)𝐴𝐸𝐺A\subseteq E(G)italic_A ⊆ italic_E ( italic_G ), we define the depth of A𝐴Aitalic_A to be

Depth(A)=max{|AE(P)|:P is a directed path in G}.Depth𝐴:𝐴𝐸𝑃P is a directed path in G\text{\emph{Depth}}(A)=\max{\{|A\cap E(P)|:\text{$P$ is a directed path in $G$% }\}}.Depth ( italic_A ) = roman_max { | italic_A ∩ italic_E ( italic_P ) | : italic_P is a directed path in italic_G } .

With this, the full description of switchability is provided by the following two definitions.

Definition 4.2.

Given a set AE(G)𝐴𝐸𝐺A\subseteq E(G)italic_A ⊆ italic_E ( italic_G ), we (inductively) say that a directed path P=v0v𝑃subscript𝑣0subscript𝑣P=v_{0}\ldots v_{\ell}italic_P = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible if any of the following conditions hold.

  1. C1

    P=v0𝑃subscript𝑣0P=v_{0}italic_P = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

  2. C2

    v0v1subscript𝑣0subscript𝑣1v_{0}\ldots v_{\ell-1}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible and v1vAsubscript𝑣1subscript𝑣𝐴v_{\ell-1}v_{\ell}\in Aitalic_v start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ italic_A.

  3. C3

    v0v1subscript𝑣0subscript𝑣1v_{0}\ldots v_{\ell-1}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible and there is no wV𝑤𝑉w\in Vitalic_w ∈ italic_V such that v1wAsubscript𝑣1𝑤𝐴v_{\ell-1}w\in Aitalic_v start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT italic_w ∈ italic_A.

Then, given a vertex v𝑣vitalic_v, we say that A𝐴Aitalic_A is a v𝑣vitalic_v-switcher if v𝑣vitalic_v is contained in every A𝐴Aitalic_A-compatible directed path v0vsubscript𝑣0subscript𝑣v_{0}\ldots v_{\ell}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with vInt(G)subscript𝑣Int𝐺v_{\ell}\notin\text{\emph{Int}}(G)italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∉ Int ( italic_G ).

Definition 4.3.

The switchability s(v)𝑠𝑣s(v)italic_s ( italic_v ) of a vertex v𝑣vitalic_v is the smallest possible depth of a v𝑣vitalic_v-switcher. We will also write s¯=maxvVs(v)¯𝑠subscript𝑣𝑉𝑠𝑣\overline{s}=\max_{v\in V}s(v)over¯ start_ARG italic_s end_ARG = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT italic_s ( italic_v ).

v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTv𝑣vitalic_vA𝐴Aitalic_A
v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTv𝑣vitalic_vA𝐴Aitalic_A
Figure 3: Two illustrations of switchability. In the first, s(v)=1𝑠𝑣1s(v)=1italic_s ( italic_v ) = 1, and a v𝑣vitalic_v-switcher of depth 1111 is shown in blue. In the second, s(v)=2𝑠𝑣2s(v)=2italic_s ( italic_v ) = 2, a v𝑣vitalic_v-switcher of depth 2222 is shown in blue, and one example of an A𝐴Aitalic_A-compatible path is shown in red.

Thus, while the set A𝐴Aitalic_A shown in Figure 2 has 5 edges, it has Depth(A)=1Depth𝐴1\text{Depth}(A)=1Depth ( italic_A ) = 1, and so in that case we have s(u)=1𝑠𝑢1s(u)=1italic_s ( italic_u ) = 1. Figure 3 shows two further illustrations of switchability. For certain games, constructing a small v𝑣vitalic_v-switcher is quite straightforward (see proof of Proposition 6.1 later); in other cases where determining switchability is not obvious, the following upper bound may be used instead.

Proposition 4.4.

If there is a directed path of length \ellroman_ℓ from v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to v𝑣vitalic_v, then s(v)𝑠𝑣s(v)\leqslant\ellitalic_s ( italic_v ) ⩽ roman_ℓ.

Proof.

If P𝑃Pitalic_P is a directed path from v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to v𝑣vitalic_v, then every E(P)𝐸𝑃E(P)italic_E ( italic_P )-compatible path v0vsubscript𝑣0subscript𝑣v_{0}\ldots v_{\ell}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with vInt(G)subscript𝑣Int𝐺v_{\ell}\notin\text{Int}(G)italic_v start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∉ Int ( italic_G ) includes P𝑃Pitalic_P as a prefix, and hence also includes v𝑣vitalic_v. Thus, E(P)𝐸𝑃E(P)italic_E ( italic_P ) is a v𝑣vitalic_v-switcher of depth \ellroman_ℓ. ∎

To complete this section, the required lower bound on (uPathG(x,y))𝑢subscriptPath𝐺𝑥𝑦\mathbb{P}(u\in\text{Path}_{G}(x,y))blackboard_P ( italic_u ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) is provided by the following lemma. Note that as well as improving the naive approach by using Depth(A)Depth𝐴\text{Depth}(A)Depth ( italic_A ) instead of |A|𝐴|A|| italic_A |, we also deduce a result of γs(v)superscript𝛾𝑠𝑣\gamma^{s(v)}italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT instead of γ2s(v)superscript𝛾2𝑠𝑣\gamma^{2s(v)}italic_γ start_POSTSUPERSCRIPT 2 italic_s ( italic_v ) end_POSTSUPERSCRIPT by carefully accounting for the fact that at most one player can visit each possible game position (due to the previous assumption that the impartial combinatorial games considered are acyclic).

Lemma 4.5.

Suppose that pvInt(G)𝒫γ(F(v))𝑝subscriptproduct𝑣Int𝐺subscript𝒫𝛾𝐹𝑣p\in\prod_{v\in\text{\emph{Int}}(G)}\mathcal{P}_{\gamma}(F(v))italic_p ∈ ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_F ( italic_v ) ) and x,yUniv(𝒳G,p)similar-to𝑥𝑦Univsubscript𝒳𝐺𝑝x,y\sim\text{\emph{Univ}}(\mathcal{X}_{G},p)italic_x , italic_y ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p ). Then for every vV𝑣𝑉v\in Vitalic_v ∈ italic_V, (vPathG(x,y))γs(v)𝑣subscriptPath𝐺𝑥𝑦superscript𝛾𝑠𝑣\mathbb{P}(v\in\text{\emph{Path}}_{G}(x,y))\geqslant\gamma^{s(v)}blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⩾ italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT.

Proof.

The distribution of PathG(x,y)subscriptPath𝐺𝑥𝑦\text{Path}_{G}(x,y)Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) is the same as the random set P𝑃Pitalic_P produced by the following process.

  1. 1.

    Initially, set z0=v0subscript𝑧0subscript𝑣0z_{0}=v_{0}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

  2. 2.

    For i0𝑖0i\geqslant 0italic_i ⩾ 0, do the following.

    1. (a)

      If ziInt(G)subscript𝑧𝑖Int𝐺z_{i}\notin\text{Int}(G)italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∉ Int ( italic_G ), then set P={z0,,zi}𝑃subscript𝑧0subscript𝑧𝑖P=\{z_{0},\ldots,z_{i}\}italic_P = { italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }.

    2. (b)

      Otherwise if ziInt(G)subscript𝑧𝑖Int𝐺z_{i}\in\text{Int}(G)italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ Int ( italic_G ), sample zi+1p(zi)similar-tosubscript𝑧𝑖1𝑝subscript𝑧𝑖z_{i+1}\sim p(z_{i})italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∼ italic_p ( italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

We will generate an instance of the above process in a very specific way using a collection of independent Unif([0,1])Unif01\text{Unif}([0,1])Unif ( [ 0 , 1 ] ) random variables. First, let A𝐴Aitalic_A be a v𝑣vitalic_v-switcher of depth s(v)𝑠𝑣s(v)italic_s ( italic_v ), and let B={uV:(u,w)A for some wF(u)}𝐵conditional-set𝑢𝑉(u,w)A for some wF(u)B=\{u\in V:\text{$(u,w)\in A$ for some $w\in F(u)$}\}italic_B = { italic_u ∈ italic_V : ( italic_u , italic_w ) ∈ italic_A for some italic_w ∈ italic_F ( italic_u ) }. Next, for each uV𝑢𝑉u\in Vitalic_u ∈ italic_V, let ϕu:[0,1]F(u):subscriptitalic-ϕ𝑢01𝐹𝑢\phi_{u}:[0,1]\to F(u)italic_ϕ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT : [ 0 , 1 ] → italic_F ( italic_u ) be any function satisfying the following properties.

  1. D1

    If XUnif([0,1])similar-to𝑋Unif01X\sim\text{Unif}([0,1])italic_X ∼ Unif ( [ 0 , 1 ] ), then (ϕu(X)=w)=p(u,w)subscriptitalic-ϕ𝑢𝑋𝑤𝑝𝑢𝑤\mathbb{P}(\phi_{u}(X)=w)=p(u,w)blackboard_P ( italic_ϕ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_X ) = italic_w ) = italic_p ( italic_u , italic_w ) for every uV𝑢𝑉u\in Vitalic_u ∈ italic_V.

  2. D2

    If s[0,γ]𝑠0𝛾s\in[0,\gamma]italic_s ∈ [ 0 , italic_γ ] and uB𝑢𝐵u\in Bitalic_u ∈ italic_B, then (u,ϕu(s))A𝑢subscriptitalic-ϕ𝑢𝑠𝐴(u,\phi_{u}(s))\in A( italic_u , italic_ϕ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_s ) ) ∈ italic_A.

Note that this is possible because p(u,w)γ𝑝𝑢𝑤𝛾p(u,w)\geqslant\gammaitalic_p ( italic_u , italic_w ) ⩾ italic_γ holds for every uV𝑢𝑉u\in Vitalic_u ∈ italic_V and wF(u)𝑤𝐹𝑢w\in F(u)italic_w ∈ italic_F ( italic_u ). The modified process is then as follows.

  1. 0.

    Let X1,,Xn,Y1,,Ynsubscript𝑋1subscript𝑋𝑛subscript𝑌1subscript𝑌𝑛X_{1},\ldots,X_{n},Y_{1},\ldots,Y_{n}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be independent Unif([0,1])Unif01\text{Unif}([0,1])Unif ( [ 0 , 1 ] ) random variables.

  2. 1.

    Initially, set z0=v0subscript𝑧0subscript𝑣0z_{0}=v_{0}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

  3. 2.

    For i0𝑖0i\geqslant 0italic_i ⩾ 0, do the following

    1. (a)

      If ziInt(G)subscript𝑧𝑖Int𝐺z_{i}\notin\text{Int}(G)italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∉ Int ( italic_G ), then set Q={z0,,zi}𝑄subscript𝑧0subscript𝑧𝑖Q=\{z_{0},\ldots,z_{i}\}italic_Q = { italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }.

    2. (b)

      If ziBsubscript𝑧𝑖𝐵z_{i}\in Bitalic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_B, then set ri=|{z0,,zi}B|subscript𝑟𝑖subscript𝑧0subscript𝑧𝑖𝐵r_{i}=|\{z_{0},\ldots,z_{i}\}\cap B|italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = | { italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } ∩ italic_B | and zi+1=ϕzi(Xri)subscript𝑧𝑖1subscriptitalic-ϕsubscript𝑧𝑖subscript𝑋subscript𝑟𝑖z_{i+1}=\phi_{z_{i}}(X_{r_{i}})italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_ϕ start_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

    3. (c)

      If ziInt(G)Bsubscript𝑧𝑖Int𝐺𝐵z_{i}\in\text{Int}(G)\setminus Bitalic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ Int ( italic_G ) ∖ italic_B, then set ri=|{z0,,zi}B|subscript𝑟𝑖subscript𝑧0subscript𝑧𝑖𝐵r_{i}=|\{z_{0},\ldots,z_{i}\}\setminus B|italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = | { italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } ∖ italic_B | and zi+1=ϕzi(Yri)subscript𝑧𝑖1subscriptitalic-ϕsubscript𝑧𝑖subscript𝑌subscript𝑟𝑖z_{i+1}=\phi_{z_{i}}(Y_{r_{i}})italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_ϕ start_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_Y start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

From D1 it follows that Q𝑄Qitalic_Q has the same distribution as P𝑃Pitalic_P, and hence also as PathG(x,y)subscriptPath𝐺𝑥𝑦\text{Path}_{G}(x,y)Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ).

We now claim that vQ𝑣𝑄v\in Qitalic_v ∈ italic_Q whenever X1,,Xs(v)[0,γ]subscript𝑋1subscript𝑋𝑠𝑣0𝛾X_{1},\ldots,X_{s(v)}\in[0,\gamma]italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_s ( italic_v ) end_POSTSUBSCRIPT ∈ [ 0 , italic_γ ]. The key observation is that under this regime, the first s(v)𝑠𝑣s(v)italic_s ( italic_v ) visits that Q𝑄Qitalic_Q makes to B𝐵Bitalic_B must be followed by an edge in A𝐴Aitalic_A. Let us label Q=z0z1z𝑄subscript𝑧0subscript𝑧1subscript𝑧Q=z_{0}z_{1}\ldots z_{\ell}italic_Q = italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. We will show by induction on i𝑖iitalic_i that z0zisubscript𝑧0subscript𝑧𝑖z_{0}\ldots z_{i}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible for every i[]𝑖delimited-[]i\in[\ell]italic_i ∈ [ roman_ℓ ], noting that the case i=0𝑖0i=0italic_i = 0 holds because z0=v0subscript𝑧0subscript𝑣0z_{0}=v_{0}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. For the inductive step, if z0zisubscript𝑧0subscript𝑧𝑖z_{0}\ldots z_{i}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible, then the only way for z0zi+1subscript𝑧0subscript𝑧𝑖1z_{0}\ldots z_{i+1}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT to not be A𝐴Aitalic_A-compatible is to have ziBsubscript𝑧𝑖𝐵z_{i}\in Bitalic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_B and (zi,zi+1)Asubscript𝑧𝑖subscript𝑧𝑖1𝐴(z_{i},z_{i+1})\notin A( italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) ∉ italic_A. But then, because the first s(v)𝑠𝑣s(v)italic_s ( italic_v ) visits that Q𝑄Qitalic_Q makes to B𝐵Bitalic_B are followed by an edge in A𝐴Aitalic_A, we can infer that z0zisubscript𝑧0subscript𝑧𝑖z_{0}\ldots z_{i}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT already includes at least s(v)𝑠𝑣s(v)italic_s ( italic_v ) edges in A𝐴Aitalic_A. Letting wV𝑤𝑉w\in Vitalic_w ∈ italic_V be such that (zi,w)Asubscript𝑧𝑖𝑤𝐴(z_{i},w)\in A( italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) ∈ italic_A, we then have that R:=z0ziwassign𝑅subscript𝑧0subscript𝑧𝑖𝑤R:=z_{0}\ldots z_{i}witalic_R := italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w is a directed path with |E(R)A|s(v)+1𝐸𝑅𝐴𝑠𝑣1|E(R)\cap A|\geqslant s(v)+1| italic_E ( italic_R ) ∩ italic_A | ⩾ italic_s ( italic_v ) + 1, a contradiction to the depth of A𝐴Aitalic_A. So in fact the inductive step holds, and z0zisubscript𝑧0subscript𝑧𝑖z_{0}\ldots z_{i}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is A𝐴Aitalic_A-compatible for every i𝑖iitalic_i. In particular, Q𝑄Qitalic_Q is A𝐴Aitalic_A-compatible, and hence vQ𝑣𝑄v\in Qitalic_v ∈ italic_Q.

Thus, vQ𝑣𝑄v\in Qitalic_v ∈ italic_Q whenever X1,,Xs(v)[0,γ]subscript𝑋1subscript𝑋𝑠𝑣0𝛾X_{1},\ldots,X_{s(v)}\in[0,\gamma]italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_s ( italic_v ) end_POSTSUBSCRIPT ∈ [ 0 , italic_γ ], and hence

(vPathG(x,y))=(vQ)(X1,,Xs(v)γ)=γs(v),𝑣subscriptPath𝐺𝑥𝑦𝑣𝑄subscript𝑋1subscript𝑋𝑠𝑣𝛾superscript𝛾𝑠𝑣\displaystyle\mathbb{P}(v\in\text{Path}_{G}(x,y))=\mathbb{P}(v\in Q)\geqslant% \mathbb{P}(X_{1},\ldots,X_{s(v)}\leqslant\gamma)=\gamma^{s(v)},blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) = blackboard_P ( italic_v ∈ italic_Q ) ⩾ blackboard_P ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_s ( italic_v ) end_POSTSUBSCRIPT ⩽ italic_γ ) = italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT ,

as required. ∎

5 Main result

In order to state runtime results, we adopt the standard black box convention where runtime is defined as the number of times a function is queried until the algorithm reaches the desired search objective (see [14]), as follows.

Definition 5.1.

Suppose that G𝐺Gitalic_G is an impartial combinatorial game, and that 𝒜𝒜\mathcal{A}caligraphic_A is an algorithm which makes τ𝜏\tauitalic_τ queries of fGsubscript𝑓𝐺f_{G}italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT during each generation. Then, given a set B𝒳G𝐵subscript𝒳𝐺B\subseteq\mathcal{X}_{G}italic_B ⊆ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, the runtime of 𝒜𝒜\mathcal{A}caligraphic_A on fGsubscript𝑓𝐺f_{G}italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT is defined to be the random variable

T𝒜G(B)=τmin{t:PtB}superscriptsubscript𝑇𝒜𝐺𝐵𝜏:𝑡subscript𝑃𝑡𝐵T_{\mathcal{A}}^{G}(B)=\tau\cdot\min{\{t:P_{t}\cap B\neq\emptyset\}}italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT ( italic_B ) = italic_τ ⋅ roman_min { italic_t : italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∩ italic_B ≠ ∅ }

where Pt𝒳Gsubscript𝑃𝑡subscript𝒳𝐺P_{t}\subseteq\mathcal{X}_{G}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊆ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT is the population of 𝒜𝒜\mathcal{A}caligraphic_A at the start of generation t𝑡titalic_t. (If the game G𝐺Gitalic_G is clear from context, we will write T𝒜subscript𝑇𝒜T_{\mathcal{A}}italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT instead of T𝒜Gsuperscriptsubscript𝑇𝒜𝐺T_{\mathcal{A}}^{G}italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT.)

Our main result is now provided by Theorem 5.2. In simple terms, it states that if Algorithm 1 is executed on an impartial combinatorial game G𝐺Gitalic_G using a sufficiently large population size μ𝜇\muitalic_μ, then with high probability its runtime is at most O(μr(G))𝑂𝜇𝑟𝐺O(\mu\cdot r(G))italic_O ( italic_μ ⋅ italic_r ( italic_G ) ), where r(G)𝑟𝐺r(G)italic_r ( italic_G ) is a formula of the game graph expressed in terms of its number of vertices n𝑛nitalic_n, maximum degree ΔΔ\Deltaroman_Δ, and a summation involving the switchability s(v)𝑠𝑣s(v)italic_s ( italic_v ) (Definition 4.3) at each critical position vWG𝑣subscript𝑊𝐺v\in W_{G}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (Definition 2.1). Notably, r(G)𝑟𝐺r(G)italic_r ( italic_G ) is increasing in each of n𝑛nitalic_n, ΔΔ\Deltaroman_Δ, and s(v)𝑠𝑣s(v)italic_s ( italic_v ), indicating that games for which these quantities are high may be the most difficult to optimise. We remark that the exact parameter settings for Algorithm 1 appearing in the statement have not been chosen to guarantee an optimal runtime, but rather to make the proof more comprehensible.

Theorem 5.2.

There is a constant C>0𝐶0C>0italic_C > 0 such that the following holds. Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex impartial combinatorial game with maximum degree ΔΔ\Deltaroman_Δ, and let s^=maxvWGs(v)^𝑠subscript𝑣subscript𝑊𝐺𝑠𝑣\hat{s}=\max_{v\in W_{G}}s(v)over^ start_ARG italic_s end_ARG = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_s ( italic_v ). Let K>0𝐾0K>0italic_K > 0, and let 𝒜𝒜\mathcal{A}caligraphic_A be described by Algorithm 1, where γ=1/(20Δn)𝛾120Δ𝑛\gamma=1/(20\Delta n)italic_γ = 1 / ( 20 roman_Δ italic_n ) and

μC(K+s^+1)(20Δn)1+2s^logn.𝜇𝐶𝐾^𝑠1superscript20Δ𝑛12^𝑠𝑛\mu\geqslant C(K+\hat{s}+1)(20\Delta n)^{1+2\hat{s}}\log{n}.italic_μ ⩾ italic_C ( italic_K + over^ start_ARG italic_s end_ARG + 1 ) ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT 1 + 2 over^ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT roman_log italic_n . (10)

Then,

[T𝒜G(Opt(G))CμvWG(20Δn)s(v)logn]nK.delimited-[]superscriptsubscript𝑇𝒜𝐺Opt𝐺𝐶𝜇subscript𝑣subscript𝑊𝐺superscript20Δ𝑛𝑠𝑣𝑛superscript𝑛𝐾\mathbb{P}\Biggl{[}T_{\mathcal{A}}^{G}(\text{\emph{Opt}}(G))\geqslant C\mu\sum% _{v\in W_{G}}(20\Delta n)^{s(v)}\log{n}\Biggr{]}\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT ( Opt ( italic_G ) ) ⩾ italic_C italic_μ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT roman_log italic_n ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .

The asymptotic behaviour of the runtime bound may not be immediately obvious from the form stated here. Accordingly, we will shortly provide an easier to digest corollary using the facts s(v)s^maxvVs(v)𝑠𝑣^𝑠subscript𝑣𝑉𝑠𝑣s(v)\leqslant\hat{s}\leqslant\max_{v\in V}s(v)italic_s ( italic_v ) ⩽ over^ start_ARG italic_s end_ARG ⩽ roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT italic_s ( italic_v ) and |WG|nsubscript𝑊𝐺𝑛|W_{G}|\leqslant n| italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT | ⩽ italic_n to remove the role of WGsubscript𝑊𝐺W_{G}italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT and the corresponding summation. For many games (including the applications considered later), this simplified bound has the same asymptotic behaviour as the one provided by Theorem 5.2. Nonetheless, as it is possible to construct games for which Theorem 5.2 offers significant improvement of the simplified bound, we opt to retain the more general form above.

We will now briefly provide some intuition for the proof of Theorem 5.2. As characterised by Lemma 2.2, we know that any strategy x𝒳G𝑥subscript𝒳𝐺x\in\mathcal{X}_{G}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT that makes the correct decision at every critical position vWG𝑣subscript𝑊𝐺v\in W_{G}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT is an element of Opt(G)Opt𝐺\text{Opt}(G)Opt ( italic_G ) (where here, making a correct decision means ensuring x(v)𝑥𝑣x(v)italic_x ( italic_v ) has a Sprague-Grundy value of 00). Thus, we consider the sequence p0,p1,subscript𝑝0subscript𝑝1p_{0},p_{1},\ldotsitalic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … appearing in Algorithm 1, and estimate the time until the algorithm arrives at some p𝑝pitalic_p such that, with high probability, an xUniv(𝒳G,pt)similar-to𝑥Univsubscript𝒳𝐺subscript𝑝𝑡x\sim\text{Univ}(\mathcal{X}_{G},p_{t})italic_x ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) makes the correct decision at every critical position. The progress to arrive at such a p𝑝pitalic_p is effectively broken down into |WG|subscript𝑊𝐺|W_{G}|| italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT | steps: fixing an ordering u1,,unsubscript𝑢1subscript𝑢𝑛u_{1},\ldots,u_{n}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of V𝑉Vitalic_V such that F(ui){u1,,ui1}𝐹subscript𝑢𝑖subscript𝑢1subscript𝑢𝑖1F(u_{i})\subseteq\{u_{1},\ldots,u_{i-1}\}italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⊆ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } for every i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] (a reverse topological ordering), step k𝑘kitalic_k finishes when, with high probability, an xUniv(𝒳G,pt)similar-to𝑥Univsubscript𝒳𝐺subscript𝑝𝑡x\sim\text{Univ}(\mathcal{X}_{G},p_{t})italic_x ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) makes the correct decision at the first k𝑘kitalic_k critical positions appearing in the ordering. Bounding the length of time to complete step k𝑘kitalic_k is accomplished by combining Lemmas 2.23.2, and 4.5 to show that if sampled individuals are usually making the correct decision at the first k𝑘kitalic_k critical positions in the ordering, then the algorithm has a bias towards retaining individuals who also make the correct decision at the next critical position in the ordering. Note that this step-by-step process does not appear explicitly in the proof, but is implicit from the definition of a function g^^𝑔\hat{g}over^ start_ARG italic_g end_ARG measuring progress towards the optimality condition (see (12)).

Proof of Theorem 5.2.

First, let us introduce some further notation. Given a set VVsuperscript𝑉𝑉V^{\prime}\subseteq Vitalic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ italic_V we will write pt(u,V)=vVp(u,v)subscript𝑝𝑡𝑢superscript𝑉subscript𝑣superscript𝑉𝑝𝑢𝑣p_{t}(u,V^{\prime})=\sum_{v\in V^{\prime}}p(u,v)italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u , italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_p ( italic_u , italic_v ). Recalling that Algorithm 1 ensures that ptvInt(G)𝒫γ(F(v))subscript𝑝𝑡subscriptproduct𝑣Int𝐺subscript𝒫𝛾𝐹𝑣p_{t}\in\prod_{v\in\text{Int}(G)}\mathcal{P}_{\gamma}(F(v))italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_F ( italic_v ) ) at every step, we will write 𝒬=vInt(G)𝒫γ(F(v))𝒬subscriptproduct𝑣Int𝐺subscript𝒫𝛾𝐹𝑣\mathcal{Q}=\prod_{v\in\text{Int}(G)}\mathcal{P}_{\gamma}(F(v))caligraphic_Q = ∏ start_POSTSUBSCRIPT italic_v ∈ Int ( italic_G ) end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT ( italic_F ( italic_v ) ). Let h:V0:𝑉subscript0h:V\to\mathbb{N}_{0}italic_h : italic_V → blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denote the Sprague-Grundy function, and let V0=h1({0})subscript𝑉0superscript10V_{0}=h^{-1}(\{0\})italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( { 0 } ) and V1=VV0subscript𝑉1𝑉subscript𝑉0V_{1}=V\setminus V_{0}italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_V ∖ italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We also will assume that n3𝑛3n\geqslant 3italic_n ⩾ 3, as any impartial combinatorial game G𝐺Gitalic_G with n<3𝑛3n<3italic_n < 3 satisfies Opt(G)=𝒳GOpt𝐺subscript𝒳𝐺\text{Opt}(G)=\mathcal{X}_{G}Opt ( italic_G ) = caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (such games satisfy Δ1Δ1\Delta\leqslant 1roman_Δ ⩽ 1, and so in fact |𝒳G|=1subscript𝒳𝐺1|\mathcal{X}_{G}|=1| caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT | = 1 in these cases).

Let u1,,unsubscript𝑢1subscript𝑢𝑛u_{1},\ldots,u_{n}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT be an ordering of V𝑉Vitalic_V such that F(ui){u1,,ui1}𝐹subscript𝑢𝑖subscript𝑢1subscript𝑢𝑖1F(u_{i})\subseteq\{u_{1},\ldots,u_{i-1}\}italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⊆ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } for every i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] (note that such an ordering exists as G𝐺Gitalic_G is assumed to be acyclic). Let us write Aisubscript𝐴𝑖A_{i}italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for the set of p𝒬𝑝𝒬p\in\mathcal{Q}italic_p ∈ caligraphic_Q such that p(u,V1)110n𝑝𝑢subscript𝑉1110𝑛p(u,V_{1})\leqslant\textstyle{\frac{1}{10n}}italic_p ( italic_u , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩽ divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG for all uWG{u1,,ui}𝑢subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖u\in W_{G}\cap\{u_{1},\ldots,u_{i}\}italic_u ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }. If for some generation t𝑡titalic_t we have ptAnsubscript𝑝𝑡subscript𝐴𝑛p_{t}\in A_{n}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, then for every vWG𝑣subscript𝑊𝐺v\in W_{G}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT we have

qt(v)(V0)B4πγF(v)(qt(v))(V0)γ|F(v)|=pt(v,V0)γ|F(v)|1110nγΔ>115n.subscript𝑞𝑡𝑣subscript𝑉0B4superscriptsubscript𝜋𝛾𝐹𝑣subscript𝑞𝑡𝑣subscript𝑉0𝛾𝐹𝑣subscript𝑝𝑡𝑣subscript𝑉0𝛾𝐹𝑣1110𝑛𝛾Δ115𝑛q_{t}(v)(V_{0})\overset{\emph{\ref{pi-3}}}{\geqslant}\pi_{\gamma}^{F(v)}(q_{t}% (v))(V_{0})-\gamma|F(v)|=p_{t}(v,V_{0})-\gamma|F(v)|\geqslant 1-\textstyle{% \frac{1}{10n}}-\gamma\Delta>1-\textstyle{\frac{1}{5n}}.italic_q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v ) ( italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) overOVERACCENT start_ARG ⩾ end_ARG italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_F ( italic_v ) end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v ) ) ( italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_γ | italic_F ( italic_v ) | = italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_γ | italic_F ( italic_v ) | ⩾ 1 - divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG - italic_γ roman_Δ > 1 - divide start_ARG 1 end_ARG start_ARG 5 italic_n end_ARG . (11)

Recalling from Lemma 2.2 that if x𝒳GOpt(G)𝑥subscript𝒳𝐺Opt𝐺x\in\mathcal{X}_{G}\setminus\text{Opt}(G)italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∖ Opt ( italic_G ) then x(v)V1𝑥𝑣subscript𝑉1x(v)\in V_{1}italic_x ( italic_v ) ∈ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for some vWG𝑣subscript𝑊𝐺v\in W_{G}italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, we can deduce

|{j[μ]:Pt(j)Opt(G)}|conditional-set𝑗delimited-[]𝜇subscript𝑃𝑡𝑗Opt𝐺\displaystyle|\{j\in[\mu]:P_{t}(j)\notin\text{Opt}(G)\}|| { italic_j ∈ [ italic_μ ] : italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_j ) ∉ Opt ( italic_G ) } | vWG|{j[μ]:Pt(j)(v)V1}|=vWGμqt(v)(V1)absentsubscript𝑣subscript𝑊𝐺conditional-set𝑗delimited-[]𝜇subscript𝑃𝑡𝑗𝑣subscript𝑉1subscript𝑣subscript𝑊𝐺𝜇subscript𝑞𝑡𝑣subscript𝑉1\displaystyle\leqslant\sum_{v\in W_{G}}|\{j\in[\mu]:P_{t}(j)(v)\in V_{1}\}|=% \sum_{v\in W_{G}}\mu\cdot q_{t}(v)(V_{1})⩽ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT | { italic_j ∈ [ italic_μ ] : italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_j ) ( italic_v ) ∈ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT } | = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_μ ⋅ italic_q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v ) ( italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT )
=vWGμ(1qt(v)(V0))<(11)|WG|μ5nμ5<μ,absentsubscript𝑣subscript𝑊𝐺𝜇1subscript𝑞𝑡𝑣subscript𝑉0italic-(11italic-)subscript𝑊𝐺𝜇5𝑛𝜇5𝜇\displaystyle=\sum_{v\in W_{G}}\mu\cdot(1-q_{t}(v)(V_{0}))\overset{\eqref{eq:q% -opt-bound}}{<}\frac{|W_{G}|\mu}{5n}\leqslant\frac{\mu}{5}<\mu,= ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_μ ⋅ ( 1 - italic_q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v ) ( italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG < end_ARG divide start_ARG | italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT | italic_μ end_ARG start_ARG 5 italic_n end_ARG ⩽ divide start_ARG italic_μ end_ARG start_ARG 5 end_ARG < italic_μ ,

and hence PtOpt(G)subscript𝑃𝑡Opt𝐺P_{t}\cap\text{Opt}(G)\neq\emptysetitalic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∩ Opt ( italic_G ) ≠ ∅. In particular, if T=min{t:ptAn}superscript𝑇:𝑡subscript𝑝𝑡subscript𝐴𝑛T^{\ast}=\min{\{t:p_{t}\in A_{n}\}}italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_min { italic_t : italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } then T𝒜G(Opt(G))μTsuperscriptsubscript𝑇𝒜𝐺Opt𝐺𝜇superscript𝑇T_{\mathcal{A}}^{G}(\text{Opt}(G))\leqslant\mu\cdot T^{\ast}italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT ( Opt ( italic_G ) ) ⩽ italic_μ ⋅ italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

We will define a map g^:𝒬0:^𝑔𝒬subscriptabsent0\hat{g}:\mathcal{Q}\to\mathbb{R}_{\geqslant 0}over^ start_ARG italic_g end_ARG : caligraphic_Q → blackboard_R start_POSTSUBSCRIPT ⩾ 0 end_POSTSUBSCRIPT that will measure progress towards Ansubscript𝐴𝑛A_{n}italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. To do this, first let g:[γ,1γ]0:𝑔𝛾1𝛾subscriptabsent0g:[\gamma,1-\gamma]\to\mathbb{R}_{\geqslant 0}italic_g : [ italic_γ , 1 - italic_γ ] → blackboard_R start_POSTSUBSCRIPT ⩾ 0 end_POSTSUBSCRIPT be given by

g(y)=log(y1y)log(γ1γ),𝑔𝑦𝑦1𝑦𝛾1𝛾g(y)=\log{\left(\frac{y}{1-y}\right)}-\log{\left(\frac{\gamma}{1-\gamma}\right% )},italic_g ( italic_y ) = roman_log ( divide start_ARG italic_y end_ARG start_ARG 1 - italic_y end_ARG ) - roman_log ( divide start_ARG italic_γ end_ARG start_ARG 1 - italic_γ end_ARG ) ,

so that g𝑔gitalic_g is a monotone increasing function. Then, given p𝒬𝑝𝒬p\in\mathcal{Q}italic_p ∈ caligraphic_Q, let (p)=max{i[n]:pAi}𝑝:𝑖delimited-[]𝑛𝑝subscript𝐴𝑖\ell(p)=\max{\{i\in[n]:p\in A_{i}\}}roman_ℓ ( italic_p ) = roman_max { italic_i ∈ [ italic_n ] : italic_p ∈ italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } and define

g^(p)={i[(p)]1(uiWG)(g(1γ)(32γs(ui))+1)+g(p(u(p)+1,V0))(32γs(u(p)+1))if (p)<n,i[(p)]1(uiWG)(g(1γ)(32γs(ui))+1)if (p)=n.^𝑔𝑝casessubscript𝑖delimited-[]𝑝1subscript𝑢𝑖subscript𝑊𝐺𝑔1𝛾32superscript𝛾𝑠subscript𝑢𝑖1𝑔𝑝subscript𝑢𝑝1subscript𝑉032superscript𝛾𝑠subscript𝑢𝑝1if (p)<n,subscript𝑖delimited-[]𝑝1subscript𝑢𝑖subscript𝑊𝐺𝑔1𝛾32superscript𝛾𝑠subscript𝑢𝑖1if (p)=n.\hat{g}(p)=\begin{cases}\scalebox{0.85}{$\displaystyle\sum_{i\in[\ell(p)]}% \text{1}(u_{i}\in W_{G})\cdot\left(g(1-\gamma)\cdot\left(\frac{32}{\gamma^{s(u% _{i})}}\right)+1\right)+g(p(u_{\ell(p)+1},V_{0}))\cdot\left(\frac{32}{\gamma^{% s(u_{\ell(p)+1})}}\right)$}&\text{if $\ell(p)<n$,}\\ \scalebox{0.85}{$\displaystyle\sum_{i\in[\ell(p)]}\text{1}(u_{i}\in W_{G})% \cdot\left(g(1-\gamma)\cdot\left(\frac{32}{\gamma^{s(u_{i})}}\right)+1\right)$% }&\text{if $\ell(p)=n$.}\end{cases}over^ start_ARG italic_g end_ARG ( italic_p ) = { start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_i ∈ [ roman_ℓ ( italic_p ) ] end_POSTSUBSCRIPT 1 ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ⋅ ( italic_g ( 1 - italic_γ ) ⋅ ( divide start_ARG 32 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) + 1 ) + italic_g ( italic_p ( italic_u start_POSTSUBSCRIPT roman_ℓ ( italic_p ) + 1 end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ⋅ ( divide start_ARG 32 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT roman_ℓ ( italic_p ) + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) end_CELL start_CELL if roman_ℓ ( italic_p ) < italic_n , end_CELL end_ROW start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_i ∈ [ roman_ℓ ( italic_p ) ] end_POSTSUBSCRIPT 1 ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ⋅ ( italic_g ( 1 - italic_γ ) ⋅ ( divide start_ARG 32 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) + 1 ) end_CELL start_CELL if roman_ℓ ( italic_p ) = italic_n . end_CELL end_ROW (12)

Define also gmax=maxp𝒬g^(p)subscript𝑔maxsubscript𝑝𝒬^𝑔𝑝g_{\text{max}}=\max_{p\in\mathcal{Q}}\hat{g}(p)italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_p ∈ caligraphic_Q end_POSTSUBSCRIPT over^ start_ARG italic_g end_ARG ( italic_p ), and note that pAn𝑝subscript𝐴𝑛p\in A_{n}italic_p ∈ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT if and only if g^(p)=gmax^𝑔𝑝subscript𝑔max\hat{g}(p)=g_{\text{max}}over^ start_ARG italic_g end_ARG ( italic_p ) = italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT. The motivation for the function g^:𝒬0:^𝑔𝒬subscriptabsent0\hat{g}:\mathcal{Q}\to\mathbb{R}_{\geqslant 0}over^ start_ARG italic_g end_ARG : caligraphic_Q → blackboard_R start_POSTSUBSCRIPT ⩾ 0 end_POSTSUBSCRIPT is that the value of g^(pt)^𝑔subscript𝑝𝑡\hat{g}(p_{t})over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) increases as ptsubscript𝑝𝑡p_{t}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT moves through 𝒬𝒬\mathcal{Q}caligraphic_Q towards Ansubscript𝐴𝑛A_{n}italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Indeed, the first term of (12) is a summation depending on (p)𝑝\ell(p)roman_ℓ ( italic_p ) only; its role ensures that g(p)g(p)𝑔𝑝𝑔superscript𝑝g(p)\geqslant g(p^{\prime})italic_g ( italic_p ) ⩾ italic_g ( italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) whenever pAi𝑝subscript𝐴𝑖p\in A_{i}italic_p ∈ italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and pAisuperscript𝑝subscript𝐴𝑖p^{\prime}\notin A_{i}italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∉ italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and hence g^^𝑔\hat{g}over^ start_ARG italic_g end_ARG increases true to the sequence A0A1Ansuperset-of-or-equalssubscript𝐴0subscript𝐴1superset-of-or-equalssuperset-of-or-equalssubscript𝐴𝑛A_{0}\supseteq A_{1}\supseteq\ldots\supseteq A_{n}italic_A start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊇ italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊇ … ⊇ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. The second term measures progress within some Aisubscript𝐴𝑖A_{i}italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as we move towards Ai+1subscript𝐴𝑖1A_{i+1}italic_A start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT (it increases as the value of pt(ui,V1)subscript𝑝𝑡subscript𝑢𝑖subscript𝑉1p_{t}(u_{i},V_{1})italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) decreases toward 110n110𝑛\textstyle{\frac{1}{10n}}divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG).

Denote Xt(i)=g(pt(ui,V0))subscript𝑋𝑡𝑖𝑔subscript𝑝𝑡subscript𝑢𝑖subscript𝑉0X_{t}(i)=g(p_{t}(u_{i},V_{0}))italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) = italic_g ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ). We will later show the following two claims, where the second is a direct consequence of the first.

Claim 5.3.

If ptAi1subscript𝑝𝑡subscript𝐴𝑖1p_{t}\in A_{i-1}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT and uiWGsubscript𝑢𝑖subscript𝑊𝐺u_{i}\in W_{G}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, then

(Xt+1(i)min{g(12Δγ),Xt(i)+γs(ui)/32})nK3s^5.subscript𝑋𝑡1𝑖𝑔12Δ𝛾subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖32superscript𝑛𝐾3^𝑠5\mathbb{P}(X_{t+1}(i)\leqslant\min{\{g(1-2\Delta\gamma),X_{t}(i)+\gamma^{s(u_{% i})}/32\}})\leqslant n^{-K-3\hat{s}-5}.blackboard_P ( italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ⩽ roman_min { italic_g ( 1 - 2 roman_Δ italic_γ ) , italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 32 } ) ⩽ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT .
Claim 5.4.

If ptAnsubscript𝑝𝑡subscript𝐴𝑛p_{t}\notin A_{n}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∉ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, then (g^(pt+1)g^(pt)+1)1nK3s^4^𝑔subscript𝑝𝑡1^𝑔subscript𝑝𝑡11superscript𝑛𝐾3^𝑠4\mathbb{P}(\hat{g}(p_{t+1})\geqslant\hat{g}(p_{t})+1)\geqslant 1-n^{-K-3\hat{s% }-4}blackboard_P ( over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) ⩾ over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + 1 ) ⩾ 1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 4 end_POSTSUPERSCRIPT.

Claim 5.4 asserts that if ptsubscript𝑝𝑡p_{t}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT has not yet reached Ansubscript𝐴𝑛A_{n}italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, then we should expect the value of g^(pt)^𝑔subscript𝑝𝑡\hat{g}(p_{t})over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) to increase by at least 1111 in the next generation. However, g^(pt)^𝑔subscript𝑝𝑡\hat{g}(p_{t})over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) cannot increase by at least 1111 more than gmaxsubscript𝑔max\lfloor g_{\text{max}}\rfloor⌊ italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ⌋ times. Precisely, if T>gmaxsuperscript𝑇subscript𝑔maxT^{\ast}>g_{\text{max}}italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT > italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT then we must have ptAnsubscript𝑝𝑡subscript𝐴𝑛p_{t}\notin A_{n}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∉ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and g^(pt)<gmax^𝑔subscript𝑝𝑡subscript𝑔max\hat{g}(p_{t})<g_{\text{max}}over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) < italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT for every tgmax𝑡subscript𝑔maxt\leqslant g_{\text{max}}italic_t ⩽ italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT. In particular, it would then hold that g^(pt)<g^(pt1)+1^𝑔subscript𝑝𝑡^𝑔subscript𝑝𝑡11\hat{g}(p_{t})<\hat{g}(p_{t-1})+1over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) < over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) + 1 for some t[gmax]𝑡delimited-[]subscript𝑔maxt\in[\lfloor g_{\text{max}}\rfloor]italic_t ∈ [ ⌊ italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ⌋ ]. Therefore, using a union bound with Claim 5.4, we have

[T>gmax]gmaxnK3s^4.delimited-[]superscript𝑇subscript𝑔maxsubscript𝑔maxsuperscript𝑛𝐾3^𝑠4\mathbb{P}[T^{\ast}>g_{\text{max}}]\leqslant g_{\text{max}}\cdot n^{-K-3\hat{s% }-4}.blackboard_P [ italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT > italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ] ⩽ italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 4 end_POSTSUPERSCRIPT . (13)

Noting (using Lemma A.3) that

g(1γ)F32log(1/γ)=2log(20Δn)5logn1,𝑔1𝛾F321𝛾220Δ𝑛5𝑛1g(1-\gamma)\overset{\emph{\ref{g-simp-3}}}{\leqslant}2\log{(1/\gamma)}=2\log{(% 20\Delta n)}\leqslant 5\log{n}-1,italic_g ( 1 - italic_γ ) overOVERACCENT start_ARG ⩽ end_ARG 2 roman_log ( 1 / italic_γ ) = 2 roman_log ( 20 roman_Δ italic_n ) ⩽ 5 roman_log italic_n - 1 , (14)

we can bound

gmax=vWG(g(1γ)(32γs(v))+1)subscript𝑔maxsubscript𝑣subscript𝑊𝐺𝑔1𝛾32superscript𝛾𝑠𝑣1\displaystyle g_{\text{max}}=\sum_{v\in W_{G}}\left(g(1-\gamma)\cdot\left(% \frac{32}{\gamma^{s(v)}}\right)+1\right)italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_g ( 1 - italic_γ ) ⋅ ( divide start_ARG 32 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT end_ARG ) + 1 ) (14)vWG(5logn)32(20Δn)s(v)italic-(14italic-)subscript𝑣subscript𝑊𝐺5𝑛32superscript20Δ𝑛𝑠𝑣\displaystyle\overset{\eqref{eq:g-gamma}}{\leqslant}\sum_{v\in W_{G}}(5\log{n}% )\cdot 32\cdot(20\Delta n)^{s(v)}start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 5 roman_log italic_n ) ⋅ 32 ⋅ ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT
<CvWG(20Δn)s(v)lognabsent𝐶subscript𝑣subscript𝑊𝐺superscript20Δ𝑛𝑠𝑣𝑛\displaystyle\hskip 2.84544pt<C\sum_{v\in W_{G}}(20\Delta n)^{s(v)}\log{n}< italic_C ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT roman_log italic_n (15)
C|WG|(20Δn)s^lognn3s^+4.absent𝐶subscript𝑊𝐺superscript20Δ𝑛^𝑠𝑛superscript𝑛3^𝑠4\displaystyle\hskip 2.84544pt\leqslant C|W_{G}|(20\Delta n)^{\hat{s}}\log{n}% \leqslant n^{3\hat{s}+4}.⩽ italic_C | italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT | ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT over^ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT roman_log italic_n ⩽ italic_n start_POSTSUPERSCRIPT 3 over^ start_ARG italic_s end_ARG + 4 end_POSTSUPERSCRIPT . (16)

We now have

[T𝒜G(Opt(G))Cμ\displaystyle\mathbb{P}\Bigl{[}T_{\mathcal{A}}^{G}(\text{Opt}(G))\geqslant C\mublackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT ( Opt ( italic_G ) ) ⩾ italic_C italic_μ vWG(20Δn)s(v)logn][TCvWG(20Δn)s(v)logn]\displaystyle\sum_{v\in W_{G}}(20\Delta n)^{s(v)}\log{n}\Bigr{]}\leqslant% \mathbb{P}\Bigl{[}T^{\ast}\geqslant C\sum_{v\in W_{G}}(20\Delta n)^{s(v)}\log{% n}\Bigr{]}∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT roman_log italic_n ] ⩽ blackboard_P [ italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ⩾ italic_C ∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT roman_log italic_n ]
(15)[T>gmax](13)gmaxnK3s^4(16)n3s^+4nK3s^4=nK,italic-(15italic-)delimited-[]superscript𝑇subscript𝑔maxitalic-(13italic-)subscript𝑔maxsuperscript𝑛𝐾3^𝑠4italic-(16italic-)superscript𝑛3^𝑠4superscript𝑛𝐾3^𝑠4superscript𝑛𝐾\displaystyle\overset{\eqref{eq:df1}}{\leqslant}\mathbb{P}\left[T^{\ast}>g_{% \text{max}}\right]\overset{\eqref{eq:T-ast-tau}}{\leqslant}g_{\text{max}}\cdot n% ^{-K-3\hat{s}-4}\overset{\eqref{eq:df2}}{\leqslant}n^{3\hat{s}+4}\cdot n^{-K-3% \hat{s}-4}=n^{-K},start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG blackboard_P [ italic_T start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT > italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ] start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG italic_g start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 4 end_POSTSUPERSCRIPT start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG italic_n start_POSTSUPERSCRIPT 3 over^ start_ARG italic_s end_ARG + 4 end_POSTSUPERSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 4 end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT ,

as required. Therefore, all that remains is to prove Claims 5.3 and 5.4.

Proof of Claim 5.3.

Assume x,yUniv(𝒳G,pt)similar-to𝑥𝑦Univsubscript𝒳𝐺subscript𝑝𝑡x,y\sim\text{Univ}(\mathcal{X}_{G},p_{t})italic_x , italic_y ∼ Univ ( caligraphic_X start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) are independent. To assist with this claim, we will introduce some further notation. Let r=(uiPathG(x,y))𝑟subscript𝑢𝑖subscriptPath𝐺𝑥𝑦r=\mathbb{P}(u_{i}\in\text{Path}_{G}(x,y))italic_r = blackboard_P ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ), and note that from Lemma 4.5 we have

rγs(ui).𝑟superscript𝛾𝑠subscript𝑢𝑖r\geqslant\gamma^{s(u_{i})}.italic_r ⩾ italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT . (17)

Given wV𝑤𝑉w\in Vitalic_w ∈ italic_V, let us write Nwsubscript𝑁𝑤N_{w}italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT as a shorthand for the event fGw(x,y)=1superscriptsubscript𝑓𝐺𝑤𝑥𝑦1f_{G}^{w}(x,y)=1italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1, and note that because x𝑥xitalic_x and y𝑦yitalic_y are independent and identically distributed,

(Nw)=(fGw(x,y)=1)=(fGw(y,x)=1).subscript𝑁𝑤superscriptsubscript𝑓𝐺𝑤𝑥𝑦1superscriptsubscript𝑓𝐺𝑤𝑦𝑥1\mathbb{P}(N_{w})=\mathbb{P}(f_{G}^{w}(x,y)=1)=\mathbb{P}(f_{G}^{w}(y,x)=1).blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_y , italic_x ) = 1 ) . (18)

Finally, let us also write F0=F(ui)V0subscript𝐹0𝐹subscript𝑢𝑖subscript𝑉0F_{0}=F(u_{i})\cap V_{0}italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∩ italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and F1=F(ui)V1subscript𝐹1𝐹subscript𝑢𝑖subscript𝑉1F_{1}=F(u_{i})\cap V_{1}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∩ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

Given vV𝑣𝑉v\in Vitalic_v ∈ italic_V, we wish to consider (z(ui)=v)𝑧subscript𝑢𝑖𝑣\mathbb{P}(z(u_{i})=v)blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ), where z𝑧zitalic_z is the winner of the game G𝐺Gitalic_G played between x𝑥xitalic_x and y𝑦yitalic_y (as in Lemma 3.2). This will be useful, as the individuals Pt+1(1),,Pt+1(μ)subscript𝑃𝑡11subscript𝑃𝑡1𝜇P_{t+1}(1),\ldots,P_{t+1}(\mu)italic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( 1 ) , … , italic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_μ ) selected in lines 10-17 of Algorithm 1 are independent and with the same distribution as z𝑧zitalic_z, and hence for every vV𝑣𝑉v\in Vitalic_v ∈ italic_V,

μqt+1(ui,v)Bin(μ,(z(ui)=v)).similar-to𝜇subscript𝑞𝑡1subscript𝑢𝑖𝑣Bin𝜇𝑧subscript𝑢𝑖𝑣\mu\cdot q_{t+1}(u_{i},v)\sim\text{Bin}(\mu,\mathbb{P}(z(u_{i})=v)).italic_μ ⋅ italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ∼ Bin ( italic_μ , blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ) ) . (19)

To analyse (z(ui)=v)𝑧subscript𝑢𝑖𝑣\mathbb{P}(z(u_{i})=v)blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ), first note that we have

1(Nui)=(fGui(x,y)=1)=wF(ui)pt(ui,w)(fGw(y,x)=1)=(18)wF(ui)pt(ui,w)(Nw).1subscript𝑁subscript𝑢𝑖superscriptsubscript𝑓𝐺subscript𝑢𝑖𝑥𝑦1subscript𝑤𝐹subscript𝑢𝑖subscript𝑝𝑡subscript𝑢𝑖𝑤superscriptsubscript𝑓𝐺𝑤𝑦𝑥1italic-(18italic-)subscript𝑤𝐹subscript𝑢𝑖subscript𝑝𝑡subscript𝑢𝑖𝑤subscript𝑁𝑤1-\mathbb{P}(N_{u_{i}})=\mathbb{P}(f_{G}^{u_{i}}(x,y)=-1)=\sum_{w\in F(u_{i})}% p_{t}(u_{i},w)\mathbb{P}(f_{G}^{w}(y,x)=1)\overset{\eqref{eq:Nw-equiv}}{=}\sum% _{w\in F(u_{i})}p_{t}(u_{i},w)\mathbb{P}(N_{w}).1 - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) = ∑ start_POSTSUBSCRIPT italic_w ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_y , italic_x ) = 1 ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG ∑ start_POSTSUBSCRIPT italic_w ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) . (20)

Therefore, applying Lemma 3.2 with u=ui𝑢subscript𝑢𝑖u=u_{i}italic_u = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT,

(z(ui)=v)𝑧subscript𝑢𝑖𝑣\displaystyle\mathbb{P}(z(u_{i})=v)blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ) =(3)pt(ui,v)[1+r(1(Nv)(Nui))]italic-(3italic-)subscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]1𝑟1subscript𝑁𝑣subscript𝑁subscript𝑢𝑖\displaystyle\overset{\eqref{eq:sampled-individual}}{=}p_{t}(u_{i},v)\cdot[1+r% \cdot(1-\mathbb{P}(N_{v})-\mathbb{P}(N_{u_{i}}))]start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + italic_r ⋅ ( 1 - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ]
=(20)pt(ui,v)[1+r((Nv)+wF(ui)pt(ui,w)(Nw))]italic-(20italic-)subscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]1𝑟subscript𝑁𝑣subscript𝑤𝐹subscript𝑢𝑖subscript𝑝𝑡subscript𝑢𝑖𝑤subscript𝑁𝑤\displaystyle\overset{\eqref{eq:Nui-expansion}}{=}p_{t}(u_{i},v)\cdot\Bigl{[}1% +r\cdot\Bigl{(}-\mathbb{P}(N_{v})+\sum_{w\in F(u_{i})}p_{t}(u_{i},w)\mathbb{P}% (N_{w})\Bigr{)}\Bigr{]}start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG = end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + italic_r ⋅ ( - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_w ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) ) ]
=pt(ui,v)[1+r((Nv)+wF0pt(ui,w)(Nw)+wF1pt(ui,w)(Nw))].absentsubscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]1𝑟subscript𝑁𝑣subscript𝑤subscript𝐹0subscript𝑝𝑡subscript𝑢𝑖𝑤subscript𝑁𝑤subscript𝑤subscript𝐹1subscript𝑝𝑡subscript𝑢𝑖𝑤subscript𝑁𝑤\displaystyle=p_{t}(u_{i},v)\cdot\Bigl{[}1+r\cdot\Bigl{(}-\mathbb{P}(N_{v})+% \sum_{w\in F_{0}}p_{t}(u_{i},w)\mathbb{P}(N_{w})+\sum_{w\in F_{1}}p_{t}(u_{i},% w)\mathbb{P}(N_{w})\Bigr{)}\Bigr{]}.= italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + italic_r ⋅ ( - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_w ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_w ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_w ) blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) ) ] . (21)

In particular, we also have

(z(ui)=v)(21)p(ui,v)[1r].𝑧subscript𝑢𝑖𝑣italic-(21italic-)𝑝subscript𝑢𝑖𝑣delimited-[]1𝑟\mathbb{P}(z(u_{i})=v)\overset{\eqref{eq:cg-sel-press}}{\geqslant}p(u_{i},v)% \cdot[1-r].blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG italic_p ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 - italic_r ] . (22)

Next, we would like to place some simple bounds on (Nw)subscript𝑁𝑤\mathbb{P}(N_{w})blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) for wF0F1𝑤subscript𝐹0subscript𝐹1w\in F_{0}\cup F_{1}italic_w ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∪ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. If w{u1,,ui1}𝑤subscript𝑢1subscript𝑢𝑖1w\in\{u_{1},\ldots,u_{i-1}\}italic_w ∈ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } satisfies h(w)0𝑤0h(w)\neq 0italic_h ( italic_w ) ≠ 0, then by using the fact that ptAi1subscript𝑝𝑡subscript𝐴𝑖1p_{t}\in A_{i-1}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT,

(Nw)subscript𝑁𝑤\displaystyle\mathbb{P}(N_{w})blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) =(fGw(x,y)=1)Lemma 2.2(h(x(v))=0 for all vWG{u1,,ui1})absentsuperscriptsubscript𝑓𝐺𝑤𝑥𝑦1Lemma 2.2h(x(v))=0 for all vWG{u1,,ui1}\displaystyle=\mathbb{P}(f_{G}^{w}(x,y)=1)\overset{\text{Lemma~{}\ref{lm:% optimality-characterisation}}}{\geqslant}\mathbb{P}(\text{$h(x(v))=0$ for all % $v\in W_{G}\cap\{u_{1},\ldots,u_{i-1}\}$})= blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_x , italic_y ) = 1 ) overLemma start_ARG ⩾ end_ARG blackboard_P ( italic_h ( italic_x ( italic_v ) ) = 0 for all italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } )
=vWG{u1,,ui1}pt(v,V0)=vWG{u1,,ui1}(1pt(v,V1))absentsubscriptproduct𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖1subscript𝑝𝑡𝑣subscript𝑉0subscriptproduct𝑣subscript𝑊𝐺subscript𝑢1subscript𝑢𝑖11subscript𝑝𝑡𝑣subscript𝑉1\displaystyle=\prod_{v\in W_{G}\cap\{u_{1},\ldots,u_{i-1}\}}p_{t}(v,V_{0})=% \prod_{v\in W_{G}\cap\{u_{1},\ldots,u_{i-1}\}}(1-p_{t}(v,V_{1}))= ∏ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ∏ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) )
(1110n)n910.absentsuperscript1110𝑛𝑛910\displaystyle\geqslant(1-\textstyle{\frac{1}{10n}})^{n}\geqslant\textstyle{% \frac{9}{10}}.⩾ ( 1 - divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ⩾ divide start_ARG 9 end_ARG start_ARG 10 end_ARG .

On the other hand, if w{u1,,ui1}𝑤subscript𝑢1subscript𝑢𝑖1w\in\{u_{1},\ldots,u_{i-1}\}italic_w ∈ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } satisfies h(w)=0𝑤0h(w)=0italic_h ( italic_w ) = 0, then by using the fact that ptAi1subscript𝑝𝑡subscript𝐴𝑖1p_{t}\in A_{i-1}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT,

1(Nw)1subscript𝑁𝑤\displaystyle 1-\mathbb{P}(N_{w})1 - blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) =(fGw(x,y)=1)Lemma 2.2(h(y(v))=0 for all vWG{u1,,ui1})910.absentsuperscriptsubscript𝑓𝐺𝑤𝑥𝑦1Lemma 2.2h(y(v))=0 for all vWG{u1,,ui1}910\displaystyle=\mathbb{P}(f_{G}^{w}(x,y)=-1)\overset{\text{Lemma~{}\ref{lm:% optimality-characterisation}}}{\geqslant}\mathbb{P}(\text{$h(y(v))=0$ for all % $v\in W_{G}\cap\{u_{1},\ldots,u_{i-1}\}$})\geqslant\textstyle{\frac{9}{10}}.= blackboard_P ( italic_f start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ( italic_x , italic_y ) = - 1 ) overLemma start_ARG ⩾ end_ARG blackboard_P ( italic_h ( italic_y ( italic_v ) ) = 0 for all italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ∩ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } ) ⩾ divide start_ARG 9 end_ARG start_ARG 10 end_ARG .

In summary, we have

(Nw)110subscript𝑁𝑤110\displaystyle\mathbb{P}(N_{w})\leqslant\textstyle{\frac{1}{10}}blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) ⩽ divide start_ARG 1 end_ARG start_ARG 10 end_ARG whenever wF0,whenever wF0\displaystyle\qquad\text{whenever $w\in F_{0}$},whenever italic_w ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (23)
(Nw)910subscript𝑁𝑤910\displaystyle\mathbb{P}(N_{w})\geqslant\textstyle{\frac{9}{10}}blackboard_P ( italic_N start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) ⩾ divide start_ARG 9 end_ARG start_ARG 10 end_ARG whenever wF1.whenever wF1\displaystyle\qquad\text{whenever $w\in F_{1}$}.whenever italic_w ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . (24)

Finally, we will apply Corollary A.2 to establish that certain events occur with very low probability. A straightforward numerical manipulation we will use after each application is that for every vF(ui)𝑣𝐹subscript𝑢𝑖v\in F(u_{i})italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), because pt(ui,v)γsubscript𝑝𝑡subscript𝑢𝑖𝑣𝛾p_{t}(u_{i},v)\geqslant\gammaitalic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩾ italic_γ,

exp(r2μpt(ui,v)/168(1+r/4))superscript𝑟2𝜇subscript𝑝𝑡subscript𝑢𝑖𝑣1681𝑟4\displaystyle\exp{\left(-\frac{r^{2}\mu p_{t}(u_{i},v)/16}{8(1+r/4)}\right)}roman_exp ( - divide start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) / 16 end_ARG start_ARG 8 ( 1 + italic_r / 4 ) end_ARG ) (17)exp(γ2s(ui)+1μ200)exp(γ2s^+1μ200)italic-(17italic-)superscript𝛾2𝑠subscript𝑢𝑖1𝜇200superscript𝛾2^𝑠1𝜇200\displaystyle\overset{\eqref{eq:r-bound}}{\leqslant}\exp{\left(-\frac{\gamma^{% 2s(u_{i})+1}\mu}{200}\right)}\leqslant\exp{\left(-\frac{\gamma^{2\hat{s}+1}\mu% }{200}\right)}start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_exp ( - divide start_ARG italic_γ start_POSTSUPERSCRIPT 2 italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + 1 end_POSTSUPERSCRIPT italic_μ end_ARG start_ARG 200 end_ARG ) ⩽ roman_exp ( - divide start_ARG italic_γ start_POSTSUPERSCRIPT 2 over^ start_ARG italic_s end_ARG + 1 end_POSTSUPERSCRIPT italic_μ end_ARG start_ARG 200 end_ARG )
(10)exp(C(K+s^+1)logn200)12nK3s^6.italic-(10italic-)𝐶𝐾^𝑠1𝑛20012superscript𝑛𝐾3^𝑠6\displaystyle\overset{\eqref{eq:mu-lower}}{\leqslant}\exp{\left(-\frac{C(K+% \hat{s}+1)\log{n}}{200}\right)}\leqslant\frac{1}{2}n^{-K-3\hat{s}-6}.start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_exp ( - divide start_ARG italic_C ( italic_K + over^ start_ARG italic_s end_ARG + 1 ) roman_log italic_n end_ARG start_ARG 200 end_ARG ) ⩽ divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 6 end_POSTSUPERSCRIPT . (25)

We now complete the proof of the claim by dividing into two cases. Note that the properties E1-E3 and F1-F2 quoted hereafter are from the results of Section A.

Case 1: pt(ui,F1)12subscript𝑝𝑡subscript𝑢𝑖subscript𝐹112p_{t}(u_{i},F_{1})\leqslant\textstyle{\frac{1}{2}}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩽ divide start_ARG 1 end_ARG start_ARG 2 end_ARG. If vF1𝑣subscript𝐹1v\in F_{1}italic_v ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, then

(z(ui)=v)𝑧subscript𝑢𝑖𝑣\displaystyle\mathbb{P}(z(u_{i})=v)blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ) (21),(23),(24)pt(ui,v)[1+r(910+110pt(ui,F0)+pt(ui,F1))]italic-(21italic-)italic-(23italic-)italic-(24italic-)subscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]1𝑟910110subscript𝑝𝑡subscript𝑢𝑖subscript𝐹0subscript𝑝𝑡subscript𝑢𝑖subscript𝐹1\displaystyle\overset{\eqref{eq:cg-sel-press},\eqref{eq:Nw-F0},\eqref{eq:Nw-F1% }}{\leqslant}p_{t}(u_{i},v)\cdot[1+r\cdot(-\textstyle{\frac{9}{10}}+\textstyle% {\frac{1}{10}}p_{t}(u_{i},F_{0})+p_{t}(u_{i},F_{1}))]start_OVERACCENT italic_( italic_) , italic_( italic_) , italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + italic_r ⋅ ( - divide start_ARG 9 end_ARG start_ARG 10 end_ARG + divide start_ARG 1 end_ARG start_ARG 10 end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) ]
pt(ui,v)[114r].absentsubscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]114𝑟\displaystyle\hskip 19.91684pt\leqslant p_{t}(u_{i},v)\cdot[1-\textstyle{\frac% {1}{4}}r].⩽ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 - divide start_ARG 1 end_ARG start_ARG 4 end_ARG italic_r ] . (26)

By using (19) and (26) to apply E2 with α=r/4𝛼𝑟4\alpha=r/4italic_α = italic_r / 4, it holds for any fixed vF1𝑣subscript𝐹1v\in F_{1}italic_v ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT that

(qt+1(ui,v)>(1r/8)pt(ui,v))exp(r2μpt(ui,v)/168(1+r/4))(5)12nK3s^6nK3s^6.subscript𝑞𝑡1subscript𝑢𝑖𝑣1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣superscript𝑟2𝜇subscript𝑝𝑡subscript𝑢𝑖𝑣1681𝑟4italic-(5italic-)12superscript𝑛𝐾3^𝑠6superscript𝑛𝐾3^𝑠6\mathbb{P}(q_{t+1}(u_{i},v)>(1-r/8)p_{t}(u_{i},v))\leqslant\exp{\left(-\frac{r% ^{2}\mu p_{t}(u_{i},v)/16}{8(1+r/4)}\right)}\overset{\eqref{eq:bb-manip}}{% \leqslant}\frac{1}{2}n^{-K-3\hat{s}-6}\leqslant n^{-K-3\hat{s}-6}.blackboard_P ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) > ( 1 - italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ) ⩽ roman_exp ( - divide start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) / 16 end_ARG start_ARG 8 ( 1 + italic_r / 4 ) end_ARG ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 6 end_POSTSUPERSCRIPT ⩽ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 6 end_POSTSUPERSCRIPT .

Therefore, by taking a union bound over F1subscript𝐹1F_{1}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, it occurs with probability at least 1nK3s^51superscript𝑛𝐾3^𝑠51-n^{-K-3\hat{s}-5}1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT that

qt+1(ui,v)(1r/8)pt(ui,v)subscript𝑞𝑡1subscript𝑢𝑖𝑣1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣\displaystyle q_{t+1}(u_{i},v)\leqslant(1-r/8)p_{t}(u_{i},v)italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩽ ( 1 - italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) for every vF1𝑣subscript𝐹1v\in F_{1}italic_v ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, (27)

and so we proceed under the assumption that this occurs. Note that this automatically gives us for any vF1𝑣subscript𝐹1v\in F_{1}italic_v ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT that

pt+1(ui,v)=πγF(ui)(qt+1(ui,))(v)B3max{γ,qt+1(ui,v)}(27)max{γ,pt(ui,v)}=pt(ui,v).subscript𝑝𝑡1subscript𝑢𝑖𝑣superscriptsubscript𝜋𝛾𝐹subscript𝑢𝑖subscript𝑞𝑡1subscript𝑢𝑖𝑣B3𝛾subscript𝑞𝑡1subscript𝑢𝑖𝑣italic-(27italic-)𝛾subscript𝑝𝑡subscript𝑢𝑖𝑣subscript𝑝𝑡subscript𝑢𝑖𝑣p_{t+1}(u_{i},v)=\pi_{\gamma}^{F(u_{i})}(q_{t+1}(u_{i},\,\cdot\,))(v)\overset{% \emph{\ref{pi-2}}}{\leqslant}\max{\{\gamma,q_{t+1}(u_{i},v)\}}\overset{\eqref{% eq:c1-hp1}}{\leqslant}\max{\{\gamma,p_{t}(u_{i},v)\}}=p_{t}(u_{i},v).italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) = italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) ) ( italic_v ) overOVERACCENT start_ARG ⩽ end_ARG roman_max { italic_γ , italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) } start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_max { italic_γ , italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) } = italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) . (28)

So if pt(ui,F1)2Δγsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹12Δ𝛾p_{t}(u_{i},F_{1})\leqslant 2\Delta\gammaitalic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩽ 2 roman_Δ italic_γ then pt+1(ui,F1)2Δγsubscript𝑝𝑡1subscript𝑢𝑖subscript𝐹12Δ𝛾p_{t+1}(u_{i},F_{1})\leqslant 2\Delta\gammaitalic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩽ 2 roman_Δ italic_γ and hence Xt+1(i)=g(pt+1(ui,V0))=g(pt+1(ui,F0))=g(1pt+1(ui,F1))g(12Δγ)subscript𝑋𝑡1𝑖𝑔subscript𝑝𝑡1subscript𝑢𝑖subscript𝑉0𝑔subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹0𝑔1subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹1𝑔12Δ𝛾X_{t+1}(i)=g(p_{t+1}(u_{i},V_{0}))=g(p_{t+1}(u_{i},F_{0}))=g(1-p_{t+1}(u_{i},F% _{1}))\geqslant g(1-2\Delta\gamma)italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) = italic_g ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) = italic_g ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) = italic_g ( 1 - italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) ⩾ italic_g ( 1 - 2 roman_Δ italic_γ ). On the other hand, if pt(ui,F1)2Δγsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹12Δ𝛾p_{t}(u_{i},F_{1})\geqslant 2\Delta\gammaitalic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩾ 2 roman_Δ italic_γ then there is some vF1𝑣subscript𝐹1v\in F_{1}italic_v ∈ italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT such that p(ui,v)1Δpt(ui,F1)2γ𝑝subscript𝑢𝑖𝑣1Δsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹12𝛾p(u_{i},v)\geqslant\textstyle{\frac{1}{\Delta}}p_{t}(u_{i},F_{1})\geqslant 2\gammaitalic_p ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩾ divide start_ARG 1 end_ARG start_ARG roman_Δ end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩾ 2 italic_γ, and hence

pt+1(ui,F1)subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹1\displaystyle p_{t+1}(u_{i},F_{1})italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) =pt+1(ui,v)+pt+1(ui,F1{v})B3max{γ,qt+1(ui,v)}+pt+1(ui,F1{v})absentsubscript𝑝𝑡1subscript𝑢𝑖𝑣subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹1𝑣B3𝛾subscript𝑞𝑡1subscript𝑢𝑖𝑣subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹1𝑣\displaystyle=p_{t+1}(u_{i},v)+p_{t+1}(u_{i},F_{1}\setminus\{v\})\overset{% \emph{\ref{pi-2}}}{\leqslant}\max{\{\gamma,q_{t+1}(u_{i},v)\}}+p_{t+1}(u_{i},F% _{1}\setminus\{v\})= italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) + italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∖ { italic_v } ) overOVERACCENT start_ARG ⩽ end_ARG roman_max { italic_γ , italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) } + italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∖ { italic_v } )
(27),(28)max{γ,(1r/8)pt(ui,v)}+pt(ui,F1{v})italic-(27italic-)italic-(28italic-)𝛾1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣subscript𝑝𝑡subscript𝑢𝑖subscript𝐹1𝑣\displaystyle\overset{\eqref{eq:c1-hp1},\eqref{eq:pt-on-F1}}{\leqslant}\max{\{% \gamma,(1-r/8)p_{t}(u_{i},v)\}}+p_{t}(u_{i},F_{1}\setminus\{v\})start_OVERACCENT italic_( italic_) , italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_max { italic_γ , ( 1 - italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) } + italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∖ { italic_v } )
=(1r/8)pt(ui,v)+pt(ui,F1{v})absent1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣subscript𝑝𝑡subscript𝑢𝑖subscript𝐹1𝑣\displaystyle=(1-r/8)p_{t}(u_{i},v)+p_{t}(u_{i},F_{1}\setminus\{v\})= ( 1 - italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) + italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∖ { italic_v } )
=pt(ui,F1)(r/8)pt(ui,v)(17)(1γs(ui)/8)pt(ui,F1).absentsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣italic-(17italic-)1superscript𝛾𝑠subscript𝑢𝑖8subscript𝑝𝑡subscript𝑢𝑖subscript𝐹1\displaystyle=p_{t}(u_{i},F_{1})-(r/8)p_{t}(u_{i},v)\overset{\eqref{eq:r-bound% }}{\leqslant}(1-\gamma^{s(u_{i})}/8)p_{t}(u_{i},F_{1}).= italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - ( italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG ( 1 - italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) . (29)

In particular, this would then imply that

Xt+1(i)subscript𝑋𝑡1𝑖\displaystyle X_{t+1}(i)italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) =g(pt+1(ui,V0))=g(1pt+1(ui,V1))(29)g(1(1(γs(ui)/8))pt(ui,V1))absent𝑔subscript𝑝𝑡1subscript𝑢𝑖subscript𝑉0𝑔1subscript𝑝𝑡1subscript𝑢𝑖subscript𝑉1italic-(29italic-)𝑔11superscript𝛾𝑠subscript𝑢𝑖8subscript𝑝𝑡subscript𝑢𝑖subscript𝑉1\displaystyle=g(p_{t+1}(u_{i},V_{0}))=g(1-p_{t+1}(u_{i},V_{1}))\overset{\eqref% {eq:p-adjust-1}}{\geqslant}g(1-(1-(\gamma^{s(u_{i})}/8))p_{t}(u_{i},V_{1}))= italic_g ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) = italic_g ( 1 - italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG italic_g ( 1 - ( 1 - ( italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 8 ) ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) )
F2g(1pt(ui,V1))+γs(ui)16=Xt(i)+γs(ui)16.F2𝑔1subscript𝑝𝑡subscript𝑢𝑖subscript𝑉1superscript𝛾𝑠subscript𝑢𝑖16subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖16\displaystyle\overset{\emph{\ref{g-simp-2}}}{\geqslant}g(1-p_{t}(u_{i},V_{1}))% +\frac{\gamma^{s(u_{i})}}{16}=X_{t}(i)+\frac{\gamma^{s(u_{i})}}{16}.overOVERACCENT start_ARG ⩾ end_ARG italic_g ( 1 - italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) + divide start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG 16 end_ARG = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + divide start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG 16 end_ARG .

Combining the cases pt(ui,F1)2Δγsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹12Δ𝛾p_{t}(u_{i},F_{1})\leqslant 2\Delta\gammaitalic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩽ 2 roman_Δ italic_γ and pt(ui,F1)2Δγsubscript𝑝𝑡subscript𝑢𝑖subscript𝐹12Δ𝛾p_{t}(u_{i},F_{1})\geqslant 2\Delta\gammaitalic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩾ 2 roman_Δ italic_γ shows that the event Xt+1(i)min{g(12Δγ),Xt(i)+γs(ui)/32}subscript𝑋𝑡1𝑖𝑔12Δ𝛾subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖32X_{t+1}(i)\geqslant\min{\{g(1-2\Delta\gamma),X_{t}(i)+\gamma^{s(u_{i})}/32\}}italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ⩾ roman_min { italic_g ( 1 - 2 roman_Δ italic_γ ) , italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 32 } holds with probability at least 1nK3s^51superscript𝑛𝐾3^𝑠51-n^{-K-3\hat{s}-5}1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT.

Case 2: pt(ui,F1)12subscript𝑝𝑡subscript𝑢𝑖subscript𝐹112p_{t}(u_{i},F_{1})\geqslant\textstyle{\frac{1}{2}}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩾ divide start_ARG 1 end_ARG start_ARG 2 end_ARG. If vF0𝑣subscript𝐹0v\in F_{0}italic_v ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, then

(z(ui)=v)𝑧subscript𝑢𝑖𝑣\displaystyle\mathbb{P}(z(u_{i})=v)blackboard_P ( italic_z ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_v ) (21),(23),(24)pt(ui,v)[1+r(110+910pt(ui,F1))]italic-(21italic-)italic-(23italic-)italic-(24italic-)subscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]1𝑟110910subscript𝑝𝑡subscript𝑢𝑖subscript𝐹1\displaystyle\overset{\eqref{eq:cg-sel-press},\eqref{eq:Nw-F0},\eqref{eq:Nw-F1% }}{\geqslant}p_{t}(u_{i},v)\cdot[1+r\cdot(-\textstyle{\frac{1}{10}}+\textstyle% {\frac{9}{10}}p_{t}(u_{i},F_{1}))]start_OVERACCENT italic_( italic_) , italic_( italic_) , italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + italic_r ⋅ ( - divide start_ARG 1 end_ARG start_ARG 10 end_ARG + divide start_ARG 9 end_ARG start_ARG 10 end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) ]
pt(ui,v)[1+14r].absentsubscript𝑝𝑡subscript𝑢𝑖𝑣delimited-[]114𝑟\displaystyle\hskip 19.91684pt\geqslant p_{t}(u_{i},v)\cdot[1+\textstyle{\frac% {1}{4}}r].⩾ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⋅ [ 1 + divide start_ARG 1 end_ARG start_ARG 4 end_ARG italic_r ] . (30)

By using (19) and (30) to apply E1 with α=r/4𝛼𝑟4\alpha=r/4italic_α = italic_r / 4, it holds for every vF0𝑣subscript𝐹0v\in F_{0}italic_v ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT that

(qt+1(ui,v)<(1+r/8)pt(ui,v))exp(r2μpt(ui,v)/168(1+r/4))(5)12nK3s^6.subscript𝑞𝑡1subscript𝑢𝑖𝑣1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣superscript𝑟2𝜇subscript𝑝𝑡subscript𝑢𝑖𝑣1681𝑟4italic-(5italic-)12superscript𝑛𝐾3^𝑠6\mathbb{P}(q_{t+1}(u_{i},v)<(1+r/8)p_{t}(u_{i},v))\leqslant\exp{\left(-\frac{r% ^{2}\mu p_{t}(u_{i},v)/16}{8(1+r/4)}\right)}\overset{\eqref{eq:bb-manip}}{% \leqslant}\frac{1}{2}n^{-K-3\hat{s}-6}.blackboard_P ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) < ( 1 + italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ) ⩽ roman_exp ( - divide start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) / 16 end_ARG start_ARG 8 ( 1 + italic_r / 4 ) end_ARG ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 6 end_POSTSUPERSCRIPT .

By using (19) and (22) to apply E3 with α=r𝛼𝑟\alpha=ritalic_α = italic_r, it holds for every vF(ui)𝑣𝐹subscript𝑢𝑖v\in F(u_{i})italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) that

(qt+1(ui,v)<(12r)pt(ui,v))exp(r2μpt(ui,v)/168(1+r/4))(5)12nK3s^6.subscript𝑞𝑡1subscript𝑢𝑖𝑣12𝑟subscript𝑝𝑡subscript𝑢𝑖𝑣superscript𝑟2𝜇subscript𝑝𝑡subscript𝑢𝑖𝑣1681𝑟4italic-(5italic-)12superscript𝑛𝐾3^𝑠6\mathbb{P}(q_{t+1}(u_{i},v)<(1-2r)p_{t}(u_{i},v))\leqslant\exp{\left(-\frac{r^% {2}\mu p_{t}(u_{i},v)/16}{8(1+r/4)}\right)}\overset{\eqref{eq:bb-manip}}{% \leqslant}\frac{1}{2}n^{-K-3\hat{s}-6}.blackboard_P ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) < ( 1 - 2 italic_r ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ) ⩽ roman_exp ( - divide start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) / 16 end_ARG start_ARG 8 ( 1 + italic_r / 4 ) end_ARG ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 6 end_POSTSUPERSCRIPT .

Therefore, by taking a union bound over F0subscript𝐹0F_{0}italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and also F(ui)𝐹subscript𝑢𝑖F(u_{i})italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), it occurs with probability at least 1nK3s^51superscript𝑛𝐾3^𝑠51-n^{-K-3\hat{s}-5}1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT that

qt+1(ui,v)(1+r/8)pt(ui,v)subscript𝑞𝑡1subscript𝑢𝑖𝑣1𝑟8subscript𝑝𝑡subscript𝑢𝑖𝑣\displaystyle q_{t+1}(u_{i},v)\geqslant(1+r/8)p_{t}(u_{i},v)italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩾ ( 1 + italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) for every vF0𝑣subscript𝐹0v\in F_{0}italic_v ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, (31)
qt+1(ui,v)(12r)pt(ui,v)subscript𝑞𝑡1subscript𝑢𝑖𝑣12𝑟subscript𝑝𝑡subscript𝑢𝑖𝑣\displaystyle q_{t+1}(u_{i},v)\geqslant(1-2r)p_{t}(u_{i},v)italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩾ ( 1 - 2 italic_r ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) for every vF(ui)𝑣𝐹subscript𝑢𝑖v\in F(u_{i})italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), (32)

and so we proceed under the assumption that this occurs. Recalling that γ=1/(20Δn)𝛾120Δ𝑛\gamma=1/(20\Delta n)italic_γ = 1 / ( 20 roman_Δ italic_n ) and the assumption that n3𝑛3n\geqslant 3italic_n ⩾ 3, we can now bound βγ(qt+1(ui,))superscriptsubscript𝛽𝛾subscript𝑞𝑡1subscript𝑢𝑖\beta_{\gamma}^{-}(q_{t+1}(u_{i},\,\cdot\,))italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) ) above as

βγ(qt+1(ui,))superscriptsubscript𝛽𝛾subscript𝑞𝑡1subscript𝑢𝑖\displaystyle\beta_{\gamma}^{-}(q_{t+1}(u_{i},\,\cdot\,))italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) ) =vF(ui)max{γqt+1(ui,v),0}(32)vF(ui)max{γ(12r)γ,0}absentsubscript𝑣𝐹subscript𝑢𝑖𝛾subscript𝑞𝑡1subscript𝑢𝑖𝑣0italic-(32italic-)subscript𝑣𝐹subscript𝑢𝑖𝛾12𝑟𝛾0\displaystyle=\sum_{v\in F(u_{i})}\max{\{\gamma-q_{t+1}(u_{i},v),0\}}\overset{% \eqref{eq:c2-hp2}}{\leqslant}\sum_{v\in F(u_{i})}\max{\{\gamma-(1-2r)\gamma,0\}}= ∑ start_POSTSUBSCRIPT italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT roman_max { italic_γ - italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) , 0 } start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT roman_max { italic_γ - ( 1 - 2 italic_r ) italic_γ , 0 }
vF(ui)2rγ2Δrγ=r10nr30r24(1160)r24(1γΔ).absentsubscript𝑣𝐹subscript𝑢𝑖2𝑟𝛾2Δ𝑟𝛾𝑟10𝑛𝑟30𝑟241160𝑟241𝛾Δ\displaystyle\leqslant\sum_{v\in F(u_{i})}2r\gamma\leqslant 2\Delta r\gamma=% \frac{r}{10n}\leqslant\frac{r}{30}\leqslant\frac{r}{24}\cdot(1-\textstyle{% \frac{1}{60}})\leqslant\displaystyle\frac{r}{24}\cdot(1-\gamma\Delta).⩽ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT 2 italic_r italic_γ ⩽ 2 roman_Δ italic_r italic_γ = divide start_ARG italic_r end_ARG start_ARG 10 italic_n end_ARG ⩽ divide start_ARG italic_r end_ARG start_ARG 30 end_ARG ⩽ divide start_ARG italic_r end_ARG start_ARG 24 end_ARG ⋅ ( 1 - divide start_ARG 1 end_ARG start_ARG 60 end_ARG ) ⩽ divide start_ARG italic_r end_ARG start_ARG 24 end_ARG ⋅ ( 1 - italic_γ roman_Δ ) . (33)

Hence, using that qt+1(ui,v)γsubscript𝑞𝑡1subscript𝑢𝑖𝑣𝛾q_{t+1}(u_{i},v)\geqslant\gammaitalic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v ) ⩾ italic_γ for every vF0𝑣subscript𝐹0v\in F_{0}italic_v ∈ italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT,

pt+1(ui,F0)subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹0\displaystyle p_{t+1}(u_{i},F_{0})italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) =πγF(ui)(qt+1(ui,))(F0)B2(1βγ(qt+1(ui,))1γΔ)qt+1(ui,F0)absentsuperscriptsubscript𝜋𝛾𝐹subscript𝑢𝑖subscript𝑞𝑡1subscript𝑢𝑖subscript𝐹0B21superscriptsubscript𝛽𝛾subscript𝑞𝑡1subscript𝑢𝑖1𝛾Δsubscript𝑞𝑡1subscript𝑢𝑖subscript𝐹0\displaystyle=\pi_{\gamma}^{F(u_{i})}(q_{t+1}(u_{i},\,\cdot\,))(F_{0})\overset% {\emph{\ref{pi-1}}}{\geqslant}\left(1-\frac{\beta_{\gamma}^{-}(q_{t+1}(u_{i},% \,\cdot\,))}{1-\gamma\Delta}\right)q_{t+1}(u_{i},F_{0})= italic_π start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_F ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) ) ( italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) overOVERACCENT start_ARG ⩾ end_ARG ( 1 - divide start_ARG italic_β start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ⋅ ) ) end_ARG start_ARG 1 - italic_γ roman_Δ end_ARG ) italic_q start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT )
(33),(31)(1r/24)(1+r/8)pt(ui,F0)(17)(1+γs(ui)/16)pt(ui,F0).italic-(33italic-)italic-(31italic-)1𝑟241𝑟8subscript𝑝𝑡subscript𝑢𝑖subscript𝐹0italic-(17italic-)1superscript𝛾𝑠subscript𝑢𝑖16subscript𝑝𝑡subscript𝑢𝑖subscript𝐹0\displaystyle\overset{\eqref{eq:b-minus},\eqref{eq:c2-hp1}}{\geqslant}(1-r/24)% (1+r/8)p_{t}(u_{i},F_{0})\overset{\eqref{eq:r-bound}}{\geqslant}(1+\gamma^{s(u% _{i})}/16)p_{t}(u_{i},F_{0}).start_OVERACCENT italic_( italic_) , italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG ( 1 - italic_r / 24 ) ( 1 + italic_r / 8 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG ( 1 + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 16 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) . (34)

This would then imply that

Xt+1(i)subscript𝑋𝑡1𝑖\displaystyle X_{t+1}(i)italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) =g(pt+1(ui,F0))(34)g((1+γs(ui)/16)pt(ui,F0))absent𝑔subscript𝑝𝑡1subscript𝑢𝑖subscript𝐹0italic-(34italic-)𝑔1superscript𝛾𝑠subscript𝑢𝑖16subscript𝑝𝑡subscript𝑢𝑖subscript𝐹0\displaystyle=g(p_{t+1}(u_{i},F_{0}))\overset{\eqref{eq:p-adjust-2}}{\geqslant% }g((1+\gamma^{s(u_{i})}/16)p_{t}(u_{i},F_{0}))= italic_g ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩾ end_ARG italic_g ( ( 1 + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 16 ) italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) )
F1g(pt(ui,F0))+γs(ui)32=Xt(i)+γs(ui)32.F1𝑔subscript𝑝𝑡subscript𝑢𝑖subscript𝐹0superscript𝛾𝑠subscript𝑢𝑖32subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖32\displaystyle\overset{\emph{\ref{g-simp-1}}}{\geqslant}g(p_{t}(u_{i},F_{0}))+% \frac{\gamma^{s(u_{i})}}{32}=X_{t}(i)+\frac{\gamma^{s(u_{i})}}{32}.overOVERACCENT start_ARG ⩾ end_ARG italic_g ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) + divide start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG 32 end_ARG = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + divide start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG 32 end_ARG .

Thus, the event Xt+1(i)min{g(12Δγ),Xt(i)+γs(ui)/32}subscript𝑋𝑡1𝑖𝑔12Δ𝛾subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖32X_{t+1}(i)\geqslant\min{\{g(1-2\Delta\gamma),X_{t}(i)+\gamma^{s(u_{i})}/32\}}italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ⩾ roman_min { italic_g ( 1 - 2 roman_Δ italic_γ ) , italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 32 } holds with probability at least 1nK3s^51superscript𝑛𝐾3^𝑠51-n^{-K-3\hat{s}-5}1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT. ∎

Proof of Claim 5.4.

Suppose that ptAnsubscript𝑝𝑡subscript𝐴𝑛p_{t}\notin A_{n}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∉ italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, so that :=(pt)<nassignsubscript𝑝𝑡𝑛\ell:=\ell(p_{t})<nroman_ℓ := roman_ℓ ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) < italic_n. For every i[]𝑖delimited-[]i\in[\ell]italic_i ∈ [ roman_ℓ ] with uiWGsubscript𝑢𝑖subscript𝑊𝐺u_{i}\in W_{G}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, it follows from the fact that ptAisubscript𝑝𝑡subscript𝐴𝑖p_{t}\in A_{i}italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that p(ui,V0)=1p(ui,V1)1110n𝑝subscript𝑢𝑖subscript𝑉01𝑝subscript𝑢𝑖subscript𝑉11110𝑛p(u_{i},V_{0})=1-p(u_{i},V_{1})\geqslant 1-\textstyle{\frac{1}{10n}}italic_p ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = 1 - italic_p ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⩾ 1 - divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG and hence

Xt(i)=g(p(ui,V0))g(1110n)=g(12Δγ).subscript𝑋𝑡𝑖𝑔𝑝subscript𝑢𝑖subscript𝑉0𝑔1110𝑛𝑔12Δ𝛾X_{t}(i)=g(p(u_{i},V_{0}))\geqslant g(1-\textstyle{\frac{1}{10n}})=g(1-2\Delta% \gamma).italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) = italic_g ( italic_p ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ⩾ italic_g ( 1 - divide start_ARG 1 end_ARG start_ARG 10 italic_n end_ARG ) = italic_g ( 1 - 2 roman_Δ italic_γ ) . (35)

Let I={i[+1]:uiWG}𝐼conditional-set𝑖delimited-[]1subscript𝑢𝑖subscript𝑊𝐺I=\{i\in[\ell+1]:u_{i}\in W_{G}\}italic_I = { italic_i ∈ [ roman_ℓ + 1 ] : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT } so that +1I1𝐼\ell+1\in Iroman_ℓ + 1 ∈ italic_I. For iI𝑖𝐼i\in Iitalic_i ∈ italic_I, let Eisubscript𝐸𝑖E_{i}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the event that

Xt+1(i)min{g(12Δγ),Xt(i)+γs(ui)/32}.subscript𝑋𝑡1𝑖𝑔12Δ𝛾subscript𝑋𝑡𝑖superscript𝛾𝑠subscript𝑢𝑖32X_{t+1}(i)\geqslant\min{\{g(1-2\Delta\gamma),X_{t}(i)+\gamma^{s(u_{i})}/32\}}.italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ⩾ roman_min { italic_g ( 1 - 2 roman_Δ italic_γ ) , italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_i ) + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 32 } .

We will now show that if Eisubscript𝐸𝑖E_{i}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT holds for every iI𝑖𝐼i\in Iitalic_i ∈ italic_I, then g^(pt+1)g^(pt)+1^𝑔subscript𝑝𝑡1^𝑔subscript𝑝𝑡1\hat{g}(p_{t+1})\geqslant\hat{g}(p_{t})+1over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) ⩾ over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + 1. Indeed, if Eisubscript𝐸𝑖E_{i}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT holds for every iI𝑖𝐼i\in Iitalic_i ∈ italic_I, then it follows from (35) that Xt+1(i)g(12Δγ)subscript𝑋𝑡1𝑖𝑔12Δ𝛾X_{t+1}(i)\geqslant g(1-2\Delta\gamma)italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( italic_i ) ⩾ italic_g ( 1 - 2 roman_Δ italic_γ ) for every i[]𝑖delimited-[]i\in[\ell]italic_i ∈ [ roman_ℓ ] with uiWGsubscript𝑢𝑖subscript𝑊𝐺u_{i}\in W_{G}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, and hence pt+1Asubscript𝑝𝑡1subscript𝐴p_{t+1}\in A_{\ell}italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. If additionally (pt+1)>subscript𝑝𝑡1\ell(p_{t+1})>\ellroman_ℓ ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) > roman_ℓ, then g^(pt+1)g^(pt)+1^𝑔subscript𝑝𝑡1^𝑔subscript𝑝𝑡1\hat{g}(p_{t+1})\geqslant\hat{g}(p_{t})+1over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) ⩾ over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + 1 is immediate from (12). On the other hand, if (pt+1)=subscript𝑝𝑡1\ell(p_{t+1})=\ellroman_ℓ ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) = roman_ℓ, then E+1subscript𝐸1E_{\ell+1}italic_E start_POSTSUBSCRIPT roman_ℓ + 1 end_POSTSUBSCRIPT implies Xt+1(+1)Xt(+1)+γs(u+1)/32subscript𝑋𝑡11subscript𝑋𝑡1superscript𝛾𝑠subscript𝑢132X_{t+1}(\ell+1)\geqslant X_{t}(\ell+1)+\gamma^{s(u_{\ell+1})}/32italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( roman_ℓ + 1 ) ⩾ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( roman_ℓ + 1 ) + italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT roman_ℓ + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT / 32 and hence,

g^(pt+1)g^(pt)=(Xt+1(+1)Xt(+1))(32γs(u+1))1.^𝑔subscript𝑝𝑡1^𝑔subscript𝑝𝑡subscript𝑋𝑡11subscript𝑋𝑡132superscript𝛾𝑠subscript𝑢11\hat{g}(p_{t+1})-\hat{g}(p_{t})=(X_{t+1}(\ell+1)-X_{t}(\ell+1))\cdot\left(% \frac{32}{\gamma^{s(u_{\ell+1})}}\right)\geqslant 1.over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) - over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = ( italic_X start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( roman_ℓ + 1 ) - italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( roman_ℓ + 1 ) ) ⋅ ( divide start_ARG 32 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_s ( italic_u start_POSTSUBSCRIPT roman_ℓ + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT end_ARG ) ⩾ 1 .

Therefore, using a union bound we have

(g^(pt+1)g^(pt)+1)(iIEi)1iI(Eic)Claim 5.31|I|nK3s^51nK3s^4,^𝑔subscript𝑝𝑡1^𝑔subscript𝑝𝑡1subscript𝑖𝐼subscript𝐸𝑖1subscript𝑖𝐼superscriptsubscript𝐸𝑖𝑐Claim 5.31𝐼superscript𝑛𝐾3^𝑠51superscript𝑛𝐾3^𝑠4\displaystyle\mathbb{P}(\hat{g}(p_{t+1})\geqslant\hat{g}(p_{t})+1)\geqslant% \mathbb{P}(\wedge_{i\in I}E_{i})\geqslant 1-\sum_{i\in I}\mathbb{P}(E_{i}^{c})% \overset{\text{Claim~{}\ref{clm:cg-logit-drift}}}{\geqslant}1-|I|\cdot n^{-K-3% \hat{s}-5}\geqslant 1-n^{-K-3\hat{s}-4},blackboard_P ( over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) ⩾ over^ start_ARG italic_g end_ARG ( italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + 1 ) ⩾ blackboard_P ( ∧ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⩾ 1 - ∑ start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT blackboard_P ( italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) overClaim start_ARG ⩾ end_ARG 1 - | italic_I | ⋅ italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 5 end_POSTSUPERSCRIPT ⩾ 1 - italic_n start_POSTSUPERSCRIPT - italic_K - 3 over^ start_ARG italic_s end_ARG - 4 end_POSTSUPERSCRIPT ,

as required. ∎ \square

For many applications, rather than applying Theorem 5.2 directly it will be convenient to use the following corollary.

Corollary 5.5.

There is a constant C>0𝐶0C>0italic_C > 0 such that the following holds. Let G𝐺Gitalic_G be an impartial combinatorial game with maximum degree ΔΔ\Deltaroman_Δ, and let s¯=maxvVs(v)¯𝑠subscript𝑣𝑉𝑠𝑣\overline{s}=\max_{v\in V}s(v)over¯ start_ARG italic_s end_ARG = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT italic_s ( italic_v ). Let K>0𝐾0K>0italic_K > 0, and assume 𝒜𝒜\mathcal{A}caligraphic_A uses parameters γ=1/(20Δn)𝛾120Δ𝑛\gamma=1/(20\Delta n)italic_γ = 1 / ( 20 roman_Δ italic_n ) and μ=C(K+s¯+1)(20Δn)1+2s¯logn𝜇𝐶𝐾¯𝑠1superscript20Δ𝑛12¯𝑠𝑛\mu=C(K+\overline{s}+1)(20\Delta n)^{1+2\overline{s}}\log{n}italic_μ = italic_C ( italic_K + over¯ start_ARG italic_s end_ARG + 1 ) ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT 1 + 2 over¯ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT roman_log italic_n. Then,

[T𝒜G(Opt(G))C2(K+s¯+1)(20Δn)2+3s¯log2n]nK.delimited-[]superscriptsubscript𝑇𝒜𝐺Opt𝐺superscript𝐶2𝐾¯𝑠1superscript20Δ𝑛23¯𝑠superscript2𝑛superscript𝑛𝐾\mathbb{P}[T_{\mathcal{A}}^{G}(\text{\emph{Opt}}(G))\geqslant C^{2}(K+% \overline{s}+1)(20\Delta n)^{2+3\overline{s}}\log^{2}{n}]\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT ( Opt ( italic_G ) ) ⩾ italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K + over¯ start_ARG italic_s end_ARG + 1 ) ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT 2 + 3 over¯ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .
Proof.

By noting that

s^=maxvWGs(v)maxvVs(v)=s¯,^𝑠subscript𝑣subscript𝑊𝐺𝑠𝑣subscript𝑣𝑉𝑠𝑣¯𝑠\hat{s}=\max_{v\in W_{G}}s(v)\leqslant\max_{v\in V}s(v)=\overline{s},over^ start_ARG italic_s end_ARG = roman_max start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_s ( italic_v ) ⩽ roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT italic_s ( italic_v ) = over¯ start_ARG italic_s end_ARG ,

and

vWG(20Δn)s(v)vV(20Δn)s(v)vV(20Δn)s¯=n(20Δn)s¯(20Δn)s¯+1,subscript𝑣subscript𝑊𝐺superscript20Δ𝑛𝑠𝑣subscript𝑣𝑉superscript20Δ𝑛𝑠𝑣subscript𝑣𝑉superscript20Δ𝑛¯𝑠𝑛superscript20Δ𝑛¯𝑠superscript20Δ𝑛¯𝑠1\sum_{v\in W_{G}}(20\Delta n)^{s(v)}\leqslant\sum_{v\in V}(20\Delta n)^{s(v)}% \leqslant\sum_{v\in V}(20\Delta n)^{\overline{s}}=n\cdot(20\Delta n)^{% \overline{s}}\leqslant(20\Delta n)^{\overline{s}+1},∑ start_POSTSUBSCRIPT italic_v ∈ italic_W start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT ⩽ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT ⩽ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V end_POSTSUBSCRIPT ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT over¯ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT = italic_n ⋅ ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT over¯ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT ⩽ ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT over¯ start_ARG italic_s end_ARG + 1 end_POSTSUPERSCRIPT ,

this is an immediate consequence of Theorem 5.2. ∎

6 Applications

In this section we will apply Theorem 5.2 to obtain several runtimes for Algorithm 1 on a number of well-established combinatorial games. Throughout, we state runtimes in terms of n𝑛nitalic_n, the number of possible game positions, and always assume that 𝒜𝒜\mathcal{A}caligraphic_A is described by Algorithm 1. All described games are played under the normal play convention (that a player unable to move loses), as established in Section 2.

6.1 Subtraction Nim

Nim is a strategic game in which players take turns removing items from distinct heaps. Variants have been played across cultures since ancient history [56, 67], and it was also the game of choice for some of the earliest machines and computers dedicated to game playing [37, 44, 52]. Nim is also perhaps the most important impartial combinatorial game from a mathematical perspective, with the Sprague-Grundy theorem establishing that, for a particular formulation of equivalence which characterises strategic continuation, every position in any impartial combinatorial game is equivalent to some position of a one-heap game of Nim [7].

While the version central to combinatorial game theory typically allows players to remove any positive number of items on their turn, here we consider the well-known one-heap variant in which there is an upper limit on the number of items that can be taken at once (see, for example, [19]). Given parameters n𝑛nitalic_n and k𝑘kitalic_k, SubtractionNimnksuperscriptsubscriptSubtractionNim𝑛𝑘\textsc{SubtractionNim}_{n}^{k}SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT begins with an initial heap of (n1)𝑛1(n-1)( italic_n - 1 ) items, and on each turn a player may remove between 1111 and k𝑘kitalic_k items from the heap. The game graph for SubtractionNimn2superscriptsubscriptSubtractionNim𝑛2\textsc{SubtractionNim}_{n}^{2}SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is shown in Figure 3. This game constitutes the simplest example of a subtraction game [26] of also a take-away game [59], both of which are expansive and well-studied classes of impartial combinatorial games. We have the following polynomial runtime for SubtractionNimnksuperscriptsubscriptSubtractionNim𝑛𝑘\textsc{SubtractionNim}_{n}^{k}SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT.

Proposition 6.1.

SubtractionNimnksuperscriptsubscriptSubtractionNim𝑛𝑘\emph{{SubtractionNim}}_{n}^{k}SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT satisfies s¯1¯𝑠1\overline{s}\leqslant 1over¯ start_ARG italic_s end_ARG ⩽ 1 and ΔkΔ𝑘\Delta\leqslant kroman_Δ ⩽ italic_k. Thus, for each K>0𝐾0K>0italic_K > 0 there exists C>0𝐶0C>0italic_C > 0 such that for appropriately chosen parameters in Algorithm 1,

[T𝒜(Opt(SubtractionNimnk))C(kn)5log2n]nK.delimited-[]subscript𝑇𝒜OptsuperscriptsubscriptSubtractionNim𝑛𝑘𝐶superscript𝑘𝑛5superscript2𝑛superscript𝑛𝐾\mathbb{P}[T_{\mathcal{A}}(\text{\emph{Opt}}(\emph{{SubtractionNim}}_{n}^{k}))% \geqslant C(kn)^{5}\log^{2}{n}]\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( Opt ( SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) ) ⩾ italic_C ( italic_k italic_n ) start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .
Proof.

For SubtractionNimnksuperscriptsubscriptSubtractionNim𝑛𝑘\textsc{SubtractionNim}_{n}^{k}SubtractionNim start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT we have V={0,1,,n1}𝑉01𝑛1V=\{0,1,\ldots,n-1\}italic_V = { 0 , 1 , … , italic_n - 1 }, v0=n1subscript𝑣0𝑛1v_{0}=n-1italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_n - 1, and F(v)={v1,,vk}V𝐹𝑣𝑣1𝑣𝑘𝑉F(v)=\{v-1,\ldots,v-k\}\cap Vitalic_F ( italic_v ) = { italic_v - 1 , … , italic_v - italic_k } ∩ italic_V. Note that VInt(G)={0}𝑉Int𝐺0V\setminus\text{Int}(G)=\{0\}italic_V ∖ Int ( italic_G ) = { 0 }.

We need to verify that s(v)1𝑠𝑣1s(v)\leqslant 1italic_s ( italic_v ) ⩽ 1 for every vV𝑣𝑉v\in Vitalic_v ∈ italic_V. Given v𝑣vitalic_v, let Av={(v+i,v):i[k1]}E(G)subscript𝐴𝑣conditional-set𝑣𝑖𝑣𝑖delimited-[]𝑘1𝐸𝐺A_{v}=\{(v+i,v):i\in[k-1]\}\cap E(G)italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = { ( italic_v + italic_i , italic_v ) : italic_i ∈ [ italic_k - 1 ] } ∩ italic_E ( italic_G ). We have Depth(Av)=1Depthsubscript𝐴𝑣1\text{Depth}(A_{v})=1Depth ( italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) = 1, as any directed path in G𝐺Gitalic_G can visit v𝑣vitalic_v at most once. To see that Avsubscript𝐴𝑣A_{v}italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT is a v𝑣vitalic_v-switcher, suppose that z0zsubscript𝑧0subscript𝑧z_{0}\ldots z_{\ell}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is an Avsubscript𝐴𝑣A_{v}italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT-compatible directed path from z0=n1subscript𝑧0𝑛1z_{0}=n-1italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_n - 1 to VInt(G)={0}𝑉Int𝐺0V\setminus\text{Int}(G)=\{0\}italic_V ∖ Int ( italic_G ) = { 0 }. Because at most k𝑘kitalic_k items are removed on each turn, there is some i𝑖iitalic_i such that zi{v,v+1,,v+(k1)}subscript𝑧𝑖𝑣𝑣1𝑣𝑘1z_{i}\in\{v,v+1,\ldots,v+(k-1)\}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { italic_v , italic_v + 1 , … , italic_v + ( italic_k - 1 ) }. But then we either have zi=vsubscript𝑧𝑖𝑣z_{i}=vitalic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_v or, in order for z0zi+1subscript𝑧0subscript𝑧𝑖1z_{0}\ldots z_{i+1}italic_z start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT … italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT to remain Avsubscript𝐴𝑣A_{v}italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT-compatible, zizi+1Avsubscript𝑧𝑖subscript𝑧𝑖1subscript𝐴𝑣z_{i}z_{i+1}\in A_{v}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∈ italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT and hence zi+1=vsubscript𝑧𝑖1𝑣z_{i+1}=vitalic_z start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_v. In either case, we deduce that v𝑣vitalic_v lies on every Avsubscript𝐴𝑣A_{v}italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT-compatible directed path from n1𝑛1n-1italic_n - 1 to 00. Thus, Avsubscript𝐴𝑣A_{v}italic_A start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT is a v𝑣vitalic_v-switcher of depth 1111, and hence s(v)1𝑠𝑣1s(v)\leqslant 1italic_s ( italic_v ) ⩽ 1.

From this, we have that s¯1¯𝑠1\overline{s}\leqslant 1over¯ start_ARG italic_s end_ARG ⩽ 1. Combined with the observation that ΔkΔ𝑘\Delta\leqslant kroman_Δ ⩽ italic_k, the result then follows from Corollary 5.5. ∎

6.2 Silver Dollar

We consider the variant of Silver Dollar played without the eponymous silver dollar [7, 15, 26]; however, it should be noted that Theorem 5.2 also implies a similar polynomial runtime for the original version of Silver Dollar attributed to de Bruijn (see also [7]).

Given parameters m𝑚mitalic_m and k𝑘kitalic_k, SilverDollarmksuperscriptsubscriptSilverDollar𝑚𝑘\textsc{SilverDollar}_{m}^{k}SilverDollar start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT is played using k𝑘kitalic_k coins on a horizontal strip of m𝑚mitalic_m squares, with the coins initially placed on the rightmost k𝑘kitalic_k squares (most descriptions actually have the coins placed on arbitrary starting squares, however this does not significantly affect our analysis). A turn consists of moving one coin leftwards any number of spaces, provided the coin does not go past any other coins. In addition, coins may never occupy the same square. Assuming k𝑘kitalic_k is a fixed constant, the number of game positions is n=(mk)𝑛binomial𝑚𝑘n=\binom{m}{k}italic_n = ( FRACOP start_ARG italic_m end_ARG start_ARG italic_k end_ARG ). We have the following polynomial runtime for SilverDollarmksuperscriptsubscriptSilverDollar𝑚𝑘\textsc{SilverDollar}_{m}^{k}SilverDollar start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT.

Proposition 6.2.

Let k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N be fixed. SilverDollarmksuperscriptsubscriptSilverDollar𝑚𝑘\emph{{SilverDollar}}_{m}^{k}SilverDollar start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT satisfies s¯k¯𝑠𝑘\overline{s}\leqslant kover¯ start_ARG italic_s end_ARG ⩽ italic_k and Δm=O(n1/k)Δ𝑚𝑂superscript𝑛1𝑘\Delta\leqslant m=O(n^{1/k})roman_Δ ⩽ italic_m = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / italic_k end_POSTSUPERSCRIPT ). Thus, for each K>0𝐾0K>0italic_K > 0 there exists C>0𝐶0C>0italic_C > 0 such that for appropriately chosen parameters in Algorithm 1,

[T𝒜(Opt(SilverDollarmk))Cn5+3k+(2/k)log2n]nK.delimited-[]subscript𝑇𝒜OptsuperscriptsubscriptSilverDollar𝑚𝑘𝐶superscript𝑛53𝑘2𝑘superscript2𝑛superscript𝑛𝐾\mathbb{P}[T_{\mathcal{A}}(\text{\emph{Opt}}(\emph{{SilverDollar}}_{m}^{k}))% \geqslant Cn^{5+3k+(2/k)}\log^{2}{n}]\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( Opt ( SilverDollar start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) ) ⩾ italic_C italic_n start_POSTSUPERSCRIPT 5 + 3 italic_k + ( 2 / italic_k ) end_POSTSUPERSCRIPT roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .
Proof.

On each turn, for each empty square there is at most one possible move that places a coin onto that square. Therefore, Δmkm=O(n1/k)Δ𝑚𝑘𝑚𝑂superscript𝑛1𝑘\Delta\leqslant m-k\leqslant m=O(n^{1/k})roman_Δ ⩽ italic_m - italic_k ⩽ italic_m = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / italic_k end_POSTSUPERSCRIPT ). Next, any possible game position can be reached from the starting position in at most k𝑘kitalic_k moves (simply move each coin in order from left to right onto the required square). Therefore, using Proposition 4.4, we have s¯k¯𝑠𝑘\overline{s}\leqslant kover¯ start_ARG italic_s end_ARG ⩽ italic_k. The required result then follows from Corollary 5.5 using these bounds on ΔΔ\Deltaroman_Δ and s¯¯𝑠\overline{s}over¯ start_ARG italic_s end_ARG. ∎

6.3 Turning Turtles

Here we consider one instance of a large class of coin turning games [3, 26]. Given a parameter m𝑚mitalic_m, TurningTurtlesmsubscriptTurningTurtles𝑚\textsc{TurningTurtles}_{m}TurningTurtles start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is played using a row of m𝑚mitalic_m coins, initially all showing heads. A turn consists of turning over one coin from heads to tails, and then optionally turning over one more coin anywhere to the left of that one (regardless of whether it is showing heads or tails). Play continues until all coins show tails. Noting that the total number of game positions is n=2m𝑛superscript2𝑚n=2^{m}italic_n = 2 start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, we have the following quasipolynomial runtime for TurningTurtlesmsubscriptTurningTurtles𝑚\textsc{TurningTurtles}_{m}TurningTurtles start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.

Proposition 6.3.

TurningTurtlesmsubscriptTurningTurtles𝑚\emph{{TurningTurtles}}_{m}TurningTurtles start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT satisfies s¯log2n¯𝑠subscript2𝑛\overline{s}\leqslant\log_{2}{n}over¯ start_ARG italic_s end_ARG ⩽ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n and Δ(log2n)2Δsuperscriptsubscript2𝑛2\Delta\leqslant(\log_{2}{n})^{2}roman_Δ ⩽ ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Thus, for each K>0𝐾0K>0italic_K > 0 there exists c>0𝑐0c>0italic_c > 0 such that for appropriately chosen parameters in Algorithm 1,

[T𝒜(Opt(TurningTurtlesm))nclogn]nK.delimited-[]subscript𝑇𝒜OptsubscriptTurningTurtles𝑚superscript𝑛𝑐𝑛superscript𝑛𝐾\mathbb{P}[T_{\mathcal{A}}(\text{\emph{Opt}}(\emph{{TurningTurtles}}_{m}))% \geqslant n^{c\log{n}}]\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( Opt ( TurningTurtles start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) ⩾ italic_n start_POSTSUPERSCRIPT italic_c roman_log italic_n end_POSTSUPERSCRIPT ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .
Proof.

On each turn, there are at most m𝑚mitalic_m possible moves that turn over only one coin and at most (m2)binomial𝑚2\binom{m}{2}( FRACOP start_ARG italic_m end_ARG start_ARG 2 end_ARG ) possible moves that turn over two coins. Therefore, Δm+(m2)m2Δ𝑚binomial𝑚2superscript𝑚2\Delta\leqslant m+\binom{m}{2}\leqslant m^{2}roman_Δ ⩽ italic_m + ( FRACOP start_ARG italic_m end_ARG start_ARG 2 end_ARG ) ⩽ italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Next, any possible game position can be reached from the starting position in at most m𝑚mitalic_m moves (simply turn over the required coins from heads to tails one by one). Therefore, using Proposition 4.4 we have s¯m¯𝑠𝑚\overline{s}\leqslant mover¯ start_ARG italic_s end_ARG ⩽ italic_m. Noting that m=log2n𝑚subscript2𝑛m=\log_{2}{n}italic_m = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n, and hence

C2(K+s¯+1)(20Δn)2+3s¯log2nC2(K+log2n+1)(20(log2n)2n)2+3log2n(logn)2nclogn,superscript𝐶2𝐾¯𝑠1superscript20Δ𝑛23¯𝑠superscript2𝑛superscript𝐶2𝐾subscript2𝑛1superscript20superscriptsubscript2𝑛2𝑛23subscript2𝑛superscript𝑛2superscript𝑛𝑐𝑛C^{2}(K+\overline{s}+1)(20\Delta n)^{2+3\overline{s}}\log^{2}{n}\leqslant C^{2% }(K+\log_{2}{n}+1)(20(\log_{2}{n})^{2}n)^{2+3\log_{2}{n}}(\log{n})^{2}% \leqslant n^{c\log{n}},italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K + over¯ start_ARG italic_s end_ARG + 1 ) ( 20 roman_Δ italic_n ) start_POSTSUPERSCRIPT 2 + 3 over¯ start_ARG italic_s end_ARG end_POSTSUPERSCRIPT roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ⩽ italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K + roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n + 1 ) ( 20 ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ) start_POSTSUPERSCRIPT 2 + 3 roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⩽ italic_n start_POSTSUPERSCRIPT italic_c roman_log italic_n end_POSTSUPERSCRIPT ,

the required result then follows from Corollary 5.5. ∎

6.4 Chomp

Since its introduction by Schuh [58] and later by Gale [21], Chomp has inspired a great deal of theoretical and empirical analysis, as well as numerous variants incorporating, for example, graphs and simplicial complexes [22]. While typically played on any rectangular board, we focus on square instances for the sake of conciseness.

Given a parameter m𝑚mitalic_m, ChompmsubscriptChomp𝑚\textsc{Chomp}_{m}Chomp start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is played on an m×m𝑚𝑚m\times mitalic_m × italic_m board. A turn consists of removing one square, as well as all squares to the right and above. However, if a player removes the square in the lower-left corner (the ‘poison’ square) they immediately lose. Note that to instantiate this game under our normal play convention, we can make removing the lower-left corner fatal by simply removing the position that has no remaining squares. We can establish the following quasipolynomial runtime for ChompmsubscriptChomp𝑚\textsc{Chomp}_{m}Chomp start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.

Proposition 6.4.

ChompmsubscriptChomp𝑚\emph{{Chomp}}_{m}Chomp start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT satisfies s¯O(log2n)¯𝑠𝑂subscript2𝑛\overline{s}\leqslant O(\log_{2}{n})over¯ start_ARG italic_s end_ARG ⩽ italic_O ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n ) and ΔO((log2n)2)Δ𝑂superscriptsubscript2𝑛2\Delta\leqslant O((\log_{2}{n})^{2})roman_Δ ⩽ italic_O ( ( roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Thus, for each K>0𝐾0K>0italic_K > 0 there exists c>0𝑐0c>0italic_c > 0 such that for appropriately chosen parameters in Algorithm 1,

[T𝒜(Opt(Chompm))nclogn]nK.delimited-[]subscript𝑇𝒜OptsubscriptChomp𝑚superscript𝑛𝑐𝑛superscript𝑛𝐾\mathbb{P}[T_{\mathcal{A}}(\text{\emph{Opt}}(\emph{{Chomp}}_{m}))\geqslant n^{% c\log{n}}]\leqslant n^{-K}.blackboard_P [ italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( Opt ( Chomp start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) ⩾ italic_n start_POSTSUPERSCRIPT italic_c roman_log italic_n end_POSTSUPERSCRIPT ] ⩽ italic_n start_POSTSUPERSCRIPT - italic_K end_POSTSUPERSCRIPT .
Proof.

In each possible game position, every row must be at least as long as the row above it. In particular, there is a correspondence between game positions and lattice paths (i.e., paths that only move right and down along the squares’ edges) from the top-left corner to bottom-right corner, with the path marking out the boundary of the remaining squares. Using stars and bars counting (see [4, Theorem 8.5.1] for a full treatment) and removing the position that has no remaining squares, the total number of game positions is n=(2mm)1=Θ(4m/m)𝑛binomial2𝑚𝑚1Θsuperscript4𝑚𝑚n=\binom{2m}{m}-1=\Theta(4^{m}/\sqrt{m})italic_n = ( FRACOP start_ARG 2 italic_m end_ARG start_ARG italic_m end_ARG ) - 1 = roman_Θ ( 4 start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT / square-root start_ARG italic_m end_ARG ). On each turn, there are at most m2superscript𝑚2m^{2}italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT moves available, and so Δm2Δsuperscript𝑚2\Delta\leqslant m^{2}roman_Δ ⩽ italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Next, any possible game position can be reached from the starting position in at most m𝑚mitalic_m moves (simply make the appropriate chomp row by row working from top to bottom). Therefore, using Proposition 4.4, we have s¯m¯𝑠𝑚\overline{s}\leqslant mover¯ start_ARG italic_s end_ARG ⩽ italic_m. Thus, as with Proposition 6.3, we have Δm2Δsuperscript𝑚2\Delta\leqslant m^{2}roman_Δ ⩽ italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and s¯m¯𝑠𝑚\overline{s}\leqslant mover¯ start_ARG italic_s end_ARG ⩽ italic_m where m=O(logn)𝑚𝑂𝑛m=O(\log{n})italic_m = italic_O ( roman_log italic_n ), and so the result follows. ∎

7 Concluding remarks

v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT
Figure 4: A game that should be easy to optimise, but contains vertices with switchability Θ(n)Θ𝑛\Theta(n)roman_Θ ( italic_n ).

We conclude with some brief remarks about the main result and future work.

In order to accommodate the high degree of generality in Theorem 5.2, the proof makes a number assumptions about the route taken to the search objective. A notable one is that, if v𝑣vitalic_v is the next critical position to be optimised, or one that has already been learned, then the probability (vPathG(x,y))𝑣subscriptPath𝐺𝑥𝑦\mathbb{P}(v\in\text{Path}_{G}(x,y))blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) that v𝑣vitalic_v is encountered in a game played out by sampled individuals x,y𝑥𝑦x,yitalic_x , italic_y is bound below by γs(v)superscript𝛾𝑠𝑣\gamma^{s(v)}italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT. Lemma 4.5 demonstrates that analysis of (vPathG(x,y))𝑣subscriptPath𝐺𝑥𝑦\mathbb{P}(v\in\text{Path}_{G}(x,y))blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) is a major contribution to the eventual runtime, serving a role akin to a dynamic learning rate for the algorithm at position v𝑣vitalic_v. A key insight is that encountering a large range of game positions by evaluating diverse sets of opponents is essential to an algorithm’s success. However, it is apparent that the general bound (vPathG(x,y))γs(v)𝑣subscriptPath𝐺𝑥𝑦superscript𝛾𝑠𝑣\mathbb{P}(v\in\text{Path}_{G}(x,y))\geqslant\gamma^{s(v)}blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) ⩾ italic_γ start_POSTSUPERSCRIPT italic_s ( italic_v ) end_POSTSUPERSCRIPT could be greatly improved through closer analysis of coevolutionary dynamics, especially for specific games. For example, if individuals often misplay at a winning position v𝑣vitalic_v, opponents should begin to exploit this by steering the game towards v𝑣vitalic_v; the resulting feedback mechanism between (vPathG(x,y))𝑣subscriptPath𝐺𝑥𝑦\mathbb{P}(v\in\text{Path}_{G}(x,y))blackboard_P ( italic_v ∈ Path start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_x , italic_y ) ) and pt(v,)subscript𝑝𝑡𝑣p_{t}(v,\,\cdot\,)italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_v , ⋅ ) can assist more efficient learning.

A related assumption is that game positions are optimised sequentially, moving from the end of the game and working backwards, not unlike a recursive computation of the Sprague-Grundy function. However, this is not the route to optimality we would expect CoEAs to adopt for all games (consider Figure 4, where UMDA would naturally optimise starting from v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and working forwards). Moreover, because Lemma 2.2 is not a necessary condition, there is potential for CoEAs to demonstrate bias towards learning simpler elements of Opt(G)Opt𝐺\text{Opt}(G)Opt ( italic_G ) without the need to implicitly deduce all zeros of the Sprague-Grundy function (for example, when played on a square board, there is an optimal strategy for Chomp that can be described by specifying an action at only Θ(m2)Θsuperscript𝑚2\Theta(m^{2})roman_Θ ( italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) of the Θ(4m/m)Θsuperscript4𝑚𝑚\Theta(4^{m}/\sqrt{m})roman_Θ ( 4 start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT / square-root start_ARG italic_m end_ARG ) game positions).

In future work, we aim to provide more detailed analysis related to both of the above assumptions in order to provide stronger runtime results on classes of impartial combinatorial games. A longer term goal is the development of runtime analysis applicable to game representations that are practical even for games with exponentially many positions, such as in situations encountered in genetic programming.

References

  • [1] F. Ben Jedidia, B. Doerr, and M. S. Krejca. Estimation-of-distribution algorithms for multi-valued decision variables. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’23, page 230–238, 2023.
  • [2] A. Benford and P. K. Lehre. Runtime analysis of coevolutionary algorithms on a class of symmetric zero-sum games. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’24, page 1542–1550, 2024.
  • [3] E. R. Berlekamp, J. H. Conway, and R. K. Guy. Winning Ways for Your Mathematical Plays, volume 3. A. K. Peters, 2003.
  • [4] R. A. Brualdi. Introductory Combinatorics. Pearson, 5th edition, 2009.
  • [5] K. Chellapilla and D. Fogel. Evolving neural networks to play checkers without relying on expert knowledge. IEEE Transactions on Neural Networks, 10(6):1382–1391, 1999.
  • [6] F. Chung and L. Lu. Concentration inequalities and martingale inequalities: a survey. Internet Mathematics, 3(1):79 – 127, 2006.
  • [7] J. H. Conway. On Numbers and Games. A.K. Peters, 2nd edition, 2001.
  • [8] D.-C. Dang and P. K. Lehre. Simplified runtime analysis of estimation of distribution algorithms. In Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation, GECCO ’15, page 513–518, 2015.
  • [9] D. DeCoste. The significance of Kasparov versus DEEP BLUE and the future of computer chess. ICGA Journal, 21(1):33–43, 1998.
  • [10] E. Demaine and R. Hearn. Playing games with algorithms: Algorithmic combinatorial game theory. In Games of No Chance 3, volume 56 of Mathematical Sciences Research Institute Publications, pages 3–56. Cambridge University Press, 2009.
  • [11] B. Doerr. The runtime of the compact genetic algorithm on jump functions. Algorithmica, 83(10):3059–3107, 2021.
  • [12] B. Doerr and F. Neumann. A survey on recent progress in the theory of evolutionary algorithms for discrete optimization. ACM Transactions on Evolutionary Learning and Optimization, 1(4), oct 2021.
  • [13] S. Droste. A rigorous analysis of the compact genetic algorithm for linear functions. Natural Computing, 5:257–283, 2006.
  • [14] S. Droste, T. Jansen, and I. Wegener. Upper and lower bounds for randomized search heuristics in black-box optimization. Theory of Computing Systems, 39(4):525–544, 2006.
  • [15] G. Farr and N. B. Ho. The Sprague–Grundy function for some nearly disjunctive sums of nim and silver dollar games. Theoretical Computer Science, 732:46–59, 2018.
  • [16] G. Ferrer and W. Martin. Using genetic programming to evolve board evaluation functions. In Proceedings of 1995 IEEE International Conference on Evolutionary Computation, volume 2, pages 747–752, 1995.
  • [17] S. G. Ficici. Solution concepts in coevolutionary algorithms. PhD thesis, Brandeis University, 2004.
  • [18] D. Fogel, T. Hays, S. Hahn, and J. Quon. A self-learning evolutionary chess program. Proceedings of the IEEE, 92(12):1947–1954, 2004.
  • [19] A. S. Fraenkel. Scenic trails ascending from sea-level nim to alpine chess and back. In Games of No Chance, volume 29 of Mathematical Sciences Research Institute Publications, pages 13–42. Cambridge University Press, 1996.
  • [20] A. S. Fraenkel and D. Lichtenstein. Computing a perfect strategy for n × n chess requires time exponential in n. Journal of Combinatorial Theory, Series A, 31(2):199–214, 1981.
  • [21] D. Gale. A curious nim-type game. American Mathematical Monthly, 81:876–879, 1974.
  • [22] I. García-Marco, K. Knauer, and L. P. Montejano. Chomp on generalized Kneser graphs and others. International Journal of Game Theory, 50(3):603–621, 2021.
  • [23] R. Gold, H. Branquinho, E. Hemberg, U.-M. O’Reilly, and P. García-Sánchez. Genetic programming and coevolution to play the Bomberman™ video game. In Applications of Evolutionary Computation, pages 765–779, 2023.
  • [24] D. Grier. Deciding the winner of an arbitrary finite poset game is PSPACE-complete. In Automata, Languages, and Programming, pages 497–503, 2013.
  • [25] P. M. Grundy. Mathematics and games. Eureka, 2:6–8, 1939.
  • [26] R. K. Guy. Impartial games. In Games of No Chance, volume 29 of Mathematical Sciences Research Institute Publications, pages 61–78. Cambridge University Press, 1996.
  • [27] R. K. Guy. What is a game? In Games of No Chance, volume 29 of Mathematical Sciences Research Institute Publications, pages 43–60. Cambridge University Press, 1996.
  • [28] S. N. Harris and D. R. Tauritz. Competitive coevolution for defense and security: Elo-based similar-strength opponent sampling. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, GECCO ’21, page 1898–1906, 2021.
  • [29] A. Hauptman. Evolving search heuristics for combinatorial games with genetic programming. PhD thesis, Ben-Gurian University of the Negev, 2009.
  • [30] A. Hauptman and M. Sipper. GP-EndChess: using genetic programming to evolve chess endgame players. In Proceedings of the 8th European Conference on Genetic Programming, EuroGP ’05, page 120–131, 2005.
  • [31] M. A. Hevia Fajardo and P. K. Lehre. How fitness aggregation methods affect the performance of competitive CoEAs on bilinear problems. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’23, page 1593–1601, 2023.
  • [32] M. A. Hevia Fajardo, P. K. Lehre, and S. Lin. Runtime analysis of a co-evolutionary algorithm: Overcoming negative drift in maximin-optimisation. In Proceedings of the 17th Conference on Foundations of Genetic Algorithms, FOGA ’23, page 73–83, 2023.
  • [33] J. Hofbauer and K. Sigmund. Evolutionary game dynamics. Bulletin of the American Mathematical Society, 40(4):479–519, 2003.
  • [34] T. Jansen and R. P. Wiegand. The cooperative coevolutionary (1+1) EA. Evolutionary Computation, 12(4):405–434, 2004.
  • [35] W. Jaśkowski, K. Krawiec, and B. Wieloch. Fitnessless coevolution. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’08, page 355–362, 2008.
  • [36] W. Jaśkowski, M. Szubert, and P. Liskowski. Multi-criteria comparison of coevolution and temporal difference learning on othello. In Applications of Evolutionary Computation, pages 301–312, 2014.
  • [37] A. H. Jorgensen. Context and driving forces in the development of the early computer game Nimbi. IEEE Annals of the History of Computing, 31(3):44–53, 2009.
  • [38] K. Krawiec and M. Heywood. Solving complex problems with coevolutionary algorithms. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’20, page 832–858, 2020.
  • [39] J. Lange, M. Stanke, and M. Ebner. Co-evolution of spies and resistance fighters. In Applications of Evolutionary Computation, pages 487–502, 2022.
  • [40] P. K. Lehre. Runtime analysis of competitive co-evolutionary algorithms for maximin optimisation of a bilinear function. Algorithmica, 86(7):2352–2392, 2024.
  • [41] P. K. Lehre, M. A. Hevia Fajardo, J. Toutouh, E. Hemberg, and U.-M. O’Reilly. Analysis of a pairwise dominance coevolutionary algorithm and DefendIt. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’23, page 1027–1035, 2023.
  • [42] S. Lin and P. K. Lehre. Overcoming binary adversarial optimisation with competitive coevolution. In Parallel Problem Solving from Nature XVIII, 2024.
  • [43] A. Lubberts and R. Miikkulainen. Co-evolving a go-playing neural network. In Coevolution: Turning Adaptive Algorithms Upon Themselves, 2001.
  • [44] H. K. McCoy. The game of nim - the Nimatron. Carnegie Technical, page 14, February 1951.
  • [45] D. Michie. Experiments on the mechanization of game-learning part I: characterization of the model and its parameters. The Computer Journal, 6(3):232–236, 11 1963.
  • [46] G. A. Monroy, K. O. Stanley, and R. Miikkulainen. Coevolution of neural networks using a layered pareto archive. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’06, page 329–336, 2006.
  • [47] G. Nivasch. More on the Sprague-Grundy function for Whythoff’s game. In Games of No Chance 3, volume 56 of Mathematical Sciences Research Institute Publications, pages 377–410. Cambridge University Press, 2009.
  • [48] J. Noble and R. A. Watson. Pareto coevolution: using performance against coevolved opponents in a game as dimensions for pareto selection. In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’01, page 493–500, 2001.
  • [49] M. Pelikan, M. Hauschild, and F. G. Lobo. Estimation of distribution algorithms. In Springer Handbook of Computational Intelligence, pages 899–928. Springer, 2015.
  • [50] J. B. Pollack and A. D. Blair. Co-evolution in the successful learning of backgammon strategy. Machine Learning, 32(3):225–240, Sep 1998.
  • [51] E. Popovici, A. Bucci, R. P. Wiegand, and E. D. De Jong. Coevolutionary Principles, pages 987–1033. Springer, 2012.
  • [52] R. Redheffer. A machine for playing the game nim. The American Mathematical Monthly, 55(6):343–349, 1948.
  • [53] J. M. Robson. The complexity of go. In Proceedings of the IFIP 9th World Computer Congress on Information Processing, pages 413–417, 1983.
  • [54] J. M. Robson. N by N checkers is exptime complete. SIAM Journal on Computing, 13(2):252–267, 1984.
  • [55] C. D. Rosin and R. K. Belew. New methods for competitive coevolution. Evolutionary Computation, 5(1):1–29, 1997.
  • [56] L. Rougetet. A prehistory of nim. The College Mathematics Journal, 45(5):358–363, 2014.
  • [57] M. Saburov. On discrete-time replicator equations with nonlinear payoff functions. Dynamic Games and Applications, 12(2):643–661, 2022.
  • [58] F. Schuh. Spel van delers. Nieuw Tijdschrift voor Wiskunde, 39:299–304, 1952.
  • [59] A. J. Schwenk. Take-away games. The Fibonacci Quarterly, 8:225–234, 1970.
  • [60] D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai, A. Guez, M. Lanctot, L. Sifre, D. Kumaran, T. Graepel, T. Lillicrap, K. Simonyan, and D. Hassabis. A general reinforcement learning algorithm that masters chess, shogi, and go through self-play. Science, 362(6419):1140–1144, 2018.
  • [61] R. Sprague. Über mathematische kampfspiele. Tohoku Mathematical Journal, First Series, 41:438–444, 1935.
  • [62] R. Sprague. Über zwei abarten von nim. Tohoku Mathematical Journal, First Series, 43:351–354, 1937.
  • [63] M. Szubert, W. Jaśkowski, P. Liskowski, and K. Krawiec. The role of behavioral diversity and difficulty of opponents in coevolving game-playing agents. In Applications of Evolutionary Computation, pages 394–405, 2015.
  • [64] M. Szubert, W. Jaśkowski, and K. Krawiec. On scalability, generalization, and hybridization of coevolutionary learning: a case study for othello. IEEE Transactions on Computational Intelligence and AI in Games, 5(3):214–226, 2013.
  • [65] C. Witt. Upper bounds on the running time of the univariate marginal distribution algorithm on onemax. Algorithmica, 81:632–667, 2019.
  • [66] C. Witt. How majority-vote crossover and estimation-of-distribution algorithms cope with fitness valleys. Theoretical Computer Science, 940:18–42, 2023.
  • [67] I. M. Yaglom. Two games with matchsticks. In Kvant Selecta: Combinatorics, I, volume 17 of Mathematical World, pages 1–8. American Mathematical Society, 2001.

Appendix A Preliminary results

Here we provide two straightforward results that will be useful to quote throughout the proof of Theorem 5.2. The first is derived from the Chernoff bounds for binomial random variables given by Theorem A.1, which is in turn an immediate consequences of [6, Theorem 3.2]. We remark that the conclusions E1-E3 have been optimised for ease of integration with the proofs in this paper, rather than tightness of bound.

Theorem A.1.

If XBin(m,q)similar-to𝑋Bin𝑚𝑞X\sim\text{\emph{Bin}}(m,q)italic_X ∼ Bin ( italic_m , italic_q ), then for any t0𝑡0t\geqslant 0italic_t ⩾ 0 it holds that

(Xmqt)𝑋𝑚𝑞𝑡\displaystyle\mathbb{P}(X\leqslant mq-t)blackboard_P ( italic_X ⩽ italic_m italic_q - italic_t ) exp(t2/2mq),absentsuperscript𝑡22𝑚𝑞\displaystyle\leqslant\exp{\left(-\frac{t^{2}/2}{mq}\right)},⩽ roman_exp ( - divide start_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 end_ARG start_ARG italic_m italic_q end_ARG ) , (36)
(Xmq+t)𝑋𝑚𝑞𝑡\displaystyle\mathbb{P}(X\geqslant mq+t)blackboard_P ( italic_X ⩾ italic_m italic_q + italic_t ) exp(t2/2mq+t/3).absentsuperscript𝑡22𝑚𝑞𝑡3\displaystyle\leqslant\exp{\left(-\frac{t^{2}/2}{mq+t/3}\right)}.⩽ roman_exp ( - divide start_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 end_ARG start_ARG italic_m italic_q + italic_t / 3 end_ARG ) . (37)
Corollary A.2.

Suppose q[0,1/2]𝑞012q\in[0,1/2]italic_q ∈ [ 0 , 1 / 2 ] and XBin(μ,q)similar-to𝑋Bin𝜇𝑞X\sim\text{\emph{Bin}}(\mu,q)italic_X ∼ Bin ( italic_μ , italic_q ).

  1. E1

    For any α>0𝛼0\alpha>0italic_α > 0 and p[0,1/2]𝑝012p\in[0,1/2]italic_p ∈ [ 0 , 1 / 2 ] satisfying q(1+α)p𝑞1𝛼𝑝q\geqslant(1+\alpha)pitalic_q ⩾ ( 1 + italic_α ) italic_p,

    (X/μ(1+α/2)p)exp(α2μp8(1+α)).𝑋𝜇1𝛼2𝑝superscript𝛼2𝜇𝑝81𝛼\mathbb{P}(X/\mu\leqslant(1+\alpha/2)p)\leqslant\exp{\left(-\frac{\alpha^{2}% \mu p}{8(1+\alpha)}\right)}.blackboard_P ( italic_X / italic_μ ⩽ ( 1 + italic_α / 2 ) italic_p ) ⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p end_ARG start_ARG 8 ( 1 + italic_α ) end_ARG ) .
  2. E2

    For any α>0𝛼0\alpha>0italic_α > 0 and p[0,1/2]𝑝012p\in[0,1/2]italic_p ∈ [ 0 , 1 / 2 ] satisfying q(1α)p𝑞1𝛼𝑝q\leqslant(1-\alpha)pitalic_q ⩽ ( 1 - italic_α ) italic_p,

    (X/μ(1α/2)p)exp(α2μp8(1+α)).𝑋𝜇1𝛼2𝑝superscript𝛼2𝜇𝑝81𝛼\mathbb{P}(X/\mu\geqslant(1-\alpha/2)p)\leqslant\exp{\left(-\frac{\alpha^{2}% \mu p}{8(1+\alpha)}\right)}.blackboard_P ( italic_X / italic_μ ⩾ ( 1 - italic_α / 2 ) italic_p ) ⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p end_ARG start_ARG 8 ( 1 + italic_α ) end_ARG ) .
  3. E3

    For any α>0𝛼0\alpha>0italic_α > 0 and p[0,1/2]𝑝012p\in[0,1/2]italic_p ∈ [ 0 , 1 / 2 ] satisfying q(1α)p𝑞1𝛼𝑝q\geqslant(1-\alpha)pitalic_q ⩾ ( 1 - italic_α ) italic_p,

    (X/μ(12α)p)exp(α2μp/168(1+α/4)).𝑋𝜇12𝛼𝑝superscript𝛼2𝜇𝑝1681𝛼4\mathbb{P}(X/\mu\leqslant(1-2\alpha)p)\leqslant\exp{\left(-\frac{\alpha^{2}\mu p% /16}{8(1+\alpha/4)}\right)}.blackboard_P ( italic_X / italic_μ ⩽ ( 1 - 2 italic_α ) italic_p ) ⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ italic_p / 16 end_ARG start_ARG 8 ( 1 + italic_α / 4 ) end_ARG ) .
Proof.

For E1, let Y1Bin(μ,(1+α)p)similar-tosubscript𝑌1Bin𝜇1𝛼𝑝Y_{1}\sim\text{Bin}(\mu,(1+\alpha)p)italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ Bin ( italic_μ , ( 1 + italic_α ) italic_p ) so that XY1succeeds-or-equals𝑋subscript𝑌1X\succcurlyeq Y_{1}italic_X ≽ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. We then have

(X/μ(1+α/2)p)𝑋𝜇1𝛼2𝑝\displaystyle\mathbb{P}(X/\mu\leqslant(1+\alpha/2)p)blackboard_P ( italic_X / italic_μ ⩽ ( 1 + italic_α / 2 ) italic_p ) =(X(1+α/2)pμ)(Y1(1+α/2)pμ)absent𝑋1𝛼2𝑝𝜇subscript𝑌11𝛼2𝑝𝜇\displaystyle=\mathbb{P}(X\leqslant(1+\alpha/2)p\mu)\leqslant\mathbb{P}(Y_{1}% \leqslant(1+\alpha/2)p\mu)= blackboard_P ( italic_X ⩽ ( 1 + italic_α / 2 ) italic_p italic_μ ) ⩽ blackboard_P ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⩽ ( 1 + italic_α / 2 ) italic_p italic_μ )
=(Y1(1+α)pμαpμ/2)(36)exp(α2p2μ2/8μ(1+α)p)exp(α2pμ8(1+α)),absentsubscript𝑌11𝛼𝑝𝜇𝛼𝑝𝜇2italic-(36italic-)superscript𝛼2superscript𝑝2superscript𝜇28𝜇1𝛼𝑝superscript𝛼2𝑝𝜇81𝛼\displaystyle=\mathbb{P}(Y_{1}\leqslant(1+\alpha)p\mu-\alpha p\mu/2)\overset{% \eqref{eq:chernoff-1}}{\leqslant}\exp{\left(-\frac{\alpha^{2}p^{2}\mu^{2}/8}{% \mu(1+\alpha)p}\right)}\leqslant\exp{\left(-\frac{\alpha^{2}p\mu}{8(1+\alpha)}% \right)},= blackboard_P ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⩽ ( 1 + italic_α ) italic_p italic_μ - italic_α italic_p italic_μ / 2 ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 end_ARG start_ARG italic_μ ( 1 + italic_α ) italic_p end_ARG ) ⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p italic_μ end_ARG start_ARG 8 ( 1 + italic_α ) end_ARG ) ,

as required. For E2, let Y2Bin(μ,(1α)p)similar-tosubscript𝑌2Bin𝜇1𝛼𝑝Y_{2}\sim\text{Bin}(\mu,(1-\alpha)p)italic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∼ Bin ( italic_μ , ( 1 - italic_α ) italic_p ) so that XY2precedes-or-equals𝑋subscript𝑌2X\preccurlyeq Y_{2}italic_X ≼ italic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. We then have

(X/μ(1α/2)p)𝑋𝜇1𝛼2𝑝\displaystyle\mathbb{P}(X/\mu\geqslant(1-\alpha/2)p)blackboard_P ( italic_X / italic_μ ⩾ ( 1 - italic_α / 2 ) italic_p ) =(X(1α/2)pμ)(Y2(1α/2)pμ)absent𝑋1𝛼2𝑝𝜇subscript𝑌21𝛼2𝑝𝜇\displaystyle=\mathbb{P}(X\geqslant(1-\alpha/2)p\mu)\leqslant\mathbb{P}(Y_{2}% \geqslant(1-\alpha/2)p\mu)= blackboard_P ( italic_X ⩾ ( 1 - italic_α / 2 ) italic_p italic_μ ) ⩽ blackboard_P ( italic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⩾ ( 1 - italic_α / 2 ) italic_p italic_μ )
=(Y1(1α)pμ+αpμ/2)(37)exp(α2p2μ2/8μ(1α)p+(αpμ/6))absentsubscript𝑌11𝛼𝑝𝜇𝛼𝑝𝜇2italic-(37italic-)superscript𝛼2superscript𝑝2superscript𝜇28𝜇1𝛼𝑝𝛼𝑝𝜇6\displaystyle=\mathbb{P}(Y_{1}\geqslant(1-\alpha)p\mu+\alpha p\mu/2)\overset{% \eqref{eq:chernoff-2}}{\leqslant}\exp{\left(-\frac{\alpha^{2}p^{2}\mu^{2}/8}{% \mu(1-\alpha)p+(\alpha p\mu/6)}\right)}= blackboard_P ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⩾ ( 1 - italic_α ) italic_p italic_μ + italic_α italic_p italic_μ / 2 ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 end_ARG start_ARG italic_μ ( 1 - italic_α ) italic_p + ( italic_α italic_p italic_μ / 6 ) end_ARG )
exp(α2pμ8(1+α)),absentsuperscript𝛼2𝑝𝜇81𝛼\displaystyle\leqslant\exp{\left(-\frac{\alpha^{2}p\mu}{8(1+\alpha)}\right)},⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p italic_μ end_ARG start_ARG 8 ( 1 + italic_α ) end_ARG ) ,

as required. For E3, let Y3Bin(μ,(1α)p)similar-tosubscript𝑌3Bin𝜇1𝛼𝑝Y_{3}\sim\text{Bin}(\mu,(1-\alpha)p)italic_Y start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ∼ Bin ( italic_μ , ( 1 - italic_α ) italic_p ) so that XY3succeeds-or-equals𝑋subscript𝑌3X\succcurlyeq Y_{3}italic_X ≽ italic_Y start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. We then have

(X/μ(12α)p)𝑋𝜇12𝛼𝑝\displaystyle\mathbb{P}(X/\mu\leqslant(1-2\alpha)p)blackboard_P ( italic_X / italic_μ ⩽ ( 1 - 2 italic_α ) italic_p ) =(X(12α)pμ)(Y3(12α)pμ)absent𝑋12𝛼𝑝𝜇subscript𝑌312𝛼𝑝𝜇\displaystyle=\mathbb{P}(X\leqslant(1-2\alpha)p\mu)\leqslant\mathbb{P}(Y_{3}% \leqslant(1-2\alpha)p\mu)= blackboard_P ( italic_X ⩽ ( 1 - 2 italic_α ) italic_p italic_μ ) ⩽ blackboard_P ( italic_Y start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⩽ ( 1 - 2 italic_α ) italic_p italic_μ )
=(Y3(1α)pμαpμ)(36)exp(α2p2μ2/2μ(1α)p)exp(α2pμ/168(1+α/4)),absentsubscript𝑌31𝛼𝑝𝜇𝛼𝑝𝜇italic-(36italic-)superscript𝛼2superscript𝑝2superscript𝜇22𝜇1𝛼𝑝superscript𝛼2𝑝𝜇1681𝛼4\displaystyle=\mathbb{P}(Y_{3}\leqslant(1-\alpha)p\mu-\alpha p\mu)\overset{% \eqref{eq:chernoff-1}}{\leqslant}\exp{\left(-\frac{\alpha^{2}p^{2}\mu^{2}/2}{% \mu(1-\alpha)p}\right)}\leqslant\exp{\left(-\frac{\alpha^{2}p\mu/16}{8(1+% \alpha/4)}\right)},= blackboard_P ( italic_Y start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⩽ ( 1 - italic_α ) italic_p italic_μ - italic_α italic_p italic_μ ) start_OVERACCENT italic_( italic_) end_OVERACCENT start_ARG ⩽ end_ARG roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 end_ARG start_ARG italic_μ ( 1 - italic_α ) italic_p end_ARG ) ⩽ roman_exp ( - divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_p italic_μ / 16 end_ARG start_ARG 8 ( 1 + italic_α / 4 ) end_ARG ) ,

as required. ∎

Lemma A.3.

Given γ[0,12)𝛾012\gamma\in[0,\textstyle{\frac{1}{2}})italic_γ ∈ [ 0 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ), let g:[γ,1γ]0:𝑔𝛾1𝛾subscriptabsent0g:[\gamma,1-\gamma]\to\mathbb{R}_{\geqslant 0}italic_g : [ italic_γ , 1 - italic_γ ] → blackboard_R start_POSTSUBSCRIPT ⩾ 0 end_POSTSUBSCRIPT be given by

g(y)=log(y1y)log(γ1γ).𝑔𝑦𝑦1𝑦𝛾1𝛾g(y)=\log{\left(\frac{y}{1-y}\right)}-\log{\left(\frac{\gamma}{1-\gamma}\right% )}.italic_g ( italic_y ) = roman_log ( divide start_ARG italic_y end_ARG start_ARG 1 - italic_y end_ARG ) - roman_log ( divide start_ARG italic_γ end_ARG start_ARG 1 - italic_γ end_ARG ) .

Then, the following properties hold.

  1. F1

    If y[γ,12]𝑦𝛾12y\in[\gamma,\textstyle{\frac{1}{2}}]italic_y ∈ [ italic_γ , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ] and a[0,1)𝑎01a\in[0,1)italic_a ∈ [ 0 , 1 ), then g((1+a)y)g(y)a/2𝑔1𝑎𝑦𝑔𝑦𝑎2g((1+a)y)-g(y)\geqslant a/2italic_g ( ( 1 + italic_a ) italic_y ) - italic_g ( italic_y ) ⩾ italic_a / 2.

  2. F2

    If y[12,1γ]𝑦121𝛾y\in[\textstyle{\frac{1}{2}},1-\gamma]italic_y ∈ [ divide start_ARG 1 end_ARG start_ARG 2 end_ARG , 1 - italic_γ ] and a[0,1)𝑎01a\in[0,1)italic_a ∈ [ 0 , 1 ), then g(1(1+a)y)g(1y)a/2𝑔11𝑎𝑦𝑔1𝑦𝑎2g(1-(1+a)y)-g(1-y)\geqslant a/2italic_g ( 1 - ( 1 + italic_a ) italic_y ) - italic_g ( 1 - italic_y ) ⩾ italic_a / 2.

  3. F3

    maxy[γ,1γ]g(y)2log(1/γ)subscript𝑦𝛾1𝛾𝑔𝑦21𝛾\max_{y\in[\gamma,1-\gamma]}g(y)\leqslant 2\log{(1/\gamma)}roman_max start_POSTSUBSCRIPT italic_y ∈ [ italic_γ , 1 - italic_γ ] end_POSTSUBSCRIPT italic_g ( italic_y ) ⩽ 2 roman_log ( 1 / italic_γ ).

Proof.

F1: If y[γ,12]𝑦𝛾12y\in[\gamma,\textstyle{\frac{1}{2}}]italic_y ∈ [ italic_γ , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ] and a[0,1)𝑎01a\in[0,1)italic_a ∈ [ 0 , 1 ), then

g((1+a)y)g(y)𝑔1𝑎𝑦𝑔𝑦\displaystyle g((1+a)y)-g(y)italic_g ( ( 1 + italic_a ) italic_y ) - italic_g ( italic_y ) =log((1+a)y1(1+a)y)log(y1y)=log((1+a)(1y)1(1+a)y)absent1𝑎𝑦11𝑎𝑦𝑦1𝑦1𝑎1𝑦11𝑎𝑦\displaystyle=\log{\left(\frac{(1+a)y}{1-(1+a)y}\right)}-\log{\left(\frac{y}{1% -y}\right)}=\log{\left(\frac{(1+a)(1-y)}{1-(1+a)y}\right)}= roman_log ( divide start_ARG ( 1 + italic_a ) italic_y end_ARG start_ARG 1 - ( 1 + italic_a ) italic_y end_ARG ) - roman_log ( divide start_ARG italic_y end_ARG start_ARG 1 - italic_y end_ARG ) = roman_log ( divide start_ARG ( 1 + italic_a ) ( 1 - italic_y ) end_ARG start_ARG 1 - ( 1 + italic_a ) italic_y end_ARG )
=log(1+a1(1+a)y)log(1+a)a/2.absent1𝑎11𝑎𝑦1𝑎𝑎2\displaystyle=\log{\left(1+\frac{a}{1-(1+a)y}\right)}\geqslant\log{(1+a)}% \geqslant a/2.= roman_log ( 1 + divide start_ARG italic_a end_ARG start_ARG 1 - ( 1 + italic_a ) italic_y end_ARG ) ⩾ roman_log ( 1 + italic_a ) ⩾ italic_a / 2 .

F2: If y[12,1γ]𝑦121𝛾y\in[\textstyle{\frac{1}{2}},1-\gamma]italic_y ∈ [ divide start_ARG 1 end_ARG start_ARG 2 end_ARG , 1 - italic_γ ] and a[0,1)𝑎01a\in[0,1)italic_a ∈ [ 0 , 1 ), then

g(1(1a)y)g(1y)𝑔11𝑎𝑦𝑔1𝑦\displaystyle g(1-(1-a)y)-g(1-y)italic_g ( 1 - ( 1 - italic_a ) italic_y ) - italic_g ( 1 - italic_y ) =log(1(1a)y(1a)y)log(1yy)=log(1(1a)y(1a)(1y))absent11𝑎𝑦1𝑎𝑦1𝑦𝑦11𝑎𝑦1𝑎1𝑦\displaystyle=\log{\left(\frac{1-(1-a)y}{(1-a)y}\right)}-\log{\left(\frac{1-y}% {y}\right)}=\log{\left(\frac{1-(1-a)y}{(1-a)(1-y)}\right)}= roman_log ( divide start_ARG 1 - ( 1 - italic_a ) italic_y end_ARG start_ARG ( 1 - italic_a ) italic_y end_ARG ) - roman_log ( divide start_ARG 1 - italic_y end_ARG start_ARG italic_y end_ARG ) = roman_log ( divide start_ARG 1 - ( 1 - italic_a ) italic_y end_ARG start_ARG ( 1 - italic_a ) ( 1 - italic_y ) end_ARG )
=log(1+a(1a)(1y))log(1+a)a/2.absent1𝑎1𝑎1𝑦1𝑎𝑎2\displaystyle=\log{\left(1+\frac{a}{(1-a)(1-y)}\right)}\geqslant\log{(1+a)}% \geqslant a/2.= roman_log ( 1 + divide start_ARG italic_a end_ARG start_ARG ( 1 - italic_a ) ( 1 - italic_y ) end_ARG ) ⩾ roman_log ( 1 + italic_a ) ⩾ italic_a / 2 .

F3: Because g𝑔gitalic_g is an increasing function,

maxy[γ,1γ]g(y)=g(1γ)=log(1γγ)log(γ1γ)=2log(1γγ)2log(1/γ),subscript𝑦𝛾1𝛾𝑔𝑦𝑔1𝛾1𝛾𝛾𝛾1𝛾21𝛾𝛾21𝛾\max_{y\in[\gamma,1-\gamma]}g(y)=g(1-\gamma)=\log{\left(\frac{1-\gamma}{\gamma% }\right)}-\log{\left(\frac{\gamma}{1-\gamma}\right)}=2\log{\left(\frac{1-% \gamma}{\gamma}\right)}\leqslant 2\log{(1/\gamma)},roman_max start_POSTSUBSCRIPT italic_y ∈ [ italic_γ , 1 - italic_γ ] end_POSTSUBSCRIPT italic_g ( italic_y ) = italic_g ( 1 - italic_γ ) = roman_log ( divide start_ARG 1 - italic_γ end_ARG start_ARG italic_γ end_ARG ) - roman_log ( divide start_ARG italic_γ end_ARG start_ARG 1 - italic_γ end_ARG ) = 2 roman_log ( divide start_ARG 1 - italic_γ end_ARG start_ARG italic_γ end_ARG ) ⩽ 2 roman_log ( 1 / italic_γ ) ,

as required. ∎