The Alpha-Beta algorithm (Alpha-Beta Pruning, Alpha-Beta Heuristic ^{[1]} ) is a significant enhancement to the minimax search algorithm that eliminates the need to search large portions of the game tree applying a branch-and-bound technique. Remarkably, it does this without any potential of overlooking a better move. If one already has found a quite good move and search for alternatives, onerefutation is enough to avoid it. No need to look for even stronger refutations. The algorithm maintains two values, alpha and beta. They represent the minimum score that the maximizing player is assured of and the maximum score that the minimizing player is assured of respectively. Consider the following example...

Say it is White's turn to move, and we are searching to a depth of 2 (that is, we are consider all of White's moves, and all of Black's responses to each of those moves.) First we pick one of White's possible moves - let's call this Possible Move #1. We consider this move and every possible response to this move by black. After this analysis, we determine that the result of making Possible Move #1 is an even position. Then, we move on and consider another of White's possible moves (Possible Move #2.) When we consider the first possible counter-move by black, we discover that playing this results in black winning a Rook! In this situation, we can safely ignore all of Black's other possible responses to Possible Move #2 because we already know that Possible Move #1 is better. We really don't care exactly how much worse Possible Move #2 is. Maybe another possible response wins a Queen, but it doesn't matter because we know that we can achieve at least an even game by playing Possible Move #1. The full analysis of Possible Move #1 gave us a lower bound. We know that we can achieve at least that, so anything that is clearly worse can be ignored.

The situation becomes even more complicated, however, when we go to a search depth of 3 or greater, because now both players can make choices affecting the game tree. Now we have to maintain both a lower bound and an upper bound (called Alpha and Beta.) We maintain a lower bound because if a move is too bad we don't consider it. But we also have to maintain an upper bound because if a move at depth 3 or higher leads to a continuation that is too good, the other player won't allow it, because there was a better move higher up on the game tree that he could have played to avoid this situation. One player's lower bound is the other player's upper bound.

Savings

The savings of alpha beta can be considerable. If a standard minimax search tree has xnodes, an alpha beta tree in a well-written program can have a node count close to the square-root of x. How many nodes you can actually cut, however, depends on how well ordered your game tree is. If you always search the best possible move first, you eliminate the most of the nodes. Of course, we don't always know what the best move is, or we wouldn't have to search in the first place. Conversely, if we always searched worse moves before the better moves, we wouldn't be able to cut any part of the tree at all! For this reason, good move ordering is very important, and is the focus of a lot of the effort of writing a good chess program. As pointed out by Levin in 1961, assuming constantly b moves for each node visited and search depth n, the maximal number of leaves in alpha-beta is equivalent to minimax, b ^ n. Considering always the best move first, it is b ^ ceil(n/2) plus b ^ floor(n/2) minus one. The minimal number of leaves is shown in following table which also demonstrates the odd-even effect:

number of leaves with depth n and b = 40

depth

worst case

best case

n

0

1

1

1

40

40

2

1,600

79

3

64,000

1,639

4

2,560,000

3,199

5

102,400,000

65,569

6

4,096,000,000

127,999

7

163,840,000,000

2,623,999

8

6,553,600,000,000

5,119,999

History

Alpha-Beta was invented independently by several researchers and pioneers from the 50s ^{[3]}, and further research until the 80s, most notable by

Chess programs catch some of the human chess playing abilities but rely on the limited effective branching of the chess move tree. The ideas that work for chess are inadequate for go. Alpha-beta pruning characterizes human play, but it wasn't noticed by early chess programmers - Turing, Shannon, Pasta and Ulam, and Bernstein. We humans are not very good at identifying the heuristics we ourselves use. Approximations to alpha-beta used by Samuel, Newell and Simon, McCarthy. Proved equivalent to minimax by Hart and Levin, independently by Brudno. Knuth gives details.

It is interesting to convert this recursive procedure to an iterative (non-recursive) form by a sequence of mechanical transformations, and to apply simple optimizations which preserve program correctness. The resulting procedure is surprisingly simple, but not as easy to prove correct as the recursive form.

Implementation

Max versus Min

A C-like pseudo code implementation of the alpha-beta algorithm with distinct indirect recursive routines for the max- and min-player, similar to the minimax routines. Making and unmakingmoves is omitted, and should be done before and after the recursive calls. So called beta-cutoffs occur for the max-play, alpha-cutoffs for the min-player.

int alphaBetaMax(int alpha, int beta, int depthleft ){if( depthleft ==0)return evaluate();for( all moves){
score = alphaBetaMin( alpha, beta, depthleft -1);if( score >= beta )return beta;// fail hard beta-cutoffif( score > alpha )
alpha = score;// alpha acts like max in MiniMax}return alpha;}int alphaBetaMin(int alpha, int beta, int depthleft ){if( depthleft ==0)return-evaluate();for( all moves){
score = alphaBetaMax( alpha, beta, depthleft -1);if( score <= alpha )return alpha;// fail hard alpha-cutoffif( score < beta )
beta = score;// beta acts like min in MiniMax}return beta;}

Alpha-beta search tree with two alpha-cuts at min nodes ^{[11]}

Negamax Framework

Inside a negamax framework the routine looks simpler, but is not necessarily simpler to understand. Despite negating the returned score of the direct recursion, alpha of the min-player becomes minus beta of the max-player and vice versa, and the term alpha-cutoff or alpha-pruning is somehow diminished.

int alphaBeta(int alpha, int beta, int depthleft ){if( depthleft ==0)return quiesce( alpha, beta );for( all moves){
score =-alphaBeta(-beta, -alpha, depthleft -1);if( score >= beta )return beta;// fail hard beta-cutoffif( score > alpha )
alpha = score;// alpha acts like max in MiniMax}return alpha;}

Note #1: Notice the call to quiesce(). This performs a quiescence search, which makes the alpha-beta search much more stable.

Note #2: This function only returns the score for the position, not the best move. Normally, a different (but very similar) function is used for searching the root node. The SearchRoot function calls the alphaBeta function and returns both a score and a best move. Also, most search functions collect the principal variation not only for display purposes, but for a good guess as the leftmost path of the next iteration inside an iterative deepening framework.

Fail hard

Since alpha and beta act as hard bounds of the return value if depth left is greater zero in the above code samples, this is referred to a fail-hard-framework.

Outside the Bounds

Fail-Soft Alpha-Beta ^{[12]} may return scores outside the bounds, that is either greater than beta or less than alpha. It has to keep track of the best score, which might be below alpha.

The alpha-beta algorithm can also be improved. These enhancements come from the fact that if you restrict the window of scores that are interesting, you can achieve more cutoffs. Since move ordering is so much important, a technique applied outside of the search for this is iterative deepening boosted by a transposition table, and possibly aspiration windows. Other enhancements, applied within the search function, are further discussed.

Alexander Brudno (1963). Bounds and valuations for shortening the search of estimates. Problemy Kibernetiki (10) 141–150 and Problems of Cybernetics (10) 225–241

James R. Slagle (1963). Game Trees, M & N Minimaxing, and the M & N alpha-beta procedure. Artificial Intelligence Group Report 3, UCRL-4671, University of California

Peter W. Frey (1983). The Alpha-Beta Algorithm: Incremental Updating, Well-Behaved Evaluation Functions, and Non-Speculative Forward Pruning. Computer Game-Playing (ed. Max Bramer), pp. 285-289. Ellis Horwood Limited

Ingo Althöfer, Bernhard Balkenhol (1992). A Game Tree with Distinct Leaf Values which is Easy for the Alpha-Beta Algorithm. Artif. Intell. 52(2): 183-190

John Man (2001). Alpha Beta: How 26 Letters Shaped the Western World, from amazon

References

^Arthur Samuel (1967). Some Studies in Machine Learning. Using the Game of Checkers. II-Recent Progress. pdf, IBM Journal - November 1967,
on the name Alpha-Beta Heuristic pp. 602: So named by Prof. John McCarthy. This procedure was extensively investigated by Prof. John McCarthy and his students at M.I.T. but it has been inadequately described in the literature. It is, of course, not a heuristic at all, being a simple algorithmic procedure and actually only a special case of the more general "branch and bound" technique which was been rediscovered many times and which is currently being exploited in integer programming research.

Home * Search * Alpha-BetaAlpha-Betaalgorithm (Alpha-Beta Pruning, Alpha-Beta Heuristic^{[1]}) is a significant enhancement to the minimax search algorithm that eliminates the need to search large portions of the game tree applying a branch-and-bound technique. Remarkably, it does this without any potential of overlooking a better move. If one already has found a quite good move and search for alternatives,onerefutation is enough to avoid it. No need to look for even stronger refutations. The algorithm maintains two values, alpha and beta. They represent the minimum score that the maximizing player is assured of and the maximum score that the minimizing player is assured of respectively. Consider the following example...^{[2]}## Table of Contents

## How it works

Say it is White's turn to move, and we are searching to a depth of 2 (that is, we are consider all of White's moves, and all of Black's responses to each of those moves.) First we pick one of White's possible moves - let's call this Possible Move #1. We consider this move and every possible response to this move by black. After this analysis, we determine that the result of making Possible Move #1 is an even position. Then, we move on and consider another of White's possible moves (Possible Move #2.) When we consider the first possible counter-move by black, we discover that playing this results in black winning a Rook! In this situation, we can safely ignore all of Black's other possible responses to Possible Move #2 because we already know that Possible Move #1 is better. We really don't careexactlyhow much worse Possible Move #2 is. Maybe another possible response wins a Queen, but it doesn't matter because we know that we can achieveat leastan even game by playing Possible Move #1. The full analysis of Possible Move #1 gave us a lower bound. We know that we can achieve at least that, so anything that is clearly worse can be ignored.The situation becomes even more complicated, however, when we go to a search depth of 3 or greater, because now both players can make choices affecting the game tree. Now we have to maintain both a lower bound and an upper bound (called Alpha and Beta.) We maintain a lower bound because if a move is too bad we don't consider it. But we also have to maintain an upper bound because if a move at depth 3 or higher leads to a continuation that is too good, the other player won't allow it, because there was a better move higher up on the game tree that he could have played to avoid this situation. One player's lower bound is the other player's upper bound.

## Savings

The savings of alpha beta can be considerable. If a standard minimax search tree hasxnodes, an alpha beta tree in a well-written program can have a node count close to the square-root ofx. How many nodes you can actually cut, however, depends on how well ordered your game tree is. If you always search the best possible move first, you eliminate the most of the nodes. Of course, we don't always know what the best move is, or we wouldn't have to search in the first place. Conversely, if we always searched worse moves before the better moves, we wouldn't be able to cut any part of the tree at all! For this reason, good move ordering is very important, and is the focus of a lot of the effort of writing a good chess program. As pointed out by Levin in 1961, assuming constantlybmoves for each node visited and search depthn, the maximal number of leaves in alpha-beta is equivalent to minimax,b^n. Considering always the best move first, it isb^ ceil(n/2) plusb^ floor(n/2) minus one. The minimal number of leaves is shown in following table which also demonstrates the odd-even effect:n## History

Alpha-Beta was invented independently by several researchers and pioneers from the 50s^{[3]}, and further research until the 80s, most notable by^{[4]}^{[5]}^{[6]}^{[7]}Knuth and Moore's famous Function F2, aka AlphaBeta

Knuth already introduced an iterative solution, see Iterative Search

Knuth's node types

## Quotes

## McCarthy

Quote by John McCarthy fromHuman-Level AI is harder than it seemed in 1955^{[8]}:Chess programs catch some of the human chess playing abilities but rely on the limited effective branching of the chess move tree. The ideas that work for chess are inadequate for go. Alpha-beta pruning characterizes human play, but it wasn't noticed by early chess programmers - Turing, Shannon, Pasta and Ulam, and Bernstein. We humans are not very good at identifying the heuristics we ourselves use. Approximations to alpha-beta used by Samuel, Newell and Simon, McCarthy. Proved equivalent to minimax by Hart and Levin, independently by Brudno. Knuth gives details.## Ershov and Shura-Bura

Quote fromThe Early Development of Programming in the USSRby Andrey Ershov, Mikhail R. Shura-Bura^{[9]}At the end of the 1950's a group of Moscow mathematicians began a study of computerized chess. Sixteen years later, the studies would lead to victory in the first world chess tournament for computer programs held in Stockholm during the 1974 IFIP Congress. An important component of this success was a deep study of the problems of information organization in computer memory and of various search heuristics. G. M. Adelson-Velsky and E. M. Landis invented the binary search tree ("dichotomic inquiry") and A. L. Brudno, independent of J. McCarthy, discovered the (α,β)-heuristic for reducing search times on a game tree.## Knuth

Quote by Knuth^{[10]}:It is interesting to convert this recursive procedure to an iterative (non-recursive) form by a sequence of mechanical transformations, and to apply simple optimizations which preserve program correctness. The resulting procedure is surprisingly simple, but not as easy to prove correct as the recursive form.## Implementation

## Max versus Min

A C-like pseudo code implementation of the alpha-beta algorithm with distinct indirect recursive routines for the max- and min-player, similar to the minimax routines. Making and unmaking moves is omitted, and should be done before and after the recursive calls. So called beta-cutoffs occur for the max-play, alpha-cutoffs for the min-player.With this call from the Root:

^{[11]}## Negamax Framework

Inside a negamax framework the routine looks simpler, but is not necessarily simpler to understand. Despite negating the returned score of the direct recursion, alpha of the min-player becomes minus beta of the max-player and vice versa, and the term alpha-cutoff or alpha-pruning is somehow diminished.Note #1: Notice the call to quiesce(). This performs a quiescence search, which makes the alpha-beta search much more stable.Note #2: This function only returns the score for the position, not the best move. Normally, a different (but very similar) function is used for searching the root node. The SearchRoot function calls the alphaBeta function and returns both a score and a best move. Also, most search functions collect the principal variation not only for display purposes, but for a good guess as the leftmost path of the next iteration inside an iterative deepening framework.## Fail hard

Since alpha and beta act as hard bounds of the return value if depth left is greater zero in the above code samples, this is referred to a fail-hard-framework.## Outside the Bounds

Fail-Soft Alpha-Beta^{[12]}may return scores outside the bounds, that is either greater than beta or less than alpha. It has to keep track of the best score, which might be below alpha.## Enhancements

The alpha-beta algorithm can also be improved. These enhancements come from the fact that if you restrict the window of scores that are interesting, you can achieve more cutoffs. Since move ordering is so much important, a technique applied outside of the search for this is iterative deepening boosted by a transposition table, and possibly aspiration windows. Other enhancements, applied within the search function, are further discussed.## Obligatory

## Selectivity

## Scout and Friends

## Alpha-Beta goes Best-First

## See also

## Selected Publications

## 1958 ..

1958).Chess Playing Programs and the Problem of Complexity. IBM Journal of Research and Development, Vol. 4, No. 2, pp. 320-335. Reprinted (1963) in Computers and Thought (eds. Edward A. Feigenbaum and Julian Feldman), pp. 39-70. McGraw-Hill, New York, N.Y., pdf1959).Some Studies in Machine Learning Using the Game of Checkers. IBM Journal July 1959## 1960 ...

1961).The Alpha-Beta Heuristic, AIM-030, reprint available from DSpace at MIT. Retrieved on 2006-12-21.1963).Bounds and valuations for shortening the search of estimates. Problemy Kibernetiki (10) 141–150 and Problems of Cybernetics (10) 225–2411963).Game Trees, M & N Minimaxing, and the M & N alpha-beta procedure.Artificial Intelligence Group Report 3, UCRL-4671, University of California1969).Experiments With Some Programs That Search Game Trees. Journal of the ACM, Vol. 16, No. 2: 189-207, pdf## 1970 ...

1973).An Analysis of the Alpha-Beta Pruning Algorithm.Technical Report, Carnegie Mellon University, pdf1975).An analysis of alpha-beta pruning.Artificial Intelligence, 6:293–326,1975. Reprinted in Selected Papers on Analysis of Algorithms, Published2000, by Addison-Wesley ISBN: 1-57585-211-51978).An Analysis of the Full Alpha-Beta Pruning Algorithm. STOC 1978: 296-3131978).On the branching factor of the alpha-beta pruning algorithm. Artificial Intelligence 10## 1980 ...

1980).Intelligent Games. Creative Computing, Vol. 6, No. 4, hosted by the Internet Archive1981).Heuristic search theory: A survey of recent results. IJCAI-81, pdf1981).On the Average Number of Terminal Nodes examined by Alpha-Beta. Technical Report UCLA-ENG-CSL-8108, University of California at Los Angeles, Cognitive Systems Laboratory1982).The Solution for the Branching Factor of the Alpha-Beta Pruning Algorithm and its Optimality. Communications of the ACM, Vol. 25, No. 81983).The Alpha-Beta Algorithm: Incremental Updating, Well-Behaved Evaluation Functions, and Non-Speculative Forward Pruning. Computer Game-Playing (ed. Max Bramer), pp. 285-289. Ellis Horwood Limited1983).Another optimization of alpha-beta search. SIGART Bulletin, Issue 841984).Why Functional Programming Matters. 5 An Example from Artificial Intelligence, Chalmers Tekniska Högskola, Göteborg, pdf,1986).Generalization of Alpha-Beta and SSS* Search Procedures.Artificial Intelligence, Vol. 29, pp. 73-117. ISSN 0004-3702.## 1990 ...

1992).A Game Tree with Distinct Leaf Values which is Easy for the Alpha-Beta Algorithm. Artif. Intell. 52(2): 183-1901999).Scalable Search in Computer Chess. Morgan Kaufmann, ISBN 3-528-05732-7## 2000 ...

2004).Alpha-Beta Search Enhancements with a Real-Value Game-State Evaluation Function. ICGA Journal, Vol. 27, No. 1, pdf## 2010 ...

2012).Alpha-Beta Pruning for Games with Simultaneous Moves. AAAI 20122013).Analysis of pruned minimax trees. pdf » Late Move Reductions, Null Move Pruning## Forum Posts

## 1993 ...

## 1995 ...

Re: computer chess "oracle" ideas... by Ronald de Man, rgcc, April 3, 1997

Re: computer chess "oracle" ideas... by Ronald de Man, rgcc, April 7, 1997

## 2000 ...

^{[13]}## 2005 ...

## 2010 ...

## External Links

^{[14]}1972).Games Playing with Computers. Allen & Unwin, ISBN-13: 978-00802122272001).Alpha Beta: How 26 Letters Shaped the Western World, from amazon## References

1967).Some Studies in Machine Learning. Using the Game of Checkers. II-Recent Progress. pdf, IBM Journal - November 1967,on the name Alpha-Beta Heuristic pp. 602:

So named by Prof. John McCarthy. This procedure was extensively investigated by Prof. John McCarthy and his students at M.I.T. but it has been inadequately described in the literature. It is, of course, not a heuristic at all, being a simple algorithmic procedure and actually only a special case of the more general "branch and bound" technique which was been rediscovered many times and which is currently being exploited in integer programming research.2001).Science, Competition and Business. ICGA Journal, Vol. 24, No. 4, pdf19551961).The Alpha-Beta Heuristic, AIM-030, reprint available from DSpace at MIT. Retrieved on 2006-12-21.1973).An Analysis of the Alpha-Beta Pruning Algorithm.Technical Report, Carnegie Mellon University1975).An analysis of alpha-beta pruning.Artificial Intelligence, 6:293–3261980).The Early Development of Programming in the USSR. in Nicholas C. Metropolis (ed.)A History of Computing in the Twentieth Century. Academic Press, preprint pp. 441975).An analysis of alpha-beta pruning.Artificial Intelligence, 6:293–3261983).Another optimization of alpha-beta search. SIGART Bulletin, Issue 841999).Scalable Search in Computer Chess. Morgan Kaufmann, ISBN 3-528-05732-72009).Fifty-Plus Years of Combinatorial Integer Programming. pdfUp one level