03 Adversarial Search

The document discusses optimal decision-making in competitive games, focusing on adversarial search problems and the minimax algorithm for determining optimal moves. It introduces alpha-beta pruning to improve efficiency in minimax searches and explores imperfect real-time decisions using heuristic evaluation functions. Additionally, it addresses stochastic games that incorporate chance nodes, leading to the concept of expecti-minimax values for decision-making under uncertainty.

Uploaded by

nguyendangminh0625

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views3 pages

03 Adversarial Search

Uploaded by

nguyendangminh0625

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

*************************

- GAMES -
*************************
[~] Cover competitive environment, in which the agent's goals are in conflict,
giving rise to adversarial search problems.
[~] Begin with a definition of optimal move, algorithm to find it. Then look at
technique for choosing a good move in limited time.
[-] Prunning allows us to ignore portions of search tree that makes no
difference to the final choice.
[-] Heuristic evaluation functions allow us to approximate the true utility of
the state without doing complete search.
[~] Consider 2-player games, MAX and MIN. Game can be defined as:
[-] S(0): initial state
[-] Player(S): defines which player has the move in state S
[-] Action(S): set of legal moves in a state
[-] Result(S, a): transistion model, defines result of a move
[-] Terminal-Test(S): true when game is over.
[-] Utility(S, p): utility function defines final numeric value for game that
ends in terminal state S for player p. (e.g., outcome: 1-win, 0-lose, 1/2-draw)

**********************************************
- OPTIMAL DECISIONS IN GAMES -
**********************************************
[~] In game parlance, we say that a tree is one move deep, consists of two half
moves from each player, each of which is called a ply.
[~] Optimal strategy can be determined from the minimax value of each node,
MINIMAX(n) - utility of being in the corresponding state, assume both plays
optimally.
[-] Minimax of a terminal state is its utility.
[-] MINIMAX(S) = UTILITY(S) if TERMINAL-TEST(S)
max(..a) MINIMAX(RESULT(S, a)) if PLAYER(S) = MAX
min(..a) MINIMAX(RESULT(S, a)) if PLAYER(S) = MIN
[~] The minimax algorithm: computes the minimax decision from the current state.
[-] uses a simple recursive computation of the minimax values of each successor
state.
[-] minimax values are backed up through the tree as the recursion unwinds.
[-] performs a complete depth-first exploration of the game tree => time
complexity: O(b^m) - b: num legal moves at each point, depth: m
[~] For multiplayers, replace single value for each node with a vector of values.
[-] The backed-up value of a node n is always the utility vector of the
successor state with the highest value for the player choosing at n.
[-] Usually involve alliances: Collaboration emerges from purely selfish
behavior.

**************************************
- ALPHA-BETA PRUNING -
**************************************
[~] Problem with minimax search: number of game states is exponential in the depth
of the tree.
[-] Cannot eliminate the exponential, but can cut it in half.
[-] Possible to compute correct minimax decision without looking at all nodes.
=> Alpha-beta pruning.
[~] Alpha-beta pruning can be applied to trees of any depth, usually possible to
prune entire subtrees.
[~] General principle: consider a node n, such that Player has a choice of moving
to that node.
[-] If Player has a better choice m before either at parent node of n or any
choice point further up, n will never be reached in actual play.
[~] alpha: value of the best choice found so far at any choice point along the path
for MAX
beta: value of the best choice found so far at any choice point along the path
for MIN
[-] Alpha-beta search updates the value of alpha-beta (of each node) as it goes
along and prunes remaining branches at a node as soon as the value of current node
is known to be worse than current alpha or beta value for MAX and MIN,
respectively.
[~] Move-ordering: effectiveness of alpha-beta pruning is highly dependent on the
examined order of the states.
[-] Might be worthwhile to examine first the succesors that are likely to be
the best.`
[-] If can be done, time complexity reduce to O(b^(m/2)), branching factor
becomes sqrt(b) instead of b.
[+] alpha-beta can solve a tree twice as deep as minimax in same amount of
time.
[-] Can add dynamic move-ordering schemes - trying first the moves that were
found to be best in the past
[+] Can apply iterative deepening search: first 1 ply, then 1 ply
deeper,...
[-] Repeated states may occur frequently due to transpositions - different
permutations but yields same result.
[+] Worthwhile to store the evaluation of resulting position in hash table
the first time encountered => transpotition table.

*************************************************
- IMPERFECT REAL-TIME DECISIONS -
*************************************************
[~] Should cut off the search earlier and apply heuristics evaluation function to
states
[-] Replace utility function by a heuristic evaluation function EVAL -
estimates the position's utility.
[-] Replace terminal test by a cutoff test - decides when to apply EVAL.
[~] Evaluation function: estimate expected utility of game from given position.
[-] First, evaluation func should order terminal states same way as the true
utility func does: win states must be evaluated better than draws.
[-] Second, computational time is not too long !
[-] Finally, for nonterminal states, evaluation func should be strongly
correlated with the actual chances of winning.
[-] Most EVAL computer separate numberical contributions from each feature and
then combine them to find total value.
[+] Mathematically called weighted linear function:
EVAL(s) = w1f1(s) + w2f2(s) + ... wnfn(s), fi(s): feature of state, wi:
weight of corresponding feature.
[+] Can lead to errors due to approximate nature of evaluation function.
[~] Need more sophisticated cutoff test:
[-] EVAL func should only be applied to quiescent positions - unlikely to
exhibit wild swings in value in the near future.
[-] Horizon effect - arises when facing opponent's move that causes serious
damage and unavoidable.
[+] Can be temporarily avoided by delayed tactics.
[+] Can mitigate horizon effect by singular extension - move that "clearly
better" than others.
[-] Forward pruning: prun some moves without consideration
[+] Can use beam search
[+] Dangerous => use ProbCut (based on gained experience and statistics).
[-] Lookup table: rather than search for opening and ending.

************************************
- STOCHASTIC GAMES -
************************************
[~] Unpredictable external events => put us into unforeseen situations.
=> stochastic games.
[~] Must include chance nodes in addition to MAX and MIN nodes.
[-] Branches leading from each chance nodes denote possible dice rolls, for ex.
[~] Need to make correct decisions, but positions don't have definite minimax
values.
[-] But we can calculate expected value.
[-] This leads us to generalize the minimax value for deterministic games to an
expecti-minimax values for chance-node games.
[+] Terminal, MAX, MIN nodes (when dice roll is known) work exactly the
same way before.
[+] For chance nodes, we calculate the expected value - sum of value over
all outcomes, weighted by probability.
EXPECTI-MINIMAX(S) = UTILITY(S) if
TERMINAL-TEST(S)
max(..a) EXPECTI-MINIMAX(RESULT(S, a)) if
PLAYER(S) = MAX
min(..a) EXPECTI-MINIMAX(RESULT(S, a)) if
PLAYER(S) = MIN
sum of P(r)*EXPECTI-MINIMAX(RESULT(S, r)) if
PLAYER(S) = CHANCE
[~] The presence of chance nodes make the evaluation function more sensitive:
[-] Program behaves totally different when changing the scale of some
evaluation values!
[-] To avoid this sensitivity, EVAL must be a positive linear transformation of
probability of winning from a position.

Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
12 - Поиск в Google
No ratings yet
12 - Поиск в Google
5 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Unit 3
No ratings yet
Unit 3
61 pages
AI-Unit-4
No ratings yet
AI-Unit-4
25 pages
6 games
No ratings yet
6 games
45 pages
Unit 2c Game Playing (Compatibility Mode)
No ratings yet
Unit 2c Game Playing (Compatibility Mode)
36 pages
ai_lect_05
No ratings yet
ai_lect_05
39 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
Module 2 (Part 2)
No ratings yet
Module 2 (Part 2)
136 pages
Ch 5 Adversarial Search
No ratings yet
Ch 5 Adversarial Search
20 pages
AI UNIT 3 (1)
No ratings yet
AI UNIT 3 (1)
138 pages
ai lecture-4
No ratings yet
ai lecture-4
37 pages
unit3&4ai
No ratings yet
unit3&4ai
57 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
AAI Lecture 7 Sp 25
No ratings yet
AAI Lecture 7 Sp 25
51 pages
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
4 pages
Games
No ratings yet
Games
41 pages
Game Theory Unit IV
No ratings yet
Game Theory Unit IV
6 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Game Playing. Updated (3)
No ratings yet
Game Playing. Updated (3)
44 pages
Game Playing
No ratings yet
Game Playing
60 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Lec3-Adversarial Search
No ratings yet
Lec3-Adversarial Search
73 pages
Lecture11_AdversarialSearch
No ratings yet
Lecture11_AdversarialSearch
74 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Lecture 6 - minmax alpha beta
No ratings yet
Lecture 6 - minmax alpha beta
41 pages
GamePlaying_Minimax_Unit-2_SPS
No ratings yet
GamePlaying_Minimax_Unit-2_SPS
72 pages
6 Game
No ratings yet
6 Game
42 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
Chapter 3:game Theory: 3.1optimal Decision in Games
No ratings yet
Chapter 3:game Theory: 3.1optimal Decision in Games
17 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
Unit 2 MinMaxScaling With Alpha Beta Pruning
No ratings yet
Unit 2 MinMaxScaling With Alpha Beta Pruning
24 pages
IA-c06-NoAnim
No ratings yet
IA-c06-NoAnim
31 pages
CS 863 / CSE 860 Artificial Intelligence
No ratings yet
CS 863 / CSE 860 Artificial Intelligence
22 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
6-GAME
No ratings yet
6-GAME
53 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
18CS753 Ai Module 4
No ratings yet
18CS753 Ai Module 4
43 pages
Game Trees: Introduction To Artificial Intelligence COS302 Michael L. Littman Fall 2001
No ratings yet
Game Trees: Introduction To Artificial Intelligence COS302 Michael L. Littman Fall 2001
32 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Game Playing
No ratings yet
Game Playing
24 pages
SG-CPU4
No ratings yet
SG-CPU4
50 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
34 pages
The Role of AI in EPC Projects
No ratings yet
The Role of AI in EPC Projects
5 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Diabetes Documentation
No ratings yet
Diabetes Documentation
54 pages
4 Adversarial Search
No ratings yet
4 Adversarial Search
14 pages
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
STT Presentation
No ratings yet
STT Presentation
8 pages
Seeding Viral Content
No ratings yet
Seeding Viral Content
14 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Rogue Galaxy Guia Completa
No ratings yet
Rogue Galaxy Guia Completa
45 pages
Modern Systems Analysis and Design: Structuring System Process Requirements
No ratings yet
Modern Systems Analysis and Design: Structuring System Process Requirements
45 pages
Assignment No.6 1
No ratings yet
Assignment No.6 1
5 pages
HP ElitePad 1000 G2 Maintenance and Service Guide (c04825506)
No ratings yet
HP ElitePad 1000 G2 Maintenance and Service Guide (c04825506)
126 pages
Swift Fees
100% (4)
Swift Fees
2 pages
List EP EXP
No ratings yet
List EP EXP
2 pages
Investigating Digital Animation and Effects
No ratings yet
Investigating Digital Animation and Effects
7 pages
Multispan Utc 421 Digital Timer
No ratings yet
Multispan Utc 421 Digital Timer
4 pages
Fortios v6.0.5 Release Notes
No ratings yet
Fortios v6.0.5 Release Notes
34 pages
Computer Security
No ratings yet
Computer Security
147 pages
The Purpose of This Feasibility Study Is To Forecast The Sales of Renewable Stationary Generators Over The Next Three Years
No ratings yet
The Purpose of This Feasibility Study Is To Forecast The Sales of Renewable Stationary Generators Over The Next Three Years
2 pages
Sbi 632 02-22-18
No ratings yet
Sbi 632 02-22-18
71 pages
HP Pavilion DV6-1210SA Specifications
No ratings yet
HP Pavilion DV6-1210SA Specifications
1 page
Doosan DX200A Service Manual
100% (2)
Doosan DX200A Service Manual
11 pages
Faizal Bin Mat Ali
No ratings yet
Faizal Bin Mat Ali
7 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
Freezing Cross Tab Row Headers
0% (1)
Freezing Cross Tab Row Headers
8 pages
Allplan BIM Compendium
No ratings yet
Allplan BIM Compendium
279 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
MicroStation 95tp
No ratings yet
MicroStation 95tp
8 pages
Gameplaying Group 1
No ratings yet
Gameplaying Group 1
13 pages
usermanual-gc-operation-8890-g3540-90014-en-agilent
No ratings yet
usermanual-gc-operation-8890-g3540-90014-en-agilent
292 pages
Engineering Problem Solving Method
No ratings yet
Engineering Problem Solving Method
3 pages
Constituent Colleges: First Rank Holders of Constituent and Autonomous Colleges
No ratings yet
Constituent Colleges: First Rank Holders of Constituent and Autonomous Colleges
3 pages
07 Game Playing
No ratings yet
07 Game Playing
30 pages
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Irfan Shariff Resume
No ratings yet
Irfan Shariff Resume
3 pages
IRC5 Programming and Operation - 2007
100% (2)
IRC5 Programming and Operation - 2007
102 pages
Linux Material
100% (1)
Linux Material
38 pages

03 Adversarial Search

Uploaded by

03 Adversarial Search

Uploaded by

*************************

You might also like