0% found this document useful (0 votes)

17 views

Cheatsheet

Uploaded by

garryrem80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Cheatsheet

Uploaded by

garryrem80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Optimization in DS Katholische Universität Eichstätt-Ingolstadt

winter semester 2023/2024 Thomas Jahn

Cheat sheet
Notation
∇f (x) gradient, ∇2 f (x) Hessian
A ⪰ B means B − A is positive semidefinite
A ≻ B means B − A is positive definite
x ≥ 0 means that all entries of the vector x are ≥ 0.

Special functions and sets

linear function f (x) = Ax with a matrix A
affine function f (x) = Ax + b with a matrix A and a vector b
quadratic function f (x) = x⊤ Ax + b⊤ x + r with a matrix A, a vector b, a number r
µ-strongly convex function (just “convex” if µ = 0)
µ 2
f (λx + (1 − λ)y) + λ(1 − λ) ∥x − y∥2 ≤ λf (x) + (1 − λ)f (y) for all x, y ∈ Rd , λ ∈ (0, 1)
2
f cont.diff. µ 2
⇐⇒ f (x) − f (y) ≥ ∇f (y)⊤ (x − y) + ∥x − y∥2 for all x, y ∈ Rd
2
f twice cont.diff.
⇐⇒ ∇2 f (x) ⪰ µI for all x ∈ Rd

L-smooth function

∥∇f (x) − ∇f (y)∥2 ≤ L ∥x − y∥2 for all x, y ∈ Rd

f cont.diff. L 2 L 2
⇐⇒ f (y) + ∇f (y)⊤ (x − y) − ∥x − y∥2 ≤ f (x) ≤ f (y) + ∇f (y)⊤ (x − y) + ∥x − y∥2 for all x, y ∈ Rd
2 2
f twice cont.diff.
⇐⇒ −LI ⪯ ∇2 f (x) ⪯ LI for all x ∈ Rd

convex set S ⊂ Rd : λx + (1 − λ)y ∈ S for all x, y ∈ S, all λ ∈ [0, 1].

Algorithms, rates of convergence

k→∞ ∥x(k+1) −x∗ ∥2

• x(k) −→ x∗ q-linearly ⇐⇒ limk→∞ ∈ (0, 1)
∥x(k) −x∗ ∥2
k→∞ ∥x(k+1) −x∗ ∥
• x(k) −→ x∗ q-superlinearly ⇐⇒ limk→∞ x(k) −x∗ 2 = 0
∥ ∥2
∥x(k+1) −x∗ ∥2
• The convergence is q-quadratic ⇐⇒ limk∈∞ 2 < ∞.
∥x(k) −x∗ ∥2

Unconstrained optimization
min f (x) over x ∈ Rd

• 1st-order necessary conditions: x∗ ∈ Rd be a local minimizer of cont.diff. f : Rd → R =⇒ ∇f (x∗ ) = 0.

• 2nd-order necessary conditions: x∗ ∈ Rd be a local minimizer of twice cont.diff. f : Rd → R =⇒ ∇2 f (x∗ ) ⪰ 0
• 2nd-order sufficient condition: f : Rd → R twice cont.diff., ∇f (x∗ ) = 0, ∇2 f (x∗ ) ≻ 0 =⇒ x∗ strict local
minimizer of f
• convex objective function: locally optimal solutions are optimal solutions, set of optimal solutions is convex
but may be empty
• strongly convex, cont.diff. objective function: there exists a unique optimal solution

Line search methods

Setup:
• general form: x(k+1) = x(k) + αk p(k)
• search directions: gradient method p(k) = −∇f(x(k) ), Newton p(k) = −∇2 f (x(k) )−1 ∇f (x(k) )
• step sizes: constant αk = α, Armijo αk = max ᾱρj : f (x(k) + ᾱρj p(k) ) ≤ f (x(k) ) + c1 ᾱρj ∇f (x(k) )⊤ p(k)

1
Theorems:
• f cont.diff., gradient method with Armijo step size =⇒ every accumulation point of (x(k) )k∈N0 is a stationary
point of f
• f cont.diff., µ-strongly convex, L-smooth, gradient method with αk = L1 =⇒ (x(k) )k∈N0 and (f (x(k) ))k∈N0
converge q-linearly
• f twice cont.diff., ∇f (x∗ ) = 0, ∇2 f (x∗ ) invertible =⇒ for x(0) ≈ x∗ , αk = 1 and Newton directions p(k) ,
convergence x(k) → x∗ is q-superlinear (q-quadratic when ∇2 f is Lipschitz)
Linear optimization min c⊤ x over x ∈ Rd s.t. constraints of the form a⊤ x ≤ b, a⊤ x ≥ b, or a⊤ x = b for given
a, c ∈ Rd and b ∈ R.

• c is called cost vector.

• standard form: min c⊤ x s.t. Ax = b and x ≥ 0. We require b ≥ 0.
• Locally optimal solutions are global ones. (Linearity implies convexity.)
• Set of optimal solutions = a face of the feasible set (a polyhedral set), i.e., a vertex, an edge, . . . , or the whole
feasible set.
• If there are optimal solutions, there is a solution among the vertices of the feasible set.
• simplex algorithm: cleverly traversing the vertices of the feasible set
• reduced costs: A⊤ B y = cB , zN = cN − AN y
⊤

• Optimality test: zN ≥ 0 ⇒ STOP

• Choose k ∈ N with zk < 0 and solve AB w = Ak . Unboundedness test: w ≤ 0 ⇒ STOP.
x
• Ratio test: Compute wjj for j ∈ B with wj > 0. Call the smallest value t and the corresponding index r
• Update xB ← xB − tw, xk ← t, k enters B, r leaves B.
• Phase 1 problem: min v1 + . . . + vm over (x, v) s.t. Ax + v = b, x ≥ 0, v ≥ 0, start simplex algorithm with
feasible basis solution (0, b)
• dual problem: max b⊤ y over (y, z) s.t. A⊤ y + z = c, z ≥ 0
• dual variables y, z are computed in the simplex algorithm
• weak duality: b⊤ y ≤ c⊤ x “dual values are less or equal than primal values”
• no duality gap: If b⊤ y = c⊤ x, then x is optimal for the primal problem, (y, z) optimal for the dual problem
• strong duality gap: If one problem is solvable, then both of them are, and the optimal values coincide. If one
problem is unbounded, then the other is infeasible.
• Lagrangian: L(x, y) = c⊤ x + (b − Ax)⊤ y
• KKT conditions: Ax = b, x ≥ 0, A⊤ y + z = c, z ≥ 0, x⊤ z = 0
• KKT conditions have a solution if and only if the primal problem has an optimal solution if and only if the
dual problem has an optimal solution.

Nonlinear constrained optimization min f (x) over x ∈ Rd s.t. gj (x) ≤ 0, hi (x) = 0 with f, gj , hi continuously
differentiable, let C be the feasible set
n (k)
o
• tangent cone TC (x) = y ∈ Rd : ∃ (x(k) ∈ C N , αk ∈ (0, ∞)N , x(k) → x, αk → 0, x αk−x → y
• first-order necessary optimality condition: x∗ is a locally optimal solution ⇒ ∇f (x∗ )⊤ y ≤ 0 for all y ∈ TC (x∗ )
• polar cone: K = y : y x ≤ 0 ∀ x ∈ K
◦ ⊤

• first-order necessary optimality condition again: x∗ is a locally optimal solution ⇒ −nablaf (x∗ ) ∈ TC (x∗ )◦
• linearized tangent cone F (x) = y ∈ Rd : ∇hi (x)⊤ y = 0 ∀ i, ∇gj (x)⊤ y ≤ 0 ∀ j with gj (x) = 0
• always TC (x) ⊆ F (x) and F (x)◦ ⊆ TC (x)◦
• F (x) depends
nP on the constraint functions, TC (x) only dependso on the feasible set
• F (x)◦ = (x) + (x) : 0,
P
µ
j:gj (x)=0 j ∇g j λ h
i i i µj ≥ λi ∈ R
• F (x)◦ = TC (x)◦ is true when constraint qualifications are satisfied
• LICQ is satisfied at x: All the vectors ∇gj (x) and ∇hi (x) are linearly independent (all i, only those j for
which gj (x) = 0)

Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
No ratings yet
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
17 pages
lect5_removed
No ratings yet
lect5_removed
35 pages
Chapter 9st - Non-Linear Programming
No ratings yet
Chapter 9st - Non-Linear Programming
21 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
L2
No ratings yet
L2
35 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
Notes HQ
No ratings yet
Notes HQ
96 pages
Optimal Control
No ratings yet
Optimal Control
48 pages
Wisdom of Crowds Intro
No ratings yet
Wisdom of Crowds Intro
53 pages
Optimisation
No ratings yet
Optimisation
38 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
ma3252-cheatsheet-intro-to-linear-programming-concepts
No ratings yet
ma3252-cheatsheet-intro-to-linear-programming-concepts
4 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
optimization
No ratings yet
optimization
16 pages
15.093 Optimization Methods
No ratings yet
15.093 Optimization Methods
12 pages
斯坦福大学机器学习数学基础 57-64
No ratings yet
斯坦福大学机器学习数学基础 57-64
8 pages
5165 Test 2 Cheating
No ratings yet
5165 Test 2 Cheating
7 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
Numopt 0
No ratings yet
Numopt 0
163 pages
Classical Optimization Techniques
No ratings yet
Classical Optimization Techniques
48 pages
N.O. Study Guide
No ratings yet
N.O. Study Guide
7 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Dimitri Bertsekas - Nonlinear Programming (Google Books Preview) (2016, Athena Scientific) - Libgen - Li
No ratings yet
Dimitri Bertsekas - Nonlinear Programming (Google Books Preview) (2016, Athena Scientific) - Libgen - Li
64 pages
Linear Programming Problems
No ratings yet
Linear Programming Problems
12 pages
CH 4-Design Optimization-Optimum Design Concepts PDF
No ratings yet
CH 4-Design Optimization-Optimum Design Concepts PDF
62 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
46 pages
Bologna 07
No ratings yet
Bologna 07
315 pages
Apostila Otimização MIT
No ratings yet
Apostila Otimização MIT
101 pages
Linear Programming: Presented by - Meenakshi Tripathi
No ratings yet
Linear Programming: Presented by - Meenakshi Tripathi
13 pages
AMPC1_LB
No ratings yet
AMPC1_LB
59 pages
OPTCON Optimization 2023 10 11
No ratings yet
OPTCON Optimization 2023 10 11
71 pages
10-725/36-725 Optimization Midterm Exam: Name
No ratings yet
10-725/36-725 Optimization Midterm Exam: Name
10 pages
Optimization Models
No ratings yet
Optimization Models
104 pages
Optimisation THM Proof
No ratings yet
Optimisation THM Proof
12 pages
Tentalosning TMA947 131217
No ratings yet
Tentalosning TMA947 131217
7 pages
Op Tim Ization Notes
No ratings yet
Op Tim Ization Notes
55 pages
Optimality Conditions
No ratings yet
Optimality Conditions
10 pages
Optmizationtechniques 150308051251 Conversion Gate01
No ratings yet
Optmizationtechniques 150308051251 Conversion Gate01
18 pages
Optimization - Homework 6
No ratings yet
Optimization - Homework 6
6 pages
Lecture14 KKT
No ratings yet
Lecture14 KKT
37 pages
Exam #1
No ratings yet
Exam #1
8 pages
UNIT12345
No ratings yet
UNIT12345
14 pages
Introduction to Optimization
No ratings yet
Introduction to Optimization
18 pages
Or Goalprograming Last Lecture Sllbs
No ratings yet
Or Goalprograming Last Lecture Sllbs
39 pages
Math562 ContinuousOptimization
No ratings yet
Math562 ContinuousOptimization
126 pages
LN 2122 Student3
No ratings yet
LN 2122 Student3
18 pages
Convexdualitychapter PDF
No ratings yet
Convexdualitychapter PDF
214 pages
Optimization: Lecturer: Stanley B. Gershwin
No ratings yet
Optimization: Lecturer: Stanley B. Gershwin
62 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Optimization With Constraints: 2nd Edition, March 2004
No ratings yet
Optimization With Constraints: 2nd Edition, March 2004
35 pages
Convexdualitychapter PDF
No ratings yet
Convexdualitychapter PDF
214 pages
Tentalosning TMA947 070312 2
No ratings yet
Tentalosning TMA947 070312 2
6 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Model 20161010
No ratings yet
Model 20161010
48 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A Guide To Machine Learning Algorithms 100+
No ratings yet
A Guide To Machine Learning Algorithms 100+
49 pages
Quiz6 - Computer Vision
No ratings yet
Quiz6 - Computer Vision
3 pages
1-Divide and Conquer Algorithms
No ratings yet
1-Divide and Conquer Algorithms
99 pages
Cosc 325 Exam
No ratings yet
Cosc 325 Exam
3 pages
Complete Download Data Structures Algorithms in Go 1st Edition Hemant Jain PDF All Chapters
100% (4)
Complete Download Data Structures Algorithms in Go 1st Edition Hemant Jain PDF All Chapters
62 pages
Using Path-Finding Algorithms of Graph Theory For Route-Searching
No ratings yet
Using Path-Finding Algorithms of Graph Theory For Route-Searching
23 pages
Soft Comp PDF
No ratings yet
Soft Comp PDF
2 pages
Sna Lab Experiments
No ratings yet
Sna Lab Experiments
2 pages
Lab Manual: B.E. 6 Semester
No ratings yet
Lab Manual: B.E. 6 Semester
22 pages
NP-Complete: Proof of Correctness More Reductions
No ratings yet
NP-Complete: Proof of Correctness More Reductions
20 pages
Search: CS221 Lecture Notes #3
No ratings yet
Search: CS221 Lecture Notes #3
26 pages
Game Playing in Artificial Intelligence
No ratings yet
Game Playing in Artificial Intelligence
12 pages
Harshadsa
No ratings yet
Harshadsa
27 pages
All Medium Problems
No ratings yet
All Medium Problems
614 pages
MMW Reviewer Endterm
No ratings yet
MMW Reviewer Endterm
9 pages
Numerical Analysis - Truncation and Taylor Series
No ratings yet
Numerical Analysis - Truncation and Taylor Series
25 pages
Sparse Coding and Dictionary Learning For Image Analysis
No ratings yet
Sparse Coding and Dictionary Learning For Image Analysis
43 pages
Predavanje 07
No ratings yet
Predavanje 07
69 pages
Introduction To Algorithms and Data Structures in Swift 4 Get Ready For Programming Job Interviews. Write Better, Faster Swift Code. (Swift Clinic Book 1) - Karoly Nyisztor
No ratings yet
Introduction To Algorithms and Data Structures in Swift 4 Get Ready For Programming Job Interviews. Write Better, Faster Swift Code. (Swift Clinic Book 1) - Karoly Nyisztor
182 pages
DS-I - Introduction To Data Structure
No ratings yet
DS-I - Introduction To Data Structure
64 pages
EE159 Computer Aided Power System Design
No ratings yet
EE159 Computer Aided Power System Design
1 page
Programme Guide
No ratings yet
Programme Guide
90 pages
Unit 2 (Part 1)
No ratings yet
Unit 2 (Part 1)
6 pages
Data Structure Algorithm CH 1
No ratings yet
Data Structure Algorithm CH 1
71 pages
Boole
No ratings yet
Boole
1 page
L4 Slides - Algorithms - KS4
No ratings yet
L4 Slides - Algorithms - KS4
29 pages
Convolutional and Miller
No ratings yet
Convolutional and Miller
4 pages
Yoga Project Final
No ratings yet
Yoga Project Final
40 pages
Code Optimiztion Criteria For Code-Improving Transformations
No ratings yet
Code Optimiztion Criteria For Code-Improving Transformations
10 pages
C5 MDP TERM 2
No ratings yet
C5 MDP TERM 2
4 pages

Cheatsheet

Uploaded by

Cheatsheet

Uploaded by

Optimization in DS Katholische Universität Eichstätt-Ingolstadt

winter semester 2023/2024 Thomas Jahn

Special functions and sets

∥∇f (x) − ∇f (y)∥2 ≤ L ∥x − y∥2 for all x, y ∈ Rd

convex set S ⊂ Rd : λx + (1 − λ)y ∈ S for all x, y ∈ S, all λ ∈ [0, 1].

Algorithms, rates of convergence

k→∞ ∥x(k+1) −x∗ ∥2

• 1st-order necessary conditions: x∗ ∈ Rd be a local minimizer of cont.diff. f : Rd → R =⇒ ∇f (x∗ ) = 0.

Line search methods

• c is called cost vector.

• Optimality test: zN ≥ 0 ⇒ STOP

You might also like