Change Point Detection Seminar

The document discusses change point detection algorithms. It begins with background on change point problems and examples of common changes detected. It then classifies change point detection algorithms as online or offline, parametric or non-parametric. Key algorithms discussed include maximum likelihood estimation, binary segmentation, CUSUM, and change finder. Performance of algorithms is evaluated using metrics like accuracy, sensitivity, mean absolute error, and mean squared error. Exact and approximate methods for single and multiple change point detection are also covered.

Uploaded by

Salbani Chakrabortty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

209 views23 pages

Change Point Detection Seminar

Uploaded by

Salbani Chakrabortty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Change Point Detection

I E S 6 0 1 S E MI N AR
S A L BA N I CH A K RA BO RTTY (18I190011)
U N D E R T H E G U I D A NCE O F P RO F. N H E MA CH A N D RA
Contents
 Background
 Sample time series and change points
Classifications of change point detection algorithms and methods of detecting change points
 Maximum Likelihood Estimation Of Change Point Location
Hypothesis test for the Change Point Problem – Single and Multiple Change point detection
 Binary Segmentation Method in Multiple change point detection
CUSUM Procedure
 Change Finder Method
 Performance Measures of CPD Algorithms
 References
Background
Data in reality may not often retain the same statistical properties is The Change point Problem
over time.
Detection of change points is useful in modelling and prediction of time series , medical
condition monitoring, climate change detection, speech and image analysis, and human
activity analysis.
The most commonly investigated changes in behaviour are:

 Change in mean
 Change in Variance
 Change in Regression Model/parameter
Sample time series and change points
Classifications of Change Point
Detection
 Change point detection algorithms are classified as “online” or “offline”.
 The goal of off-line detection is generally to identify all of a sequence’s change points in
batch mode.
 The goal of on-line detection is to detect change as soon as possible after it occurs, ideally
before the next data point arrives.
 Both Parametric and Non-parametric methods are used in change point detection.
Basic Algorithms Used In CPD Problems
The techniques used in CPD include both supervised and unsupervised methods.
 Supervised learning algorithms learn a mapping from input data to a target attribute of
the data, usually a class label.
This presentation will mostly focus on un-supervised methods of CPD.
 Un-supervised learning algorithms are typically used to discover patterns in unlabeled
data.
 Un-supervised methods includes
 Likelihood Ratio Method
 Change Finder Method
 CUSUM Method
Maximum Likelihood Estimation Of The
Change Point Location
Given dataset 𝑥1 , 𝑥2 ,. . . . . . . . 𝑥𝑛 (Normally distributed) the estimated changepoint location 𝑘෠ is
෠
k=arg k max 2≤k≤n−1 Vk

Vk = σnt=1 xt − μො 2 − σkt=1 xt − μ
ෞ1 2 + σnt=k+1 xt − μෞn 2

1 n 1 1
μො = σk=1 xk μ1 = σkt=1 xt , μෞn =
,ෞ σnt=k+1 xt
n k n−k
 From this 𝐤መ value we get statistic 𝐔𝐤መ = 𝐕𝐤መ equivalent to the likelihood ratio test-statistic.
Hypothesis test for the Change Point
Problem
 A natural approach to detecting a single changepoint is to perform an hypothesis test.
The hypotheses where there is a change in mean of the distribution at the point k is defined as
𝑯𝟎 : 𝝁𝟏 = 𝝁𝟐 = ... = 𝝁𝒏 ; 𝑯𝟏 : 𝝁𝟏 = ... = 𝝁𝒌 ≠ 𝝁𝒌+𝟏 = ... = 𝝁𝒏
Here assumption is the data is normally distributed.
 H0 is rejected when Uk > cα
 𝑈𝑘 is the test statistic of k and 𝑐𝛼 is the critical value 𝛼 is a chosen significance level.
Likelihood Ratio Test For Single Change
 We can view detecting a single changepoint as a hypothesis test
𝑯𝟎 : No changepoint, m = 0
𝑯𝟏 : A single changepoint, m = 1
 One approach is to find τ, the position of change which maximises the log likelihood
෢𝟏 ) + 𝐥𝐨𝐠 𝐩(𝐲𝛕+𝟏:𝐧 |𝛉
L(τ) = 𝐥𝐨𝐠 𝐩(𝐲𝟏:𝛕 |𝛉 ෢𝟐 )
Then, calculate the test statistic
෡ ]
𝛌 = 2[𝐦𝐚𝐱 𝛕 L(τ) - 𝐥𝐨𝐠 𝐩(𝐲𝟏:𝐧 |𝛉)
We then choose a threshold, c, such that we reject the null hypothesis if λ > c.
Multiple Change Point Detection
 In practice the assumption of only one change may be unrealistic.
 The search method for multiple change point aim to minimise,
σ𝒎+𝟏
𝒊=𝟏 [𝑪(𝒚(𝝉𝒊−𝟏+𝟏):𝝉 )] + 𝜷𝒇(𝒎)
𝒊

∁ is a cost function for a segment and 𝛽𝑓(𝑚) is a penalty to guard against over ﬁtting.
∁ is negative log-likelihood and 𝛽𝑓(𝑚) may be 𝑐m.
 An approximate method for minimizing the above is Binary Segmentation.
Binary Segmentation
Input: A time series of the form {𝑦1 , 𝑦2 , . . . . . . . . . , 𝑦𝑛 }
A test statistic Λ(.) dependent on the time series
An estimator of change point position 𝜏(.)
Ƹ
A rejection threshold C
Initialise: Let L = 𝜑, and S = {[1,n]}
Iterate: While S ≠ 𝜑
1. Choose an element of S; denote this element as [s,t].
2. If Λ(𝑦𝑠:𝑡 ) < C, remove [s,t] from S.
3. If Λ(𝑦𝑠:𝑡 ) ≥ C then:
a. remove [s,t] from S;

𝑠:𝑡 + s−1, and add r to L;

b. calculate r = 𝜏ෞ
c. if r ≠ s add [s,r] to S;
d. if r ≠ t−1 add [r + 1,t] to S;
Output: The set of change points recorded L.
Binary Segmentation: An Approximate
Method
 The Binary Segmentation is computationally efficient.
It results in an 𝚶(𝐧 𝐥𝐨𝐠 𝐧) calculation.
 However computational efficiency comes at the cost of exactness.
 The location of a changepoint is conditional on the locations of previous changepoints.
The method does not search the entire solution space and is an approximation.
Exact Methods Of Detecting Change
Points
 Exact methods detects all changepoints simultaneously using a goodness of fit measure, may be
by minimising
- σ𝒎 ෡
𝒊=𝟎 𝒍𝒐𝒈 𝒑( 𝒚𝝉𝒊 :𝝉𝒊+𝟏 |𝜽)

 Can be ineﬃcient, Ο(𝑄𝑛2 ), but eﬃciency can be improved through pruning such that Ο 𝑛 .
 Exact methods include
 Segment Neighbourhood Search
 Pruned Exact Linear Time Method
CUSUM Procedure
 It is method for detecting change in distribution of sequentially observed data.
 At the 𝑘𝑡ℎ stage, the likelihood ratio test statistic is
𝐟𝟏 𝐗𝐢
𝐓𝐤 =max 𝐦𝐚𝐱 𝟏≤𝐣≤𝐤 σ𝐤𝐢=𝐣 𝐙𝐢 , 𝟎 where 𝐙𝐢 =𝐥𝐨𝐠
𝐟𝟎 𝐗𝐢

which is calculated by the recursive formula

+
𝐓𝐤 = 𝐓𝐤−𝟏 + 𝐙𝐤 , 𝐓𝟎 = 0
The Page-CUSUM procedure stops at 𝐍𝐩 =min 𝐤: 𝐓𝐤 ≥ 𝛄 where 𝛄 is prescribed boundary.
CUSUM Method In Statistical Quality
Control
 The earliest techniques in this area are the Shewhart control charts.
These charts use only the current sample
Hence, they fail to use accumulated evidence of change.
 In an attempt to remedy this CUSUM chart can be used.
 The CUSUM-chart typically signals an out-of-control process by an upward or downward drift
of the cumulative sum until it crosses the boundary.
Change Finder Method
 Change Finder is mainly used in time series analysis.
 It reduces the problem of change point detection into time series-based outlier detection.
 It fits an Auto Regression (AR) model onto the data and updates it’s parameter estimates
incrementally.
 we can model the time series {𝑥𝑡 } using an AR mode of the 𝑘𝑡ℎ order by
𝒙𝒕 =𝝎𝒙𝒕−𝟏
𝒕−𝒌 + 𝝐
𝑥𝑡−𝑘 = (𝑥𝑡−1 , 𝑥𝑡−2 , . . . . . . . . ., 𝑥𝑡−𝑘 ) are previous observations, 𝜔 = (𝜔1 ,. . . . . ., 𝜔𝑘 ) are
constants, 𝜖 is normal random variable.
Change Finder Method
 Updating model parameters, the probability density function 𝑝𝑡 at time t is calculated.
An auxiliary time-series {𝑦𝑡 } is generated by giving a score to each data point.
Score (𝑦𝑡 ) = 𝑑(𝑝𝑡−1 , 𝑝𝑡 )
𝑑 is any distance function.
In order to detect change points we need to know abrupt change in difference.
The change-point score is defined using score function.
 A higher score indicates a higher possibility of a change point.
Performance Evaluation Of Change
Point Algorithms
 It is very important to choose the appropriate algorithm for the change point detection.
Some of the useful performance metrics that we can employ to evaluate CPD algorithms are
 Accuracy
 Sensitivity

When difference in time between the detected CP and the actual CP represents the measure of
performance, then the measures used are
 MAE
 MSE
 MSD
 RMSE
Measures Of Comparison Between
Various Change Point Algorithms
 Accuracy = 𝑇𝑃+𝑇𝑁+𝐹𝑁+𝐹𝑃
𝑇𝑃+𝑇𝑁

 Sensitivity=𝑇𝑃+𝐹𝑁
𝑇𝑃

𝑇𝑃
 Precision=
𝑇𝑃+𝐹𝑃

TP= True Positive, TN=True Negative, FN=False Negative, FP=False Positive

Classified as Classified as Non-
Change Point change point
True Change Point TP FN
True Non-change FP TN
Point
Other Methods Of Evaluation
 Mean absolute error (MAE) = σ#𝐶𝑃
𝑖=1 |𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝐶𝑃 −𝐴𝑐𝑡𝑢𝑎𝑙 𝐶𝑃 |
#𝐶𝑃

 Mean squared error (MSE) = σ#𝐶𝑃

𝑖=1 (𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝐶𝑃 −𝐴𝑐𝑡𝑢𝑎𝑙 𝐶𝑃 )2
#𝐶𝑃
Mean signed difference (MSD) = σ#𝐶𝑃
𝑖=1 (𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝐶𝑃 −𝐴𝑐𝑡𝑢𝑎𝑙 𝐶𝑃 )
#𝐶𝑃

 Root mean squared error(RMSE) = σ#𝐶𝑃 2

𝑖=1 (𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝐶𝑃 −𝐴𝑐𝑡𝑢𝑎𝑙 𝐶𝑃 )
#𝐶𝑃
Conclusion
 Change point detection is a very important statistical technique now a days.
 It has vast application in statistical quality control, time series analysis and many more fields.
 In this presentation, I presented various change point detection methods, analysed their
advantages and disadvantages.
 Finding the change points as soon as possible is crucial, the detection delay for many existing
approaches is a problem.
 Evaluating the significance of the detected change point is another important open issue for
unsupervised methods.
 Although CPD algorithms have progressed significantly in the last decade, there are still many
open challenges.
References
 Samaneh Aminikhanghahi and Diane J Cook. “A survey of methods for time series change
point detection”. In: Knowledge and information systems 51.2 (2017), pp. 339–367
 PK Bhattacharya. “Some aspects of change-point analysis”. In: Lecture Notes-Monograph
Series (1994), pp. 28–56
 Damien Garreau. “Change-point detection and kernel methods”. PhD thesis. PSL Research
University, 2017
Rebecca Killick et al. “Efficient detection of multiple changepoints within an oceano-graphic
time series”. In: Proceedings of the 58th world science congress of ISI. 2011.
 Jie Chen and Arjun K Gupta. “On change point detection and estimation”. In: Communications
in statistics-simulation and computation30.3 (2001), pp. 665–697.
Thank You!

MSM Specification: Discrete Time
No ratings yet
MSM Specification: Discrete Time
5 pages
FX Smile Interpolation Methods
No ratings yet
FX Smile Interpolation Methods
5 pages
Bayesian Structural Time Series Models
No ratings yet
Bayesian Structural Time Series Models
100 pages
Statistical Analysis of Financial Data in S-PLUS
No ratings yet
Statistical Analysis of Financial Data in S-PLUS
2 pages
HiddenMarkovModels RobertFreyStonyBrook PDF
No ratings yet
HiddenMarkovModels RobertFreyStonyBrook PDF
34 pages
Introduction Mathematical Portfolio Theo
No ratings yet
Introduction Mathematical Portfolio Theo
159 pages
Zivot+Yollin R Forecasting
100% (1)
Zivot+Yollin R Forecasting
90 pages
The Econometric Modelling of Financial Time Series: Terence C. Mills
100% (1)
The Econometric Modelling of Financial Time Series: Terence C. Mills
11 pages
A New Approach To Markov-Switching Garch Models
100% (1)
A New Approach To Markov-Switching Garch Models
38 pages
Stochastic Calculus for Finance
No ratings yet
Stochastic Calculus for Finance
211 pages
SABR Stochastic Volatility
No ratings yet
SABR Stochastic Volatility
91 pages
American Option Pricing in A Tick-Calibration in A Click
No ratings yet
American Option Pricing in A Tick-Calibration in A Click
38 pages
BNP Paribas Dupire Arbitrage Pricing With Stochastic Volatility
100% (1)
BNP Paribas Dupire Arbitrage Pricing With Stochastic Volatility
18 pages
The Advantages of Least Squares Monte Carlo
0% (1)
The Advantages of Least Squares Monte Carlo
9 pages
Markovian Projection Method
No ratings yet
Markovian Projection Method
22 pages
Preview-9781000176766 A39526004
No ratings yet
Preview-9781000176766 A39526004
35 pages
Statistical Arbitrage Strategy Analysis
No ratings yet
Statistical Arbitrage Strategy Analysis
31 pages
Deep Learning in Hilbert Spaces - New Frontiers in Algorithmic Trading
No ratings yet
Deep Learning in Hilbert Spaces - New Frontiers in Algorithmic Trading
351 pages
Making Fat Tails Fatter
100% (1)
Making Fat Tails Fatter
7 pages
Modelling Financial Time Series by Stephen J. Taylor
No ratings yet
Modelling Financial Time Series by Stephen J. Taylor
297 pages
Analytical Pricing of Basket Default Swaps in A Dynamic Hull & White Framework
No ratings yet
Analytical Pricing of Basket Default Swaps in A Dynamic Hull & White Framework
18 pages
Deep Learning for Economic Forecasting
No ratings yet
Deep Learning for Economic Forecasting
40 pages
Master Thesis Excl. Appendix
No ratings yet
Master Thesis Excl. Appendix
118 pages
Bovier & Den Hollander - Metastability
100% (1)
Bovier & Den Hollander - Metastability
578 pages
Local Volatility Under Rough Volatility
No ratings yet
Local Volatility Under Rough Volatility
28 pages
Statistical Modelling of Financial Time Series - An Introduction
100% (1)
Statistical Modelling of Financial Time Series - An Introduction
41 pages
Better Approximations To Cumulative Normal Functions: Graeme West
No ratings yet
Better Approximations To Cumulative Normal Functions: Graeme West
7 pages
Diffusion Processes and Stochastic Calculus
100% (1)
Diffusion Processes and Stochastic Calculus
290 pages
Advanced Statistical Modelling Notes
No ratings yet
Advanced Statistical Modelling Notes
233 pages
Implied Volatility Surface Construction
No ratings yet
Implied Volatility Surface Construction
40 pages
Almgren, Li - Closed-Form Solutions For Option Hedging With Market Impact
No ratings yet
Almgren, Li - Closed-Form Solutions For Option Hedging With Market Impact
30 pages
Neural Network Calibration for Volatility Models
No ratings yet
Neural Network Calibration for Volatility Models
32 pages
Arbitrage-Free Local-Stochastic Volatility Model
No ratings yet
Arbitrage-Free Local-Stochastic Volatility Model
21 pages
Heath Jarrow Morton A Interest Rate Model For CVA Calculations
No ratings yet
Heath Jarrow Morton A Interest Rate Model For CVA Calculations
9 pages
Understanding Asset Distribution Risks
No ratings yet
Understanding Asset Distribution Risks
43 pages
Econ 138: Financial and Behavioral Economics
100% (1)
Econ 138: Financial and Behavioral Economics
21 pages
Quant Investing Comparing and Contrasting Part 1 of 3
No ratings yet
Quant Investing Comparing and Contrasting Part 1 of 3
11 pages
CQF
No ratings yet
CQF
29 pages
Regime-Switching with HMMs
No ratings yet
Regime-Switching with HMMs
15 pages
Credit Derivatives Lecture Series
No ratings yet
Credit Derivatives Lecture Series
21 pages
Dokumen - Pub Time Series Econometrics J 6726102
100% (2)
Dokumen - Pub Time Series Econometrics J 6726102
219 pages
Ito Calculus
No ratings yet
Ito Calculus
23 pages
Pricing Rainbow Options Guide
No ratings yet
Pricing Rainbow Options Guide
7 pages
Let's Be Rational: J Ac06 Vog07
No ratings yet
Let's Be Rational: J Ac06 Vog07
12 pages
Stochastic Volatility Jump Model
No ratings yet
Stochastic Volatility Jump Model
7 pages
Rough Heston
No ratings yet
Rough Heston
11 pages
Book An Introduction To The Math of Financial Derivatives Neftci S.N. SOLUTION
No ratings yet
Book An Introduction To The Math of Financial Derivatives Neftci S.N. SOLUTION
90 pages
Monte-Carlo Methods For Single-And Multi-Factor Models: 1 Simulating Stochastic Differential Equations
No ratings yet
Monte-Carlo Methods For Single-And Multi-Factor Models: 1 Simulating Stochastic Differential Equations
11 pages
Gappy's Guide to Quantitative Books
No ratings yet
Gappy's Guide to Quantitative Books
13 pages
Stochastic Calculus With Applications by Ovidiu Calin
100% (1)
Stochastic Calculus With Applications by Ovidiu Calin
372 pages
Preliminaries
No ratings yet
Preliminaries
5 pages
Статья на конференцию
No ratings yet
Статья на конференцию
27 pages
Optimal Nonparametric Change Point Detection and Localization
No ratings yet
Optimal Nonparametric Change Point Detection and Localization
44 pages
2017 A Survey of Methods For Time Series Change Point Detection
No ratings yet
2017 A Survey of Methods For Time Series Change Point Detection
35 pages
21 Ejs1809
No ratings yet
21 Ejs1809
48 pages
Selective Review Truong2019
No ratings yet
Selective Review Truong2019
52 pages
Online Change Points Detection For Linear Dynamical Systems With Finite Sample Guarantees
No ratings yet
Online Change Points Detection For Linear Dynamical Systems With Finite Sample Guarantees
11 pages
Change-Point Detection Using Wavelets
No ratings yet
Change-Point Detection Using Wavelets
12 pages
MBA 2 Year
No ratings yet
MBA 2 Year
120 pages
Statistical Hypothesis Testing Guide
No ratings yet
Statistical Hypothesis Testing Guide
4 pages
Chapter 6 - ANOVA and Kruskal-Wallis Test
No ratings yet
Chapter 6 - ANOVA and Kruskal-Wallis Test
33 pages
Psychological Statistics - 2nd Sem
No ratings yet
Psychological Statistics - 2nd Sem
5 pages
Hypothesis Testing in Statistical Analysis
100% (2)
Hypothesis Testing in Statistical Analysis
12 pages
Food Science Stats for Professionals
No ratings yet
Food Science Stats for Professionals
2 pages
Dunnett's Test for Treatment Comparison
No ratings yet
Dunnett's Test for Treatment Comparison
3 pages
Statistics 231 Course Notes
No ratings yet
Statistics 231 Course Notes
204 pages
Journal Announcement Guidelines 300712
No ratings yet
Journal Announcement Guidelines 300712
19 pages
Introduction To Clustering Procedures
No ratings yet
Introduction To Clustering Procedures
42 pages
Hypothesis Testing Q
No ratings yet
Hypothesis Testing Q
2 pages
Wesgardrules & Multirules
No ratings yet
Wesgardrules & Multirules
18 pages
Performance Appraisal
100% (16)
Performance Appraisal
56 pages
Gaodun - CFA2 Quantitative
No ratings yet
Gaodun - CFA2 Quantitative
35 pages
Educational Statistics Course Overview
No ratings yet
Educational Statistics Course Overview
5 pages
Understanding Type I and Type II Errors
No ratings yet
Understanding Type I and Type II Errors
13 pages
Dear MR Gosset
No ratings yet
Dear MR Gosset
8 pages
Applied Final$$$$$
No ratings yet
Applied Final$$$$$
11 pages
QM2 Stat Chap 14 Comparing Two Means
No ratings yet
QM2 Stat Chap 14 Comparing Two Means
2 pages
Audit Evidenc1
No ratings yet
Audit Evidenc1
5 pages
Homework 08: One Way ANOVA Analysis
No ratings yet
Homework 08: One Way ANOVA Analysis
17 pages
Hypothesis
No ratings yet
Hypothesis
37 pages
Action Research Midterm
No ratings yet
Action Research Midterm
197 pages
Quality Control Hypothesis Test
No ratings yet
Quality Control Hypothesis Test
2 pages
Social Work Research Methods Overview
No ratings yet
Social Work Research Methods Overview
68 pages
Asr 6
No ratings yet
Asr 6
53 pages
Sampling Theory
No ratings yet
Sampling Theory
19 pages
Bollerslev Engle Nelson 1994 Arch Model Handbook of Econometric
No ratings yet
Bollerslev Engle Nelson 1994 Arch Model Handbook of Econometric
80 pages
Likelihood-Ratio Test
No ratings yet
Likelihood-Ratio Test
5 pages
M.Ed Course Syllabus Overview
No ratings yet
M.Ed Course Syllabus Overview
16 pages

Change Point Detection Seminar

Uploaded by

Change Point Detection Seminar

Uploaded by

Change Point Detection

𝑠:𝑡 + s−1, and add r to L;

which is calculated by the recursive formula

TP= True Positive, TN=True Negative, FN=False Negative, FP=False Positive

 Mean squared error (MSE) = σ#𝐶𝑃

 Root mean squared error(RMSE) = σ#𝐶𝑃 2

You might also like