PowerPoint Presentation
PowerPoint Presentation
Dr. Wenyu Wang Dr. Farzana Kabir Dr. Yinglun Li Dr. Zuzhao Ye
Senior Engineer at EV Grid Infrastructure Senior ML Scientist Founder of
Quanta Technology Specialist at California at Alibaba AmpTrans
Energy Commission International Digital
Commercial Group 2
Research Projects and Sponsors
Over $16 Million of R&D Funding in the past 10 years
Federal government, state agencies, private companies
Research Tracks
Physics-informed Machine Learning for Power Systems
Scalable Optimization in Critical Infrastructure Systems
Transportation and Building Electrification
Decarbonization Planning
Energy Efficient Data Center
3
Outline
Volume, Variety, Velocity & Value of Big Data in Power Systems
Applications of Machine Learning in Power Systems
Transmission system, distribution system, end-use customers
Motivation for Physics-informed Methods in Power Systems
Leverage Unique Properties of Data from Power Systems
Low rank and sparsity, high and low entropy
Wireless
Network
Meter Data
Geographical Information System Cell Relay Management
System
Generator Tripping
State Estimation 7
Linear State Estimation
Applications of Machine Learning for
Power Distribution Systems and End Use Customers
Spatio-temporal Forecasting
Electric Load / DERs – Short-Term / Long-Term
Anomaly Detection
Electricity Theft, Unauthorized System Monitoring
Solar Interconnection State Estimation & Visualization
Rank of the matrix: the number of linearly independent rows or columns in the matrix
Low Rank and Sparsity: Voltage Event Detection Using
Optimization with Structured Sparsity Inducing Norms
Key Observations
Voltage related events trigged by system
faults are often regional events
The 𝑋 − 𝐿 during voltage event periods
have row-sparse structure
Rows of residual matrix correspond to
PMUs highly impacted by the event
Main Idea
Decompose the streaming PMU data matrix
𝑋 into
A low-rank matrix 𝐿, a row-sparse event-
pattern matrix 𝑆, and a noise matrix 𝐺
Extract anomaly features from 𝐿 & 𝑆
Use clustering algorithm to identify power X. Kong, B, Foggo, and N. Yu, “Online Voltage Event
system voltage events Detection Using Optimization with Structured Sparsity-
Inducing Norms,“ IEEE Transactions on Power
Systems, vol. 37, no. 5, Sep. 2022. 11
Decompose PMU Data Matrix with Proximal Bilateral
Random Projection (PBRP) to Detect Events
Residual PMU data matrices during voltage events have distinctive sparsity structure
Computationally efficient PBRP algorithm is proposed to decompose PMU data matrices
Online voltage event detection algo. shows better accuracy & scalability on PMU data (Eastern
Interconnection) 12
Low and High Entropy Power System
Measurements
Main Idea
Control the amount of information compression between the input layer and the last hidden
layer of a deep neural network
Balance memorization and generalization
Low entropy High entropy
input feature space input feature space
e.g. Vol. mag. in distribution e.g. PMU data during power
system (4–11 bits) system events ( > 60 bits)
Algorithm
Augment the typical cross-entropy loss function with estimated mutual information between
the input layer and the hidden representation
B. Foggo, N. Yu, J. Shi and Y. Gao, "Information Losses in Neural Classifiers from Sampling," IEEE Transactions on
Neural Networks and Learning Systems, vol. 31, no. 10, pp. 4073-4083, 2020. DOI: 10.1109/TNNLS.2019.2952029. 14
Phase Connectivity Identification
Very few electric utility companies have completely accurate phase connectivity
information in GIS!
Validated using real-world distribution circuits data from SCE and PG&E.
Learned Representations
on Circuit V
B. Foggo and N. Yu, "Improving Supervised Phase Identification Through the Theory of Information
Losses," IEEE Transactions on Smart Grid, vol. 11, no. 3, pp. 2337-2346, 2020. 15
System Event Classification with PMU Data
Formulated as a classification problem
Normal operation condition, line event, generator event, oscillation event
Learned Representation
J. Shi, B. Foggo, and N. Yu, "Power System Event Identification based on Deep Neural Network with Information Loading," 16
IEEE Transactions on Power Systems, vol. 36, no. 6, pp. 5622-5632, Nov. 2021.
Renewable Energy Resource Model
Solar PV System Performance Model: used to understand and predict energy or power
output from PV systems under a wide range of environmental, design, and site.
Physical Solar PV Performance Model
Estimation of Behind-the-Meter Solar Generation
Net Metering: 𝑛𝑒𝑡 𝑙𝑜𝑎𝑑 = 𝑙𝑜𝑎𝑑 − 𝑠𝑜𝑙𝑎𝑟 𝑔𝑒𝑛𝑒𝑟𝑎𝑡𝑖𝑜𝑛
Lemma 3. ∀𝛿 > 0, there exists a training data window length 𝑇 > 0 such that for each 𝑗: ℙ 𝛽𝑗𝑦 ≥ −𝛿 > 1 − 𝛿
Yuanqi Gao, Brandon Foggo, and Nanpeng Yu, "A Physically Inspired Data-driven Model for Electricity Theft
Detection with Smart Meter Data," IEEE Transactions on Industrial Informatics, vol. 15, no. 9, 2019. 21
Testing Results with Real-world Smart Meter Data
12 KV circuit from Southern California Edison, 6
months of smart meter data from 980 customers
and 190 transformers.
The average electric load consumed by the
customer is 1.6 kWh.
The mean of the estimation residual is -0.01 kWh
and its standard deviation is 0.1 kWh.
𝒗 𝑡 }𝑇𝑡=1 , given 𝒙, {
The likelihood of observing { 𝒑 𝑡 }𝑇𝑡=1 and {
𝒒 𝑡 }𝑇𝑡=1 is
𝑇
𝑃𝑟𝑜𝑏 𝑡
𝒗 𝑡=1 𝒑 𝑡 }𝑇𝑡=1 , {
{ 𝒒 𝑡 }𝑇𝑡=1 ; 𝒙)
𝑇
−
Σ𝑁 2 1 𝑇 −1
= 𝑀𝑇 × exp {− 2 [ (𝑡, 𝒙)]𝑇 Σ𝑁
𝒗 𝑡 −𝒗 [ (𝑡, 𝒙)]}
𝒗 𝑡 −𝒗
𝑡=1
(2𝜋) 2
Lemma 1. Let 𝒙∗ be the correct phase connection. If the following two conditions are satisfied, then
as 𝑇 → ∞, 𝒙∗ is a global optimizer of 𝑓 𝒙 .
𝑟𝑒𝑓 𝑡𝑙 , 𝒑
1. 𝒏 𝑡𝑘 is i.i.d. and independent of 𝒗 𝑡𝑙 , for ∀𝑡𝑘 , 𝑡𝑙 ∈ 𝑍 + .
𝑡𝑙 , and 𝒒
𝑟𝑒𝑓 𝑡𝑘 , 𝒑
2. 𝒗 𝑡𝑘 , and 𝒒 𝑟𝑒𝑓 𝑡𝑙 , 𝒑
𝑡𝑘 are independent of 𝒗 𝑡𝑙 , for ∀𝑡𝑘 , 𝑡𝑙 ∈ 𝑍 + , 𝑡𝑘 ≠ 𝑡𝑙 .
𝑡𝑙 , and 𝒒
Directly minimizing 𝑓(𝒙) is difficult.
Key Idea: phase identification problem → maximum marginal likelihood estimation (MMLE)
problem.
W. Wang and N. Yu, "Maximum Marginal Likelihood Estimation of Phase Connections in Power Distribution 23
Systems," IEEE Transactions on Power Systems, vol. 35, no. 5, pp.3906-3917, Sep. 2020.
Numerical Results with Smart Meter Data
Number of Loads per Phase in the IEEE Test Circuits
Feeder A B C AB BC CA ABC Total
37-bus 5 5 6 3 2 2 2 25
123-bus 18 17 17 9 9 10 5 85
342-bus 30 38 31 35 31 33 10 208
Smart Meter Data Phase Identification Accuracy of Different Methods with 90 days of Meter Data
Graph Model for Power Systems: Nodes → Vertices, Power Lines → Branches
Graph Model: Physics-informed Graphical Learning for
Distribution Line Parameter Estimation
Key Idea: Embed physical
equations of power flow in the
graphical learning model
Inspired by graphical neural
network (GNN)
Difference between physics-
informed GL and GNN
Leverage 3Φ power flow-based
physical transition fcn. to replace
the deep neural networks in GNN.
Key Step: Derive the gradient of
voltage magnitude loss function
w.r.t. line segment’s resistance
and reactance parameters with
an iterative method.
Estimate distribution line parameters with SGD considering prior estimates of line parameters
and physical constraints.
Improve computation efficiency with grid partition scheme and fast forward/backward function.
Wenyu Wang and Nanpeng Yu, "Estimate Three-phase Distribution Line Parameters with Physics-informed 26
Graphical Learning Method," IEEE Transactions on Power Systems, vol. 37, no. 5, pp. 3577-3591, Sep. 2022,
Fast GL and Numerical Study Results
MADR Improvement of Parameter Estimation Methods
in the Test Feeder (Avg. / Choose Optimal Value)
Outperforms commercial
MIP solver (e.g. Gurobi
and CPLEX) on large-
scale UCR problems
(6515-bus)
Better scalability,
optimality and
interpretability
29
J. Qin & N. Yu, "Solve Large-scale Unit Commitment Problems by Physics-informed Graph Learning" arXiv preprint arXiv:2311.15216
System Control Model
X. Kong, K. Yamashita, B. Foggo, and N. Yu, "Dynamic Parameter Estimation with Physics-based Neural Ordinary 33
Differential Equations" IEEE PES GM, 2022.
Numerical Study Results Generate noisy PMU measurements from dynamic
WECC 3-machine 9-bus system simulation data.
A single transmission line is disconnected at 5s. The
simulation time is 10s.
Two disturbance scenarios (between nodes 5 and 7,
between nodes 8 and 9)
When the PMU data length is 3s, our proposed algorithm
achieves the lowest relative estimation error.
The physics-based neural ODE algorithm outperforms the
baseline algorithm in terms of estimation accuracy for
most of the unknown parameters
When the data length is 3s, the running time of our model
is just 4.82 seconds, which is nearly 8 times faster than
the baseline model.
Mini-batch scheme of neural network training shortens
34
model running time.
Learning Dynamic System with Hamiltonian Neural Networks
Hamiltonian mechanics: can predict the motion of a energy-conserved system.
𝑇 𝑇
State variables: generalized position 𝐪 = 𝑞1 , 𝑞2 , ⋯ , 𝑞𝑛 momentum 𝐩 = 𝑝1 , 𝑝2 , ⋯ , 𝑝𝑛
𝐪 and 𝐩 correspond to voltage angle 𝛿 and angular speed 𝜔 in power system dynamic model
𝑑𝒒 𝑑𝒑
𝐻(𝒒, 𝒑) ,
𝑑𝑡 𝑑𝑡
Legendre transform
Learn Hamiltonian Learn ODEs
The Hamiltonian can be regarded
as the energy function
[1] Greydanus S, Dzamba M, Yosinski J. Hamiltonian neural networks. Advances in neural information processing systems, 2019.
[2] Cranmer M, Greydanus S, Hoyer S, et al. Lagrangian neural networks. arXiv preprint arXiv:2003.04630, 2020.
35
Learning Power System Dynamics: Nearly Hamiltonian NN
Can we formulate the SMIB system as a Hamiltonian function?
𝑚1 𝛿ሷ + 𝑑1 𝛿ሶ + 𝐵12 𝑉1 𝑉2 sin(𝛿) − 𝑃1 = 0
Unfortunately, the answer is No. If the damping coefficient 𝑑1 is positive, then the SMIB system
is a dissipative system instead of a conservative system.
Hamiltonian System Nearly Hamiltonian System
Hamiltonian differential equations Hamiltonian differential equations
36
Summary
Key: synergistic combine ML algorithms with physical domain knowledge
Steady state models: renewable resource model, topology, power flow model
Repository of synthetic PMU data generated from U.S. power grid data
pmuBAGE: https://github.com/NanpengYu/pmuBAGE
Recent Publications
Theoretical Machine Learning
B. Foggo, N. Yu, J. Shi and Y. Gao, "Information Losses in Neural Classifiers from Sampling," IEEE Transactions on Neural Networks
and Learning Systems, vol. 31, no. 10, pp. 4073-4083, 2020.
Brandon Foggo and Nanpeng Yu, "Analyzing Data Selection Techniques with Tools from the Theory of Information Losses," IEEE
International Conference on Big Data, pp. 1-10, 2021.
Brandon Foggo and Nanpeng Yu, "On the Maximum Mutual Information Capacity of Neural Architectures," ICML Workshop on Neural
Compression: From Information Theory to Applications, 2023.
Thank You
Questions?
Collaborating Companies: SCE, PG&E, National Grid, Fortis BC, Con Edison, NYPA,
ComEd, Exelon, EPRI, RPU, ISO-NE and CAISO.
Funding Support: DOE, NSF, CEC, NYSERDA, EPRI, DEED, UCOP, NREL & Utilities.