WASARD Igrass 2019
WASARD Igrass 2019
1
Analytical Mechanics Associates, Hampton, VA
2
NASA Langley Research Center, Hampton, VA
ABSTRACT labeling all pixels for which this band’s value is below a
certain threshold as containing water. The thresholding
The detection of inland water bodies from Synthetic method works since water tends to return a much lower
Aperture Radar (SAR) data provides a great advantage over backscatter value to the satellite than land [1]. However, this
water detection with optical data, since SAR imaging is not method can be flawed since estimating the proper threshold
impeded by cloud cover. Traditional methods of detecting is often imprecise, complicated, and labor intensive for the
water from SAR data involves using thresholding methods end user. Thresholding also tends to use data from only one
that can be labor intensive and imprecise. This paper SAR polarization, when a combination of polarizations can
describes Water Across Synthetic Aperture Radar Data provide insight into whether water is present. [2]
(WASARD): a method of water detection from SAR data In order to alleviate these problems, this paper presents
which automates and simplifies the thresholding process an application for the Open Data Cube to detect water from
using machine learning on training data created from SAR data using support vector machine (SVM)
Geoscience Australia’s WOFS algorithm. Of the machine classification.
learning models tested, the Linear Support Vector Machine
was determined to be optimal, with the option of training 2. PLATFORM
using solely the VH polarization or a combination of the VH
and VV polarizations. WASARD was able to identify water WASARD is an application for the Open Data Cube, a
in the target area with a correlation of 97% with WOFS. mechanism which provides a simple yet efficient means of
ingesting, storing, and retrieving remote sensing data. Data
Index Terms— Sentinel-1, Open Data Cube, can be ingested and made analysis ready according to
Earth Observations, Machine Learning, Water Detection whatever specifications the researcher chooses, and easily
resampled to artificially alter a scene’s resolution. Currently
1. INTRODUCTION WASARD supports water detection on scenes from ESA’s
Sentinel-1 and JAXA’s ALOS. When testing WASARD,
Water classification is an important function of Earth Sentinel-1 was most commonly used due to its relatively
imaging satellites, as accurate remote classification of land high spatial resolution and its rapid 6 day revisit cycle [5].
and water can assist in land use analysis, flood prediction, With minor alterations to the application's code, however, it
climate change research, as well as a variety of agricultural could support data from other satellites.
applications [2]. The ability to identify bodies of water
remotely via satellite is immensely cheaper than contracting 3. METHODOLOGY
surveys of the areas in question, meaning that an application
that can accurately use satellite data towards this function Using supervised classification, WASARD compares SAR
can make valuable information available to nations which data to a dataset pre-classified by WOFS in order to train an
would not be able to afford it otherwise. SVM classifier. This classifier is then used to detect water in
Highly reliable applications for the remote detection of other SAR scenes outside the training set. Accuracy was
water currently exist for use with optical satellite data such measured according to the following metrics:
as that provided by LANDSAT. One such application, Precision: a measure of what percentage of the
Geoscience Australia’s Water Observations from Space points WASARD labels as water are truly water
(WOFS) has already been ported for use with the Open Data Recall: a measure of what percentage of the total
Cube [6]. However, water detection using optical data from water cover WASARD was able to identify.
Landsat is constrained by its relatively long revisit cycle of F1 Score: a harmonic average of the precision and
16 days [5], and water detection using any optical data is recall scores
constrained in that it lacks the ability to make accurate Both precision and recall are calculated at the end of the
classifications through cloud cover [2]. The alternative training phase, when the trained classifier is compared to a
solution which solves these problems is water detection testing dataset. Because the WOFS algorithm’s
using SAR data, which images the Earth using cloud- classifications are used as the truth values when training a
penetrating microwaves. WASARD classifier, when precision and recall are
Because of its advantages over optical data, much mentioned in this paper, they are always with respect to the
research has been done into water detection using SAR data. values produced by WOFS on a similar scene of Landsat
Traditionally, this has been done using the thresholding data, which themselves have a classification accuracy of
method, which involves picking a polarization band and 97% [6].
438
testing WASARD, it was observed that a classifier trained
on one lake in Vietnam detected water accurately across the
entire nation.
5.2 Applications
Figure 3: Water Detection by WASARD (Raw SAR image Figure 5 shows a composite of WASARD’s classifications
on left, WASARD’s predictions denoted in white on the in Buon Tua Sarh in Dak Nong, Vietnam. This composite
right hand side). was built from 17 scenes spanning 22 months. It is clear that
there is a large amount of variation in the presence of water
The comparison in Figure 3 demonstrates WASARD’s in this region, as there are large portions colored blue where
water detection on a reservoir in Southern Vietnam. This water was found 80-100% of the time, as well as significant
classifier has a precision score of .963, and recall of .983. portions colored yellow or orange where water was found
The Support Vector Classifier is represented by the 20-60% of the time. Closer research reveals that this
following equation: variation is due to the body of water being a reservoir whose
water level is controlled by a dam at its northernmost point.
SVC = (VH Coefficient) (VV Band Value) + (VH Coefficient) While the example of identifying a body of water with a dam
(VH Band Value) + Bias Constant, where SVC < 0 is water is somewhat trivial, it is a valid demonstration of how time
and SVC > 0 is non-water. series data might be used to analyze flooding patterns in a
For Figure 3: SVC = -45.899 * VH - 1.271 * VV + 1.007 region. WASARD makes running time series analysis on
SAR scenes easy with the included function
wasard_time_plot().
439
from 100 classifiers trained back to back on the same area.
Standard deviations of .008 for precision and .004 for recall
were found. Therefore, trial and observation of multiple
classifiers is recommended to find the optimal one, which
can then be saved and reused. Included in WASARD is a
function get_best_classifier() which automates this process,
training a given number of classifiers and returning them to
the user in a list sortable by precision, accuracy, or f1 score.
6. CONCLUSION
440