Topic 2
Topic 2
Prepared by HAK
Topic outline
2.1 Spatial data sources for GIS 2.1.1 Describe the readily available data o Digitized data o Satellite Images o Air Photos o GPS Data o Old maps and plans 2.1.2 Describe the new data from fieldwork 2.2 Attribute data sources 2.3 Methods of GIS data input
2.2.1 Describe the readily available data o Digital data o Hardcopy data
2.2.2 Describe the new data from fieldwork
Topic outline
2.4 Elements of data quality 2.4.1 Prescribe the following elements
o o o
2.5 Error sources in a GIS 2.5.1 State the following error : o Error during data acquisition o Error during data entry o Error during data storage o Error during data Processing
o
o
Clarification elements
Current elements
Completeness in coverage
Completeness in classification Completeness in authentication
DIGITIZED DATA
Often, digital data for a GIS project is not available, so it must be created from other existing sources like paper map
Digitizing is the process where features on a map or image are converted into digital format for use by a GIS Process by which coordinates from map, image or other sources are converted digital format Digitizing converts the features on the map into three basic data types: i. Points zero dimensional objects ii. Lines one dimensional objects iii. Polygons two dimensional objects
..cont.
Points, lines and areas (polygon) on maps or images represent real-world entities or phenomena There are three major steps to digitize spatial data: i. Prepare map or other image for digitizing. ii. Enter (digitize) features from the map. iii. Edit the digitized map for accuracy and to ensure quality.
..cont.
Real World
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 G B G G B B BG G B G G B G BK B G B B B B
600 500 400 Y-AXIS 300 200 River 100 100 200 300 400 500 X-AXIS 600 Trees
Trees House
Raster Representation
Vector Representation
..cont.
..cont.
..cont.
1.
Missing line
..cont.
2.
Undershoot
..cont.
3.
Overshoot
..cont.
4.
SATELLITE IMAGES
Vector Data
SATELLITE WILL PRODUCE A RASTER IMAGE Raster images come in the form of individual pixels Each spatial location or resolution element has an associated pixel value, which indicates the coordinates, elevation, and any relevant attribute data, such as a color or ID number. VECTOR DATA Vector data comes in the form of points and lines that are geometrically and mathematically associated. The points are stored using their coordinates, and the lines are stored as a series of point pairs. Vector data structures produce smaller file sizes than raster images because only point coordinates are stored
A photograph could also be represented and displayed in a digital format by subdividing the image into small equal-sized and shaped areas, called picture elements or pixels.
Preprocessing
IMPROVE CONTRATST AND INFORMATION MAKE THEMES THEME CATEGORIES TRANSFER INTO GIS
Enhancement
Clean the data by removing electronic noise and correcting mistakes, such as missing scan lines. Often Radiometric and Geometric correction Data quality is improved and the image is then distributed to users.
ENHANCEMENT To enhance the data so that better information can be obtained This often includes improving visual contrast, such as changing subtle differences in gray tones into shades that are more distinctive, as an aid to interpretation. MAKE THEMES ( LAYER ) Enhanced data into selected themes For example, landscape images contain a variety of land covers ( such as various vegetation types ) that can be separated to provide
..cont.
CLASSIFICATION A vegetation theme, for example might include a sequence of classification, starting at overall land cover ( e.G forest, grassland, agricultural land and barren area. Classifying data according to a projects specefic purpose is an important data reduction step in keeping the amount of data manageable and focused. INTEGRATION INTO GIS Transferring imagery into GIS use as a background or complete integration as a full data set. That image used as background for on-screen digitizing are merely digital pictures, requires proper georeferencing, projection and database development.
Peta :
Satu lakaran yang dibuat atas kertas bagi mengambarkan sesuatu kawasan berdasarkan bentuk simbol piawai.
Pelan Satu lukisan yang diplot mengandungi butiran asal dan disediakan mengikut peraturan tertentu.Contoh Pelan kadaster.
Pelan
Kawasan kecil ( lebih khusus ) Lebih detail eg, nama pemilik, luas, jarak dll. Contohnya : Pelan Kadaster, Pelan enginering
DIGITIZING ( PENDIGITAN )
Data Spatial
Attribute
FEATURES TYPES There are three types of Feature which can be mapped: Points, Lines and Areas. A Point Feature is a single GPS coordinate position which is identified with a specific Object A Line Feature is a collection of GPS positions which are identified with the same Object and linked together to form a line An Area Feature is very similar to a Line Feature, except that the ends of the line are tied to each other to form a closed area
AERIAL PHOTO
Before digitizing, we need to georeference aerial image ( photo ). Georeferencing is the process of defining how raster data is situated in map coordinates Use intersections of the road, could be easily identified on the photo.
DIGITIZING
Process whereby an analog (or paper) map is converted into a digital format There are three primary methods for digitizing spatial information:
Two types of digitizing :
Tablet Digitizing On-Screen Digitizing
Tablet digitizing requires a person to enter coordinate information through the use of a digitizing tablet and digitizing puck.
A digitizing tablet is a hardened surface with a fine electrical wire grid under the surface. Digitizing tablets are either hardened, more stationary tables or rollup boards designed for portability
A digitizing puck is an electrical device with cross hairs and multiple buttons to
perform data entry operations An operator then enters the information using the puck.
Tablet digitizing
x,y
tedious, and operator fatigue Due to environmental conditions (such as humidity), the source materials may actually change (due to shrinking and expansion). If you notice the person below, the actual map is usually a foot or two away from the operator. So, small subtitles in the geographic objects may be missed when you are so far away
ON-SCREEN DIGITIZING
On-screen digitizing is an interactive process in which a map is created using previously digitized of scanned information. Commonly called "heads-up" digitizing because the attention of the user is focused up on the screen, and not on a digitizing tablet
Advantages : Ease of zooming for detailed work Full array of editing ( delete, copy and so-on ) and text features ( font and text editing ) Quick and inexpensive method for producing GIS
SCANNING
Pengimbasan ( Scanning )
device that converts maps into digital form Prinsip: Mengimbas garisan-garisan secara berturutan Merekodkan jumlah cahaya yang dipantulkan
Perbezaan cahaya diskalakan kepada bi-level hitam atau putih (1 bit per piksel) atau multiple-gray levels (8,16 atau 32 bit)
Resolusi spatial adalah dari sekecil 100 dpi hinggalah kepada 1800 dpi dan lebih Alat pengimbas GIS dalam lingkungan 400 1000 dpi
Dots per inch (DPI) is a measure of spatial printing or video dot density
PROCESS OF SCANNING
PREPARE MAP
PRE-PROCESSING
Prepare Map Although scanners will copy any data, map scans produce best results using bi-tonal data ( balck-andwhite lines and features, not gray or color ) Select the area to be scanned, reviewing procedures, and selecting the desired resolution
Pre-processing Clean the map of stains, specks and other useless marks that make data noise Clean map reduce the editing process Finding real-world coordinate control points for georeferencing Scan the Map Each scanner has particular controls and procedures, but it is usually a standard process of either positioning the map in the scanner or feeding it into the system. Scanning produce a raster image
SCANNING
PROSES PENGIMBASAN
Penyediaan Peta -Walaupun alat pengimbas boleh menyalin semua data, bagi menghasilkan keputusan imbasan peta yang baik menggunakan bitonal data ( black and white lines and features, not gray or color ) -Pilih kawasan yang hendak diimbas, penelitian ( review ) dan pilih resolusi yang sesuai. Resolusi Penyusunan struktur grid ( baris dan lajur ) atau saiz sel dunia sebenar ( contoh 30m diatas permukaan bumi. )
Penyediaan Peta
Pra-pemprosesan
-Membesihkan peta daripada tompok kotor, bintik2 atau tanda-tanda lain yang x berguna data noise .
Pra-pemprosesan
- Pembersihan peta mengurangkan proses pengeditan. - Dapatkan koordinat titik kawalan untuk georefering
Pengimbasan
Pengimbasan
-Langkah terakhir buat pengimbasan - Setiap alat pengimbas mepunyai kaedah yang tertentu, tetapi biasanya proses yang biasa ialah meletakkan peta di atas alat pengimbas dan disalurkan kepada sistem
Keyboard Entry
Papan Kekunci :
Digunakan untuk memasukan data atribut sebagai tambahan kepada proses pendigitan dan pengimbasan.
Cara ini amat berkesan untuk perlaksanaan yang memerlukan ketepatan tinggi seperti penggunaan kadaster, bearing dan jarak sempadan lot-lot tanah dimasukkan dan koordinat batu sempadan dihitung. Kemasukan data (nilai ukuran atau koordinat) dengan papan kekunci akan menghasilkan pengkalan data GIS yang berketepatan tinggi.
SPATIAL ACCURACY - deviance in the geographic location of an object from its true ground position.
TWO COMPONENTS TO SPATIAL ACCURACY
relative accuracy
The positioning of map features relative to one another.
absolute accuracy
The accuracy of data elements with respect to a coordinate scheme Eg : UTM
Kedudukan relatif
Kedudukan Mutlak
ATTRIBUTE ACCURACY Attribute accuracy is the closeness of attribute values to their true value Location does not change with time, but attributes often do Attributes may be discrete or continuous. Discrete attributes may have a finite number of values I.e., land use, vegetation type, etc. Continuous attributes may have an infinite number of values I.e., elevation, property value, isotherms, isohyets, etc.
Attribute Accuracy must be analyzed in different ways depending on the nature of the data
In testing attribute accuracy: An error of omission occurs when a points class on the ground incorrectly recorded in the database An error of commission occurs when the class recorded in the database does not exist on the ground
CURRENT ELEMENTS
A critical factor of any database is Time. Demographic information is time-sensitive, changing significantly even over the course of a year
dates
Date of acquisition is also a factor for geographic information that changes rapidly over time Forestry maps are generally updated on a 5-10 year basis Agricultural maps are updated as rapidly as weekly during growing seasons
LOGICAL CONSISTENCY
This component is concerned with determining the faithfulness of the data structure for a data set. This typically involves spatial data inconsistencies such as incorrect line
slivers
overshoots dangles
COMPLETENESS
What is completeness ? Completeness concerns the degree to which the data exhausts the universe of
possible items.
Are all possible objects included in the database? Is the database affected by rules of selection, generalization and scale? Completeness consists of : Completeness in coverage Completeness in classification Completeness in authentication
COMPLETENESS IN COVERAGE
What is Coverage? A coverage is a georelational data model that stores vector datait contains both the spatial (location) and attribute (descriptive) data for geographic features. Completeness of coverage is the proportion of data available for area of interest Progressively updated data sets, a patchwork of more recent data, may work for current status of a resource Older, more complete, data sets work best for comparative analysis where consistency is important
COMPLETENESS OF CLASSIFICATION
Completeness of Classification assesses how well the chosen classification is able to represent the data All data should be encoded at the selected level of detail Differences may occur owing to the individual or organizations that produced the maps. Different government agencies mapping adjacent areas may have boundary-matching problems, even though the maps are accurate in terms of position and classification
COMPLETENESS OF VERIFICATION
Completeness of Verification examines the amount and distribution of field
measurement or other independent sources of information used to develop the data In Geology, standard field data techniques use solid lines for mapping visible and verifiable rock types boundaries, while inferred boundaries are indicated with dashed lines (Air Geology) A data quality check for the geologists field data
Geometrical (positional) and semantic (classification) errors in the compilation of the source maps Interpretation and classification errors in remotely sensed data causing attribute error. Inaccuracy in source data i.e. from field sampling.
Digitizing errors due to operator mistakes and limited precision of digitizer. Errors during digitizing may be due to limitations of maps as given earlier or due to operational error. These operational errors may include psychological, physiological and digitizing method error.
ERROR DURING DATA STORAGE Errors due to the limited precision with which coordinates and other numerical data are stored. Errors arising from rasterization / vectorization.
ERROR DURING DATA PROCESSING a. Input: Digitizing: human error, the width of a line, spikes, knots, also entering attribute data. Dangling nodes (connected to only one arc): permissible in arc themes (river headwaters etc.). Pseudo-nodes (connected to one or two arcs) - permissible in island arcs, and where attributes change, e.g. road becomes paved from dirt or vice versa. Projection input error.
b. Manipulation Interpolation of point data into lines and surfaces. Overlay of layers, digitized separately, e.g. soils and vegetation.
Density of observations.
Inappropriate or inadequate inputs for models
c. Output Scale changes - detail and scale bars. Color palettes: intended colors don't match from screen to Printer
REFERENCE