Geographic Information Systems (GIS): Data Structure, Data Format,
and Database Design
1. DATA STRUCTURE IN GIS
Definition:
In GIS, a data structure refers to the way geographic data is stored, organized, and manipulated.
This includes how spatial and attribute data are digitally represented for analysis and visualization.
Types of GIS Data Structures:
a) Vector Data Structure:
- Represents geographic features as discrete objects using points, lines, and polygons.
- Examples: Points (trees, schools), Lines (roads, rivers), Polygons (lakes, land parcels).
- Characteristics: High precision, linked attribute tables, ideal for clear boundaries.
b) Raster Data Structure:
- Represents data as a grid of cells (pixels), each with a value.
- Examples: Satellite imagery, elevation models.
- Characteristics: Good for continuous data, easier processing for complex analysis,
storage-intensive.
Comparison:
| Feature | Vector | Raster |
|---------------|------------|------------------|
| Data Type | Discrete | Continuous |
| Precision | High | Lower |
| Storage | Efficient | Storage-intensive|
| Examples | Roads, parcels | Elevation, satellite images |
2. DATA FORMAT IN GIS
Definition:
Data format in GIS refers to the file types or structures used to store and exchange spatial data,
ensuring compatibility and proper interpretation by GIS software.
Common GIS Data Formats:
a) Vector Data Formats:
- Shapefile (.shp): Widely used; consists of .shp, .shx, .dbf files.
- GeoJSON (.geojson): JSON-based, web-friendly.
- KML (.kml): Used with Google Earth.
- GeoPackage (.gpkg): Stores both vector and raster data.
b) Raster Data Formats:
- GeoTIFF (.tif): Raster format with geographic metadata.
- JPEG2000 (.jp2): High-resolution imagery.
- IMG (.img): ERDAS IMAGINE format.
c) Database Formats:
- File Geodatabase (.gdb): Efficient storage for large datasets.
- Personal Geodatabase (.mdb): Based on MS Access.
- PostGIS: Spatial extension of PostgreSQL.
Considerations:
- Type of data
- Software compatibility
- Metadata needs
- Data size and sharing requirements
3. DATABASE DESIGN IN GIS
Definition:
Database design in GIS refers to the structured development of spatial databases for organizing and
managing geographic and attribute data.
Key Components:
a) Data Modeling:
- Conceptual Model: Abstract structure (ER diagram).
- Logical Model: Defines relationships and attributes.
- Physical Model: Physical data storage.
b) Data Schema:
- Structure of tables and fields.
- Field types, primary/foreign keys, relationships.
c) Normalization:
- Reduces redundancy, improves data integrity.
d) Topological Rules:
- Ensures spatial accuracy (e.g., polygons must close).
e) Security and Access Control:
- Role-based permissions for editing and access.
Example: Urban Planning GIS Database
- Feature Classes: Roads, buildings, parks.
- Attributes: Roads (name, width), Buildings (ID, height).
- Relationships: Buildings within parcels, parcels in zones.
- Spatial Indexing: Enhances query performance.