GIS Data Management_Presentation
GIS Data Management_Presentation
DEFINITION: Data management is the practice of collecting, processing, storing, securing and disseminating an
organization’s data, where it is then consumed for meeting business requirements and outcomes.
3. Data collection
Data sourcing, field collection, data creation.
Data type to be collected (spatial, non-spatial or a combination of both?)
Data collection techniques: How will the data be collected?
Manually as needed, extract scheduling, surveying, UAS, mobile, web editing?
4. Data Processing (Preparation)
Data cleaning and transformation (which tools/technology: ETL tools, ArcGIS, QGIS, Excel (Power Query), etc.?)
Datasets consolidation into single database objects (feature classes, tables, raster catalog, mosaic)
Data engineering (identify patterns, incomplete and disparate data).
Implement guidelines for naming data, populating lineage attributes, and adding metadata.
Data quality control and validation.
Setup automated geoprocessing, data reviewer and QC workflows for repetitive data preparation tasks.
THE MOST TIME CONSUMING ASPECT, especially in big data companies!
APPROACH TO GIS DATA MANAGEMENT
5. Data Storage
Where will you store your data? Which DB technology: RDBMS, SharePoint, MS Dataverse (warehouse) or data lake?
Consider the data formats
Professional database schema designs
Standardization (makes the data easy to govern)
Use international standardized schemas wherever possible: topographic (e.g. MGCP); operational (various ESRI-designed industry-specific schemas, etc.)
Implement database management best practices
Replicas, versioning, archiving, compressing, enable editor tracking, etc.
6. Data Security
How will you keep your data secure?
Implement best practices:
Create OS access groups, database admin users, and/or email ID (cloud active directory) users
Grant/revoke permissions for users PER database object (feature class, table, raster catalog, mosaic, etc.)
Setup client web app logins
APPROACH TO GIS DATA MANAGEMENT
7. Data Dissemination
For teams or departments requiring the ability to collaborate?
Data distribution via database replicas (online, offline, check-out/check-in)
Online collaborative editing and/or mobile applications
GIS
Operations
HQ
PRACTICAL USE CASE: A DATA MANAGEMENT EXAMPLE
1. Data strategy
From a centralized to decentralized data management approach
Formulated policies and procedures for data management governance.
2. Identify organization-wide needs
Mission’s primary business objectives are defined in the mandate: protection of civilians; disarmament, rehabilitation and re-integration; capacity building in security reform.
Discovered a great appetite for GIS analyses to support the Mission’s situation analytical capability but with no or disparate datasets:
Incidents analyses,
Hotspots identification,
Affected populations (including IDPs),
Area security gaps,
Armed groups monitoring,
Mission field activities analyses and statistics, in response to the security incidents or in support of the mandate.
Identified data gaps
Required data inventory list
PRACTICAL USE CASE: A DATA MANAGEMENT EXAMPLE
Shortage of data management skills: those who can implement, manage and monitor the complete aspects of data
management (not just an aspect), more so in organizations handling increasing amounts of data .
Increased data volumes
Constantly changing compliance requirements: make it a challenge to ensure people are using the right data.
Resistance to change
No or weak data governance
MODERN AND FUTURE TRENDS IN DATA MANAGEMENT
THANK YOU