Skip to content

jug_gis_validation

jug_gis_validation is a Sabu service intended for validating geospatial datasets.

It validates input GeoJSON building datasets against reference census data in order to assess consistency, completeness, and plausibility before downstream analysis.

Purpose

The service is designed to:

  • validate geospatial building datasets,
  • compare cleaned GeoJSON data with census-based reference values,
  • identify missing, zero, or inconsistent values,
  • generate validation outputs that support quality control.

How validation works

The validation workflow is based on comparison between:

  • district-level GeoJSON building data,
  • census CSV data.

The service can aggregate and compare values by postal-code prefix (FSA). It is designed to support checks such as:

  • counts of buildings or units,
  • area-related fields,
  • floor-adjusted values where applicable,
  • optional filtering by building function.

This makes it possible to assess whether prepared GeoJSON building datasets are consistent with census-informed expectations before they are passed to later services.

Role in Sabu

jug_gis_validation sits between data preparation and emissions calculation.

It is intended to operate after:

  • jug_gis_cities

and before:

  • jug_lca_buildings

Outputs

The validation workflow can produce:

  • Python dictionaries,
  • Pandas DataFrames,
  • CSV summary files,
  • plots for visual comparison.

Current status

The validation logic exists, but the service is not yet fully aligned with the Sabu architectural structure.

Future development will focus on:

  • defining API boundaries,
  • standardizing validation outputs,
  • integrating with shared infrastructure.