Skip to content

InfDB-basedata-buildings

The tool InfDB-basedata-buildings processes building-related data as fundamental data basis for various applications and analyses.

Usage

Details on how to use the tool can be found in the Usage section.

Architecture

The architecture of the InfDB-basedata-buildings tool is based on the InfDB Dev Container template as described in the Tools Template section. The tool is implemented in Python and utilizes the pyinfdb package for interaction with InfDB. The database operations are executed using SQL queries. More details can be found in the Architecture section.

Data Pipeline

More details can be found in the Data Pipeline section.

Tool Overview

  • Loads and cleans LOD2 building data for the chosen AGS (removing too small buildings)
  • Fills building key attributes if missing such as height, area, and floor count
  • Distributes census population across buildings proportionally by volume, then converts occupant estimates to household counts using grid-cell-level average household sizes
  • Classifies residential buildings into types (Single-family, Terraced, Multi-family, Apartment) based on building-related data from the previous step including area, number of floors, and neighbourhood patterns information. Missing types are defaulted to Apartment Buildings.
  • Rebalances type distributions against census totals per 1km grid cell to meet the statistical cell-related distribution data. (1km has been chosen here, to avoid data gaps due to data privacy)
  • Assigns construction periods to buildings by sampling from census statistics per grid cell
  • Attaches geographic weather data from OpenMeteo to each building
  • Outputs a final enriched building dataset ready for downstream tools