Schema#
All DART pipeline processors output data in an uniform schema, described below. Data is usually made available as text CSV files or in the parquet format.
iso3: The ISO 3166-1 alpha-3 country code for the measure. The pycountry package is used to map ISO codes to country names.admin_level_0: The common name of the country referred to iniso3admin_level_1: Admin level 1 of the country (state or province)admin_level_2: Admin level 2 of the country (district)admin_level_3: Admin level 3 of the country (city)year: Year of measurementmonth: Month of measurement (1-12)day: Day of measurement (1-31)week: Week of measurement (1-52)metric: Name of the metric, possible values can be seen in the Metrics page.value: Value of the metricunit: Unit of the metric, specified as an UCUM coderesolution: Geographical resolution of the measure, one ofworld,admin0,admin1,admin2oradmin3.creation_date: Date measure was recorded in DART database, in YYYY-MM-DD format.
The schema is available as a JSON Schema for validation. JSON Schema can be used to validate data files, using packages such as {jsonvalidate} in R or fastjsonschema in Python.