World Meteorological Organization

Date: 2024-03-28

Version: 2024-02-18

Document location: https://community.wmo.int/wis2-guide

Standing Committee on Information Management and Technology (SC-IMT)[1]

Commission for Observation, Infrastructure and Information Systems (INFCOM)[2]

Copyright © 2024 World Meteorological Organization (WMO)

Table of Contents

Introduction

Purpose

In conjunction with the Manual on the WMO Information System volume II (WMO-No. 1060) (Manual on WIS volume II: WIS 2.0), the present Guide to the WMO Information System volume II (Guide to WIS volume II: WIS 2.0) is designed to ensure adequate uniformity and standardization in the data, information and communication practices, procedures and specifications employed by Members of the World Meteorological Organization (WMO) in the operation of the WMO Information System WIS 2.0 as it supports the mission of the Organization. The Manual on WIS contains standard and recommended practices, procedures and specifications. The Guide to WIS contains additional information concerning practices, procedures, and specifications that Members are invited to follow or implement in establishing and conducting their arrangements in compliance with the WMO Technical Regulations and in developing meteorological and hydrological services.

1. PART I

1.1. Introduction to WIS2

Since the Global Telecommunication System (GTS) entered operational life in 1971, it has been a reliable real-time exchange mechanism of essential data for WMO members.

In 2007, the WMO Information System (WIS) entered into operations to complement the GTS, providing a searchable catalogue and a global cache to enable additional discovery, access and retrieval. The success of WIS was limited as the system only partially met the requirement of providing simple access to WMO data. Today’s technology developed for the Internet of Things (IoT) opens the possibility of creating a WIS2 able to stand to its expectations of delivering an increasing number and volume of real-time data to WMO Centres in a reliable and cost effective way.

WIS2 has been designed to meet the shortfalls of the current WIS and GTS, support the WMO’s Unified Data Policy and the Global Basic Observing Network (GBON), and meet the demand for high data volume, variety, velocity and veracity.

WIS2 technical framework is based around three foundational pillars: leveraging open standards, simpler data exchange and cloud-ready solutions.

1.1.1. Leveraging open standards

WIS2 leverages open standards to take advantage of the ecosystem of technologies available on the market and avoid building bespoke solutions that can force NMHSs to procure costly systems and equipment. In today’s standards development ecosystem, standards bodies work closely together to minimise overlap and build on one another’s areas of expertise. For example, the World Wide Web Consortium provides the framework of Web standards, which the Open Geospatial Consortium and other standards bodies leverage. WIS2 leverages open standards with industry adoption and wider, stable, and robust implementations, thus extending the reach of WMO data sharing and lowering the barrier to access by Members.

1.1.2. Simpler data exchange

WIS2 prioritises public telecommunication networks, unlike private networks for GTS links. As a result, using the Internet will enable the best choice for a local connection, using commonly available and well-understood technology.

WIS2 aims to improve the discovery, access and utilisation of weather, climate and water data by adopting Web technologies proven to provide a truly collaborative platform for a more participatory approach. Data exchange using the Web also facilitates easy access mechanisms. Browsers and search engines allow Web users to discover data without specialised software. The Web also enables additional data access platforms, e.g. desktop GIS, mobile applications, forecaster workstations, etc. The Web provides access control and security mechanisms that can be utilised to freely share the core data per the WMO Unified Data Policy and protect the data with more restrictive licensing constraints. Web technologies also allow for authentication and authorisation for the provider to retain control of who can access published resources and to request users to accept a license specifying the terms and conditions for using the data as a condition for providing access to them.

WIS2 uses a "publish-subscribe" pattern where users subscribe to a topic to receive new data in real time. The mechanism is similar to WhatsApp and other messaging applications. It is a reliable and straightforward way to allow the user to choose her data of interest and to receive them reliably.

1.1.3. Cloud-ready solutions

The cloud provides reliable platforms for data sharing and processing. It reduces the need for expensive local IT infrastructure, which constitutes a barrier to developing effective and reliable data processing workflows for some WMO Members. WIS2 encourages WMO centres to adopt cloud technologies where appropriate to meet their users' needs. Whilst WMO technical regulations will not mandate cloud services, WIS2 will promote a gradual adoption of cloud technologies that provide the most effective solution.

The cloud-based infrastructure allows easy portability of technical solutions, ensuring that a system implemented by a specific country can be packaged and deployed easily in other countries with similar needs. In addition, using cloud technologies allows WIS2 to deploy infrastructure and systems efficiently with minimum effort for the NMHSs by shipping ready-made services and implementing consistent data processing and exchange techniques.

It should be clear that hosting data and services on the cloud does not affect data ownership. Even in a cloud environment, organisations retain ownership of their data, software, configuration, and change management as if they were hosting their infrastructure. As a result, data authority and provenance stay with the organisation, and the cloud is simply a technical means to publish the data.

1.1.4. Why are datasets so important?

WMO enables the international exchange of observations and model data for all Earth-system disciplines.

The WMO Unified Data Policy, Resolution 1 (Cg-Ext(2021)) describes the Earth system data that are necessary for efforts to monitor, understand and predict the weather and climate - including the hydrological cycle, the atmospheric environment and space weather.

WIS is the mechanism by which this Earth system data is exchanged.

Common practice when working with data is to group them into "Datasets". All the data in a Dataset share some common characteristics. The Data Catalog Vocabulary (DCAT) defines a Dataset as a "collection of data, published or curated by a single agent, and available for access of download in one or more representations" [3].

Why is this important? The "single agent" (i.e., a single organisation) responsible for managing the collection ensures consistency among the data. For example, in a Dataset:

  • All the data should be of the same type (e.g., observations from weather stations).

  • All the data should be have the same license and/or usage conditions.

  • All the data should be subject to the same quality management regime - which may mean that all the data is collected or created using the same processes.

  • All the data should be encoded in the same way (i.e., using the same data formats and vocabularies).

  • All the data should be accessible using the same protocols - ideally from a single location.

This consistency means that one can predict what data is in a Dataset, at least as far as the common characteristics, making it easier to write applications to process the data.

A Dataset might be published as an immutable resource (e.g., data collected from a research programme), or it might be routinely updated (e.g., every minute as new observations are collected from weather stations).

A Dataset may be represented as a single, structured file or object (e.g., a CSV file where each row represents a data record) or as thousands of consistent files (e.g., output from a reanalysis model encoded as many thousands of GRIB files). Determining the best way to represent a Dataset is beyond the scope of this guide - there are many factors to consider! The key point here is that we consider the Dataset to be a single, identifiable resource irrespective of how it’s represented.

Because we group data into a single, conceptual resource (i.e., the Dataset) we can:

  • Give this resource an identifier, and use this identifier to unambiguously refer to collections of data.

  • Make statements about the Dataset (i.e., metadata), and infer that these statements apply to the entire collection.

All this means that the Dataset concept is central WIS:

  • We publish discovery metadata about Datasets, as specified in the Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2.

  • We can search for Datasets that contain relevant data using the Global Discovery Catalogue (see Global Discovery Catalogue).

  • We can subscribe to notifications about updates about a Dataset via a Global Broker (see Global Broker).

  • We can access the data that comprises a Dataset from a single location using a well described mechanism.

It is up to the Data Publisher to decide how their data is grouped into Datasets - effectively, to decide what Datasets they publish to WIS. That said, we recommend that, subject to the consistency rules above, Data Publishers should organise their data into as few Datasets as possible.

For a Data Publisher, this means fewer discover metadata records to maintain. For a Data Consumer this means fewer topics to subscribe to and fewer places to access the data.

There are some things that are fixed requirements for Datasets:

  1. All data in the Dataset must be accessible from a single location.

  2. All data in the Dataset must be subject to the same license or usage conditions.

Here are some examples of Datasets:

  • The most recent 5-days of synoptic observations for an entire country or territory[4].

  • Long-term record of observed water quality for a managed set of hydrological stations.

  • Output from the most recent 24-hours of operational numerical weather prediction model runs.

  • Output from 6-months of experimental model runs. It’s important to note that output from the operational and experimental should not be merged into the same dataset because they use different algorithms - it’s very useful to be able to distinguish the provenance (or lineage) of data.

  • A multi-petabyte global reanalysis spanning 1950 to present day.

In summary, Datasets are important because they’re how data is managed in WIS.

1.2. Data consumer

As a Data Consumer wanting to use data published via WIS2 you should read the guidance presented here. In addition, a list of references to informative material in this Guide and elsewhere is provided at the end of this section.

1.2.1. How to search the Global Discovery Catalogue to find Datasets

The first step to using data published via WIS2 is to determine which Dataset or Datasets contains the data that is needed. To do this, a Data Consumer may browse discovery metadata provided by the Global Discovery Catalogue. Discovery metadata follows a standard scheme (see Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2). A Data Consumer may discover a Dataset using keywords, geographic area of interest, temporal information, or free text. Matching search results from the Global Discovery Catalogue provide high-level information (title, description, keywords, spatiotemporal extents, data policy, licensing, contact information), from which a Data Consumer can assess and evaluate their interest in accessing/downloading data associated with the Dataset record.

A key component of Dataset records in the Global Discovery Catalogue is that of "actionable" links. A Dataset record provides one to many links that clearly identify the nature and purpose of the link (informational, direct download, API, subscription) so that the Data Consumer can interact with the data accordingly. For example, a Dataset record may include a link to subscribe to notifications (see How to subscribe to notifications about availability of new data) about the data, or a API, or an offline archive retrieval service.

The Global Discovery Catalogue is accessible via an API and provides a low barrier mechanism (see Global Discovery Catalogue). Internet search engines are able to index the discovery metadata in the Global Discovery Catalogue, thereby providing Data Consumers with an alternative means to search for WIS2 data.

1.2.2. How to subscribe to notifications about availability of new data

WIS2 provides notifications about updates to Datasets; for example, when a new observation record from an automatic weather station is added to a Dataset of surface observations. Notifications are published on message brokers. Where Data Consumers need to use data rapidly once it has been published (e.g., for input to a weather prediction model), they should subscribe to one or more Global Broker to get notifications messages using MQTT protocol[5].

In WIS2, notifications are re-published by Global Brokers to ensure resilient distribution. Consequently, there will by multiple places where one can subscribe. Data Consumers requiring real-time notifications must subscribe to Global Brokers. A Data Consumer should subscribe to more than one Global Broker, thereby ensuring that notifications continue to be received in the event that a Global Broker instance fails.

A Dataset in WIS2 is associated with a unique topic. Notifications about updates to a Dataset are published to the associated topic. Topics are organized according to a standard scheme (see the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy).

A Data Consumer can find the appropriate topic to subscribe to either by searching the Global Discovery Catalogue, using an Internet search engines[6], or by browsing the topic hierarchy on a message broker.

WIS2 uses Global Caches to distribute Core data, as defined in the WMO Unified Data Policy (Resolution 1 (Cg-Ext 2021)). Each Global Cache re-publishes Core data on its own highly-available data server and publishes a new notification message advertising the availability of that data from a the Global Cache location.

Notifications from WIS2 Nodes and Global Caches are published on different topics: The root topic used by WIS2 Nodes is origin, while the root topic used by Global Caches is cache. Other than the root, the topic hierarchy is identical. For example, for synoptic weather observations published by Environment Canada:

  • Environment and Climate Change Canada, Meteorological Service of Canada’s WIS2 Node publishes to: origin/a/wis2/ca-eccc-msc/data/core/weather/surface-based-observations/synop

  • Global Caches publish to: cache/a/wis2/ca-eccc-msc/data/core/weather/surface-based-observations/synop

As per clause 3.2.13 of the Manual on WIS (WMO-No. 1060), Volume II, Data Consumers should access Core data from the Global Caches. Consequently, they need to subscribe to the cache topic hierachy to receive the notifications from Global Caches, each of which provides a link (i.e., URL) to download from the respective Global Cache’s data server.

1.2.3. How to use a notification message to decide whether to download data

On receipt of a notification message, a Data Consumer needs to decide whether to download the newly available data. The content of the notification message provides the information needed to make this decision. For details of the specification, see the Manual on WIS (WMO-No. 1060), Volume II, Appendix E: WIS2 Notification Message.

In many cases, Data Consumers will use a software application to determine whether or not to download the data. This section provides insight about what happens.

When subscribing to multiple Global Brokers the Data Consumer will receive multiple copies of a notification message. Each notification message has a unique identifier, defined using the id property. Duplicate messages should be discarded.

Core data will be available from both a WIS2 Node and Global Caches, each of which publish a different notification message advertising alternative locations from where the data can be downloaded. Because these are different messages, they will have different identifiers. However, the each of these messages refers to the same data object, which is uniquely identified in the notification message using the data_id property. Notification messages from different sources can easily be compared to determine if they refer to the same data. By subscribing to the cache root topic, Data Consumers will only receive notifications about data available from the Global Caches. The origin root topic should be used when subscribing to notifications about Recommended data. Data Consumers should not subscribe to the origin root topic for notifications about Core data because notification messages provided on these topics will refer to data published on directly on the WIS2 Nodes (i.e., the "origin").

Data Consumers need to consider their strategy for managing these duplicate messages. From a data perspective, it does not matter which Global Cache instance is used – they will all provide an identical copy of the data object published by the originating WIS2 Node. The simplest strategy is to accept the first notification message and download from the Global Cache instance that the message refers to (i.e., using a URL for the data object at that Global Cache instance). Alternatively, a Data Consumer may have a preferred Global Cache instance, for example, that is located in their region. Whichever Global Cache instance is chosen, Data Consumers will need to implement logic to discard duplicate notification messages based on id and duplicate data objects based on data_id.

A notification message also provides a small amount of metadata about the data object it references: Location, time, etc. Data Consumers can use this metadata to decide if the data object referenced in the message should be downloaded. This is known as client-side filtering.

The notification message should also include the metadata identifier for the Dataset to which the data object belongs. A Data Consumer can use the metadata identifier to search the Global Discovery Catalogue and discover more about the data - in particular, whether there are any conditions on the use of this data.

1.2.4. How to download data

Links to where data can be accessed are made available through Dataset discovery metadata (via the Global Discovery Catalogue) and/or data notification messages (via Global Brokers). Links can be used to directly download the data (according to the network protocol and content description provided in the link) using a mechanism appropriate to the workflow of the Data Consumer. This could include Web and/or desktop applications, custom tooling, or other approaches.

A discovery metadata record or notification message may provide more than one download links. The preferred link will be identified as "canonical" (link relation: "rel": "canonical"[7]).

Where data is provided through an interactive Web service, a canonical link that provides a URL where one can directly download a data object may be complemented with an additional link providing the URL for the root of the Web service from where one can interact with or query the entire Dataset.

If a download link implements access control (i.e., where the Data Consumer needs to take some additional action(s) to download the data object), the download link will contain a security object that provides the pertinent information (e.g., the access control mechanism used, and where/how a Data Consumer would need to register to request access).

1.2.5. How to use data

Data is shared on WIS2 in accordance with the WMO Unified Data Policy (Resolution 1 (Cg-Ext 2021)). This data policy describes two categories of data: Core and Recommended:

  • Core data is considered essential for provision of services for the protection of life and property and for the well-being of all nations. Core data is provided on a free and unrestricted basis, without charge and with no conditions on use.

  • Recommended data is exchanged on WIS2 in support of Earth system monitoring and prediction efforts. Recommended data may by provided with conditions on use and/or subject to a license.

Furthermore, the WMO Unified Data Policy encourages attribution of the source of the data in all cases. In this way, credit is given to those who have expended effort and resources in collecting, curating, generating, or processing the data. Attribution provides visibility of who is using data which, for many organizations, provides necessary evidence to justify continued provision of and updates to the data.

Details of the applicable WMO data policy and any rights or licenses associated with data are provided in the discovery metadata that accompanies the data. Discovery metadata records are available from the Global Discovery Catalogue.

The Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2, section 7.1.17. Properties / WMO data policy provides details on how data policy, rights, and/or licenses are described in the discovery metadata.

When using data from WIS2, data consumers:

  • Shall respect the conditions of use applicable to the data as expressed in the WMO data policy, rights statements, or licenses.

  • Should attribute the source of the data.

1.2.6. Further reading for data consumers

As a Data Publisher planning to operate a WIS2 Node, as a minimum you should read the following sections:

The following specifications in the Manual on WIS (WMO-No. 1060), Volume II are useful for further reading:

  • Appendix D: WIS2 Topic Hierarchy

  • Appendix E: WIS2 Notification Message

  • Appendix F: WMO Core Metadata Profile 2

1.3. Data publisher

As a Data Publisher with authoritative Earth system data that you want to share with the WMO community you should read the guidance presented here. In addition, a list of references to informative material in this Guide and elsewhere is provided at the end of this section.

1.3.1. How to get started

The first thing you need to do is consider your data, how it can be conceptually grouped into one or more datasets (see [_why_are_datasets_so_important?]), and whether it is Core or Recommended data, as per the WMO Unified Data Policy (Resolution 1 (Cg-Ext 2021)).

Next, you need to consider where it is published. If your data relates to your country or territory, you need to publish it through a National Centre (NC). If your data relates to a region, programme, or other specialized function within WMO, you need to publish it through a Data Collection or Production Centre (DCPC). The functional requirements for NC and DCPC are described in the Manual on WIS (WMO-No. 1060), Volume II - Part III Functions of WIS.

All NCs and DCPCs are affiliated with a Global Information System Centre (GISC) that has a responsibility to help establish efficient and effective data sharing on the WIS. Your GISC will be able to help you in getting your data onto WIS2.

You may be able to identify an existing NC or DCPC that can publish your data. Alternative, you need to establish a new NC or DCPC. The main difference is that an NC is designated by a Member, whereas a DCPC is designated by a WMO or related international programme and/or a regional association.

Both NC and DCPC require operation of a WIS2 Node (see WIS2 Node). The procedure for registering a new WIS2 Node is provided in Registration and decommissioning of a WIS2 Node.

Once you have determined the scope of your datasets, the data policy which applies, and have a WIS2 Node ready for data publication, you are ready to progress to the next step: providing discovery metadata.

1.3.2. How to provide discovery metadata to WIS2

Discovery metadata is the mechanism by which you tell potential consumers about your data, how it may be accessed, and any conditions you may place on the use of the data.

Each dataset you want to publish must have an associated discovery metadata record. This record is encoded as GeoJSON (RFC 7946[8]) must conform to the specification given in the Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2.

Copies of all discovery metadata records from WIS2 are held at the Global Discovery Catalogues, where data consumers can search and browse to find data that is of interest to them.

Depending on local arrangements, your GISC may be able to help you transfer your discovery metadata record(s) to the Global Discovery Catalogues. If this is not the case, you will need to publish the discovery metadata record(s) yourself[9] using one of two ways:

  • The simplest method is to encode the discovery metadata record as a file and publish it to an HTTP server where it can be accessed with a URL.

  • Alternatively, you may operate a local metadata catalogue through which discovery metadata records can be shared using an API (e.g., OGC API - Records[10]). Each discovery metadata record can be accessed with a unique URL via the API (e.g., as an item which is part of the discovery metadata catalogue).

In both cases, a notification message needs to be published on a message broker that tells WIS2 there is new discovery metadata to upload and that it is accessed at the specified URL[11]. The notification messages shall conform to the specification given in the Manual on WIS (WMO-No. 1060), Volume II, Appendix E: WIS2 Notification Message. Furthermore, the notification message must be published on topic that conforms to the specification given in Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy. For example, metadata published by Deutscher Wetterdienst would use the following topic: origin/a/wis2/de-dwd/metadata/core

These discovery metadata records are then propagated through the Global Service components into to the Global Discovery Catalogue where Data Consumers can search and browse for datasets of interest.

Upon receipt of a new discovery metadata record, a Global Discovery Catalogue (see Global Discovery Catalogue) will validate, assess, ingest, and publish the record. Validation ensures that your discovery metadata record complies with the specification. Assessment examines your discovery metadata record against good practice. The Global Discovery Catalogue will notify you if your discovery metadata record fails validation and provide recommendations for improvements for you to consider.

Discovery metadata must be published in the Global Discovery Catalogues before you begin publishing data.

1.3.3. How to provide data to WIS2

WIS2 is based on the Web architecture[12]. As such it is resource oriented. Datasets are resources; the "granules" of data grouped in a dataset are resources; the discovery metadata records that describe datasets are resources. In Web architecture, every resource has a unique identifier (e.g., a URI[13]), and the unique identifier can be used to resolve the resource identified and interact with it (e.g., to download a representation of the resource over an open standard protocol such as HTTP).

Simply, you provide data (and metadata) to WIS2 by assigning it a unique identifier, in this case a URL[14], and making it available via a data server - most typically a Web server using the HTTP protocol[15]. It’s up to the data server to decide what to provide when resolving the identifier: the URL of a data granule may resolve as a representation encoded in a given data format, whereas the URL of a dataset may resolve as a description of the dataset (i.e., metadata) that includes links to access the data from which the set is comprised - either individual files (i.e., the data granules) or an interactive API that enables a user to request just the parts of the dataset they need by specifying query parameters.

The following sections cover specific considerations relating to publishing data to WIS2.

1.3.3.1. Data formats and encodings

Whether providing data as files or through interactive APIs you need to decide which encodings (aka. data formats) to use. WMO Technical Regulations may require that data is encoded in particular formats. For example, synoptic observations must be encoded in BUFR. The Manual on Codes (WMO-No. 306) provides details of data formats formally approved for use in WMO. However, Technical Regulations don’t cover all data sharing requirements. In such cases, you should select data formats that are open, non-proprietary, and widely adopted and understood in their target user community. In this context, ‘open’ means that anyone can use the format without needing a license to do so – either to encode data in that format or write software that understands the format.

1.3.3.2. Providing data as files

The simplest way to publish data through WIS2 is to persist your data as files and publish those files on a Web sserver. All these files need to be organised somehow – perhaps a flat structure or grouped into collections that resemble folders or directory structures.

To make your data usable, your users need to be able to find the specific file (or files) they need.

Naming conventions for files and/or directories are useful - but only when the scheme is understood. Assuming that your users understand your naming convention will be a barrier to widespread re-use; many users will simply treat the filename as an opaque string. Where communities commonly use file-naming conventions (e.g., with embedded metadata), these should only be used when adequate documentation is provided to users.

WIS2 does not require the use of specific naming conventions.

Another mechanism to consider is complementing the collections (i.e., directories or folders in which files are grouped) with information that describes their content. Then users,both humans and software agents, can browse the structure and find what they need. Examples of this approach include:

  • Web Accessible Folders (WAF) and "README" files: a Web-based folder structure listing the data object files by name, where each folder contains a formated "README" file describing the folder contents.

  • SpatioTemporal Asset Catalog (STAC)[16]: a community standard based on GeoJSON to describe geospatial data files which can be easily indexed, browsed, and accessed. Free and open source tools tools present STAC records (one for each data object file) through a Web-based, browse-able user interface.

When publishing collections of data it is tempting to package content into zip or SIP[17] resources - perhaps even packaging the entire collection, complete with folders, into a single resource. Similarly, WMO formats such as GRIB and BUFR allow multiple data objects (e.g., fields or observations) to be packed into a single file. Only having to download a single resource is convenient for many users, but the downside is that the user must download the entire resource and then unpack/decompress it. The convenience of downloading fewer resources needs to be balanced against the cost of forcing users to download data they may not need. Whatever your choice, you should be guided by common practice in your domain - i.e., only zip, SIP, or pack if your users expect it.

1.3.3.3. Providing interactive access to data with APIs

Interactive data access aims to support efficient data workflows by enabling client applications to request only the data that they need. The advantage with interactive data access is that it provides more flexibility. Data publishers can offer an API structured around how users want to work with the data rather than force them to work with the structure that is convenient for you as a data publisher.

But it is more complex to implement. You need a server running software that can:

  1. Interpret a user’s request;

  2. Extract the data from wherever it is stored;

  3. Package that data up and send it back to the user.

Importantly, when considering use of interactive APIs to serve your data you need to plan for costs: every request to an interactive API requires computational resources to process.

Based on the experience of data publishers who have been using Web APIs to serve their communities, this Guide makes the following recommendations about interactive APIs:

  • First, interactive APIs should be self-describing. A Data Consumer should not need to know, apriori, how to make requests from a API. They should be able to discover this information from the API endpoint itself – even if this is just a link to a documentation page they need to read.

  • Second, APIs should comply with OpenAPI[18] version 3 or later. OpenAPI provides a standardised mechanism to describe the API. Tooling (free and, commercial, etc.) is widely available that can read this metadata and automatically generate client applications to query the API.

  • Third, the Open Geospatial Consortium (OGC) have developed a suite of APIs[19] (called "OGC APIs") that are designed specifically to provide APIs for geospatial data workflows (discovery, vizualisation, access, processing/exploitation) – all of which build on OpenAPI. Among these, OGC API – Environmental Data Retrieval (EDR)[20], OGC API – Features[21], and OGC API - Coverages[22] are considered particularly useful. Because these are open standards, there is an ever-growing suite of software implementations (both free and proprietary) that support them. We recommend that data publishers assess these open-standard API specifications to determine their suitability to for publishing their datasets using APIs.

Finally, you should consider versioning your API to avoid breaking changes when adding new features. A common approach is add a version number prefix into the API path; e.g., /v1/service/{rest-of-path} or /service/v1/{rest-of-path}.

More guidance on use of interactive APIs in WIS2 is anticipated in future versions of this Guide.

1.3.3.4. Providing data in (near) real-time

WIS2 is designed to support the data sharing needs of all WMO programmes. Among these, the World Weather Watch [23] drives specific needs for the rapid exchange of data to support weather forecasting.

To enable real-time data sharing[24], WIS2 uses notification messages to advertise the availability of a new resource - data or discovery metadata - and how to access that resource. Notification messages are published to a queue on a message broker in your WIS2 Node[25] using the MQTT protocol and immediately delivered to everyone subscribing to that queue. A queue is associated with a specific topic, such as dataset.

For example, when a new temperature profile from a radio sonde deployment is added to a dataset of upper-air data measurements, a notification message would be published that includes the URL used to access the new temperature profile data. Everyone subscribing to notification messages about the upper-air measurement dataset would receive the notification message, identify the URL and download the new temperature profile data.

Optionally, data may be embedded in a notification message using a content object in addition to publishing via the data server. Inline data must be encoded as UTF-8, Base64, or gzip, and must not exceed 4096 bytes in length once encoded.

Notification messages are encoded as GeoJSON (RFC 7946) and must conform to the Manual on WIS (WMO-No. 1060), Volume II, Appendix E: WIS2 Notification Message.

The URL used in the notification message should refer only to the newly added data object rather (e.g., the new temperature profile) than the entire dataset. However, the WIS2 Notification Message specification allows for multiple URLs to be provided. If you are providing your data through an interactive API, you might provide a "canonical" link (designated with link relation: "rel": "canonical"[26]), and an additional link providing the URL for the root of the Web service from where one can interact with or query the entire Dataset.

You should include the dataset identifier in the notification message (metadata_id property). This allows data consumers receiving the notification to cross reference with information provided in the discovery metadata for the dataset, such as the conditions of use specified in the data policy, rights, or license.

Furthermore, if you have implemented controlled access to your data (e.g., the use of an API key), you should include a security object in the download link that provides the pertinent information (e.g., the access control mechanism used, and where/how a Data Consumer would need to register to request access).

To ensure that data consumers can easily find the topics they want to subscribe to, data publishers must publish to an authorized topic, as specified in the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy.

If your data seems to relate to more than one topic, select the most appropriate one. The topic-hierarchy is not a knowledge organisation system - it is only used to ensure uniqueness of topics for publishing notification messages. Discovery metadata is used to describe a dataset and its relevance to additional disciplines; each dataset is mapped to one, and only one, topic.

If the WIS2 Topic Hierarchy does not include a topic appropriate for your data, your should publish on an experimental topic. This allows for data exchange to be established while the formalities are considered[27]. Experimental topics are provided for each Earth-system discipline at level 8 in the topic hierarchy (e.g., origin/a/wis2/{centre-id}/data/{earth-system-discipline}/experimental/). Data publishers can can extend the experimental branch with sub-topics as they deem appropriate. Experimental topics are subject to change and will be removed once they are no longer needed. For more information, see Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy, section 7.1.2 Publishing guidelines.

Whatever topic you choose, the discovery metadata you provided to the Global Discovery Catalogue must include subscription links using that topic[28]. The Global Broker will only republish notification messages on topics specified in your discovery metadata records.

1.3.3.5. Considerations when providing Core data in WIS2

Core data, as specified in the WMO Unified Data Policy (Resolution 1 (Cg-Ext 2021)) is considered essential for provision of services for the protection of life and property and for the well-being of all nations. Core data is provided on a free and unrestricted basis, without charge and with no conditions on use.

WIS2 ensures highly available, rapid access to most Core data via a collection of Global Caches (see Global Cache). Global Caches subscribe to notification messages about the availability of new Core data published at WIS2 Nodes, download a copy of that data and re-publish it on a high-performance data server, discarding it after the retention period expires - normally 24-hours[29]. Global Caches do not provide any sophisticated APIs - they publish notification messages advertising the availability of data on their cache and allow users to download data via HTTPS using the URL in the notification message.

The URL included in a notification message that is used to access Core data from a WIS2 Node, or the "canonical" URL if multiple URLs are provided, must:

  1. Refer to an individual data object; and

  2. Be directly resolvable, i.e., the data object can downloaded simply by resolving the given URL without further action.

A Global Cache will download and cache the data object accessed via this URL.

The Global Caches are designed to support Members efficiently share real-time and near real-time data; they take on the task of making sure that Core data is available to all and cover the costs of delivering data to a global community.

Unfortunately, Global Caches cannot republish all Core data: there is a limit to how much data they can afford to serve. Currently, a Global Cache expected to cache about 100GB of data each day.

If frequent updates to your dataset are very large (e.g., weather prediction models or remote sensing observations) you will need to share the burden of distributing your data with the Global Cache operators. You should work with your GISC to determine the highest priority elements of your Core datasets that will be republished by the Global Caches.

For Core data that is not to be cached, you must set the cache property in the notification message to false[30].

You must ensure that Core data that is not cached is publicly accessible from your WIS2 Node; i.e., with no access control mechanisms in place.

A Global Cache operator may choose to disregard your cache preference - for example, if they feel that the content you are providing is large enough to impede provision of caching services for other Members[31]. In such cases, the Global Cache operator will log this behaviour. In collaboration with the Global Cache operators, your GISC will work with you to resolve concerns.

Finally, please note that Global Caches are under no obligation to cache data published on experimental topics. For such data, the cache property should be set to false.

1.3.3.6. Implementing access control

Recommended data, as defined in the WMO Unified Data Policy (Resolution 1 (Cg-Ext 2021)), is exchanged on WIS2 in support of Earth system monitoring and prediction efforts and may by provided with conditions on use. This means that you may control access to Recommended data.

Access control should use only the "security schemes" for authentication and authorization specified in OpenAPI[32].

Where access control is implemented, you should include a security object in download links provided in discovery metadata and notification messages that provide the user with pertinent information about the access control mechanism used and where/how they might register to request access.

Recommended data is never cached by the Global Caches.

Use of Core data must always be free and unrestricted. However, you may need to leverage existing systems with built-in access control when implementing the download service for your WIS2 Node.

Example 1: API key. Your data server requires a valid API key to be included in download requests. The URLs used notification messages should include a valid API key.[33][34]

Example 2: Pre-signed URLs. Your data server uses a cloud-based object store that requires credentials to be provided when downloading data. The URLs used in notification message should be pre-signed with the data publisher’s credentials and valid for the cache retention period (e.g., 24-hours).[35]

In both cases, the URL provided in a notification message can be directly resolved without a user, or Global Cache, needing to take additional action such as providing credentials or authenticating.

Finally, note that if you are only publishing Core data, you may be able to entirely rely on the Global Caches to distribute your data. In such cases, your WIS2 Node may use IP-filtering to allow access only from Global Services. For more details, see section 2.6 Implementation and operation of a WIS2 Node.

1.3.3.7. Providing access to data archives

There is no requirement for a WIS2 Node to publish notification messages about newly available data - the mechanism is available if needed (e.g., for real-time data exchange). Data archives published through WIS2 do not need to provide notification messages for data unless the user community have expressed a need to be rapidly notified about changes (e.g., the addition of new records into a climate observation archive).

However, notification messages must still be used to share discovery metadata with WIS2. Given that provision of metadata and subsequent updates is likely to be infrequent, it may be sufficient to "hand-craft" a notification message and publish it locally on an MQTT broker[36] or with help from a GISC. See above for more details on publishing discovery metadata to WIS2.

Note that some data archives are categorised as Core data; for example, Essential Climate Variables. Core data may be distributed via the Global Caches. However, given that these provide only short-term hosting of data (e.g., 24-hours), Global Caches are not an appropriate mechanism to provide access to archives of Core data. The archive must be accessed directly via the WIS2 Node.

1.3.4. Further reading for data publishers

As a Data Publisher planning to operate a WIS2 Node, as a minimum you should read the following sections:

The following sections are useful for further reading:

Note that sections 4.1. Security and 5.1. Competencies reference content originally published for WIS1. These remain largely applicable and will be updated subsequent releases of this Guide.

If you are publishing aviation weather data via WIS2 for onward transmission through ICAO SWIM you should also read:

Finally, you should also review the specifications in the Manual on WIS (WMO-No. 1060), Volume II:

  • Appendix D: WIS2 Topic Hierarchy

  • Appendix E: WIS2 Notification Message

  • Appendix F: WMO Core Metadata Profile 2

2. PART II

2.1. WIS2 Architecture

WIS2 is a federated system of systems based on Web-Architecture and open standards, comprising of many WIS2 Nodes for publishing data and Global Services that enable fault tolerant, highly available, low latency data distribution.

National Centres (NC), Data Collection or Production Centres (DCPC), and Global Information System Centres (GISC) are sll types of WIS Centre.

NCs and DCPCs operate WIS2 Nodes.

GISCs coordinate the operation of WIS within their Area of Responsibility (AoR) and ensure the smooth operation of the WIS2 system.

A WIS Centre may also operate one or more Global Services.

WIS Centres shall comply with the Technical Regulations defined in the Manual on WIS (WMO-No. 1060), Volume II.

2.2. Roles in WIS2

When describing the functions of WIS2 there are four roles to consider:

  1. Data Publisher

  2. Global Coordinator

  3. Global Service Operator

  4. Data Consumer

These roles are outlined below.

2.2.1. Data Publisher

  • This role is fulfilled by NC and DCPC.

  • Data Publishers operate a WIS2 Node to share their data within the WIS2 ecosystem.

  • Data Publishers manage, curate, and provide access to one or more "Datasets".

  • For each Dataset, a Data Publisher provides:

    1. "Discovery metadata" to describe the Dataset, provide details on how it can be accessed, and under what conditions.

    2. An API or Web-service to access (or interact with) the Dataset.

    3. Notification messages advertising the availability of new data and metadata.

2.2.2. Global Coordinator

  • This role is exclusive to GISCs.

  • All GISCs supporting WMO Members in their AoR fulfil their data sharing obligations via WIS2.

2.2.3. Global Service operator

  • To ensure highly available global data exchange, a WIS Centre may operate one or more Global Services:

    1. Global Discovery Catalogue: enables users to search all Datasets provided by Data Publishers and discover where and how to interact with those Datasets (e.g., subscribe to updates, access/download/visualize data, or access more detailed information about the Dataset).

    2. Global Broker: provides highly available messaging services where users may subscribe to notifications about all Datasets provided by Data Publishers.

    3. Global Cache: provides highly available download service for cached copies of core data downloaded from Data Publishers’ Web-services.

    4. Global Monitor: gathers and displays system performance, data availability, and other metrics from all WIS2 Nodes and Global Services.

2.2.4. Data Consumer

  • This role represents anyone wanting to find, access, and use data from WIS2 – examples include (but are not limited to): NMHS, government agency, research institution, private sector organisation, etc.

  • Searches or browses the Global Discovery Catalogue (or other search engine) to discover the Dataset(s) that meet their needs (i.e., "Datasets of interest").

  • Subscribes via the Global Broker to receive notification messages about the availability of data or metadata associated with Datasets of interest.

  • Determines whether the data or metadata referenced in notification messages is required.

  • Downloads data from Global Cache or WIS2 Node.

2.3. Specifications of WIS2

Leveraging existing open standards, WIS2 defines the following specifications in support of publish, subscribe, notification, and discovery:

Table 1. WIS2 Specifications
Specification Granularity Primary WIS2 Component(s)

WMO Core Metadata Profile 2 (WCMP2)

datasets

Global Discovery Catalogue (GDC)

WIS2 Topic Hierarchy (WTH)

dataset granules

Global Broker, WIS2 Nodes

WIS2 Notification Message

dataset metadata, dataset granules

Global Broker, WIS2 Nodes

Please refer to the Manual on WIS (WMO-No. 1060), Volume II for details.

2.4. Components of WIS2

2.4.1. WIS2 Node

  • WIS2 Nodes are central to WIS2. These are operated by National Centres (NC) and Data Collection or Production Centres (DCPC) to publish their Core and Recommended data.

  • WIS2 adopts Web technologies and open standards enabling WIS2 Nodes to be implemented using freely-available software components and common industry practices.

  • WIS2 Nodes publish data as files of a Web server or using an interactive Web service.

  • WIS2 Nodes describe the data they publish using discovery metadata. See the Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2.

  • WIS2 Nodes generate notification messages advertising the availability of new data. These notification messages are published to a message broker. The WIS2 Topic Hierarchy is used to ensure that all WIS2 Nodes publish to consistent topics. The information in the notification message tells the Data Consumer where to download data from. Notification messages are also used to advertise the availability of discovery metadata. See the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy and Appendix E: WIS2 Notification Message.

  • WIS2 Nodes may implement controlled access for the data they publish. Global Services will operate with fixed IP addresses, enabling WIS2 Nodes to easily distinguish their requests.

2.4.2. Global Broker

  • WIS2 incorporates several Global Brokers, ensuring highly resilient distribution of notification messages across the globe.

  • A Global Broker subscribes to the message broker operated by each WIS2 Node and republishes notification messages.

  • A Global broker subscribes to notifications from other Global Brokers to ensure it receives a copy of all notification messages.

  • A Global Broker republishes notification messages from every WIS2 Node and Global Service.

  • A Global Broker operates a highly available, high-performance message broker.

  • A Global Broker uses the WIS2 Topic Hierarchy enabling a Data Consumer to easily find topics relevant to their needs.

  • Data Consumers should subscribe to notifications from a Global Broker not directly to the message brokers operated by WIS2 Nodes.

2.4.3. Global Cache

  • WIS2 incorporates several Global Caches, ensuring highly resilient distribution of data across the globe.

  • A Global Cache provides a highly available data server from which a Data Consumer can download Core data, as specified in the WMO Unified Data Policy, Resolution 1 (Cg-Ext(2021)).

  • A Global Cache subscribes to notification messages via a Global Broker.

  • On receipt of a notification message, the Global Cache downloads from the WIS2 Node a copy data referenced in the notification message, makes this copy available on its data server, and publishes a new notification message advertising availability of this data at the Global Cache.

  • A Global Cache will subscribe to notification messages from other Global Caches enabling it to download and republish data it has not acquired directly from WIS2 Nodes. This ensures that each Global Cache holds data from every WIS2 Node.

  • A Global Cache shall retain a copy of core data for a duration compatible with the real-time or near real-time schedule of the data and not less than 24-hours.

  • A Global Cache will delete data from the cache once the retention period has expired.

  • Data Consumers should download data from a Global Cache when available.

2.4.4. Global Discovery Catalogue

  • WIS2 includes several Global Discovery Catalogues.

  • A Global Discovery Catalogue enables a data consumer to search and browse descriptions of data published by each WIS2 Node. The data description (i.e., discovery metadata) provides sufficient information to determine the usefulness of data and how one may access it.

  • A Global Discovery Catalogue subscribes to notification messages via a Global Broker about the availability of new (or updated) discovery metadata. It downloads a copy of the discovery metadata and updates the catalogue.

  • A Global Discovery Catalogue will amend discovery metadata records to add details of where one can subscribe to updates about the Dataset at a Global Broker.

  • A Global Discovery Catalogue makes its content available for indexing by search engines.

2.4.5. Global Monitor

  • WIS2 includes a Global Monitor service.

  • The Global Monitor collects metrics from WIS2 components.

  • The Global Monitor provides a dashboard that supports operational management of the WIS2 system.

  • The Global Monitor tracks:

    1. What data is published by WIS2 Nodes.

    2. Whether data can be effectively accessed by Data Consumers.

    3. The performance of components in the WIS2 system.

2.5. Protocols configuration

2.5.1. Publish-Subscribe protocol (MQTT)

  • The MQTT protocol[37] is to be used for all WIS2 Publish-Subscribe workflow (publication and subscription).

  • MQTT v3.1.1 and v5.0 are the chosen protocols for the WIS2 Notification Messages publication and subscription.

    • To connect to Global Brokers, MQTT v5.0 is preferred as it provides additional features such as the ability to used shared subscription.

  • The following parameters are to be used for all MQTT client/server connectivity and subscription:

    • Message retention: false

    • Quality of Service (QoS) of 1

    • A maximum of 2000 messages to be held in a queue per client

  • In order to permit authentication and authorization for users, WIS2 Node, Global Cache, Global Discovery Catalogue and Global Brokers shall use a user and password based mechanism.

  • To improve the overall level of security of WIS2, the secure version of the MQTT protocol is preferred. If used, the certificate must be valid.

  • The standard TCP ports to be used are 8883 for Secure MQTT (MQTTS) and 443 for Secure Web Socket (WSS).

2.5.2. Download protocol (HTTP)

  • The HTTP protocol (RFC 7231[38]) is to be used for all WIS2 download workflow.

  • To improve the overall level of security of WIS2, the secure version of the HTTP protocol is preferred. If used, the certificate must be valid.

  • The standard TCP port to be used is 443 for Secure HTTP (HTTPS).

2.6. Implementation and operation of a WIS2 Node

2.6.1. Practices and procedures

2.6.1.1. Registration and decommissioning of a WIS2 Node

Registration and decomissioning of WIS2 Nodes must be approved by the Permanent Representative to WMO (PR) for the country or territory in which the WIS Centre resides. Where the WIS2 Node is part of a Data Collection or Production Centre (DCPC), the sponsoring WMO Programme or Regional Association shall be consulted.

WMO Secretariat will operate a WIS2 register as an authoritative list of WIS2 Nodes and Global Services.

The registration of a WIS2 Node involves the following steps:

  • Request hosting a WIS2 Node: A request for hosting a WIS2 Node shall be put forward by WIS National Focal Point (NFP) of the country of the WIS2 Node host centre, or, in the case of international organizations, by either the Permanent Representative (PR) of the country or territory where the WIS2 Node host centre is located or the president of the relevant organization in case of WMO partner or programme designated as DCPC.

  • Assign a centre-id: The centre identifier (centre-id) is an acronym as proposed by the Member and endorsed by the WMO Secretariat. It is a single identifier comprised of a top level domain (TLD) and centre name, and represents the data publisher, distributor or issuing centre of a given dataset or data product/granule (see the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy). See below for guidance on assigning a centre identifier ([_guidance_on_assigning_a_centre_identifier_for_a_wis2_n ode]).

  • Complete the WIS2 Register: The WIS National Focal Point shall complete the WIS2 Register operated by the WMO Secretariat.

  • Provide Global Service details: The WMO Secretariat provides connection details for the Global Services (e.g., IP addresses) so that the WIS2 Node can be configured to provide the access.

  • WIS2 Node assessment: The principal GISC verifies that the WIS2 Node is compliant with WIS2 requirements. The assessment includes:

    • verification of the compliance of the topics used by the centre with the WIS2 Topic Hierarchy (WTH) specification.

    • verification of compliance of notification messages with the WIS2 Notification Message (WNM) specification.

    • verification that the data server is correctly configured and properly functioning.

    • verification that the message broker is correctly configured and properly functioning.

  • Add new centre to WIS2: Upon completion of this verification, and confirmation that it satisfies all conditions for operating a WIS2 Node, GISC notifies WMO Secretariat and confirms that this WIS2 Node can be added to WIS2.

  • Communicate details to the Global services: WMO Secretariat provides the WIS2 Node details to the Global Brokers to subscribe to the WIS2 Node.

A diagram of the process of registering a WIS2 Node is presented below.

Adding a WIS2 Node

Once a WIS2 Node has been registered and connected with the Global Services, it can procede to register the datasets it will publish via WIS2. To register a dataset, the authorized WIS2 Node publishes discovery metadata about the new dataset. Validation of the discovery metadata is completed by the Global Discovery Catalogues and Global Brokers automatically subscribes to the topics provided in the discovery metadata record. For more information, see How to provide discovery metadata to WIS2.

Once the dataset has successfully been registered, the WIS2 Node can procede to exchange data - see [_how_to_provide_data_in_wis2].

When decommissioning a WIS2 Node operators must ensure that obligations relating to data sharing within WIS continue to be met after the WIS2 Node is decommissioned, for example, by migrating these data sharing obligations to another WIS2 Node. In the case of a DCPC, this may mean the responsibilities are transferred to another Member.

2.6.1.2. Guidance on assigning a Centre Identifier for a WIS2 Node

The Centre Identifier (centre-id) is used in WIS2 to uniquely identify a participating WIS2 Node. The Centre Identifier must conform to the specification given in the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy, section 7.1.6 Centre identification.

The Centre Identifier comprises two dash-separated tokens.

Token 1 is a Top Level Domain (TLD) defined by IANA[39].

This is usually a simple choice for a Member. However, overseas territories require some thought. The recommended approach depends on the governance of the overseas territory. Take some French examples. Réunion is a French Department – it’s considered part of France, it uses the Euro. Here, we would use the “fr” TLD. New Caledonia is a French overseas territory with top-level-domain of “nc”. It has separate, devolved governance. The recommendation is to use “nc”. All that said, it’s a national decision which TLD to use.

Token 2 is a descriptive name for the centre and this may contain dashes (but not other special characters).

The descriptive name should be something recognisable - not only by our community, but by other users too. Basing things on the Web domain name is likely to ensure that centre identifiers remain unique within a particular country/territory. A UK example this time: the UK’s National Meteorological Service is the Met Office (http://www.metoffice.gov.uk), so “metoffice” is better than “ukmo”[40]. Using the 4-letter GTS centre identifiers (CCCC) is not recommended because people unfamiliar with the GTS do not understand them.

The Centre Identifier specification says that larger organisations operating multiple centres may wish to register separate centre-ids for each centre. This is good practice. Keeping with the UK example, Met Office operates a National Meteorological Centre (NMC), 9 DCPCs (e.g., a Volcanic Ash Advisory Centre) and a WIS2 Global Service, so it’s important to split them out. For example:

  • uk-metoffice-nmc

  • uk-metoffice-vaac

  • uk-metoffice-global-cache

Using a system name in the centre-id is not a good idea because these may change over time. Functional designations are long-term durable. Appending `-test may be used to designate test WIS Nodes.

Appending “-test” may be used to designate test WIS Nodes.

2.6.1.3. Authentication, authorization, and access control for a WIS2 Node

When configuring your WIS2 Node you need to consider how it will accessed by Global Services and Data Consumers.

Global Brokers must authenticate when they connect to the MQTT message broker in your WIS2 Node. Username and password credentials are used[41]. When registering your WIS2 Node with the WMO Secretariat, you will need to provide these credentials. The WMO Secretariat will share these credentials with the Global Service operators and store them in the WIS register. You should not consider these credentials as confidential or secret.

Given that Global Brokers will re-publish notification messages provided by your WIS2 Node, you may decide to restrict access to the MQTT message broker. Global Brokers operate using a fixed IP address which allows you to permit them access using IP filtering[42]. You must ensure that your MQTT message broker is accessible for more than one Global Broker to provide resilient transmission of notification messages to WIS2.

If your WIS2 Node is only publishing Core data[43], you may also restrict access to your data server - instead, relying on the Global Caches to distribute your data. Similarly, Global Caches also operate on fixed IP addresses allowing connections from them to be easily identified. Again, you must ensure that access is given to more than one Global Broker to ensure resilience.

During registration, the WMO Secretariat will provide host names and IP addresses of the Global Services to enable configuration of access control.

Access controls may be implemented for Recommended data. You should use only the "security schemes" for authentication and authorization specified in OpenAPI[44].

2.6.2. Performance management

2.6.2.1. Service levels and performance indicators

A WIS2 Node must be able to:

  • Publish datasets and compliant metadata and discovery metadata

    • Publish metadata to the Global Data Catalogue

    • Publish core data to the Global Cache

    • Publish data for consumer access

    • Publish data embedded in a message (i.e., CAP warnings)

    • Receive metadata publication errors from the Global Data Catalogue

    • Provide metadata with topics to Global Brokers

2.6.2.2. System performance metrics

If contacted by the Global Montior via GISC for a performance issue, the WIS2 Node should provide metrics to the GISC and Global Monitor when service is restored to indicate resolution of the issue.

2.6.3. WIS2 Node reference implementation: WIS2 in a box

To provide a WIS2 Node, members may use whichever software components they consider most appropriate to comply with WIS2 Technical Regulations.

To assist Members participate in WIS2, a free and open-source Reference Implementation is available for use. WIS2 in a box (wis2box) implements the requirements of a WIS2 Node in as well as additional enhancements. wis2box builds on mature and robust free and open-source software components that are widely adopted for operational use.

wis2box provides functionality required for both data publisher and data consumer roles. It provides the following technical functions:

  • Configuration, generation and publication of data (real-time or archive) and metadata to WIS2, compliant to WIS2 Node requirements

  • MQTT Message Broker and notification message publication (Subscribe)

  • HTTP object storage and raw data access (Download)

  • Station metadata curation / editing tools (user interface)

  • Discovery metadata curation / editing tools (user interface)

  • Data entry tools (user interfaces)

  • OGC API server, providing dynamic APIs for discovery, access, visualization and processing functionality (APIs)

  • Extensible data "pipelines", allowing for transformation, processing and publishing of additional data types

  • Provision of system performance and data availability metrics

  • Access control for recommended data publication, as required

  • Subscription to notifications and and download of WIS data from Global Services

  • Modular design, allowing for extending to meet additional requirements or integrate with existing data management systems

Project documentation can be found at https://docs.wis2box.wis.wmo.int

wis2box is managed as a free and open source project. Source code, issue tracking and discussions are hosted in the open on GitHub: https://docs.wis2box.wis.wmo.int.

2.7. Implementation and operation of a Global Service

2.7.1. Procedure for registration of a new Global Service

Successful operations of WIS will depend on having a set of Global Services running well-managed IT environments with a very high level of reliability so that all WIS Users and WIS2 Nodes will be able to access and provide data they need for their duties.

Depending on the nature of the Global Service, the following is considered to be the minimum capability of Global Service operation, so that collectively, the level of service is 100% (or very close):

  • Three (3) Global Brokers: Each Global Broker connected to at least two (2) other Global Brokers

  • Three (3) Global Caches: Each Global Cache connected to at least two (2) Global Broker and should be able to download the data from all WIS2 Nodes providing Core data

  • Two (2) Global Discovery Catalogues: Each Global Discovery Catalogue connected to at least one (1) Global Broker

  • Two (2) Global Monitors: Each Global Monitor should scrape the metrics from all other Global Services

In addition to the above, WIS architecture can accomodate adding (or removing) Global Services. Candidate WIS Centres should inform their WIS Focal Point and contact the WMO Secretariat to discuss their offer to provide a Global Service.

Running a Global Service is a significant commitment for a WIS Centre. To maintain a very high level of service of WIS, each Global Service will have a key role to play.

On receipt of an offer from a Member to operate a Global Service, WMO Secretariat will suggest which Global Service the Member may provide in order to improve WIS2. This suggestion will be based on the current situation of WIS2 (e.g., the number of existing Global Brokers, whether an additional Global Cache is needed, etc.).

The Manual on WIS (WMO-No. 1060), Volume II, the Guide and other material available will help WIS Centres in deciding the best way forward.

When decided, the WIS Focal Point will inform WMO Secretariat of its preference. Depending on the type of Global Service, WMO Secretariat will provide a checklist to the WIS Centre so that the future Global Service can be included in WIS Operations.

A WIS Centre must commit to running the Global Service for a minimum of four (4) years.

WMO Secretariat and other Global Services will make the required changes to include the new Global Service in WIS Operations.

2.7.2. Performance management and monitoring of a Global Service

2.7.2.1. Monitoring and metrics for WIS2 operations

The availability of data and performance of system components within WIS2 are actively monitored by GISCs and the Global Monitor service to ensure proactive response to incidents and effective capacity planning for future operations.

WIS2 requires that metrics are provided using OpenMetrics – the de-facto standard [45] for transmitting cloud-native metrics at scale. Widely adopted, many commercial and open-source software components already come preconfigured to provide performance metrics using the OpenMetrics standard. Tools such as Prometheus and Grafana provide aggregation and visualisation of metrics provided in this form, making it simple to generate performance insights. The OpenMetrics standard can be found at openmetrics.io [46].

The WIS2 Global Services, namely the Global Broker, Global Cache, and Global Discovery Catalogue expose monitoring metrics on their respective service to the Global Monitor.

There is no requirement on WIS2 Nodes to provide monitoring metrics. However their WIS2 interfaces may be queried remotely by Global Services, which in turn can provide metrics on the availability of WIS2 Nodes.

Metrics for the WIS2 monitoring should follow the naming convention:

wmo_<program>_<name>

where program is the name of the responsible WMO Program and name is the name of the metric. Examples for WIS2 metrics can look like

wmo_wis2_gc_downloaded_total
wmo_wis2_gb_messages_invalid_total

The full set of the WIS2 monitoring metrics is given in WMO: WIS2 Metric Hierarchy [47]

2.7.2.2. Service levels, performance indicators, and fair-usage policies
  • Each WIS Centre operating a WIS2 Node will be responsible in achieving the highest possible level of service based on their resources and capabilities.

  • All Global Services, and in particular Global Brokers and Global Caches, are collectively responsible in making the WIS a reliable and efficient mean to exchange data required for the operations of all WIS Centres. The agreed architecture provides a redundant solution where the failure of one component will not impact the overall level of service of WIS.

  • Each Global Service should aim at achieving at least 99.5% availibility of the service they propose. This is not a contractual target. It should be considered by the entity providing the Global Service as a guideline when designing and operating the Global Service.

  • A Global Broker:

    • should support a minimum of 200 WIS2 Nodes or Global Services

    • should support a minimum of 1000 subscribers.

    • should support processing of a minimum of 10000 messages per second

  • A Global Cache:

    • should support a mimimum of 100 GB of data in the cache

    • should support a minimum of 1000 simultaneous downloads

    • could limit the number of simultaneous connections from a user (known by its originating source IP) to 5

    • could limit the bandwidth usage of the service to 1Gb/s

  • A Global Monitor:

    • should support a minimum of 50 metrics providers

    • should support 200 simultaneous access to the dashboard

    • could limit the bandwidth usage of the service to 100Mb/s

  • A Global Discovery Catalogue:

    • should support a minimum of 20000 metadata records

    • should support a minimum of 50 requests per second to the API endpoint

2.7.2.3. Metrics for Global Services

In the following sections and for each Global Service, a set of metrics is defined. Each Global Service will provide those metrics. They will then be ingested by the Global Monitor.

2.7.3. Global Broker

2.7.3.1. Technical considerations
  • As detailed above, there will be at least three (3) instances of Global Broker to ensure highly available, low latency global provision of messages within WIS.

  • A Global Broker instance subscribes to messages from WIS2 Nodes and other Global Services. The Global Broker should aim at subscribing to all WIS Centres. If this is not possible, for whatever reason, the Global Broker should inform WMO Secretariat so that situation is documented.

  • Every WIS2 Node or Global Service must have subscriptions from at least two Global Brokers.

  • For full global coverage, a Global Broker instance will subscribe to messages from at least two (2) other Global Brokers.

  • When subscribing to messages from WIS2 Node and other Global Services, a Global Broker must authenticate using the valid credentials managed by the WIS Centre and available at WMO Secretariat.

  • A Global Broker is built around two software components:

    • An off the shelf broker implementing both MQTT 3.1.1 and MQTT 5.0 in a highly-available setup, typically in a cluster mode. Tools such as EMQX, HiveMQ, VerneMQ, RabbitMQ (in its latest versions) are compliant with these requirements. It must be noted that the open source version of Mosquitto cannot be clustered and therefore should not be used as part of a Global Broker.

    • Additional features including anti-loop detection, notification message format compliance, validation of the published topic, and provision of metrics are required.

  • When receiving a message from a WIS Centre or Global Service broker, The metric wmo_wis2_gb_messages_received_total will be increased by 1.

  • A Global Broker will check if the topic on which the message is received is valid (in particular, a discovery metadata record must exist with a corresponding topic in order that data can be made available using this topic). If the topic is invalid, the Global Broker will discard non-compliant messages and will raise an alert. The metric wmo_wis2_gb_messages_no_metadata_total will be increased by 1. Global Broker should not request Global Discovery Catalogue for each notification message but should keep a cache of all valid topics for every centre-id.

  • During the pre-operational phase (2024), Global Broker will not discard the message but will send a message on the monitor topic hierarchy to inform the originating centre and its GISC.

  • A Global Broker will validate notification messages against the standard format (see Notification message format and structure), discarding non-compliant messages and raising an alert. The metric wmo_wis2_gb_messages_invalid_total will be increased by 1.

  • A Global Broker instance will republish a message only once. Using the message id as defined in WIS Notification Message, the Global Broker will record id of messages already published and will discard subsequent identitical (with the same message id) messages. This is the anti-loop feature of the Global Broker.

  • When publishing a message to the local broker, the metric wmo_wis2_gb_messages_published_total will be increased by 1.

  • All aboved defined metrics will be made avalaible on HTTPS endpoints that the Global Monitor will ingest from regularly.

  • As a convention Global Broker centre-id will be tld-{centre-name}-global-broker.

  • A Global Broker should operate with a fixed IP address so that WIS2 Nodes can permit access to download resources based on IP address filtering. A Global Broker should also operate with a public resolvable DNS name pointing to that IP address. The WMO Secretariat must be informed of the IP address and/or host name, and any subsequent changes.

2.7.4. Global Cache

In WIS2 Global Caches provide access to WMO Core Data for data consumers. This allows for data providers to restrict access to their systems to Global Services and it reduces the need for them to provide high bandwith and low latency access to their data. Global Caches work transparent for end users in that they resend notification messages from data providers which are updated to point to the Global Cache data store for data, they copied from the original source. Additionally, Global Caches also resend notification messages from data providers for Core Data, that is not stored on the Global Cache, for instance if the originator indicates that a certain data set should not be cached in the notification message. In the latter case, the notification messages that a Global Cache resends are unchanged and point to the original source. Data consumers should subscribe to the notification messages from Global Caches instead of the notification messages from the data providers for WMO Core Data. When data consumers receive a notification message they should follow the URLs from that messages which either point to a Global Cache holding a copy of the data, or - in case of uncached content - point to the original source.

2.7.4.1. Technical considerations
  • A Global Cache is built around three software components:

    • A highly available data server allowing data consumers to download cache resources with high bandwidth and low latency.

    • A message broker implementing both MQTTv3.1.1 and MQTTv5 for publishing notification messages about resources that are available from the Global Cache

    • A Cache management implementing the features needed to connect with the WIS ecosystem, receive data from WIS2 nodes and other Global Caches, store the data to the data server and manage the content of the cache (i.e. expiration of data, deduplication, etc)

  • The Global Cache will aim at containing copies of real-time and near real-time data designated as "core" within the WMO Unified Data Policy, Resolution 1 (Cg-Ext(2021)).

  • A Global Cache instance will host data objects copied from NC/DCPCs.

  • A Global Cache instance will publish notification messages advertising availability of the data objects it holds. The notification messages will follow the standard structure (see Manual on WIS (WMO-No. 1060), Volume II, Appendix E: WIS2 Notification Message).

  • A Global Cache instance will use the standard topic structure in their local message brokers (see Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy ).

  • A Global Cache instance will publish on topic cache/a/wis2/…​.

  • There will be multiple Global Cache instances to ensure highly available, low latency global provision of real-time and near real-time "core" data within WIS2.

  • There will be multiple Global Cache instances may attempt to download cacheable data objects from all originating centres with "cacheable" content. A Global Cache instance will also download data objects from other Global Cache instances. This ensures the instance has full global coverage, mitigating where direct download from an originating centre is not possible.

  • A Global Cache instance will operate independently of other Global Cache instances. Each Global Cache instance will hold a full copy of the cache – albeit that there may be small differences between Global Cache instances as "data availability" notification messages propagate through WIS to each Global Cache in turn. There is no formal ‘synchronisation’ between Global Cache instances.

  • A Global Cache will temporarily cache all resources published on the metadata topic. A Global Discovery Catalogue will subscribe to notifications about publication of new or updated metadata, download the metadata record from the Global Cache and insert it into the catalogue. A Global Discovery Catalogue will also publish a metadata record archive each day containing the complete content of the catalogue and advertise its availability with a notification message. This resource will also be cached by a Global Cache.

  • A Global Cache is designed to support real-time distribution of content. Data Consumers access data objects from a Global Cache instance by resolving the URL in a "data availability" notification message and downloading the file to which the URL points. Apart from the URL it is transparent to the Data Consumers from which Global Cache they download the data. There is no need to download the same Data Object from multiple Global Caches. The data id contained within the notification messages is used by Data Consumers and Global Services to detect such duplicates.

  • There is no requirement for a Global Cache to provide a "browse-able" interface to the files in its repository allowing Data Consumers to discover what content is available. However, a Global Cache may choose to provide such a capability (e.g., implemented as a "Web Accessible Folder", or WAF) along with adequate documentation for Data Consumers to understand how the capability works.

  • The default behaviour for a Global Cache is to cache all data published under the origin/a/wis2/data/+/core topic. A data publisher may indicate that data should not be cached by adding the "cache": false assertion in the WIS Notification Message.

  • A Global Cache may decide not to cache data. For example, if the data is considered too large, or a WIS2 Node publishes an excessive number of small files. Where a Global Cache decides not to cache data it should behave as though the cache property is set to false and send a message on the monitor topic hierarchy to inform the originating centre and its GISC. The Global Cache operator should work with the originating WIS2 Node and their GISC to remedy the issue.

  • If core data is not cached on a Global Cache (that is, if the data is flagged as "cache": false or if the Global Cache decides not to cache this data), the Global Cache shall nevertheless republish the WIS2 Notification Message to the cache/a/wis2/…​ topic. In this case the message id will be changed and the rest of the message will not be modified.

  • A Global Cache should operate with a fixed IP address so that WIS2 Nodes can permit access to download resources based on IP address filtering. A Global Cache should also operate with a public resolvable DNS name pointing to that IP address. The WMO Secretariat must be informed of the IP address and/or host name, and any subsequent changes.

  • A Global Cache should validate the integrity of the resources it caches and only accept data which matches the integrity value from the WIS Notification Message. If the WIS Notification Message does not contain an integrity value, a Global Cache should accept the data as valid. In this case a Global Cache may add an integrity value to the message it republishes.

  • As a convention Global Cache centre-id will be tld-{centre-name}-global-cache.

2.7.4.2. Practices and procedures
  • A Global Cache shall subscribe to the topics origin/a/wis2/#, cache/a/wis2/#.

  • A Global Cache shall ignore all messages received on the topics origin/a/wis2/+/data/recommended/# and cache/a/wis2/+/data/recommended/# [48]

  • A Global Cache shall retain the data and metadata they receive for a minimum period of 24 hours. Requirements relating varying retention times for different types of data may be added later.

  • For messages received on topic origin/a/+/data/core/# or cache/a/+/data/core/#, a Global Cache shall:

    • If the message contains the property "properties.cache": false

      • Republish the message at topic cache/a/wis2/…​ matching +/a/wis2/…​ where the original message has been received after having updated the id of the message.

    • else

      • Maintain a list of data_ids already downloaded.

      • Verify if the message points to new or updated data by comparing the pubtime value of the notification message with the list of data_ids.

      • If the message is new or updated

        • Download only new or updated data from the href or extract the data from the message content.

        • If the message contains an integrity value for the data, verify the integrity of the data.

        • If data is downloaded successfully, move the data to the HTTP endpoint of the Global Cache.

        • Wait until the data becomes available at the endpoint.

        • Modify the message identifier and the canonical link’s href of the received message. Leave all other fields untouched.

        • Republish the modified message to topic cache/a/wis2/…​ matching the +/a/wis2/…​ where the original message has been received.

        • The metric wmo_wis2_gc_downloaded_total will be increased by 1. The metric wmo_wis2_gc_dataserver_last_download_timestamp_seconds will be updated with the timestamp (in seconds) of the last successful download from the WIS2 Node or Global Cache.

      • else

        • Drop the messages for data already present on the Cache.

  • If the Global Cache is not able to download the data the metric wmo_wis2_gc_downloaded_error_total will be increased by 1.

  • A Global Cache shall provide the metric defined in this Guide at an http(s) endpoint

  • A Global Cache should make sure that data is downloaded in parallel and downloads are not blocking each other

  • The metric wmo_wis2_gc_dataserver_status_flag will reflect the status of the connection to the download endpoint of the Centre. It values will be 1 when the endpoint is up and 0 otherwise.

2.7.5. Global Discovery Catalogue

2.7.5.1. Technical considerations
  • The Global Discovery Catalogue provides Data Consumers with a mechanism to discover and search for Datasets of interest, as well as how to interact with and find out more information about those Datasets.

  • The Global Discovery Catalogue implements the OGC API – Records – Part 1: Core standard[49], adhering to the following conformance classes and their dependencies:

    • Searchable Catalog (Deployment)

    • Searchable Catalog - Sorting (Deployment)

    • Searchable Catalog - Filtering (Deployment)

    • JSON (Building Block)

    • HTML (Building Block)

  • The Global Discovery Catalogue will make discovery metadata available via the collection identifier of wis2-discovery-metadata.

  • The Global Discovery Catalogue advertises the availability of Datasets and how to access them or subscribe to updates.

  • The Global Discovery Catalogue does not advertise or list the availability of individual Data Objects that comprise a Dataset (i.e. data files).

  • A single Global Discovery Catalogue instance is sufficient for WIS2.

  • Multiple Global Discovery Catalogue instances may be deployed for resilience.

  • Global Discovery Catalogue instances operate independently of each other; each Global Discovery Catalogue instance will hold all discovery metadata records. Global Discovery Catalogues do not need to synchronise between themselves.

  • A Global Discovery Catalogue is populated with discovery metadata records from a Global Cache instance, receiving messages about the availability of discovery metadata records via a Global Broker.

  • A Global Discovery Catalogue should connect and subscribe to more than one Global Broker instance to ensure that no messages are lost in the event of a Global Broker failure. A Global Discovery Catalogue instance will discard duplicate messages as needed.

  • A Global Discovery Catalogue will validate that a discovery metadata record identifier’s centre-id token (see Manual on WIS (WMO-No. 1060), Volume II, Appendix F: WMO Core Metadata Profile 2) matches against the centre-id level of the topic from which it was published (see Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy), to ensure that discovery metadata is published by the authoritative orgnanization.

  • A Global Discovery Catalogue will validate discovery metadata records against the WMO Core Metadata Profile 2 (WCMP2). Valid WCMP2 records will be ingested into the catalogue. Invalid or malformed records will be discarded and reported to the Global Monitor against the centre identifier associated with the discovery metadata record.

  • A Global Discovery Catalogue will only update discovery metadata records to replace links for dataset subscription and notification (origin) with their equivalent links for subscription at Global Broker instances (cache).

  • A Global Discovery Catalogue will periodically assess discovery metadata provided by NCs and DCPCs against a set of key performance indicators (KPIs) in support of continuous improvement. Suggestions for improvement will be reported to the Global Monitor against the centre identifier associated with the discovery metadata record.

  • A Global Discovery Catalogue will remove discovery metadata that is marked for deletion as specified in the data notification message.

  • A Global Discovery Catalogue should apply faceting capability as specified in the cataloguing considerations of the WCMP2 specification, as defined in OGC API - Records.

  • A Global Discovery Catalogue will provide human-readable Web pages with embedded markup using the schema.org vocabulary, thereby enabling search engines to crawl and index the content of the Global Discovery Catalogue. Consequently, Data Consumers should also be able to discover WIS content via third party search engines.

  • A Global Discovery Catalogue will generate and store a zipfile of all WCMP2 records once a day, that will be made be accessible via HTTP.

  • A Global Discovery Catalogue will publish a WIS2 Notification Message of its zipfile of all WCMP2 records on its centre-id’s metadata topic (i.e. origin/a/wis2/centre-id/metadata, where centre-id is the centre identifier of the Global Discovery Catalogue).

  • A Global Discovery Catalogue may initialize itself (cold start) from a zipfile of all WCMP2 records published.

  • As a convention Global Discovery Catalogue centre-id will be tld-{centre-name}-global-discovery-catalogue.

2.7.5.2. Global Discovery Catalogue reference implementation: wis2-gdc

To provide a Global Discovery Catalogue, members may use whichever software components they consider most appropriate to comply with WIS2 Technical Regulations.

To assist Members participation in WIS2, a free and open-source Global Discovery Catalogue Reference Implementation is made available for download and use. wis2-gdc builds on mature and robust free and open-source software components that are widely adopted for operational use.

wis2-gdc provides functionality required Global Discovery Catalogue, providing the following technical functions:

  • discovery metadata subscription and publication from the Global Broker

  • discovery metadata download the Global Cache

  • discovery metadata validation, ingest and publication

  • WCMP2 compliance

  • quality assessment (KPIs)

  • OGC API - Records - Part 1: Core compliance

  • metrics reporting

  • implementation of metrics

wis2-gdc is managed as a free and open source project. Source code, issue tracking and discussions are hosted in the open on GitHub: https://github.com/wmo-im/wis2-gdc.

2.7.6. Global Monitor

2.7.6.1. Technical Considerations
  • WIS standardises how system performance and data availability metrics are published from WIS2 Nodes and Global Services.

  • For each type of Global Service, a set of standard metrics have been defined. Global Services will implement those metrics and provide an endpoint for those metrics to be scraped by the Global Monitor

  • The Global Monitor will collect metrics as defined in the OpenMetrics standard.

  • The Global Monitor will monitor the 'health' (i.e., performance) of components at NC/DCPC as well as Global Service instances.

  • The Global Monitor will provide a Web-based ‘dashboard’ that displays the WIS2 system performance and data availability.

  • As a convention Global Monitor centre-id will be tld-{centre-name}-global-monitor.

    The main task of the Global Monitor is to regularly query the provided metrics from the relevant WIS2 entities, aggregate and process the data and then provide the results to the end user in a suitable presentation.

2.8. Operations

2.8.1. Interoperability with external systems

The WIS2 priciples enable lowering the barrier to weather/climate/water data for WMO members. Lowering the barrier is driven by international standards for data discovery, access, and visualization. In addition to Member benefits, a by-product of utilizing standards is being able to provide the same data and access mechanisms to external systems at no extra cost of implementation.

WIS2 standards are based on industry standards (OGC, W3C, IETF) and allow for broad interoperabliilty. This means that non-traditional users can also use data from WIS2 data in the same manner, without the requirement for specialized software, tools, or applications.

2.8.1.1. Publishing meteorological data through WIS2 into ICAO SWIM

Meteorological data is an essential input for public weather services and aviation services alike. WIS2 provides the mechanism for data exchange in WMO, while SWIM (System Wide Information Management) is the ICAO initiative to harmonize the provision of aeronautical, meteorological and flight information to support air traffic management (ATM).

Both WIS2 and SWIM support the similar outcomes relating to data exchange. However, there are differences in both approach and implementation.

Specifications for WIS2 are defined in the Manual on WIS (WMO-No. 1060), Volume II, and further elaborated in this Guide. Specifications for SWIM will be defined in the Procedures for Air Navigation Services – Information Management (PANS-IM) (ICAO Doc. 10199)[50].

During the WIS2 transition phase (2025-2030), meteorological data published via WIS2 will automatically be published to the GTS via the WIS2 WIS2-to-GTS Gateways.

WIS2

SWIM

Earth-system scope: weather, climate, hydrology, atmospheric composition, cryosphere, ocean and space weather data

Air traffic management scope: aeronautical, meteorological and flight information

Data centric - a consumer discovers data and then determines the services through which that data may be accessed

Service centric - a consumer discovers a service (or service provider) and determines what resources (i.e., information) is available therein

Technical protocols: MQTT, HTTP

Technical protocols: AMQP[51]

An organisation (e.g., the National Meteorological Service) that is responsible for providing meteorological data to WIS2 may be designated by the ICAO Contracting State as a responsible entity to provide aeronautical meteorological information into SWIM. Where requirements dictate, the organisation may provide regional capability on behalf of a group of countries or territories.

This section of the Guide outlines how such an organisation may efficiently fulfil the requirements in providing required data/information to the two systems. It proposes an interoperability approach between WIS2 and SWIM where meteorological data published via WIS2 can be automatically propagated to SWIM.

This Guide covers only how data from WIS2 can be published into SWIM. Consumption of information from SWIM services is not in scope.

This Guide also does not cover implementation details of the SWIM service - including, but not limited to:

  • Mechanisms used by SWIM to discover service providers and services.

  • Specification of the SWIM data message.

  • AMQP message broker configuration.

  • Operation, logging and monitoring.

  • Cybersecurity considerations for provision of SWIM services.

This Guide will be updated as more information is made available from ICAO and/or recommended practices are updated.

Finally, it should be noted that the provision of aeronautical meteorological information and its exchange via the ICAO Aeronautical Fixed Service (AFS) is also out of scope as they are solely defined under the ICAO regulatory framework.

WIS2 to SWIM Gateway

The WIS2 to SWIM interoperability approach employs a Gateway component (as per the figure below):

Schematic of interoperability approach
Figure 1. Schematic of an interoperability approach

The Gateway can operate as an "adapter" between WIS2 and SWIM by pulling the requisite meteorological data from WIS2 and re-publishing it to SWIM.

Data types and format

Specifications for aeronautical meteorological information are provided in ICAO Annex 3 and other relevant guidance materials. IWXXM format (FM 205)[52] is to be used for encoding aeronautical meteorological information in SWIM.

Publishing meteorological data via WIS2

For meteorological data to be published from WIS2 to SWIM, the organisation responsible for this provision will need to operate a WIS2 Node and comply with the pertinent Technical Regulations as specified in the Manual on WIS (WMO-No. 1060), Volume II. Onward distribution of the data by the Message Broker over SWIM can be handled by the respective Information Service Provider in accordance with ICAO Standards and Recommended Practices (SARPs).

The responsible organisation should consider whether this data should be published via an existing WIS2 Node, or whether a separate WIS2 Node should be established. For example, the data may be provided by a separate operational unit, or there may be a requirement to easily distinguish between data for SWIM and any other meteorological data.

Where a new WIS2 Node is needed, the responsible organisation must establish a new WIS2 Node and register it with WMO Secretariat. For more information, see Implementation and operation of a WIS2 Node.

Datasets are a central concept in WIS2. Where meteorological data is published via WIS2, it shall be packaged into “datasets”. The data should be grouped at the country / territory level; i.e., datasets should be published for a given country / territory, one for each datatype (e.g., aerodrome observation, aerodrome forecast and quantitative volcanic ash concentration information).

For the purposes of publishing through WIS2, Datasets containing aeronautical meteorological information should be considered as "Recommended Data", as described in the WMO Unified Data Policy, Resolution 1 (Cg-Ext(2021)). The Recommended Data category of the policy is intended to cover data that should be exchanged by Members to support Earth system monitoring and prediction efforts.

Recommended Data:

  • May be subject to conditions on use and re-use.

  • May have access controls[53][54] applied at the WIS2 Node.

  • Are not cached within WIS2 by the Global Caches[55].

The WMO Unified Data Policy requires transparency on the conditions of use for Recommended Data. Conditions regarding the use of aeronautical meteorological information are specified in ICAO Annex 3 and, optionally, by the ICAO Contracting State. Such conditions of use should be explicitly stated in the discovery metadata for each dataset as describe below.

  • The attribute wmo:dataPolicy should be set to recommended.

  • Information about conditions of use should be specified using a rights property (see example below) and/or a link object with relation license.

  • Information about access control should be specified using a security object in the link object describing the data access details.

Example expression of conditions relating to the use of aeronautical meteorological information:
"properties": {
  ...
  "rights": "This information is freely disseminated for the purposes of safety of international air navigation. ICAO Annex 3"
  ...
}

For more information on the WMO Core Metadata Profile version 2, see the Manual on WIS (WMO-No. 1060), Volume II, Appendix F.

On receipt of new data, the WIS2 Node will:

  1. Publish the data as a resource via a Web server (or Web service).

  2. Publish a WIS2 Notification Message to a local message broker that advertises the availability of the data resource.

Note that, in contrast to the GTS, WIS2 publishes data resources individually, each with an associated notification message. WIS2 does not group data resources into bulletins.

The data resource is identified using a URL. The notification message refers to the data resource using this URL[56].

For more details on the WIS2 Notification Message, see the Manual on WIS (WMO-No. 1060), Volume II, Appendix E: WIS2 Notification Message.

The notification message must be published to the proper topic on the message broker. WIS2 defines a standard topic hierarchy to ensure that data is published consistently by all WIS2 Nodes. Notification messages for aviation data should be published on a specific topic allowing a data consumer, such as the Gateway, to subscribe only to aviation-specific notifications. See the example below:

Example Topic used to publish notifications about Quantitative Volcanic Ash Concentration Information
origin/a/wis2/{centre-id}/data/recommended/weather/aviation/qvaci

For more details of the WIS Topic Hierarchy, see the Manual on WIS (WMO-No. 1060), Volume II, Appendix D: WIS2 Topic Hierarchy.

WIS Global Brokers subscribe to the local message brokers of WIS2 Nodes and republish notification messages for global distribution.

As a minimum, the WIS2 Node should retain the aviation data for a duration that meets the needs of the Gateway. The retention period of at least 24 hours is recommended.

Gateway implementation

The potential interactions between the Gateway component, WIS2 and SWIM are illustrated in the figure below[57]

Interactions between the Gateway and components of WIS2 and SWIM
Figure 2. Interactions between the Gateway and components of WIS2 and SWIM

Configuration

Dataset discovery metadata will provide useful information that can be used to configure the Gateway, e.g., the topic(s) to subscribe to plus various other information that may be needed for the SWIM service.

Discovery metadata can be downloaded from the Global Discovery Catalogue.

Functions

The Gateway component implements the following functions:

  • Subscribe to the pertinent topic(s) for notifications about new aeronautical meteorological information[58].

  • On receipt of notification messages about newly available data:

    • parse the notification message, discarding duplicate messages already processed previously;

    • download the data resource from the WIS2 Node[59] using the URL in the message - the resource should be in IWXXM format;

    • create a new "data message" as per the SWIM specifications, including the unique identifier extracted from the data resource[60], and embedding the aviation weather data resource within the data message;

    • publish the data message to the appropriate topic on the SWIM Message Broker component of the SWIM service.

The choice of protocol for publishing to the SWIM Message Broker should be based on bilateral agreement between operators of the Gateway and SWIM service.

The Gateway should implement logging and error handling as necessary to enable reliable operations. WIS2 uses the OpenMetrics standard[61] for publishing metrics and other operating information. Use of OpenMetrics by the Gateway would enable monitoring and performance reporting to be easily integrated into the WIS2 system.

Operation

The Gateway may be operated at national or regional level depending on the organisational governance in place.

SWIM service

The SWIM aviation weather information service may comprise of a Message Broker component which implements the AMQP 1.0 messaging standard[62].

The Message Broker publishes the data messages provided by the Gateway.

The Message Broker must ensure that data messages are provided only by authorized sources such as a Gateway and should validate incoming messages as aeronautical meteorological information.

2.8.1.2. The Ocean Data and Information System (ODIS)

The Ocean Data and Information System (ODIS) is a federation of independent data systems coordinated by the International Oceanographic Data and Information Exchange (IODE) of IOC-UNESCO. This federation includes continental-scale data systems as well as those of small organisations. ODIS partners use Web architectural approaches to share metadata describing their holdings, services, and other capacities. In brief, IODE publishes guidelines on how to share metadata as linked open data, serialised in JSON-LD using schema.org[63] semantics. ODIS nodes use these guidelines to publish their metadata catalogues on the Web. This allows all systems with Web connectivity to harvest and merge these catalogues, creating a global map of the ocean datascape. IODE harvests all metadata shared by ODIS partners, combines it as a knowledge graph, and processes this to export derivative products (e.g. diagnostic reports and cloud-optimised data products). The Ocean InfoHub (OIH) system is IODE’s reference implementation of a discovery system leveraging ODIS. The ODIS architecture and tools are Free and Open Source (FOSS), with regular releases published for the community.

To reach beyond the oceans domain, ODIS works with other data systems and federations, such as WIS2, to define sustainable data and metadata exchanges and - where needed - translators/converters. The resources needed to convert between such systems are developed in the open and in close collaboration with staff from those systems. These exchanges include ETL functions, to ensure that the bilateral exchange is mutually beneficial.

Cross system interoperability

Given the strong support for standards and inteoperability by both WIS2 and ODIS, data and metadata exchange is realized using Web architectural principles and approaches. The ability to discover ODIS data on WIS2 (as well as the inverse) is a goal in extending the reach of both systems and data beyond their primary communities of interest.

The WIS2 Global Discovery Catalogue will provide discovery metadata records using the OGC API - Records standard. This will include schema.org and JSON-LD annotations on WCMP2 discovery metadata in the GDC, to enable cross-pollination and federation.

ODIS dataset records will be made available using the WCMP2 standard and provided as objects available via HTTP for ingest, validation and publication to the GDC as a federated catalogue. ODIS data will be published as recommended data as per the WMO Data Policy.

WIS2 and ODIS metadata and catalogue interoperability
Figure 3. WIS2 and ODIS metadata and catalogue interoperability

As a result, federated discovery will be realized between both systems, allowing for use and reuse of data in an authoritative manner, closest to the source of the data.

3. PART III

3.1. Information management

3.1.1. Introduction

3.1.1.1. Background

The efficient and effective provision of services relying on meteorological, climatological, hydrological and oceanographic information depends on a reliable information infrastructure. This infrastructure should be guided by community best practices and standards, including recommendations and requirements on sourcing, securing, managing, archiving, exchanging, and providing easy access to information. These terms and activities can be grouped under the term "information management" and this part of the Guide aims to provide high-level guidance on those activities. This is done by identifying and describing the fundamental principles of good information management and by highlighting the different stages of the information management lifecycle.

Note: The term "information" is used in a general sense and includes data and products.

3.1.1.2. Scope

High-level guidance on information management practices that apply in the context of information related to the Earth system is provided in this part of the Guide. Detailed technical information, such as specification of data formats or quality control and assurance methods, is provided in other parts of the Guide and in other WMO publications. These are referenced where applicable.

The Principles of information management are described below. Five focus areas are described in The information management lifecycle. These are:

  1. Planning, information creation and acquisition. Creation of information using internal and external data sources and the acquisition of information from various sources.

  2. Representation and metadata. Standards to represent metadata, data and information are of primary importance to enable interoperability and long-term usability of the information.

  3. Publication and exchange of information. The creation and publication of discovery metadata in a standardized format enabling users to discover, access and retrieve the information.

  4. Usage and communication. Publication of guidance material on the use of published information, including on the limitations and suitability of the information and any licensing terms.

  5. Storage, archival and disposal. Policies and procedures for business continuity and disaster recovery, as well as retention and disposal.

3.1.1.3. Intended audience

This guidance is primarily aimed at personnel within WMO centres, with responsibility for planning and undertaking the creation or acquisition, stewardship, exchange and provision of information related to the Earth system.

Specifically, the guidance has five main target audiences across the information lifecycle:

  1. Information producers or creators (those who produce or acquire the information - they need to ensure the scientific quality of the underpinning information).

  2. Information managers (those who manage information).

  3. Information providers or publishers (those who publish the information - they are responsible for the provision of the information, and for ensuring that appropriate access is enabled, licensing agreements are in place, etc.).

  4. Service providers (those who disseminate the information - they are responsible for ensuring information availability and maintaining capability for easy and secure access to the information).

  5. Information consumers (those who utilize the information - they need to understand the restrictions, rights, responsibilities and limitations associated with the information together with the suitability for intended usage or purpose).

3.1.2. Principles of information management

Effective management of information is essential for WMO Centres to deliver operational services and information that is authoritative, seamless, secure and timely. The principles below underpin this management across the full information lifecycle and provide a framework for information management. The principles are independent of information type and are largely independent of technology, they are therefore expected to remain stable over time.

3.1.2.1. Principle 1: Information is a valued asset
  • An information asset is information that has value. This value may be related to the cost of generating and collecting the information, a value associated with the immediate use or a value associated with the longer term preservation and subsequent reuse of the information.

  • This value should be recognizable and quantifiable and the asset should have an identifiable lifecycle. Risks associated with, and to, an information asset should also be identified. As such, information management must be considered an integral part of a WMO centre’s responsibilities and needs to be adequately resourced over the full lifecycle of the information.

3.1.2.2. Principle 2: Information must be managed
  • An information asset must be managed throughout its lifecycle, from creation to use to eventual disposal, in a way that makes it valuable, maximizes its benefits and reflects its value in time and its different uses.

  • Information managers must consider the entire information lifecycle, from identifying needs and business cases to creating, quality assurance, maintenance, reuse, archiving, and disposal. Careful consideration must be given to disposal, ensuring that information is destroyed only when it has ceased to be useful for all categories of users.

  • Professionally qualified and adequately skilled staff with clear roles and responsibilities should apply a sound custodianship framework concerning security, confidentiality and other statutory requirements of different types of information.

3.1.2.3. Principle 3: Information must be fit for purpose
  • Information should be developed and managed in accordance with its function and use for internal and external users.

  • WMO Centres should regularly assess information to ensure that it is fit for its purpose and that processes, procedures, and documentation are adequate.

  • Processes should be consistent with the general provisions and principles of quality management as described in the WMO Technical Regulations (WMO-No. 49).

3.1.2.4. Principle 4: Information must be standardized and interoperable
  • Information must be stored and exchanged in standardized formats to ensure wide usability in the short and long term. It is essential for long-term archiving that information is stored in a form that can be understood and used after several decades.

  • Standardization is essential for structured information such as dataset definitions and metadata to support interoperability.

  • Interoperability is essential for users to utilize information through different systems and software. Open standards help ensure interoperability with their openness and wide adoption across various communities.

  • Which standards to use depends on the user community and organizational policies. Interoperability requirements should be considered when selecting the standard for internal use and broader dissemination.

  • The use of closed and proprietary standards is strongly discouraged.

3.1.2.5. Principle 5: Information must be well documented
  • WMO centres should comprehensively document information processes, policies, and procedures to facilitate broad and long-term use.

  • WMO centres should keep documentation up to date to ensure full traceability of processes along the information lifecycle, particularly for its creation.

  • Previous versions of the documentation should be retained, versioned, archived and made readily available for future use. In addition, versions should be assigned a unique and persistent identifier for future unambiguous identification.

3.1.2.6. Principle 6: Information must be discoverable, accessible and retrievable
  • Information should be easy to find through the Web, and for this purpose, the publisher should share discovery metadata with a catalogue service. The catalogue service should include a Web Application Programming Interface (API) to be used by other applications in order to offer user-tailored search portals.

  • For information to be easily retrievable once discovered, it should be accessible using standard data exchange protocols.

3.1.2.7. Principle 7: Information should be reusable
  • In order to maximize the economic benefits of an information asset it should be made as widely available and as accessible as possible.

  • The WMO Unified Data Policy encourages the reuse of data and information through the open and unrestricted exchange of core WMO data. The WMO encourages the free and unrestricted exchange of information in all circumstances.

  • The publisher should provide an explicit and well-defined license for each information item or dataset as part of the associated metadata.

  • The Findable, Accessible, Interoperable and Reusable (FAIR) data principles promote open data with the ultimate goal of optimizing reuse of data. These principles should be followed where possible.

Note: Information on the FAIR data principles can be found at: FAIR Principles - GO FAIR [64]

3.1.2.8. Principle 8: Information management is subject to accountability and governance.
  • Information management processes must be governed as the information moves through its lifecycle. All information must have a designated owner, steward, curator and custodian. These roles may be invested in the same person but should be clearly defined at the time of creation. A WMO centre with responsibility for managing information must ascertain:

  • information management practices, procedures and protocols, including well-defined roles, responsibilities and restrictions on managing the information;

  • definition and enforcement of appropriate retention policy, taking into account stakeholder needs and variations in value over the information lifecycle;

  • licensing and defining and enforcing any access restrictions.

  • The designated owner should have budget and decision-making authority about preservation and data usage, including passing ownership to another authority.

3.1.3. The information management lifecycle

3.1.3.1. Overview

All information should be subject to a well defined and documented lifecycle. The governance of this process is often referred to as the information management lifecycle and this process helps organizations manage information throughout its full lifecycle, from planning, creation and acquisition through usage and exchange to archival and disposal.

The following sections describe two overarching themes, governance and documentation, that apply to all stages of the information lifecycle and then provides high level guidance split into 5 aspects:

  • Planning, creation and acquisition

  • Representation and metadata

  • Publication and exchange

  • Usage and communication

  • Storage, archival and disposal

Governance covers the rules that apply to managing information in a secure and transparent manner, documentation covers the act of recording the reasons for, and detail of, all operations in the information management process.

3.1.3.2. Overarching requirements
Governance
  • Information management governance defines a set of organizational procedures, policies and processes for the management of information. This includes defining accountabilities and compliance mechanisms.

  • Effective governance helps ensure that all aspects of the information management process are conducted in a rigorous, standardized and transparent manner and that the information are secure, accessible and usable.

  • WMO centres should establish a board or leadership group to develop and regularly review such a governance structure and ensure compliance with its requirements.

Documentation
  • Documentation describing the who, what, why, when, where and how various actions are undertaken in the management of information is required to ensure the traceability and integrity of the information and to ensure operations can continue if key staff leave.

  • This documentation is required for all aspects of the information lifecycle and should be clear, well communicated, regularly updated, and easy to find. Guidance to the documentation should be provided to new staff taking on responsibilities for information management and be a key component of training.

3.1.3.3. Aspects of the information management lifecycle
Planning, information creation and acquisition

Before the creation or acquisition of new information a business case and information management plan should be developed, covering both the input information sources and any derived information. The plans should include:

  • Why the information is required

  • How it will be collected or created

  • How it will be stored

  • Whether it will be exchanged with other users and under what policy

  • Where it should be submitted for long term archival

  • Key roles and responsibilities associated with the management of the information

For externally sourced data the plans should include where the information has come from and what the licensing terms are.

Once information has been acquired it should be checked to ensure that the contents and format are as expected. This may be done using a compliance checker or validation service. Once these checks have been performed the information content should also undergo quality control checks using well documented procedures to identify any issues. A record of the checks should be kept and any issues detected should be documented and feedback to the originators. It is also important to subscribe to updates from originators so any issues identified externally can be taken into account.

Information created rather than acquired should undergo the same processes as the acquired information. The information created should undergo quality control and the resulting files checked against the specified format requirements. The results of the processes and checks should be documented.

To ensure traceability and reproducibility the information and documents at this, and subsequent stages, should be version controlled and clearly labelled with version information. Similarly, software, or computer code, used to generate or process information should be version controlled with the version information recorded in the documentation and metadata. Where possible, software should be maintained within a code repository.

Representation and metadata

The formats used to store and exchange information should be standardized to ensure its usability, both in the short and long term. It is essential that the information can be accessed many years after archival if required. To ensure this usability, the format and version information should be recorded in the metadata record for the information and should be included in the information where the format allows.

Information exchanged on the WMO Information System and between WMO centres is standardized through the use the formats specified in the Manual on Codes (WMO-No. 306), Volume I.2 and the Manual on WIS (WMO-No. 1060), Volume II. This includes the GRIB and BUFR formats for numerical weather prediction products and observational data and the WIS Core Metadata Profile for discovery, access and retrieval metadata. The format for the exchange of station and instrumental metadata, the WIGOS Metadata Data Representation, is also defined in the Manual on Codes (WMO-No. 306), Volume I.3.

These formats have been developed within the WMO community to enable the efficient exchange of information between WMO centres and for the information to be interoperable between centres and systems. The formats, including detailed technical information, have also published openly through the WMO manuals, enabling use of the formats and information by other communities, promoting reuse of the information.

The WMO formats specified in the manuals are subject to strong governance processes, and changes to the formats can be traced through the versions of the manuals. The code tables and controlled vocabularies are also maintained in a code repository. To enable future reuse, the technical information, including detailed format specifications, should be archived alongside information for future access. This includes any controlled vocabulary, such as BUFR tables or WIGOS metadata code lists, associated with the format.

Publication and exchange of information

To maximize the benefits and return on investment in the acquisition and generation of information there needs to be a clear method as to how the information will be published, exchanged and accessed by users.

Information is published on the WMO Information System through the creation of discovery metadata records. These records are publicly searchable and retrievable via WMO cataloguing services, providing access to the records via the Web and via a Web Application Programming Interface (API). The metadata records should include information on how to access the described datasets and services (see also Representation and metadata) and how to subscribe to receive updates and new data.

Guidance on the creation of these discovery metadata records is included in Part V of this Guide. Technical regulations are provided in the Manual on WIS (WMO-No. 1060), Volume II. Before exchange and publication the metadata should be assessed using the WMO Core Metadata Profile Key Performance Indicators to ensure usable and high quality metadata in addition to metadata that conforms with the technical standard.

The Web standards and protocols used should be adequately documented to enable users to find and retrieve the information. This should be possible both manually and automatically via machine-to-machine interfaces and should be standardized between centres.

Updates to the information exchanged on the WIS, including the publication of new information or the cessation of previously exchanged information, is published in the WMO Operational Newsletter.

Note: The newsletter is available from: https://community.wmo.int/news/operational-newsletter

Usage and communication

For information to have value it must inform users, aid knowledge discovery and have impact through informed decision making. Ensuring that the user can make effective use of the information is an important step in the information management lifecycle. This takes two forms:

  1. Provision of suitable information within the discovery metadata, enabling users to discover and access the information and to assess whether it meets their requirements. This should include licensing information.

  2. Provision of user guides and documentation on the suitability of the information for different uses, including any technical caveats or restrictions on the use of the information.

For common types of information the guides may be generic or link to standard documentation. Information on the observations available from the WMO Integrated Global Observing System is provided within the Manual on the WMO Integrated Global Observing System (WMO-No. 1160) and the Guide to the WMO Integrated Global Observing system (WMO-No. 1165) respectively. This includes information on the expected uses and quality of the data, either directly or through links within. Similarly, information on the data and products available through the Global Data Processing and Forecasting System is provided in the Manual on the WMO Integrated Processing and Prediction System (formerly Manual on the Global Data Processing and Forecasting System) (WMO-No. 485).

For non-standard and specialist products targeted user guides may be more appropriate. These should include a plain text summary for the non-technical user and should also be accessible and retrievable via a link within the discovery metadata. Any user guide should be in addition to the technical documentation described under _information_creation_and_acquisition.

Updates and the availability of new information should be announced and published via the WMO Operational Newsletter (see Publication and exchange of information). Other communication methods may also be used but these should not be in place of the operational newsletter. It is also recommended to allow users to subscribe to receive updates directly.

The discovery metadata should include a valid point of contact, enabling users to provide feedback and ask questions about the information provided.

Storage, archival and disposal

The type of storage used should be appropriate to the type of information stored. Core information exchanged operationally should be stored and made available via high-availability and low latency media and services. For some operation critical information, such as hazard warnings, there is a requirement for the end-to-end global distribution of the information to be completed in two minutes. For other operational data there is a requirement for the global exchange to be completed in 15 minutes.

The storage requirements for non-operational services and information may be different but the guidance provided in this section applies equally. Further information on the performance requirements is provided within the WI2S Technical Specifications listed in the Manual on WIS (WMO-No. 1060), Volume II.

Backup policies and data recovery plans should be documented as part of the information management plan. These should be implemented either before or when the information is created or acquired and should include both the information and the associated metadata. The backup and recovery process should be routinely tested. Specific guidance on the expectations and requirements for WMO centres is provided under the operational guidance in Part VII of this Guide.

Business rules governing the access to and modification of the information should be clearly documented in the information management plan. This must include the clear specification of roles and responsibilities of those managing the information. Information on who can authorize the archival and disposal of the information and the processes for doing so should be included. The roles associated with an information resource are standardized as part of the WIS Core Metadata Profile, see Part V of this Guide for further information.

The archival and long-term preservation of an information resource should be identified and included in the information management plan. This may be at a national data centre and/or a WMO centre. The WMO centres are recommended for globally exchanged core data and include those centres contributing to the Global Atmosphere Watch, the Global Climate Observing System and the Marine Climate Data System (see Manual on Marine Meteorological Services (WMO-No. 558), Volume II, as well as the WMO World Data Centres and those defined in the Manual on WIS (WMO-No. 1060), Volume II and those defined in the Manual on the WMO Integrated Processing and Prediction System (formerly Manual on the Global Data Processing and Forecasting System) (WMO-No. 485).

Earth system information, especially observational data, are often irreplaceable. Other information, whilst technically replaceable, is often costly to produce and therefore not easily replaceable. This includes output from numerical models and simulations. Before an information resource is marked for disposal careful consideration must be given to whether long term archival or disposal is more appropriate. This consideration must follow a clearly defined process documented in the information management plan.

When an information resource is marked for disposal the reasons for disposal, including the outcome of the consultation with stakeholders and users, must clearly be documented. The disposal must be authorized by the identified owner and custodian of the information. The information on the disposal must be included in the metadata associated with the information resource. The metadata must be retained for future reference.

3.1.4. Other considerations

3.1.4.1. Technology and technology migration

Information managers must be aware of the need to ensure that the technologies, hardware and software used do not become obsolete and must be aware of emerging data issues. This topic is discussed further in the WMO Guidelines on Emerging Data Issues (WMO-No. 1239).

3.1.4.2. Information security

Further information on information security and best practices can be found in the Guide to Information Technology Security (WMO-No. 1115).

4. PART IV

4.1. Security

For this initial version of the Guide to WIS2, existing guidance on information technology security (aka. cybersecurity) remains largely applicable. Please refer to:

  • Guide to Information Technology Security (WMO-No. 1115)[65]

  • Guide to the WMO Information System (WMO-No. 1061), Vol I[66], Appendix E - Annex To Paragraph 7.8 (ICT Service Incident Management), and Appendix F - WIS IT Security Incident Response Process

5. PART V

5.1. Competencies

For this initial version of the Guide to WIS2, existing guidance on competencies remains largely applicable. Please refer to Guide to the WMO Information System (WMO-No. 1061), Vol I, Appendix A - WMO Information System Training And Learning Guide[67].


1. https://community.wmo.int/governance/commission-membership/commission-observation-infrastructures-and-information-systems-infcom/commission-infrastructure-officers/infcom-management-group/standing-committee-information-management-and-technology-sc-imt
2. https://community.wmo.int/governance/commission-membership/infcom
3. Data Catalog Vocabulary (DCAT) - Version 2, W3C Recommendation 04 February 2020 https://www.w3.org/TR/vocab-dcat-2/#Class:Dataset
4. Why 5-days in this example? Because the system used to publish the data in this example only retains data for 5-days.
5. Subscribing to notifications about newly available data means that you don’t need to continually to poll the data server to check for updates.
6. Internet search engines allow Data Consumers to discover WIS2 datasets by indexing the content in the Global Discovery Catalogues.
7. IANA Link Relations https://www.iana.org/assignments/link-relations/link-relations.xhtml
8. RFC 7946 - The GeoJSON Format: https://datatracker.ietf.org/doc/html/rfc7946
9. In future, WIS2 may provide metadata publication services (e.g., through a WIS2 metadata management portal) to assist with this task. However, such a service is not available at this time.
10. OGC API - Records - Part 1: Core https://docs.ogc.org/DRAFTS/20-004.html
11. Both data and metadata publication use the same notification message mechanism to advertise the availability of a new resource.
12. Architecture of the World Wide Web https://www.w3.org/TR/webarch/
13. RFC 3986 - Uniform Resource Identifier (URI) - Generic Syntax: https://datatracker.ietf.org/doc/html/rfc3986
14. The term "Uniform Resource Locator" (URL) refers to the subset of URIs that, in addition to identifying a resource, provide a means of locating the resource by describing its primary access mechanism (e.g., its network "location"). RFC 3986
15. WIS2 strongly prefers secure versions of protocols (e.g., HTTPS) wherein the communication protocol is encrypted using Transport Layer Security (TLS)
16. Spatio Temporal Asset Catalogue (STAC) https://stacspec.org/en
17. See https://www.iasa-web.org/tc04/submission-information-package-sip or end of https://www.eumetsat.int/formats
18. OpenAPI Specification https://spec.openapis.org/oas/v3.1.0
19. Open Geospatial Consortium OGC API https://ogcapi.ogc.org/
20. OGC API - Environmental Data Retrieval (EDR) https://ogcapi.ogc.org/edr
21. OGC API - Features https://ogcapi.ogc.org/features
22. OGC API - Coverages https://ogcapi.ogc.org/coverages
23. WMO World Weather Watch https://wmo.int/world-weather-watch
24. In the context of WIS2, real-time implies anything from a few seconds to a few minutes - not the milliseconds required by some applications.
25. WIS2 ensures rapid global distribution of notification messages using a network of Global Brokers which subscribe to message brokers of WIS2 Nodes and republish notification messages (see Global Broker).
26. IANA Link Relations https://www.iana.org/assignments/link-relations/link-relations.xhtml
27. The "experimental" topic is necessary for the WIS2 pre-operational phase and future pre-operational data exchange in test mode.
28. The Global Discovery Catalogue will reject discovery metadata records containing links to topics outside the official topic-hierarchy.
29. A Global Cache provides short-term hosting of data. Consequently, it is not an appropriate mechanism to provide access to archives of Core data, such as Essential Climate Variables. Providers of such archive data must be prepared to serve such data directly from their WIS2 Node.
30. Default value for the cache property is true; omission of the property will result in the data object being cached.
31. Excessive data volume isn’t the only reason they may refuse to cache content. Other reasons include: too many small files, unreliable download from a WIS2 Node, etc.
32. OpenAPI Security Scheme Object: https://spec.openapis.org/oas/v3.1.0#security-scheme-object
33. A specific API key should be used for data publication via WIS2 so that usage can be tracked.
34. Given that users are encouraged to download Core data from the Global Cache, there will likely be only a few accesses using the WIS2 account’s API key. If the usage quota for the WIS2 account is exceeded (i.e., further data access is blocked) then this should encourage users to download via the Global Cache as mandated in the Manual on WIS (WMO-No. 1060), Volume II.
35. Working with presigned URLs on Amazon S3 https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-presigned-url.html
36. MQTT broker managed services are available online, often with a free (no cost) starter plan sufficient for infrequent publications of notifications about metadata. These provide a viable alternative to implementing an MQTT broker instance yourself.
37. MQTT Specifications: https://mqtt.org/mqtt-specification/
38. TFC 7231 - Hypertext Transfer Protocol (HTTP/1.1): https://datatracker.ietf.org/doc/html/rfc7231
39. IANA Top Level Domains https://data.iana.org/TLD
40. The “.gov” part of the domain name is superfluous for the purposes of WIS2. There is nothing preventing its use, but it doesn’t add any value.
41. The default connection credentials for a WIS2 Node message broker are username everyone and password everyone. WIS2 Node operators should choose credentials that meet their local policies (e.g., password complexity).
42. In WIS2 we use IP addresses to determine the origin of connections and therefore confer trust to remote systems. It is well documented that IP addresses can be hi-jacked and that there are alternative, more sophisticated, mechanisms available for reliably determining the origin of connections requests, such as Public Key Infrastructure (PKI). However, the complexities of such implementation would introduce a barrier to Member’s participation in WIS2. IP addresses are considered to provide an adequate level of trust for the purposes of WIS2: distributing publicly accessible data and messages.
43. In some cases, WIS2 Nodes will need to serve Core data directly (see Considerations when providing Core data in WIS2). In these situations, the WIS2 Node data server must remain publicly accessible.
44. OpenAPI Security Scheme Object: https://spec.openapis.org/oas/v3.1.0#security-scheme-object
45. OpenMetrics is proposed as a draft standard within IETF.
46. https://openmetrics.io
47. https://github.com/wmo-im/wis2-metric-hierarchy
48. It is also technically possible to filter recommended data by using a wildcard subscription such as origin/a/wis2/+/data/core/#. However, avoiding wildcard subscription is generally considered good practice as it limits the burden of the broker operated by Global Brokers.
49. OGC-API Records - Part 1 https://docs.ogc.org/DRAFTS/20-004.html
50. The PANS-IM is expected to available on ICAO NET by July 2024 and become applicable in November 2024. Information provided in herein is based on best understanding of draft proposals from ICAO.
51. AMQP 1.0 is one of the protocols proposed in the draft PANS-IM
52. IWXXM (FM205) is defined in the Manual on Codes (WMO-No. 306), Volume I.3 – International Codes
53. WIS2 follows the recommendations from OpenAPI regarding choice of security schemes for authenticated access - a choice of HTTP authentication, API keys, OAuth2 or OpenID Connect Discovery. For more information see OpenAPI Security Scheme Object: https://spec.openapis.org/oas/v3.1.0#security-scheme-object
54. WIS2 does not provide any guidance on use of Public Key Infrastructure (PKI).
55. Global Caches enable highly available, low-latency distribution of Core Data. Given that Core Data is provided on a free and unrestricted basis, Global Caches do not implement any data access control.
56. Where the data resource does not exceed 4Kb, it may additionally be embedded in the notification message.
57. Note that the figure simplifies the transmission of discovery metadata from WIS2 Node to the Global Discovery Catalogue. In reality, the WIS2 Node publishes notification messages advertising the availability of new discovery metadata resource at a given URL. These messages are republished by the Global Broker. The Global Discovery Catalogue subscribes to a Global Broker and downloads the discovery metadata from the WIS2 Node using the URL supplied in the message.
58. WIS2 recommends that one subscribes to notifications from a Global Broker. However, where both Gateway and WIS Node are operated by the same organisation, it may be advantageous to subscribe directly to the local message broker of WIS2 Node, e.g., to reduce latency.
59. The WIS2 Node may control access to data - the Gateway will need to be implemented accordingly.
60. In case a unique identifier is required for proper passing of an aviation weather message to the Gateway, one can use the GTS abbreviated heading (TTAAii CCCC YYGGgg) in the COLLECT envelop (available in IWXXM messages having a corresponding TAC message), or content in attribute gml:identifier (available in newer IWXXM messages like WAFS SIGWX Forecast and QVACI), for such purpose. There is currently no agreed definition for unique identifier of IWXXM METAR and TAF reports of individual aerodrome.
61. OpenMetrics: https://openmetrics.io
62. AMQP 1.0: https://www.amqp.org/resources/specifications
63. https://schema.org
64. https://go-fair.org
65. Guide to Information Technology Security (WMO-No. 1115): https://library.wmo.int/records/item/51145-guide-to-information-technology-security
66. Guide to the WMO Information System (WMO-No. 1061), Vol I: https://library.wmo.int/records/item/28988-guide-to-the-wmo-information-system
67. Guide to the WMO Information System (WMO-No. 1061), Vol I: https://library.wmo.int/records/item/28988-guide-to-the-wmo-information-system