This section describes why and how the organisation can monitor the use of its opened datasets.
Why should the use of opened datasets be monitored?
Investment decisions in open data publications often need to be justified by value creation, i.e. the benefits derived from the investment. The organisation may also have set quantitative targets for such benefits. It is important to measure the benefits created by open data, making it possible to report them and to use them to justify future investments.
Organisations that have already opened their data are often interested in finding out the extent to which their datasets have been viewed or downloaded in the data portal. It is important to note that while the viewing and downloading data concerning datasets published on the portal or other online service do not indicate if the dataset has actually been used and how, they do give an indication of the interest attracted by the opened dataset.
How should the use of opened datasets be measured?
Measuring open data is difficult, but the scale of its use can be measured and estimated. You can choose from a number of indicators, such as view and download statistics, or API use metrics. In addition, the utilisation of data can be measured with other factors, such as media visibility and the number of organisations and applications making use of your data. Information on open data use can, for example, be collected by means of media monitoring (keywords) and surveys or by offering various incentives to report on data use. The table below contains examples of different use-related factors and indicators.
Examples of measurement methods
Measured factor | Example indicator |
---|
Downloads | Number of files downloaded |
Views | Number of files browsed |
API use | API usage statistics |
Data utilisation | Media monitoring and press articles |
Number of organisations using the data |
Number of applications using the data |
Surveys and interviews |
Use of data in hackathons |
Contacts by higher education institutions |
Data quality | Feedback: contacts, number, content, background organisations |
Monitoring and measuring the use of opened data also depends on the format it was published in. Different statistics can be obtained on the use of data published in a dynamic form, often through an API, and its use is reasonably easy to monitor. On the other hand, monitoring the use of static data (files) is more challenging.
Measuring data published as files
Static data is usually downloaded only once, after which the user has unlimited access to it. Static data downloads do not necessarily convey any information about the extent of data use or the impact of its publication. In addition, open data licences permit the re-sharing of data, which means that other parties can also share the file in their services. For example, an individual media outlet may download the dataset once and publish a news item based on it, which spreads to several other media outlets. Consequently, even one download can have a high impact.
Measuring data published through an API
It is possible to collect analytics on API use, which can then be used to develop the API and possibly also to improve data quality. Statistics can be compiled using various applications, such as Matomo. For example, analytics can be collected on the general volume and methods of data use, API requests and user locations.
If the use of the API requires an API key, this key can be used to distinguish between data users and to monitor the API’s utilisation rate. API keys can also be distributed anonymously, allowing data users to retain their anonymity, and the use of the API key does not undermine the basic principles of openness. If someone misuses an open data API that requires an API key, the key can be used to differentiate between users and, if necessary, limit server loads.
Monitoring methods used in Suomi.fi Open Data service
The view and download rates of data published on Open Data service are displayed openly on the page for each dataset. In addition, the service’s Statistics page provides additional information on e.g. the most popular datasets. The service also has an application gallery, which maintains an overview of applications that use open data, with emphasis on the central government’s open datasets.
Monitoring methods used by the Finnish Meteorological Institute
The Finnish Meteorological Institute uses Spatineo Monitor for usage monitoring (downloads, user groups). In addition, the development of a continuous situational picture and dashboard is under way with Spatineo. The Finnish Meteorological Institute reports on the use of its open data in its data balance sheet.