GDP Data Quality Ratings

Updated: November 2025


World Economics has developed the Global GDP Data Quality Ratings to review the usefulness of official GDP data of individual countries

The Ratings currently cover five factors to determine data quality.

Each factor is evaluated to provide country scores which are then normalised using the standard deviation of the data for each factor and combined into the DQR score using an average aggregate to reflect the importance of each of the individual factors.


 

Measuring GDP Quality

World Economics has developed the Global GDP Data Quality Ratings to review the usefulness of official GDP data of individual countries. The Ratings currently cover five factors to determine data quality. Each factor is evaluated to provide country scores which are then normalised using the standard deviation of the data for each factor and combined into the DQR score using an average aggregate to reflect the importance of each of the individual factors.

These five factors used to judge data quality are:

  1. Base Year used to calculate the GDP data (chained or years out of date)
  2. Standard of National Accounts (SNA) applied
  3. Estimated Size of the Informal Economy
  4. Resources Devoted to Measuring Economic Activity
  5. A proxy measure for likely Government Interference in Economic Data production

It should be noted that there is not infrequent variation between what the World Bank and IMF list as the most recent Base Year and/or most recent SNA in use, and what countries themselves claim to be using. This is sometimes caused by often unavoidable time lags in the International organisations being informed of changes that have taken place locally and sometimes simple error is involved. Whatever the reasons, World Economics takes some trouble to find out what is the on-the-ground reality behind the figures. If we also fail to reflect the latest changes occasionally, we apologise in advance, but hope the data in this report is as correct and timely as is possible to achieve.
 

Key Variables and Methodology

 

Base Year (Range from 1990 to Chained)

Constant price estimates of GDP use the inflation adjusted price of goods and services relative to a particular year, known as a base year, to weight the volume components of output. But since the structure of production and relative prices over time are dynamic, the structure of the prices of products and the industries surveyed in the base year become less relevant over time. For some rapidly changing products (such as the smartphone in recent years) rapid technological change and relative price falls make any kind of comparison fraught with difficulty.

What is clear is that using data from 10 or 20 years ago (as many countries do) as a basis for calculations of the size and shape of economic activity, is unlikely to produce reliable estimates of GDP. In countries that revise base dates, very significant increases are usually recorded, highlighting just how inaccurate data is that has been produced using out of date base years.

When GDP is revised and the base year is updated, it allows the statistician to reweight the relative importance of the different sectors of economic activity, and further change or reconsider the methods and data sources.

The United Nations recommends updating base years every five years, although, most developed countries now adopt the practice of chaining, where relative prices are updated every year. The more out of date a country’s base year, the more inaccurate are estimates of GDP and the lower a country’s score in the World Economics Data Quality Ratings (DQR).

The base year score for each country in the DQR is a number between 0 and 100, with 100 indicating that a country is using a chaining system, where base years (or relative prices) are updated every year). Information on individual country’s base year is taken from the World Bank’s World Development Indicators (WDI), IMF’s World Economic Outlook Report, United Nations and National Statistics Offices. Base year points for use in the data ratings are then calculated by taking the range of base years used and applying a sliding scale based on the number of years out of data. 100 indicates Chained or the latest possible base year where the oldest base year used (Bolivia, 32 years) is assigned the lowest score of 0. All variations between these years are deducted points for each year out of date.
 

System of National Accounts: (SNAs used range from 1968 to 2008)

National income measurement is governed by a global standard: the United Nations System of National Accounts (SNA) - an internationally agreed standard set of recommendations on how to compile and measure economic activity and facilitate international comparability of economic statistics. The first SNA was published in 1953 and there have been three revisions SNA 1968, SNA 1993, and SNA 2008.

The longer it takes a country to update its SNA the less reliable the data becomes, particularly when used for economic comparisons to a country with a more recent SNA version. In the World Economics Data Quality Ratings, the newer the SNA version, the higher a country’s score.

The score for the SNA component is based on a scale of 0-100, with 100 points given to countries using the latest SNA version. Information on individual country’s SNA is taken from the World Bank’s World Development Indicators (WDI), IMF World Economic Outlook Report, United Nations and National Statistics Offices.

 

Informal Economy: (ranges from less than 6.9% to over 72.9% of GDP)

In many poorer countries, a very large swathe of activity can remain uncounted and even in wealthy countries, some informal activities remain outside the national accounts. But due to the nature of much informal work, ranging from housework, farming through to gambling, prostitution, drug dealing, and smuggling, calculations of the value of such activities are extremely difficult. The existence of such large amount of informal activity is so economically important that to leave it unrecorded in the official national accounts is unsatisfactory.

There have been many attempts to estimate the size of parts of the informal economy. The World Economics Data Quality Ratings employs estimates for the World Economics Quarterly Informal Econony Survey.

In constructing the data, the higher the size of the informal economy, the lower a country’s factor score. A DQR score of 100 means that a country has the lowest rate of informal sector activity as percentage of GDP.

 

Resources Available for Producing National Accounts Data

The quality of national income estimates depends to some extent on the statistical capacity and the resources available to national statistics offices. The United Nations System of National Accounts has put a global standard in place but the challenge for a local national statistics office is to produce a measure of the economy, usually with limited resources. Statistical capacity, or the ability to adhere to the global standard, depends critically on the resources and information available at any given time and place.

All other things being equal, there are a priori grounds to believe that poorer economies will have lower-quality statistics. The statistical capacity and economic resources in national statistics offices therefore matter a great deal in terms of data availability and quality of economic statistics. Data availability is subject to the number of trained staff and the level of resources available for collecting, processing and analysing the data.

As an illustration of the importance of resources in the collection of data, few have shown with greater clarity the nature of the problem than Morten Jerven in his book Poor Numbers, 2013, based on actual visits to statistics offices in Africa. To quote Jerven: “This book has shown that the most basic metric of development , GDP, should not be treated as an objective number but rather as a number that is the product of a process in which a range of arbitrary and controversial assumptions are made. As a result the metric should be used with the utmost care. The quality of this number depends on the state of the system that produces the statistics and this system is deficient in many poor countries.” This problem is not confined to Africa but is evident in countries on all continents.

The score for this component used in the World Economics GDP Data Quality Ratings is derived from the World Economics Statistical Resource Index (SRI).

The SRI is a composite indicator developed by World Economics to measure the level of resources available to national statistical systems (specifically National Accounts preparation and Population data).

It is constructed from primary data drawn from two sources: the World Bank Statistical Performance Indicators (SPI) and the Open Data Watch Open Data Inventory (ODIN). For each underlying indicator, country-level z-scores are calculated to standardise distributions across variables with differing scales and units. These z-scores are subsequently transformed onto a 0–100 scale (where 100 represents the highest observed performance). The standardised scores from the two source datasets are then aggregated through an equally weighted combination to yield two distinct sub-indices: one specifically targeted at resources supporting the production of GDP-related statistics and another focused on resources underpinning population and demographic statistics.

We use this former index as a proxy for assessing the availability of economic resources in national statistics offices. In theory, the larger the resources devoted to statistics offices, the better the quality of statistics. This is a proxy measure.

.

 

Government Interference

Governments interfere with the production and dissemination of basic economic data in many ways. Attempts in Greece to prosecute and potentially jail Andreas Georgiou, the man hired by the IMF to sort out the corrupt mess of Greek economic data is perhaps the most egregious recent example. Read: The Case of Andreas Georgiou.

The Greek instance might appear to be an extreme special case. But unfortunately there are also many occurrences of serious Government interference in the production of economic data in the Americas.

For example, in the paper (On Measuring Hyperinflation; Venezuela's Episode, by Hanke), records the following:

The Banco Central de Venezuela (BCV), like many central banks, has followed a pattern that Oskar Morgenstern elegantly documents in his classic work On the Accuracy of Economic Observations. Indeed, the BCV has failed to report data that would reflect poorly on the government, and when it has reported inflation statistics, it has lied and doctored the data. Instead of reporting Venezuela’s ‘real’ open rate of inflation, the BCV has attempted to measure suppressed inflation.

Venezuela imposes a thick blanket of price controls and a maze of subsidies over the economy. List prices are artificially held down. Yet these suppressed prices are the ones that, in principle, the BCV attempts to measure and use to construct a price index for calculating the inflation rate. But this metric misses the mark. Arbitrage opportunities prevail under the Venezuelan regime of price controls and subsidies, because there is a gap between the items under price controls and the prices of those goods and services that are actually exchanged on the black market. And it is in the black market and underground economy that most of Venezuela’s economic activity occurs. In consequence, there is a huge gap between the official inflation rate, which is based on artificially suppressed prices, and the ‘real’ open inflation rate.

And similarly, a paper by Ariel Coremberg "Measuring Argentina's GDP Growth; Myths and Facts" makes the following points:

  • Since 2007, official economic statistics in Argentina, particularly on consumer inflation and GDP, have been subject to political manipulation.
  • This paper reproduces Argentine national income from 2007 using standard methods and original sector data and finds that declared GDP is 12.2% higher in 1993 prices due to political intervention.
  • The paper finds that the distortion is mainly due to changes in accounting methodology across industries and not to changes in inflation estimates.
  • The reproduced GDP data dispels the myth that Argentina has been the fastest growing South American economy in recent years.

Governments and Government agencies manipulate GDP data directly in many ways, for example through the calculation of price indexes such as the GDP deflator which impact on GDP per capita data. They can and do stop publishing important data prior to elections. They try to abolish independent statistics bureaus. They try to add questions that will bias responses to Census data. They leave in place price indexes known to be unreliable and impacting heavily and negatively on crucial pensions systems.

This is not only a problem evident in poor countries, although countries with autocratic systems probably suffer to a greater extent. Sometimes the transgressions are deliberate, and sometimes due to incompetence or lack of resource.

Government corruption also infect all parts of an economy and its accurate measurement in systematic ways. Often a direct result of the government’s concentration of economic or political power, corruption manifests itself in many forms such as bribery, extortion, nepotism, patronage, embezzlement, and graft.

For example, excessive and redundant government regulations provide opportunities for bribery or graft. In addition, government regulations or restrictions in one area may create informal markets in another. As a result, governance and the informal economy are often correlated.

All these potential ways of corrupting data are difficult to measure directly. We have adopted a general measure of Governance as a proxy for Government interference. The score for this component of the DQR is derived directly from the World Economics Governance Rankings for over 155+ countries.

The Governance score is based on a 100-point scale in which a score of 100 indicates very good levels of governance and a score of 0 indicates very poor governance.

 

Calculating the Data Ratings

Each of the component indices is first transformed into standardised z-scores to ensure comparability across components with different scales and distributional properties. This standardisation is achieved by subtracting the global mean of each component from the country’s raw score and dividing by the global standard deviation of that component. For components where a lower raw value indicates better performance (e.g., size of the informal economy or base-year lag), the raw score is multiplied by −1 prior to standardisation so that higher z-scores consistently reflect superior data quality.

The standardised z-scores for the components are then aggregated into a composite GDP Data Quality Index using a simple unweighted average, assigning equal weight (20%) to each dimension. This equal-weighting approach reflects the view that no single component can fully substitute for the others when assessing the overall reliability and accuracy of GDP statistics.

The resulting composite index is rescaled to a uniform 0–100 range, where higher scores indicate higher GDP data quality and reliability. This rescaling is performed using the observed minimum and maximum composite z-scores in the current year (or a fixed historical reference range where greater intertemporal stability is desired), enabling intuitive interpretation and consistent cross-country and over-time comparisons.

Countries are assigned summary ratings of A through E based on their final 0–100 composite scores using a bell-curve grading approach identical to that employed for other World Economics Indexes. Rather than dividing countries into fixed quintiles or equal-sized groups, the rating boundaries are determined dynamically each year by the actual spread of composite scores as measured by the standard deviation from the global mean. This method ensures that grades reflect genuine deviations from average global performance rather than imposing an arbitrary equal distribution across categories.

The rating categories are defined as follows:

  • Grade A: As Good as it Gets
    Very high data quality and reliability. Comparable to the best statistical systems in the world. The quality of the economic data in ‘A’ graded countries is “as good as it gets” and can be used for most purposes, although the usual caution must be taken given less than full implementation of the latest standards, difficulties in estimating government output and the services sector.
     
  • Grade B: Good
    The quality of economic data in ‘B’ graded countries is generally good but should be used with some caution, it means that it provides a reasonable guide of GDP for most purposes , but it should not be used to make direct comparisons to countries ranked with A grade data. Particular attention should be made to the quality of government data and price indexes.
     
  • Grade C: Use with caution
    Moderate data quality. The quality of economic data in ‘C’ graded countries should only be used as a general guide, but some attempts are being made by many countries to improve accuracy.
     
  • Grade D: Poor
    Substantial deficiencies in methodology, coverage, or timeliness. Official GDP figures offer only a rough guide and are subject to large potential revisions or biases. The quality of economic data in ‘D’ graded countries provides a poor guide.
     
  • Grade E: Extremely Poor
    Severe limitations render official GDP data highly unreliable. The quality and reliability of economic data in ‘E’ graded countries is extremely poor and official GDP data should not be used for any purpose.

References

  • Fioramonti, Lorenzo. (2013). Gross Domestic Problem: The Politics Behind the World's Most Powerful Number.
  • Medina L. and Schneider F., Shadow Economies Around the World: What Did We Learn Over the Last 20 Years?;. International Monetary Fund, Working Paper No. WP/18/17.