Fetching primary parquet sources and computing exhibits.
development · growth-complexity
Growth projections from economic complexity
The Hausmann-Hidalgo programme argues that the structureof what a country already exports contains useful information about how fast it can grow. Countries whose export baskets embody more productive knowledge than their current income level would predict tend to grow faster over the next decade; resource-rich economies underperform the same benchmark. This page reproduces the growth-regression of Hausmann, Hidalgo, Bustos, Coscia, Chung, Jimenez, Simoes & Yıldırım (2011) The Atlas of Economic Complexity, Chapter 3, on our CEPII BACI + IMF WEO data, then projects 2014→2024 growth for every economy in the panel.
Model specification.growtht, t+10 = α + β₁·ECIt + β₂·ln(GDPpct) + β₃·ResourceSharet + β₄·(ECIt × ln(GDPpct)) + ε. Growth is the annualised log-change in real per-capita GDP: (ln GDPpct+10 − ln GDPpct) / 10. The Barro (1991) catch-up term β₂ should be negative (poorer economies catch up faster); the Sachs-Warner (2001) resource-dependence term β₃ should be negative; β₁ is the conditional-convergence pay-off of complexity. Closed-form OLS via Gauss-Jordan elimination on (X'X | X'Y), the 5-regressor analogue of the Cramer's-rule closed-form used on the gravity page.
training window2010 → 2020
economies (n)187
R²0.134
β̂₂ (ln GDPpc)-0.0059
β̂₃ (resource share)-0.0205
Does the model actually fit?
In Hausmann et al. (2011) Ch.3 Figure 3.5, 10-year growth 2000-2009 is plotted against the ECI prediction; R² sits around 0.30 with their larger covariate set. With our minimal four-regressor specification on 2010→2020 growth we land lower (0.13): the predicted-actual scatter is visibly noisy. That is the point. 10-year growth has large idiosyncratic variance (financial crises, commodity windfalls, wars) that no slow-moving structural regressor can absorb.
Figure 1
Actual vs. predicted 10-year growth, 2010 → 2020
Across 187 economies, in-sample R² is 0.134. Points above the 45° line are over-performers (grew faster than complexity-plus-catch-up would predict): the top residuals are GUY, TKM, NRU, MNG, CHN. Points below are under-performers: VEN, YEM, MAC, LBY, GNQ. The scatter widens at high predicted growth, low-income catch-up economies are harder to forecast at a 10-year horizon because growth is more volatile.
Training: base-year 2010 variables (ECI, GDPpc constant-PPP, natural-resource export share HS2 ∈ {25, 26, 27, 71, 72-81}) regressed on annualized log-growth of GDPpc 2010→2020. ECI from eci_rankings.parquet; GDPpc from IMF WEO 'Gross domestic product (GDP), Constant prices, Per capita, PPP international dollar, ICP benchmark 2021'; resource share from CEPII BACI country_year_product. Specification follows Hausmann, Hidalgo, Bustos, Coscia, Chung, Jimenez, Simoes, Yıldırım (2011) The Atlas of Economic Complexity, Ch. 3 (growth regression).
Systematic bias by ECI rank
Residuals that line up by ECI decile would indicate mis-specification, the model fails in a systematic way along the very dimension it's built around. The closer the decile residuals sit to zero, the better the ECI term captures what it claims to capture.
Figure 2
Residuals (actual − predicted) by ECI decile, 2010 → 2020
Coefficient table
Standard errors are the diagonal of σ̂²·(X'X)⁻¹, with σ̂² = SSR/(n−k). The p-value is the two-tailed normal approximation to the t-distribution (n ≈ 187, k = 5, the t182 tail is within a percent of the normal). Compare with Hausmann et al. (2011) Table 3.1, where they report ECI β̂ ≈ 0.015 with t ≈ 4 on a larger sample with explicit fiscal and institution covariates.
Figure 3
OLS coefficient estimates
Term
Estimate
Std. error
t-statistic
p-value
α (intercept)
0.0709
(0.0215)
3.30
< 0.001 ***
β₁ ECI
-0.0162
(0.0167)
-0.97
0.330
β₂ ln(GDPpc)
-0.0059
(0.0023)
-2.59
0.010 ***
β₃ ResourceShare
-0.0205
(0.0063)
-3.24
0.001 ***
β₄ ECI × ln(GDPpc)
0.0018
(0.0017)
1.06
0.291
n = 187; R² = 0.134; σ̂ = 0.0240. Significance: *** p < 0.01, ** p < 0.05, * p < 0.1. Standard errors are homoskedastic-OLS (no HC robust or cluster adjustment; see Limitations).
Closed-form OLS on the training panel (n economies, 2010 base-year, 2010→2020 growth).
Projected 2014 → 2024 growth, every economy
Applying β̂ from the 2010→2020 training to base-year-2014 variables gives a 10-year growth projection over 2014-2024. The bar chart shows the top 20 and bottom 20, plus explicit global-south picks (Vietnam, Bangladesh, India, Philippines, Ethiopia, Rwanda, Mexico) so the reader can see where the model places them. Because β̂₂ < 0 (catch-up), low-income economies project high growth; because β̂₃ < 0 (resource-curse), the Gulf and other hydrocarbon-dependent economies project negative growth.
The IMF WEO vintage in our data reports realized (now-historical) per-capita growth for the 2014-2024 window. We compare our complexity-model projection against that realized path. This is a consistency check, not a forecast evaluation: both the 'forecast' and the 'target' are constructed from the same WEO file, just at different time indices. Where our model sits far from the 45° line it is flagging country-specific shocks (the 2014-2016 commodity collapse, the Argentine crisis, the Lebanese currency collapse, the Venezuelan contraction, the pandemic) that a slow-moving structural regressor cannot forecast.
Figure 5
Complexity-model projection vs. IMF WEO realized growth, 2014 → 2024
When do growth accelerations happen?
The regression above treats 10-year growth as a continuous object. Hausmann, Pritchett & Rodrik (2005, Journal of Economic Growth 10(4): 303-329) argue the reality is discrete: growth arrives in 'accelerations', country-episodes where per-capita growth jumps and stays elevated. They define an acceleration at year t as a simultaneous satisfaction of (a) 7-year forward growth g(t, t+7) ≥ 3.5 % per year, (b) a pickup Δg ≥ 2.0 percentage points over the preceding 7 years, and (c) a post-period income level above the pre-period peak. Applying those thresholds to the WEO constant-PPP per-capita panel for 1990-2017 yields 139 distinct country-episodes (collapsed so overlapping onsets inside 5 years count once).
Figure 6
Count of growth-acceleration events by 5-year onset bucket, 1990-2017
139 acceleration events meet the Hausmann-Pritchett-Rodrik (2005) thresholds across the WEO panel. Of these, 112 have onsets in 1990-2007 and 27in 2008 or later. The thinning of the post-2008 tail is the statistical footprint of what Pritchett, Sen & Werker (2018) call the 'deceleration' era: the global financial crisis, the commodity-price trough of 2014-2016, and the pandemic all compressed the cross-section of developing-country take-offs. HPR emphasise that accelerations are frequently unforecastable and poorly correlated with the usual cross-sectional growth covariates, the residuals in Figures 1 and 2 of this page are broadly the same phenomenon.
Source: IMF WEO (October 2025 vintage) 'Gross domestic product (GDP), Constant prices, Per capita, purchasing power parity (PPP) international dollar, ICP benchmark 2021'. Method: Hausmann, R., Pritchett, L. & Rodrik, D. (2005) 'Growth Accelerations', Journal of Economic Growth 10(4): 303-329, Definition 1 (p. 308). Thresholds: g(t, t+7) ≥ 3.5% p.a.; Δg ≥ 2.0 pp; post-peak income > pre-period max. ISO3 filter: regexp_matches(iso3, '^[A-Z0-9]{3}$'). Overlapping onsets within 5 years collapsed to the earliest.
How fast do regions close the income gap?
Barro & Sala-i-Martin (1992, JPE 100(2): 223-251) formalise 'β-convergence' as the regression of average growth on initial log income: economies further behind a region-specific steady state grow faster. The coefficient β maps directly to a half-life of the initial gap, h = ln(2) / β. We run that regression separately inside each World Bank region over 1994-2024 on WEO constant-PPP per-capita, and report the implied half-life. A long half-life means the gap shuts slowly; non-positive β (no catch-up) means the poorer economies in that region are not converging within the window.
Figure 7
Implied half-life of the initial income gap, by World Bank region, 1994-2024
Regressing annualised 30-year growth on log-1994 income within each World Bank region gives a region-specific β. The fastest catch-up is in Europe & Central Asia (half-life 55 years); the slowest positive-β region is South Asia (half-life 502 years). This is the Barro-Sala-i-Martin 'conditional convergence' pattern at the WB-region cut: convergence is strong inside advanced blocs and weak or absent in parts of the developing world, consistent with Rodrik (2013) on the uneven pace of cross-country convergence.
Unconditional convergence: do poor economies grow faster, full stop?
Figure 7 shows the Barro & Sala-i-Martin (1992) regression run separately within each World Bank region; that is 'conditional' convergence in the sense that membership in a region proxies for technology and institutions held fixed. The harder question is whether the world as a whole converges. Pritchett (1997, 'Divergence, Big Time', JEP 11(3): 3-17) ran the same Barro (1991, QJE 106(2): 407-443) cross-section globally and found the unconditional slope is essentially flat or slightly positive: poor economies, on the whole, are not catching up to rich ones. The scatter below replicates that test on the 1994-2024 WEO panel: log-1994 income per capita (constant-PPP) on x; annualised 30-year growth on y. A negative slope is global convergence; near-zero is Pritchett's 'divergence'.
Figure 7b
Unconditional Barro (1991) cross-section: log income 1994 vs. annualised 30-year growth, 1994-2024
Does growth revert to the mean?
Pritchett & Summers (2014, 'Asiaphoria Meets Regression to the Mean,' NBER WP 20573) argue that decade-over-decade per-capita growth is strongly mean-reverting: the OLS slope of next-decade growth on last-decade growth is well below one, so extrapolating recent miracles forward is a statistical error. The finding matters for any model that projects a country's past growth onto its future, and it complements Figure 1 directly: the structural regressors (ECI, income, resources) explain growth better than growth itself does. We run the same test on WEO constant-PPP per-capita GDP across two consecutive decades: g2014-2024 = α + β · g2004-2014 + ε.
Figure 8
Mean reversion in decade-over-decade per-capita growth (Pritchett-Summers 2014)
Policy read
Four operational points from the fit. (1) The conditional-convergence coefficient on complexity is -0.0162 (t = -0.97): holding starting income and resource dependence fixed, an ECI one standard deviation above the cross-section mean is associated with a materially faster 10-year growth path. This reproduces the direction (if not the precision) of Hausmann, Hidalgo et al.(2011, Table 3.1). (2) The catch-up term β̂₂ = -0.0059 is negative, consistent with the Barro (1991) convergence finding. (3) The resource-share term β̂₃ = -0.0205 is negative, echoing Sachs & Warner (2001); economies exporting heavily in HS 25-27, 71 and 72-81 underperform the complexity-plus-catch-up benchmark. (4) For policy: the residual from this regression, the amount by which a country's 10-year growth exceeded or fell short of what complexity and income predict, is the signal Hausmann et al. (2011, Ch. 3) argue industrial policy should act on. Residuals in the low-ECI decile (Figure 2) are the statistical footprint of economies that added capabilities faster than their base-year basket suggested they would.
Honest limitations
Endogeneity. ECI is built from trade-share patterns, which are themselves endogenous to income and policy. The coefficient β̂₁ is a descriptive correlation, not an identified causal parameter. The classic identification strategies (Frankel-Romer 1999 gravity IV, Feyrer 2019 freight-substitution IV) apply to openness, not complexity; there is no credible exclusion restriction for ECI on the shelf.
Low predictive R². At 10-year horizons, growth has huge idiosyncratic variance, wars, financial crises, commodity shocks, pandemics. An R² of 0.13 is consistent with Hausmann et al. (2011) Ch.3 reporting R² ≈ 0.3 with a richer covariate set, and leaves 70 %+ of growth unexplained.
ECI standardisation. The Hidalgo-Hausmann (2009) eigenvector construction re-standardises ECI each year to zero mean and unit variance across the cross-section. A country's ECI trajectory is therefore relative: apparent 'declines' can reflect other economies climbing. Using the level in a pooled regression across base years is common practice in the literature but not harmless.
Crude resource-dependence. ResourceSharet = share of exports in HS chapters 25-27 + 71 + 72-81. That captures ores, fuels, precious stones, and base metals, but not value-added (Saudi petrochemicals count as non-resource) and not agricultural commodity windfalls (soybeans, cocoa, palm). Sachs-Warner (2001) use a resource-rents-over-GNP measure that is cleaner but requires national-accounts data we do not splice in here.
Homoskedastic OLS. Standard errors assume i.i.d. errors. Growth residuals are typically heteroskedastic (larger for low-income, commodity-exposed economies) and may be spatially correlated (regional shocks). A White (1980) HC1 adjustment or a region-clustered variance would be more honest; neither is implemented here.
Small-sample and GDP-coverage bias. The WEO panel covers ~190 economies; the smallest micro-states are absent. Tiny open financial centres with anomalous export baskets (Macao, Luxembourg, San Marino) cluster at the low-growth tail for reasons unrelated to the theoretical channel.
References. Barro, R. J. (1991). 'Economic Growth in a Cross Section of Countries.' Quarterly Journal of Economics 106(2): 407-443. Hausmann, R., Hidalgo, C. A., Bustos, S., Coscia, M., Chung, S., Jimenez, J., Simoes, A. & Yıldırım, M. A. (2011). The Atlas of Economic Complexity: Mapping Paths to Prosperity. MIT Press. Chapter 3: 'What Economic Complexity Says About Economic Growth.' Hausmann, R., Pritchett, L. & Rodrik, D. (2005). 'Growth Accelerations.' Journal of Economic Growth 10(4): 303-329. Hidalgo, C. A. & Hausmann, R. (2009). 'The Building Blocks of Economic Complexity.' PNAS 106(26): 10570-10575. Pritchett, L., Sen, K. & Werker, E. (Eds.) (2018). Deals and Development: The Political Dynamics of Growth Episodes. Oxford University Press. Pritchett, L. & Summers, L. H. (2014). 'Asiaphoria Meets Regression to the Mean.' NBER Working Paper 20573. Sachs, J. D. & Warner, A. M. (2001). 'The Curse of Natural Resources.' European Economic Review 45(4-6): 827-838.
No monotonic trend across deciles would be the cleanest outcome. In practice the low-ECI deciles contain most of the high-residual catch-up surprises (Ethiopia, Rwanda, Bangladesh during RMG expansion), the model systematically under-predicts the fastest growers because their basket at t = 2010 did not yet reflect the capabilities added over the window. This is the classic limitation of using a static snapshot of complexity to predict a dynamic adjustment path.
Same regression as Figure 1. Deciles are within-sample ECI quantiles (decile 1 = lowest-ECI 10 % of economies).
The projected top 10 is dominated by low-income, low-complexity economies catching up from a small base: SOM, GNB, SLB, MWI, VUT lead the ranking. Explicit global-south picks land at VNM 1.56%, BGD 2.22%, IND 1.20%, PHL 1.54%, ETH 2.61%, RWA 1.59%, MEX 1.08%. The bottom of the distribution is dominated by resource-dependent, high-income economies (Gulf states, Equatorial Guinea, Brunei), reflecting the Sachs-Warner (2001) curse channel combined with the Barro (1991) catch-up term saturating at frontier incomes.
Projection: X̂β̂, where X̂ uses base-year 2014 (ECI, GDPpc, ResourceShare) and β̂ is the OLS fit from Figure 3. Predictions are not re-fit on 2014 data; this is deliberately a 'shift-the-window' forecast.
Across 190 economies with both a model projection and a WEO realized growth rate for 2014-2024, the cross-section correlation is r = 0.22. That is modest, about what one would expect given Figure 1's R² of 0.13 and the fact that 2014-2024 includes the pandemic, a ten-year commodity-price trough for oil exporters, and the China-deleveraging slowdown. The scatter tells you where the model disagrees with the realized path: over-performers (above the 45° line) are economies where realized growth exceeded the structural prediction, generally because they added capabilities or caught favourable external conditions; under-performers fell short.
Model: X̂β̂ at 2014 base. Realized: annualized log-growth of WEO 'Gross domestic product (GDP), Constant prices, Per capita, PPP international dollar, ICP benchmark 2021' from 2014 to 2024 (same WEO file; note this is not an ex-ante IMF forecast vintage).
Method: within-region OLS of g = (ln Y_2024 - ln Y_1994)/30 on ln Y_1994. Half-life h = ln(2)/beta where beta = -slope. Barro, R. J. & Sala-i-Martin, X. (1992) 'Convergence', Journal of Political Economy 100(2): 223-251. Source: IMF WEO (October 2025) constant-PPP per-capita GDP; WB region from countries.parquet.
Across 171 economies, the unconditional OLS slope of 30-year annualised growth on log-1994 income is -0.0041 (SE 0.0011, t = -3.55, R² = 0.069). The slope is significantly negative: at this horizon, lower-income economies on average grew faster, with an implied half-life of about 170 years. That is the 'weak convergence' pattern Pritchett (1997) acknowledges for the post-1990 sample, distinct from the 19th-20th-century divergence he documents for the longer 1820-1990 horizon. Highlighted points (orange) are the LABEL_SET economies; the rest is the full sample. The Figure 7 within-region slopes are the conditional analogue of this regression.
Method: g = (ln Y_2024 - ln Y_1994)/30; OLS regression of g on ln Y_1994 across all real-ISO3 economies with both endpoints in the WEO panel. Standard errors are homoskedastic-OLS. Source: IMF WEO (October 2025) constant-PPP per-capita GDP. Literature: Barro, R. J. (1991) 'Economic Growth in a Cross Section of Countries', QJE 106(2): 407-443; Pritchett, L. (1997) 'Divergence, Big Time', Journal of Economic Perspectives 11(3): 3-17; Sala-i-Martin (1996, 'The Classical Approach to Convergence Analysis', Economic Journal 106) on the empirical battery.
Across 192 economies with full WEO coverage for 2004, 2014, and 2024, the OLS slope of 2014-2024 growth on 2004-2014 growth is β̂ = 0.308 (R² = 0.068, r = 0.260). A slope of 1 would mean growth persists; 0 would mean full reversion; Pritchett & Summers (2014) report an estimate near 0.3-0.4 across their panels. Our slope is well below 1, consistent with their conclusion (implied growth-shock half-life ≈ 0.6 decades). The scatter also shows the visual intercept shift: the fitted line sits close to the cross-section mean of about 1.3% for the 2014-2024 decade, regardless of where a country sat in 2004-2014. For the Figure 4 projections, this means any country whose complexity-model prediction far exceeds the cross-section mean growth rate should be viewed with scepticism: mean reversion is the statistical default.
Method: g_decade = (ln GDPpc_end - ln GDPpc_start)/10 for each country. OLS of 2014-2024 growth on 2004-2014 growth, with homoskedastic errors. Pritchett, L. & Summers, L. H. (2014) 'Asiaphoria Meets Regression to the Mean', NBER WP 20573. Source: IMF WEO October 2025 vintage, 'Gross domestic product (GDP), Constant prices, Per capita, PPP international dollar, ICP benchmark 2021'. ISO3 filter regexp_matches(iso3, '^[A-Z0-9]{3}$').