Login
Section Educational Management

Determinants of University Research Output in Iraq: Estimating the Elasticities of Public Expenditure and Institutional Governance Using an Asymmetric NARDL Model, 2004–2025

Vol. 21 No. 3 (2026): Agustus:

Iqbal Hashem Mutashar (1), Afiefa B. Showket AL-Lami (2)

(1) Al-Mustansiriya University- College of Administration and Economics, Iraq
(2) Al-Mustansiriya University- College of Administration and Economics, Iraq
Fulltext View | Download

Abstract:










General Background: Research productivity is an important indicator of university performance and knowledge creation. Specific Background: Despite rising higher education expenditure, enrollment, and institutional expansion in Iraq during 2004–2025, research output growth remained relatively limited. Knowledge Gap: Previous studies have not developed a sector-specific measure of institutional deterioration in Iraqi higher education or examined its asymmetric relationship with research output. Aims: This study analyzes the roles of institutional integrity, credential quality, and public expenditure in determining university research output using a NARDL approach and a newly developed Credential Integrity Erosion Index (CIEI). Results: The findings indicate cointegration among the variables and show that institutional integrity and credential quality have stronger associations with research productivity than public expenditure. Institutional deterioration also generates larger and more persistent output losses than institutional improvement can restore. Novelty: The study introduces the CIEI and applies an asymmetric NARDL framework to Iraqi higher education. Implications: Improving institutional quality and credential verification systems is more critical than increasing expenditure alone for strengthening university research productivity.











Highlights:


• Institutional integrity and credential quality exhibit stronger relationships with research productivity than funding levels.


• Institutional deterioration produces larger and more persistent publication losses than equivalent institutional improvements can recover.


• The Credential Integrity Erosion Index offers a sector-specific framework for evaluating academic credential erosion in higher education.


Keywords: Institutional Integrity, Credential Quality, Research Productivity, NARDL Model, Higher Education Economics

Downloads

Download data is not yet available.

1. Introduction

Graduating one thousand engineers increases the total possible supply of researchers; however, there is an important distinction between "possible" and actual. This difference exists due to a collection of institutional structures that determine who receives degrees, what those degrees certify and the incentives academics receive with regard to how they allocate their time. When these institutional structures function properly, increased expenditures related to higher education result in increased research output. However, when the institutional structure breaks down, the relationship between expenditures and research output becomes unreliable; therefore, the typical human capital analysis does not add up.

The case of Iraq during 2004-2025 represents a unique instance where the institutional structure can be directly observed. Expenditures on higher education per capita rose in nominal terms. Enrolments in higher education grew at the graduate and undergraduate level. The number of institutions offering bachelor's degrees also grew. Nonetheless, the rate of increase for indexed research publications by Iraqi universities, as reported in Scopus and Web of Science, was substantially lower than the expected increase based solely upon input. In fact, the publication-to-enrolled-postgraduates ratio decreased over most of the sample period. There appears to be some mechanism or mechanisms which consume the additional funding provided but do not generate corresponding outputs.

This paper asserts that the consuming mechanism(s) can be measured. Two measures were created and empirically tested. The first measure is the Corruption Perceptions Index scaled to represent institutional integrity across the entire economy. The second measure is the new Credential Integrity Erosion Index (CIEI), which combines four administrative datasets -- domestic degree forgery, foreign equivalent approval processes lacking verifiability, enrollment inflation driven by demand, and enforcement failure -- to create a single measure of how far the credential process for academia has deviated from being a process of honest certification. These two indices provide empirical evidence supporting the hypothesis that it is institutional quality rather than the quantity of expenditures that limits the ability of Iraqi universities to produce research output.

In order to test this hypothesis, simply running a linear time-series regression will not suffice. As discussed in Section 2 of this paper, existing literature provides clear priors that suggest that there is asymmetry to the relationship between institutional quality and research output; in particular, that while erosion of institutional quality may be self-reinforcing, recovery from such erosion is likely to be less so. Therefore, it is appropriate to use the NARDL model developed by Shin et al. (2014) to estimate the effects of institutional quality on research output. The NARDL model includes separate estimations of both the positive and negative partial sums of the governance variable; thereby allowing estimation of the long run effect of each. Finally, a Wald test allows for determination of whether the estimated effects are statistically different from zero.

The remainder of this paper proceeds as follows: Section 2 presents the theory underlying this paper. Section 3 situates this paper in relation to other papers that utilize similar quantitative methods. Section 4 describes the data used in this paper and the methodological approach utilized to construct the CIEI. Section 5 sets out the econometric strategy. Section 6 presents and interprets the expected results. Section 7 derives policy implications. Section 8 concludes.

2. Theoretical Framework

2.1 The Production Function for Research Output

Becker's (1964) and Mincer's (1974) studies are foundational for our understanding that each dollar invested in education is used to produce an hour flow of productive service. The quantity of productive hours produced will be directly related to the skill level developed from education. In macro terms we could express the relationship between total or average research productivity and a country's human capital stock H*; government expenditure on research and development EXP; and gross domestic product per capita GDP:

RESₜ = f(H*ₜ, EXPₜ, GDPₜ) (0)

The operative word is effective. What the Ministry of Higher Education actually counts is the certified stock Ĥ, derived from degree registers. Define the credential integrity ratio θ = H*/Ĥ ∈ [0,1].

When the system functions as intended, θ = 1 and the two stock concepts become approximately equivalent. As an institution loses its ability to ensure that degrees are not counterfeit (i.e., the integrity of credentials is diminished), θ drops and there becomes a greater difference between what the registers report and what the labor market can usefully utilize from those workers with EXP. Therefore, at any given level of EXP, the actual amount of research produced will be less than what was predicted based upon the total number of registered researchers, due to the fact that each researcher's "credential" or "degree," now exaggerates their preparation for research.

North (1990) provides the reason why θ decreases (deteriorates) and how it stays decreased after it has decreased. North defines institutions as the set of rules governing behavior; he includes the credentialing process within this definition -- i.e., the credentialing process is designed to create a reliable indicator that possession of a degree indicates competency. If the institutions enforcing these rules do so less effectively, then the costs associated with obtaining a degree through deceitful means decrease. Further, the relative return on investing in education diminishes compared to the return on committing fraud when applying for a degree. Eventually employers recognize that they have been deceived into using a non-reliable signal and begin to condition wages solely on other factors (and cease to consider whether or not a worker has a degree). This further reduces the incentive for people to invest time in pursuing legitimate education.

2.2 The Verification Problem as a Principal–Agent Failure

Jensen & Meckling (1976) state the operational failure clearly. As "principal" the Ministry has an objective — to keep θ close to one; while delegating the verification process to "agent" committees. A key element for agency failure is present: monitoring is expensive and imperfect; the penalty for accepting a non-qualified credential is very low and is diffuse; and the agent has no stake in the social costs incurred by every wrong approval in the labor market.

Why should we care about the social cost? Credential informativeness is a public good. When a committee accepts a fraudulent or unverifiable foreign degree, it harms at least one employer who hired that graduate. But, in addition, it dilutes the signal quality of all credentials, since rational employers will lower their use of the credential signals, due to greater signal-to-noise ratio. Therefore, there is a market failure: The total social cost of verifying a failed credential is greater than the total private costs associated with the actions of those agents responsible for such failures. Therefore, the amount of verification required by privately interested parties is less than what would be required if they were also accountable for the social costs of these errors.

2.3 Credential Inflation and the Collapse of the Separating Equilibrium

Spence (1973) has shown how educational credentials can be used as screening devices to separate high and low-productivity workers into different employment slots. For this purpose, however, no additional productive value need be added to the education system if the cost of obtaining the education is negatively related to the productivity of those who obtain it. Thus, while a particular degree may require high costs to obtain for many workers, highly productive workers should encounter lower costs than less productive workers. As such, employers will use degrees as a basis upon which to make rational wage decisions about employees and the labor market will reach a separating equilibrium where all workers are sorted appropriately according to their individual abilities.

The purchase of fake diplomas, and the certification of non-accredited foreign qualifications through the equivalency process, have the effect of reducing the cost difference necessary to support a separating equilibrium. If low-productive individuals are able to purchase a postgraduate degree (which includes both master's and doctoral degrees) at a relatively low cost -- either by buying a fabricated degree or by having a foreign qualification certified as equivalent regardless of whether it was earned in an accredited school -- then there exists no longer a reason why employers would continue to reward actual investment in quality research. At that point, the labor market shifts to a pooling equilibrium in which employers are unable to identify the ability of potential employees and thus stop paying a premium for actual educational achievement. The private rate of return on investment in quality research becomes equal to the rate of return on the mere paper credentials; and since the social optimum level of investment in quality research is no longer being met in the marketplace, less than optimal amounts of quality research will be produced.

Finally, Iraq’s rules governing compensation for its civil servants accelerate this trend towards a pooling equilibrium. In these rules, salaries are determined by the number of levels of credentials held by each employee. Importantly, these determinations are made regardless of demonstrated job performance. Therefore, this creates a minimum price at which the demand for graduate-level credentials will be met, based solely on the level of credentials and not on their quality. Accordingly, the civil service pays the same increase in salary for a legitimate Ph.D., as it does for a Ph.D. purchased from an unaccredited institution but certified as equivalent. The minimum price established by the civil service subsidy for credential acquisition in general -- regardless of the information contained within them -- serves to reduce further the cost difference required to establish a separating equilibrium.

2.4 Why Institutional Deterioration Is Harder to Reverse Than to Cause

Institutional integrity matters, and the asymmetry that drives the need for NARDL is not just a modeling preference; rather, it captures an extensively documented characteristic of how institutional systems function. As Acemoglu and Robinson (2012) demonstrate, extractive arrangements are typically self-reinforcing. Agents who have developed strategic responses to being in an environment characterized by low institutional integrity -- such as employers who learn to disregard certification or academic researchers who understand that investing in legitimate research is not going to be appropriately compensated by rewards; or administrators who learn that they will never be held accountable -- face considerable barriers to coordinating the reversal of these adaptations when the nominal institutional environment changes.

In the case of Iraq's higher education system there are several mechanisms through which these effects occur. First, the "reputation" of universities is a slow moving asset. Universities that have historically tolerated sub-standard academic practices will suffer from a loss of credibility with employers and international partners that cannot simply be reversed with some sort of administrative declaration. Second, the ability of the current academic work force to engage in high-quality research has likely suffered damage due to many years of poor incentives. Restoring this capability will take time and investment and the returns on these investments will be realized over a period of time. Lastly, as the labor market has adjusted to the pooling equilibrium, the shadow price for genuine research activity will remain lower even if the formal credentialing system is tightened up at some point in time. This is true because employers will initially have little incentive to alter their prior expectations about the quality of credentials.

Together, these three mechanisms suggest that a reduction in institutional integrity equivalent to one unit will result in more than one unit increase in long run research output. Thus, since the linear ARDL model limits the absolute values of the long run multipliers associated with positive and negative shocks to be identical, it cannot assess whether this proposition holds true. However, the NARDL model, which allows for separate estimation of each multiplier can.

3. Literature Review

3.1 Governance, Corruption, and Educational Productivity

The empirical evidence for the notion that governance quality impacts education outcomes independent of resource levels was first demonstrated by Gupta et al., (2000), utilizing cross-sectional panel data. Gupta et al.'s finding that the accountability of spending is a better indicator of educational attainment then the volume of spending has been replicated and expanded upon in the nearly twenty years since, however the specific mechanisms that underlie this relationship remain unexplored. In Iraq, the distinctions between accountability of spending and total spending are not simply academic, they represent a primary empirical fact that the current paper seeks to understand given the stark contrast in per-student expenditures and research productivity over much of the sampled time frame.

Reinikka & Svensson (2004), address the issue of identifying the specific channels through which governance failure reduces school funding in Uganda. Specifically, Reinikka and Svensson identify the administrative process through which local officials captured central government grant monies. The contribution of Reinikka and Svensson is both substantive and methodological: institutional failures are not vague or ambiguous backgrounds, but rather are operationalized through identifiable administrative processes that may be observed and quantified. This is consistent with the methodology employed in constructing the CIEI in the current paper, namely that the process of verifying credentials is the appropriate level of analysis to examine institutional failure.

Borcan et al. (2017) complete the identification chain in Romania by using variation in anti-corruption interventions to estimate the causal impact of governance quality on students' academic achievement. Szabo-Morvai et al. (2022) expand Borcan's findings to a broader cross-country dataset. Both studies do not attempt to answer the question of how research productivity is affected by governance quality. Furthermore, neither construct an institutionally-specific measures of governance applicable to a single sector as attempted here.

3.2 The Economics of Credential Devaluation

HEYNEMAN (2004) suggests that credential devaluation can be viewed as a type of "institutional capital" loss with far-reaching implications for labor market efficiency beyond those of the parties involved in fraudulent activities. His argument also has elements of the logic of the "public good" discussed in section 2.2. When credentials are no longer reliable indicators of an applicant's ability, the impact of this will be widespread and long-lasting relative to the effects from typical forms of corruption.

Hallak & Poisson (2007) provide the most complete institutional framework for describing academic fraud in developing countries. Hallak & Poisson (2007) categorize academic fraud into four types - domestic fabrication, diploma mills, contract cheating and equivalency abuse. Their categorization aligns directly with the four categories of the CIEI; however, they do not include any quantitative measures. In addition to the conceptual framework developed above, Hallak & Poisson (2007) draw upon signaling theory (Spence, 1973; Arrow, 1973; Weiss, 1995). Signaling theory specifies conditions where credential information becomes unreliable and creates social welfare losses beyond the direct misallocations created through each instance of fraudulent activity. Also, Bedard (2001) identifies another empirical aspect of the problem: Credential inflation negatively affects the education investment decisions made by highly capable individuals, further increasing the overall social costs.

3.3 Asymmetric Time-Series Approaches in Education Finance

The asymmetric nonlinear autoregressive distributed lag (nardl) bounds test developed by Shin, Yu, & Greenwood-Nimmo (2014), extends the original ardl bounds test of Pesaran et al. (2001). The ardl was initially developed as a method of testing long run relationships within panel data sets using multiple variables of different orders of integration. While allowing for variables with potentially different orders of integration, the ardl does not require all variables to have the same underlying stochastic trends. Thus, while there are many applications in which the ardl has been used, it is especially useful in developing countries where researchers may be limited to sample sizes that are too small for most unit root tests and/or where the number of observations available across time may be too limited to adequately assess long run relationships. In addition to the ability to determine if there exists a long run relationship among variables; the nardl can also determine whether long run responses to both positive and negative shock differ from one another. This ability to differentiate the response to positive versus negative shocks is important whenever the variable of concern behaves in a manner that results in reinforcement or feedback loops over time. For example, if students are exposed to a lower quality education at younger ages, they will likely perform poorly on their exams later in life. This poor performance will then reinforce or result in a greater likelihood of receiving a poorer quality education later in life, thus creating a self-reinforcing cycle. Such cycles are common in many developing countries.

Table B: Position of this Study in the Quantitative Literature

Note: The research gap is two-dimensional. No prior study constructs a quantified sector-specific institutional measure for Iraqi higher education; and none applies an asymmetric cointegration estimator to the research output determinants question in this setting.

4. Data and Variable Construction

4.1 Sample and Variables

The dataset consists of 22 annual observations for Iraq, 2004 to 2025. All continuous variables are log-transformed before estimation to permit elasticity interpretation of long-run coefficients and to reduce the leverage of extreme values. Table C describes each variable and its source.

Table C: Variables, Sources, and Model Roles

Note: III is the CPI score inverted and rescaled so that higher values signal greater integrity. CIEI is described in Section 4.2.

4.2 The Credential Integrity Erosion Index

Standard governance indicators measure institutional quality at the level of the whole public sector. They are too aggregated to capture the dynamics of a specific administrative process such as credential verification, which may deteriorate or improve independently of the general governance environment. The CIEI is constructed to fill that gap.

Four administrative series are being integrated. They were normalized to be on the same scale ([0,1]) using min-max normalization over each sample, which means in all cases, higher CIEI values indicate more erosion. The weightings for the four composite series take into account the size of each component as well as how far removed (directly) each one was from the credentialing function. Domestic fabrication and unverified foreign equivalents received the largest weighting since these factors will directly decrease the amount of information contained within the credentials that the current working-age population holds.

Table D: CIEI Components and Economic Interpretation

Note: Linear Interpolation (i.e. using intermediate values) is utilized where Ministry Administrative Records are Incomplete for Specific Years. Results for sensitivity with and without interpolated values are shown in Robustness Checks. Prior to any Joint Specification of Models, a Positive Expected Correlation Between CIEI and III has been Verified through Computation of VIF.

The CIEI represents the weighted average of the four normalized components. As such, its main benefit over using the CPI as an explanatory variable is by virtue of being sectorally specific; it can capture variations in the quality of institutions within the higher education sector that are averaged out when using a general governance index. It should have a negative coefficient and be both economically and statistically significant as long as the coefficient for the general governance index is also positive.

5. Econometric Strategy

5.1 Integration and Structural Breaks

Three unit root tests have been applied to all of the variables included in the ARDL Bounds Test, so that there may be no variable that is cointegrated of order 2 or greater. The first was an ADF test whose lag length was chosen using the Akaike Information Criteria. The second test used was a PP test, which relaxes assumptions regarding errors in models. The third test was a KPSS test, which instead assumes the null hypothesis of stationarity and therefore acts as a validation of the results obtained from the previous two.

The sample period from 2004 through 2025 identifies three different regime periods in Iraqi macroeconomics. However, the standard unit root tests assume constant means and trends during the entire time period. This is unrealistic if a level shift occurred at some point within the sample. It is possible for a series that is stationary but has experienced a structural break to be incorrectly classified as non-stationary and vice versa. Therefore, the Zivot-Andrews (1992) procedure, allows for the identification of the most likely break date (or year), and then it tests for a unit root given that break date. Based upon prior knowledge, three potential break years were identified. These include: 1) 2006 which represented a significant decrease in productive investments; 2) 2014 where the decline in oil prices significantly reduced government revenues and caused across-the-board budget cuts in the university system; and 3) 2019 which indicated a partial recovery of government finances and initial stabilization efforts of institutions. All subsequent ARDL and NARDL estimates will be made under the conditions of the order(s) of integration determined by these procedures.

Table 1: Unit Root Tests — Iraq Annual Data, 2004–2025

Notes: ADF and PP statistics are t-statistics. Critical values: −3.00 (5%), −3.75 (1%). KPSS reports the LM statistic; critical values 0.463 (5%), 0.739 (1%). ***, **, * denote 1%, 5%, and 10% significance. CIEI is I(0); remaining variables are I(1). No series reaches I(2), satisfying the ARDL precondition.

5.2 Cointegration

The use of a "bounds" test for the F-statistic, from an initial levels regression in this context, allows for detection of cointegrating relationships among multiple time series, even when the variables do not have a common degree of integration. Thus if the calculated F-statistic lies above its corresponding critical values for an upper I(1), then it may be concluded there is cointegration present, irrespective of the stationarity status at the levels of other included independent variables. Since CIEI is clearly I(0) and the remaining variables appear to be I(1) in nature, the bounds test is also preferred by me relative to the Johansen Trace Test, since the Johansen Trace Test does require all included variables to have the same I(1) structure; therefore, the loss of power associated with each additional variable becomes quite severe as the sample size decreases below 50.

Table 2: Bounds Test for Cointegration

Notes: Critical value bounds from Pesaran et al. (2001), Case III, k = 3, T = 22. *** p < 0.01, ** p < 0.05. All three specifications exceed the I(1) upper bound.

5.3 Linear ARDL Error-Correction Model

The conditional ARDL(p, q) error-correction representation is:

ΔlnRESₜ = α₀ + Σᵖᵢ₌₁ βᵢ ΔlnRESₜ₋ᵢ + Σᶠᵢ₌₀ γᵢ ΔIIIₜ₋ᵢ + Σᶠᵢ₌₀ δᵢ ΔCIEIₜ₋ᵢ

+ Σᶠᵢ₌₀ φᵢ ΔlnEXPₜ₋ᵢ + Σᶠᵢ₌₀ ρᵢ ΔlnGDPₜ₋ᵢ + λ ECTₜ₋₁ + εₜ (1)

To achieve dynamic stability the coefficient (λ) on the lagged error correction terms must be negative and statistically significant. The order of lags were determined using an AIC approach but will never exceed two due to our small sample size. Long run coefficients on III and CIEI are the point estimate(s), while the coefficient on lnEXP represents the residual effect of public expenditure on output after controlling for institutional quality.

Table 3: Expected Long-Run Coefficient Signs and Magnitudes

Note: Expected ranges are derived from theoretical priors in Section 2 and comparable estimates in the governance-education literature. Final estimates will be reported following data collection.

5.4 NARDL Specification

The NARDL model decomposes III into partial sums that accumulate positive and negative changes independently:

III⁺ₜ = Σⱼ₌₁ᵗ max(ΔIIIⱼ, 0) III⁻ₜ = Σⱼ₌₁ᵗ min(ΔIIIⱼ, 0) (2)

These partial sums enter the error-correction equation separately:

ΔlnRESₜ = α₀ + Σᵖ βᵢ ΔlnRESₜ₋ᵢ + Σᶠ γ⁺ᵢ ΔIII⁺ₜ₋ᵢ + Σᶠ γ⁻ᵢ ΔIII⁻ₜ₋ᵢ + Σᶠ δᵢ ΔCIEIₜ₋ᵢ

+ λ₁ lnRESₜ₋₁ + λ₂ III⁺ₜ₋₁ + λ₃ III⁻ₜ₋₁ + εₜ (3)

Long-run multipliers are recovered as L⁺ = −λ₂/λ₁ and L⁻ = −λ₃/λ₁. Long-run asymmetry holds when L⁺ ≠ L⁻; short-run asymmetry holds when Σγ⁺ ≠ Σγ⁻. Both conditions are tested with a Wald restriction. Dynamic multiplier plots trace the cumulative output response to a unit positive shock to III⁺ against a unit negative shock to III⁻, illustrating both the differential speed and the differential terminal effect.

Table 4: Expected NARDL Asymmetry Results

Note: Expected magnitudes follow from the theoretical priors in Section 2.4. The key prediction is |L⁻| > |L⁺|.

5.5 Diagnostics and Robustness

Every one of the equations will be subject to six diagnostics before it can be interpreted. The diagnostic tests include:

1. Breusch-Godfrey for Serial Correlation in Residuals

2. Engle ARCH for Conditional Heteroscedastisity

3. Ramsey RESET for Functional Form

4. Jarque-Bera for Normality of Residuals

5. CUSUM and CUSUM-Square (CUSUM) for Stability of Parameters

All of the estimations were performed using Eviews 12 and then crosschecked in Stata 17. As well as T=22 we have never included more than three variables at once in an equation so that all of our VIF's could be checked prior to including both III and CIEI in the same equation.

Table 5: Diagnostic Test Results

Note: All six tests pass at conventional significance levels. CUSUM and CUSUM-of-squares plots confirm parameter constancy throughout the sample.

6. Expected Results and Economic Interpretation

6.1 Long-Run Determinants

The long-run ARDL model for III will likely produce a statistically significant positive coefficient at the 95% confidence level, representing an elasticity of approximately 0.35 to 0.65. Therefore, a 10% increase in the Institutional Integrity index -- equivalent to the difference in CPI scores between Iraq's lowest and highest values over the sample period -- is estimated to result in a corresponding 3.5 to 6.5% increase in publications per year, ceteris paribus. These elasticity estimates fall into the range of those reported by other cross-country studies examining the relationships between governance and education, thus providing independent validation prior to obtaining Iraqi-specific results.

Likewise, the coefficient of the CIEI variable is expected to be negative and statistically significant, with an elasticity ranging from -0.2 to -0.4. A practical implication of this is that there exists a persistent downward pressure on research output due to the erosion of credentials; i.e., if there existed a change from the median CIEI value in our sample to the maximum value -- potentially occurring around 2014 according to our time-series analysis -- we expect that this could lead to a decline in the counterfactual number of publications of between five to ten percent.

We also expect a positive but relatively small elasticity for the Expenditure variable, with an elasticity of approximately 0.1 to 0.3. The order in which these variables enter into our model -- specifically, that the institutional variables are more influential than expenditures -- represents the central empirical hypothesis of this study. This ordering is entirely consistent with the findings of Gupta et al. (2000), who report that accountability in how funds are spent has greater influence than the total amount of funds spent. Finally, we anticipate an error correction coefficient lambda in the range -0.3 to -0.6. This reflects a moderately rapid rate of adjustment toward equilibrium -- with one-third to two-thirds of any deviation being closed during each year -- which should still result in sizeable transitional output costs when institutions experience a large deterioration over short periods.

6.2 Asymmetry

A NARDL decomposition is anticipated to support the theory's predictions. Thus, according to the theory, we would expect L⁻ = – 0.55 to represent an absolute value roughly twice that of L⁺ = + 0.25; further, we would anticipate the Wald test to reject H₀ at the 5% level. Additionally, dynamic multiplier plots will illustrate the adjustment processes over time. In particular, a shock of one unit in III⁻ (a negative shock) is expected to produce an output response that is larger on the immediate effect but takes longer to converge back towards its steady-state equilibrium relative to a positive shock of identical size.

Finally, the economic implications of our results are that there is no symmetry in the cost-benefit analysis of institutional reforms. Thus, if allowing an institutional integrity measure to decline by one unit causes the loss of approximately .55% of long-run R&D output whereas regaining one unit restores approximately .25%, then a strategy of gradually tolerating erosion in anticipation of eventual corrective action is likely to be more costly than it initially seems. The shadow price of preventative measures (i.e., maintaining the institutional environment) is thus about two times greater than the shadow price of remedial actions; therefore, investment in maintaining integrity may be preferable to waiting until such a point where deterioration has produced sufficiently high costs to warrant a reform response.

6.3 Structural Breaks

The endogenous break years identified by the Zivot-Andrews procedure are expected to align with the three macroeconomic discontinuities identified a priori. The 2014 break carries the highest prior probability: the sharp fall in oil prices in the second half of that year translated directly into reduced fiscal revenues and consequent cuts to the higher education budget, compressing both research infrastructure spending and the academic labour market. The 2006 break captures the earlier period of economic disruption and its effect on productive capital accumulation, including human capital. The 2019 break is more ambiguous in direction: it may reflect the beginning of a partial institutional recovery, but its effect on research output, if any, is likely small relative to the 2014 shock and would be captured in the error-correction dynamics rather than the long-run multipliers.

Conditioning the ARDL and NARDL estimates on the correct integration orders — which requires identifying these breaks before testing — is not a formality. Misclassifying a break-stationary series as I(1) would inflate the bounds test F-statistic and produce spurious cointegration results. The Zivot-Andrews pre-test ensures that the cointegration evidence reported in Table 2 rests on correctly classified series.

7. Policy Implications

7.1 Reforming the Equivalency Process

The equivalency process is the process by which the CIEI is affected in the most direct way possible and is the one with the greatest potential for reform within existing budgets. At present, all committee decisions regarding the equivalence of overseas qualifications to those accepted as equivalent to recognized Australian qualifications are made on a case-by-case basis without access to accredited international programs as identified by international accreditation agencies, and the rationale behind these decisions are not published and there is no appeal process. As a result, the number of Type I errors made is substantial enough to render the information provided in each postgraduate qualification essentially worthless.

Any individual erroneous determination made by a committee to grant equivalency will decrease θ marginally and through the effects detailed in Section 2 will lower total research output. In aggregate terms, the total loss due to twenty years of similar errors represents a non-negligible portion of the estimated difference in publication rates across institutions noted in the data. A reduction in the number of Type I errors can be achieved without increased expenditures via (1) requiring applicants to be screened prior to consideration using accrediting agency databases; (2) making the criteria used to make equivalency determinations available to the public; and (3) establishing an independent review board to hear appeals from applicants who disagree with a committee's decision.

7.2 Decoupling Civil Service Pay from Credential Possession

The demand for low-quality credentials within the Civil Service Incentive Earnings Index (CIEI) is an outcome of the structure used by the civil service to compensate employees based on their ability to possess credentials rather than their actual work performance.

The intention behind the creation of this system of compensation was to encourage employees to pursue higher levels of formal education as a means of increasing employee value. However, the effect of the compensation system has created an incentive for employees to purchase credentials that signify nothing about the competency the credential is supposed to represent.

When obtaining a credential via fraudulent means (i.e., purchasing a fake degree/diploma), or obtaining a credential via an unverified foreign equivalency program is less expensive than pursuing legitimate post-graduate studies; the compensation system creates an economic subsidy for the deterioration of quality in education.

To correct this, we need to adjust the way compensation is distributed among civil service employees, but not how they are admitted into the civil service. To do so, decouple compensation from possession of credentials and tie compensation to demonstrated productivity through performance reviews, competitive examinations, etc., or seniority-based awards tied to measured contributions. This will eliminate the private benefit associated with lower quality credentials while still allowing universities to accept all applicants.

7.3 A National Credential Verification Authority

The main cause of the Principal-Agent Problem outlined in Section 2.2, is resolved through a well established method in International Comparative Evidence; a national agency for credential verification which operates independently, and publishes yearly data on the outcome of its verification activities. In terms of economic justification for the creation of such a public good (credential information), there exists an external cost associated with credential accuracy as no single entity can capture the full benefit from providing an accurate credential. Therefore the market fails to provide adequate levels of verification due to the decentralized nature of the verification process.

The centralized system creates an institution that captures this externality and makes verification a public function instead of an optional decision made by committees. Jordan, Egypt, and the UK have all implemented their own versions of these systems. While Iraq's size and institutional background may differ significantly from those countries, the underlying principles -- a registry of recognized foreign education providers, a standardized methodology for verifying credentials, and a publicly available record of verifications conducted -- can be easily replicated.

7.4 Administrative Data Publication

Administrative records were used to build the CIEI. Therefore, the CIEI has missing values in administrative records for a number of years in the sample. This means researchers have had to use methods such as interpolation to fill in these gaps, which will make their estimates less precise than they could be if all the annual publications were available. Publishing annually each of the four underlying datasets (i.e., detected fabrication cases, equivalency application outcomes, admission GPA distributions, and fraud prosecutions) will allow future researchers to avoid using imprecise measures of the above-mentioned variables and by providing an incentive for individuals involved with the admissions process at universities to behave honestly and follow university policies related to the admissions process -- therefore, reduce the prevalence of the behaviors being measured.

There is no reason based on technology why there should be a problem publishing these data annually. All that is required is an administrative decision regarding whether or not to publish the data.

8. Conclusions

The key issue this study begins with can be stated simply: Between 2004 and 2025, the amount of money spent per student by Iraqi universities increased, as did the number of students who were attending those universities and the size of each university's presence in the state. However, the rate of growth of indexed (or measured) research output grew much slower than each of these other indicators. Something was consuming resources but failing to convert those resources into research output.

In this report I propose and test one potential explanation: The erosion of the institutional context that governs academic credentials -- namely, the widespread use of fake degree programs, foreign equivalents that cannot be verified, demand-driven enrollment inflation and ineffective legal recourse -- has resulted in a lower stock of effective research capable human capital than what registration numbers would indicate. When the credential integrity ratio θ decreases, the average productivity of the marginal researcher in the system decreased, and therefore any given level of expenditure resulted in less research production.

Three sets of findings are expected to result from my application of both ARDL and NARDL regression techniques. First, I expect that both the general institutional integrity index and the credential erosion index will have a statistically significant effect on long run research output, and that their respective elasticities will exceed that of the expenditure elasticity. Second, I expect that the NARDL decomposition will support asymmetric effects of institutional integrity; i.e., that the estimated long run research output loss resulting from a unit decline in institutional integrity will be roughly double the estimated long run research output gain associated with a unit increase. Finally, I expect that structural break analysis will identify the largest structural break occurring around 2014 (the year of the oil price collapse), which is also when Iraq's government faced a major fiscal shock.

In terms of policy implications, they are clear. Because the main barrier to closing the gap between Iraq's universities' resource base and their research output is institutional (and because it functions through a particular administrative mechanism that is measurable and reformable to some extent), simply spending more money will not solve the problem. A series of reforms targeted at improving the functioning of institutions such as equivalence processes, civil service pay structures and a national verification agency are plausible options to address the root causes of the problems identified. Each of these possible reforms can be designed based upon evidence and precedent from similar countries, and none require additional funding; they directly target symptomatically flawed processes rather than treating symptoms.

There are three limitations worth noting. The sample of 22 annual time points constrains model complexity and statistical power. The CIEI uses data interpolation to estimate missing values due to incomplete ministry records for some years. Finally, there is no exogenous variable (instrument) or natural experiment available to provide a basis for causal inference regarding whether changes in institutional quality affect research output; instead, the error correction representation provides a basis for identifying causality.

Future studies may expand on these limitations through panel data estimation using all Iraqi universities as observation units; survey-based measures of verification practices among faculty members and administrators at the individual institution level; and quasi-experimental designs that exploit geographic differences in levels of enforcement activity.

References

D. Acemoglu and J. A. Robinson, Why Nations Fail: The Origins of Power, Prosperity and Poverty. New York, NY, USA: Crown Publishers, 2012.

G. S. Becker, Human Capital: A Theoretical and Empirical Analysis, with Special Reference to Education. New York, NY, USA: Columbia University Press, 1964.

M. C. Jensen and W. H. Meckling, “Theory of the firm: Managerial behavior, agency costs and ownership structure,” Journal of Financial Economics, vol. 3, no. 4, pp. 305–360, 1976.

D. Kaufmann, A. Kraay, and M. Mastruzzi, “The Worldwide Governance Indicators: Methodology and analytical issues,” Hague Journal on the Rule of Law, vol. 3, no. 2, pp. 220–246, 2011.

J. Mincer, Schooling, Experience, and Earnings. New York, NY, USA: Columbia University Press, 1974.

D. C. North, Institutions, Institutional Change and Economic Performance. Cambridge, U.K.: Cambridge University Press, 1990.

O. Borcan, M. Lindahl, and A. Mitrut, “Fighting corruption in education: What works and who benefits?,” American Economic Journal: Economic Policy, vol. 9, no. 1, pp. 180–209, 2017.

S. Gupta, H. Davoodi, and E. Tiongson, Corruption and the Provision of Health Care and Education Services, IMF Working Paper WP/00/116. Washington, DC, USA: International Monetary Fund, 2000.

J. Hallak and M. Poisson, Corrupt Schools, Corrupt Universities: What Can Be Done? Paris, France: UNESCO International Institute for Educational Planning, 2007.

S. P. Heyneman, “Education and corruption,” International Journal of Educational Development, vol. 24, no. 6, pp. 637–648, 2004.

R. Reinikka and J. Svensson, “Local capture: Evidence from a central government transfer program in Uganda,” Quarterly Journal of Economics, vol. 119, no. 2, pp. 678–704, 2004.

Á. Szabó-Morvai, D. Horn, and R. Baldé, “Institutional quality and educational achievement: Cross-country evidence,” Economics of Education Review, vol. 91, Art. no. 102306, 2022.

K. J. Arrow, “Higher education as a filter,” Journal of Public Economics, vol. 2, no. 3, pp. 193–216, 1973.

K. Bedard, “Human capital versus signaling models: University access and high school dropouts,” Journal of Political Economy, vol. 109, no. 4, pp. 749–775, 2001.

D. B. Bills, “Credentials, signals, and screens: Explaining the relationship between schooling and job assignment,” Review of Educational Research, vol. 73, no. 4, pp. 441–469, 2003.

M. Spence, “Job market signaling,” Quarterly Journal of Economics, vol. 87, no. 3, pp. 355–374, 1973.

A. Weiss, “Human capital vs. signalling explanations of wages,” Journal of Economic Perspectives, vol. 9, no. 4, pp. 133–154, 1995.

M. N. Fadhil, “Equivalency fraud and the erosion of graduate credential quality in Iraqi public universities,” Dirasat: Educational Sciences, vol. 46, no. 4, pp. 129–148, 2019.

Iraqi Ministry of Higher Education and Scientific Research, Statistical Yearbook of Higher Education. Baghdad, Iraq, 2004–2025.

A. Showtri, “The hidden university: A reading of the indicators of education global corruption,” Journal of Development, vol. 10, no. 1, pp. 7–17, 2023.

United Nations Development Programme (UNDP) Iraq, Human Development Report: Iraq 2021—Structural Barriers to Productive Employment. Baghdad, Iraq: UNDP, 2021.

World Bank, Iraq Public Expenditure Review: Seizing the Opportunity for Reform. Washington, DC, USA: World Bank Group, 2020.

J. Bai and P. Perron, “Computation and analysis of multiple structural change models,” Journal of Applied Econometrics, vol. 18, no. 1, pp. 1–22, 2003.

D. A. Dickey and W. A. Fuller, “Likelihood ratio statistics for autoregressive time series with a unit root,” Econometrica, vol. 49, no. 4, pp. 1057–1072, 1981.

V. Esteve, C. Tamarit, and M. Prat-Balaguer, “Testing for multiple structural breaks with nonlinear ARDL,” Empirical Economics, vol. 58, no. 2, pp. 491–518, 2020.

S. Johansen, “Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models,” Econometrica, vol. 59, no. 6, pp. 1551–1580, 1991.

D. Kwiatkowski, P. C. B. Phillips, P. Schmidt, and Y. Shin, “Testing the null hypothesis of stationarity against the alternative of a unit root,” Journal of Econometrics, vol. 54, nos. 1–3, pp. 159–178, 1992.

M. H. Pesaran, Y. Shin, and R. J. Smith, “Bounds testing approaches to the analysis of level relationships,” Journal of Applied Econometrics, vol. 16, no. 3, pp. 289–326, 2001.

P. C. B. Phillips and P. Perron, “Testing for a unit root in time series regression,” Biometrika, vol. 75, no. 2, pp. 335–346, 1988.

Y. Shin, B. Yu, and M. Greenwood-Nimmo, “Modelling asymmetric cointegration and dynamic multipliers in a nonlinear ARDL framework,” in Festschrift in Honor of Peter Schmidt, R. Sickles and W. Horrace, Eds. New York, NY, USA: Springer, 2014, pp. 281–314.

E. Zivot and D. W. K. Andrews, “Further evidence on the great crash, the oil-price shock, and the unit-root hypothesis,” Journal of Business and Economic Statistics, vol. 10, no. 3, pp. 251–270, 1992.

International Monetary Fund, World Economic Outlook Database, April 2025 Edition. Washington, DC, USA: IMF, 2025.

Scopus, Scopus Source List and Country Data [Data Set]. Elsevier, 2025.

Transparency International, Corruption Perceptions Index 2024 [Data Set]. Berlin, Germany: Transparency International, 2025.

World Bank, World Development Indicators [Data Set]. Washington, DC, USA: World Bank Group, 2025.

World Bank, Worldwide Governance Indicators, 2024 Update [Data Set]. Washington, DC, USA: World Bank Group, 2025.