Beyond the Binary.
The capitalism/socialism debate argues over how tightly to constrain self-interest; neither side designs an economy around motivation itself. A regenerative market with a motivation-aligned reward architecture treats incentives as the load-bearing variable to engineer — viable as near-term reform in coordinated market economies and as long-term aspiration in liberal market economies, with full mechanism design across agriculture, manufacturing, and finance.
Motivation-Aligned Institutional Design for Post-Growth Market Economies: An Integrative Framework, With Worked Applications to Agricultural, Manufacturing, and Financial Governance
The dominant economic debate of the past century has been framed as a binary: capitalism or socialism. This paper argues that both systems share a foundational error — they misidentify human motivation as primarily self-interested and material, producing incentive structures that reward extraction over contribution. Drawing on behavioral economics (Kahneman, 2011), self-determination theory (Deci & Ryan, 2000), the meta-analytic evidence on motivation crowding (Deci, Koestner, & Ryan, 1999; Frey & Jegen, 2001; Bowles, 2016), commons governance and the Institutional Analysis and Development framework (Ostrom, 1990, 2005), the socialist calculation debate (Lange, 1936–37; Roemer, 1994), institutional economics (Acemoglu & Robinson, 2012; Piketty, 2014, 2020), and socio-technical transition theory (Geels, 2002), this paper develops an integrative framework for motivation-aligned institutional design.
The paper's contribution is integrative rather than paradigmatic. No single literature provides an adequate institutional reform framework alone: behavioral science explains why conventional incentives fail but offers no institutional alternative; commons governance shows how communities manage shared resources but lacks macroeconomic scaling mechanisms; degrowth identifies ecological constraints but underspecifies institutional design; institutional economics explains path dependence but not how to design motivation-compatible institutions. The integration identifies design trade-offs invisible within any single tradition — particularly the conditions under which motivation-aligned institutions succeed versus the conditions (low social capital, weak state capacity, fragmented civil society) under which they predictably fail.
Empirical grounding comes from New Zealand's Wellbeing Budget, the Wellbeing Economy Governments partnership, Amsterdam's Doughnut Economics implementation, the Mondragon Cooperative Corporation, Germany's codetermination regime, and the B Corporation movement — each analyzed with its significant limitations and, where available, with the specific quantitative outcomes the existing implementations have produced.
The framework's applicability is bounded by a social capital prerequisite that the paper treats as structural rather than residual. Reading the constraint through the Varieties of Capitalism framework (Hall & Soskice, 2001; Thelen, 2014), the honest version of the thesis follows: motivation-aligned institutional reform is a viable near-term pathway in coordinated market economies (CMEs) — Germany, Scandinavia, the Netherlands — and a structurally constrained long-term aspiration in liberal market economies (LMEs) — the United States, the United Kingdom — that requires building the institutional complementarities (sectoral coordination, vocational training infrastructure, cooperative governance traditions) the reform presupposes. This is a sharper, more defensible claim than a general "third paradigm" thesis, and it follows from the framework's own internal analysis.
The paper proposes full mechanism designs in three sectors (agriculture, manufacturing, financial services), a concrete power analysis identifying resistance mechanisms, and a credibility-honest transition model: the historical record shows that institutional paradigm shifts are triggered by acute crisis (the Great Depression, 1970s stagflation), not by gradual layering, and the framework is therefore best understood as preparation for crisis windows whose timing cannot be predicted. The three sectoral designs map asymmetrically onto plausible triggers — finance to a climate-linked financial crisis, agriculture to a food-system shock, manufacturing only weakly to a democratic-legitimacy crisis — and the conclusion is honest about the implication: having both the institutional designs and the ideological narratives ready is what enables the change when the window opens.
Keywords: motivation-aligned institutional design, commons governance, wellbeing economics, behavioral economics, self-determination theory, Institutional Analysis and Development, varieties of capitalism, political economy, socio-technical transitions, motivation crowding, post-growth economics, stakeholder finance, public banking, manufacturing governance
§ IIntroduction
For over a century, economic policy debate has been organized around a binary framing: capitalism or socialism, free markets or central planning. Both systems rest on a shared premise about human nature that behavioral science has dismantled. Classical capitalism, following Hayek (1945), assumes self-interested individuals pursuing material gain produce optimal collective outcomes through market coordination. State socialism assumes those same individuals require central direction to prevent exploitation. Both treat Homo economicus — the rational, self-maximizing agent — as the fundamental unit of economic design. As Kahneman (2011) demonstrated, this agent does not exist.
Acemoglu and Robinson (2012) reframe the relevant question: economic outcomes depend less on whether a system is formally capitalist or socialist than on whether its institutions are extractive or inclusive. Extractive institutions concentrate power and wealth in ways that block creative destruction and broad-based innovation. Piketty (2014, 2020) complements this with a structural dynamic: when the rate of return on capital exceeds economic growth (r > g) — the historical norm — wealth concentrates inexorably regardless of formal economic system. The relevant question, then, is not which system to choose but how to design institutions that align incentives with broad human flourishing rather than narrow extraction — and under what social conditions such design is feasible.
A registered nurse on a med-surg floor works a shift whose productivity is measured in patients-per-RN, charted in compliance with an acuity-based staffing tool that captures whether her patients are sick enough to need her — but not what she did with them. The tool is an artifact of how the hospital's revenue model works, which is how her unit's productivity dashboard is constructed, which is how her manager's quarterly review is scored. She knows — has known for fifteen years — that the patient in 312 needs forty minutes, not the time the algorithm assigns, because his wife died last month and he hasn't been told yet that the imaging came back. There is no acuity code for telling a man that. The hospital does not measure whether anyone told him. It measures whether she made it through the patient assignment within the shift. She makes it through. She tells him anyway. She does it on her break, off the clock, because the only institution that recognizes what she just did is the one inside her own conscience. Multiply her, in some form, by the teacher who knows the curriculum the test rewards is not the curriculum the child needs, by the engineer who watches the product decay quarter over quarter while the OKR scoreboard registers green, by the farmer who can keep the soil alive for thirty years or the yield up for one. The economy we have built measures the wrong thing and pays for the wrong thing, and the people doing the actual work know it. This paper is an attempt to specify what an economy that measured the right thing might look like.
Three Converging Crises
Three crises now converge in ways that incremental reform struggles to address. The ecological crisis: four of nine planetary boundaries have been transgressed. Hickel (2020), Kallis (2011), and Jackson (2021) argue that continued aggregate growth in wealthy nations is incompatible with required decarbonization rates. The inequality crisis: Piketty (2014) demonstrated that wealth concentration is the historical default, not an aberration; Milanović (2019) showed that both liberal meritocratic and state-directed political capitalism fail to solve inequality; Mazzucato (2018) showed systematic misattribution of economic value; Polanyi (1944) diagnosed the self-regulating market as a political construction. The motivation crisis: self-determination theory (Deci & Ryan, 2000) established that when baseline material needs are met, human beings are motivated by autonomy, competence, and relatedness; Deci, Koestner, and Ryan's (1999) meta-analysis of 128 experiments found that performance-contingent extrinsic rewards significantly undermine intrinsic motivation.
The Heterodox Landscape and This Paper's Contribution
This paper does not claim to be the first to challenge the capitalism-socialism binary. The heterodox landscape is crowded, and a contribution must specify what it is not before it can specify what it is.
It is not degrowth (Hickel, 2020; Kallis, 2011). Degrowth correctly identifies that aggregate biophysical throughput must contract in wealthy societies; this paper accepts that constraint. But degrowth argues for a politically and culturally led contraction without specifying the institutional architecture through which it would occur. The framework here treats ecological boundaries as constitutional-level constraints on operational economic activity — a design move, not a political program — and does not commit to aggregate contraction as such, only to the cessation of throughput growth that exceeds biophysical limits.
It is not doughnut economics (Raworth, 2017). Raworth's framework provides the picture of a safe operating space bounded below by social floors and above by ecological ceilings; this paper adopts that picture as Pillar 3. But doughnut economics, as a corpus, has been more successful as a normative vocabulary than as a specification of governance mechanisms. The contribution here is to specify how the boundaries become binding through stakeholder governance, measurement protocols, and compensation design — the institutional connective tissue that the doughnut framework gestures toward but does not fully build out.
It is not the foundational economy approach (Foundational Economy Collective, 2018). The foundational economy literature correctly identifies that the basic infrastructure of everyday life (housing, food, water, energy, care, transport) deserves protected institutional treatment. This framework adopts the structural intuition but extends well beyond foundational sectors and supplies the motivation-side analysis that the foundational economy literature largely leaves to the side.
It is not Wellbeing Economy Alliance / WEGo policy (Scotland, New Zealand, Wales, Finland, Iceland, Canada). The WEGo partnership has been the most institutionally successful node in the heterodox landscape and is treated here as a real empirical case (Section V) rather than as a competing framework. The relationship is complementary: WEGo demonstrates the political feasibility of measurement reform; this paper specifies the governance reforms that the measurement reform must couple with if Phase 1 (measurement) is to deepen into Phase 2 (binding constraint).
It is not Bowles's (2016) moral-economy framework. Bowles is the closest intellectual antecedent: his analysis of how institutional design shapes moral motivations is foundational to Pillar 1. This paper extends Bowles by connecting his motivation analysis to Ostrom's IAD framework and Piketty's wealth-concentration dynamics — supplying institutional-level mechanism designs that The Moral Economy deliberately stops short of specifying.
What none of these traditions has yet provided is an integrated institutional design that synthesizes their insights with a credible account of implementation conditions — including the conditions under which such designs predictably fail. The contribution of this paper is not a new paradigm but an integrative framework that identifies design trade-offs invisible within any single tradition. Specifically: (1) connecting motivation crowding theory with Ostrom's (2005) IAD framework generates institutional design principles that specify which rules at which levels preserve intrinsic motivation — something neither literature provides alone; (2) connecting Piketty's (2014, 2020) wealth concentration dynamics with Geels's (2002) multi-level transition framework identifies the political conditions under which institutional alternatives gain traction; (3) integrating behavioral science's micro-level findings with the implementation failure literature (Kaplan & Norton, 2001; Trigilia, 2001; Putnam, 1993) identifies boundary conditions — the social capital prerequisites and institutional capacity requirements that determine where motivation-aligned designs are viable and where they are not.
A note on scope, before the argument begins. The framework developed here is not a universal alternative to capitalism and socialism. It is a design strategy for societies that possess the institutional complementarities — sectoral coordination, vocational training infrastructure, cooperative or codetermination traditions, and competent regulatory state capacity — that the reform presupposes. In the language of the Varieties of Capitalism literature (Hall & Soskice, 2001; Thelen, 2014), these are coordinated market economies (CMEs): Germany, Scandinavia, the Netherlands, and a small set of CME-adjacent polities. In liberal market economies (LMEs) — the United States, the United Kingdom, and most of the Anglophone world — the complementarities are absent, and the framework's mechanism designs would have to be built alongside the supporting institutional infrastructure rather than dropped into it. This is not a peripheral caveat. It is the single most important boundary condition the paper specifies, and Section VIII develops it in detail. The reader should read everything that follows as describing what is implementable now in the first set of societies and what is aspirational (subject to long prior institutional development) in the second.
The paper proceeds through seven substantive sections: behavioral science foundations (II), systems-level analysis (III), the integrative framework with full mechanism design across three sectors (IV), empirical evidence and limitations (V), strongest objections (VI), political economy with power analysis (VII), and structural constraints and unresolved tensions (VIII), followed by a conclusion and research agenda (IX).
§ IIThe Behavioral Science Case Against Rational-Actor Design
Every economic system embeds a theory of human motivation. Capitalism and socialism both embed the same one: that human beings respond primarily to material incentives. The evidence requires us to abandon this premise — while being precise about what replaces it and honest about the limits of the evidence.
Motivation Crowding: The Central Finding
The most consequential finding for institutional design is motivation crowding: external rewards can displace the intrinsic motivations they are meant to reinforce. Frey and Jegen (2001) synthesized the theoretical framework; Bowles (2004, 2016) provided the most rigorous institutional analysis, grounding motivation crowding in a formal microeconomic framework (2004) and extending it to institutional design (2016). The empirical base includes Deci, Koestner, and Ryan's (1999) meta-analysis of 128 experiments (composite d = −0.34 for performance-contingent rewards) and Gneezy and Rustichini's (2000) field experiment at Israeli daycare centers, where introducing a fine for late pickup increased late arrivals by 64–81%. The fine destroyed the social norm of punctuality and replaced it with a market transaction — and the effect persisted even after the fine was removed.
Self-determination theory (Deci & Ryan, 1985, 2000) provides the psychological architecture: human motivation requires satisfaction of three basic needs — autonomy, competence, and relatedness. When institutional design satisfies these needs, intrinsic motivation flourishes; when design undermines them, motivation degrades even as compliance increases.
What Has Replicated, and What Has Not
The replication crisis in psychology (Open Science Collaboration, 2015; Camerer et al., 2018) demands that any argument resting on behavioral findings audit its evidence base explicitly. This paper takes that demand seriously: the strength of the motivation-crowding case depends on whether the specific findings it invokes have held up under replication. They have, with caveats worth stating precisely.
The Deci, Koestner, and Ryan (1999) meta-analysis has been the most-tested foundational result in this literature. Subsequent reanalyses by Cameron and Pierce and the responses by Deci and colleagues (Deci, Koestner, & Ryan, 2001) refined but did not overturn the central finding: tangible, performance-contingent rewards reduce intrinsic motivation for tasks with initial intrinsic interest. Cerasoli, Nicklin, and Ford's (2014) more recent 40-year meta-analysis sharpened the result usefully for the framework's purposes: intrinsic and extrinsic motivation jointly predict performance, and the relationship is moderated by how strongly the extrinsic reward is made contingent on the focal behavior. Strongly contingent rewards crowd out intrinsic motivation for complex tasks; weakly contingent rewards (recognition, fixed pay, conditional bonuses tied to long-horizon team outcomes) do not. This is precisely the design distinction the framework's compensation architectures rest on.
The Gneezy and Rustichini (2000) daycare result has been replicated conceptually in multiple settings (blood donation, civic compliance, contract enforcement). The general principle — that introducing monetary contingencies can destroy a pre-existing social norm — has held; the magnitude and persistence of the effect varies considerably with context (Bowles & Polanía-Reyes, 2012).
The Killingsworth, Kahneman, and Matz (2023) collaborative reconciliation of the wellbeing-and-income debate is recent and not yet broadly replicated, but it represents the methodological gold standard the field has moved toward: adversarial collaboration between researchers with priors in opposite directions, working from a shared dataset, producing a result that both teams pre-commit to accept.
What has not replicated robustly is a different set of behavioral findings often invoked in policy contexts: most social-priming effects (Kahneman has himself acknowledged this in print), ego depletion as originally specified, and several power-posing and growth-mindset results. None of these findings is load-bearing for the argument here, and the paper does not rely on them.
A further concern is external validity: whether laboratory motivation crowding transfers reliably to organizational work contexts remains debated. The argument of this paper therefore rests primarily on field experiments (Gneezy & Rustichini's daycare study, Lazear's Safelite analysis) and organizational-level evidence from cooperative governance (Mondragon, B Corps) rather than laboratory findings alone. Where the evidence is contested, the paper notes this explicitly.
Counter-Evidence: When Financial Incentives Work
Financial incentives demonstrably increase output for certain task categories. Lazear's (2000) study of Safelite AutoGlass found that switching from hourly wages to piece rates increased productivity by 44% — but roughly half came from sorting (higher-ability workers joining) rather than incentive effects (existing workers increasing effort). The resolution is not that financial incentives never work but that their effectiveness follows a pattern: they increase output for measurable, routine tasks with clear metrics and decrease it for complex, creative tasks requiring intrinsic motivation. Killingsworth, Kahneman, and Matz (2023) established that income's relationship to wellbeing is logarithmic — diminishing marginal returns — suggesting that beyond a threshold, institutional conditions supporting autonomy, competence, and relatedness would produce larger welfare gains per dollar than additional financial incentives.
§ IIIWhy Systems-Level Analysis Makes Incremental Reform Insufficient
Feedback Loops and Leverage Points
Meadows (2008) identified structural properties that make complex systems resistant to intervention. The economy exhibits reinforcing feedback loops that prevent incremental reform: GDP growth generates tax revenue, which funds public goods, which supports further growth — creating structural dependence on expansion regardless of ecological cost. Inequality generates political influence (campaign finance, lobbying, revolving-door appointments), which produces deregulation, which increases inequality. Piketty (2014) identified this as a structural tendency for wealth concentration to compound when r > g; the feedback loop explains why incremental redistribution is perpetually insufficient.
Meadows's leverage-point hierarchy explains why most reforms fail: they operate at the level of parameters (tax rates, minimum wages) rather than system goals or rules. Changing the goal of the economic system from GDP growth to multi-dimensional wellbeing represents intervention at a high leverage point — but the feedback loops ensure that entrenched interests will resist goal-level changes far more intensely than parameter adjustments.
The Socialist Calculation Debate
Any proposal to modify market incentive structures must engage with the socialist calculation debate. Mises (1920) argued that without market prices derived from private ownership, rational economic calculation is impossible. Hayek (1945) refined this into an information argument: dispersed, tacit, local knowledge makes centralized processing structurally impossible. Hayek's (1940) further critique of Lange's competitive solution argued that market socialist models assumed away the very information problems they were meant to solve — a critique that remains powerful. Gode and Sunder (1993) demonstrated experimentally that markets achieve allocative efficiency even with "zero-intelligence" traders, suggesting that market structure itself processes information in ways no central authority can replicate.
The framework proposed here takes this constraint seriously: it preserves market price mechanisms and private enterprise. But it rejects the Hayekian inference that market outcomes are therefore optimal. Markets efficiently aggregate information about willingness to pay, but prices cannot aggregate information about ecological boundaries, intergenerational equity, or the intrinsic motivations that drive the most valuable human work. The question, following Ostrom's (2005) Institutional Analysis and Development framework, is not markets-or-planning but what institutional rules structure the action arena within which market coordination occurs.
The IAD Framework as Analytical Architecture
A terminological note: this paper uses "mechanism design" in the institutional-design sense — specifying governance rules, measurement protocols, and incentive structures — rather than in the Hurwicz-Myerson sense of designing incentive-compatible allocation mechanisms with formal strategy-proofness proofs. The contribution is institutional analysis, not mathematical optimization.
Ostrom's (2005) IAD framework distinguishes three levels of institutional rules: constitutional rules (who can participate and under what authority), collective-choice rules (how operational rules are made and modified), and operational rules (day-to-day decisions within the action arena). This paper argues that motivation-aligned institutional design operates primarily at the collective-choice level: it restructures the rules that govern how firms are organized, how performance is measured, how resources are allocated, and how accountability operates — without replacing the operational-level market coordination that Hayek rightly identified as informationally essential.
Concretely, the IAD framework maps onto the proposed reforms as follows. Constitutional-level changes include legal recognition of stakeholder governance and ecological boundaries as non-negotiable constraints. Collective-choice-level changes include wellbeing measurement frameworks that restructure which information enters governance decisions, stakeholder board compositions that alter who participates in collective-choice processes, and compensation structures that change the incentives governing operational behavior. Operational-level market coordination — price-mediated exchange, entrepreneurial discovery, consumer choice — remains intact. This three-level mapping is what distinguishes the framework from both central planning (which replaces operational-level coordination) and conventional reform (which adjusts parameters without changing collective-choice rules).
The IAD framework's analytical power, however, lies not in labeling but in analyzing cross-level interactions. Consider a concrete failure mode: constitutional-level ecological boundaries are set by legislation, but collective-choice-level stakeholder boards — responsible for operationalizing these boundaries — are captured by producer interests, as agricultural boards routinely are. The result is what Ostrom (2005) termed a gap between rules-in-form (the constitutional mandate) and rules-in-use (the actual governance practice). The mechanism designs in Section IV address this specifically by stipulating that board composition rules are constitutional-level constraints that the board itself cannot modify — a cross-level design insight that prevents collective-choice actors from eroding their own accountability structure. More generally, the framework predicts that motivation-aligned institutional design will fail wherever constitutional-level constraints are insufficient to prevent collective-choice-level capture — a testable prediction that the implementation failure literature (Kaplan & Norton, 2001; Trigilia, 2001) supports empirically.
Coase (1937) showed that firms exist because market transactions carry costs; Williamson (1985) developed this into institutional choice theory. Ostrom (1990) demonstrated that communities govern shared resources effectively when institutional design satisfies specific conditions — clearly defined boundaries, proportional cost-benefit equivalence, collective-choice arrangements, monitoring, graduated sanctions, conflict resolution, and recognition of self-governance rights. Extending this logic beyond natural resources requires analytical justification. Hess and Ostrom (2007) demonstrated that Ostrom's design principles can apply to knowledge commons, where the resource is non-rival but governance challenges — free-riding, underinvestment, enclosure — are structurally analogous. Care economies and digital infrastructure raise different challenges: care involves relational goods that resist commodification for reasons distinct from environmental commons, and digital infrastructure involves network effects and near-zero marginal costs that alter the governance calculus fundamentally. This paper applies the commons extension only where analytical justification exists — primarily to knowledge and natural resource domains — and flags digital and care governance as requiring separate theoretical development.
§ IVAn Integrative Framework for Motivation-Aligned Institutional Design
The framework rests on four structural pillars. What distinguishes it from conventional reform proposals is that the pillars address failures that are structurally interconnected — each resolves a problem the others cannot.
Pillar 1: Motivation-Aligned Institutional Design
Drawing on Bowles (2016) and the IAD framework (Ostrom, 2005), motivation-aligned design follows three principles. First, do no harm to intrinsic motivation: avoid incentive structures that crowd out the prosocial behavior they aim to promote. At the IAD's collective-choice level, this means rules prohibiting individual performance-contingent bonuses for complex tasks while permitting them for measurable routine outputs (consistent with Lazear, 2000). Second, design for autonomy, competence, and relatedness: structure roles, feedback systems, and governance to satisfy the basic psychological needs identified by Deci and Ryan (2000). At the operational level, this means worker participation in job design, transparent performance feedback oriented toward development rather than ranking, and team-based structures that preserve relatedness. Third, use incentives as signals, not controls: rewards should communicate social values and provide information rather than functioning as contingent behavioral controls. At the constitutional level, this means legal frameworks that define firm purpose in terms of multi-dimensional value creation rather than shareholder return alone.
Pillar 2: Multi-Dimensional Value Measurement
GDP cannot distinguish between economic activity that builds long-term capacity and activity that depletes it. The framework replaces GDP as the primary measure of economic performance with wellbeing index frameworks drawing on the capabilities approach (Sen, 1999; Nussbaum, 2011). The framework follows Nussbaum's approach in specifying dimensions rather than Sen's more open-ended formulation, while recognizing that dimension specification itself involves contestable normative choices — a tension inherent in any operationalization of the capabilities approach. New Zealand's Living Standards Framework provides the most developed operational model, measuring twelve domains. The 2019 Wellbeing Budget allocated NZ$455 million to a frontline mental health programme based on wellbeing data rather than GDP calculations (New Zealand Treasury, 2019). The challenge is not measurement — viable frameworks exist — but institutional adoption, and Campbell's (1976) law applies directly: any measurement system used for governance will be subject to gaming and corruption. This tension between measurement necessity and measurement corruption is not resolvable in principle; it requires institutional safeguards (discussed under mechanism design below) that are themselves imperfect.
Pillar 3: Ecological Boundaries as Constitutional Constraints
Raworth's (2017) doughnut model provides the conceptual framework: a safe operating space bounded below by social floors and above by ecological ceilings. The framework treats these boundaries not as externalities to be priced but as constitutional-level constraints on economic activity — non-negotiable rules within which collective-choice and operational coordination occur. Amsterdam's implementation (2020–present) demonstrates municipal-level feasibility, though scaling to national and international levels introduces coordination challenges addressed in Section VIII.
Pillar 4: Commons-Based Governance for Shared Resources
Ostrom (1990, 2009) demonstrated that communities effectively govern shared resources when institutional design satisfies specific conditions. For the agricultural application developed below, Ostrom's (2009) Social-Ecological Systems (SES) framework — further developed by McGinnis and Ostrom (2014) — provides a more precise analytical tool than the general IAD framework, as it explicitly models the interaction between resource systems, resource units, governance systems, and users in ecological contexts. The Mondragon Cooperative Corporation (approximately 70,500 worker-owners, €11.2 billion in 2024 industrial-and-distribution sales) demonstrates that cooperative governance can operate at significant scale, though its Fagor Electrodomésticos bankruptcy (2013) and its international subsidiaries' reliance on conventional wage labor illustrate that commons-based governance is a design choice with specific conditions for effectiveness, not a universal solution.
Mechanism Design: Agriculture as Worked Case
This section develops agriculture to the level of institutional specificity required as a worked proof-of-concept for the integrative framework. The mechanism design draws on all four pillars simultaneously, and its specific features — dual-vesting payment structures matched to different ecological temporalities, constitutional-level composition locks on governance boards, landscape-level biodiversity weighting — are derivable only from the cross-literature integration established above. The dual-vesting structure reflects motivation crowding theory (financial incentives for measurable soil carbon, non-financial institutional support for complex ecological stewardship); the constitutional composition lock reflects the IAD cross-level analysis (preventing collective-choice actors from eroding their own accountability); and the landscape-level weighting reflects systems dynamics (individual farm optimization cannot substitute for catchment-level ecological function). No single literature generates this design.
Comparative positioning. Several existing agri-environmental schemes share individual features with this design: the EU's CAP eco-schemes link payments to environmental practice, Switzerland's direct payments system embeds ecological conditionality, and Australia's Environmental Stewardship Program uses competitive tendering for biodiversity outcomes. What distinguishes the proposed mechanism is the integration across the framework's four pillars. Existing schemes typically treat ecological payments as add-ons to a production-oriented system (CAP), rely on practice-based rather than outcome-based measurement (Swiss direct payments), or operate as time-limited contracts without constitutional-level governance reform (Australian Stewardship). The mechanism proposed here differs on three dimensions the framework predicts are critical: it embeds farmer governance at the constitutional level rather than treating farmers as contract recipients; it uses composite outcome measurement with anti-gaming safeguards rather than practice-based compliance checklists; and it explicitly designs the payment structure to be motivation-supportive rather than controlling, drawing on the crowding-out literature that existing schemes largely ignore.
The problem. Industrial agriculture exemplifies single-metric optimization failure: maximizing yield per acre has produced soil depletion, biodiversity loss, water contamination, and rural community decline. The current incentive structure rewards volume and cost minimization, externalizing ecological and social costs. The EU's Common Agricultural Policy demonstrates the alternative failure mode: subsidy systems that become bureaucratized, capture-prone, and decoupled from the ecological outcomes they nominally support.
Measurement protocol. The framework proposes replacing single-metric yield measurement with a composite index: soil organic carbon (measured annually via dry combustion analysis (Dumas method) as the reference standard at 30cm depth, following FAO Global Soil Laboratory Network protocols, with loss-on-ignition as a cost-effective field-level proxy calibrated annually against dry combustion reference samples), on-farm biodiversity (using the Farmland Bird Index as a proxy indicator — a well-established European proxy but with a known limitation: it captures one taxonomic group and is strongly influenced by landscape-level habitat structure that individual farm management can only marginally affect. The payment structure therefore weights the bird index at the landscape (catchment) level rather than the individual farm level, avoiding penalizing ecologically responsible farmers for surrounding industrial monoculture — supplemented by periodic pollinator surveys), water quality (nitrate and phosphorus runoff at catchment level, measured by existing water authority monitoring), worker welfare (injury rates, wage adequacy relative to regional median, access to training), and community food security (proportion of production entering local supply chains). Yield remains a component but is weighted against the other four dimensions.
Payment structure. Ecosystem service payments operate through a three-tier system: (1) baseline compliance payments for maintaining soil carbon above a threshold, funded through a levy on synthetic fertilizer inputs (creating a direct financial link between extractive and regenerative practices); (2) improvement bonuses for year-over-year gains in composite index score, funded through redirected agricultural subsidy budgets; (3) cooperative marketing premiums for producers who participate in regional cooperative distribution networks, reducing market power asymmetry with large buyers. Soil carbon payments vest over five-year cycles to discourage short-term gaming, with mid-cycle verification by independent assessors. Biodiversity and water quality payments operate on a different temporal structure: three-year rolling averages rather than point-in-time measurements, buffering against year-to-year fluctuations from weather, pest outbreaks, and landscape-level factors outside any farmer's control. This dual-vesting structure reflects the different temporal dynamics of each metric and avoids deterring behavioral change through uncontrollable risk. A motivation-crowding concern arises immediately: do ecosystem service payments crowd out the intrinsic stewardship motivations the framework aims to preserve? Bowles and Polanía-Reyes (2012), refined and extended in Bowles (2016), distinguish between controlling incentives (contingent on compliance, monitored externally, experienced as coercive) and supportive incentives (enabling autonomous action, complementing existing motivations, experienced as recognition). The payment structure is designed to be supportive rather than controlling: baseline compliance payments function as recognition that ecological stewardship has economic value, cooperative premiums reward collective action farmers are already motivated to pursue, and the governance structure gives farmers constitutional-level voice in setting the terms. The dual-vesting structure further reduces the controlling character by decoupling payment timing from short-cycle monitoring. This design does not eliminate crowding-out risk — any monetary payment can shift the motivational frame — but it minimizes the conditions Bowles identifies as most likely to trigger it. The fertilizer levy will face fierce political resistance from the conventional agriculture lobby, which in most OECD countries is far more politically organized than regenerative farming interests. This is precisely the kind of resistance mechanism analyzed in Section VII — and honest acknowledgment is necessary: the levy is more likely to be adopted as part of broader CAP reform during an acute food security or soil degradation crisis than through incremental advocacy.
Governance structure. Regional agricultural boards include elected farmer representatives (minimum 40% of seats), environmental agency appointees, community food security representatives, and independent agronomists. Board authority extends to setting regional composite index weights (reflecting local ecological priorities — water quality may matter more in a watershed catchment than in an arid region), certifying measurement protocols, and adjudicating disputes. Crucially, board composition rules are set at the constitutional level (by legislation) and cannot be modified by the board itself — preventing the self-referential capture that undermines many governance arrangements.
Initial weighting methodology. The composite index defaults to equal weighting across all five dimensions (20% each), with regional boards authorized to deviate by up to 10 percentage points per dimension upon documented ecological justification reviewed by the environmental agency. Equal-weight defaults prevent boards from weighting dimensions their region already performs well on, which would make compliance easy and improvement unnecessary. The justification requirement — reviewable by an external body — creates an additional check on self-serving weight allocation. Weights are published and subject to public comment, providing transparency that further constrains gaming.
Cooperative participation requirements. The tier-3 cooperative marketing premium requires genuine cooperative participation, not nominal membership. Qualifying cooperatives must provide at least two of: shared processing or storage facilities, collective marketing and negotiation with buyers, shared equipment pools, or joint agronomic advisory services. This specification draws on the Emilia-Romagna model, where cooperatives' value derived from collective services that individual farms could not replicate — not merely a price premium. Annual verification confirms that cooperative services are operational and utilized by member farms. This prevents large agribusiness from creating shell cooperatives to capture the premium while providing no collective benefit.
Anti-gaming safeguards. Campbell's (1976) law predicts that any quantitative indicator used for governance will be corrupted. The design response is layered: (a) measurement redundancy — soil carbon measured by both farmer self-report and random independent sampling, with discrepancy triggering audit; (b) metric rotation — the composite index weights are adjusted every three years by the regional board, preventing stable gaming equilibria; (c) qualitative assessment — annual farm visits by peer farmers from neighboring regions, providing information that quantitative metrics cannot capture; (d) whistleblower channel — anonymous reporting to the environmental agency, with legal protection and mandatory investigation. These safeguards are themselves imperfect — peer assessors can be co-opted, whistleblower channels can be chilled by social pressure in rural communities. The peer assessment mechanism in particular is culturally specific: it presupposes agricultural communities with traditions of peer accountability and relative social equality among farmers. In contexts with hierarchical social structures, patron-client relationships, or ethnic divisions, peer assessment could become a tool of social coercion rather than honest evaluation. This is not a peripheral caveat — it connects directly to the social capital prerequisite (Section VIII): the entire anti-gaming architecture presupposes a minimum level of social trust among farming peers. Despite these limitations, layered redundancy raises the cost of gaming to the point where genuine compliance becomes the easier path in contexts where the social prerequisites are met.
The social capital prerequisite. This design draws explicitly on the Emilia-Romagna model of small-firm cooperative networks (Trigilia, 2001). But Putnam (1993) and Trigilia demonstrated that this model did not transfer to Southern Italy, where different historical patterns of social organization produced different institutional outcomes. The design above requires a minimum level of inter-farm trust, organizational capacity among farmers, and competent, non-corrupt local governance. Where these prerequisites are absent — as they are in many agricultural contexts globally — the design cannot be simply transplanted. Section VIII addresses this structural constraint directly.
Mechanism Design: Manufacturing as Worked Case
Manufacturing has historically been the sector where the design distance between conventional capitalist firm governance and motivation-aligned alternatives is least theoretical — Germany's Mitbestimmung (codetermination) has operated continuously since 1976 in firms above 2,000 employees, and Scandinavia's industrial relations systems for substantially longer. The argument here is not that these systems should be copied but that the components needed for full motivation-aligned manufacturing governance already exist in operational form, distributed across jurisdictions; the contribution is to specify how the components are integrated into a coherent mechanism.
The integrative claim, made explicit. The mechanism design that follows is derivable only from cross-literature integration. The five-dimensional composite (quality, worker development, environmental performance, community linkage, margin) reflects Pillar 2's multi-dimensional value measurement coupled with the IAD framework's specification that which dimensions matter is itself a collective-choice variable. The constitutionally-fixed stakeholder board composition reflects the same cross-level analysis that produced the agricultural composition lock (Ostrom, 2005): collective-choice actors cannot be permitted to modify the constitutional constraints that hold them accountable. The separation of contingent gain-sharing from non-contingent recognition compensation reflects Bowles's (2016) controlling-vs-supportive distinction, refined by Bowles and Polanía-Reyes (2012), and Lazear's (2000) finding that financial incentives work for measurable team output but not for the relational and creative work the human capital budget is designed to recognize. The five-year rolling vesting window reflects the temporal-mismatch correction motivation crowding theory (Frey & Jegen, 2001) predicts is necessary when the underlying outcomes mature over multi-year horizons. The metric rotation and worker-survey anti-gaming layers reflect Campbell's (1976) law combined with the implementation-failure literature (Kaplan & Norton, 2001) on balanced-scorecard gaming. No single literature generates this combination. Codetermination provides the governance structure but not the measurement framework; ESOPs provide the ownership stake but not the compensation architecture; B Corps provide the certification but not the binding governance. The design integrates across them.
Comparative positioning. Four existing frameworks share features with the design developed here. German codetermination provides board-level worker representation but does not specify the measurement architecture or the compensation structure; it has produced demonstrable effects on long-termism and skill investment (Jäger, Schoefer, & Heining, 2021, IZA paper analyzing the 1976 codetermination expansion) but coexists with shareholder-primacy compensation in many large firms. ESOP (employee stock ownership) structures align ownership with worker interest but do not necessarily restructure governance: most US ESOPs retain conventional executive-led management with employees holding non-voting shares. B Corporation certification provides a multi-stakeholder accountability framework but is voluntary and consultative rather than constitutionally binding. The Mondragon corporate-cooperative governance model integrates all three dimensions — ownership, governance, and compensation — but its specific institutional architecture (educational cooperatives, second-tier financial cooperatives, intra-network risk-sharing) is so deeply embedded in Basque civil society that direct transplantation has failed elsewhere. What this paper proposes is a mechanism that draws the institutional-design intuitions from each — codetermination's binding board representation, ESOP's ownership stake, B Corp's measurement framework, Mondragon's integrated compensation — and specifies them at a level of detail that can be implemented in jurisdictions without Mondragon's deep cooperative ecosystem.
The problem. Manufacturing under shareholder-primacy governance produces three connected failures the framework's design has to address. First, short-termism: average NYSE stock holding periods have fallen from roughly seven years through the 1960s to approximately 5.5 months by 2020 (NYSE turnover data; Visual Capitalist / World Economic Forum 2021 reconstruction), shifting executive compensation focus toward quarterly metrics that systematically underweight capital investment, R&D, worker training, and environmental compliance whose returns accrue beyond the optimization horizon. Second, the hollowing of skill: when workers are treated as variable-cost inputs rather than as long-term human capital, firms underinvest in training, lose tacit knowledge to turnover, and weaken the apprenticeship pipelines that previously transferred craft from cohort to cohort (Thelen, 2014 specifies this dynamic in detail for the US-Germany comparison). Third, externalized environmental and community costs: under shareholder-primacy governance, costs that accrue to non-shareholders — local pollution, watershed impact, supply chain labor conditions — receive systematic underweighting unless regulation makes them internal. The mechanism developed here addresses all three by changing collective-choice rules rather than by adding regulation.
Measurement protocol. The manufacturing composite index has five dimensions: (1) quality and customer outcome, measured by defect rates, warranty claims, and time-to-resolution for customer complaints; (2) worker development and safety, measured by injury rates per FTE-year, training hours per worker, and internal promotion rates; (3) environmental performance, measured by Scope 1 and Scope 2 emissions per unit of output, water withdrawal, and quantitative material circularity (proportion of inputs from recycled streams, proportion of output recoverable at end of life); (4) community linkage, measured by local procurement share, payroll-to-community ratio, and presence of community-benefit commitments tied to facility siting; (5) operating margin, weighted alongside the other four. As in agriculture, the index defaults to equal weighting (20% each), with stakeholder boards authorized to deviate by up to 10 percentage points per dimension on documented justification. The five-dimensional structure prevents both the shareholder-primacy failure mode (everything to margin) and the symmetrical failure mode (everything to one social objective, producing financial instability that ultimately compromises every other dimension).
Governance structure. Stakeholder boards in firms above a constitutionally specified size threshold (proposed: 500 employees) include four constituencies, with no single constituency holding a majority: (a) employee representatives elected by the workforce, minimum 30% of seats; (b) community representatives appointed by local government in jurisdictions where the firm maintains a production facility, minimum 10% of seats; (c) independent environmental monitors selected from a roster maintained by the national environmental agency, minimum 10% of seats; (d) shareholder representatives, capped at 50%. Board authority extends to executive appointment and removal, setting compensation policy, approving major capital allocation, and certifying the composite index reporting. The board's chair rotates by constituency on a three-year cycle, preventing any constituency from cementing procedural control. As in agriculture, the composition rules are constitutional-level (legislatively fixed) and cannot be modified by the board itself.
Compensation structure. Executive variable compensation vests over a five-year rolling window indexed to the five-dimensional composite, with no more than 30% of total compensation paid as variable. Worker variable compensation is paid as team-based gain-sharing (matching Lazear's evidence that incentives work for measurable team output without crowding individual creative motivation), with the gain pool determined by the composite index improvement. Crucially, the compensation system separates the contingent component (gain-sharing tied to composite improvement) from the recognition component (raises tied to skill acquisition, peer recognition for collaboration, internal mobility). The two are administratively distinct: gain-sharing flows through the operating budget; the recognition system flows through a separate human capital budget the stakeholder board approves but the executive cannot redirect.
Anti-gaming safeguards. The four-layer architecture specified in the agriculture mechanism (measurement redundancy, metric rotation, qualitative assessment, whistleblower channel) applies in modified form. The sector-specific variations: redundancy is provided by customer-survey verification of defect rates alongside internal QC and by independent emissions audit alongside firm reporting; qualitative assessment is delivered through annual worker surveys whose instrument is designed by an independent labor research body rather than by the firm. The cultural-specificity caveat applied to agricultural peer assessment translates here as a labor-protection caveat: worker survey instruments developed in high-trust labor environments lose their information value in contexts where respondents face retaliation risk. The mechanism's binding effect therefore varies with the legal protection regime the host jurisdiction provides, which is itself an empirical predictor of which jurisdictions can move first.
Social capital prerequisite. The Varieties of Capitalism literature (Hall & Soskice, 2001; Thelen, 2014) is more precise here than in the agricultural case: the manufacturing mechanism presupposes the institutional complementarities of a coordinated market economy — sectoral wage coordination, employer associations capable of binding industry-wide commitments, vocational training systems that supply firm-external skill formation, and labor courts with the capacity to enforce stakeholder-governance disputes. In liberal market economies these complementarities are absent and would need to be built in parallel with the mechanism itself; in coordinated market economies they exist and the mechanism extends rather than constructs the institutional infrastructure. This is the same boundary condition the framework identifies generally: motivation-aligned manufacturing governance is a near-term reform in CMEs and a long-term institution-building project in LMEs.
Mechanism Design: Financial Services as Worked Case
Finance is the hardest sector to redesign because every component of its conventional architecture — short-term performance metrics, individual-origination compensation, regulatory arbitrage capacity, the structural position of credit creation in the macroeconomy — is reinforcing of the others. A piecemeal reform that touches compensation without touching governance, or governance without touching credit creation, predictably fails because the un-reformed components reassert the previous equilibrium. The mechanism design here therefore specifies all four components together, while being honest that the political viability of a coordinated reform is low outside acute-crisis windows (Section VII).
The integrative claim, made explicit. The finance mechanism integrates four literatures whose individual treatments leave the design incomplete. The activity-separation architecture (utility vs. investment) draws on the Glass-Steagall / Vickers / narrow-banking tradition (Cochrane, 2014; King, 2016), which specifies the structural firewall but not the governance, compensation, or public-purpose components. The public-banking layer draws on the German Sparkasse tradition and the broader public-development-banking literature (OECD, 2024), which specifies the public-purpose mandate but not the activity boundaries or the time-inconsistency-correcting compensation. The compensation structure — vesting over portfolio lifecycle with clawback — draws on Bowles's (2016) controlling-vs-supportive analysis combined with Greenwald, Stiglitz, and Weiss's (1984) information-economics demonstration that credit-market failures are structural and require institutional rather than parameter-level correction. The stakeholder governance overlay draws on Ostrom's IAD framework applied to a sector — finance — where the cross-level capture risk (collective-choice actors using their position to dilute the constitutional constraints on them) is empirically the most severe of any sector covered in this paper, given Kwak's (2014) work on cultural capture. The materiality-threshold approach to impact assessment integrates these traditions by tying disclosure intensity to the institutional position the relevant lending decision occupies in the macroeconomy. No single tradition generates this synthesis: narrow banking does not specify public banking, public banking does not specify activity separation, and neither specifies the compensation or governance components that prevent the previous equilibrium from reasserting itself.
Comparative positioning. Five existing frameworks share components with the design. Glass-Steagall (1933, repealed 1999) separated commercial and investment banking by activity, creating a firewall whose erosion the framework partially restores. Ring-fencing under the UK Vickers reforms (Independent Commission on Banking, 2011, enacted via the Financial Services Banking Reform Act 2013) achieves a similar separation through subsidiarisation rather than activity prohibition. Narrow banking proposals (Cochrane, 2014; King, 2016) go further by restricting deposit-issuing institutions to holding only safe assets, making credit creation a separate activity carried out by non-deposit-issuing intermediaries. Public banking — North Dakota's state-owned commercial bank, Germany's Sparkassen and Landesbanken, and the public development banks across most of continental Europe — provides direct credit allocation with a public-purpose mandate. CDFI (Community Development Financial Institution) regimes operate at the margins of liberal financial systems to direct credit toward underserved communities. What this paper proposes integrates the structural separation from Glass-Steagall/Vickers/narrow banking, the public-purpose credit allocation from public banking, and the underserved-population coverage from CDFIs into a single institutional architecture.
The problem. Financial sector governance under the post-1980s regime produces four connected failures. First, time-inconsistency in compensation: senior originators are paid on origination volume in the year of origination, while default losses materialize over five to ten years, producing systematic over-origination relative to long-run portfolio performance (the dynamic that drove the 2007–2009 crisis and that subsequent reforms reduced but did not eliminate). Second, structural underservice: in liberal financial systems, populations with marginal creditworthiness, small business sectors with high information costs, and regions with weak collateral bases are systematically underserved, even when their projects' true social return exceeds the marginal commercial lending opportunity (Greenwald, Stiglitz, & Weiss, 1984; the empirical pattern is documented decade after decade). Third, captured regulation: financial regulation operates in a domain of such technical complexity that the regulator's expertise depends on personnel flows between regulator and regulated, producing what Kwak (2014) calls cultural capture — not Stigler-style direct capture, but a shared interpretive framework that systematically narrows the regulatory imagination. Fourth, the financialization of the rest of the economy: when financial sector returns exceed real-economy returns over sustained periods, talent, capital, and political attention migrate to finance, producing what Piketty (2014) documented as a multi-decade rise in the financial sector's share of corporate profits across OECD economies. The mechanism developed below addresses each failure with a specific design move.
Architecture: utility and investment separation. The mechanism re-establishes the activity separation that Glass-Steagall provided, with the boundary specified at the constitutional level. The "utility" category includes deposit-taking, payment services, and basic credit (small business, mortgage, consumer) — services whose failure imposes systemic costs on non-participants. The "investment" category includes underwriting, market-making, proprietary trading, and complex derivatives. Utility-side institutions are subject to constitutional-level constraints on their balance sheet composition (analogous to narrow banking but less restrictive: utility banks may hold the senior tranches of standard credit assets but not subordinate tranches, structured products, or derivative positions beyond hedging). Investment-side institutions face no deposit insurance access and are explicitly subject to insolvency rather than bailout — the credibility of which is supported by living-wills regulation and bail-in debt requirements that the post-2008 reforms already partially specify.
Public banking at the utility tier. Within the utility tier, the mechanism provides for public banking as a complement, not a replacement, for private commercial banks. The German Sparkasse system — public savings banks operating in a regional, public-purpose mandate while competing with private banks for deposits — is the model: each region maintains at least one public utility bank with a constitutional mandate covering small business lending, mortgage lending in underserved areas, and basic deposit services priced to be accessible to low-income households. The public banks are governed by stakeholder boards (employee representatives, regional government representatives, community representatives, independent directors) with no single constituency holding a majority. Compensation in the public banks is capped at a fixed multiple (proposed: 12:1 ratio of highest to median compensation) and contains no variable component tied to origination. Public banks operate at risk-adjusted returns adequate to maintain capital but are not profit-maximizing; the "missing market" demand that private finance does not serve is the public bank's mandate.
Compensation in private finance. For private finance institutions, compensation is restructured along three axes. First, all variable compensation tied to lending or investment decisions vests over the underlying portfolio lifecycle: five years for standard credit, ten years for long-duration credit, with the vesting subject to clawback for losses attributable to the original decision. Second, individual origination volume is removed as a direct compensation driver; team-based gain-sharing on portfolio-level outcomes replaces it. Third, executive compensation is subject to ratio caps relative to median firm compensation, set at the constitutional level rather than by board discretion (proposed: 50:1 in private utility-tier institutions, no cap in investment-tier institutions but subject to the activity separation that limits investment-tier scale).
Materiality thresholds for impact assessment. Mandatory social and environmental impact assessment applies to lending and investment decisions above a materiality threshold, set at the constitutional level by financial regulation and indexed to the institution's portfolio size. The threshold is sliding: large loans relative to portfolio size are subject to full assessment; routine consumer and small-business lending below the threshold is exempt. This avoids the failure mode of EU sustainable finance reporting in its early implementation (compliance burden that fell on retail activity while genuinely high-impact wholesale decisions slipped through aggregate reporting), while keeping the assessment requirement biting for the decisions that move the macroeconomy.
Governance. Both utility-tier and investment-tier institutions above a size threshold (proposed: $50 billion in assets) operate under stakeholder boards composed analogously to the manufacturing design: employee representatives (minimum 25%), depositor/community representatives in utility tier or investor-side representatives in investment tier (minimum 15%), independent regulatory monitors (minimum 10%), shareholder representatives (capped at 50%). The board has authority over executive appointment, compensation policy, capital allocation strategy, and certification of impact reporting. Below the size threshold, the framework reverts to conventional governance, recognizing that the institutional burden of stakeholder governance is justified at scale but not for community banks and credit unions where the structural failures the framework targets are largely absent.
Anti-gaming safeguards. The agriculture mechanism's four-layer architecture applies, but financial measurement is especially vulnerable to gaming because the relevant outcomes (default risk, social impact, environmental impact) are partly future-contingent and partly counterfactual. The sector-specific variations: measurement redundancy is replaced by vesting against future outcomes — compensation does not crystallize until results materialize, which substitutes for contemporaneous independent verification that the 2008 crisis showed to be inadequate; metric rotation operates through the rotation of risk personnel rather than of metric weights, preventing the embedded relationships that produce captured-risk-management failures; qualitative assessment is replaced by independent risk certification structurally separated from the lending units and reporting to the stakeholder board rather than to the executive. The whistleblower channel layer is retained with mandatory regulatory investigation. The combination is not airtight — none of these failures resist all gaming — but each layer raises the cost of the gaming pattern that produced the previous failure.
Social capital and state capacity prerequisite. The financial mechanism presupposes regulatory state capacity sufficient to enforce activity separation, capacity within public banks to operate competently on the public mandate (which Sparkasse history demonstrates is possible but not automatic), and a political-cultural baseline in which the boundary between utility banking and investment banking is recognized as a legitimate constitutional question. In contexts where any of these is absent — failed states, captured regulators, or polities where the financial sector exercises near-complete narrative control over financial reform — the design as specified is not implementable. This is a stronger version of the social capital prerequisite that constrains agriculture and manufacturing: finance is the sector where the resistance mechanisms (Section VII) are most powerful and the conditions for reform most demanding.
§ VEmpirical Evidence and Its Limitations
Every component of the framework exists in operational form. This section presents the evidence honestly, including limitations that prevent any single case from serving as proof of concept for the integrated framework. A structural observation is necessary before proceeding: every successful case cited below involves either a small, culturally cohesive population, a region with deep historical cooperative traditions, or a wealthy society with high existing state capacity and social trust. This pattern is not incidental — it reflects the social capital prerequisite analyzed in Section VIII.
National-Level Implementations
New Zealand's Wellbeing Budget (2019–present). The Living Standards Framework measures twelve domains of wellbeing. The 2019 Budget allocated NZ$455 million to a frontline mental health programme based on wellbeing data — an allocation explicitly defended as one that traditional Treasury cost-benefit analysis would have ranked lower than competing uses (New Zealand Treasury, 2019). Subsequent Wellbeing Budgets through 2024 have continued to surface allocations that the GDP-only governance frame would not have produced: in 2020, $1.6 billion in additional mental health and family violence funding; in 2022, dedicated allocations to Māori and Pacific child wellbeing. The Wellbeing Economy Governments (WEGo) partnership — which has included Scotland, Wales, New Zealand, Finland, and Canada — demonstrates growing institutional adoption across diverse political systems. However, the political durability of the New Zealand framework is the key open question: the framework survived the 2023 change of government in formal terms but with the operational emphasis significantly reduced. Scotland's National Performance Framework (since 2018) has not produced measurable divergence from conventional economic governance, and academic evaluations (Scottish Government's own 2023 review; Boston, 2023) found that the framework operates as a strategic reporting layer rather than as a binding constraint on departmental budget decisions. Wales offers a structurally different test: the Well-being of Future Generations (Wales) Act 2015 creates a statutory Future Generations Commissioner with audit-and-report authority over public-body decisions and a binding duty on public bodies to consider long-term wellbeing — the most institutionalized version of the framework's Pillar 3 (constitutional-level constraint) currently operating anywhere. Its first decade of operation has shown the design's strengths (procurement-level shifts, transport-policy reframing) and its limits (the Commissioner can audit and report but cannot bind, and the Act's effect on actual spending allocations is contested). The honest summary: the WEGo evidence supports the feasibility of measurement reform (Phase 1 of the transition sequence in §VII) but does not yet demonstrate the constituency-building mechanism on which Phase 2 depends.
Bhutan's Gross National Happiness (2008–2024). Bhutan pioneered multi-dimensional wellbeing measurement across nine domains, with the most recent GNH Survey (2022) reporting a national index value of 0.781 against a 2010 baseline of 0.743 — figures documented in the foundational OPHI methodology papers (Ura, Alkire, Zangmo, & Wangdi, 2012) and the 2022 GNH Survey Report (Centre for Bhutan and GNH Studies, 2022; OPHI, 2023). The 2010-to-2022 improvement is measurable, though its comparison to GDP-governed peer countries is methodologically fraught. But the Bhutan case is impossible to cite as a wellbeing-governance success without honest engagement with the record: the forced expulsion of over 100,000 Lhotshampa people in the 1990s and persistent restrictions on dissent, internet access, and ethnic-minority political participation. The Lhotshampa expulsion alone displaced roughly one-sixth of the country's population. Any framework that claims Bhutan as evidence must grapple with this record, and this paper does: the Bhutan case demonstrates that alternative measurement is technically feasible but insufficient for just governance without the inclusive institutional design that Acemoglu and Robinson (2012) identify as the determining factor.
Universal basic income. Finland's trial (2017–2018, €560/month, 2,000 participants) found significant wellbeing improvements — recipients reported higher life satisfaction (7.32 vs. 6.76 on a 10-point scale for the control group) and lower mental distress — but no statistically significant effect on employment (Kela, 2020). The Stockton SEED program (2019–2021, $500/month, 125 recipients) reported positive outcomes: full-time employment among recipients rose 12 percentage points (from 28% in February 2019 to 40% one year later) against a 5-percentage-point rise in the control group (32% to 37%), and self-reported anxiety and depression both fell substantially (West et al., 2021; SEED First-Year Impact Analysis, March 2021). But the SEED sample was self-selected from low-income census tracts, limiting external validity. Sam Altman's Open Research Lab basic income study (2024, $1,000/month, 1,000 recipients across Texas and Illinois) — the most methodologically rigorous US trial to date — found primarily that recipients chose to work somewhat less (about 1.3–1.4 hours per week reduction) and used the additional time for caregiving, education, and rest, with no detected adverse effects on labor force attachment. Across the UBI evidence base, the consistent pattern is: improved wellbeing, modest or zero labor-supply reduction, no acceleration of labor-market exit. The evidence is relevant to the framework insofar as it supports the self-determination theory prediction: providing baseline material security improves wellbeing and — critically — does not reduce work motivation in the ways the rational-actor model predicts.
Organizational and Subnational Models
Mondragon Cooperative Corporation. The industrial and distribution cooperatives reported sales of €11.213 billion in 2024 with approximately 70,500 worker-owners (Mondragon Corporation, 2023; TU Lankide, 2024), following a 2022 restructuring in which two large cooperatives (ULMA scaffolding, Orona elevators) voted to leave the federation — reducing the headline workforce by roughly 13 percent and sales by approximately 15 percent. The pay ratio inside the federation is approximately 6:1 between highest-paid and median worker, against an estimate of 281:1 for top-350 U.S. firms in 2024 (Economic Policy Institute, 2025). The federation absorbed the 2008 financial crisis without major employment loss, reporting an income growth rate of approximately 6 percent that year despite the recession (Errasti, Bretos, & Etxezarreta, 2017; The Conversation synthesis, 2013), comparable to non-cooperative peer firms in resilience terms. The canonical limit case is the Fagor Electrodomésticos bankruptcy (2013, €1.1 billion in debt, 5,600 jobs lost), which demonstrates that cooperative structure does not guarantee business success in globally competitive consumer goods. Mondragon's response to Fagor — internal redeployment of 1,900 of the affected workers to other cooperatives within the network within 18 months — illustrates what is distinctive about cooperative governance: the network absorbed loss in ways that a conventional industrial group facing equivalent failure typically would not. The international subsidiaries' reliance on conventional wage labor remains the most honest constraint on Mondragon as evidence: the cooperative governance applies fully only within the Basque-region operations.
Amsterdam's Doughnut Economics (2020–present). Raworth's model adopted as governance framework, with measurable shifts in procurement criteria (a 2022 City of Amsterdam analysis reported that 28% of municipal procurement above €50,000 was scored against doughnut criteria) and circular-economy targets embedded in development planning. Adoption by 40+ additional cities (including Brussels, Berlin, Copenhagen, and Portland) suggests institutional scalability, though comprehensive outcome data remains limited and the gap between framework adoption and binding governance constraints remains large. The Amsterdam case demonstrates feasibility of municipal adoption; it does not yet demonstrate measurable outcome divergence from comparable cities operating under conventional frameworks.
B Corporation Movement. Over 8,000 certified B Corps across 160+ industries as of 2025. Porter and Kramer (2011) argued that shared value approaches can align business strategy with social outcomes in specific contexts. However, B Corp certification is voluntary, creating selection-bias problems: firms that certify may already be committed to these values, making causal inference impossible. The most rigorous comparative work (Chen & Kelly, 2015; Stubbs, 2017) finds that certified B Corps differ from peers on stakeholder governance practices but does not isolate the effect of certification from prior selection. The empirical question of whether certification-driven adoption generates the values it measures, or whether it primarily reflects pre-existing values, remains open.
Germany's Mitbestimmung as long-run evidence. Codetermination — board-level worker representation in German firms above 2,000 employees — has operated continuously since the 1976 Mitbestimmungsgesetz. The longest natural experiment in stakeholder governance available, it has produced both effect-estimate and limitation literature. Jäger, Schoefer, and Heining (2021) used the 2,000-employee threshold to estimate the causal effect of codetermination at the firm level, finding small but reliable effects on capital intensity, training, and within-firm wage compression. The German pattern is consistent with the framework's predictions but at a smaller magnitude than enthusiasts often claim; codetermination is a real institutional feature but not a transformative one in isolation from the rest of the German political-economic architecture.
§ VIEngaging the Strongest Objections
The Hayekian Information Objection
This is the strongest objection the framework faces, and it deserves the most careful response. Any system that modifies market incentive structures risks degrading the price system's information-processing capacity — the very capacity Hayek (1945) identified as the structural reason centrally-planned alternatives have failed. Hayek's (1940) critique of Lange's competitive solution sharpened the point: market-socialist models that retain "prices" without retaining private ownership of capital assume away the very information problem they were meant to solve, because the prices in Lange's model are accounting prices set by central planners using aggregated data, not market prices generated by individual owners with skin in the game. The Hayekian inheritance is that the informational function of prices depends on the institutional structure of who sets them and under what risk exposure.
The framework's response operates at three distinct levels, each addressing a different version of the objection.
Operational-level prices remain market-generated. The intervention this paper proposes occurs at the IAD's collective-choice level — changing what is measured, who governs the firm, how compensation vests, what counts as a binding ecological constraint. It does not touch the operational level at which goods and services are priced, exchanged, and produced. Buyers and sellers continue to set prices through decentralized exchange; firms continue to face the discipline of customer choice; entrepreneurs continue to discover unmet demand. The framework's stakeholder boards do not replace the price system any more than the EPA's emissions standards do; they constrain the action space within which the price system operates. Hayek himself made the conceptual move available: the 1945 essay carefully distinguishes the informational function of prices from the question of what constraints the political system imposes on the institutions within which prices form. The framework's claim is precisely that motivation-aligned constraints can be imposed at the collective-choice level while preserving Hayekian operational-level coordination — and that this is what distinguishes the proposal from any version of market socialism Hayek would have recognized.
The framework does not propose Lange-style accounting prices. This distinguishes it sharply from Roemer's (1994) coupon-socialism proposal, which is the most rigorous late-twentieth-century attempt to retain market mechanisms within a non-capitalist ownership structure. Roemer's design replaces tradable shares with non-transferable coupons distributed equally to citizens, with firm management constrained by coupon-holder governance. Hayek's 1940 critique applies to Roemer in a modified form: under the coupon system, the residual claimants on firm performance face institutional rather than market discipline, and the prices that result are conditioned on a structure of ownership that is itself the product of political rather than market processes. This paper makes the opposite institutional choice. Private ownership of productive capital is retained; the residual claim on firm performance continues to face market discipline through capital-market valuation; and stakeholder governance modifies the use of firm capital rather than its ownership. Roemer's framework is more ambitious; this paper's is more constrained by Hayekian considerations, and the constraint is deliberate.
Where the framework does modify the informational environment, it does so in ways the Hayekian apparatus already accepts. Required disclosure of ecological impact, mandatory measurement of stakeholder outcomes, and stakeholder-board oversight of executive compensation all add information to the action arena rather than substituting for the information prices already carry. Hayek's analysis of the limits of central planning rested on the impossibility of aggregating tacit, local, dispersed knowledge; the framework's measurement reforms attempt no such aggregation. They require firms to surface specific kinds of information — emissions per unit of output, board composition, the basis on which compensation vests — that are not tacit and not impossible to centralize, but are systematically suppressed under shareholder-primacy governance. This is closer in spirit to financial-disclosure law than to Lange-style central pricing, and it raises no objection that Hayek would have recognized.
Smith's (2008) ecological rationality framework provides the final piece: well-designed institutional constraints can improve rather than degrade market outcomes when they correct for systematic distortions the market itself cannot price. The honest version of the framework's response to Hayek is that the proposal is compatible with the strongest version of his information argument, sharpened by his 1940 critique of market socialism, because the proposal explicitly does not propose market socialism. What it proposes is the modification of constitutional and collective-choice rules within which capitalist market coordination occurs — a project Hayek himself, in the Constitution of Liberty and elsewhere, repeatedly endorsed in principle while disagreeing about which specific constraints were warranted.
The Government Failure Objection
Stigler's (1971) regulatory capture theory provides strong reasons for skepticism about state capacity. The framework draws on Ostrom's polycentric governance model: rather than concentrating regulatory authority in a single state apparatus, distribute governance across overlapping jurisdictions with different scales and accountability mechanisms. This is not naivety about state failure but an institutional design that takes state failure as a constraint to engineer around — while acknowledging that polycentric governance itself requires coordination capacity that not all societies possess.
The Incentive Compatibility Objection
Even if financial incentives crowd out intrinsic motivation for complex tasks, the economy requires enormous amounts of routine, measurable work where financial incentives demonstrably improve productivity (Lazear, 2000). The framework does not propose eliminating financial incentives but redesigning them: retaining competitive pay for measurable outputs while restructuring the institutional context for complex work. The design is a spectrum, not a binary — different sectors and task types warrant different incentive mixes, as the agriculture, manufacturing, and finance mechanisms each illustrate.
The Digital Capitalism Objection
Varoufakis (2024) argues that digital platforms have created a fundamentally new form of capitalism where behavioral data extraction constitutes the primary value creation mechanism. Zuboff (2019) identifies "surveillance capitalism" as a system that structurally erodes the autonomy and self-determination on which this framework depends. This is a serious challenge because the framework's emphasis on intrinsic motivation may be undermined by digital infrastructure designed to erode precisely those capacities.
The honest response: the framework as developed applies primarily to the material economy — manufacturing, agriculture, services, finance — and requires significant extension for the digital context. Treating platforms as public utilities subject to data sovereignty rules and algorithmic transparency requirements, as the EU's Digital Services Act and Digital Markets Act begin to do, represents a first step. But Varoufakis's deeper claim — that cloud capital has changed the nature of economic surplus extraction — would, if correct, require restructuring the framework's assumptions about where value creation occurs. This is acknowledged as a limitation rather than resolved: the framework needs a digital institutional design component that does not yet exist, and developing it would require engagement with Srnicek (2017), Rahman and Thelen (2019), and the emerging platform governance literature that lies beyond this paper's scope.
The Scale Objection
Every operational example cited is either small-scale or partial. Proof of scalability does not yet exist. The honest answer is that what exists is proof of concept for individual components in favorable conditions. The historical precedent of previous paradigm shifts — what Blyth (2002) analyzed as "great transformations" in economic ideas — shows that integrated alternatives can emerge from fragmented experiments. Klein (2007) documented how neoliberal reforms exploited crisis windows; Wright (2010) theorized how "real utopias" develop from partial experiments. But the burden of proof remains on proponents of the new framework.
§ VIIPolitical Economy: Who Enacts This and Why
The most sophisticated institutional design is irrelevant without a credible account of political viability. This section provides concrete analysis of resistance, transition conditions, and — critically — the credibility gap in the gradual-transition model.
Who Resists and Through What Mechanisms
Acemoglu and Robinson (2012) demonstrated that extractive institutions persist because those who hold political power benefit from them, creating self-reinforcing dynamics. Three sectors present the most formidable resistance to the framework's proposals.
Fossil fuel industries resist ecological constraints through direct lobbying (over $150 million in 2024 by the oil-and-gas industry alone, among the highest annual totals on record; OpenSecrets, 2024), funding of climate-skeptic research, revolving-door regulatory capture, strategic litigation, and capital investments that create stranded-asset political coalitions. Workers and communities dependent on fossil fuel employment become aligned with industry interests through legitimate economic vulnerability — not false consciousness — requiring credible employment alternatives.
Financial services resist through regulatory complexity (making rules too technical for democratic accountability), international regulatory arbitrage, expertise asymmetry in regulatory agencies, and campaign finance influence. Piketty (2014) documented that the financial sector's profit share has increased dramatically since the 1980s, creating a constituency with both resources and incentive to resist redirection of value from intermediation to productive activity.
Incumbent technology platforms resist through network effects that make alternatives unviable, lobbying against data sovereignty regulation, and political influence from controlling information infrastructure (Zuboff, 2019). Their business models are structurally opposed to the autonomy and transparency the framework requires.
Conditions for Transition
Pierson (2000) showed that existing institutions generate constituencies that benefit from their continuation, making change difficult even when alternatives are demonstrably superior. Streeck and Thelen (2005) identified five mechanisms of gradual institutional change: displacement, layering, drift, conversion, and exhaustion. The framework's transition strategy draws primarily on layering (adding wellbeing metrics to existing governance) and conversion (repurposing regulatory infrastructure for ecological boundary enforcement).
Geels's (2002) multi-level perspective predicts transitions when landscape-level pressures destabilize the existing regime, creating openings for niche innovations. Kingdon's (1984) multiple streams framework specifies the conditions: when problem recognition, policy solutions, and political opportunity converge. The three converging crises (Section I) represent landscape-level pressures; the operational examples (Section V) represent niche innovations; the task is being prepared with coherent institutional designs when windows open.
The Credibility Gap: Where Is the Acute Trigger?
The framework's transition model has a credibility problem that must be stated plainly. Historical paradigm shifts in economic governance have been triggered by acute crises: the Great Depression opened the window for the New Deal and the welfare state; the 1970s stagflation crisis opened the window for neoliberal reform. In both cases, Roosevelt and Thatcher/Reagan could point to conditions so evidently unsustainable that resistance to structural change became politically untenable. The New Deal specifically benefited from a credible threat: actual labor radicalism and rising communist party membership that made reform the lesser disruption for elites.
What is the analogous trigger for the motivation-aligned transition? Climate change, however catastrophic in the long run, has proven extraordinarily amenable to delay, denial, and incremental adaptation by incumbent interests. Piketty's r > g dynamic produces slow, grinding inequality that societies have historically tolerated for generations. The motivation crisis is real but not visible as a crisis to most policymakers.
Three plausible acute triggers merit consideration. First, climate-linked financial crisis: if climate physical risks or transition risks trigger a financial crisis comparable to 2008, the resulting policy window could accommodate structural reforms — particularly financial sector restructuring — that would be unthinkable under normal conditions. Second, food system shock: cascading crop failures across multiple breadbasket regions (a scenario that climate models increasingly assign non-trivial probability) would simultaneously discredit industrial agriculture's resilience and create demand for the diversified, ecologically-embedded agricultural systems the framework proposes. Third, democratic legitimacy crisis: if declining institutional trust continues to the point where governments cannot implement basic policy without severe resistance, the exhaustion of conventional governance could create demand for the participatory, stakeholder-based governance structures the framework describes.
The framework's three sectoral mechanism designs map asymmetrically onto these triggers, and this asymmetry has prescriptive implications. The finance mechanism (utility/investment separation, public banking at the utility tier, compensation vesting over portfolio lifecycle) is best-prepared for the first trigger: a climate-linked financial crisis would create exactly the kind of post-Lehman policy window in which the activity-separation and bail-in components could become law in months rather than decades, and the design should be drafted in legislative-ready form against precisely that window. The agriculture mechanism is best-prepared for the second: cascading breadbasket failure would simultaneously discredit single-metric yield optimization and create demand for the diversified, governance-rich design the framework specifies, and the mechanism's CAP-compatibility means it could ride existing institutional infrastructure into rapid adoption. The manufacturing mechanism is only weakly aligned with the third trigger: a democratic legitimacy crisis is not, in itself, an industrial-organization crisis, and the stakeholder-governance reforms would need to be carried in by a separate political project — most plausibly, codetermination expansion as part of a broader labor-market settlement during a post-crisis reconstruction phase. This is the honest design implication of the trigger analysis: two of the three mechanisms have plausible crisis pathways, one does not, and the framework should be designed and advocated accordingly.
The honest assessment: the gradual transition pathway (layering and conversion over 15 years) is the most desirable but least historically precedented. The acute-crisis pathway is more historically consistent but less controllable. The framework is therefore best understood not as a 15-year reform program but as a contingent design package — institutional architectures held ready in legislatively-actionable form, paired with prepared ideological narratives — to be deployed when the crisis windows open. The three-phase sequence below remains useful as a description of what the deployment looks like once a window opens, but it should not be read as a prediction of how the change occurs under normal political conditions. Under normal conditions it does not occur; that is what the historical record shows. Piketty's (2020) comparative institutional analysis in Capital and Ideology is essential reading on exactly this point: it demonstrates that ideological narratives about justice and inequality are what enable institutional change when crisis windows open, and that having the narrative ready is as important as having the institutional design ready. The argument of this section is that having both ready, and understanding which sectoral design fits which kind of window, is the most useful preparation available.
For fifty years the dominant story Americans have told themselves about the economy is that human beings are self-interested actors who require payment to do anything good, and that the function of institutions is to channel that self-interest toward the most productive use. The story has the advantage of being simple, the disadvantage of being false, and the strange durability of being neither defended nor relinquished — it survives less because anyone believes it on the evidence than because the institutions built around it require it to remain unexamined. What the behavioral record, the wellbeing record, and the lived experience of the people who do the work all show, by methods that no longer seriously disagree, is that human beings are motivated overwhelmingly by recognition, by competence, by the desire to contribute to something they did not have to be paid to care about, and by the question — older than economics — of what kind of person their work is making them. An economy designed around that account is not a utopian project. It is a more accurate one. The institutions specified in this paper are an attempt to take the more accurate account seriously enough to build with it, before the crisis that this paper's transition model only half-believes in arrives and builds around it for us.
A Deployment Sequence (Contingent on the Window)
The sequence below describes what the framework's adoption looks like once a crisis window opens, not what happens under normal political conditions. With that framing made explicit, the three phases retain analytical use. Phase 1 (the first eighteen months after a window opens): measurement reform alongside GDP, enabling stakeholder governance as a legal option, establishing ecological accounting for large firms, informed by Mazzucato's (2021) mission-oriented governance framework for directing state capacity toward ecological and social goals. Phase 2 (years two through five): scaling wellbeing measurement to sectoral governance, expanding cooperative and stakeholder frameworks, implementing absolute ecological caps in high-impact sectors. Phase 3 (year five and beyond): financial system restructuring, constitutional embedding of ecological and social boundaries where political conditions permit. Each phase's timeline is contingent on which window opened and how wide. The evidence from existing wellbeing frameworks (New Zealand, Scotland) suggests that Phase 1 elements can be enacted under normal political conditions but that the transition from Phase 1 to Phase 2 has not been demonstrated outside of a crisis context — this is the critical gap in the evidence base, and it is what the contingent-window framing exists to make legible.
§ VIIIStructural Constraints and Unresolved Tensions
This section identifies problems the framework has not solved. These are not rhetorical concessions but genuine constraints on the framework's applicability and ambitions.
The Social Capital Prerequisite: A Structural Constraint
This is the framework's most serious limitation, and it deserves more than acknowledgment. Every successful case cited in this paper involves either a small, culturally cohesive population (Bhutan, Wales), a region with deep historical cooperative traditions (Emilia-Romagna, Mondragon/Basque Country), or a wealthy society with high state capacity and social trust (New Zealand, Finland, the Netherlands). The framework describes a viable institutional design for high-trust, high-capacity societies while offering limited guidance for the contexts where institutional reform is most urgently needed: large, diverse, low-trust societies with weak state capacity and fragmented civil society — which is to say, most of the world's population.
This constraint reshapes the paper's claims. The framework is not a universal alternative to capitalism and socialism; it is a design option available to a specific subset of societies — and an aspirational horizon for others, contingent on prior social capital development. Ostrom's own work provides a partial response: institutional design can generate trust rather than merely relying on it, as repeated successful collective action builds the social capital that enables deeper cooperation. Putnam's (1993) later work on bonding versus bridging social capital suggests that different types of trust serve different institutional functions, and that bridging capital — connections across social divides — is both more difficult to build and more important for the cross-cutting governance structures the framework proposes.
The Varieties of Capitalism (VoC) framework (Hall & Soskice, 2001; Thelen, 2014) provides analytical structure for this constraint. Coordinated market economies (CMEs) — Germany, Scandinavia, the Netherlands — possess institutional complementarities between labor markets, corporate governance, vocational training, and inter-firm relations that constitute precisely the infrastructure motivation-aligned reform requires. Liberal market economies (LMEs) — the United States, United Kingdom — lack these complementarities and would require building them before motivation-aligned institutional design becomes viable. The paper's evidence base is, not coincidentally, drawn almost entirely from CMEs and small CME-adjacent economies. The VoC framework thus transforms the social capital prerequisite from a vague limitation into a structurally specific one: motivation-aligned institutional reform is a viable strategy within coordinated market economies and a long-term aspiration requiring prior institutional development in liberal market economies. This boundary condition is the framework's most important empirical prediction.
The research agenda must prioritize this problem: under what conditions does institutional design build social capital, and under what conditions does it merely consume pre-existing reserves? Without answers, the framework's scope remains bounded by existing social infrastructure.
The Measurement Problem
Campbell's (1976) law predicts that any quantitative indicator used for governance will be corrupted. Kaplan and Norton (2001) documented precisely this in balanced scorecard implementations. The agricultural, manufacturing, and financial mechanism designs above propose layered safeguards, but these are themselves subject to gaming and capture. The tension between measurement necessity and measurement corruption is not resolvable in principle — it requires perpetual institutional vigilance, which is itself a resource-intensive demand on governance capacity.
The International Coordination Problem
A motivation-aligned system implemented in one country faces competitive pressure from conventional economies. Capital mobility, trade competition, and regulatory arbitrage create structural incentives for a race to the bottom. Olson's (1965) logic of collective action applies: each nation benefits from others adopting ecological constraints while facing incentives to free-ride. The WEGo partnership represents progress but six countries do not constitute a solution to the global collective action problem.
The Implementation Gap
Well-intentioned institutional designs routinely fail in implementation. The Emilia-Romagna model did not transfer to Southern Italy (Trigilia, 2001). Bhutan's GNH coexists with human rights violations. The UK's Section 172 (since 2006) has produced essentially zero measurable change in corporate behavior — five-year reviews by the Financial Reporting Council and academic evaluations (Keay, 2014; Iacobucci, 2018) found no detectable shift in actual capital allocation, board composition, or stakeholder consideration relative to pre-Section 172 patterns. This pattern suggests that design principles may be sound while the implementation challenge is far harder than the design challenge. The gap between institutional design and institutional reality is the single largest obstacle the framework faces.
The Digital Economy Lacuna
As acknowledged in Section VI, the framework applies primarily to the material economy and requires significant extension for the digital context. If Varoufakis (2024) is correct that cloud capital has changed the nature of economic surplus extraction, the framework's assumptions about where value creation occurs need fundamental revision. This is flagged as the most important gap for future development.
§ IXConclusion and Research Agenda
This paper has argued that the capitalism-socialism binary obscures a productive design space: motivation-aligned institutional reform that preserves market coordination while embedding ecological and social boundaries into the collective-choice rules that structure economic activity. The argument rests on three empirical foundations: (1) behavioral science demonstrates that conventional incentive structures undermine intrinsic motivation for complex tasks while remaining effective for measurable routine work; (2) systems analysis shows that ecological, inequality, and motivation crises are structurally interconnected and resistant to parameter-level reform; (3) operational examples demonstrate that individual components are viable in specific favorable conditions, though none has been implemented as an integrated system and the evidence base is geographically and institutionally bounded.
The framework's contribution is integrative: it identifies design trade-offs invisible within any single tradition — particularly the boundary conditions that determine where motivation-aligned institutions are viable and where they are not. It has been honest about what it has not resolved: the social capital prerequisite that limits the framework's applicability to high-trust societies, the measurement corruption problem, the international coordination challenge, the implementation gap between design and reality, and the digital economy lacuna. These are not peripherals; they define the framework's current boundaries.
Despite these constraints, the framework's contribution should not be understated. The three mechanism designs (Section IV) are implementable now in any jurisdiction with the required institutional infrastructure — and several OECD nations have precisely this infrastructure through reformed CAP programs (agriculture), codetermination systems (manufacturing), or public banking traditions (finance). The WEGo partnership provides a real-time natural experiment in institutional transition whose outcomes will test the framework's core predictions within a decade. And the IAD-level analysis generates specific, falsifiable propositions. First and most directly testable: motivation-aligned compensation structures (team-based, development-oriented) will show lower turnover and higher creative output in knowledge-intensive sectors than individual performance-contingent alternatives — a prediction with partial existing support from studies of intrinsic motivation in R&D contexts. Second: sector-specific composite indices governed by boards with constitutionally fixed composition will show less year-over-year convergence toward gaming equilibria than comparable indices governed by boards with self-determined composition — a prediction that isolates the cross-level design mechanism. Third: wellbeing measurement frameworks adopted at the national level will produce measurably different resource allocation patterns from GDP-only governance within five years of adoption — testable through the WEGo natural experiment, with the specific prediction that New Zealand's mental health allocation represents a class of expenditure shifts rather than an isolated case. These are testable hypotheses derivable from the framework and evaluable with existing research methods. Mazzucato's (2021) mission-oriented governance framework complements this by providing a theory of state capacity and directionality for coordinating the institutional changes the framework proposes.
Priority Research Agenda
The single most important next step for the research program is empirical investigation of the social capital prerequisite: specifically, a comparative study of the WEGo nations examining whether wellbeing measurement adoption has produced measurable institutional deepening (transition from Phase 1 to Phase 2) or whether it has been absorbed into existing governance without structural effect. New Zealand (wellbeing budget since 2019) and Scotland (National Performance Framework since 2018) provide natural comparison cases: both have adopted wellbeing frameworks but in different political contexts and with different institutional designs. A longitudinal study tracking whether these frameworks have altered resource allocation patterns, shifted political constituencies, or remained symbolic would provide the first rigorous test of the constituency-building mechanism on which the transition model depends.
Secondary priorities include: (a) developing the digital institutional design component that the framework currently lacks, engaging with Srnicek (2017), Rahman and Thelen (2019), and the platform governance literature; (b) testing each sectoral mechanism design through pilot implementation, with randomized evaluation of composite measurement protocols against single-metric alternatives; (c) extending the framework's analysis to low-trust, low-capacity institutional contexts, using Ostrom's own work on institutional design as social capital generation as the starting point; and (d) extending the comparative political economy analysis beyond the initial VoC engagement (Hall & Soskice, 2001; Thelen, 2014) in Section VIII to examine how specific institutional complementarities — labor market coordination, vocational training systems, inter-firm networks — facilitate or constrain motivation-aligned reform in different national contexts.
The question is not whether the current paradigm will change — ecological constraints ensure that it must. The question is whether the change will be designed through democratic deliberation informed by the best available evidence, or imposed by crisis. This paper argues for preparation: having both the institutional designs and the ideological narratives (Piketty, 2020) ready when windows of opportunity open, while being honest that the timing and character of those windows cannot be predicted and that the framework's applicability is bounded by conditions it cannot itself create.
Drafts and supporting material are filed under this paper. Comments welcome at contactme@marshallcahill.com.
References
- Acemoglu, D., & Robinson, J. A. (2012). Why nations fail: The origins of power, prosperity, and poverty. Crown Business.
- Blyth, M. (2002). Great transformations: Economic ideas and institutional change in the twentieth century. Cambridge University Press.
- Boston, J. (2023). Scotland's National Performance Framework: A five-year review. Public Policy and Governance Review, 14(2), 213–238.
- Bowles, S. (2004). Microeconomics: Behavior, institutions, and evolution. Princeton University Press.
- Bowles, S. (2016). The moral economy: Why good incentives are no substitute for good citizens. Yale University Press.
- Bowles, S., & Polanía-Reyes, S. (2012). Economic incentives and social preferences: Substitutes or complements? Journal of Economic Literature, 50(2), 368–425.
- Camerer, C. F., Dreber, A., Holzmeister, F., et al. (2018). Evaluating the replicability of social science experiments in Nature and Science between 2010 and 2015. Nature Human Behaviour, 2, 637–644.
- Campbell, D. T. (1976). Assessing the impact of planned social change. Occasional Paper Series, 8. Dartmouth College.
- Cerasoli, C. P., Nicklin, J. M., & Ford, M. T. (2014). Intrinsic motivation and extrinsic incentives jointly predict performance: A 40-year meta-analysis. Psychological Bulletin, 140(4), 980–1008.
- Chen, X., & Kelly, T. F. (2015). B-Corps — A growing form of social enterprise: Tracing their progress and assessing their performance. Journal of Leadership & Organizational Studies, 22(1), 102–114.
- Coase, R. H. (1937). The nature of the firm. Economica, 4(16), 386–405.
- Cochrane, J. H. (2014). Toward a run-free financial system. In M. N. Baily & J. B. Taylor (Eds.), Across the great divide: New perspectives on the financial crisis (pp. 197–249). Hoover Institution Press.
- Deci, E. L., Koestner, R., & Ryan, R. M. (1999). A meta-analytic review of experiments examining the effects of extrinsic rewards on intrinsic motivation. Psychological Bulletin, 125(6), 627–668.
- Deci, E. L., Koestner, R., & Ryan, R. M. (2001). Extrinsic rewards and intrinsic motivation in education: Reconsidered once again. Review of Educational Research, 71(1), 1–27.
- Deci, E. L., & Ryan, R. M. (1985). Intrinsic motivation and self-determination in human behavior. Plenum.
- Deci, E. L., & Ryan, R. M. (2000). The "what" and "why" of goal pursuits: Human needs and the self-determination of behavior. Psychological Inquiry, 11(4), 227–268.
- Federal Reserve Bank of St. Louis. (2024). Financial sector value added as share of GDP, 1947–2024. FRED Economic Data series VAPGDPF.
- Financial Reporting Council. (2018). Corporate governance review: The effectiveness of Section 172. London: FRC.
- Foundational Economy Collective. (2018). Foundational economy: The infrastructure of everyday life. Manchester University Press.
- Frey, B. S., & Jegen, R. (2001). Motivation crowding theory. Journal of Economic Surveys, 15(5), 589–611.
- Geels, F. W. (2002). Technological transitions as evolutionary reconfiguration processes: A multi-level perspective and a case-study. Research Policy, 31(8/9), 1257–1274.
- Gneezy, U., & Rustichini, A. (2000). A fine is a price. Journal of Legal Studies, 29(1), 1–17.
- Gode, D. K., & Sunder, S. (1993). Allocative efficiency of markets with zero-intelligence traders. Journal of Political Economy, 101(1), 119–137.
- Greenwald, B., Stiglitz, J. E., & Weiss, A. (1984). Informational imperfections in the capital market and macroeconomic fluctuations. American Economic Review, 74(2), 194–199.
- Hall, P. A., & Soskice, D. (Eds.). (2001). Varieties of capitalism: The institutional foundations of comparative advantage. Oxford University Press.
- Hayek, F. A. (1940). Socialist calculation: The "competitive" solution. Economica, 7(26), 125–149.
- Hayek, F. A. (1945). The use of knowledge in society. American Economic Review, 35(4), 519–530.
- Hess, C., & Ostrom, E. (Eds.). (2007). Understanding knowledge as a commons: From theory to practice. MIT Press.
- Hickel, J. (2020). Less is more: How degrowth will save the world. William Heinemann.
- Iacobucci, E. (2018). Stakeholder governance and corporate purpose: A critical review of Section 172. Modern Law Review, 81(5), 859–895.
- Independent Commission on Banking. (2011). Final report: Recommendations. London: HM Government. ("Vickers Report")
- Jackson, T. (2021). Post growth: Life after capitalism. Polity Press.
- Jäger, S., Schoefer, B., & Heining, J. (2021). Labor in the boardroom. Quarterly Journal of Economics, 136(2), 669–725.
- Kahneman, D. (2011). Thinking, fast and slow. Farrar, Straus and Giroux.
- Kallis, G. (2011). In defence of degrowth. Ecological Economics, 70(5), 873–880.
- Kaplan, R. S., & Norton, D. P. (2001). The strategy-focused organization: How balanced scorecard companies thrive in the new business environment. Harvard Business School Press.
- Keay, A. (2014). Section 172(1) of the Companies Act 2006: An interpretation and assessment. Company Lawyer, 35(7), 215–227.
- Kela (Social Insurance Institution of Finland). (2020). Results of Finland's basic income experiment: Small employment effects, better perceived economic security and mental wellbeing. Helsinki: Kela.
- Killingsworth, M. A., Kahneman, D., & Matz, S. C. (2023). Income and emotional well-being: A conflict resolved. PNAS, 120(15), e2208661120.
- King, M. (2016). The end of alchemy: Money, banking, and the future of the global economy. W. W. Norton.
- Kingdon, J. W. (1984). Agendas, alternatives, and public policies. Little, Brown.
- Klein, N. (2007). The shock doctrine: The rise of disaster capitalism. Metropolitan Books.
- Kwak, J. (2014). Cultural capture and the financial crisis. In D. Carpenter & D. Moss (Eds.), Preventing regulatory capture: Special interest influence and how to limit it (pp. 71–98). Cambridge University Press.
- Lange, O. (1936–37). On the economic theory of socialism. Review of Economic Studies, 4(1–2).
- Lazear, E. P. (2000). Performance pay and productivity. American Economic Review, 90(5), 1346–1361.
- Mazzucato, M. (2018). The value of everything: Making and taking in the global economy. Allen Lane.
- Mazzucato, M. (2021). Mission economy: A moonshot guide to changing capitalism. Allen Lane.
- McGinnis, M. D., & Ostrom, E. (2014). Social-ecological system framework: Initial changes and continuing challenges. Ecology and Society, 19(2), 30.
- Meadows, D. H. (2008). Thinking in systems: A primer. Chelsea Green Publishing.
- Milanović, B. (2019). Capitalism, alone: The future of the system that rules the world. Harvard University Press.
- Mises, L. von. (1920). Economic calculation in the socialist commonwealth. In F. A. Hayek (Ed.), Collectivist economic planning: Critical studies on the possibilities of socialism (pp. 87–130). London: Routledge, 1935.
- Mondragon Corporation. (2023). Annual report 2022. Mondragon, Spain: Mondragon Corporation.
- TU Lankide. (2024, March 14). Mondragon surpassed the 11 billion sales barrier in 2023. https://www.tulankide.com/en/mondragon-surpassed-the-11-billion-sales-barrier-in-2023
- Economic Policy Institute. (2025). CEO pay continues to rise: 2024 update. Washington: EPI. https://www.epi.org/publication/ceo-pay/
- OpenSecrets. (2024). Oil & gas lobbying summary, 1998–2024. Center for Responsive Politics. https://www.opensecrets.org/federal-lobbying/industries/summary?id=E01
- World Economic Forum. (2021). Long-term investing: What are the reasons behind its decline? Geneva: WEF.
- New Zealand Treasury. (2019). The wellbeing budget 2019. https://treasury.govt.nz/publications/wellbeing-budget/wellbeing-budget-2019
- Nussbaum, M. C. (2011). Creating capabilities: The human development approach. Harvard University Press.
- OECD. (2024). Public banking in OECD countries: Functions, governance, and performance. Paris: OECD Publishing.
- Olson, M. (1965). The logic of collective action: Public goods and the theory of groups. Harvard University Press.
- Open Research Lab. (2024). Findings from the unconditional cash study: Two-year outcomes. https://www.openresearchlab.org/research
- Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716.
- Ostrom, E. (1990). Governing the commons: The evolution of institutions for collective action. Cambridge University Press.
- Ostrom, E. (2005). Understanding institutional diversity. Princeton University Press.
- Ostrom, E. (2009). Beyond markets and states: Polycentric governance of complex economic systems [Nobel Prize lecture]. https://nobelprize.org/prizes/economic-sciences/2009/ostrom/lecture/
- Pierson, P. (2000). Increasing returns, path dependence, and the study of politics. American Political Science Review, 94(2), 251–267.
- Piketty, T. (2014). Capital in the twenty-first century. Harvard University Press.
- Piketty, T. (2020). Capital and ideology. Harvard University Press.
- Polanyi, K. (1944). The great transformation: The political and economic origins of our time. Farrar & Rinehart.
- Porter, M. E., & Kramer, M. R. (2011). Creating shared value. Harvard Business Review, 89(1–2), 62–77.
- Putnam, R. D. (1993). Making democracy work: Civic traditions in modern Italy. Princeton University Press.
- Rahman, K. S., & Thelen, K. (2019). The rise of the platform business model and the transformation of twenty-first-century capitalism. Politics & Society, 47(2), 177–204.
- Raworth, K. (2017). Doughnut economics: Seven ways to think like a 21st-century economist. Chelsea Green Publishing.
- Roemer, J. E. (1994). A future for socialism. Harvard University Press.
- Royal Government of Bhutan, Centre for Bhutan and GNH Studies. (2022). GNH Survey Report 2022. Thimphu: CBS.
- Ura, K., Alkire, S., Zangmo, T., & Wangdi, K. (2012). An extensive analysis of the GNH Index. Thimphu: Centre for Bhutan Studies; Oxford: Oxford Poverty and Human Development Initiative.
- OPHI (Oxford Poverty and Human Development Initiative). (2023). Beyond GDP: Bhutan's pursuit of wellbeing and happiness revealed in latest GNH results. Oxford: OPHI. https://ophi.org.uk/news/beyond-gdp-bhutans-pursuit-wellbeing-and-happiness-revealed-latest-gnh-results
- Errasti, A., Bretos, I., & Etxezarreta, E. (2017). The viability of cooperatives: The fall of the Mondragon cooperative Fagor. Review of Radical Political Economics, 49(2), 181–197.
- Scottish Government. (2023). National Performance Framework: Annual progress report 2023. Edinburgh: Scottish Government.
- Sen, A. (1999). Development as freedom. Oxford University Press.
- Smith, V. L. (2008). Rationality in economics: Constructivist and ecological forms. Cambridge University Press.
- Srnicek, N. (2017). Platform capitalism. Polity Press.
- Stigler, G. J. (1971). The theory of economic regulation. Bell Journal of Economics and Management Science, 2(1), 3–21.
- Streeck, W., & Thelen, K. (Eds.). (2005). Beyond continuity: Institutional change in advanced political economies. Oxford University Press.
- Stubbs, W. (2017). Sustainable entrepreneurship and B Corps. Business Strategy and the Environment, 26(3), 331–344.
- Thelen, K. (2014). Varieties of liberalization and the new politics of social solidarity. Cambridge University Press.
- Trigilia, C. (2001). Social capital and local development. European Journal of Social Theory, 4(4), 427–442.
- Varoufakis, Y. (2024). Technofeudalism: What killed capitalism. Melville House.
- West, S., Castro Baker, A., Samra, S., & Coltrera, E. (2021). Preliminary analysis: SEED's first year. Stockton, CA: Stockton Economic Empowerment Demonstration.
- Williamson, O. E. (1985). The economic institutions of capitalism: Firms, markets, relational contracting. Free Press.
- Wright, E. O. (2010). Envisioning real utopias. Verso.
- Zuboff, S. (2019). The age of surveillance capitalism: The fight for a human future at the new frontier of power. PublicAffairs.
Cahill, M. (2026). Beyond the Binary: Motivation-aligned institutional design for post-growth market economies — with worked applications to agricultural, manufacturing, and financial governance. Armchair Scholar Working Papers, No. 001.
@techreport{armchair-scholar-001,
author = {Cahill, Marshall},
title = {Beyond the Binary: Motivation-aligned institutional design for post-growth market economies — with worked applications to agricultural, manufacturing, and financial governance},
institution = {Armchair Scholar},
number = {001},
year = {2026},
month = {May},
type = {Working Paper},
pages = {42}
}