Check out the results from the AI Safety X Physics Grand Challenge! 👉

Apr 27, 2025

View Related Sprint

Data as Capital in TAI Economies: A Biomorphic Framework

Isobel (Bella) Smith

This research introduces a novel theoretical framework for understanding data's role in Transformative Artificial Intelligence (TAI) economies. Traditional economic models inadequately capture data's unique properties in TAI systems: its non-rivalry, increasing returns, variable depreciation, and context-dependent value. We develop a "biomorphic framework" that conceptualizes data as both economic capital and a quasi-biological entity, drawing from economics, biology, quantum physics, and complex systems theory. Through case studies in financial markets and healthcare, we demonstrate the framework's explanatory power and derive policy implications for data governance that account for data's unique properties in advanced AI systems.

Download

Review Project

Reviewer's Comments

Axel Backlund

An interesting perspective on how data is a unique type of good, and how its properties in a TAI economy differs from capital!

The primary strength lies in the observations made about how data is unique compared to capital, particularly the section on why traditional economic models fail. The statements are easy to understand, backed with good examples, and form a strong framework that is used throughout the report.

I also find the policy and governance implications interesting, although some more detail here would have been great – I would have recommended expanding this section at the expense of the metaphors described in section 3 (the biology analogy is sufficient, having the quantum example is not equally value-adding). The text would also have benefitted from being more consistent in what terms are used. For example, in the case studies, both "value superposition" and "multi-domain value" are used, and seem to mean similar things. Also, I am not sure if it is due to a tech issue, but I can't find the references being used in the text. This would have further strengthened the credibility of the analysis.

The relevance of the work is nevertheless high. Data will indeed be a cornerstone of a TAI economy, and it is essential to have a good economic model of it to develop effective governance, as the article states.

Yulu Pi

The biomorphic framing and incorporation of biological metaphors are creative. However, the argument would be stronger with a closer engagement with existing literature. There’s already a substantial body of work that explores data’s unique role in the modern economy—especially around platform capitalism and data as labor—that’s not fully acknowledged here. Several references are listed in the bibliography but are not clearly integrated into the main text, limiting their impact.

Discussions on the direct impact of AI risk reduction should be strengthened. Governance recommendations are offered, but they remain at a high level. A more detailed explanation of how the proposed policies would reduce specific risks, particularly eoconomic risks such as monopoly and data misuse on market dynamics, would enhance the paper.

This is really creative piece of work. Still, the framework remains mostly theoretical, and that limits how far the argument can go. The case studies in finance and healthcare are interesting, but they feel a bit too surface-level to really support the claims being made. Adding more detail, evidence, or even some light modeling would go a long way in making the framework feel more robust and ready for broader application.

Joel Christoph

The submission offers an ambitious reimagining of data in Transformative AI economies, presenting a biomorphic framework that treats data as capital with quasi-biological traits. By drawing on biology, quantum physics, and complex systems theory, the paper highlights non-rivalry, combinatorial value creation, and disequilibrium dynamics that standard capital models overlook. The section on nuanced forms of data scarcity and the policy matrix that balances excludability with innovation are original contributions that help policymakers think beyond current data governance debates .

Despite this conceptual novelty, the argument relies mainly on metaphors and narrative case studies. There are no formal definitions, mathematical expressions, or simulation results that operationalize the proposed framework. As a result, it is difficult to test falsifiable claims or compare the approach with existing models of data value and access. The literature review lists relevant OECD and academic sources, but engagement is largely descriptive and omits recent quantitative work on data as an intangible asset, platform competition, and measurement of data externalities.

Links to AI safety are indirect. The paper notes that data governance shapes incentives for advanced AI systems, yet it stops short of analyzing how data capital affects alignment risks, model interpretability, or malicious use. Concrete pathways connecting the biomorphic perspective to safety mechanisms such as auditing, dataset provenance, or red-team incentives would improve the impact on the safety agenda.

Technical quality and documentation remain limited. The case studies in finance and healthcare are illustrative rather than empirical, and the text does not specify selection criteria, data sources, or analytical methods. No code, datasets, or reproducibility materials accompany the submission, which restricts external validation and future extensions by other researchers.

To strengthen the work, the author should:

1. Formalize key concepts with clear definitions and, if possible, simple models or agent-based simulations.

2. Provide at least one quantitative example that traces data accumulation, depreciation, and value creation under the biomorphic rules.

3. Expand the review to cover empirical studies on data markets and recent AI safety papers linking data governance to catastrophic risk reduction.

4. Release a minimal open-source notebook that reproduces any illustrative result or visual.

5. Clarify how the proposed governance mechanisms would mitigate specific safety risks, for example by preventing data poisoning or ensuring verifiable data lineage.

Cecilia Wood

An interesting discussion of the limits of standard economic assumptions when modelling data as a productive asset. Many of the individual points raised felt reasonable and correct. In addition, introducing case studies is a good way to make this feel more tangible. However, there didn’t seem to be a concrete alternative framework proposed, beyond gesturing towards taking insights from other fields. I definitely think there could be interesting research that comes from this direction, but this paper doesn’t suggest a concrete alternative approach. One way you could push this research forward instead is to adapt existing economic models using insights from this paper to guide changes to the standard assumptions (e.g. increasing returns to scale and non-rivalrous).

Some specific comments below.

Increasing returns is a good point, and a good reason for considering systems out of equilibrium. We could reframe this as long run/short run, where we look at SR equilibrium, transitions over time, then what happens in the LR either if all data is used, or synthetic data introduced.

I wouldn’t say ‘economic theory assumes…’, as economic theory refers to the tools. I would say ‘most of the literature considers models where…’. Then you’re arguing that we should use economic theory to consider different scenarios, i.e. non-rivalrous, increasing RTS, etc.

Often we might think about data as information, i.e. once you know it, you can’t get any further use from it. But might be interesting to think about data as a source of information, where the more you study it, the more information you can draw from it. (Then you could include a function which takes data as an input and produces value, then considering different functions allows you to explore the points you consider in ‘Variable and Context-Dependent Depreciation’ and elsewhere.)

‘Complex Adaptive System Dynamics’ – this section has interesting claims, I would be keen to see more discussion in this section.

It’s not true that economic theory doesn’t allow for inputs to be complements (capital and labour typically are). You could separate ‘data’ into different categories that are complements (so just a modelling choice). (re. Combinatorial value creation)

The section on scarcity seems right.

Cite this work

@misc {

title={

Data as Capital in TAI Economies: A Biomorphic Framework

author={

Isobel (Bella) Smith

date={

4/27/25

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Recent Projects

View All

May 20, 2025

EscalAtion: Assessing Multi-Agent Risks in Military Contexts

Our project investigates the potential risks and implications of integrating multiple autonomous AI agents within national defense strategies, exploring whether these agents tend to escalate or deescalate conflict situations. Through a simulation that models real-world international relations scenarios, our preliminary results indicate that AI models exhibit a tendency to escalate conflicts, posing a significant threat to maintaining peace and preventing uncontrollable military confrontations. The experiment and subsequent evaluations are designed to reflect established international relations theories and frameworks, aiming to understand the implications of autonomous decision-making in military contexts comprehensively and unbiasedly.

Apr 28, 2025

The Early Economic Impacts of Transformative AI: A Focus on Temporal Coherence

We investigate the economic potential of Transformative AI, focusing on "temporal coherence"—the ability to maintain goal-directed behavior over time—as a critical, yet underexplored, factor in task automation. We argue that temporal coherence represents a significant bottleneck distinct from computational complexity. Using a Large Language Model to estimate the 'effective time' (a proxy for temporal coherence) needed for humans to complete remote O*NET tasks, the study reveals a non-linear link between AI coherence and automation potential. A key finding is that an 8-hour coherence capability could potentially automate around 80-84\% of the analyzed remote tasks.

Mar 31, 2025

Model Models: Simulating a Trusted Monitor

We offer initial investigations into whether the untrusted model can 'simulate' the trusted monitor: is U able to successfully guess what suspicion score T will assign in the APPS setting? We also offer a clean, modular codebase which we hope can be used to streamline future research into this question.

May 20, 2025