Open-source sustainability: who pays for the code everyone uses?

Simor Consulting | 22 Jun, 2026 | 05 Mins read

A critical open-source library used by thousands of companies, including several Fortune 500 firms, is maintained by one person in their spare time. This is not a hypothetical. It is a description of dozens of projects that form the foundation of modern software infrastructure. The maintainers of these projects are volunteers. The companies that depend on their work generate billions of dollars in revenue from that dependence.

The open-source sustainability problem is not new. But the AI era has made it more acute, because AI systems depend on open-source software at every layer — from the operating systems that run training clusters to the frameworks that orchestrate model training to the libraries that handle data preprocessing. The AI industry is built on open-source infrastructure, and the people who maintain that infrastructure are, in most cases, uncompensated.

The free rider problem at scale

Open-source software creates a classic free rider problem. The benefits of the software are non-excludable — anyone can use it. The costs of producing and maintaining the software fall on the maintainers. When the software is used by individuals and small companies, the free rider problem is manageable because the maintenance burden is small. When the software is used by large enterprises at scale, the maintenance burden grows with the user base, but the compensation does not.

The asymmetry is stark. A company that saves millions of dollars annually by building on open-source infrastructure contributes nothing to the maintenance of that infrastructure. The maintainer fixes bugs, reviews pull requests, handles security vulnerabilities, and manages community dynamics — all without compensation. When the maintainer burns out and abandons the project, the company that depended on it discovers, suddenly and expensively, that the “free” software they were using was subsidized by someone else’s unpaid labor.

This dynamic plays out repeatedly. The left-pad incident in 2016, where one developer’s removal of a small npm package broke thousands of projects, was a visible demonstration of the fragility. The Log4j vulnerability in 2021 showed the security consequences of undermaintained critical infrastructure. The xz utils backdoor attempt in 2024 showed what happens when a critical project is maintained by a single person who becomes a target for social engineering.

AI’s compounding dependency

AI systems deepen the open-source sustainability problem in three ways.

The dependency stack is deeper. A typical AI application depends on an operating system, container runtime, orchestration platform, programming language runtime, ML framework, data processing library, serving infrastructure, monitoring tools, and dozens of smaller libraries. Each layer of this stack includes open-source components maintained by people who are not compensated by the companies running AI workloads on their software.

The performance demands are higher. AI workloads push infrastructure to its limits, which means they expose edge cases and performance bottlenecks that casual users do not encounter. Maintainers of infrastructure software receive bug reports and feature requests from AI companies that are using their software at scales the maintainers never anticipated and cannot test against. The maintenance burden increases with the most demanding users, and the most demanding users are often the least likely to contribute.

The talent drain is accelerating. The same companies that depend on open-source infrastructure are hiring open-source maintainers away from their projects. When a maintainer joins a company, their open-source maintenance typically decreases or stops. The project loses its most experienced contributor. The company gains an employee who understands the project but is now paid to work on the company’s internal systems, not on the open-source project the company depends on.

Models that partially work

Several models for sustaining open-source development have been tried. None has solved the problem completely, but each offers lessons.

Corporate sponsorship. Companies like Google, Microsoft, and Meta fund significant open-source development. This works for high-profile projects that align with the sponsor’s strategic interests. It does not work for the long tail of critical-but-unsexy infrastructure projects that do not have a corporate champion.

Open-core licensing. Projects offer a free open-source core and a paid commercial version with additional features. This works for projects where the commercial features are valuable enough to justify the cost. It does not work for libraries and frameworks where the core functionality is what everyone needs and there is no natural boundary between free and paid.

GitHub Sponsors and donation platforms. Individual developers receive small recurring donations from users of their software. This works for developers with large, engaged communities. It does not work for maintainers of critical-but-invisible infrastructure — the kind of project that everyone uses but no one thinks about until it breaks.

Venture-funded open source. Companies build open-source products with venture capital, intending to monetize through cloud services, support, or enterprise features. This works for the venture-backed company. It often does not work for the open-source community, because the venture investors’ incentives eventually diverge from the community’s interests — typically around the point where the company needs to show revenue growth and starts restricting the open-source offering.

What organizations should do

If your organization depends on open-source software — and it does, whether you know it or not — you have a responsibility to contribute to the sustainability of that software. The contribution can take several forms.

Fund the maintainers. Identify the critical open-source dependencies in your stack. Determine who maintains them. If those maintainers have sponsorship programs, contribute. If they do not, reach out and offer support. The cost of maintaining a dependency is a fraction of the cost of replacing it when the maintainer burns out.

Contribute engineering time. Assign engineers to contribute to critical open-source projects. Not as a side project during free time, but as a formal allocation with management support and performance evaluation. This benefits both the organization — which gains influence over the projects it depends on — and the project — which gains additional maintenance capacity.

Contribute to foundations. Organizations like the Apache Software Foundation, the Linux Foundation, and the Python Software Foundation provide governance, legal support, and funding for open-source projects. Contributing to these foundations is an efficient way to support the open-source ecosystem without needing to evaluate individual projects.

Do not extract without contributing. This is the ethical minimum. If your organization generates revenue from software that depends on open-source infrastructure, the open-source ecosystem is a supplier. Treat it like one. Negotiate sustainable terms, which in the open-source context means funding or labor contributions proportional to your dependence.

The provocation

The open-source sustainability problem will not be solved by appeals to generosity. It will be solved when organizations recognize that open-source infrastructure is not free — it is subsidized by maintainers’ unpaid labor, and that subsidy has a limit. When the subsidy runs out — when maintainers burn out, when critical projects go unmaintained, when security vulnerabilities go unpatched — the cost falls on the organizations that benefited from the subsidy.

The organizations that will be least affected are the ones that invested in sustainability before the crisis. The organizations that will be most affected are the ones that treated open-source as a costless input to their business. The difference between these two positions is not luck. It is a choice that every organization using open-source software is making, whether it acknowledges the choice or not.

Shipping a production AI system?

Find the control gaps before they turn into incidents. Take the AI Production Scorecard for a fast baseline across the seven layers, or book an architecture review and we will turn it into a hardening plan.

Take the AI Production Scorecard Book an Architecture Review

This comment section requires JavaScript.

Enable JavaScript in your browser to use this feature.

Similar Articles

AI Ethics Machine Learning

Privacy-Preserving Machine Learning Techniques

30 Jan, 2024 | 03 Mins read

ML models require data to train effectively, but this data often contains sensitive personal information. Privacy-preserving ML (PPML) techniques enable organizations to build effective models while s

Thought Leadership Organizational Design

Why most AI transformations fail (it's not the technology)

20 Apr, 2026 | 04 Mins read

The CTO of a mid-size financial services firm told me they had spent $4 million on AI tooling in eighteen months. They had three large language model providers under contract, a vector database cluste

Thought Leadership Data Culture

The case for AI skepticism in your data strategy

27 Apr, 2026 | 04 Mins read

I was in a strategy session where a VP of Data told the room that generative AI would "eliminate the need for data analysts within two years." The room nodded. Budget was reallocated. Three analyst po

Thought Leadership Organizational Design

What we can learn from the DevOps revolution applied to AI

04 May, 2026 | 04 Mins read

In 2009, deploying software to production was an event. It involved a change request, a maintenance window, a runbook, and a prayer. Developers wrote code, then threw it over the wall to operations, w

Thought Leadership Data Culture

Building a data-driven culture: lessons from 50 engagements

13 May, 2026 | 05 Mins read

The phrase "data-driven culture" has been emptied of meaning by overuse. It appears in every strategy deck, every job posting, every conference talk. Everyone claims to want it. Almost no one can desc

Thought Leadership AI Ethics

The ethics of training on copyrighted data — a nuanced take

18 May, 2026 | 05 Mins read

The legal system has not caught up with the practice of training AI models on copyrighted data, and the people building AI systems are not waiting for it. Models trained on books, articles, code repos

Thought Leadership AI Ethics

Why your AI team needs philosophers, not just engineers

25 May, 2026 | 05 Mins read

A hiring manager at a large tech company told me they had four hundred engineers working on their AI platform and zero people with training in philosophy, ethics, or the social sciences. When I asked

Trends Thought Leadership

The great model commoditization: what happens when everyone has GPT-5

30 May, 2026 | 03 Mins read

OpenAI shipped GPT-5. Anthropic shipped Claude 4. Google shipped Gemini Ultra 2. Within six weeks of each other, the three leading model providers released frontier models that are, by most benchmarks

Thought Leadership Organizational Design

The paradox of AI automation: more tools, less productivity?

01 Jun, 2026 | 05 Mins read

A data engineering team I worked with had adopted six AI-powered tools in twelve months. An automated code reviewer, a data quality scanner, a pipeline orchestrator with intelligent retry, a natural l

Thought Leadership Career

Career paths in AI data engineering: 2026 edition

08 Jun, 2026 | 04 Mins read

Three years ago, "data engineer" was a coherent job title. You built pipelines, managed infrastructure, and moved data from where it was to where it needed to be. The role required SQL, Python, and a

Thought Leadership Career

Books every AI leader should read this year

10 Jun, 2026 | 04 Mins read

Most reading lists for AI leaders are assembled by people who sell AI. The lists are full of books about machine learning techniques, deep learning architectures, and the latest framework documentatio

Thought Leadership Data Culture

The invisible infrastructure: why data plumbing matters more than models

15 Jun, 2026 | 05 Mins read

A Fortune 500 company hired a team of twelve machine learning engineers and tasked them with building a predictive maintenance system for their manufacturing floor. The ML team spent four months evalu

Trends Thought Leadership

Why 'AI engineer' is the fastest-growing job title (and what it means)

17 Jun, 2026 | 04 Mins read

LinkedIn's latest workforce report shows "AI engineer" as the fastest-growing job title for the third consecutive quarter. Job postings containing the title increased 280% year-over-year. The growth r

Thought Leadership Data Culture

Why I stopped chasing the latest AI framework

29 Jun, 2026 | 04 Mins read

In 2023, I rewrote a data pipeline three times because the framework landscape kept shifting. First it was built on LangChain. Then the team wanted to switch to LlamaIndex because it handled retrieval

Thought Leadership Career

The loneliness of being the only data engineer on the team

06 Jul, 2026 | 05 Mins read

There is a version of the data engineering career that nobody warns you about. It is not the startup grind or the big-company bureaucracy. It is being the only data engineer on a team of people who do

Thought Leadership Data Culture

Technical debt in ML systems: a honest accounting

13 Jul, 2026 | 05 Mins read

Google's 2015 paper "Hidden Technical Debt in Machine Learning Systems" described a problem that has only gotten worse in the decade since. The paper's central observation was that the model itself is

AI Ethics Responsible AI

Responsible AI: Bias Detection and Mitigation

07 Aug, 2024 | 12 Mins read

# Responsible AI: Bias Detection and Mitigation AI systems influence critical decisions in healthcare, finance, hiring, and criminal justice. When these systems produce unfair outcomes, they can perp

AI Ethics Decision Systems

Ethical Considerations in AI-Powered Decision Systems

17 Nov, 2024 | 03 Mins read

AI increasingly powers high-stakes decision systems across industries. Organizations deploying AI-powered decision systems face complex questions about fairness, transparency, privacy, and accountabil

Trends Thought Leadership

2025 Year-in-Review & 2026 Trends in Data & AI Architecture

19 Dec, 2025 | 03 Mins read

2025 was the year AI moved from experimentation to industrialization. While 2024 saw the explosion of generative AI capabilities, 2025 was about making those capabilities production-ready, cost-effect

AI Operating System Thought Leadership

The AI Operating System: Why Companies Need an AI Foundation Layer

05 Jan, 2026 | 16 Mins read

A financial services firm spent eight months building an AI-powered document analysis system. When it came time to deploy, they discovered their retrieval system had no governance layer, their agent h

AI Enablement Thought Leadership

AI Enablement Programs: Building Organizational Capability, Not Just Technology

19 Mar, 2026 | 11 Mins read

A technology company built an impressive AI platform. They had GPU clusters, fine-tuning pipelines, evaluation frameworks, and a growing model registry. They opened access to any team that wanted to u