Before GitHub
Summary
The author reflects on the evolution of Open Source software development, contrasting the pre-GitHub era with the platform's rise and current perceived decline. Before GitHub, projects often ran on self-hosted infrastructure like Trac and Subversion, fostering a smaller, more reputation-driven community where dependencies were carefully considered. GitHub centralized this, making project creation, discovery, and contribution frictionless, leading to an explosion of micro-dependencies and forming a significant part of the Open Source social infrastructure. While acknowledging GitHub's immense contributions, such as archival work and normalizing open collaboration, the author expresses concern over its current instability, product churn, and unclear leadership, which are causing prominent projects like Ghostty, Strudel, and Tenacity to consider moving. This shift away from GitHub, while potentially decentralizing and restoring autonomy, risks losing valuable social context and historical data if not properly archived.
Key takeaway
For CTOs and VPs of Engineering evaluating their Open Source strategy, recognize that relying solely on a single commercial platform like GitHub for project hosting and archival introduces significant risk. Actively explore and support decentralized alternatives and public archiving initiatives to ensure the long-term discoverability and preservation of your projects' code, issues, and social context, mitigating the impact of platform instability or business model shifts on your software supply chain.
Key insights
Centralized platforms like GitHub profoundly shaped Open Source, but their decline necessitates new archival solutions.
Principles
- Centralization creates discoverable memory.
- Frictionless publishing leads to dependency explosion.
- Distributed systems can still centralize around services.
In practice
- Mirror social context of projects.
- Preserve release artifacts and metadata.
- Support public Open Source archives.
Topics
- GitHub
- Open-Source Ecosystem
- Code Hosting Platforms
- Software Dependencies
- Distributed Version Control
Best for: CTO, VP of Engineering/Data, AI Architect, Software Engineer, Director of AI/ML, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Armin Ronacher's Thoughts and Writings.