Own or Be Owned: Why Every Company Needs Its Own AI Model (Yash Patil, Co-Founder & CEO of Applied Compute)
Summary
Yash Patil, Co-Founder & CEO of Applied Compute, advocates for companies to develop and own custom AI models, citing the "own or be owned" thesis. His \$1.3 billion company trains specialized models on proprietary business data, leveraging open-weight models and advanced post-training techniques. This approach addresses issues like frontier model guardrails, anti-competitive concerns, and the high cost of general-purpose AI, which Patil likens to "cooking with a blowtorch." Applied Compute's method focuses on optimizing for specific tasks, cost, latency, and data ownership, enabling clients like DoorDash to achieve superior performance for niche applications. Patil also predicts AI's economic rollout will span decades due to change management and data readiness challenges within large organizations.
Key takeaway
For Directors of AI/ML or AI Product Managers evaluating their AI strategy, relying solely on large frontier models risks dependency, high costs, and limited differentiation. You should explore custom model development on your proprietary data to gain control over capabilities, optimize for specific business tasks, and build a competitive advantage. Prioritize internal data readiness and invest in robust ML infrastructure to support continuous model improvement.
Key insights
Companies must own custom AI models to control capabilities, optimize costs, and differentiate from competitors.
Principles
- Frontier models are often the wrong tool for specific business tasks.
- Proprietary evals define model value and differentiation for enterprises.
- Continuous training loops are essential for model adaptation and improvement.
Method
Applied Compute trains custom, task-specific models on customer data using open-weight models, serving them at scale, and continuously improving them via production usage data and advanced ML infrastructure.
In practice
- Develop task-specific models to optimize for cost, latency, and performance.
- Codify internal procedures into proprietary evals for model training.
- Implement asynchronous sampling and training for efficient GPU utilization.
Topics
- Custom AI Models
- Open-Weight Models
- Post-Training
- ML Infrastructure
- AI Strategy
- Data Ownership
- Compute Optimization
Best for: Investor, CTO, VP of Engineering/Data, Director of AI/ML, AI Product Manager, Entrepreneur
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Generalist.