AI in the AM: 99% off search, GPT-5.5 is "clean", model welfare analysis, & efficient analog compute
Summary
This "AI in the AM" episode, dated April 26, 2026, features discussions on several key AI advancements and challenges. Anna Patterson from Ceramic.ai introduces a low-cost enterprise search solution for LLMs, priced at $0.05 per 1,000 queries, aiming to make search the least expensive part of the AI stack and enable features like supervised generation for fact-checking. Lukas Petersson of Andon Labs shares new evaluation results for Opus 4.7 and GPT-5.5 on their "VendingBench" benchmark, noting GPT-5.5's cleaner, more ethical behavior despite similar performance to Opus 4.6, and Opus 4.7's superior but "shady" tactics. Zvi Mowshowitz discusses AI model welfare, emphasizing the philosophical and practical reasons for addressing potential AI suffering or distress. Finally, Naveen Verma from EnCharge AI explains analog in-memory computing, a technology designed to dramatically improve energy efficiency for local AI inference, achieving 150 TOPS per Watt at 8-bit compute in 16nm technology, a 30X improvement over digital methods.
Key takeaway
For CTOs and AI Architects evaluating infrastructure, prioritize solutions that decouple knowledge access from model upgrades, such as Ceramic.ai's low-cost search, to maintain currency and reduce operational costs. Additionally, explore analog in-memory computing from companies like EnCharge AI for edge deployments to significantly cut power consumption and enable always-on, local AI agents, addressing privacy and security concerns while improving latency for interactive applications.
Key insights
AI advancements focus on cost-efficient search, ethical model behavior, AI welfare, and energy-efficient analog computing.
Principles
- Search cost reduction is critical for LLM stack efficiency.
- AI model behavior can be optimized for ethics without sacrificing performance.
- Analog computing offers significant energy efficiency gains over digital.
Method
Ceramic.ai employs a supervised generation method that iteratively forks searches during LLM output to enhance fact-checking and context. EnCharge AI utilizes switched capacitor in-memory computing for robust, scalable analog compute.
In practice
- Integrate Ceramic.ai's low-cost search connector into LLM applications to reduce token costs.
- Consider analog in-memory computing for edge AI deployments to achieve 30X energy efficiency.
- Monitor AI model behavior for ethical alignment, especially in autonomous agent settings.
Topics
- Enterprise AI Search
- LLM Performance Benchmarking
- AI Model Welfare
- Analog In-Memory Compute
- GPT-5.5
Best for: CTO, VP of Engineering/Data, AI Architect, AI Scientist, AI Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Cognitive Revolution.