AZ Computer Guru Radio Show Prep
Saturday, April 25, 2026 — AI Arms Race Edition
COMMON THREAD
"Three AI Heavyweights in One Week — And What Normal People Actually Get Out of It"
Three of the biggest names in AI dropped major new models this week. OpenAI launched GPT-5.5 (nicknamed "Spud") on Thursday April 23. DeepSeek — the Chinese startup that shocked Silicon Valley a year ago — unveiled V4-Pro and V4-Flash on Friday April 24. And Anthropic released Claude Opus 4.7 on April 16, just ahead of the pack. Meanwhile, Amazon announced it's putting $33 billion total into Anthropic — the biggest AI investment commitment in history.
The press is breathless. The tech forums are arguing about benchmarks. But most people in Tucson just want to know: does any of this actually help me? Today's show answers that question. We're going to skip the jargon and explain what these models can actually do — not for PhD researchers, but for regular people: the homeowner, the small business owner, the retiree who just wants to know if this stuff is worth paying for.
The punchline up front: yes, something genuinely changed this week. These new AI models don't just answer questions. They plan, use tools, and finish multi-step tasks without you holding their hand. That's a real shift — and Fortune magazine put it best: AI model launches are starting to look like software updates. Which means this technology is maturing faster than the internet did in the 1990s.
SEGMENT 1: "The New AI Does Your Errands — Not Just Your Homework" (14-16 min)
Opening
"So I want to start today by asking: how many of you have tried ChatGPT or one of these AI tools, typed in a question, and thought — okay, that's impressive, but it didn't actually DO anything for me? It told me things. It wrote stuff. But it didn't go figure it out. That's changing. This week. And that is a genuinely big deal."
Story 1: What "Agentic AI" Means — And Why It Matters to You
Key Facts
- "Agentic" comes from the word "agency" — the capacity to act independently toward a goal
- Previous AI: you ask, it answers. You ask again, it answers again. You do the work.
- Agentic AI: you give it a goal, it figures out the steps, takes the actions, and reports back
- Gartner prediction: 40% of enterprise applications will include AI agents by end of 2026 — up from less than 5% in 2025
- BCG research: workers using 4+ AI tools simultaneously see productivity DROP — agentic AI solves this by being the single agent coordinating the tools
Gartner: 40% of enterprise apps will have AI agents built in by end of 2026. In 2025 it was under 5%. That's an 8x jump in one year.
Talking Points
- The old way: you ask ChatGPT how to plan a trip, it gives you a list, you spend two hours executing it
- The new way: you tell it "plan me a 4-day Tucson-to-Sedona road trip, book under $120/night, avoid highway driving after 3pm" — it searches, compares, drafts an itinerary, and hands you something actionable
- Small business example: instead of typing in a customer complaint and asking "what should I say," the AI drafts a reply, checks your refund policy, and prepares a response for you to approve
- Retiree example: tell it "I need a specialist who takes Medicare, within 15 miles of 85719, accepting new patients" — it searches, filters, gives you a shortlist
- Old AI: "here's a recipe." New AI: "here's a recipe, I've ordered the ingredients to your Instacart, and set a timer reminder for Thursday"
- You don't have to be a tech person. You just have to be willing to describe what you want clearly.
- The people who learn to describe problems clearly are going to get a real advantage — everyone else will keep doing things the hard way
Why This Matters
This is the leap that moves AI from "interesting toy" to "tool that saves you hours every week." Every single model released this week is built specifically around this agentic capability. This is the whole game right now.
Story 2: GPT-5.5 "Spud" — OpenAI's Thursday Drop
Key Facts
- Released: Thursday, April 23, 2026 — two days ago
- Internal codename: "Spud" (yes, like a potato — OpenAI staff have a sense of humor)
- First fully retrained base model since GPT-4.5 — not just a tune-up, a ground-up rebuild
- Natively omnimodal: handles text, images, audio, and video in a single unified architecture
- Terminal-Bench 2.0 score: 82.7% — leading benchmark for real-world autonomous computer tasks
- OSWorld-Verified: 78.7% — tests whether AI can operate software like a human would
- Long-context recall (MRCR v2): jumped from 36.6% to 74.0% — a 37-point improvement in remembering long conversations
- API Pricing: $5 per million input tokens, $30 per million output tokens (doubled from previous version)
- Available now: ChatGPT Plus, Pro, Business, Enterprise — $20/month
Long-context recall: up 37 points. That's the AI finally remembering what you told it 30 exchanges ago. A real fix for a real frustration.
Talking Points
- "Spud" because it's a step between GPT-5.4 and GPT-6 — a mid-cycle release, like a point upgrade on your iPhone
- Can browse the web, write and debug code, analyze data, fill spreadsheets, AND coordinate across multiple software tools — all in one uninterrupted session
- That 37-point jump in long-context recall means: it actually remembers the beginning of the conversation when you're 30 exchanges deep — a pain point that's finally fixed
- OSWorld 78.7%: nearly 4 out of 5 times, it can operate software like a human (clicking, typing, navigating menus) without being shown how
- Price doubled on the API side, but OpenAI says it uses fewer tokens to do more work — net cost to most users is flat or lower
- Fortune's exact framing: "AI model launches are starting to look like software updates" — that's a GOOD thing for consumers
- "Spud" is a reminder: the most serious tech in the world is made by humans who name their projects after potatoes
Why This Matters
GPT-5.5 is what you'll be using on ChatGPT for the next several months. If you're paying $20/month for Plus, this is your new version — and it's genuinely better at doing actual tasks, not just generating text. It's the first version where "autonomous computer use" is a real feature, not a lab demo.
Time: 14-16 minutes
SEGMENT 2: "China Just Fired Back — And They're Charging 10 Cents on the Dollar" (12-14 min)
Opening
"About a year ago, a Chinese AI startup called DeepSeek dropped a model that sent shockwaves through Silicon Valley. OpenAI stock dropped. NVIDIA shares tumbled. The whole tech industry had to confront the fact that maybe you don't need a hundred million dollars and a warehouse full of chips to build world-class AI. This week, DeepSeek is back — with two new models, a million-token context window, and prices so low they almost seem like a mistake."
Story 1: DeepSeek V4-Pro and V4-Flash — The China Response
Key Facts
- Released: Friday, April 24, 2026 — yesterday
- Two models: V4-Pro (flagship) and V4-Flash (fast/cheap)
- V4-Pro: 1.6 trillion total parameters, 49 billion active (mixture-of-experts architecture)
- V4-Flash: 284 billion total parameters, 13 billion active
- Context window: 1 million tokens on BOTH — roughly 750,000 words; fits an entire novel or large codebase
- Benchmarks: V4-Pro beats all rival open-source models on math and coding; trails only Google's Gemini 3.1-Pro among closed commercial models
- Training hardware: Huawei Ascend AI processors — not NVIDIA chips subject to US export restrictions
Pricing Comparison (per million output tokens)
- GPT-5.5 (OpenAI): $30.00
- Claude Opus 4.7 (Anthropic): $25.00
- DeepSeek V4-Pro: $3.48
- DeepSeek V4-Flash: $0.28
DeepSeek Pro is roughly 8-10x cheaper than the US competitors for similar quality work.
Talking Points
- Let that sink in: DeepSeek charges $3.48 for a million output tokens. OpenAI charges $30. Anthropic charges $25. For similar quality.
- For small businesses using AI in workflows — customer service, drafting contracts, writing descriptions — the cost difference is massive at scale
- The "mixture of experts" thing explained: imagine a hospital with 100 specialists. You don't see all 100 when you walk in — a smart receptionist routes you to the right 3 or 4. DeepSeek's model works the same way. Huge overall but efficient per request.
- The Huawei chip angle: US restricted export of NVIDIA's best chips to China to slow AI development there. DeepSeek just proved the restriction isn't working — they built a frontier model on Chinese hardware.
- V4 trails GPT-5.5 by about 3-6 months of development. But at 10 cents on the dollar, "almost as good for way less" is a compelling pitch.
- The 1 million token context window: you can hand it an entire legal contract, a full codebase, or a year's worth of customer emails and ask it to analyze the whole thing at once.
Why This Matters
DeepSeek is forcing down the price of intelligence. Every time they release a cheap, capable model, US companies have to respond. That's competition working — the consumer wins. And the fact that China can build frontier AI on domestic chips despite US export restrictions is a geopolitical development that will drive policy decisions in Washington for years.
Story 2: The China vs. US AI Race — Where It Actually Stands
Key Facts
- Current gap: US models still lead benchmarks but China is closing fast — estimated 3-6 months behind
- US private AI investment 2025: $285.9 billion (Stanford AI Index 2026)
- China private AI investment 2025: $12.4 billion — 23x less
- But China leads: AI research publications, citations, patents, industrial robot installations
- Different strategies: US = private sector bets; China = state-directed investment with national mandate
- Timeline: DeepSeek-R1 shocked the world January 2025; V4 a year later shows this wasn't a one-time miracle
Talking Points
- This isn't the Cold War space race where one side had a clear lead. This is neck-and-neck and tightening.
- China's strategy: build models that match US quality at a fraction of the cost, make them open-source, let the world adopt them — soft power through AI
- The Huawei chip achievement: US assumed the chip export ban would slow China's AI by years. DeepSeek just cut that estimate to months.
- Good news for consumers: competition keeps prices down and innovation up
- For policymakers: the competitive lead is narrower than anyone in Washington wants to admit
Time: 12-14 minutes
SEGMENT 3: "Amazon Just Bet $33 Billion That Anthropic Wins — Here's What That Means for You" (12-14 min)
Opening
"Last Monday, Amazon made what is almost certainly the largest AI investment in history. Thirty-three billion dollars. To put that in perspective: that's more than the entire GDP of Iceland. And they're putting it all into one AI company — Anthropic, the maker of Claude. The announcement barely made the news because GPT-5.5 dropped Thursday and everyone shifted. But this deal will shape the AI you use for the next decade."
Story 1: The $33 Billion Deal — What Amazon Is Actually Buying
Key Facts
- Announcement date: Monday, April 20, 2026
- Total commitment: $33 billion ($8B previous + $5B immediate + up to $20B more tied to milestones)
- Anthropic's current valuation: $380 billion
- In exchange: Anthropic committed to spend more than $100 billion on Amazon Web Services over the next decade
- Compute secured: up to 5 gigawatts of Amazon's Trainium AI chips — spanning Trainium2 through the not-yet-released Trainium4
- Timeline: nearly 1 gigawatt of Trainium capacity online for Anthropic by end of 2026
- AWS customers using Claude: over 100,000 businesses already
- Cloud coverage: Claude available on AWS Bedrock, Google Cloud Vertex AI, AND Microsoft Azure Foundry — the only frontier model on all three
$33 billion. The entire Apollo moon program cost about $25 billion in today's dollars. Amazon is spending more on one AI company than we spent getting to the Moon.
Talking Points
- Why Amazon? They want Claude baked into AWS — the cloud platform that runs a huge chunk of the internet
- The $100 billion in cloud spending going back to Amazon is key: this isn't just investment, it's a strategic lock-in. Anthropic goes all-in on Amazon's infrastructure, Amazon goes all-in on funding Anthropic.
- Anthropic's candid statement: "unprecedented consumer growth has placed an inevitable strain on our infrastructure" — Claude has been slow and unreliable at peak times. This money fixes that.
- What it means for Claude users: faster responses, less downtime, more capacity during peak hours
- Trainium chips are Amazon's answer to NVIDIA — designed specifically for AI training and inference. Amazon is betting it can build its own AI chip ecosystem.
- $380 billion valuation: a company founded just a few years ago is now worth more than Ford, General Motors, and Harley-Davidson combined.
Why This Matters
Amazon is essentially saying: AI is the next AWS. They built their whole cloud business on making computing cheap and accessible. Now they want to do the same with intelligence. If you're a small business on AWS, Claude is about to get much more tightly integrated into the tools you already use.
Story 2: The Anthropic Story — Why This Company Has Amazon's Attention
Key Facts — Claude Opus 4.7 (Released April 16)
- SWE-bench Verified: 87.6% — nearly 9 out of 10 real software bugs fixed correctly on first try
- SWE-bench Pro: 64.3% — up 10.9 points from previous version
- Vision improvement: 3.26x higher resolution image understanding
- New "xhigh" effort level: finer control between fast/cheap and slow/thorough
- Key feature: ability to "double-check its own work" before reporting back — fewer hallucinations
- Pricing unchanged: $5 input / $25 output per million tokens
- Benchmark wins: 12 of 14 reported head-to-head tests against GPT-5.5
- Available on: claude.ai (free + $20/month Pro), API, Amazon Bedrock, Google Vertex AI, Microsoft Foundry
Talking Points
- Anthropic's whole pitch: we're the "safety-first" AI company — we build the best models AND we don't ship the ones that scare us
- Claude Opus 4.7 now "double-checks its own work" — it generates an answer, then runs a verification pass before sending it back. Fewer wrong answers stated confidently.
- 87.6% coding benchmark means: near Ph.D. level software engineering performance. The AI that companies are trusting to write production code.
- The 3.26x vision improvement is practical: hand it a blurry photo of a receipt, a complex chart, a hand-drawn diagram — it reads it reliably
- "xhigh" effort: think of it as choosing between a quick Google search and a thorough research report. More control is useful for professionals.
- Amazon's bet is partly about safety: they want an AI company that won't have a catastrophic public failure. Anthropic's methodical approach is a feature, not a limitation, from an enterprise perspective.
Story 3: The "Software Update" Moment
Key Facts
- Fortune headline, exact quote: "GPT-5.5 is here — and AI model launches are starting to look like software updates"
- Three frontier models this April week: Claude Opus 4.7 (Apr 16), GPT-5.5 (Apr 23), DeepSeek V4 (Apr 24)
- March 2026: 30+ new AI model releases in one month
- Adoption speed: generative AI hit 53% population adoption in 3 years; internet took 7+ years (Stanford AI Index 2026)
- BCG research: workers using 4+ AI tools simultaneously see productivity DROP — proliferation problem is real
Talking Points
- The "software update" analogy is exactly right — and it's actually good news. Remember when Windows updates were once-a-year, terrifying events? Now they're invisible background installs. That's where AI is heading.
- Developers are fatigued — there's a running joke that AI tool fatigue now happens daily, where JavaScript framework fatigue used to happen monthly
- But for end users? More competition = better tools, same or lower prices
- The same AI that cost $100/month a year ago is now free or $20/month
- 53% adoption in 3 years: TV took decades, internet took 7 years, smartphones took 5 years. AI hit half the population in 3. Fastest tech adoption in human history.
Why This Matters
We are past the "neat demo" phase. This week's releases — three frontier models, $33 billion in investment, the Fortune "software update" comparison — are signs that AI is becoming infrastructure. Like electricity or the internet: you don't marvel at it, you just use it. The question is no longer "is this real?" — it's "how do I use it before my competition does?"
Time: 12-14 minutes
SEGMENT 4: "What Normal People Should Actually DO With All of This" (12-14 min)
Opening
"Okay, so we've covered GPT-5.5, DeepSeek, Claude, Amazon's $33 billion bet. Here's the part of the show that matters most: what do YOU do with any of this? Not a software engineer. Not a tech investor. A regular person in Tucson with actual problems to solve. Let's get practical."
Story 1: Five Real Tasks You Can Do Right Now — Free or $20/Month
Your Options
- ChatGPT (chat.openai.com): Free tier = GPT-4o. $20/month Plus = GPT-5.5.
- Claude (claude.ai): Free tier = Claude Sonnet. $20/month Pro = Opus 4.7.
- DeepSeek (chat.deepseek.com): Free. Impressive. But note: Chinese servers — don't paste sensitive business data.
Five Tasks Worth Trying This Week
- Homeowners: "Write a letter to my HOA about the drainage issue behind my house." Hand it a photo of the relevant rules, describe the problem, ask for a firm but professional letter. 2 minutes. Previously: half a day.
- Small business owners: "Analyze these 47 customer reviews and tell me the top 3 complaints and what to do about each." Paste them all in. Get a prioritized action list. No consultant required.
- Retirees / medical: "Explain this Medicare Advantage summary of benefits in plain English. What's my out-of-pocket maximum for specialist visits?" Hand it the PDF or paste the text. 30 seconds.
- Anyone: "Compare these three home insurance quotes. Tell me which one is actually better and what I'm trading off." Paste all three. Get a real comparison with tradeoffs called out.
- Small business: "Draft a response to this negative Yelp review that's professional, doesn't admit liability, and invites them to call us directly." Paste the review. Done.
Why This Matters
These are all tasks where you previously either struggled through it yourself, paid someone, or just didn't do it. Now they take minutes. The people who figure out how to describe their problems clearly and give AI the relevant context are going to save 5-10 hours a week. That's not a technology story — that's a quality-of-life story.
Story 2: What to Watch Out For — The Three Real Risks
Key Facts
- Hallucination rate: still exists; models confidently generate wrong facts — dropped significantly but not zero
- Privacy: free tier conversations may be used for training; opt-out available in settings
- DeepSeek data: Chinese-owned, Chinese servers, subject to Chinese law — not for sensitive business data
- BCG study: workers using 4+ AI tools simultaneously see measurable productivity drop
Talking Points
- Hallucination warning: These systems are confident liars when they don't know something. Never trust a specific number, date, legal interpretation, or medical fact without checking. Use it for drafting and organizing, not as the final authority.
- Privacy 101: ChatGPT's free tier uses your conversations for training by default. You can opt out in Settings > Data Controls. Claude has similar controls. If you're pasting business contracts or personal financial data — use the paid tier.
- DeepSeek caveat: Free and impressive. But it's Chinese-owned, data goes to Chinese servers, company is subject to Chinese law. For personal curiosity tasks? Fine. For sensitive business information? Use Claude or ChatGPT.
- The right mindset: AI is best at "get me most of the way there" tasks where you then review and edit. It's a very capable first draft machine, not a finished product machine.
- The cost question: If you use it more than 3x per week for work tasks, the $20/month subscription pays for itself. If it's occasional curiosity, stay free.
- "It's just a tool" reminder: a hammer doesn't do your home renovations. AI doesn't do your thinking. It does the mechanical part — the drafting, the summarizing, the formatting — and you bring the judgment.
Story 3: The Bigger Picture — What This Week Means for Tucson
88% organizational adoption (Stanford AI Index 2026): most businesses are already using some form of AI. Not experimenting — using.
Talking Points
- Tucson specific: The service businesses that dominate our local economy — contractors, restaurants, medical offices, real estate — all have repeatable communication tasks that AI handles well
- The "your competition is already using it" argument: 88% organizational adoption means the HVAC company across town, the competing dental practice, the other real estate agent — they're already drafting their emails with AI. The question is whether you keep spending 40 minutes on a task that takes them 4.
- The cost argument: A business sending 200 customer emails a month saves 10+ hours if each one takes 3 minutes with AI instead of 30 without. At any reasonable hourly rate, that's thousands of dollars in recovered time per year.
- Realistic prediction: Within 18 months, AI-written first drafts will be standard in most businesses, the way spell-check became standard. The people who learned it early will be faster and more capable.
- Final encouragement: You don't need to understand the technology to use it. You just need to be willing to describe your problem in plain English and try. The barrier is lower this week than it was last week.
Why This Matters
The AI arms race covered today — GPT-5.5, DeepSeek, Claude, Amazon's $33 billion — is ultimately a competition to win your time. Every company in this race is trying to prove they can save you the most hours per week. The winner of that competition is you, as long as you actually pick up the tool.
Time: 12-14 minutes
SHOW WRAP & TAKEAWAYS
Summary
"So here's what happened this week in AI, in plain English. OpenAI dropped GPT-5.5 on Thursday — a rebuilt model that can plan, use tools, and finish multi-step tasks without you guiding it every step of the way. Fortune called it: AI launches now look like software updates. DeepSeek fired back from China on Friday with V4-Pro and V4-Flash — near-frontier quality at roughly one-tenth the price, built on Chinese Huawei chips that weren't supposed to be capable of this. And Amazon announced Monday it's committing $33 billion total to Anthropic — the biggest AI investment in history — partly because Claude's infrastructure was straining under too many users, partly because Amazon wants AI baked into everything on AWS.
The takeaway for regular people: these models actually do things now. They plan. They use tools. They finish tasks. They don't just write essays. And the competition between them is driving prices down and quality up at a pace faster than any technology we've ever seen."
Final Thought
"In 1995, you could have explained the internet to somebody and they would have said 'sounds interesting, I'll check it out when it's more useful.' The people who said that in 1999 were still catching up in 2005. We're at a similar moment with AI, except the timeline is compressed by a factor of three. The window for early adoption is shorter. Pick one task you hate doing every week. Give it to ChatGPT or Claude. See what you get back. That's all. You can evaluate the whole AI arms race from that one experiment."
What You Can Do This Weekend
- Try ChatGPT free: chat.openai.com — GPT-5.5 with Plus ($20/month)
- Try Claude free: claude.ai — Opus 4.7 with Pro ($20/month)
- Best first task: something you have to write. An email, a letter, a complaint, a review response. Paste in context, describe what you want, see what you get back.
- For small businesses: ask AI how to respond to your last negative Yelp or Google review. Then ask it what your top customer complaint pattern is based on your reviews.
- Privacy tip: in ChatGPT settings, go to Data Controls and disable "Improve the model for everyone" if you don't want your conversations used for training.
- The $20 question: if you use it more than 3x per week for work tasks, the subscription pays for itself. If it's occasional curiosity, stay free.
- DeepSeek for the curious: chat.deepseek.com — impressive and free, but keep sensitive info off it.
SOURCES
GPT-5.5 "Spud"
DeepSeek V4
Amazon / Anthropic $33B Deal
Claude Opus 4.7
Agentic AI / Context
AI Adoption / Fatigue Research
NOTES FOR NEXT SHOW
Follow-Up Stories to Track
- GPT-6 timeline — OpenAI has signaled summer 2026; watch for leaks
- Amazon Trainium2/3 capacity coming online for Anthropic — any reliability improvements for Claude users?
- DeepSeek V4 adoption: do US businesses actually switch to save 8-10x on API costs?
- China chip export policy response — will the US tighten restrictions after Huawei V4 success?
- Anthropic Mythos / Project Glasswing: any public reports of vulnerabilities discovered?
- AI adoption in Tucson/Arizona small businesses — any local angle worth pursuing?
- BCG study follow-up on AI tool fatigue and productivity impact
Possible Future Show Angles
- "AI at your doctor's office: what's actually happening in Arizona healthcare" — real examples of AI in local medical practices
- "The privacy episode" — deep dive on what happens to your data in every major AI tool
- "Small business AI toolkit" — build a practical hour on specific tools for Tucson business owners
- "When AI gets it wrong" — hallucination examples, legal cases, what happens when you trust it too much
Research Date: April 25, 2026
Show Date: April 25, 2026
Format: 4 segments, 52-58 minutes total
Research Method: Live web search of breaking news from April 16-25, 2026