How to Build a "Down-to-Earth" AI Startup: From Google X Researcher to $1M ARR in 6 Months — A Conversation with Changyin Zhou, Founder of Vozo

How to Build a "Down-to-Earth" AI Startup: From Google X Researcher to $1M ARR in 6 Months — A Conversation with Changyin Zhou, Founder of Vozo

March 2, 2025

Last week, we used 20 questions to map out the past year of AI video models with Barkley, the sole product manager at Luma.ai.

This week, we invited Changyin Zhou, founder of Vozo — a one-click video remixing tool — to share the story of building Vozo: it can redub videos, translate and edit them, topped Product Hunt for three consecutive days at launch, and hit $1 million ARR in six months.

In this conversation, we also used Vozo as a thread to dive deep with Changyin about his previous work at the famed Google X lab, his first startup in Silicon Valley, and how he began his entrepreneurial journey back in China with the surprisingly down-to-earth "live streaming box."

We had assumed that leaving behind established ways of working would require tremendous courage. But Changyin told us that compared to sunk costs, he cares more about whether he's doing something no one has done before.

He also shared observations about the work habits of exceptionally smart people he's known — we hope you'll find them valuable.

🟢 Part 1: The Story of Building Vozo

01:53 How was Vozo, the one-click video remixing tool, built?

04:34 Rewrite feature: modify videos with a simple prompt

05:15 After Rewrite, launching the Translate feature

06:24 First impression of Vozo: remixing Leonardo DiCaprio's scene from The Wolf of Wall Street

07:08 Vozo's new LipSync feature

09:52 Thinking through the product development-to-launch path

12:22 Among other video tools, preferring HeyGen and Dzine

13:31 Two Product Hunt launches: the biggest value was refining the product and rapidly completing cold start

14:43 Product Hunt launches: product innovation vs. operational tactics — which is more decisive?

16:00 Besides Product Hunt success, what else was done right?

17:20 "PMF is a feeling"

18:33 Vozo launched in July 2024 — why start early but enter AI relatively late?

19:50 "Detours" taken in entrepreneurship

23:04 Funding situation + team size

24:15 A 40-person team, $1M ARR, break-even through another already-revenue-generating app

25:03 Not an app factory model; existing app will integrate with Vozo

27:58 The relationship between Vozo's launch and Sora

31:13 Features users don't perceive but require heavy investment: translation length across languages, lip-syncing, emotion

35:09 Approaches to challenges: engineering solutions + product solutions

37:44 Technological progress and breakthroughs making the impossible possible

40:33 "Models as products, swallowing features that were once ornamental add-ons on top of models" will likely happen, but applications will have their own moats

42:21 Promising direction: glasses + low-latency large models

🟢 Part 2: From Google X to Founding Vozo

43:39 From Google X to entrepreneurship: from pursuing research breakthroughs to getting very down-to-earth

47:48 Google X's "ideal state": recruiting broadly, almost no budget constraints

50:39 First entrepreneurial experience: using VR to let people in different places meet

52:13 Reflections on the VR startup: even with Apple Vision Pro, getting ordinary people to adopt it still requires a complete commercial chain

54:25 Inspiration for building Vozo: starting from doing it yourself

58:03 From persuading users to use your product, to observing on the ground and interviewing users about their needs

59:58 After interviewing over a dozen MCNs in Hangzhou, built a live streaming box — but still had business problems

1:00:54 Getting even more down-to-earth: building a teleprompter app

1:02:49 Years of research insights vs. down-to-earth apps: a sense of dissonance

1:04:00 Still wanted to try something bold, so built a research lab

1:06:17 From Google X researcher to teleprompter app builder: "I hadn't done these things before, and honestly I was pretty excited doing them"

1:09:09 Looking at problems from a business perspective rather than technological breakthroughs: 1) be a good product manager, 2) understand the market

1:11:27 Shedding your ego: after being wrong a few times, you learn what to do

1:12:16 Behavioral change: used to do everything yourself, now what's more important is letting go

1:14:04 New understanding of entrepreneurship after letting go

1:17:12 Always making unexpected decisions: undergrad in management, grad school in computer science, PhD in research — sunk costs don't matter

1:19:58 Goal of entrepreneurship: impact more people; using video to transmit information is inevitable, hoping to be someone who pushes it forward

1:21:51 Top performers vs. less top performers: what's the difference in work habits?

1:25:01 How to distinguish what matters most? How do you keep learning?

1:30:19 Vozo Chinese version launching soon


Welcome to subscribe to the "Crossroads" podcast 🚦 We follow the new industry shifts and entrepreneurial opportunities brought by the new wave of AI technology. Crossroads is Steve Jobs' metaphor for Apple — standing at the intersection of technology and liberal arts, where great products are born. AI is transforming industries across the board. We seek out, interview, and bring together "active doers" of the AI era. Together with them, we explore and embrace new changes and new possibilities.

👦🏻 Host Koji: Co-founder of The Fair and Tangping Island. I believe technology, especially AI, will fundamentally transform society and empower humanity. Welcome to chat with me, bounce ideas, and connect on what's next. Koji's Jike, Koji's website

👧🏻 Host Ronghui: Works at a tech VC, former Silicon Valley correspondent for Yicai. Ronghui's Jike

Welcome to join the "Crossroads" member group ☀️ First-hand AI news and insights

👫🏻 We encourage you to date / make friends / find future kindred spirits

🦀 Add assistant on WeChat to join: Rwkfbcianvd, or scan the QR code below

🎄 This podcast is supported by "The Fair Sound Forest Podcast Program."