
How to Build a "Down-to-Earth" AI Startup: From Google X Researcher to $1M ARR in 6 Months — A Conversation with Changyin Zhou, Founder of Vozo
March 2, 2025
Last week, we used 20 questions to map out the past year of AI video models with Barkley, the sole product manager at Luma.ai.
This week, we invited Changyin Zhou, founder of Vozo — a one-click video remixing tool — to share the story of building Vozo: it can redub videos, translate and edit them, topped Product Hunt for three consecutive days at launch, and hit $1 million ARR in six months.
In this conversation, we also used Vozo as a thread to dive deep with Changyin about his previous work at the famed Google X lab, his first startup in Silicon Valley, and how he began his entrepreneurial journey back in China with the surprisingly down-to-earth "live streaming box."
We had assumed that leaving behind established ways of working would require tremendous courage. But Changyin told us that compared to sunk costs, he cares more about whether he's doing something no one has done before.
He also shared observations about the work habits of exceptionally smart people he's known — we hope you'll find them valuable.
🟢 Part 1: The Story of Building Vozo
01:53 How was Vozo, the one-click video remixing tool, built?
04:34 Rewrite feature: modify videos with a simple prompt
05:15 After Rewrite, launching the Translate feature
06:24 First impression of Vozo: remixing Leonardo DiCaprio's scene from The Wolf of Wall Street
07:08 Vozo's new LipSync feature
09:52 Thinking through the product development-to-launch path
12:22 Among other video tools, preferring HeyGen and Dzine
13:31 Two Product Hunt launches: the biggest value was refining the product and rapidly completing cold start
14:43 Product Hunt launches: product innovation vs. operational tactics — which is more decisive?
16:00 Besides Product Hunt success, what else was done right?
17:20 "PMF is a feeling"
18:33 Vozo launched in July 2024 — why start early but enter AI relatively late?
19:50 "Detours" taken in entrepreneurship
23:04 Funding situation + team size
24:15 A 40-person team, $1M ARR, break-even through another already-revenue-generating app
25:03 Not an app factory model; existing app will integrate with Vozo
27:58 The relationship between Vozo's launch and Sora
31:13 Features users don't perceive but require heavy investment: translation length across languages, lip-syncing, emotion
35:09 Approaches to challenges: engineering solutions + product solutions
37:44 Technological progress and breakthroughs making the impossible possible
40:33 "Models as products, swallowing features that were once ornamental add-ons on top of models" will likely happen, but applications will have their own moats
42:21 Promising direction: glasses + low-latency large models
🟢 Part 2: From Google X to Founding Vozo
43:39 From Google X to entrepreneurship: from pursuing research breakthroughs to getting very down-to-earth
47:48 Google X's "ideal state": recruiting broadly, almost no budget constraints
50:39 First entrepreneurial experience: using VR to let people in different places meet
52:13 Reflections on the VR startup: even with Apple Vision Pro, getting ordinary people to adopt it still requires a complete commercial chain
54:25 Inspiration for building Vozo: starting from doing it yourself
58:03 From persuading users to use your product, to observing on the ground and interviewing users about their needs
59:58 After interviewing over a dozen MCNs in Hangzhou, built a live streaming box — but still had business problems
1:00:54 Getting even more down-to-earth: building a teleprompter app
1:02:49 Years of research insights vs. down-to-earth apps: a sense of dissonance
1:04:00 Still wanted to try something bold, so built a research lab
1:06:17 From Google X researcher to teleprompter app builder: "I hadn't done these things before, and honestly I was pretty excited doing them"
1:09:09 Looking at problems from a business perspective rather than technological breakthroughs: 1) be a good product manager, 2) understand the market
1:11:27 Shedding your ego: after being wrong a few times, you learn what to do
1:12:16 Behavioral change: used to do everything yourself, now what's more important is letting go
1:14:04 New understanding of entrepreneurship after letting go
1:17:12 Always making unexpected decisions: undergrad in management, grad school in computer science, PhD in research — sunk costs don't matter
1:19:58 Goal of entrepreneurship: impact more people; using video to transmit information is inevitable, hoping to be someone who pushes it forward
1:21:51 Top performers vs. less top performers: what's the difference in work habits?
1:25:01 How to distinguish what matters most? How do you keep learning?
1:30:19 Vozo Chinese version launching soon
Welcome to subscribe to the "Crossroads" podcast 🚦 We follow the new industry shifts and entrepreneurial opportunities brought by the new wave of AI technology. Crossroads is Steve Jobs' metaphor for Apple — standing at the intersection of technology and liberal arts, where great products are born. AI is transforming industries across the board. We seek out, interview, and bring together "active doers" of the AI era. Together with them, we explore and embrace new changes and new possibilities.
👦🏻 Host Koji: Co-founder of The Fair and Tangping Island. I believe technology, especially AI, will fundamentally transform society and empower humanity. Welcome to chat with me, bounce ideas, and connect on what's next. Koji's Jike, Koji's website
👧🏻 Host Ronghui: Works at a tech VC, former Silicon Valley correspondent for Yicai. Ronghui's Jike
Welcome to join the "Crossroads" member group ☀️ First-hand AI news and insights
👫🏻 We encourage you to date / make friends / find future kindred spirits
🦀 Add assistant on WeChat to join: Rwkfbcianvd, or scan the QR code below
🎄 This podcast is supported by "The Fair Sound Forest Podcast Program."