A Three-Way Dialogue on AIGC Creativity, Product, and Investment: Will the Future Pixar Be Born Inside an AI Company?

April 22, 2025·7·0

At the end of last month, OpenAI announced a stunning update that sent ripples across the internet: a brand-new image generation model integrated into GPT-4o. 4o doesn't just offer stronger editing and compositing capabilities — it also weaves deep textual understanding into image generation, replacing complex creative workflows that previously required stitching together multiple AI models and tools with a single line of text instruction. There's no doubt: we are witnessing the dawn of an entirely new visual era.

Since Sora's release as a starting point, AI content generation has undergone staggering development — from initial seconds-long clips to today's higher-resolution, more powerful and controllable long-form video generation with precise editing capabilities. AI is gradually becoming the core engine of content production.

In this episode of ZhenTalk, we've invited Barkley, a product manager at Luma.ai, a leading Silicon Valley video generation startup, alongside two of China's most active top AIGC creators: Haixin and A Wen. This is a cross-disciplinary conversation spanning creative, product, and investment perspectives, structured around three themes:

First, an overall observation and technical retrospective on the current AI video generation industry. How do frontier creators understand and leverage video generation tools? How will next-generation AI creative workflows shift with 4o's release? Where will the industry's next breakthrough come from? And how far is the AI video field from its AGI moment?

Guest Introductions

04:45 Haixin: Filmmaker turned AI content creator, using AI-generated video to build games

05:46 A Wen: PPT designer making striking collage-style animations with AI

07:17 Barkley: Product manager at Luma, formerly at TikTok

After 4o Image Generation's Release

11:38 Deconstructing image layers: generating transparent PNGs, unlocking productivity

13:26 Google Gemini vs. GPT-4o image generation experience comparison

20:13 How autoregressive models reversed previous diffusion model advantages

22:40 How far apart are pixel distance and semantic distance?

30:23 4o converts images into language — visual understanding matters as much as visual generation

32:15 Foundational model research ultimately drives change at the technology, application, and product layers

33:23 Domestic video models deliver real productivity gains; expectations for Sora were actually too high

Stunning Moments in AIGC Progress Over the Past Year

34:42 From Sora to Keling AI, from Midjourney to Google Whisk to GPT-4o

36:46 When unified models become powerful enough, workflows get replaced outright

38:22 What 4o still can't do: highly customized assets won't extend, face-swapping only recognizes celebrities, etc.

42:42 Building an "agent" for video generation may be premature

AIGC Isn't Just Cost Reduction — It's a New Art Form

45:05 Every model has different strengths; subscribing to all of them gets expensive fast

48:37 Complex shot composition and audiovisual language require sufficient data and training time

54:54 Production needs unmet today will likely be solved sooner than expected

57:20 For learning AI creation, go straight to primary sources and filter out information noise

59:57 Has life become happier since AI emerged? AI isn't just about efficiency — it's a new art form

01:03:02 AI video generation applications: response speeds in 3D animation, film, and advertising industries

What Is the AGI Moment for Video?

01:13:13 Single-point tools like rotoscoping are most vulnerable to AI disruption

01:14:24 Adobe is actually used more than before: AI handles the messy work, pros stitch it together

01:16:30 The future Pixar might be born inside an AI company

01:18:22 Creation is no longer a privilege reserved for the wealthy and powerful

ZhenFund's 21st cohort of ZhenInterns is now recruiting. Partners mentor directly, with dual experience in entrepreneurship and investing, plus conversion opportunities. Interested candidates may apply at zhenfund.jobs.feishu.cn

Luma AI is currently hiring for AI data roles — data PM/engineer and model infra positions are open, with remote work available from China. Self-referrals and recommendations welcome via email: barkley@lumalabs.ai

The puzzle game Haixin mentioned at the beginning, aka Rust Shanghai — Xiaohongshu

Also follow Haixin and A Wen and their future creations: Weibo, Xiaohongshu, Channels, and Jike: search "海辛Hyacinth" and "Simon_"; X: search "ring_hyacinth" and "simonxxoo"

Executive Producers: Jiamin, Zoe, Wendi

Post-production: Yanaego

ZhenTalk is a general business podcast produced by ZhenFund, where the investment team shares the latest hot topics and industry insights with leaders across various fields.

Founded in 2011, ZhenFund is one of China's earliest angel investment institutions. Since its inception, ZhenFund has been actively seeking out the most outstanding entrepreneurial teams and era-defining investment opportunities in artificial intelligence, chips and semiconductors, robotics and hardware, healthcare, enterprise services, new energy, cross-border expansion, consumer lifestyle, and beyond.

ZhenFund — Your First Stop for Entrepreneurship!