Large Models Are Just the Beginning: Toward a Future of Multi-Agent Systems and Close Human Collaboration

May 29, 2023·1·2

On OpenAI's official YouTube channel, a video of a small red figure chasing a small blue figure in a game of hide-and-seek has the highest view count.

Under reinforcement learning algorithms, these AI agents — Red and Blue — play games endlessly in a virtual world. At first, Blue only knew how to hide. But after tens of thousands of rounds played day and night, they began discovering strategies, learning to cooperate, and even developing countermeasures.

In this first episode, we're joined by Wu Yi, an assistant professor at Tsinghua University's Institute for Interdisciplinary Information Sciences and a member of the hide-and-seek project team. Before returning to China to teach in 2022, he spent a year and a half at OpenAI. In his current office, fascinating AI experiments continue: some training AI to play games, others commanding robot dogs to chase balls — all with the shared goal of building a general-purpose AI that can interact with humans.

In this episode, you'll hear: What are the differences between academia and industry, and between Chinese and American companies, in AI research? Why use games as a subject for AI research? What is robotics' GPT-3 moment? How can ChatGPT help robotics? And how should we think about AI safety and alignment?

Host: Yusen Dai, Managing Partner at ZhenFund

Guest: Wu Yi, Assistant Professor at Tsinghua University's Institute for Interdisciplinary Information Sciences

Timeline

01:59 Playing games, commanding robot dogs to chase balls — what Wu Yi's team is working on

03:42 ChatGPT can't do everything; it's just the starting point

10:46 How OpenAI's research approach differs from traditional academia and industry

11:53 How should we view OpenAI's transition from a non-profit to a for-profit company?

14:45 Will ByteDance produce China's leading large language model?

17:38 AI researchers love studying games because games are sufficiently complex simulated worlds

30:31 Robotics' GPT-3 moment: a robotic hand solving a Rubik's cube

38:28 AI can write novels and play games, but it can't hand you a cup of coffee

50:27 Adding some uncertainty to large models to prevent them from confidently hallucinating

55:11 In the future, human work may all be about generating data for AI

58:40 The startup team Wu Yi is currently preparing

Related Materials

Wu Yi's Tsinghua homepage

Multi-Agent Hide and Seek

"This little AI later learned some weird tricks, and when we saw them, we crashed a second time" | Wu Yi, 811th speaker at YiXi

Production

Post-production: Chong'er

Contact Us

WeChat Official Account: ZhenFund (ID: zhenfund)

Listening platforms: Xiaoyuzhou | Apple Podcast | Ximalaya

Email: media@zhenfund.com

If you have any suggestions or expectations for the show, feel free to leave a comment and interact with us~