如何在 AI 时代取胜:每周发布一个功能、接受技术债、无情削减范围,并创造竞争对手无法复制的魔法 | Gaurav Misra(Captions CEO 兼联合创始人)
How to win in the AI era: Ship a feature every week, embrace technical debt, ruthlessly cut scope, and create magic your competitors can’t copy | Gaurav Misra (CEO and co-founder of Captions)
Interview Transcript
Gaurav Misra: There’s rarely a time like this where so much is possible. Even like five, seven years ago, it’s so hard to start a company. Everything feels like it’s done, someone else is working on it. Suddenly, it’s a time, right now, which I’ve never even experienced, where everything you try just works.
Lenny Rachitsky: With people constantly hearing about all the things happening. Is there any tools or processes or approaches you’ve figured out to help stay focused?
The Golden Age of Startups
Gaurav Misra: Our engineering goal is every engineer should ship a marketable product every week.
Lenny Rachitsky: I love just how wild that sounds. How do you maintain quality and make it all cohesive?
Startups Should Embrace Tech Debt
Gaurav Misra: I actually think as a startup your job is to take on technical debt because that is how you operate faster than a bigger company. Bigger companies don’t take contact technical debt, they pay it usually right away, or they’re paying back technical debt from the days when they were a startup.
Lenny Rachitsky: Is there anything else that in how you operate and the way you build product that you think is really unique and interesting?
Public vs. Secret Roadmaps
Gaurav Misra: We have what we think of as the public roadmap. This is basically what people have asked us for. There’s all these surface areas where we receive user feedback, but these are all features that every competitor knows about. If a user is asking us for it, they’re asking everybody for it.
It’s not going to be a game changer in terms of winning against your competition. So we have a second roadmap which we think of as a secret roadmap.
Guest Intro: Gaurav Misra
Lenny Rachitsky: Today my guest is Gaurav Misra. Gaurav was an early employee at Snap where he led the design engineering team, which he explains in the conversation. He’s also an engineer at Microsoft and a couple other companies. Most recently, he’s the co-founder and CEO of Captions, one of the most successful and cutting-edge consumer AI products, which lets you generate and edit talking videos with AI. They have over 10 million users and have raised over a hundred million dollars.
In our conversation, we essentially do an archeology of how a modern AI oriented startup operates, including how every single engineer at their company ships a marketable product or feature every single week. Why they have a secret roadmap, in addition to a regular roadmap. We also get in-depth about how Snap as a product team operated. What he’s learned about what it takes to build a successful consumer and social app, why they had no PMs and how designers ran the show, which may or may not have been a great idea. And also what happens in a world where AI video is so good that you have no idea if it’s real or not. This episode is for anyone that is building a product on top of AI. If you enjoy this podcast, don’t forget to subscribe and follow it in your favorite podcasting app. And also, I just launched an insane deal for subscribers of my newsletter. Every yearly subscriber now gets a year free of Notion, Perplexity, Superhuman, Linear and Granola. Learn more at lennysnewsletter.com. With that, I bring you Gaurav Misra.
Leading AI companies like AI21, you.com, 11X, and Copy.ai are already shipping new integrations seven times faster with Paragon keeping their engineers focused on core product development. Ready to accelerate your AI roadmap this year, visit useparagon.com/lenny to get a free MVP of your next product integration.
Gaurav, thank you so much for being here and welcome to the podcast.
Gaurav Misra: Thank you. Thanks for having me. Excited.
The Era of Infinite Possibilities
Lenny Rachitsky: I very rarely have early stage founders on the podcast, but I wanted to chat with you because you’re at the center of so much of what is top of mind for a lot of builders these days, AI and video, and just consumer and social apps.
Also going viral and finding new marketing channels. So I think there’s a lot that people can learn from the way you approach product, the way you’ve built product and the way you just think about where things are going. So again, thank you for being here.
Gaurav Misra: Appreciate it. Honestly, it’s an exciting time. I got to say like there’s rarely a time like this where so much is possible. In normal times, if you think about even five, seven years ago, it’s so hard to start a company. It’s so hard to come up with an idea. It’s just like everything feels like it’s done, someone else is working on it. Or it’s like, oh it’s been tried three times and failed three times. And suddenly, it’s a time, right now, which I’ve never even experienced honestly in my career, where everything you try just works.
There’s so many possibilities. There’s not enough people in the world to work on them. Honestly. There’s more things that can be done than there’s people available to do them. It is just such a rare thing. And honestly, it’s not going to last forever. We are going to catch up to this, but just feels lucky to be part of that movement. It’s awesome.
Tech Is Ready, Attention Is Scarce
Lenny Rachitsky: When you said everything is working, I think what’s an important distinction there is the building of the tool works. The tech is now there to build all these things that have not been possible before. The thing that is increasingly, difficult, and I want to get your take on this, is getting anyone to pay attention and stick with your thing because it’s so easy to build stuff and everything is just awesome and interesting. It’s harder to get people to pay attention and stick with your product.
So I guess is there anything there, you’ve learned that you’ve built a number of successful products. We’ll talk about Snap and what you’re doing now, about just, I don’t know, what you need to think about these days to get anyone to pay attention and then stick around.
Gaurav Misra: Yeah, I mean honestly it’s a great point and I think there is a lot of hype obviously, and part of it, that’s what’s driving a lot of this growth for a lot of companies. And I think from a user acquisition/marketing perspective, in a world five or seven years ago, if you were making something novel and you went to users and it was like, “Oh, we got something better.” People are going to be like, “Well, whatever. Everybody says they got something better. I don’t care.” But today, and this is not probably the way you should do it, but you can go and just say, “We’ve rethought this thing with AI.” And a bunch of people will just be like, “Well, how?” Or “Maybe I should check this out.”
They’ll just try it. Obviously, you have to deliver on the promises. If you don’t deliver, people will come in, they’ll play around a bunch and then just leave. But if you can truly deliver on the promises, there’s great opportunities to require users at scale. So I think that’s slightly different. And I don’t know how long that lasts, but it is definitely a different time from that perspective. I do think also at the core of building products is solving problems. I think a lot of people sort of get caught up in this, well, it’s cool and people will come for the cool right now. People will come in and be like, “Well, let me check it out. It’s cool.”
But at the end of the day, if you’re just building a playground and people play around in the playground and then they leave after playing around, it’s not a business. So I think that is still key. You have to be solving real problems.
Keeping Users Engaged
Lenny Rachitsky: As we were talking, I’m thinking about every day there’s something that would maybe a few years ago be news for a year. Holy shit, this is now possible. Now it’s like every day something like that happens and then we’re like, all right, so what I think about is like, we’ll have AGI one of these days or super intelligence and everyone’s going to be, “Oh, amazing.” And then, “Okay, what’s for dinner?”
Prioritization: Validating Demand via Virality
Gaurav Misra: Isn’t that already happening? Think about, in a way, I self-reflect on this sometimes if like, you’ve seen Iron Man and stuff, they have the J.A.R.V.I.S thing and you’ve seen Interstellar and they have the TARS machine. They’re talking back and forth with these things like bouncing ideas. That is science fiction. That’s literally science fiction. Okay, it’s not perfect, but it exists in a way that nobody could have imagined. That’s science fiction has become reality and I feel like nobody cares.
In a way, you would’ve expected the world to be turned upside down, but it feels like almost in a way so slow and people are like, yes, adoption is happening, but I feel like it’s almost a shocking development in a way.
Lenny Rachitsky: It feels like you guys have done a good job staying top of mind and continuing to get people excited because to your point, there’s so much happening. How do you get people to continue to be like, “Oh, okay, wow, what their building is actually is interesting and continues to be interesting.”
Anything you’ve learned about just what it takes to stay top of mind and continue to pull people back and get people re-excited over and over?
Shipping Marketable Features Weekly
Gaurav Misra: Hundred percent. I mean, I think honestly it just comes down to not just AI for the sake of AI or AI for the sake of excitement or hype or novelty or whatever that is, it’s actually effective AI like AI that solves real problems, practical problems. And the fundamentals haven’t changed. In a way, there’s three steps to building products. You identify a user problem, you apply some technology to solve that problem, but then finally you have some mechanism to find people who have that problem. If you can do all three of those things, then in any environment you can create great products. But I think right now what’s different is so much is changing on the technology side that you can create products that could not have been created before and solve problems that could not have been solved before. And that’s creating the opportunity.
And for us, especially in the video space, it’s truly endless. We’ve just begun, our goal specifically for video is not to build professional tools. We’re not building for professionals at all. We’re building for the person who could not have created video before. They didn’t have the tools, the skills, the means to be able to create video and now they can because they’re able to jump over that skill gap or that time gap. Maybe they’re business owners, they don’t have time, they want results, and honestly a lot to solve there just tons.
Speed Through Scope, Not Quality
Lenny Rachitsky: Solve people’s problems. Easier said than done, but it’s a good reminder. In the end that’s all that matters. Something that I always think about with people in your shoes is just how do you not get overwhelmed and how do you know what to pay attention to? How do you stay focused?
Any tips there for folks that are just reading every day, a new announcement, and then just like I just, how do I? What do I do? There’s too much.
Gaurav Misra: It is the new problem of product development in a way. There’s too many possible paths you can go down. There’s too many ideas, there’s too many things you could do. And I mean obviously, prioritization is always an important skill set and has always been, but it’s become an even more important skill set right now because you have to figure out what not to pay attention to. Our general framework for it is to look for user demand, and actually the easiest way to check for user demand is to just see what has virality.
Usually, what has virality and what people want to share and talk about, there’s something at the core of it that actually is interesting. Now, it may not always be interesting in a way that’s like maybe it’s a one-time use case. Maybe it’s not something that people would do repeatedly. Maybe it’s not something you could build like a subscription business off of, but oftentimes there’s some things, some core element of it that has resonated with people. And if you can identify that core and then mold it into fitting into your business, it’s actually a great way to identify what actually works. And we have these tools right now. We don’t have to build anything. You can just kind talk about it and people will share it, share the idea.
And you can measure how well the product might be received even before you built anything. So it’s a great tool we use for prioritization. We spend a lot of time on social media. Obviously, our app is often used for social media, so a lot of our employees will spend a lot of time on social media. We look at what the trends are, what’s happening, and based on that we can get a pretty good read of what might resonate well with people.
The Seemingly Infinite Roadmap
Lenny Rachitsky: So as a leader of a company with people constantly hearing about all the things happening, is there any tools or processes or approaches you’ve figured out to help people continue, stay focused, not get excited about every shiny new object and actually ship things? I
Gaurav Misra: I mean honestly, it’s all about incrementality in a way. I think we do aim to ship every week. Our engineering goal is every engineer should ship a marketable product every week. And so what’s a marketable product is a product that you can show to users and the user might subscribe or pay for the app just for that or come to the app essentially just for that. And that’s why table stakes features, let’s say we’re talking about word processor or something. If you had auto format or just table stakes stuff like justify alignment or something, no one’s going to come to your word processor for justify alignment. You can market that because it’s obvious, of course it exists, but if you did something unique that nobody else has done, you can go and show that to people and people will come to your app just for that.
And even if your app doesn’t have a lot of the obvious stuff, maybe it doesn’t have justify alignment, people will jump over that just to use these new tools and new abilities that you might be building and marketing. So we try to do every engineer one marketable feature per week, and a lot of that stuff may not work, but a lot of it does work and we can figure out obviously, where to put in more effort, things that start to work, we double down on those things, build more. People often complain because think about it, in one week where we’re shipping, it’s not complete, it’s MVP, truly.
And we slice the hell out of it. We take the design and we cut, cut, cut until we can really say that it’s going to be useless if we cut anymore. We get that out and people come in. And if things are going well, people will use it despite all the problems that it might have, and now people will complain and we’ll have a list of problems and we know what to do next. That’s a starting point essentially. As long as we’re shipping one a week, we get a ton of volume of features and products and directions we’re releasing, cut a lot of that. What remains expand from there. So it works really well, and it keeps people focused.
Long-Term Projects and Backend Work
Lenny Rachitsky: I love the simplicity of that. I love just how wild that sounds for a lot of companies I imagine. Every engineer ships a marketable feature or product every week.
Gaurav Misra: Yes.
AI Tools and Engineering Efficiency
Lenny Rachitsky: There’s some people listening to this and are just completely stressed out by this idea and there’s some people listening who are like, this is exactly how I want to work. This is how every company should build.
Gaurav Misra: Yep.
Unique Product Development Process
Lenny Rachitsky: How do you maintain quality and make it all cohesive? I imagine that’s the big trade-off. Just, any tricks there for folks that want to maybe start operating this way.
Public vs. Secret Roadmaps
Gaurav Misra: Quality is not something you compromise on most of the time. I think yes, there’s strategic compromises in quality, but most of the time what you want to do is have a bar for quality where people should come in and if they’re using the feature, it should work, right? Of course. And the way to cut down on time, and I think this is a mistake people make a lot of the time, is when time is being pressured downward, a lot of times engineers, PMS, designers, they will cut on quality rather than cutting on scope. And actually you can cut on scope. It’s actually, the method that we use is we look at every element that’s going to take any time to build and we just say, what if we remove this? Is the product still useful?
And we keep repeating that until we remove whatever’s left and we say it’s going to be useless at this point. And that becomes the one-week project, right? It actually really works. It narrows down to the core of what you’re really trying to ask. So for example, let’s say we wanted to build something to add an image on your video or something like that, and this is a really basic idea. I just made it up right now. And you might imagine a design in which you import your image from your camera roll, but before it lands in your video you might want to remove the background. You might want to change the hue and saturation or something like that. And you might expect a designer to design all of those features and you let it design, but you really quickly realize that you can cut all of that stuff.
You can cut the background or you can cut the hue saturation. All you really need is pick. And then there might be a picker. We need a picker with a library, with a lot of different type. What if you want to pull from the cloud? What if you want to pull from the drive or something like that? Cut all of that, right? And essentially come down to the core, which is just native picker from the camera, lens, straight in the video, no UI. And that is already, that should be useful. If that’s not useful, then anything else built on top of that is also useless. So that’s how we might go about it.
Lenny Rachitsky: That last sentence is so key to this. It’s the core idea of ship small iterative features before you invest a lot in something to figure out is there anything there, is this worth spending weeks on?
Secret Roadmap Examples
Gaurav Misra: Totally. And I think the coolest part of this method is the first thing that the users will come in, they’ll use the thing, they’ll import images and the first thing they’ll complain about is what bothers them the most? Is it human saturation? Is it background removal? Is it picking from the cloud? You’ll just get the most complaints about that thing.
People will be like, and people will be honest about it or they’ll be like, “This sucks. It doesn’t even have background removal. What kind of image thing is this?” And you have to take that feedback and just next week you can ship in a single week all the things that the user’s complaining about.
AI-Assisted Brainstorming
Lenny Rachitsky: And then they’re like, wow, this team is shipping like crazy.
Snap as a Camera Company
Gaurav Misra: Yes. Exactly.
Why Focus on Snap
Lenny Rachitsky: Solve all my problems. So responsive. This connects a common sign of product market fit, which is when people are complaining about the thing that means they actually care enough to complain and that’s a really good sign if they’re complaining about something.
Snap’s PMF and Continuous Innovation
Gaurav Misra: It’s very true, very true. If nobody complains, it’s almost red flag.
Snap’s Unique Design-Led Architecture
Lenny Rachitsky: A lot of this is turning into an archeology of a modern product team and startup. So I want to keep digging. This is not where I was planning to go, but this is awesome. I love that this approach of every engineer shipping something every week that’s marketable connects directly to where I started this conversation, which is how do you stay above the noise?
And part of the answer is just ship stuff constantly, and just continue to impress people. Like, “Here’s a new amazing video feature.” “Look at this thing.”
Gaurav Misra: Exactly. Yep. I think it’s definitely key, right? And there’s enough area and enough scope for that to happen. I think truly in normal times it may not be possible to create that much roadmap that quickly, but I think because there’s so much innovation underlying all this, there is that scope available. The roadmap almost seems unlimited, just truly.
Single Decision Maker in Consumer Products
Lenny Rachitsky: Okay. The other question I imagine people would be wondering is how do you work on longer-term projects that take many weeks? There’s also infrastructure, I guess, back-end stuff. So maybe answer those questions.
How do you think about long-term stuff and then how do you deal with back-end stuff that isn’t a feature that anyone would care for?
Gaurav Misra: Yep. Usually, we’ll dedicate time to that separately. For example, usually Q4 for us is infrastructure quarter. We just go and build all the infrastructure. Q4 is generally, we’ve already delivered a ton of products and stuff. We’re feeling pretty good about the rest of the year. Things are winding down. Obviously, holidays and stuff coming up. And so we spend all that time paying the technical debt.
I actually think there’s a unique thing to think here about technical debt in general. And as a startup, your job is to take on technical debt because that is how you operate faster than a bigger company. Bigger companies don’t take on technical debt, they pay it usually right away. Or they’re paying back technical debt from the days when they were a startup, and they took on a lot of it. I mean Snaps, I used to work at Snap and there was a lot of examples of that over there, and I’m sure it happens at every other company.
And we think about it as like, well, is this a problem we need to solve today or is this a problem that the 50th engineer or the hundredth engineer or the 500th engineer can solve? And if it is a problem that a future engineer can solve, we should use that future engineer now. Essentially, that’s what we’re doing. And we’re saying we’re going to push this to somebody in the future. And by the way, if the company fails, that engineer will never be hired and all this won’t matter anyways. So it’s like financial debt in many ways. Financial debt is taken on to create leverage. It can be a good thing like if you’re buying a house, you take on debt and you can buy something probably more than you can afford without taking on debt.
And it’s the same thing. You can create products that you wouldn’t be able to build with a small team that you have by taking on strategic technical debt. It’s very positive actually.
The High Bar for Snap Designers
Lenny Rachitsky: Wow, this is such a cool idea. And where my mind goes is that future engineer may be an AI agent engineer.
Design Engineering: Merging Design and Engineering
Gaurav Misra: Exactly, yeah.
Lenny Rachitsky: Just solving problems, just on technical debt in you.
Internal Comms: Prototyping for Alignment
Gaurav Misra: Exactly. Some engineer in the future Five-hundred engineer many years from now will get a promotion because they solve this big problem that those really bad early engineers created.
Prototyping and Rapid Validation
Lenny Rachitsky: So obviously, there’s a line to this. There’s only so much debt you can take on before you become a big problem.
Is there any thoughts on just that balance of just how much is too much and how if it’s enough for a net feature engineer or just-
Gaurav Misra: Yeah, I mean I think generally the rule of thumb is every piece of debt that you take on you have to pay interest on. So if there is debt that you’ve taken on, there’s 1% or 2% of your time that is going to be taken away every day in maintaining bugs and issues and restarts and crashes and things that are happening with that. Because you did it the fast way, something’s going to go wrong with it. Every day. 1% of your time will be taken away. If you take on enough debt, you’ll be paying 80 or 90% interest and you’ll not have any time to do anything new. You’ll just be paying interest. That’s all.
And that’s when you get into the mode of like, oh, we’re just keeping the lights on. We don’t have any engineers to do anything. We’re just keeping the lights on. That’s the failure case for a startup. So in a way, you have a technical debt runway. Once you run out, once you’ve taken on too much debt. And if you haven’t delivered value in that time, enough value to hire the engineers to pay the interest or just pay off the debt, you’ll get in trouble.
Product Team Triangle: PM, Eng, Design
Lenny Rachitsky: I love that. That’s such a nice heuristic of how to think about when to invest in something. I don’t want to go down this too far, but just a thought I have is … because sometimes there’s big technical decisions you got to make that impact the way everything builds or is built in the future. I imagine those you spend more time on and take really seriously.
PMs Should Understand Marketing
Gaurav Misra: Definitely. Yeah, I mean I think as long as it’s possible for wherever it’s like a two-way door, you can do whatever you want. I mean this is a classic methodology. If it’s A one-way door, it’s worth thinking about and sort of doing correctly at least as much as the one-way door would matter to you in the future.
User Research and Ev’s Product Philosophy
Lenny Rachitsky: How much do your engineers use Cursor and tools like that to build? How much is AI helping your team move?
The Boundaries of Snap’s Mission
Gaurav Misra: A hundred percent, yeah. I mean everybody’s using it. It’s super helpful. I mean even I’m using it honestly. Yeah, it’s a huge multiplier for the team, no doubt.
Lenny Rachitsky: And is a Cursor specifically. Is there anything else that you guys found useful?
The Short-Video Opportunity Snap Missed
Gaurav Misra: Yeah, we are using Cursor. Yep. We’ve tried all the different tools. We were using Devin as well, which is another, you know? That’s more advanced, I guess. It’s solving bugs for you.
Lenny Rachitsky: Yeah, Devin’s basically, I think it’s 500 bucks a month and it’s like an AI engineer that you just chat within Slack.
The Bifurcation of Social Media
Gaurav Misra: Exactly, yeah. In a way, these are the types of things that us as a startup can do that bigger companies can’t just, you know, they can’t just pull in Devin. They have to get 30 lawyers in the room first before that happens.
Lenny Rachitsky: And they’re all called Devin, these are like agents. Everyone’s going to have hundreds of Devins working at their company.
AI Video and the Authenticity Crisis
Gaurav Misra: Exactly. You can have multiple Devins. I actually heard you can have a manager of Devins who’s managing Devins.
Lenny Rachitsky: I love that managers are all getting layered, like unlayered and then they’re going to have AI managers. That’s the ultimate bait and switch.
The Authenticity Challenge of AI Video
Gaurav Misra: Yep.
Lenny Rachitsky: Okay. Is there anything else that in how you operate and build the way you build product or set up the way you build product that you think is really unique and interesting that other people might be able to learn from?
The Bifurcation of Tech Paths
Gaurav Misra: Our process is a bit interesting in that way. We have a design team, we have a PM team. We’re very early on those teams right now. And obviously, we have engineering. And we have all the different surface areas. So iOS, Android, web. There’s backend team, machine learning team, research team. So generally, when we’re developing products, we may start off with a PM first approach where we’re finding some sort of overall issue that we want to take on some new area or pillar we want to take on and then creating sort of product specs from there.
But a lot of times we’ll also start the opposite way. We’ll first design something without even having any idea of what or why we’re doing it, but we’ll design a bunch of different things and then we’ll sit down with the PMs and look at the designs and just go over one and the next and the next until we find interesting things and ideas that pop out of that.
And a lot of times that leads to us discovering things that we wouldn’t have discovered if we were just too focused on the metrics and the numbers and things like that. So it’s almost reversing the process a little bit and starting with design first, but it can often result in finding unique ideas basically. I also think that we have a unique setup in how we create our roadmap. So normally you have a single roadmap and we actually divide a roadmap into two different roadmaps. So we have what we think of as the public roadmap. This is basically what people have asked us for. So there’s all these surface areas where we receive user feedback and we look at all that feedback and people will ask for features. They’ll ask for, I want background removal, I want to undo and redo, I want to upload longer videos, whatever it is, a bunch of different features.
And we’ll just make a list of that. And just like anything else, we’ll prioritize it and we’ll look at how many people it affects and what the possible markets are and just get it done basically one at a time.
But these are all features that every competitor knows about. These are public. If a user’s asking us for it, they’re asking everybody for it. And every team has essentially more or less the same list and everybody’s prioritizing it. And yeah, sure you can win a little by extra nicely prioritizing it or winning a little in prioritization or execution or something, but it’s not going to be a game changer in terms of winning against your competition. So we have a second roadmap which we think of as a secret roadmap. So this is a roadmap that nobody asked for anything on this like literally, nobody has ever asked for it.
And if a user were shown something on it, they might be like, “I don’t need this. I don’t know what this is.” But given our unique vantage point, our unique understanding of the problem set, the user space and the technology, we’ve come up with some special ideas that we think will completely revolutionize how something is used where we can truly change the behavior of the user. I think that’s what at is at the core of. It’s like people are doing things one way if we’re able to show them another way. And once they try it, they never go back. That’s what a product is, that’s success. And those are the types of ideas that we put on the secret roadmap. These are things we never talk about publicly, never tell anybody about, and we announce them and just give them to users and see the effects.
A lot of this we come up with through brainstorming. So we do actually do quarterly brainstorming, company-wide, everybody’s included like everybody from. It’s not just a product team thing, it’s like engineering, recruiting, everybody’s included in. And we all come up with marketing, obviously, everybody comes up with ideas, we vote on the ideas, rank the ideas, and then the product team takes over from there and thinks about like feasibility and technology and what the different things could be. So this is a way where we can take all that noise that people are getting, everybody’s browsing social media, seeing all these different things that are blowing up, these models and advancements and we can get all that information together and provide a unique internal roadmap where how are we going to approach and create value out of all of these different advances that are happening.
So that’s our general methodology. And a lot of times the biggest wins will come from the secret roadmap. That’s the game-changing stuff. It’s not going to be the user requests usually that are going to do that.
Lenny Rachitsky: I love just how calling it the secret roadmap makes it extra interesting. [inaudible 00:30:07]
Safety Frameworks: Documentary vs. Narrative
Gaurav Misra: Exactly, yeah. It’s a secret.
Lenny Rachitsky: It’s a secret. I’m not even going to ask you what’s on that secret roadmap. You can’t tell me.
What’s an example of feature that came out of that secret roadmap that’s been a big deal for you guys?
The Endgame of AI Video Marketing
Gaurav Misra: Tons. I mean, I’ll give you an example from a long time ago. One of the first AI features we added after the app initially took off was this feature called eye contact. So this was a feature where if you’re recording something, oftentimes people who are new to recording a video might read from a script or a teleprompter or something like that and they might have that off-screen. So it looks like you’re reading and it’s not great from the perspective of the video itself or the viewer of the video. So we had this feature where it basically shifts your eyes to look at the camera.
And we were actually the first company to build this. We worked with Nvidia on this. It’s actually really interesting because when we originally reached out to Nvidia about this. They were not sure why we needed this. And they actually gave it to us pretty openly and were excited about some sort of partnership of how can we get this technology into something that could be useful.
But we saw this creator use case which was unique, and it was one of the ideas that came out of the brainstorm and we threw it on there, we launched it. It was a huge success. I mean, I’ll be honest, the video, the ad that we made, a social media post that demonstrates this was so viral, it was made in basically every language around the world. It still till today gets millions of views. We find reposts and reposts of that thing that other people have created that get millions and millions and millions of views because people are like, “Wow, this is a great idea.”
And now it’s been copied the hell out of, I think it’s available basically on every app you can imagine. For good reason of course. But that’s one of the ideas that came out of it.
Lenny Rachitsky: You talked about how you come up with these secret roadmap ideas. I’m just intrigued by this. I’m going to spend a little more time here.
Does your team ever work with an AI LLM to help brainstorm? I imagine that’s where things will go, where you’re actually jamming. The AI agent is brainstorming along with you.
AI Ads: Rejected to Outperforming Live-Action
Gaurav Misra: Honestly, I would like for it to go there. It hasn’t gone there yet. We haven’t done that exactly, because the problem is context. And I think just the context of understanding the user, the use case, it’s so abstract. Even right now, I feel like I understand our users obviously, but I can’t exactly verbalize why that is or how that is, a little bit abstract. And I spend a lot of time with RPMs and designers imparting anything that I understand and I’ve learned over the many years I’ve been working on this, how do I impart this to them? But then it’s a challenge because I can’t even verbalize it myself. And so it’s an extra hard challenge to figure out how do I put this context? How do I make it available to an LLM when I can’t even put it into words exactly. And honestly, this is probably my own feeling but, and I need to work on this, but there is something to it.
I do remember at Snap for example, I think one of the most unique things about Snap and the CEO Evan Spiegel was that he had an unmatched understanding of the user. I think years and years and years of the company’s existence past, almost a decade. And nobody understood the user like he did. He would come up with ideas that everybody would disagree with and we would launch them and there would be hits, just hits after hits. And nobody would understand why. Everyone would line up and be like, “Great.” Round of applause for everyone, but no one knew why.
A great example of that is a lot of this was figured out in retrospect too. I think there was a point at which Snap declared that they’re a camera company and a lot of people laugh at them and said, “Camera. What are we making digital cameras or something?” Or, “Why is it a camera company?”
But I think at the core of it was this idea that Snapchat opens to the camera and that was actually the differentiator. That actually that small decision was holding the entire company against all competition because when the moment passes where your friend is doing something funny and you need to capture it, you’re not going to open Instagram or anything else because it doesn’t open to the camera. You’re going to open Snapchat because you can capture it right away. And Instagram can never copy that because all their metrics are going to go down as soon as they do that. So that is a fundamental understanding. And I figured this out much later, actually, but it’s such a powerful idea.
Dystopian Future: Fully AI-Generated Social Networks
Lenny Rachitsky: I’m glad you talked about Snap. That’s where I definitely wanted to go. This is where I was going to start. So I’m glad we circled back to your experience at Snap. So the reason I am interested in this is if you think about social networks like Snap is basically the last social network to have launched and stuck around other than TikTok, which I don’t think is a social network. I think it’s just this content platform. I don’t think you’re really interacting with people really. And that was 2011 when it launched. So it’s been like 15 years since the last social network launch that has worked.
And I think it’s interesting also because there’s rarely been a lot of insight into just how Snap operates. You were there really early. You’re a big deal at Snap. You built a lot of really important features. So I wanted to spend a little time here, and it feels like a lot of things you learn from Snap you’re bringing to your company now. So let me just ask, I think you may have answered this, but I’m curious if there’s something else here just broadly maybe other than Ev’s brain, what do you think was core to Snap being a successful consumer social product?
The Failure Corner
Gaurav Misra: There were a couple of different things that went well. I do think for a company like Snapchat or Social Network, the core product market fit can be extremely strong. Essentially, the reason that people are downloading it, the way that it’s spreading, the way that it’s distributing, the way that it’s inviting friends or sending Snaps or whatever it is, that product market fit can be so strong sometimes that it can be hard to actually build something because you actually can’t tell if what you’re building is what’s responsible for growing the thing or if it’s actually hurting it and it’s growing despite what you’re doing basically.
And I think because of that, it actually sometimes teaches people the wrong things. It teaches people that the contrarian thing that they were doing was right when it was actually just wrong and the company just grew despite it. And I think some of the things that Snap did well and it needed to do really was to continue innovating, right?
Because for a company like Snap, it has a ton of competition. Social networks are monopolies by nature and there’s a lot of reasons for Facebook or any other social network to stop the growth of Snapchat. And they tried, they tried really, really hard. And the way that Snap was avoiding that was by innovating. I think the core of it was the setup that they had, which was very unique. I’ve never seen anything like it. I’ve worked at a bunch of different companies, but obviously there’s a CEO and the CEO was very product-led, his designer himself, but he surrounded himself with the design team. That was sort of the central team in the company. And the design team was like 10, 12 people. Basically, pretty small, even at 5, 6,000 employees it was that small still.
Lenny Rachitsky: Oh wow. At 6 or 6,000 employees. The design team was, you said how many, five or six people?
Wasting a Year and a Half
Gaurav Misra: 10, 12 people.
Lenny Rachitsky: 10, 12. And to add to that, there’s no PMs really for a long time. That was before.
Attempting to Build a Social Network
Gaurav Misra: For a long time, yeah.
The Power of Network Effects
Lenny Rachitsky: Big difference.
Gaurav Misra: Initially, there were no PMs at all. PMs were introduced with monetization. Once monetization was a big sort of element, that’s where PMs came in. Today, I think there’s a ton of, or there’s an adequate number of PMs across the company, but there was a long period of time, especially when the innovation was happening, when there were a much, much smaller number of PMs and it was very designer led. But at the same time, I think that’s slightly misleading in the way that these weren’t your sort of average designers.
These were designers who were actually PMs as well. That’s what the secret sauce was. They were able to not just design but also do the PM part which is a big responsibility. It’s a lot of work, especially for that many employees, but it gave the CEO a way to have granular control over what exactly was being launched in which part of the app at all times.
Because he could meet with a set of 10 or 12 people and know every change that was happening that was user impacting. A lot of changes were being worked on that were infrastructure and types of things that keep going on in the back end where you’re improving ranking and whatever that might be, performance and things like that. And those were not usually his concern. He was concerned with what UI are we adding where? And if you needed to add UI to the app, you needed it designed. And if there’s no designers in the company, except for a handful who talk directly to the CEO, you create a very granular control over what’s being launched in the company. So everything needed to be approved by Evan. If you hadn’t approved it, it’s not going out. So the design team actually held a lot of power in that.
Rapid Fire
Lenny Rachitsky: This is awesome. So what I’m hearing partly is, I don’t know if this is true, but it feels true that to make a consumer app that is successful and breaks through, you almost need a singular mind that continues to stay in the weeds on everything. And the way Evan did that is very close to the design team who basically ran product.
Gaurav Misra: That’s very true. Yeah, it’s very true. And he was able to keep the context of the entire app in his head at the same time. He knew the interdependencies and what we’re doing and why we’re doing it. And so that gave him just very granular control over the company’s product roadmap.
Lenny Rachitsky: It makes me think about Brian Chesky and Airbnb is a consumer product, it’s not a social network, but I wonder if that’s just an interesting insight just for consumer products. They will generally do better if there’s one person with a really … the right sort of combination of experiences, insights, and just they continue to run and own every detail.
Gaurav Misra: Definitely. And also the ability to bring about change, the ability to truly energize an entire organization to do something that’s not just incremental but fundamental.
<< Founder mode >>
Lenny Rachitsky: Exactly.
Gaurav Misra: Founder mode. That’s what we’re getting to, basically.
Lenny Rachitsky: Yeah, ever heard of it. Okay. And then you said that these designers, so I know it’s famous that Snap had no PMs for a long time. Designers were PMs. This point you made about the designers where PME is really important. I think a lot of people look at this, they’re like, “Amazing. We’re just going to hire just designers. We don’t need all these PMs. Slow everything down. Just tell us what not to build.” Can you just talk about the level of these designers? What allowed them to be as successful as they were without any PMs?
Gaurav Misra: Yeah, I mean I think what was expected from the designers now was not just the ability to design, the skill set of designing, which all of them were IC designers by the way. And there were no reports, so they weren’t allowed to have reports actually. And so they were designing everything themselves, but they also had to have the leadership skills to go figure out the roadmap, write all the documents, work with the different teams, figure out shipping schedules and just know everything, not just the technical and the engineering part, but the UX and the UI and the product needs and why are we doing this.
The roadmap, there’s just a ton to keep in mind. And that means that it was a job that was just highly … it was very high workload. No doubt, very high workload. These people work really hard and they were paid highly too. For what it’s worth, they were paid way higher than you would expect designers or PMs or engineers to be paid with quarterly bonuses and all kinds of things.
Lenny Rachitsky: That’s interesting. And it reminds, people always say, “Why do you need PMs?” There’s like someone has to do the work that a PM does. They’re not sitting around doing nothing. And it’s important to note the person that will take on the PME work, they have to be good at it and enjoy it. And a lot of designers don’t want to be doing writing docs and organizing stakeholders and getting alignment and …
Gaurav Misra: 100%, 100%. That’s why it was so hard to find those people who were able to do two things. I actually think there’s an insight in there is innovation between when you’re merging craft right between two different functions. And I do think there’s something special about one person doing two different functions or at least being able to do. And I think a lot of unique insight and innovation can come from that.
I actually think on my personal side, I eventually joined the design team. I started at Snap on the engineering team. I eventually joined the design team over the last two years that I was at Snap. And a big part of what I did there was create this function called design engineering. And that was actually a different combination. It wasn’t the designer PM. It was the designer engineer. The person who can think of the UX design it and also build it and launch it, all of those things.
And we saw both the ability to take designers and teach them engineering and take engineers and teach them design as part of that. Obviously, the reason that we created that function was very different. It was actually to continue innovating as the company got bigger. One of the problems that we identified was that as the company got bigger and bigger and there’s like 500 engineers, 1,000 engineers, 2,000 engineers, 3,000, suddenly it just becomes very difficult to do everything.
Everything is a six-month project or a one-year project. Every product is a massive investment of 500 engineers and a lot of time. And so you really have to pick your bets. If you get it wrong, if you are innovating and trying to create new products and you spend 500 engineers for a year and it doesn’t work, it’s a big problem. You’re going to be in trouble, especially if we’re coming like Snap where everybody was copying what they’re doing so they had to constantly innovate, create new stuff and push the bounds.
I think Evan’s philosophy was always he didn’t fight the things that were getting copied, right? Stories got copied pretty much straight up. A lot of things that Snap created got copied, but he was more of the mindset of like, “Let’s expand the pie, do something new and push the boundaries.” We’ll keep innovating basically. And so to do that with that scale of a company becomes really hard. And so we had this idea of let’s create a small team where we can go and pretest a lot of these ideas because we had a lot of ideas and we can’t go and build all of these things. So the idea was create a small team of these design engineers, people who are able to do the entire product design engineering process in their head and can put together early versions of the product, which we would actually bake into the Snapchat app itself.
And we were able to even test, for example, run a test in Australia, see how it’s performing. Run a test in a couple of high schools, just a couple of high schools, see how people behave. And that way we already have data on how this might perform in a real world environment, but we haven’t built it to production level. It’s a prototype, essentially. It’s how a startup might build something.
The same idea of what we’re doing at our company now, build fast, get it out there, get feedback, understand whether it works or not, and then work with the engineering team to build it at a scale. Once we understand the product and the dynamics, then it makes sense to put on 500 engineers for six months to build it.
So that was a big part of it. I think the nice thing that came out of it that was completely unexpected but actually transformational for me in a way was obviously in big organizations, alignment is a big issue. How do you get everybody on the same page? And a big part of a PM’s job is actually to create alignment and it can be a lot of work because you go talk to all these stakeholders and get them on the same page.
But one of the insights that we had, which was unique was as the company gets bigger, you can actually create alignment by causing internal virality. If there’s enough people in the company, it actually starts acting like a consumer base might. If you share something interesting with someone, they will share it with somebody else because they think it’s interesting and you can actually create virality inside a company.
So one thing that we would do is we would create these prototype products. We would just go into an area, redo a bunch of stuff, create these prototype products that didn’t exist in Snapchat normally, and then we would just share the build and it would explode. It would just go viral inside the company. Day after day we would hear from engineers, then managers, then VPs, then eventually from Evan being like, “Oh my God, everyone’s talking about this. Why am I the last one to hear about it?”
So it would create instant alignment across the company of this is exciting, this is something that we want to get behind. And everyone would be asking, “When are we doing this? When is this happening? I see someone’s already working on it.” So it was a great way to do that. And once we really understood that the product actually had good dynamics and we had tested it, it was a great way to get it out in front of everybody and create this idea of, “Hey, we’re all working on this. This is the future.”
Lenny Rachitsky:
Imagine starting a project at work and your vision is clear. You know exactly who’s doing what and where to find the data that you need to do your part. In fact, you don’t have to waste time searching for anything because everything your team needs from project trackers and OKRs to documents and spreadsheets lives in one tab all in Coda. With Coda’s collaborative all in one workspace, you get the flexibility of docs, the structure of spreadsheets, the power of applications, and the intelligence of AI all in one easy to organize tab.
Like I mentioned earlier, I use Coda every single day. And more than 50,000 teams trust Coda to keep them more aligned and focused. If you’re a startup team looking to increase alignment and agility, Coda can help you move from planning to execution in record time. To try it for yourself, go to coda.io/lenny today and get six months free of the team plan for startups. That’s coda.io/lenny to get started for free and get six months of the team plan. coda.io/lenny.
Another thread I want to follow up on is prototyping. It feels like that is where a lot of PM work is going is getting straight to a prototype versus design or versus PRDs. And it feels like that’s something that you did and worked super well. Basically, it’s a team to prototype ideas that in theory now you can just build really quickly with AI. So I think that’s really interesting, seeing where the feature’s going, just …
Gaurav Misra: 100%. Getting things in people’s hands, trying it out. Oftentimes, unless you truly try it out, in design, it can, in theory, look good with all the perfect conditions, but when you actually use it, you realize it’s actually not that useful, for example. Or when you give it to users. And some of this is intuition, honestly, just like anything else, but there’s nothing like getting something in the hands of users at the end of the day.
Lenny Rachitsky: I love how many of these things you brought over to your current company, and I’m trying to think about, one is this idea of just constantly innovating feels like that’s informed and tell me what I’m missing, but that feels like that’s informed. The ship market will feature every single week. This idea of getting design, starting almost with design versus PM a lot of times. I’m curious why you don’t even go straight to prototype in those cases. Is it just the tools aren’t there yet or?
Gaurav Misra: I mean, I think our shipping process is fast enough that within a week we can get it out anyways. So that way we just get user feedback, which is even better.
Lenny Rachitsky: And then the other really interesting thing, I’m trying to visualize that triangle of a product team, the triad of PM, engineer, design. Feels like you guys at Snap took the corners, not the corners, the line of that triangle. And you have design engineers. You have design PMs. I imagine engineers were PME already. They’re very product oriented PMs. Did you have a function called design PMs? Probably not.
Gaurav Misra: I mean honestly, it’s interesting.
Lenny Rachitsky: Sorry, engineer PMs.
Gaurav Misra: Yeah, engineer PMs should be a thing, I feel like, or every engineer should strive to understand the product, right?
Lenny Rachitsky: Yeah. A lot of companies operate that way. Like Stripe, I think they had hundreds of engineers before they hired the first PM because I think the engineers were doing what they did at Snap to do the PM work. So it feels like at your company you don’t operate that way. It feels like you have PMs, engineers, designers. Talk about why you decided not to approach things that way.
Gaurav Misra: I do think PM is a very valuable function. I think it may be actually, and maybe I’ll get roasted for this, but I think at the end of the day, not hiring PMs at Snap might’ve been one of those decisions where it actually succeeded despite that and because someone needs to do that work. If you don’t have enough people to do it, then nobody truly owns it and then it doesn’t really happen. Or if it doesn’t happen, no one’s responsible, which is not the right structure you want in an organization. So I think though, that being said, there was something unique to be said about what if a designer had the PM mindset.
It’s actually the same idea as what if an engineer had the PM mindset and then you get even crazier. What if the PM had a design and engineering mindset? I think all we’re talking about is everybody truly understanding all the functions that they’re working with. Having a fundamental, broad understanding of the functions they’re working with.
At Captions, we’re actually going even one step further than that. Why shouldn’t the PM understand marketing? I think that’s actually the biggest opportunity for PMs to understand is how do we actually find the users who have this problem? I think that’s a big part of solving the problem. I have a unique take on this in terms of I actually think PMs should own all the way to marketing in a way. And the reason is that if you think about marketing, it’s expanding the surface area of the product, right? It’s like search marketing is just placing a button to your product in Google.
Facebook ads is just placing a button to your app in Facebook. It’s almost like you work at Facebook. You work at Facebook, you have a button in the app somewhere, you make a specific thing and people show up. The funnel begins there and you have all the metrics all the way from the beginning, all the way from when the user tapped on the button in Facebook and then they went down all the steps and then they landed on some onboarding screen and they did the thing, they used the application.
That’s where the journey begins. And all of that is, in a way, it’s a product. It’s the same skillset. Understanding users from that point on is I think that’s fundamental. How do we not do that today? We should be. So that’s how we think about stuff. But I think the core idea is that every function should understand every other function deeply as much as possible and maybe even to the level where they can operate in that function. And that just increases the likelihood that all decisions being made in the company at the micro level will be optimized for all possible parts of the funnel that different people are essentially looking after. That’s something we think about quite a bit.
Lenny Rachitsky: I completely agree with that take. It’s interesting that at Airbnb, Brian was famous for changing the titles of all product managers to product marketing manager for exactly this point because he’s like-
Gaurav Misra: Makes sense.
Lenny Rachitsky: … you should be doing the marketing, you shouldn’t just be building the thing. And to me, I’ve always assumed as a PM, your job is for this thing to grow and to get adopted and be loved.
Gaurav Misra: Of course.
Lenny Rachitsky: So it’s interesting people don’t already think of it that way.
Gaurav Misra: I agree.
Lenny Rachitsky: But obviously, it’s hard to learn the skills of being awesome and paid growth and SEO and product marketing, messaging, positioning, but I completely agree. That’s such an important element of building a product. You’re not just building a thing, hope it works. Goodbye. So I love that that’s how you think about it. And so I guess when you hire PMs, it sounds like you look for marketing instinct and some experiences.
Gaurav Misra: 100%. And at least the ability and instinct to be able to learn it.
Lenny Rachitsky: Yeah. Okay, so I’m going to share one other thing that I thought as you were talking that I think is really interesting and it comes up a bunch on this podcast and this connects back to Ev and what we can learn from his success. So Patrick Olson once tweeted this tweet that has really stuck with me, which is it was around user research and the way he described it is user research isn’t go to user research that informs what you build and then you build that. It’s instead you do user research, it informs the mental model you have as a leader, a product builder of what your customers need and what pains they have, and you adjust that model in your head and then that’s how you decide what to build. And it feels like Ev is very much that. His head was learning what people need, teens in particular, and it just worked.
Gaurav Misra: Yeah, I think it’s very spot on. I would say though Snap didn’t like user research as a function for the longest time. I think there was one user researcher in the company until, again, 5,000 employees, the post IPO basically. But I think the people that were making a lot of the product decisions and the CEO himself, of course, were very steeped in how the user behaves and how they operate. They understood that.
I do think Snap also had a unique way of thinking about how to determine if a product is within scope or out of scope of what their mission was. And I think a lot of companies use the cyber framework and we try to as well, but essentially, the idea at the core was that they want to enable private sharing in a safe way. So I think that makes it clear that certain things just are out of scope for Snap.
It’s actually one of the reasons why Snap wasn’t the company to discover “short form video,” TikTok style stuff because it was just against the nature of the company to even try something. It was against the mission of the company. Public sharing means possibly bullying and bad behaviors, which is exactly what Snap was trying to avoid. We don’t want those behaviors to develop on the app. So for example, on Instagram stories, you can share somebody else’s stories to your followers. I can take your story and share it to my followers. You can’t do that on Snap.
And there was a discussion about should we do this? No, because it can enable bullying. Essentially, you’re not consenting to your thing being shared to my followers and that’s essentially bad. So a lot of it was done based on this type of pillar-based thinking of this is our mission, this is what we’re trying to do, does it fit within or is it outside? If it’s outside, we don’t do it no matter what the cost of it is, no matter how exciting it is.
And even on Spotlight, the big challenge was like how do you take something like that and put that inside the Snap mission? So that was something we worked on quite a bit. Yeah, I mean I think there’s tons of stories about earlier versions. I mean Snap almost had essentially what is TikTok earlier than TikTok existed and it died out because it didn’t align with the mission essentially, but happy to get into it.
Lenny Rachitsky: Yeah, that actually would be really interesting, because interesting that these things are important. It’s important to have these clear values in the mission of the company and to not focus on things that are outside that. And then you hear these stories of they had TikTok potentially. So yeah, whatever you can share there, that’d be awesome.
Gaurav Misra: Yeah, I mean, I don’t know if you remember this, but there was this product called Our Stories, and essentially it was MyStory, but it was a public story. And it started off with this idea of campus stories where you can post to your campus and other people can see it. And that actually started creating a lot of virality because essentially people would post. There was viral moments truly where people would post stuff like, “Oh, I think two people fell in love on it or something like that.” Those types of things really went viral and it had really good engagement.
But at the end of the day, the problem was that we were against algorithmic essentially ranking of those types of things. So there was a curation team that was looking through every single one so that there’s no negative behaviors happening essentially on the app. That was just not scalable. Even though it had really high engagement and was doing well, it just wasn’t feasible to have a person looking at every single thing posted to determine whether it’s appropriate or not.
It ended up dying out, but it looked like what was an early version of TikTok before it had launched. So I think in a way though it was a good thing because I think Snap does have a mission and I think it is solving a problem. I do think there is a bifurcation of social media at this point. There is what you traditionally think of as social networking where you share things with your friends. And by the way, remember the days where that used to be the way that apps would go viral. You would share things with your friends and then they would share with their friends and everybody was worried about friend sharing and how do you send to a friend and can I text message my friend or whatever.
That time is over. Virality now happens through a completely different mechanism. It happens through essentially algorithms that are deciding whether your piece of content is worth showing to an arbitrary number of people, and this is the new age of social media. It’s TikTok, it’s YouTube Shorts and Instagram Reels and so on. And I think actually it’s changing the fundamental nature of how people interact, fundamental nature of how things go viral. And I actually think from a regulatory perspective, we should be thinking these as differently.
On one side, you have something where you’re deciding who sees something and then on the other side, you have something where the company is deciding which means that it’s semi curated, right? It’s actually the company’s voice. So yeah, I don’t know, should Section 230 apply to that? I have no idea. Or maybe not. Maybe we’re thinking about this the wrong way, so it should be interesting.
Lenny Rachitsky: Wow. All right. Well, I’m out of my depth on the legality decision, so I’m going to not follow that thread, but I imagine there’s something really interesting there actually. So you’ve been talking about just how much things are changing and I just wanted to follow that thread and specifically, you guys are at the cutting edge of what is possible with AI video.
Gaurav Misra: Yes.
Lenny Rachitsky: It feels like we’re approaching and maybe we’re there. This world where you have no idea if it’s real or AI. I’m curious, first of all, just how far you think we are from that and second of all, the implications on the world where you can just generate any video you want.
Gaurav Misra: It’s fundamental. At the end of the day, a time where video images, audio can’t be trusted actually hasn’t existed for a while. If you think about … I mean there was a world in the 1800s where there was no video or audio or images and everything was proven by he said, she said for the most part. And it’s possible that if everything can be generated and anything can be created and it looks just as real as if it were real and there’s no way to tell, then we might actually return to that world where there’s no way to prove anything besides physical evidence or he said, she said.
And I think that’s scary, but also possibly opens a bunch of new opportunity for someone to figure out how to solve this problem. I think it’s going to be a big problem. I do think today, we are almost there in terms of creating absolutely photorealistic video. I mean the very recent models, a very cutting edge is just about … It feels like a few centimeters away from achieving it, but I do think to fully get there to the point where it cannot be differentiated at all, it’s still a couple of years away.
I also think that it is use case driven in a way. I think thinking about Captions for a second, we take a unique view on what type of video we want to focus on. Video generation and text to video generation. If you look at it today, it’s all silent video. There’s no audio and it’s often what you think of as stop video or B-roll, right? You can actually make a movie with B-roll. And a lot of a movie or a TV show or a social media post or an ad actually is dialogue or monologue. That’s actually what it is is people talking to each other, to the camera, interacting. That’s actually what makes true story.
B-roll is supportive elements that are showing up to set the scene or something like maybe before the scene opens, you see a few shots of New York City or LA or something, and then you jump into the room and now two people are talking. So our goal is to solve the talking video problem. How do we create video where people are delivering dialogue or monologue or things like that? And that’s what we focus on purely. And there actually isn’t a lot of work happening in that area today and it’s not a solved problem. We’re getting there, we’re getting closer and closer, but today’s models actually bifurcate a little bit.
So there’s a set of companies today that are able to create these types of what we’re talking about is avatar videos. They’re using this technology called neural rendering. It’s actually not a technology that’s affected by the transformer and diffusion model revolution or the large model revolution, essentially. This is a technology that existed separately and it doesn’t have anything to do with the AI growth happening right now. It just happens to produce semi-realistic outputs, but it actually stops at some point because it’s not clear how it becomes generalizable in every situation.
It has to be trained on people individually. So you might ingest a little bit of video of you and then you can generate you. And so it’s a different technology and a different outcome, essentially. And a bunch of companies using this type of model, a bunch of companies are doing general text to video with no audio today. These are large generative models and they have the capability to do more, but that frontier just hasn’t been reached yet. I think there’s no doubt in anybody’s mind on the research side that it is 100% solvable. It’s just like somebody has to go do it and we haven’t gotten there yet. Nobody has had the time to go and do that yet. So that’s where we’re at, essentially.
We’re working purely on large generative models for talking videos. So that’s our core focus. I do think though, from a safety perspective, we have a unique framework or how we think about it. So generally videos divide into two categories. So for us, we think on one side of what is documentation, so this is the type of video that it could be a personal video where you’re taking a video with your friends and you’re hanging out, you’re at a restaurant. It’s documenting what happened. You had fun, whatever it was, it’s for your memories. And there’s a non-personal version of this which is like, oh, it’s like a reporter documenting a crime or something that happened or whatever it is and who was involved, where was it? Maybe it was a natural disaster or something, and this is for history. We want to see what happened.
And there’s actually no benefit to AI-generated video in any of this. Actually, all of this, it’s just negative. It’s all negative. If we are generating fake versions of reality to fool people, there’s just nothing good about that. And we want to stay away from that, essentially. We want to design products and build products that make it difficult to use for that particular use case, for anything that falls within that. And on the other side, you have what we think of as storytelling.
Now this could be ads, it could be social media posts, it could be TV, movies. All of these things are storytelling. They’re designed for entertainment, they’re designed for fun. And nobody believes if you watch a Geico commercial, you’re not thinking that the gecko is real selling insurance somewhere out there. You know that this is fabricated and it’s for entertainment. And same with reality TV even, right? It’s called reality TV. It’s definitely not reality and social media, ads, all this stuff falls in the category.
And if we can enable more people to tell stories and entertain other people and get their message out there, that is pure positive. This is where we want to focus. And a lot of our effort in the product and design process goes into how do we design products and build products that specifically make it really hard to use on one side and really easy to use on the other side. And that’s the real challenge.
Lenny Rachitsky: That’s really helpful. Something that I’m really curious about as you’re chatting is ByteDance just released a really amazing model. I was actually just looking at it where you put a photo in, I think, and it just creates a video of this person talking in all these different ways. Where does that fall amongst the buckets you just described?
Gaurav Misra: I think that falls exactly in the area that we’re in, which is talking people and that’s what they’re going after as well there. So that’s actually one of the first examples of a large model that a larger company has released where it’s able to do these dialogue or monologue videos. And I mean you yourself, you’ve seen it, so I’m not going to describe it too much, but as you know, it’s highly expressive. It doesn’t look like an avatar video. It looks like …
Lenny Rachitsky: Yeah, it’s wild.
Gaurav Misra: And that’s because of the technology that’s used is fundamentally different. It’s just like this is using a true large diffusion model is what they use. Whereas most companies that are working on avatar technology are actually using something pretty basic in comparison.
Lenny Rachitsky: How long has it been since that Will Smith spaghetti video? Just to give us a reference of how fast things are moving?
Gaurav Misra: Oh my god, it’s been so fast, right?
Lenny Rachitsky: I think it’s a year.
Gaurav Misra: Amazing.
Lenny Rachitsky: Or is it like two years?
Gaurav Misra: I think it’s probably about a year and a half, two years. Right?
Lenny Rachitsky: Wow. We’ll link to that video and then you could tell basically that video is the state of the art of AI video one to two years ago. And then we’ll link to this other Omni something. I forget what it’s called. I’m just showing what it’s like today.
Geez, Louise. Okay, final question and this is around something that I know you have a really interesting insight on, which is that you see marketing using AI video basically as the final frontier of how people will experience AI is marketing, is seeing it in marketing channels. Talk about why you think that’s the case and just what that looks like.
Gaurav Misra: It comes back to what we were talking about before where the reality is that no matter how interesting, advanced and amazing a technology is, science fiction has become reality. We were talking about this. What was literally science fiction on TV is real now and most people still don’t even know about it, to be honest.
My parents live in India and they are the only ones in the neighborhood that know about ChatGPT and they write these amazing notes to the community just with all these words. And people are just like, “How did you get so good at writing?” And they’re not telling anybody, but there’s still a ton of people who don’t even know that these advancements have happened. And so adaption is actually much slower, even for the most exciting things. Of course, in tech circles, everybody’s talking about it, but the reality is it takes a while to get out there.
And I think for companies that are going to succeed, they’re going to have to figure out how to market these products so that they can be the ones to reach all these people that have the problems that they’re now able to solve. And we think about that every day. So on that note, as a consumer product, we spend a bunch of time and money on marketing our products, and we often use performance channels and all kinds of things, but about a year ago, we would run AI video in ads and things like that, and we would get all these comments of people being like, “Oh my God, this is so fake. Don’t show me this.”
And around that time, the technology got just about good enough that suddenly, those comments stopped happening and suddenly, you could get performance that was even better than actually recording with a person because you could just try more things. You could just generate 30, 40 possibilities and one of them would win and it would win more than the one creative you can get from a person. And more interestingly, when you think about localization, you’re going to go do that in every language. Once you discover winning creative, now you have to go localize that in every market and rebuild it from scratch.
It’s just a ton. And oftentimes it doesn’t perform as well because it’s been rethought essentially. But we found that just translating it with AI was able to get performance almost as good as the original, in the original language. So this is going to fly to the entire market. I think wherever there’s dollars to be made, saved, it is inevitable. It will be consumed and it will very quickly be a lot of social media.
I mean, you could imagine a social network of the future where, and this is dystopian by the way, so watch out. You could imagine a social network of the future where all content is generated. None of the people are real. I mean, the algorithm isn’t tailoring whose content to show you, but it’s purely generating content that is completely catered to you, with people and everything completely catered to you. I don’t think it’s out of the question. It almost seems inevitable in a way, but that’s not too far away, I think. That’s actually very possibly real in five years or something like that.
Lenny Rachitsky: What I’m imagining, because it’s hard to imagine a social network where it’s people because usually we want to know who these people are. I don’t care random sharing status updates, but I can see a TikTok that is all AI.
Gaurav Misra: Exactly, exactly.
Lenny Rachitsky: Wow, just content tuned to your loves and interests.
Gaurav Misra: Exactly.
Lenny Rachitsky: And just random videos. Wow.
Gaurav Misra: Yep. Because do you know, you see a TikTok feed, you don’t even know who’s real or not today, right? It’s not like we-
Lenny Rachitsky: Right. That’s how I would approach it. I would just join TikTok and start uploading videos that are AI generated.
Gaurav Misra: Exactly.
Lenny Rachitsky: And then build a whole network of that. Oh my god, the future is wild.
Let’s go to failure corner. Something that I try to do with this podcast is share moments where things didn’t go well. There’s all these stories of everything’s going great all the time. All this foundries killing it, building a billion-dollar company. Oh, so awesome. But they don’t know all the things that go wrong. So let me ask you, is there a story you can share of when things didn’t work out? When you failed?
Gaurav Misra: At the beginning of the company, we actually had a bunch of time where we spent figuring out what we wanted to do, and I think it’s an unconventional story almost in a way because we started off the company, the first thing we did was build the Captions app. We launched the app. That was the first thing we did. Took two days to build it. We put it out there and it immediately took off. It was absolutely shocking because I built it on a weekend. We put it out there, I called my co-founder on Monday. I’m like, “It’s at the top of the app store. We’re getting like 600 videos a day.”
Lenny Rachitsky: Top of the app store, holy shit.
Gaurav Misra: And we didn’t do anything to enable that. It just happened on its own. It was almost anti-climactic in a way because we thought it would be a lot more time spent figuring out the product before that would happen. And so it felt like, “Wait, this can’t be it, right? It can’t be this fast. How did this happen?” So we got distracted because of that, because we were like, “Oh, okay. Well, maybe … This is cool. It’ll work. That’s great, but we got to figure out what the product is.”
And so we spent at least a year, year and a half thinking about building social networks and all kinds of things when we should have been working on Captions because there was product market fit there. And how we figured that out is Captions was sitting on my personal account, so I wasn’t checking that a lot. About a year and a half into the company, as we were working on other projects and stuff, I went back to my personal account, just opened it, and I saw that there was $500,000 in there.
I looked at a chart and it was just growing. The revenue was just growing completely on its own. No employees, no releases, no bug fixes, no customer support. There was like 2,000 open support tickets that were unanswered for a year and a half, and great reviews. It’s just going completely on its own. And so that was a clear sign to me. It was like, “Oh my God, you should have been working on that. That product works.”
And so we immediately had a meeting. I mean, it was tough to figure out what the right path was at that point because we’d invested so much time in other things as well, but reset, and we got back on the track with Captions, and literally as soon as we started releasing the first features into it, it blew up. What looked like a vertical line at that time became a horizontal line, and the new vertical line was so vertical that the old vertical line became a horizontal line, essentially. And it’s continued since then, which is crazy. So we basically wasted about a year and a half.
Lenny Rachitsky: I love that new way of thinking about a hockey stick moment where not only is it going vertical, but the rest of the chart is now just flat along the bottom of the axis.
Gaurav Misra: Exactly. Yeah.
Lenny Rachitsky: For people that may not know what Captions is, I try to describe it at the beginning and we’ll link to it and stuff, but basically, the reason you thought it was nothing is it just adds captions to a video that you record.
Gaurav Misra: It does.
Lenny Rachitsky: Added captions.
Gaurav Misra: Exactly, yep. So I think we wanted … Our thought was we’re going to build a social network, but first we got to build a creation tool for the social network. And we knew that we wanted to use AI to create video, and it seemed obvious that, “Oh, speech to text, a solved problem, we should start with that.” So that’s why we decided to start with Captions because it was a solved problem at the time. What was funny is that once GPT and stuff started coming out, a lot of the things that were unsolved became solved very quickly. So timing was almost perfect.
Lenny Rachitsky: And that aligns to something you shared earlier. Just so many of these problems that were not yet solved are now possible, and the companies that are in the right place at the right time benefit greatly who’ve been just waiting for this part.
The other thing that I think is interesting about that story is you try to build a social network. I think it was around high schools and things like that. As we’ve seen, it’s very difficult to build a new social network. So let me just get your sense. Do you think it’s possible for somebody to come around and build a new, the next Facebook, the next Snap, the next whatever?
Gaurav Misra: I think it’s definitely possible. I do think … Let me tell you something crazy, actually. The social network that we had at the time, we actually remove it from the app store, so it’s not available anymore. But til today, there are people, there are thousands of people that are using it, posting on it, and all the different things, which actually speaks to the power of the social network in a way. It is hard to create and hard to kill. I mean, I think X is actually a great example of that too. A lot of movement happened there and it continues to work, I guess somehow. So testament to that.
Lenny Rachitsky: The power of network effects, especially someone once described this so well, they’re like Twitter/X. They changed the brand, they changed the team building it. They changed the URL. Like everything changed about it except the network effect of the people in it.
Gaurav Misra: It’s true. It’s true.
Lenny Rachitsky: I just saw a story that they’re making billions of dollars. He’s actually turned it around. It’s actually becoming a really profitable company.
Gaurav Misra: Wow.
Lenny Rachitsky: Yeah. It just came out the other day. So Elon did it. Well, with that, we’ve reached our very exciting lightning round. Are you ready?
Gaurav Misra: I’m ready. Let’s do it.
Lenny Rachitsky: What are two or three books that you have recommended most to other people?
Gaurav Misra: I have to say here that I actually don’t read books. It’s actually something that I decided on purpose where I decided I don’t want to build my skill in reading, and I want to build it in listening and watching instead, because I think that’s the future.
Lenny Rachitsky: I love how intentional that is, and I love how it’s a really cool way of saying, I don’t read books. The future isn’t reading, but I love that you have books behind you, so [inaudible 01:20:58].
Gaurav Misra: I do. Yeah.
Lenny Rachitsky: [inaudible 01:20:59]
Gaurav Misra: [inaudible 01:21:00] didn’t read, they’re back there.
Lenny Rachitsky: That’s funny. Okay, cool. I want to ask more questions, but I’m going to keep going. Lightning around. Speaking of watching and listening, do you have a favorite recent movie or TV show you’ve really enjoyed?
Gaurav Misra: I like Silo and Severance. I mean, obviously, I think everyone’s watching these. There’s a book around Silo too.
Lenny Rachitsky: I read that. I read all of them. There’s three of them.
Gaurav Misra: There are.
Lenny Rachitsky: It sucks to watch the show because you know all the tricks that are about to happen, and I’m just like, “Why am I watching this? I know where this will go.
Gaurav Misra: Yeah. I mean, for what it’s worth, it does seem like the show is going on a slightly different path.
Lenny Rachitsky: It is. That was also what annoyed me. Just like, “What the heck? This is made up. All is made up shit.” I don’t like that when I watch the show. So two reasons I’m not watching it but [inaudible 01:21:40].
Gaurav Misra: Don’t worry about it. I didn’t actually read the book. My wife read the book and then she told me the story.
Lenny Rachitsky: Okay, okay. I was worried. I was worried. Okay, cool, and Severance. Okay, great. I love Severance.
Next question. Do you have a favorite product you’ve recently discovered that you really like?
Gaurav Misra: My favorite product, honestly, is Linear. I’m not going to lie, just because it’s so well-designed and it’s so easy to use. I also like Superhuman. I mean, these are obvious answers, but I do use these things every day and it’s hard to create products that you use every day and don’t hate. So props for them.
Lenny Rachitsky: Cool. I haven’t announced this on the podcast yet, but this is a good time, whoever’s listening right now, is I just launched a bundle where if you become a paid subscriber to my newsletter, you get, listen to this, a year free of Linear and Superhuman and Notion and Granola, which is incredible AI app for note-taking and Perplexity, Perplexity Pro.
Gaurav Misra: Ooh. Nice.
Lenny Rachitsky: $2,000 in value for the price of my newsletter, 200 bucks, going to that.
Gaurav Misra: Damn, that’s real value.
Lenny Rachitsky: It’s an unbelievable deal, and it’s a no-brainer at this point to buy a subscription, but this isn’t an ad for my newsletter. I’ll keep going.
Next question. Do you have a favorite life motto that you often find yourself coming back to, sharing with friends and family in work or in life?
Gaurav Misra: I actually learned this because someone else told me that I keep repeating this thing, but I have this framework of how I want to operate at work, basically. Right? I think I love to compete and to win at the end of the day. And I think that to win, you have to be the best. But I also think the easiest way to be the best is to be the first, and that actually is key.
Lenny Rachitsky: And so is the motto the easiest way? Is that the-
Gaurav Misra: That’s it. The easiest way to be the best is to be first.
Lenny Rachitsky: Be the first. Interesting. Okay. I have to resist following threads here because I want to make this lightning round.
Okay. Final question, just for fun. What’s the coolest, most wild AI video you’ve seen recently? Is there one that comes to mind of like, “Wow, that was something”?
Gaurav Misra: I mean, honestly, I got to say the OmniHuman stuff was pretty cool.
Lenny Rachitsky: The ByteDance video that we talked about.
Gaurav Misra: Yeah, exactly. I mean, the broccoli talking. I don’t know if you saw that one. There was a little broccoli delivering a little speech.
Lenny Rachitsky: Interesting.
Gaurav Misra: Yeah, it looked like it was animated by an animator.
Lenny Rachitsky: Just imagine being a kid these days and just seeing stuff like that.
Gaurav Misra: I think you’re probably just used to it, right? You’re just like, this is just normal.
Lenny Rachitsky: Mm-hmm. It’s just like we were saying, AGI is just going to come around.
Gaurav Misra: Exactly.
Lenny Rachitsky: All right. Cool. What’s for dinner? Cool. That’s great.
Gaurav Misra: Yep.
Lenny Rachitsky: Amazing. Gaurav, this was incredible. It was so insightful on so many levels. Two final questions. Where can folks find you and what you’re building if they want to learn more? And then how can listeners be useful to you?
Gaurav Misra: Awesome. Yeah, I mean, definitely find me on LinkedIn. That’s where I live most of the time. My DMs are open, et cetera, et cetera. So feel free to send me a message. And I think what’ll be useful, I mean, we’re building out our early product and design team, so if AI video is interesting, if consumer apps are interesting, now’s the time to join. We’re really small, early, we work together across the team, so there’s going to be no better time to join, basically.
Lenny Rachitsky: And you get to ship a marketable feature every week.
Gaurav Misra: Exactly. I mean, that’s the PM’s dream. Think about it. Right?
Lenny Rachitsky: The PM’s dream. Yeah, I like that that’s a filter. The people that get excited about that, great fit. The people that are stressed out by that, not the place to be.
Gaurav Misra: Exactly.
Lenny Rachitsky: So awesome. All right. Gaurav, thank you so much for being here.
Gaurav Misra: No, thank you. Appreciate it.
Lenny Rachitsky: Bye everyone.
Thank you so much for listening. If you found this valuable, you can subscribe to the show on Apple Podcasts, Spotify, or your favorite podcast app. Also, please consider giving us a rating or leaving a review as that really helps other listeners find the podcast. You can find all past episodes or learn more about the show at lennyspodcast.com. See you in the next episode.
Reformatted by reformat_english_direct.py
如何在 AI 时代取胜:每周发布一个功能、接受技术债、无情削减范围,并创造竞争对手无法复制的魔法 | Gaurav Misra(Captions CEO 兼联合创始人)
访谈记录
创业的黄金时代
Gaurav Misra: 很少有这样的时刻,有这么多可能性。哪怕五七年前,创业都非常困难。每件事都感觉已经被人做了,别人已经在做了。突然之间,现在这个时代——说实话我职业生涯中从未经历过——你尝试的每件事都能跑通。
Lenny Rachitsky: 人们不断听到各种新鲜事发生。你有没有找到什么工具、流程或方法来帮助保持专注?
Gaurav Misra: 我们的工程目标是,每个工程师每周都应发布一个可推向市场的产品。
Lenny Rachitsky: 我太喜欢这句话了,听起来太疯狂了。那你们如何维持质量,让一切保持协调一致?
创业公司就该主动承担技术债务
Gaurav Misra: 我其实认为,作为一家创业公司,你的工作就是承担技术债务(technical debt),因为这就是你比大公司跑得更快的方式。大公司不承担技术债务,他们通常会立刻偿还,或者正在偿还当年还是创业公司时留下的技术债务。
Lenny Rachitsky: 在你们的运营方式和产品构建方法中,还有什么你觉得特别独特和有趣的?
公开路线图与秘密路线图
Gaurav Misra: 我们有一个所谓的公开路线图(roadmap)。基本上就是用户向我们提出的需求。我们有各种触点接收用户反馈,但这些功能每个竞争对手都知道。如果一个用户向我们提出某个需求,他也会向所有人提出。
这并不会在竞争中成为制胜关键。所以我们还有第二个路线图,我们称之为秘密路线图。
嘉宾介绍:Gaurav Misra
Lenny Rachitsky: 今天的嘉宾是 Gaurav Misra。Gaurav 是 Snap 的早期员工,在那里他领导了设计工程团队,他会在对话中详细解释。他还在微软和其他几家公司担任过工程师。最近,他是 Captions 的联合创始人兼 CEO——这是目前最成功、最前沿的消费级 AI 产品之一,让你可以用 AI 生成和编辑口播视频。他们拥有超过一千万用户,融资超过一亿美元。
在我们的对话中,我们基本上做了一次考古,探究一家现代 AI 导向的创业公司是如何运作的——包括他们公司的每位工程师每周如何发布一个可推向市场的产品或功能;为什么他们除了常规路线图之外还有一份秘密路线图。我们还深入讨论了 Snap 的产品团队是如何运作的;他对于打造成功的消费级和社交应用需要什么条件的理解;为什么他们没有产品经理(PM),以及为什么设计师主导一切——这到底是不是个好主意。另外,在一个 AI 视频逼真到真假难辨的世界里会发生什么。这期节目适合所有在 AI 之上构建产品的人。如果你喜欢这档播客,别忘了在你最喜欢的播客应用中订阅和关注。
好了,下面有请 Gaurav Misra。
(广告段落已跳过)
一切皆有可能的时代
Lenny Rachitsky: Gaurav,非常感谢你来到这里,欢迎来到播客。
Gaurav Misra: 谢谢,感谢邀请,很激动。
Lenny Rachitsky: 我很少在播客上邀请早期创业者,但我想和你聊聊,因为你正处于当下很多构建者最关心的领域的核心——AI、视频,还有消费级和社交应用。还有病毒式传播和寻找新的营销渠道。所以我认为人们可以从你做产品的方式、你构建产品的方式,以及你对未来趋势的思考中学到很多。再次感谢你来到这里。
Gaurav Misra: 感谢。说实话,这是一个激动人心的时代。我得说,很少有这样一个时代,有这么多可能性。在正常时期,你回想一下,哪怕五七年前,创业都非常难。想出一个创意都很难。感觉每件事都已经被做了,别人已经在做了。或者就像,这个已经试过三次、失败了三次。突然之间,现在这个时代,说实话我职业生涯中从未经历过——你尝试的每件事都能跑通。
可能性太多了,世界上的人手根本不够。说实话,能做的事情比能做这些事的人还多。这真的是极其罕见的事情。而且说实话,这不会永远持续下去。我们最终会追上这个阶段,但能参与其中真的很幸运。太棒了。
技术已就位,注意力成为稀缺资源
Lenny Rachitsky: 你说一切都行得通,我觉得一个重要的区分是:构建工具这件事确实行得通了。现在的技术已经能够构建很多以前不可能实现的东西。而越来越难的是——我想听听你的看法——是让别人注意到你的产品并留下来。因为构建东西变得如此容易,一切都很酷、很有趣,让人们关注并坚持使用你的产品变得更难了。
所以我想问,你在这一点上有什么心得吗?你已经打造了多款成功的产品——我们后面会聊到 Snap 和你现在做的事情——关于在当今这个环境下,你需要思考什么才能让人注意到你的产品,然后留下来。
Gaurav Misra: 说实话,这个观点非常好。确实,现在有很多炒作,而部分炒作也正是驱动许多公司增长的原因。从用户获取和营销的角度来看,放在五七年前,如果你做出了一个新东西,跑去跟用户说”我们有更好的产品”,人们的反应通常是”行吧,谁都说自己有更好的产品,我不在乎”。但今天——虽然这可能不是你该采用的方式——你可以直接说”我们用 AI 重新设计了这个东西”,然后就会有一堆人问”怎么做的?“或者”也许我应该试试看。”
他们真的会去试。当然,你必须兑现承诺。如果兑现不了,人们来了,玩一圈就走了。但如果你真的能兑现承诺,就有大规模获取用户的绝佳机会。所以我觉得情况稍微有些不同。我不知道这能持续多久,但从这个角度来看,现在确实是一个不一样的时代。同时我也认为,做产品的核心还是解决问题。很多人会陷入这样一种心态——“这很酷,人们会为酷买单”。人们确实会来,说”让我看看,挺酷的。”
但归根结底,如果你只是建了一个游乐场,人们在里面玩了一圈就离开了,这不叫生意。所以我觉得这一点仍然至关重要——你必须解决真正的问题。
Lenny Rachitsky: 我们聊着聊着,我就在想,现在每天都会出现一些放在几年前能上一年头条的东西。天哪,这居然变成可能的了。而现在,这种事每天都在发生,然后大家的反应就是”好吧”。我有时候会想,有一天我们会迎来 AGI 或超级智能,所有人都会说”哇,太厉害了”,然后紧接着说”好吧,晚饭吃什么?”
Gaurav Misra: 这不是已经在发生了吗?想想看——我有时候会自我反思——你看过《钢铁侠》之类的电影,里面有 J.A.R.V.I.S.,还有《星际穿越》里有 TARS。人和这些东西之间有来有回地对话、碰撞想法。那是科幻,名副其实的科幻。好吧,现在还不够完美,但它已经以谁都无法想象的方式存在了。科幻变成了现实,但我觉得没人在乎。
在某种程度上,你本来会以为世界会被颠覆,但感觉一切发生得那么缓慢,人们的反应也很平淡。是的,普及确实在进行,但我觉得这某种程度上是一个令人震惊的进展。
保持用户持续关注的方法
Lenny Rachitsky: 感觉你们在让用户持续记住你们、不断激发大家的热情方面做得很好。因为正如你所说,现在发生的事情太多了。你是怎么让人们持续觉得”哦,他们在做的东西确实很有意思,而且一直很有意思”的?
关于如何保持在用户心中的存在感、不断把人们拉回来、一次又一次地重新激发他们的兴趣,你有什么经验吗?
Gaurav Misra: 当然。说实话,归根结底不应该是为了 AI 而 AI,也不是为了兴奋、炒作或新奇而 AI,而是要打造真正有效的 AI——解决实际问题的 AI。基本面没有变。做产品本质上分三步:找到一个用户问题,用某种技术去解决它,然后找到有这个问题的人群。如果你能同时做好这三件事,在任何环境下都能做出好产品。但我觉得现在的不同之处在于,技术端变化如此之快,你能做出以前根本不可能做出的产品,解决以前根本不可能解决的问题。而这正是机遇所在。
对我们来说,尤其是在视频领域,机会真是无穷无尽。我们才刚刚开始。我们在视频领域的目标不是打造专业工具——我们根本不是为专业人士打造的。我们面向的是那些以前无法制作视频的人。他们没有工具、没有技能、没有条件去制作视频,而现在他们可以了,因为他们能跨越技能差距或时间差距。也许他们是企业主,没有时间,想要直接出结果,老实说,这个领域有太多问题需要解决。
Lenny Rachitsky: 解决人们的问题。说起来容易做起来难,但这是一个很好的提醒。归根结底,这才是最重要的。我一直在想的是,像你这样角色的人,如何不被各种信息淹没?你怎么知道该关注什么?你怎么保持专注?
对于那些每天看到新公告就觉得”我该怎么办?该做什么?信息太多了”的人,你有什么建议吗?
优先级排序:用传播力验证用户需求
Gaurav Misra: 在某种程度上,这是产品开发的新问题。你能走的路太多了,想法太多,能做的事太多。当然,优先级排序一直都是一项重要的能力,过去也是如此,但现在变得更加关键了,因为你必须弄清楚不该关注什么。我们的一般框架是寻找用户需求,而检验用户需求最简单的方法就是看什么东西有传播力。
通常来说,有传播力的东西、人们愿意分享和讨论的东西,其核心一定有真正有趣的东西。当然,这个”有趣”不一定能持续——也许只是一个一次性用例,也许不是人们会反复使用的功能,也许不能支撑一个订阅制业务——但很多时候,其中确实有某些核心元素打动了人们。如果你能识别出这个核心,然后把它塑造成适合你业务的形式,那这其实是一个很好的发现路径,帮你判断什么真正有效。而且我们现在有这些工具,不需要真正构建什么,你只需要把它讲出来,人们就会分享、传播这个想法。
你甚至可以在构建任何东西之前,就衡量这个产品可能被接受的程度。这是我们用来排优先级的好方法。我们花很多时间在社交媒体上。我们的产品本身就经常被用于社交媒体,所以很多员工会花大量时间在社交媒体上。我们观察趋势动态、正在发生什么,据此就能比较准确地判断什么东西可能引起用户共鸣。
Lenny Rachitsky: 那么作为一个公司领导者,面对团队里的人每天都在听说各种新鲜事,你有没有找到什么工具、流程或方法,帮助大家保持专注、不被每个闪亮的新东西分散注意力,真正把东西做出来?
每周交付可营销的功能
Gaurav Misra: 说实话,这在某种程度上就是关于增量迭代。我们的目标是每周都发布产品。我们的工程目标是每位工程师每周都应该交付一个可营销的产品。所谓可营销的产品,就是你可以展示给用户看、用户可能仅仅因为这个功能就愿意订阅或付费、或者就是冲着这个功能来用你的产品的产品。这就是为什么基本配置型的功能——比如说我们在做文字处理器之类的——如果你有自动格式化,或者像两端对齐这样的基本功能,没有人会专门为了两端对齐而来用你的文字处理器。你可以宣传它,因为它确实存在,但如果你做了一些独一无二的、别人没有做过的东西,你就可以拿去展示,人们会仅仅为了那个功能而来用你的产品。
即使你的应用缺少很多显而易见的功能——也许它连两端对齐都没有——人们也会为了使用这些你在构建和推广的新工具、新能力而跳过这些缺失。所以我们努力让每位工程师每周交付一个可营销的功能,其中很多可能不奏效,但很多确实有效,我们显然能判断出哪些方向值得投入更多精力——那些开始见效的,我们就加倍投入,继续构建。人们经常会抱怨,因为想想看,在一周的交付周期里,做出来的东西是不完整的,是真正的 MVP。
我们对它进行极致的裁切。我们拿到设计方案,然后一砍再砍,直到再砍下去就没法用了为止。然后把这个版本发布出去,用户就会进来。如果一切顺利,尽管可能存在各种问题,用户还是会使用它,然后人们会抱怨,我们就有了问题清单,就知道接下来该做什么。这本质上是一个起点。只要我们保持每周发布一个,就能产生大量的功能、产品和方向的发布量,砍掉其中大部分,剩余的再逐步扩展。所以这个方法非常有效,也能让大家保持专注。
Lenny Rachitsky: 我喜欢这种简洁性。我想对很多公司来说这听起来很疯狂。每位工程师每周交付一个可营销的功能或产品。
Gaurav Misra: 是的。
Lenny Rachitsky: 有些听众听到这个会感到压力很大,也有些听众会觉得,这正是我想要的工作方式,每家公司都应该这样做。
Gaurav Misra: 对。
在精简范围而非牺牲质量中求速度
Lenny Rachitsky: 那你如何保持质量、让一切保持一致?我想这是最大的取舍。对于想要尝试这种工作方式的人,有什么诀窍吗?
Gaurav Misra: 质量在大多数时候是不能妥协的。确实存在策略性的质量妥协,但大多数时候你要守住一条质量底线——用户进来使用功能时,它应该是能正常工作的,当然。而在时间上节省的方式——我认为这是人们经常犯的一个错误——当时间受到压缩时,很多时候工程师、产品经理、设计师会削减质量而不是削减范围(scope)。实际上你是可以削减范围的。我们使用的方法是,审视每一个需要花时间构建的元素,然后问:如果把这个去掉,产品还有用吗?
然后不断重复这个过程,直到再删就真的没用了为止。这就成了那一周的项目。这个方法确实非常有效。它能帮你缩小到真正核心的东西。比如,假设我们想在视频上添加图片之类的功能——这是一个很基础的例子,我刚编的。你可能会设想一个设计,用户从相册导入图片,但在图片进入视频之前,你可能想去掉背景、调整色相和饱和度之类的。你可能会让设计师把所有这些功能都设计出来,但你很快就会意识到,这些都可以砍掉。
可以砍掉背景处理,可以砍掉色相饱和度调整。你真正需要的只是选择图片。然后可能需要一个选择器,带图片库的,支持多种类型。如果用户想从云端拉取呢?想从网盘拉取呢?把这些全部砍掉。最终回归到核心——就是用系统原生的图片选择器,从相机直接选,放进视频里,没有额外的 UI。这本身就已经应该是有用的了。如果这都没用,那在此基础上构建的任何东西也同样没用。这就是我们的做法。
Lenny Rachitsky: 最后这句话非常关键。这就是核心思想——在大量投入之前先交付小的迭代功能,看看这条路是否行得通,是否值得花几周时间去做。
Gaurav Misra: 完全同意。我觉得这个方法最妙的地方在于,用户第一次进来使用这个功能、导入图片之后,他们最先抱怨的就是最困扰他们的东西——是色相饱和度?是背景去除?还是从云端选取?你会得到关于那个问题最多的抱怨。
人们会很直接地说,“这也太差了,连背景去除都没有,这算什么图片功能?“你接收这个反馈,然后下周就能在一周内把用户抱怨的那些东西全部交付出来。
Lenny Rachitsky: 然后他们就会说,哇,这个团队发布速度太疯狂了。
Gaurav Misra: 是的,没错。
Lenny Rachitsky: 把我所有问题都解决了,响应太快了。这其实指向了一个常见的产品市场契合(product-market fit)信号——当人们在抱怨某个东西时,意味着他们真的在乎到愿意抱怨的程度,如果他们在抱怨什么,这其实是一个很好的信号。
Gaurav Misra: 确实如此,确实如此。如果没人抱怨,反而几乎是危险信号。
路线图似乎无限
Lenny Rachitsky: 这很多内容已经变成了一场对现代产品团队和创业公司的”考古挖掘”。所以我想继续深挖。这不是我原计划的方向,但太精彩了。我特别喜欢”每位工程师每周交付一个可营销的功能”这个做法,它直接呼应了我开始这段对话时的问题——你如何在喧嚣中保持领先?
答案的一部分就是持续不断地发布东西,持续给人们带来惊喜。“看,这是一个全新的视频功能。""看看这个东西。”
Gaurav Misra: 没错,我认为这确实是关键所在。而且有足够的领域和空间让这一切发生。在正常时期,可能不太可能这么快地创建那么密集的路线图,但因为这一切底层有如此多的创新,确实有这个空间可用。路线图几乎是无限的,真的是这样。
长期项目与后端工作
Lenny Rachitsky: 好。我想大家还会好奇的另一个问题是,那些需要很多周的长期项目怎么处理?另外还有基础设施,我想还有后端的东西。所以请回答这些问题——你如何看待长期项目,以及如何处理那些不是用户会关心的功能的后端工作?
Gaurav Misra: 是的。通常我们会单独为此留出时间。比如,对我们来说 Q4 通常是基础设施季度,我们就专门去搭建各种基础设施。Q4 的时候,我们已经交付了大量的产品和功能,对这一年的其余部分感觉相当不错,节奏也在放缓。加上假期也快到了。所以我们会把所有时间都花在偿还技术债务上。
我其实觉得这里有一个关于技术债务的独特的思考角度。作为一家创业公司,你的工作就是承担技术债务,因为这就是你能比大公司运作得更快的办法。大公司不会主动承担技术债务,它们通常会在第一时间偿还。或者它们正在偿还的还是自己当年作为创业公司时累积的大量技术债务。我在 Snap 工作的时候就见过很多这样的例子,我相信每家公司都是如此。
我们思考的方式是,这是不是一个我们今天需要解决的问题?还是一个可以留给第 50 个工程师、第 100 个工程师或第 500 个工程师来解决的问题?如果是一个未来的工程师可以解决的问题,我们就应该利用那个”未来的工程师”。本质上我们就是这么做的——我们把这个推给未来的某个人。而且说句实在话,如果公司失败了,那个工程师永远不会被招进来,这一切也就无关紧要了。所以这在很多方面就像金融债务。金融债务是为了创造杠杆而承担的,它可以是一件好事——比如买房,你承担了债务,就可以买到可能不用债务就买不起的东西。
道理是一样的。通过有策略地承担技术债务,你可以用你现有的小团队创造出本不可能构建的产品。这其实是非常积极的。
Lenny Rachitsky: 哇,这个想法真的很妙。我想到的是,那个未来的工程师可能就是一个 AI agent 工程师。
Gaurav Misra: 完全正确,没错。
Lenny Rachitsky: 就是来解决问题,专门帮你清技术债务的那种。
Gaurav Misra: 没错。某个未来的工程师——很多年后的第 500 号工程师——会因为这个大问题获得晋升,而这个问题正是那些糟糕的早期工程师制造出来的。
Lenny Rachitsky: 所以很显然,这里有一条界限。你能承担的债务是有限的,过多就会变成大问题。关于这个平衡——多少算太多,以及判断是否足够让一个新工程师来接手的门槛——你有什么想法吗?
Gaurav Misra: 是的,我觉得通常的经验法则是,你承担的每一笔债务都要支付利息。如果你承担了某笔债务,每天大概会有 1% 或 2% 的时间被消耗在维护 bug、处理问题、重启、崩溃等因快速实现而产生的各种状况上。因为你用了快的方式,就一定会出问题。每一天,1% 的时间就这么没了。如果你承担了足够多的债务,你会支付 80% 甚至 90% 的利息,然后就没有任何时间做新的事情了,全部精力都在还利息。
这时候你就会进入那种状态:我们只是在维持运转,没有工程师能做任何新东西,只是在维持运转。这就是创业公司的失败模式。所以在某种程度上,你有一个技术债务的跑道。一旦耗尽——一旦你承担了过多的债务——而在那段时间里又没有交付足够的价值,没有足够的价值来招聘工程师去支付利息或者直接偿还债务,你就会陷入困境。
Lenny Rachitsky: 我很喜欢这个说法。这是一个非常好的启发式方法,帮助思考什么时候该在某件事上投入。我不想在这个话题上走太远,但我确实有个想法——因为有时候你需要做一些重大的技术决策,这些决策会影响未来所有东西的构建方式。我猜这些决策你们会花更多时间,也会更严肃地对待。
Gaurav Misra: 当然。我的看法是,只要是双向门(two-way door)的情况,你怎么做都行。这是一个经典的方法论。但如果是单向门,那就值得深思熟虑,至少要认真做好——做到与这个单向门在未来对你的重要程度相匹配。
AI 工具与工程效率
Lenny Rachitsky: 你们的工程师在多大程度上使用 Cursor 和类似的工具来构建产品?AI 对你们团队的加速作用有多大?
Gaurav Misra: 百分之百,每个人都在用。非常有帮助。说实话,连我自己都在用。它确实是团队的一个巨大乘数效应,毫无疑问。
Lenny Rachitsky: 具体是 Cursor 吗?你们还发现了其他有用的工具吗?
Gaurav Misra: 是的,我们在用 Cursor。我们试过所有不同的工具。我们也在用 Devin,它是另一个——更高级一些吧,可以帮你解决 bug。
Lenny Rachitsky: 对,Devin 基本上是——我记得是每月 500 美元,就像一个 AI 工程师,你在 Slack 里跟它对话就行。
Gaurav Misra: 没错。在某种程度上,这就是我们作为创业公司能做而大公司做不到的事情。大公司不能直接引入 Devin,他们得先召集 30 个律师坐在一起讨论才行。
Lenny Rachitsky: 而且它们都叫 Devin,这些都是 agent。未来每家公司都会有几百个 Devin 在工作。
Gaurav Misra: 没错,你可以有多个 Devin。我其实听说还可以有一个管理 Devin 的管理者,专门管理这些 Devin。
Lenny Rachitsky: 我觉得管理层被一层层地剥离这件事太有意思了,然后他们又会有 AI 管理者。这就是终极的偷梁换柱。
Gaurav Misra: 是的。
产品开发流程的独特之处
Lenny Rachitsky: 好。在你运营和构建产品的方式上,或者你搭建产品开发体系的方式上,还有没有什么你觉得非常独特和有趣、其他人可以借鉴的地方?
Gaurav Misra: 我们的流程在这方面确实有点特别。我们有设计团队,有 PM 团队。这些团队目前都还处于早期阶段。当然,我们还有工程团队。我们有各种不同的触点:iOS、Android、Web。还有后端团队、机器学习团队、研究团队。所以通常来说,当我们开发产品时,可能会先采用 PM 主导的方式——找到某个我们想要解决的总体问题,或者某个我们想要开拓的新领域、新支柱,然后从中创建产品规格。
但很多时候我们也会反过来。我们会先设计东西,甚至完全不知道我们在做什么、为什么做,但会设计一堆不同的方案,然后和 PM 坐在一起,逐一审视这些设计,一个接一个地看,直到从中发现有趣的东西和想法。
Gaurav Misra: 而很多时候,这让我们发现了那些如果只盯着指标和数据就根本不会发现的东西。所以这几乎是在某种程度上反转了流程,从设计出发,但往往能因此找到真正独特的创意。
公开路线图与秘密路线图
我也觉得我们在创建路线图的方式上有独特的做法。通常来说你只有一个路线图,而我们实际上把它分成了两个不同的路线图。一个是我们所说的公开路线图,基本上就是用户向我们提出的需求。我们通过各种触点收集用户反馈,审视所有这些反馈,人们会要求各种功能:我想要背景移除,我想要撤销和重做,我想要上传更长的视频,诸如此类,一大堆不同的功能。
我们就把这些列成一个清单。和所有事情一样,我们会进行优先级排序,看它影响多少用户,潜在市场有多大,然后基本上一个一个地去完成。
但这些都是每个竞争对手都知道的功能。这些都是公开的。如果用户向我们提出,他们也会向所有人提出。每个团队基本上都有差不多的清单,每个人都在做优先级排序。当然,你可以在优先级排序上做得更精一点,或者在执行上赢一点优势,但这不会成为战胜竞争对手的决定性因素。
所以我们有第二个路线图,我们称之为秘密路线图。这个路线图上没有任何人提出过任何需求——真的是字面意义上的,从来没有人要求过这些东西。
如果把上面的某个东西展示给用户看,他们可能会说:“我不需要这个。我不知道这是什么。“但凭借我们独特的视角,对问题领域、用户空间和技术的独特理解,我们想出了一些特殊的创意,我们认为这些创意将彻底革命性地改变某些东西的使用方式,真正改变用户的行为。我认为这本质上就是——人们一直用一种方式做事,如果我们能向他们展示另一种方式,一旦他们尝试了,就再也回不去了。这就是产品,这就是成功。而这些就是我们会放到秘密路线图上的那种创意。这些东西我们从不在公开场合谈论,从不告诉任何人,直接发布出来给用户用,然后看效果。
其中很多创意来自头脑风暴。我们确实会做季度头脑风暴,全公司范围的,所有人都参加——不只是产品团队的事,工程团队、招聘团队、所有人都会参与进来。当然市场团队也会参加,所有人都会提出创意,我们对创意进行投票、排名,然后产品团队接手,思考可行性、技术实现,以及各种不同的可能性。这样一来,我们就能把所有人接收到的那些信息汇总起来——大家都在刷社交媒体,看到各种爆火的内容、各种模型和进展——我们可以把这些信息全部整合,形成一个独特的内部路线图:面对所有这些正在发生的进步,我们该如何切入并创造价值。
这基本上就是我们的方法论。而且很多时候,最大的胜利会来自秘密路线图。那些才是改变游戏规则的东西。通常不会是用户请求带来这些。
Lenny Rachitsky: 我太喜欢”秘密路线图”这个叫法了,光是这个名字就让它变得更加有趣了。[听不清]
Gaurav Misra: 没错,是的。这是个秘密。
Lenny Rachitsky: 是个秘密。我甚至不会问你秘密路线图上有什么。你不能告诉我。能举个来自秘密路线图的功能例子吗?哪个对你们来说影响很大?
秘密路线图的案例
Gaurav Misra: 很多。我给你举个很久以前的例子。应用最初走红后,我们添加的第一批 AI 功能之一是一个叫 eye contact 的功能。这个功能是这样的:当你在录制视频时,很多刚入门的人可能会照着脚本或提词器读,而这些东西可能在屏幕外面,所以看起来你像在读东西,从视频本身或观众的角度来说效果不太好。所以我们做了这个功能,基本上可以让你的眼神看起来像是在看镜头。
实际上我们是第一家做这个功能的公司。我们和英伟达(Nvidia)合作开发的。这其实很有意思,因为当初我们联系英伟达提出这个需求时,他们并不确定我们为什么需要这个。但他们其实很开放地把技术给了我们,并且对某种合作方式感到兴奋——怎么把这项技术变成真正有用的东西。
但我们看到了这个创作者使用场景,这是独特的。这是头脑风暴中涌现的创意之一,我们把它放到了路线图上,然后发布了。效果非常成功。说实话,我们拍的那则展示这个功能的社交媒体帖子传播力极强,基本上被翻译成了世界上的每一种语言。直到今天它仍然能获得数百万的浏览量。我们不断发现人们二次发布、再转发,每次都能获得几百万几百万的浏览量,因为人们觉得:“哇,这个创意太棒了。”
而现在它已经被大量模仿了,基本上你能想到的每个应用都有了。当然这是有道理的。但那就是从秘密路线图出来的创意之一。
AI 辅助头脑风暴
Lenny Rachitsky: 你谈到了你们是如何想出这些秘密路线图创意的。我对这个确实很感兴趣,我想在这上面多花一点时间。你们团队有没有用 AI 大语言模型来辅助头脑风暴?我能想象未来会朝那个方向发展——你真正地与 AI 协作,AI 代理和你一起头脑风暴。
Gaurav Misra: 说实话,我希望它能朝那个方向发展。但目前还没有走到那一步。我们没有完全那样做,因为问题在于上下文。我觉得对用户、对使用场景的理解,这个上下文太抽象了。即使是现在,我觉得我当然理解我们的用户,但我没法确切地说清楚为什么、怎么理解的——有点抽象。我花很多时间和 RPM 以及设计师们在一起,把我理解到的、这么多年做这个产品学到的任何东西传授给他们。但这就很有挑战性,因为我连自己都没法把它说清楚。所以更大的挑战是:我怎么把这些上下文提取出来?怎么让它对一个大语言模型可用,当我自己都没法把它精确地用语言表达出来的时候。说实话,这可能是我自己的感受,我也需要改进这一点,但这里面确实有些道理。
我确实记得在 Snap 的时候,我觉得 Snap 和 CEO Evan Spiegel 最独特的一点就是他对用户有着无与伦比的理解。公司存在了很多很多年,将近十年,没有人像他那样理解用户。他会提出所有人都不同意的创意,然后我们发布出来,结果一个个都命中,连续命中。没有人理解为什么。所有人都会排着队说:“太棒了。“为大家鼓掌,但没人知道为什么。
Snap 作为相机公司的洞察
Gaurav Misra: 一个很好的例子是,很多东西也是后来才想明白的。我记得 Snap 曾在某个时点宣布自己是一家相机公司,很多人笑了,说”相机?我们是在做数码相机之类的吗?“或者,“为什么是一家相机公司?”
但我认为其核心在于这样一个理念:Snapchat 打开就是相机,这实际上就是差异化所在。那个小小的决策实际上支撑了整个公司抵御所有竞争——因为当你朋友正在做什么有趣的事、那个瞬间稍纵即逝需要捕捉的时候,你不会打开 Instagram 或其他任何应用,因为它们不会直接打开相机。你会打开 Snapchat,因为你可以立刻捕捉下来。而 Instagram 永远无法复制这一点,因为一旦他们那样做,他们所有的指标都会下降。这就是一种根本性的理解。实际上我也是很后来才想明白这件事的,但这确实是一个非常有力的想法。
为什么关注 Snap
Lenny Rachitsky: 很高兴你谈到了 Snap。这确实是我想要聊的方向,我本来就想从这里开始的。所以我很高兴我们回到了你在 Snap 的经历。我对这个感兴趣是因为,如果你想想社交网络,Snap 基本上是最后一个成功发布并存活下来的社交网络了——除了 TikTok,但我不认为它是一个社交网络,我觉得它只是一个内容平台,你并不是真的在和别人互动。Snap 是 2011 年发布的,所以距离上一个成功的社交网络发布已经过去了大约 15 年。
而且我觉得有趣的是,外界很少能看到 Snap 内部到底是怎么运作的。你很早就加入了,你在 Snap 地位很重要,做了很多非常重要的功能。所以我想在这里多花一些时间。感觉你从 Snap 学到的很多东西现在都带到了你自己的公司。所以我想问,也许你已经回答了这个问题,但我很好奇是否有其他补充——宽泛地讲,除了 Evan 的大脑之外,你认为 Snap 成为一个成功的消费类社交产品的核心是什么?
Snap 的产品市场契合与持续创新
Gaurav Misra: 有几件事做得很好。我确实认为对于 Snapchat 这样的公司或社交网络来说,核心的产品市场契合可以极其强大。基本上,用户下载它的原因、它传播的方式、分发的方式、邀请朋友或发送 Snap 的方式等等——这种产品市场契合有时候可以强到让你很难真正构建东西,因为你其实分辨不出来你正在做的东西到底是促成增长的功臣,还是在帮倒忙——产品只是在顶着你的操作增长。
而我认为正因为如此,它有时候反而教会了人们错误的东西。它让人们以为他们做的那个逆向操作是对的,但实际上它是错的,公司只是顶着它增长而已。我认为 Snap 做得好的地方,也确实需要做的,就是持续创新。
因为对于 Snap 这样的公司来说,它面临着海量的竞争。社交网络本质上是垄断性的,Facebook 或其他任何社交网络都有很多理由去遏制 Snapchat 的增长。他们确实试过了,非常非常努力地试过了。而 Snap 抵御这一切的方式就是创新。
Snap 独特的设计主导架构
我认为核心在于他们的组织架构,那是非常独特的,我从未见过类似的东西。我在好几家不同的公司工作过,但很显然那里有一位 CEO,而这位 CEO 非常以产品为导向,他自己就是设计师出身,但他身边围绕着一个设计团队。那是公司的核心团队。设计团队大概十到十二个人。基本上非常小,即使公司有五六千名员工时,它还是那么小。
Lenny Rachitsky: 哇,六千名员工的时候,设计团队你说多少人来着,五六个人?
Gaurav Misra: 十到十二个人。
Lenny Rachitsky: 十到十二个。而且补充一下,很长一段时间里基本上没有 PM。那是在那之前的事。
Gaurav Misra: 很长一段时间里,是的。
Lenny Rachitsky: 这是很大的不同。
Gaurav Misra: 最初完全没有 PM。PM 是随着商业化引入的。一旦商业化成为一个重要元素,PM 才出现了。今天,我觉得公司有大量的,或者说有合适数量的 PM 遍布各处,但在很长一段时间里,尤其是在创新正在发生的那段时期,PM 的数量要少得多得多,而且是非常以设计师为主导的。但同时,我觉得这样说稍微有些误导,因为这些可不是一般意义上的设计师。
这些设计师实际上同时也是 PM。这就是秘诀所在。他们不仅能做设计,还能承担 PM 的职责,这是一项很大的责任。工作量非常大,尤其是在那样规模的公司里,但这给了 CEO 一种方式,可以精细地控制每个时刻应用中每个部分到底发布了什么。
因为他可以和十到十二个人开会,就知道每一个影响用户的变更。当然也有很多正在进行的基础设施类的变更,后端一直在做的各种事情,比如改善排序、性能之类的。那些通常不是他关注的重点。他关注的是我们在哪里添加了什么 UI。而在公司里,如果你需要给应用添加 UI,你需要设计。如果公司里除了少数几个直接和 CEO 沟通的设计师之外没有别的设计师,你就创造了一种对产品发布的精细化控制。所以一切都必须经过 Evan 批准。没有他批准的东西,就上不了线。设计团队因此在其中拥有很大的权力。
消费产品中的单一决策者
Lenny Rachitsky: 这太精彩了。所以我听下来部分感受是——我不知道这是否完全正确,但感觉是对的——要做一个成功的、能脱颖而出的消费应用,你几乎需要一个始终深入每个细节的单一决策大脑。Evan 的做法就是和基本上在运营产品的设计团队保持非常紧密的关系。
Gaurav Misra: 非常对,确实如此。而且他能够把整个应用的上下文同时装在脑子里。他知道各个部分之间的相互依赖关系,知道我们在做什么以及为什么做。这给了他对公司产品路线图非常精细的控制力。
Lenny Rachitsky: 这让我想到 Brian Chesky 和 Airbnb——Airbnb 是一个消费产品,不是社交网络,但我怀疑这是否是消费产品的一个有趣洞见:如果有一个人拥有正确的经验与洞察组合,并且持续运营、把控每一个细节,消费产品通常会做得更好。
Gaurav Misra: 肯定是的。还有就是推动变革的能力,真正激发整个组织去做不只是渐进式的、而是根本性改变的能力。
Lenny Rachitsky: 没错。
Gaurav Misra: Founder mode。基本上我们聊到的就是这个。
Snap 设计师的超高要求
Lenny Rachitsky: 听说过。那你提到这些设计师——我知道 Snap 长期没有 PM 这件事很有名,设计师就是 PM。你刚才讲到设计师的那些观点,PM 的职能确实很重要。我想很多人看到这个会想,“太棒了,我们就只招设计师,不需要那些 PM。PM 什么都拖慢,就告诉我们别做什么就行了。” 你能谈谈这些设计师的水平吗?是什么让他们在没有 PM 的情况下依然如此成功?
Gaurav Misra: 我觉得当时对设计师的期望,已经不仅仅是设计能力、设计这项技能本身了。顺便说一下,他们全都是 IC(独立贡献者)设计师。而且不允许带下属,实际上他们是没有直属汇报对象的。所以他们所有东西都得自己亲手设计,同时还得具备领导力——自己去规划路线图,撰写所有文档,与不同团队协作,确定发布节奏,什么都得了解:不仅是技术和工程部分,还包括 UX、UI、产品需求,以及我们为什么要做这件事。
路线图本身就有大量东西需要统筹。这意味着这份工作负荷极高——毫无疑问,工作负荷非常高。这些人工作非常拼,薪酬也很高。值得一提的是,他们的薪酬远超你对设计师、PM 或工程师的一般预期,还有季度奖金之类的各种福利。
Lenny Rachitsky: 有意思。这也让人想起,大家总说”为什么需要 PM?“但总得有人去做 PM 做的那些事。他们不是在那无所事事。而且很关键的一点是,承担 PM 工作的那个人,必须擅长这件事并且享受它。而很多设计师并不想写文档、协调利益相关者、推动共识……
Gaurav Misra: 百分之百同意。这也正是为什么找到能同时做好两件事的人如此困难。我其实认为这里有一个洞察:当你在两个不同职能之间融合专业技艺时,创新就在其中产生。我真的觉得,一个人同时承担两个不同职能——或至少有能力这样做——会带来一些特别的东西。我认为大量独特的洞察和创新正源于此。
设计工程:设计师与工程师的融合
就我个人而言,我后来转到了设计团队。我刚加入 Snap 时在工程团队,在 Snap 的最后两年我转到了设计团队。我在那里做的一件很重要的事,是创建了一个叫做”设计工程”(design engineering)的职能。这其实是另一种组合——不是设计师加 PM,而是设计师加工程师。一个人能思考 UX、做出设计,还能把它构建出来并上线,全部搞定。
我们在其中既尝试过教设计师学工程,也尝试过教工程师学设计。当然,我们创建这个职能的原因很不一样。它实际上是为了在公司变大之后仍然保持创新。我们发现的一个问题是,随着公司越来越大——500 名工程师、1000 名、2000 名、3000 名——突然之间所有事情都变得极其困难。
所有事情都变成了六个月或一年的项目。每个产品都是 500 名工程师加大量时间的巨大投入。所以你必须非常谨慎地下注。如果你判断失误——你在创新、尝试做新产品,投入了 500 名工程师一年时间,结果没做成——那就是个大问题,你会陷入困境。尤其对 Snap 这种情况来说,所有人都在抄袭他们的东西,所以他们必须不断推出新东西、突破边界。
Evan 的理念一向是,他不跟那些被抄袭的东西较劲。Stories 被几乎原封不动地抄了,Snap 创造的很多东西都被抄了,但他更倾向于”把蛋糕做大,做新的东西,突破边界”。就是持续创新。但在公司达到那个规模后,要做到这一点变得非常难。于是我们有了这个想法:创建一个小团队,去提前测试大量想法,因为我们有很多想法,不可能把它们全都做出来。思路就是创建一个由设计工程师组成的小团队——这些人能把整个产品设计工程流程装在脑子里,能做出产品的早期版本,而且我们会直接把它嵌入到 Snapchat 应用里。
我们甚至可以进行测试——比如在澳大利亚试运行,看它的表现;或者在几所高中测试,就几所学校,观察用户行为。这样我们就能获得真实世界环境下产品表现的数据,但并不需要把它做到生产级别。它本质上是一个原型。它的做法和创业公司构建产品的方式一样。
内部传播:用原型驱动组织对齐
这和我们公司现在做的事情思路一样——快速构建、推出去、获取反馈、判断是否可行,然后与工程团队合作做规模化开发。等我们理解了产品和它的运作机制后,再投入 500 名工程师花六个月去构建,这才说得通。
这其中有很大一部分意义。我觉得有一个意外收获,对我来说甚至是颠覆性的——大家都知道,在大型组织中,对齐是个大问题。怎么让所有人步调一致?PM 很大一部分工作其实就是创造对齐,而这可能工作量极大,因为你要找所有利益相关者谈,把他们拉到同一页上。
但我们有一个独特的洞察:随着公司变大,你实际上可以通过制造内部传播力来实现对齐。如果公司里人数足够多,它其实会像一个消费者群体那样运转。如果你把某个有趣的东西分享给某个人,他会觉得有意思再分享给下一个人——你真的可以在公司内部制造传播力。
所以我们做的就是:创建这些原型产品。我们直接进入某个区域,改一堆东西,造出 Snapchat 里本来不存在的原型产品,然后把那个构建版本分享出去——它就炸了。在公司内部直接病毒式传播。一天接一天,先是工程师来找我们,然后是经理,然后是 VP,最后 Evan 都会说:“天哪,所有人都在聊这个,为什么我是最后一个知道的?”
这就瞬间在整个公司创造了对齐——“这个东西很令人兴奋,我们应该全力投入。“所有人都在问:“我们什么时候做这个?什么时候上线?我看到已经有人在做了。“这是一个非常棒的方式。而且等我们真正理解了产品确实有好的数据表现、也测试过了之后,这也是把它展示给所有人、建立”我们都在做这件事,这就是未来”这种共识的绝佳途径。
原型与快速验证
Lenny Rachitsky: 我想顺着原型这个话题再聊一下。感觉 PM 的工作正在往这个方向走——直接做原型,而不是先做设计或者写 PRD。感觉这正是你做过、而且效果非常好的事情。本质上就是组建一个团队来把想法原型化,现在理论上用 AI 很快就能搭出来。所以我觉得这特别有意思,看到这个功能的发展方向……
Gaurav Misra: 百分之百同意。要把东西放到人们手里让他们试用。很多时候,除非你真的上手试过,否则在设计中理论上看起来一切完美,但当你真正使用的时候,才会发现它其实没那么有用,比如这样。或者当你把它交给用户的时候。这其中有一部分靠直觉,说实话,就像其他事情一样,但归根结底,没有什么比把东西交到用户手上更好的了。
产品团队三角:PM、工程师、设计的融合
Lenny Rachitsky: 我很喜欢你把这么多经验带到了现在的公司。我在想,其中一点就是持续创新的感觉——这似乎影响了,你告诉我我有没有遗漏——但感觉这就是你们每周都发布新功能的原因。还有从设计开始而不是从 PM 开始的思路。我很好奇,那些情况下你为什么不直接跳到原型?是因为工具还不够成熟吗?
Gaurav Misra: 我的意思是,我们的发布流程已经足够快,一周之内就能上线,所以我们直接拿到用户反馈,那比原型更好。
Lenny Rachitsky: 还有一件特别有意思的事。我试着在脑海中想象产品团队那个三角形——PM、工程师、设计的三人组。感觉你们在 Snap 把三角形的三条边都占了。你们有设计工程师,有设计 PM——不对,工程师应该已经是那种产品导向很强的了吧。你们有一个叫”设计 PM”的职能吗?应该没有。
Gaurav Misra: 说实话,这很有意思。
Lenny Rachitsky: 抱歉,我说的是工程师 PM。
Gaurav Misra: 对,我觉得工程师 PM 应该成为一个正式角色,或者说每个工程师都应该努力去理解产品,对吧?
Lenny Rachitsky: 对,很多公司就是这么运作的。比如 Stripe,我印象中他们在招第一个 PM 之前就有几百号工程师了,因为我觉得那些工程师就是在做你们在 Snap 做的事——承担 PM 的工作。所以感觉你的公司不是这么运作的,你们有 PM、有工程师、有设计师。聊聊你为什么决定不采用那种方式。
Gaurav Misra: 我确实认为 PM 是一个非常有价值的职能。我觉得——也许我会因此被喷——但说到底,Snap 不招 PM 这个决定,可能属于那种实际上尽管如此仍然成功了的情况,因为总得有人来做这份工作。如果你没有足够的人来做,那就没有人真正拥有它的所有权,事情也就不会真正发生。或者就算没发生,也没人负责,这不是一个组织该有的结构。但话虽如此,如果设计师拥有 PM 的思维方式,确实有一些独特的价值。
其实这和”如果工程师拥有 PM 的思维方式”是同一个道理,然后你还可以更疯狂一点——如果 PM 拥有设计和工程的思维呢?我认为我们谈论的核心就是:每个人都应该真正理解与自己协作的所有职能,对自己合作的领域有一种基础性的、广泛的理解。
PM 应该理解营销
Gaurav Misra: 在 Captions,我们其实又往前走了一步。为什么 PM 不应该懂营销?我觉得这其实是 PM 最应该去理解的东西——我们怎么找到那些有这个问题的用户?我认为这是解决问题的一个很大部分。我对此有一个独特的看法——我其实认为 PM 应该一直管到营销。原因是,如果你想想营销是什么,它就是在扩展产品的触点。比如搜索营销,无非就是在 Google 上放一个跳转到你产品的按钮。
Facebook 广告就是在 Facebook 里放一个跳转到你应用的按钮。这几乎就像你在 Facebook 工作一样——你在 Facebook 工作,你在应用里某个地方放一个按钮,做了一个特定的东西,用户就来了。漏斗从那里就开始了,你拥有从最开头开始的所有的指标——从用户在 Facebook 上点了那个按钮开始,然后走过所有步骤,最后落在某个 Onboarding 页面上,完成了操作,使用了应用。
旅程从那里就开始了。而所有这一切,在某种意义上,它就是产品。它是同一套技能。从那个点开始理解用户,我认为这是根本性的。我们为什么现在不这样做?我们应该这样做。这就是我们思考问题的方式。但我认为核心理念是:每个职能都应该尽可能深入地理解其他每一个职能,甚至达到能在那个职能中运作的程度。这样就能提高公司里所有微观决策对所有漏斗环节进行优化的可能性,因为不同的人各自关注着不同的部分。这是我们非常重视的事情。
Lenny Rachitsky: 我完全同意这个观点。有意思的是,在 Airbnb,Brian Chesky 曾把所有产品经理的 title 都改成了产品营销经理,正是因为这个原因——因为他说——
Gaurav Misra: 说得通。
Lenny Rachitsky: 你应该做营销,而不应该只是构建东西。对我来说,我一直认为作为 PM,你的工作就是让这个东西增长、被采用、被喜爱。
Gaurav Misra: 当然。
Lenny Rachitsky: 所以有意思的是,人们并没有已经这样去想。
Gaurav Misra: 我同意。
Lenny Rachitsky: 但显然,要学好付费增长、SEO、产品营销、信息传达、定位这些技能是很难的,但我完全同意。这是构建产品中极其重要的一环。你不是光构建一个东西、希望它好用、然后说再见。所以我很喜欢你是这样思考的。那我想当你招 PM 的时候,你应该是会看中营销直觉和一些相关经验。
Gaurav Misra: 百分之百。至少要有学习和掌握它的能力和直觉。
用户研究与 Ev 的产品哲学
Lenny Rachitsky: 好,我再分享一个你刚才说话时我想到的东西,我觉得很有意思,而且在这个播客上经常被提及,这也和 Ev 的成功经验能关联起来。Patrick Olson 曾经发过一条推文,让我一直印象深刻。他谈的是用户研究,他的说法是:用户研究不是”去做用户研究,研究结果告诉你该做什么,然后你就去做那个东西”。而是你做用户研究,它塑造的是你作为领导者、作为产品构建者心中的心智模型——关于用户需要什么、有什么痛点,你不断调整头脑中的那个模型,然后基于这个模型来决定要构建什么。我觉得 Ev 非常符合这个模式。他的大脑一直在学习人们——尤其是青少年——需要什么,然后就这么奏效了。
Gaurav Misra: 对,我觉得说得很准确。不过我要补充一点,Snap 在很长一段时间里其实并不认可用户研究这个职能。公司大概到五千人、上市之后,才只有一位用户研究员。但做产品决策的很多人,当然也包括 CEO 本人,对用户的行为方式和操作习惯是非常沉浸其中的,他们确实理解这些。
Snap 的使命边界
我觉得 Snap 还有一种独特的思维方式来判断一个产品是否在使命范围之内。很多公司使用 cyber 框架,我们也尝试这样做,但 Snap 的核心理念是:他们想以安全的方式实现私密分享。这就让很多事情很明确地落在 Snap 的范围之外。
实际上,这也是为什么 Snap 没有成为发现”短视频”——TikTok 那类东西——的公司,因为这本身就违背了公司的天性,甚至违背了公司的使命。公开分享意味着可能产生欺凌和不良行为,而这恰恰是 Snap 一直想避免的。我们不希望这些行为在应用中滋长。比如说,在 Instagram Stories 上,你可以把别人的 Story 分享给自己的粉丝,我可以把你的 Story 转发给我的粉丝。但 Snap 上你做不到这一点。
当时确实讨论过要不要做这个功能?结论是不做,因为它会助长欺凌。本质上,你的内容被分享给我的粉丝,你并没有同意,这本身就是不好的。所以很多决策都是基于这种支柱式思维:这是我们的使命,这是我们想做的事,它是否符合范围?如果不符合,不管代价多大、多诱人,我们都不做。
即便是 Spotlight,最大的挑战也是:如何把那样一种产品形态纳入 Snap 的使命框架之内?这是我们花了大量精力的事情。我觉得关于早期版本有太多故事了。Snap 在 TikTok 出现之前几乎就已经有了 TikTok 本质上的东西,但因为不契合使命,最终消亡了。如果感兴趣我可以展开讲讲。
Lenny Rachitsky: 好,那确实很有意思,因为这些东西确实很重要。拥有清晰的价值观和公司使命、不去碰范围之外的东西,这些都很重要。然后你听到这些故事——他们曾经有可能做出 TikTok。所以你愿意分享的话就太好了。
Snap 错过的”短视频”
Gaurav Misra: 不知道你还记不记得,当时有一个产品叫 Our Stories,本质上是 My Story 的公开版本。它最初的想法是校园故事——你可以发布到自己的校园,其他人可以看到。这确实开始带来了很多传播力,因为人们会发布内容,真的出现了病毒式时刻,比如”哦,有两个人在上面相爱了”之类的。这类内容确实会疯传,参与度也非常高。
但最终的问题在于,我们本质上反对用算法来对这类内容进行排序。所以有一个内容审核团队会人工查看每一条内容,确保应用内不会出现负面行为。但这根本无法规模化。尽管参与度很高、表现很好,让一个人逐条审核判断是否合适,就是不可行的。
它最终消亡了,但它看起来就是 TikTok 上线之前的一个早期版本。不过我觉得从某种角度来说这也是件好事,因为 Snap 确实有自己的使命,也确实在解决一个问题。
社交媒体的分叉
我认为当前的社交媒体确实出现了分叉。一种是你传统认知中的社交网络,你在里面和朋友分享东西。顺便说一句,还记得那个应用通过这种方式疯传的时代吗?你和朋友分享,他们再和自己的朋友分享,所有人都在担心好友分享的问题——怎么发给好友、能不能给朋友发短信之类的。
那个时代已经结束了。现在的传播力通过一个完全不同的机制运作。它本质上是通过算法来决定你的内容是否值得展示给任意数量的人,这是社交媒体的新纪元。就是 TikTok、YouTube Shorts、Instagram Reels 等等。我认为这实际上正在改变人们互动方式的根本性质,也改变了内容疯传的根本机制。而且我觉得从监管的角度来看,我们应该把它们视为不同的东西。
一边是你自己决定谁来看某个内容,另一边是公司在决定——这意味着它本质上是半策划的,对吧?它其实是公司的声音。所以我不知道,Section 230 是否应该适用?我不知道。也许不是。也许我们思考这个问题的方式本身就不对,所以值得关注。
Lenny Rachitsky: 哇,好。嗯,法律判断这块我完全外行,就不追这个话题了,但我感觉里面确实有非常有趣的东西。你刚才一直在说变化有多大,我想顺着这条线聊下去——具体来说,你们在 AI 视频领域是走在最前沿的。
AI 视频与真实性危机
Gaurav Misra: 是的。
Lenny Rachitsky: 感觉我们正在接近——也许已经到达了——一个你分不清是真实还是 AI 生成的世界。我好奇的是,首先你觉得我们离那个状态还有多远?其次,在一个你可以生成任何想要的视频的世界里,这意味着什么?
Gaurav Misra: 这是根本性的。说到底,视频、图像、音频不可信的时代,其实已经存在一段时间了。你想……十九世纪的时候没有视频、没有音频、没有图像,一切都靠”他说、她说”来证明。如果所有东西都可以被生成、任何内容都可以被创造出来,而且看起来和真实的一模一样,没有任何办法区分,那我们可能真的会回到那个时代——除了物证和”他说、她说”之外,没有办法证明任何事情。
AI 视频的真实性挑战
Gaurav Misra: 我觉得这很可怕,但同时也可能为有人想出如何解决这个问题打开了大量新的机会。我认为这会是一个大问题。我确实认为,在创造绝对逼真的视频方面,我们今天已经几乎达到了那个水平。最新的模型,最前沿的技术,距离实现它可能只有几厘米之遥。但我确实认为,要完全达到根本无法区分的程度,还需要几年的时间。
我也认为这在一定程度上是由使用场景驱动的。回想一下 Captions,我们对要聚焦的视频类型有着独特的视角——视频生成和文本生成视频。如果你看今天的情况,生成的全是无声视频,没有音频,而且通常是你所认为的定格视频或 B-roll,对吧?你不可能只用 B-roll 就做出一部电影。而一部电影、一部电视剧、一条社交媒体帖子或一支广告,很大一部分实际上是对话或独白。真正的内容是人们在互相交谈、对着镜头说话、互动交流。这才是真正构成故事的元素。
B-roll 是辅助性素材,用来交代场景或类似的作用——比如在场景开始之前,你会看到几个纽约或洛杉矶的镜头,然后画面切入房间,两个人开始对话。所以我们的目标是解决”说话视频”的问题——如何创建人们正在对话、独白或类似场景的视频。这是我们纯粹专注的方向。实际上目前在这个领域并没有太多的工作在进行,它也远未成为一个被解决的问题。我们正在接近,越来越近了,但今天的模型实际上出现了分叉。
技术路线的分叉
目前有一批公司能够创建我们所说的这类——也就是数字人视频。它们使用的是一种叫 neural rendering(神经渲染)的技术。这项技术实际上并没有受到 transformer 和 diffusion model 革命、或者说大模型革命的影响。它是一项独立存在的技术,与当前正在发生的 AI 增长浪潮没有关系。它碰巧能产生半逼真的输出,但到某个阶段就会触及瓶颈,因为目前尚不清楚如何使其在各种场景下具备泛化能力。
它需要针对每个人单独训练。所以你可能先输入一些你自己的视频,然后就能生成你的画面。所以这是一种不同的技术,产生的是不同的结果。一批公司使用这类模型,另一批公司则在从事无音频的通用文本生成视频。这些是大型生成模型,它们有潜力做得更多,但那个前沿尚未被触及。我认为在研究层面上,没有人怀疑这是百分之百可以解决的——只是需要有人去做,而我们还没有到达那一步。还没有人有时间去完成这件事。所以这就是我们目前所处的位置。
我们完全专注于用大型生成模型来做说话视频,这是我们的核心方向。不过我认为,从安全角度来看,我们有一个独特的框架或者说思考方式。一般来说,视频可以划分为两个类别。对我们来说,一边是”记录类”——这类视频可能是个人视频,比如你和朋友们出去玩、在餐厅聚会,记录发生了什么,你很开心,不管是什么,这些都是为了你的记忆。也有非个人版本,比如记者记录一起犯罪事件或其他事件——谁牵涉其中、在哪里发生的,可能是一场自然灾害之类的,这是为了历史记录,我们想知道发生了什么。
安全框架:记录类与叙事类
而在所有这些场景中,AI 生成的视频没有任何益处。实际上全是负面效应。如果我们生成虚假的现实来欺骗人们,没有任何好处。我们想远离这些东西。我们想要设计和构建的产品,要让这类用途变得困难——任何属于这个范畴的使用场景,都要很难用它来做。
另一边则是我们所认为的”叙事类”。
这可以是广告,可以是社交媒体帖子,可以是电视节目、电影。所有这些都是叙事。它们是为了娱乐、为了趣味而设计的。没有人会看 Geico 的广告时真的认为那只壁虎在某个地方卖保险。你知道这是虚构的,是为了娱乐。真人秀也一样,对吧?它叫”真人秀”,但绝对不是真实——社交媒体、广告,所有这些都属于这个范畴。
如果我们能让更多人讲述故事、娱乐他人、传递他们的信息,那就是纯粹的正向价值。这是我们要专注的方向。我们在产品和设计流程中投入了大量精力,研究如何设计和构建产品——让它在一边的用途上特别难用,在另一边的用途上特别好用。这才是真正的挑战。
Lenny Rachitsky: 这个框架非常有用。我特别好奇的一件事是——字节跳动刚刚发布了一个非常惊艳的模型,我刚才还在看——你放一张照片进去,然后它就能生成这个人以各种方式说话的视频。在你刚才描述的框架中,这个模型落在哪个位置?
Gaurav Misra: 我认为它恰好落在我们所处的领域,也就是”说话的人”——他们也在朝这个方向发力。这其实是大公司发布的第一个大型模型的例子,能够做这种对话或独白视频。你自己也看到了,所以我不需要过多描述,但正如你所知,它的表现力非常强。它看起来不像数字人视频,它看起来就像……
Lenny Rachitsky: 对,非常疯狂。
Gaurav Misra: 这是因为所用的技术有根本性的不同。他们用的是真正的大型 diffusion model。而大多数从事数字人技术研究的公司,实际上用的技术相比之下相当基础。
Lenny Rachitsky: 那个 Will Smith 吃意大利面的视频到现在多久了?给我们一个参考,感受一下事情发展得有多快?
Gaurav Misra: 天哪,发展得太快了,对吧?
Lenny Rachitsky: 我觉得是一年。
Gaurav Misra: 不可思议。
Lenny Rachitsky: 还是两年?
Gaurav Misra: 我觉得大概一年半到两年左右吧?
Lenny Rachitsky: 哇。我们会在节目笔记里附上那个视频链接,你就能看到那就是一到两年前 AI 视频的最前沿。然后我们也会附上这个新的 Omni 什么的——我忘了叫什么名字了。我会展示一下今天是什么样的水平。
AI 视频营销的终局
我的天。好,最后一个问题,围绕一个我知道你有非常独到见解的话题——你认为人们体验 AI 的最终前沿将是 AI 视频在营销中的应用,也就是在营销渠道中看到它。聊聊你为什么这么认为,以及那会是什么样子。
Gaurav Misra: 这又回到了我们之前讨论的话题。现实是,无论一项技术多么有趣、先进、令人惊叹——科幻已经变成了现实。我们之前聊过这个,电视里那些曾经纯粹是科幻的东西如今已成真,但老实说,大多数人甚至都不知道。
我父母住在印度,他们是社区里唯一知道 ChatGPT 的人。他们会用各种华丽的词藻给社区写这些非常精彩的公告。大家都说,“你怎么写得这么好?“他们也不告诉任何人。但仍然有大量的人甚至不知道这些进步已经发生了。所以,普及速度实际上比我们想象的慢得多,即便是对最令人兴奋的事物也是如此。当然,在科技圈里,人人都在谈论这些,但现实是,它需要一段时间才能真正触达到更广泛的人群。
我认为,对于那些将会成功的公司来说,他们必须想清楚如何营销这些产品,才能成为触达那些有问题需要解决、而这些问题现在已经可以被解决的人群的那一家。我们每天都在思考这个问题。所以,作为一个消费级产品,我们花了大量时间和金钱在产品营销上,常常使用效果广告渠道和各种手段。但大约一年前,当我们在广告中使用 AI 视频时,会收到大量评论,人们会说,“天哪,这太假了,别给我看这个。“
AI 广告从被嫌弃到超越实拍
大约在那个时候,技术刚好成熟到了一个临界点——突然之间,那些负面评论消失了,而且你获得的效果甚至可以超过真人拍摄,因为你可以尝试更多方案。你可以直接生成三四十个版本,其中一个会胜出,而且它的表现会超过你从真人那里得到的单一创意。更有意思的是,当你考虑本地化的时候,你需要在每种语言中都重新做一遍。一旦你发现了获胜创意,现在你需要在每个市场进行本地化,从零开始重新制作。
这个工作量是巨大的。而且往往本地化版本的效果不如原版,因为本质上它被重新构思过了。但我们发现,仅仅用 AI 进行翻译,就能获得几乎和原始语言原版一样好的效果。所以这一切将会席卷整个市场。我认为,只要有钱可以赚、可以省,这就是不可避免的。它会被大量消费,而且会很快占领大量社交媒体。
反乌托邦的未来:全 AI 生成的社交网络
我的意思是,你可以想象一个未来的社交网络——顺便说一句,这是反乌托邦的,所以做好准备。你可以想象一个未来的社交网络,所有内容都是生成的,所有人物都不是真实的。算法不再是根据真人的内容来决定展示什么给你,而是纯粹生成完全为你量身定制的内容——人物、一切,都完全为你定制。我认为这并非不可能。在某种意义上它几乎是必然的,而且我认为这并不遥远。这在五年左右的时间内就非常可能成为现实。
Lenny Rachitsky: 我的想象是这样的——因为很难想象一个全部是”人”的社交网络,毕竟我们通常想知道这些人是谁。我不在乎随机的人发什么状态更新,但我能想象一个全是 AI 的 TikTok。
Gaurav Misra: 没错,完全没错。
Lenny Rachitsky: 哇,完全根据你的喜好和兴趣定制的内容,就是纯粹的视频。哇。
Gaurav Misra: 没错。
Lenny Rachitsky: 就是一堆随机视频。哇。
Gaurav Misra: 对。因为你知道吗,你看 TikTok 的信息流,你今天甚至都分不清谁是真人谁不是,对吧?又不是我们——
Lenny Rachitsky: 没错。如果是我的话,我就会直接加入 TikTok,开始上传 AI 生成的视频。
Gaurav Misra: 没错。
Lenny Rachitsky: 然后围绕它建立起一整个网络。天哪,未来太疯狂了。
失败角落
让我们进入”失败角落”。我在这个播客中一直尝试分享那些事情进展不顺的时刻。大家听到的都是一切顺利的故事——创始人做得风生水起,正在打造一家十亿美元公司,哦,太棒了。但他们不知道那些出错的事情。所以让我问你,你能不能分享一个事情没有成功、你失败了的经历?
Gaurav Misra: 在公司成立之初,我们其实花了很多时间去摸索到底要做什么。我觉得在某种程度上这是一个不太常规的故事,因为我们创立公司后做的第一件事就是构建 Captions 应用。我们发布了这个应用,这是我们的第一个产品。花了两天时间构建。我们把它放出去,它立刻就起飞了。这太令人震惊了,因为我是用一个周末做出来的。我们把它放出去后,周一我给联合创始人打电话说,“它在 App Store 排行榜顶端,我们每天收到大概 600 个视频。”
Lenny Rachitsky: App Store 榜首,我去。
Gaurav Misra: 而我们什么都没做来促成这件事,它完全是自发发生的。在某种程度上这甚至有点反高潮,因为我们以为在那之前需要花更多时间来打磨产品。所以我们觉得,“等等,不可能就这样吧?不可能这么快。这是怎么发生的?“所以我们因此分心了,因为我们想,“哦,好吧,这个……很酷,能用,很好,但我们得弄清楚真正的产品是什么。“
浪费了一年半
于是我们花了至少一年到一年半的时间去思考构建社交网络以及各种其他东西,而我们本应该一直在做 Captions,因为那里已经有产品市场契合了。我们是怎么发现的呢——Captions 一直挂在我的个人账户下,所以我不怎么查看它。在公司成立大约一年半后,当我们在做其他项目的时候,我回到了我的个人账户,打开一看,发现里面有 50 万美元。
我看了下图表,收入就在一直增长,完全是自发增长。没有员工维护,没有版本更新,没有 bug 修复,没有客户支持。有大概 2000 个未回复的支持工单,积压了一年半,但好评如潮。它完全靠自己运转。所以对我来说这是一个明确的信号。我当时就想,“天哪,你应该一直在做这个。那个产品是对的。”
所以我们立刻开了个会。说实话,当时很难确定正确的路径是什么,因为我们在其他项目上也投入了大量时间,但我们重新调整了方向,回到了 Captions 的轨道上。而且就在我们开始往里面发布第一批新功能的那一刻,它就爆发了。那时候原来看起来像垂直线的东西变成了水平线,而新的垂直线如此陡峭,以至于旧的垂直线相形之下变成了水平线。从那以后一直在持续增长,这太疯狂了。所以我们基本上浪费了大约一年半。
Lenny Rachitsky: 我很喜欢这种描述曲棍球棒时刻的新方式——不仅仅是在垂直上升,而是图表上其余部分现在看起来就像贴在横轴底部的平直线。
Gaurav Misra: 完全正确。是的。
Lenny Rachitsky: 对于可能不了解 Captions 是什么的人,我在开头尝试描述过了,我们也会在节目说明中放链接。但简单来说,你觉得它没什么的原因是它就是给录制的视频加上字幕。
Gaurav Misra: 没错。
Lenny Rachitsky: 就是加字幕。
创办社交网络的尝试
Gaurav Misra: 没错,是的。所以我们的想法是……我们的计划是先做一个社交网络,但在那之前得先为这个社交网络做一个创作工具。我们很明确要用 AI 来创作视频,而语音转文字在当时已经是被解决的问题,所以从那里切入看起来是显而易见的选择。这就是我们决定从 Captions 开始的原因——因为它在当时是个已经解决的问题。有趣的是,当 GPT 这类东西出来之后,很多原本未解决的问题很快就变成了已解决的。所以时机几乎完美。
Lenny Rachitsky: 这和你之前分享的一件事是吻合的——太多原本未解决的问题现在变得可行了,而那些在正确时间出现在正确位置、一直在等待这一刻的公司,会极大地受益。
另一件我觉得这个故事里很有意思的事是,你们当时试图做一个社交网络,大概是围绕高中之类的场景。正如我们所见,做一个新的社交网络非常困难。所以我想听听你的看法——你觉得有没有可能有人冒出来,做出下一个 Facebook、下一个 Snap、下一个什么?
Gaurav Misra: 我觉得完全有可能。不过让我告诉你一件疯狂的事。我们当时做的那个社交网络,后来已经从 App Store 下架了,现在已经不可用了。但直到今天,仍然有人在用它,有数千人还在上面发帖、做各种各样的事情。这其实从某种角度说明了社交网络的力量——难创建,也难杀死。我觉得 X 其实也是个很好的例子,那里发生了那么多变动,但它居然还在运转,不知道怎么做到的。这就是一个证明。
网络效应的力量
Lenny Rachitsky: 网络效应的力量,尤其有人曾经描述得特别好——他们说 Twitter/X 换了品牌,换了开发团队,换了 URL,所有东西都变了,唯独里面那些人形成的网络效应没变。
Gaurav Misra: 确实,确实。
Lenny Rachitsky: 我刚看到一条消息说他们在赚几十亿美元。他真的扭转了局面,它正在变成一家非常赚钱的公司。
Gaurav Misra: 哇。
Lenny Rachitsky: 对,前几天刚出来的新闻。Elon 做到了。好吧,说到这里,我们进入了非常激动人心的快问快答环节。准备好了吗?
Gaurav Misra: 准备好了,来吧。
快问快答
Lenny Rachitsky: 有哪两三本书是你最常推荐给别人的?
Gaurav Misra: 我得在这里坦白,我其实不读书。这是我刻意做的决定——我决定不在阅读能力上积累技能,而是把技能建立在听和看上,因为我认为那才是未来。
Lenny Rachitsky: 我喜欢这种刻意的态度,我也喜欢这是一种很酷的”我不读书”的说法——未来不是阅读。不过我喜欢你身后有书,所以……
Gaurav Misra: 我确实有。是的。
Lenny Rachitsky: 那些书没读过,就摆在那后面。
Gaurav Misra: 没读过,就放在后面。
Lenny Rachitsky: 有意思。好的,酷。我还想追问更多,但我继续往下走。快问快答。说到看和听,你最近有没有特别喜欢的电影或电视剧?
Gaurav Misra: 我喜欢 Silo 和 Severance。当然,我觉得大家都在看这些。Silo 也有原著小说。
Lenny Rachitsky: 我读过,全部都读了,一共三本。
Gaurav Misra: 是的。
Lenny Rachitsky: 看剧就很难受,因为你知道接下来所有要发生的桥段,我就一直在想,“我为什么在看这个?我知道会走到哪里。”
Gaurav Misra: 是的,不过话说回来,剧的走向似乎确实和书略有不同。
Lenny Rachitsky: 是的,这也是让我恼火的地方。就像,“搞什么?这都是编的。全是编的。“我看剧的时候不喜欢这种感觉。所以两个原因让我看不下去,但是……
Gaurav Misra: 别担心,我其实没读过原著。我太太读了然后讲给我听的。
Lenny Rachitsky: 好好好,我刚才还担心了。好的,Severance,很棒。我也很喜欢 Severance。
下一个问题。你最近有没有发现一个特别喜欢的产品?
Gaurav Misra: 说实话,我最喜欢的产品是 Linear。不骗你,就是因为它设计得特别好,用起来也特别顺手。我也喜欢 Superhuman。这些都是很显而易见的答案,但我确实每天都在用这些东西。能做到让人每天用还不讨厌,是很难的。所以要给他们点赞。
Lenny Rachitsky: 酷。我还没在播客上公布过这件事,但现在是个好时机——所有正在听的人,我刚刚推出了一个捆绑包:如果你成为我 Newsletter 的付费订阅者,你听好了——可以免费获得一年的 Linear、Superhuman、Notion、Granola(一个非常棒的 AI 笔记应用)以及 Perplexity Pro。
Gaurav Misra: 噢,不错。
Lenny Rachitsky: 价值 2000 美元的东西,只要我 Newsletter 的价格,200 美元。
Gaurav Misra: 天,这是真金白银的价值。
Lenny Rachitsky: 难以置信的划算,现在买订阅完全是毫无疑问的选择。不过这不是给我 Newsletter 打广告,我继续往下走。
下一个问题。你有没有一个最喜欢的人生格言,在工作和生活中经常回想、经常和朋友家人分享的?
Gaurav Misra: 其实我是从别人那里知道的,因为有人告诉我我一直在重复这句话。但我有一个关于工作中如何运作的框架。怎么说呢,我喜欢竞争,归根结底喜欢赢。而要赢,你就得做到最好。但我也认为,做到最好的最简单方法就是做第一个,这才是关键。
Lenny Rachitsky: 所以格言就是”最简单的方法”?就是那句——
Gaurav Misra: 就是这样。做到最好的最简单方法就是做第一个。
Lenny Rachitsky: 做第一个。有意思。好的。我得忍住不往下追问,因为要保持快问快答的节奏。
好的,最后一个问题,轻松一下。你最近见过的最酷、最疯狂的 AI 视频是什么?有没有一个让你觉得”哇,这真的了不起”的?
Gaurav Misra: 说实话,我得说 OmniHuman 的东西确实挺酷的。
Lenny Rachitsky: 就是我们之前聊到的字节跳动那个视频。
Gaurav Misra: 对,就是那个。一棵西兰花在说话,不知道你看了没有——一棵小西兰花在发表演讲。
Lenny Rachitsky: 有意思。
Gaurav Misra: 是的,看起来就像是由动画师做出来的。
Lenny Rachitsky: 想象一下现在的孩子们,看到这种东西。
Gaurav Misra: 我觉得大概就是习以为常了吧?就是觉得这很正常。
Lenny Rachitsky: 嗯,就像我们说的,AGI 就要来了。
Gaurav Misra: 没错。
Lenny Rachitsky: 好的,酷。晚饭吃什么?好的,太棒了。
Gaurav Misra: 是的。
Lenny Rachitsky: 太棒了。Gaurav,这次对话非常精彩,在好多层面都很有启发。最后两个问题:想了解更多的话,大家可以在哪里找到你和你们正在做的事情?另外,听众怎样才能帮到你?
Gaurav Misra: 好的。大家可以在 LinkedIn 上找到我,我大部分时间都在那里。私信是开放的,随时可以给我发消息。至于什么会有帮助——我们正在组建早期的产品和设计团队,所以如果你对 AI 视频感兴趣,对消费类应用感兴趣,现在是加入的最佳时机。我们团队很小,很早期,所有人紧密协作,基本上不会再有比现在更好的加入时机了。
Lenny Rachitsky: 而且你每周都能上线一个可营销的功能。
Gaurav Misra: 没错。这简直是产品经理的梦想,你想想看,对吧?
Lenny Rachitsky: 产品经理的梦想。对,我觉得这也是一个很好的筛选标准——对此感到兴奋的人,非常匹配;对此感到焦虑的人,这里就不太适合。
Gaurav Misra: 完全同意。
Lenny Rachitsky: 太好了。好,Gaurav,非常感谢你来。
Gaurav Misra: 不,应该我谢谢你。很感激。
Lenny Rachitsky: 大家再见。
感谢大家收听。如果你觉得这期节目有价值,可以在 Apple Podcasts、Spotify 或你喜欢的播客应用上订阅。也请考虑给我们评分或写评论,这对其他听众发现这个播客真的很有帮助。你可以在 lennyspodcast.com 找到所有往期节目或了解更多关于这个节目的信息。下期再见。
术语表
| 原文 | 中文 |
|---|---|
| alignment | 对齐(组织内部共识与步调一致) |
| avatar video | 数字人视频 |
| B-roll | B-roll(辅助镜头/空镜,保留原文) |
| Brian Chesky | Brian Chesky(Airbnb CEO,保留原文) |
| ByteDance | 字节跳动 |
| creative | 创意(广告创意素材) |
| design engineering | 设计工程(design engineering,指设计师与工程师融合的职能) |
| diffusion model | diffusion model(扩散模型,首次出现保留原文) |
| Evan Spiegel | Evan Spiegel(Snap CEO) |
| eye contact | 眼神接触(产品功能名称) |
| Founder mode | Founder mode(创始人模式,保留原文) |
| Geico | Geico(美国保险公司,保留原文) |
| Head of Growth | 增长负责人 |
| hockey stick moment | 曲棍球棒时刻(指增长曲线突然急速上升的关键节点) |
| IC | IC(独立贡献者,Individual Contributor,首次出现保留原文) |
| marketable | 可营销的 |
| monetization | 商业化 |
| MyStory | MyStory(Snap的功能名称,保留原文) |
| neural rendering | neural rendering(神经渲染,首次出现保留原文) |
| Nvidia | 英伟达 |
| Onboarding | Onboarding(用户引导,首次出现保留原文) |
| one-way door | 单向门(不可逆决策) |
| Our Stories | Our Stories(Snap的功能名称,保留原文) |
| paid growth | 付费增长 |
| Patrick Olson | Patrick Olson(保留原文) |
| performance channel | 效果广告渠道 |
| PM | 产品经理 |
| PRD | PRD(产品需求文档,Product Requirements Document,首次出现保留原文) |
| product-market fit | 产品市场契合 |
| roadmap | 路线图 |
| RPM | RPM(可能指研究型产品经理或特定职位,保留原文) |
| runway | 跑道(可用余量/缓冲期) |
| Section 230 | Section 230(《美国通信规范法》第230条,保留原文) |
| SEO | SEO(搜索引擎优化,Search Engine Optimization,首次出现保留原文) |
| Spotlight | Spotlight(Snap的短视频功能名称,保留原文) |
| support ticket | 支持工单 |
| surface area | 触点 |
| table stakes | 基本配置型 |
| technical debt | 技术债务 |
| transformer | transformer(保留原文) |
| two-way door | 双向门(可逆决策) |
| virality | 传播力 |
| Will Smith | Will Smith(美国演员,保留原文) |
此文档由 AI 分片翻译(translate_long_document)