产品科学、重大押注,以及 AI 如何影响音乐的未来 | Gustav Söderström
The science of product, big bets, and how AI is impacting the future of music | Gustav Söderström
Gustav Söderström (00:00:00): The internet started with curation, often user curation. So you took something, some good like people or books or music, and you digitize it and you put it online and then you ask users to curate it. And that was your Facebook, Spotify, and so forth. And then after a while, the world switched from curation to recommendation, where instead of people doing that work, you had algorithms. And that was a big change that required us and others to actually rethink the entire user experience and sometimes the business model as well. And I think what we’re entering now is we’re going from your curation to recommendation to generation. And I suspect it will be as big of a shift that you will eventually have to rethink your products. We have to rethink the user interface and the experience for recommendation first era. And so what does that mean in the generative era? No one really knows yet.
Lenny: Welcome to Lenny’s Podcast where I interview world-class product leaders and growth experts to learn from their hard one experiences building and growing today’s most successful products. Today, my guest is Gustav Söderström. Gustav is a product legend and he’s now the co-president, chief product and chief technology officer at Spotify, where he’s responsible for Spotify’s global product and technology strategy and oversees the product design data and engineering teams at the company. I’ve had Gustav on my wish list of dream guests to have on this podcast since the day I launched the podcast and I’m so happy we made it happen.
In our conversation, we dig into what Gustav has learned about taking big bets and what to do when they don’t work out, how Spotify moved away from squads and how they structure their teams now, how AI is already impacting their product, and also the future of music generated by AI. Also, why all great products need to pull some magic trick, how accurately succession represents Swedish business culture, and his hilarious analogy of peeing in your pants. Enjoy this episode with Gustav Söderström after a short word from our sponsors.
Eppo lets you go beyond basic click-through metrics and instead use your north star metrics like activation, retention, subscription, and payments. Eppo supports test on front end, on the backend, email marketing, even machine learning claims. Check out Eppo at getE-P-P-O.com. That’s geteppo.com. And 10X, your experiment velocity.
Gustav, welcome to the podcast.
Gustav Söderström (00:04:11): Thanks for having me, Lenny. Pleasure to be here.
It’s my pleasure to have you on. So at this point, you’ve been at Spotify for over 14 years, which is a rare feat in the tech world, and you’ve held a lot of different roles while you’ve been at Spotify. Can you just start off by giving us a sense of what these various roles and what you’ve done over the years at Spotify and then just what are you up to these days? What are you responsible for now?
Gustav Söderström (00:04:33): So I came into Spotify in early 2009, late 2008. And my job then, I had been an entrepreneur, started some of my own companies in the, back then, very, very early feature phone, smartphone space. So I had a bunch of knowledge there. I had sold the company to Yahoo in the mobile space. I worked there for a while. I came back to Sweden and then I met through a mutual friend, Daniel Ek, the CEO and co-founder of Spotify. And they had built the desktop product already, the free streaming desktop product, and it was amazing and I could try it, but they needed someone to figure out what to do with mobile. And because I had been an entrepreneur in that space, I got that job.
Gustav Söderström (00:05:20): So my job was to head up mobile for Spotify and figure out what the mobile offering would be, which was a challenge because, obviously, Spotify desktop was a free on-demand streaming application and back then, specifically with edge networks, you couldn’t really stream at all in real time. The performance wasn’t there and also you couldn’t fund that with an ads model. So it was a product and business model innovation that was a lot of fun. So that’s how I started. Then after a few years, I took on all of product development for Spotify. And then a few years later, I actually took on the technology responsibility, the CTO role for Spotify as well. And recently, my official title is co-president of Spotify together with Alex Nordstrom. So we run half of the company each. I run the product and technology side and he runs for the business and content side. So that’s the super fast version.
Gustav Söderström (00:06:20): Aside from getting more responsibilities taking on the technology department, it has been the same job by title. I’ve always reported to Daniel, but because Spotify’s grown so much every six to 12 months, it’s been starting at a new company. First, it was sort of a Swedish Nordic challenge and then it was a European challenge and then it was getting into the US and then we became a public company. So it’s as if I had jumped around between a lot of jobs actually, even though it was largely the same title and role.
Your story makes me think of the classic be careful what you’re good at because you end up taking on more and more. And clearly, you’ve been given more and more responsibility over the years and so clearly things are going well and you’re doing well.
Lenny: Shifting a little bit. So you’re on my podcast currently. You actually have your own podcast, which was this limited series on the product story of Spotify, which I listened to and loved, and it’s surreal to listen to your voice in real time because I’ve been listening to that recently in preparation for this conversation. Two questions, just what made you decide to launch your own podcast knowing you had a full-time job and a lot going on and the production value for your podcast was very high for what I could tell? And then two, just what did you learn from that experience in terms of the product you ended up a building and just empathizing with the podcast creator side of?
Gustav Söderström (00:07:41): There were a bunch of different reasons why I did that. One is, and not a small one, I think, like you, I love writing and I have this secret creator dream in me. I used to write blog posts a long time ago and I write internally a lot. You can’t write that much externally when you work at a company like this.
Yeah.
Gustav Söderström (00:08:02): But I love writing and talking and presenting. So there was certainly that. And then no small part was, from a product point of view, to empathize with one of our main constituents, the podcast creator. I’m, unfortunately, not a great musician. I try to play instruments and so forth, but I don’t have any records. I don’t sing very well. But I decided to make a podcast and that taught me a huge amount about what it’s like to be a creator, creating different styles of podcast.
Gustav Söderström (00:08:33): For example, we wanted to do a higher production cost podcast with music and then right away you run into a bunch of problems as Spotify is actually pretty well positioned to solve, but still, it’s really hard to have music in a podcast from a rights perspective. So you understand all these problems that podcasters have and you can be better at solving them. But the biggest benefit and the real reason for doing the public podcast was that I have actually done an internal podcast through a hack where we could gate the podcast-only employees. And I tried to figure out internally how to build more culture around Spotify and help define for new employees and existing employees, who we are, the mistakes we did, the successes we had, and how we think about strategy specifically in product strategy because we were quite well known externally for technology and the squads and all of these things, not so much for product strategy.
Gustav Söderström (00:09:36): And because I love storytelling more than Google Docs, I decided to do an internal podcast and I went around and I interviewed actually Daniel’s direct reports. So the CMO, the CHRO and CFO. And just ask them about a bunch of stuff. And the idea was to make them more approachable for employees because I felt listening to podcasts, even these people that have no idea who I am because I’ve never met them, I feel like I know them, I feel like I know how they think and I just like them much more. So the secret idea was, what if you could get to know your leaders much better than you do through occasional meetings or some town hall? So I did that internally, and because I’m a product person, we ended up talking a lot about product and product strategy. And people internally really like that.
Gustav Söderström (00:10:26): So next time, the question was, what if people that don’t even work at Spotify yet could feel as if they knew people at Spotify? That’d be great because most leaders in most companies are very opaque and appear as some otherworldly creatures that aren’t really real, I think, when you see them in business papers or something. So what if you have heard them talk for an hour or so? So that was general idea. So a combination of recruitment tool, sharing more about how we think about product strategy and just because I think it was a lot of fun. I got to interview a bunch of smart and interesting people both externally and internally.
Did it have the effect that you were hoping after looking back?
Gustav Söderström (00:11:12): I think it did. The podcast did well and, no, we did not give it our own promotion. I had to compete as everyone else, which also gives you a lot of empathy for the problem of like, okay, now you have a product, what about user acquisition? How do you actually get people to listen to it? So it did achieve what I wanted in the sense that we have this thing called intradays where especially in the past few years when we’ve hired a lot, we actually fly people to Stockholm for an onboarding session to learn about Spotify. And the leadership is on stage, talking about what they do and the departments and strategy and so forth. And it’s very common that people come and tell me that, “Oh, I listened to this podcast or this and the episode and it’s at least one of the key reasons why I joined or sometimes the reason why I joined.” So it’s anecdotal, but it may be in the many tens of people, at least, who have said it. So that seems to work.
That’s really interesting. Just again, and this comes up a few times on the podcast, is just the power of content in all these different ways for hiring for culture building. And it sounds like the original goal was just internally build this clinic culture and strategy.
Gustav Söderström (00:12:24): That was the original goal, make senior leadership more approachable and reduce the distance and then also share more of the thinking in an entertaining way rather than just through docs that people end up not reading.
I love that. So I was listening to it, as I said, and what was really interesting is I think episode four was actually all about AI, and I think your first attempts at leveraging machine learning in AI within Spotify. And I think that’s what led to Discover Weekly and a few other tools. And that was years ago. And it’s interesting listening to it now where AI is, again, a huge deal. And so I’m curious very tactically on the product team what you advise product managers and product teams on how to think about AI in their product thinking and also just in their day-to-day work.
Gustav Söderström (00:13:12): I can give a few examples there and I don’t know that we’re more sophisticated than anyone else, but we’ll be doing at least the traditional machine learning for quite a long time. And I think in the podcast, I think I talked about the journey of the internet in stages. And one way to think about it is that the internet started with curation of the user curation. So you took something, some good, like people or books or music and you digitize it and you put it online, and then you ask users to curate it. And that was your Facebook, Spotify and so forth. And then after a while, the world switched from curation to recommendation, where instead of people doing that work, you had algorithms. And that was a big change that required us and others to actually rethink the entire user experience and sometimes the business model as well.
Gustav Söderström (00:14:04): And I think what we’re entering now is we’re going from your curation to recommendation to generation. And I suspect it will be as big of a shift that you will eventually have to rethink your products. So that’s one lens. So I tend to talk to my teams about, even though it’s all machine learning, I ask them to think of this as something completely different. The recommendation error was one type of machine learning. The generation error is a different type, so don’t think of it as just more of the same, think of it as something actually completely new instead. And what we learned in … Well, a few things. So if you look at this new era of large language models and the fusion models and so forth, there are two types of applications. As I said for the recommendation error, we had to rethink the user interface and the experience for recommendation first error.
Gustav Söderström (00:14:57): And so what does that mean in the generative area? No one really knows yet. As usual, there are a bunch of iterative improvements. So we use these large language models to improve our recommendations. You can have bigger vectors that can have more cultural knowledge. You can use it for safety classification on podcasts that no one has listened to yet and so forth. So there’s lots of obvious improvements and we’re doing those. But so far, we’ve only really done one real generative product in the hard definition, which is a product that couldn’t have existed without generative AI, and that is the AI DJ. So that’s a concept that we’ve been thinking about for a very long time. And the AI DJ is you press a button, a digitized person, there’s a real person named X, digitized X. So he’s now an AI, comes on and talks to you about music that you like and suggests music, and you can listen to it. And if you don’t like it, you can just call him back and he says, “Okay, now, let’s listen to something maybe from a few summers ago,” or “Here’s some new stuff that were trending yesterday in The Last of Us episode or something like that.”
Gustav Söderström (00:16:10): So that product couldn’t have existed without generative AI, both generating the voice and generating the content of what the voice says. So you can have individualized, personalized voice at the scale of half a billion people. And so we had the use case we have seen for many, many years. Sometimes people call it the radio use case. We called it the zero intent use case internally when you actually don’t know what you want to listen to at all.
Gustav Söderström (00:16:40): Spotify wasn’t that good. Spotify was good when, at least roughly, you knew the use case of what you want to do, if it was a workout or dinner. We had lots of options for all of those. But if you really didn’t know at all, it was hard to open Spotify and stare at it. And people used to say longingly that this was the one thing that radio was good at. Radio was quite bad, to be honest. I mean, it’s not personalized to you at all. It’s not on demand. You come in in the middle of things, it’s actually terrible in many ways. But people still often say that there was something good about it. And I think that’s something was the fact that you had a knob and you could just switch between contexts. It’s like no, boring, boring, boring, boring, okay, this is good.
Gustav Söderström (00:17:23): And Spotify never had that mode of, I don’t know what I want, but I want to cycle through things until I find something that I like. And I think with the AI DJ, that’s actually the use case we managed to solve. So X comes on and says, “I’m going to suggest something to you that you can listen to.” And if you like it, you can keep listening, but if you don’t like it, you bring him back again and you change channel. And for one reason or another, we tried to solve that for many times for a long time, but just starting to play a random song without any context as to why you would hear this, it just never worked. So that was our first foray into a product that couldn’t exist before. And I think to your question of principles around that, there are a few pretty distinct principles that we’ve learned.
Gustav Söderström (00:18:09): One that I really like that is not my principle at all, I think it is straight from Chris Dixon, is the principle of fault-tolerant user interfaces. So I can’t say how many times during the early machine learning era when we said we’re moving from curation to recommendation. I saw a design sketch that was a single big play button because clearly that is the simplest user interface you can do, but if you don’t understand the performance of your machine learning, you can’t design for it. The quality of your machine learning, if you’re going to have a single play button, needs to be literally 100% or zero prediction error, and that’s never the case. So let’s say that you have a one in five hits, four out of five things are done, then you need a UI that probably at least shows five things at the same time on screen. So you have a one in five of something being relevant on screen.
Gustav Söderström (00:19:03): So you need to understand the performance of your machine learning to design for it. It needs to be fault tolerant and often you need an escape hatch for the user. So you make a prediction. But if you were wrong, it needs to be super easy for the user say, “No, you’re wrong, I want to go to my library or to this or to that.” So we have that principle of having fault-tolerant user interface and a user interface that corresponds to the current performance of your algorithms. And I think that is going to be true for generative machine learning as well. I think a very clear example actually is Mid Journey. If you think about the early Mid Journey user interface inside the Discord channel, actually generating an image was very, very slow.
Gustav Söderström (00:19:48): It took a long time to generate high-quality image and they could have built the silver button thing where you put in a prompt, you wait for minutes, you get an image, and I think one out of four times, it’s going to be bad. So you would’ve been disappointed three out of four times and it’s a minute each, so like four minutes later, you’d be, “This is a shitty product.” What they did was they generated four simultaneous low-res images very quickly and you could say, “So apparently, their performance was probably one in four, that’s why they showed four and not six.” And so one in four was usually pretty good. You click that one and either continue to iterate or scale it up. So that’s also an example of, I think, people understanding where the performance of generative AI was when they built the UI. So that’s something that I would be inspired by.
Gustav Söderström (00:20:37): And for the AI DJ specifically, another principle is to try to avoid this urge of just wanting to show off the technology and then have this voice that talk and talk and talk and talk. You have to remember that people came there for the music. So the principle for the AI DJ coming from the team, by the way, this was a bottoms-up product actually, it required a lot of support. We actually acquired big companies and so forth to be able to build it. But the idea had been built by teams bottom up. So the principle there was literally to do as little as possible and get out of the way. And I think that was really helpful. It’s not telling you what the weather is and what happened in the news and going on and on and on about this band. It is trying to get you to the music and I think that’s why it’s working because it is working very well for us.
Lenny: I love this distinction between recommendation and generation. And this begs the question of, there’s this trend that I imagine you’re seeing of people autogenerating music using artists catalog. There’s this Drake and The Weeknd thing that came out a week or two ago. Where do you think this ends up going and how do you think artists adjust to this world where music can just be autogenerated? This play button is all of it is generated versus just like the DJ in between the songs.
Gustav Söderström (00:21:53): First, big caveat, this is just super early. No one knows anything about how this is going to play out or the legal landscape and so forth, but I think it’s going to have a lot of impact. And I think if we talk about two things, one is what it could do for music, the other is the right situation, and if rights-holders are getting compensated and so forth. So we talk about the first thing in isolation. I think an interesting example is right about when I grew up, Avicii came along. And it’s interesting to think about because Avicii was not really considered by the existing music industry as a real artist because he couldn’t really play an instrument and he couldn’t sing, and he was just sitting with this computer in this DAW, digital audio workstation. And so it wasn’t really considered real music. And I think now all of us consider it very real music and that he had tremendous real musical talent.
Gustav Söderström (00:22:51): So I think right now, we’re probably in the face where people say this isn’t real music and it’s somehow fake. I think the way to think about these diffusion models if and when they get good enough at generating music is probably the same like an instrument. It’s just a much more powerful instrument and we’ll probably see a new type of creator that wasn’t proficient at any instrument and they couldn’t assemble a full orchestra and do the thing that they had in their head and they can now generate very new things. I also think, by the way, that there is this distinction between AI music and real music that doesn’t exist. For sure, very talented real musicians are using AI to get better and to help create new ideas. So that distinction doesn’t really exist. It’s all going to be AI. The question is what percentage, which makes the problem harder because you can’t talk about if it should exist or not.
Gustav Söderström (00:23:50): You have to talk about what percentage should exist and who gets to use it or not. But I think the way to think about it is probably as an instrument and that could help create a huge amount of art. And I think this is not news to you who probably use these things a lot, but I think if you don’t use these generative models, there is the perception that you tell it to create a hit and you will get that. That’s not how it works. Actually, what these models do is because they’ve been listening to a lot of music, they are very good at doing something that sounds very similar to what already exists. Actually being original is very hard. And from one point of view, as it now gets easier to create more generic music, it will actually be more difficult than ever to be truly unique.
Gustav Söderström (00:24:39): So I still think there would be tremendous skill in creating something truly unique. And my hope would be that what happened with the DAW and that technology jump was you got a whole new genre like EDM that you couldn’t really produce it with an orchestra or live. And maybe we’ll see completely new music styles with these technologies. I think that would be very exciting. So that’s on the positive side, but then you have the rights issue, which I have a lot of empathy for. And Spotify specifically has seen this before. So we had a different technology shift like this, which was the technology shift to online downloads of music and piracy and peer to peer. So first, it was a big technology shift in peer to peer and it was exciting for consumers. More consumers started listening to more music than ever. And I think that’s where we are now with generative AI.
Gustav Söderström (00:25:31): There’s a new technology, but it also required a new business model before creators and industry could actually participate and benefit from this. And that’s, obviously, self-serving to say because we were a big part of innovating that business model. But I still think that’s what’s necessary and I hope that that’s what I and we could be part of. So I think we’ve seen the first part, the technology shift, and there will probably be a lot of discussion and chaos here which have a lot of empathy for, but I think we haven’t seen the second part yet. What is a model where this could be a benefit? What actually happened after piracy is that the music industry got bigger than ever, not just as big but bigger than ever. And I think that could happen with this technology as well. But we’re right in the beginning.
So along the same lines, something else you teach is this idea of all truly great products have to pull some magic trick. This comes up in your podcast a lot and I think you mentioned this other places, and thinking about all the stuff you’re talking about here, it feels like, in a sense, everything’s going to feel like magic because AI’s baked into it.
Gustav Söderström (00:26:38): I think when we did the AI DJ, we did a small version of that. When people first listened to it, we could see that reaction in use of testing when they’ve wondered … So the magic trick there was that how could they record this person saying so many different things because it’s talking about my music. So the magic trick was, obviously, didn’t record a person saying, it’s generated, and that magic trick wears off. You hear it all the time now and so forth, but it was one of those magic tricks. So I still think that concept is important and it seems to correlate with products going viral and taking off.
Gustav Söderström (00:27:15): And I think it was the same using something like Dall-E or Stable Diffusion or Mid Journey the first time. It completely seemed like a magic trick. And, obviously, there is no magic, it’s just data and statistics. But I think getting to that point and iterating a product to the point where it feels like magic the first time is very helpful. And it’s often a question of just getting the performance to a certain level, scoping down, removing things. There’s a lot of fine-tuning, I think, that makes you cross that line from it’s cool and impressive but not magic to it feels like magic. I don’t understand how this could be done.
Yeah, it reminds me of the launch of GBT which ended up being the biggest, most fastest growing product in history, and it’s like the epitome of a magic trick. It feels like actual magic.
Gustav Söderström (00:28:09): Absolutely, absolutely. And to most people, it is still very … Actually to a lot of ascent, even to researchers, it’s a little bit magical. No one really understands fully. So I guess there’s maybe some magic left in the world.
Absolutely. And I think a lot of people are worried about not understanding what’s going on there. Shifting to the way you all build product at Spotify. So Spotify is famous for popularizing this idea of squads and tribes. And correct me if I’m wrong, but you guys have moved away from that approach.
Gustav Söderström (00:28:39): Yeah, that’s right.
Okay. So I’d love to understand just why you shifted and what you learned from that approach to building product, and then just like how do you organize the teams now? What do you do now?
Gustav Söderström (00:28:51): This was something that we focused a lot on early and it turned out to be smart of us to name these things into squads and chapters and so forth. It wasn’t really … Well, maybe it was deliberately branding, but it wasn’t for purposes of branding that we made it up. We made it up because we thought it was a good structure to use and we needed names for things and this was the early internet eras you were allowed to make things up. And so it was very good for where we were at the time and it certainly helped us in recruiting. It’s become a little bit of a cost to us because people still think that we organize that way and it’s not a very efficient way of being organized at this scale or maybe even if you started over right now because we’ve learned more.
Gustav Söderström (00:29:35): But I think the big difference is the idea with the squad specifically was twofold. They were supposed to be small and full stack. So squad should be about seven people and it should have front and backend, mobile, QA, agile coaches and so forth, and it should be very autonomous was the idea. And that’s really what we shifted. So, first of all, as you grow the company, scaling in increments of seven engineers just creates a ton of overhead. So, obviously, our teams now tend to be much bigger, maybe two, three times that at least per manager to maybe have 14 or something instead of seven and just less overhead roles. So that’s one. It looks more traditional as you learn more and is reasonable as you scale. The second big thing I think we struggle with was back then when I joined, the average age at Spotify was … I mean, I was the oldest and this was 14 years ago. So I think the average age was probably under 30 or something and it wasn’t most tech companies.
Gustav Söderström (00:30:46): And so we had coming from Sweden, which is a different culture than the US, and I love a lot of things about Swedish culture and I think we managed to keep the best parts, but Sweden is a very bottoms-up autonomous culture. There’s this famous drawing on how you make decisions in Sweden. In the US, I think it’s just a hierarchy. In Sweden, it’s a circle. It’s in a circle, no one is in the middle, there is no leader and so forth.
Lenny: Interesting.
Gustav Söderström (00:31:14): So I think by culture, we’re very inspired by this super autonomous thing. And I think the idea with autonomy is very reasonable and the right one, which is we work and we are hiring the smartest people we can find and we pay high salaries for that. So if you’re hiring smart people, one way to think about it is you’re renting brain power.
Gustav Söderström (00:31:39): So if you’re renting all of this expensive brain power and then you give them no room to think for themselves, that doesn’t sound smart, then you should actually hire less smart people and keep your costs down or something. So I think you have to give a bunch of autonomy to actually maximize the value of the investment you’re making. So that’s very reasonable that you would give a lot of space for people to use as much of their talent and capacity as possible. But the problem with that is if you put autonomy very far towards the leaves of the organization, and also if you combine that with having a very junior organization, which we did back then, there’s a fair chance that you’re just going to produce heat. You’re going to have a hundred squads with a hundred strategies running in a hundred directions. And Spotify has been there in that camp.
Gustav Söderström (00:32:29): I mean, we managed to get somewhere, for sure, in spite of this, but I’d struggle to say we were efficient in doing that. So we’ve done a few things. The team structure is more traditional, larger teams, less overhead. And we’ve been specifically working with where in the org do we put the autonomy because the extremes are at the leaves and we were there. The other extreme may be at the top, let’s say maybe some Twitter, there’s one person. Both have problems. If you have it at the leaves, you’re going to produce a lot of heat. If you have it at the top, you need someone with a lot of capacity and Elon has a lot of capacity, but you are, by definition, going to bottleneck. All decisions have to go through there. And Daniel, it’s not his personality that he even wants to make all the decisions.
Gustav Söderström (00:33:19): He wants to maximize throughput rather than to bottleneck the throughput. So the question is, if it’s not at the top and not at the very bottom, where do you put it? And what we’ve found, which I don’t think is very contrarian at all, I think this is the case in most companies, is around the VP level. So if you have Daniel, then you have the C level, myself and others, then you have the VP level, that is a good mix of instead of having one person in the company think, so only Daniel then and the rest just do, you have on the VP level in the company this many tens to maybe hundreds of people that have a lot of autonomy to think. So you get a good amount of freedom of thought and people thinking in different directions, but it’s not like 8,000 people. And these people on the VP level are both quite a lot of them, but they’re also usually quite senior. They have a lot of pattern recognition.
Gustav Söderström (00:34:16): So I think that solves for, it’s a good … If you think of it as an optimization problem, it’s a good optimization space. So the autonomy level in Spotify now tends to be quite high at the VP level and then lower around those levels.
And when you say autonomy, what does that actually mean? Is it the VP of, say, the podcasting product has a lot of say over what happens and there’s not a ton of … I don’t know how involved are people above? And I know Maya’s the VP of product, I believe, for the podcast product.
Gustav Söderström (00:34:48): Exactly.
Who I think is going to come on the podcast someday. What does that mean in terms of Tommy for her, what practically?
Gustav Söderström (00:34:54): So it means that I would ask Maya to define a strategy for what we do in podcasting, how are we going to be different, why would a podcaster want to be here? Whereas another company, I will make that strategy or another company, Daniel would make that strategy. Same with … The AI DJ, for example, came from our personalization team. And so that was a bet that they made. So they have autonomy to make those kinds of bets and define strategies. Same with the user interface, we have an experienced team, can talk about the org structure later, but I put a lot of autonomy on the VP of experience to define and suggest what it is that we want to do. And in other companies, I would define all of that myself, for example.
Just going even a little bit further here, I know you have just strong opinions on the way to organize teams and how the organization helps you optimize for specific things. What are your just thoughts along those lines and what have you learned about how the impact of organization and what you’re optimizing for?
Gustav Söderström (00:36:03): Yeah. So I talk about an idealized spectrum or maybe not idealized but exaggerated spectrum. Nothing is really true, but you create extremes to make a point. So on one spectrum, you have something like Amazon, which is known for two-pizza teams, no dependencies. You try to minimize dependencies so you can run in parallel. Teams compete with each other even on the same project and so forth. But they have direct access to the user.
Gustav Söderström (00:36:37): And so the benefit here is if you have an idea, the time to get the user is very low and it has worked for them. It’s produced Kindle, it produced Alexa, it’s produced a lot of very novel things. There are a few interesting downsides here. One downside that I’m extremely impressed with Jeff Bezos’ foreseeing is if you have teams that compete with each other, the incentives are to hide your results, hide your code. And that should make for an organization that gets no platform leverage because no one’s corporating. And I think either he had that insight or because he saw this, he had to do this, but he’s well known for pushing extremely hard on hard APIs. If you don’t create hard APIs to your technology, you’re out. And if you think about it, it has to be that way because otherwise no one would do it.
Lenny: And a hard API is essentially everyone knows how to use this API and connect to this team to interface with.
Gustav Söderström (00:37:38): Exactly. You have to expose your technology to others. You have to maintain those APIs and they have to be very structured because otherwise the whole thing would collapse as everyone’s supposed to compete because there are no incentives. You have to centrally force that. And interestingly, even though theoretically then they’re the worst position to have a structured platform, I think, because they forced it so hard, they were the ones who did Amazon Web Services because they had such hard defined APIs because of this rule that it was easier for them to turn it inside out and expose it the rest of the world. Whereas if you look at someone like Google, I think they struggled more with externalizing their APIs maybe because it is so friendly and soft. So they didn’t need as hard APIs on the inside because there was no competition. People could just go into each other’s code.
Gustav Söderström (00:38:18): So it’s interesting anecdote around it, but the main point is you’re faster there, but it’s going to be hard to corporate. And so you will see something like maybe exaggerating a bit. Sometimes you’ll see multiple search boxes on the same page from different teams. And this has been true in Spotify, by the way, as well. You’ve seen multiple toasters on the Now Playing view coming up from different teams because they’re working. When we were in the autonomous mode, everyone running. And then … So you get the benefit of speed, but you get the drawback of shipping your org chart and shipping complexity to the end user. But clearly, that’s been the right choice for Amazon because they’re a trillion-dollar company. But then on the other spectrum, you have something like Apple who’s also a trillion-dollar company. So clearly, both models work, where you would never see two search boxes from the same team popping up on an iPhone. That is centrally organized by something that is close to single individual.
Gustav Söderström (00:39:20): So they are instead in what is probably the world’s biggest largest functional org, they’re doing as much. If you think about what goes into the Apple, I mean, they certainly do everything we do. They have music service, podcast service, audiobooks, and they have a billion other services. So it’s not like they have an easier problem. And yet they build something that feels more like it was built by a single developer for a single user. So they centralize and they have this bottlenecking function that everything has to go through and be decided how it fits with everything else. And so that has the benefit of the user experience being simpler and not shipping the org chart and increasing complexity. But it also has the drawback of speed without having facts on it. I’ve heard people working at Apple have said, “Yeah, it took seven years to get that thing to market,” because you just had to wait in the pipeline.
Gustav Söderström (00:40:17): So you have these extremes. And I think the most interesting example, I think, to think about is when you double click the power button on an iPhone, the Apple Pay comes up. That decision, how did that happen? You can imagine that all the services team would like to pop up when you double click that button. And so someone had to decide, should music come up, should payments come up, should something else come up? And so they have a different structure there. And on that spectrum of centralized versus decentralized, because of our strategy, which is we’re a single application, trying to add or not trying to, we have added multiple types of content with actually very different business models on the backend, rev shares and royalties and book deals and so forth into single user experience. That is our strategy. We think the user experience in keeping that simple is the most important thing.
Gustav Söderström (00:41:12): So we’ve chosen more of the centralized model, where these different vertical businesses, if you think about it, the music business, podcast, audiobooks business, they have it to go through a single recommendation organization because that’s another problem. Which one do you recommend to which user? Should be a book or podcast or music? And how do you weigh them against each other? And also the user interface could easily get incredibly complicated if everyone built their own UI. The music team built their UI and then someone added features on top. So that’s how we chose to optimize. But it is based on our strategy and I think both models work.
Eco is working to make these points the most rewarding points ever. So it pays to be early. Sound too good to be true? Go to eco.com/lenny, sign up for an onboarding and find out why it isn’t. Lenny’s Podcast listeners who attend an Eco welcome session will get an exclusive 4% APY on deposits over $1,000. Learn more at eco.com/lenny, that’s E-C-O.com/lenny.
It’s interesting these two examples you gave, Apple and Amazon, they’re two of the biggest companies in the world and they’re like at the extremes of these two into the spectrum. And it’s interesting, most companies are somewhere in the middle. I wonder if there’s just a benefit to being in an extreme and that ends up being really important.
Gustav Söderström (00:43:21): I think so. In almost all industries, you have this smiling curve concept, where you want to be at the extremes on the smiling curve, and that’s where big business opportunities are but not in the middle. So it’s probably true in terms of organizational models as well.
Lenny: Speaking of extremes, I want to talk a bit about taking big bets. So you guys had this big launch event recently where you basically redesigned the whole primary feed of Spotify to make it feel more like where apps are going, like TikTok reels feel of just stream, and you start hearing videos and music starts playing and some people loved it, some people did not. And I’m curious as a product leader, how you think about thinking long term and dealing with people that are just like, “What the hell’s … I hate change, stop changing things.” How do you think about that? Who do you listen to? Who do you ignore? How do you know to stay the course? How do you approach that?
Gustav Söderström (00:44:10): Yeah, you’re being very kind. There was a lot of negative feedback on Twitter on some of that. So let me actually dig into some detail because I think for product people listening to this, this is an interesting lesson that I think few companies talk about because you want talk about everything that went exactly as you thought they would and you don’t want to talk about the things that didn’t go exactly as you thought they would.
Gustav Söderström (00:44:39): So I’ll go through what we are trying to achieve and what we learned. So Spotify is mainly a background application, and for a long time, we’ve been considered very good at background music and podcast recommendation. When the phone is in your pocket and you’re listening to an EDM playlist or pop playlist or something, we’re really good at inserting another EDM track there or another pop track there or something like that in the background.
Gustav Söderström (00:45:09): What we hear from users again and again, though, is that they say that they get trapped in a taste bubble. So I love my Spotify, I love this, but I’m a little bit bored with EDM now and Spotify’s not suggesting something completely new. And if you think about that problem, it may sound similar to the recommendation problem, it’s just another recommendation problem, but it’s actually fundamentally different because when you’re recommending another EDM track inside the EDM playlist, you have a lot of signal from that user that they like EDM. But if you’re going to recommend a completely new genre, by definition, you have no idea. Because if you had an idea, it wasn’t new to them. So you can’t know anything. So back to hit rate, your hit rate is going to be incredibly low when you suggest something completely new to the user.
Gustav Söderström (00:46:03): So this problem of helping people get out of the taste bubble isn’t that easy as it sounds. And we can’t really take some genre that maybe isn’t typical. So I’m a big fan of Reggaeton, for example. It’s not typically … It’s not that common in Sweden. And if you would look the rest of my profile, it’s EDM heaviness, you probably wouldn’t have guessed it. And Spotify wouldn’t have guessed it. So if I’m listening to my favorite EDM playlist in the background or maybe my metal playlist, metal is very big in Sweden, it’s really hard for us to just insert a Reggaeton track in the middle of that. Most people are going to think Spotify’s broken.
Yeah.
Gustav Söderström (00:46:41): What the hell are they thinking? So that doesn’t really work. So in order to help people break out of their taste bubbles, you need something different. You need something where your hit ratio can be very low and you need people to expect it to be very low.
Gustav Söderström (00:47:00): So when we recommend things in the background, our hit ratio needs to be at least nine out of 10, maybe one dud is okay, but if you get five duds, you’re going to think we broke your playlist and your session. We need something where one out of 10 is a success. If you find one gem out of 10 tries, you’re very happy. So you need a completely different paradigm. And you also need to be able to go through many candidates quickly because the hit rate is so low. You can’t take three minutes per item. It’s like, “Okay, I didn’t like this,” and it’s still like two minutes left before the next one comes on. You need to quickly say, “No, no, no.” So the obvious candidates for this are these feed-type experience, where you can go through lots of content, you’re expecting the hit ratio to be much lower. And if you don’t like it, the cost is very low, you just swipe.
Gustav Söderström (00:47:50): And then this is the reason why people have been … When they want to break out of their taste bubbles or when they come into Spotify and listen to something completely new, it is usually because they found it on one of these services, like a TikTok or YouTube or something, where they get exposed to lots of new content. So people were asking us for these tools and so that’s what we wanted to solve for. And so we built a bunch of features, feed-like structures, where you can go through either a new genre with many tracks or a podcast channel with genre with many episodes or even full playlists. And we implemented those and we put them in something called subfeeds. So in the current experience, and this is roll out worldwide, if you click the podcast subfeed, you get a feed of podcast episodes. Click the music subfeeds, you get a feed of playlists where you can go through many playlists. And if you don’t understand the name, you can quickly hear what they sound like and check out a few tracks and understand if this is for you. And if you go through the search and browse page, you can find completely new genres that you can quickly go through.
Gustav Söderström (00:48:59): And so those are working as we intended. People go to them when they want to find new music. They browse through them and they save new songs. So they’re working as we intended. The thing that didn’t work as we intended was when users asked us for this again and again, we took the sum of these things and we put it on Home because people ask so much about discovery and we can see clearly how correlated discoveries with retention on Spotify and so forth. But what we misjudged or failed or rather learned about our own homepage is that the way it works right now, and this is what you can see in the Twitter comment, if you remove the angry voices and try to see what they’re saying, they’re saying the following, which is actually quite clear in the quantitative data as well, that if you look at what people do on Spotify’s homepage, the current one, it is almost 90% what we call recall.
Gustav Söderström (00:49:59): So it is either getting to a session that you’re already in or a specific playlist that you know you want to get to or at least a specific use case. So you come in with a high intent, you actually knew what you wanted, and maybe only 10% of the time as a true discovery, like I don’t know what I want. So if you think about, that is 90% recall and 10% discovery. When we tested the design … So the subfeeds were working and not working, but when we tested some of them on Home, we switched it from 90/10 to 10/90. So 10% recall, 90% discovery. And while people want discovery, they probably don’t want 90% discovery, instead of 90% recall. So if you then look at the comments on Twitter, what they’re saying is like, “Hey, I can’t find my playlists anymore. Where are these things?”
Gustav Söderström (00:50:47): They’re not really complaining about the discovery, they’re complaining about the things they don’t get anymore. And we can see this in the quant data as well. And you can see traffic shifting from home into search and into library, which is a clear sign people are trying to find the things they can’t find anymore. And you can even see people then trying to use these discovery tools which are optimized for quickly understanding new things to do the recall. Where’s that workout playlist I know I want? And it’s actually very bad UI for recall, it’s like a slot machine, right? Very unpredictable if you ever get to that workout playlist. It was optimized for finding new things, not for recall of existing things. When you do recall, you want the dense UI with many items on screen because you know what it is you’re looking for. So you don’t need a lot of real estate when you’re doing discovery of new things. You want a lot of pixels and you probably want sound because you don’t know what it is.
Gustav Söderström (00:51:40): So what we learn about our UI, and I think there’s maybe a little bit of product jealousy here, you always look at other experiences. And if you look around, it could be forgiven for thinking that most other products, if you look at something like YouTube, for example, their homepage is exactly that. It’s a huge single-item discovery feed with only new items. And people don’t seem to tweet angrily about how angry they are at you to say they love YouTube and it’s a big product. And I think what we discovered was that we actually did something really well on our homepage, which was supporting you being inside a multiple sessions at the same time. So you could be in the middle of two podcasts and an audiobook and also them actually I just want to get to that workout playlist. I don’t remember the name of it, but I know it’s workout.
Gustav Söderström (00:52:31): We actually did that part really well. I would venture just say much better than the other experiences where you literally have to go to some tab and into library and start browsing to get back to where you were. And so maybe it’s path dependent. Because we have done recall pretty well, people got, I think, reasonably upset when they couldn’t do the recall anymore. And we didn’t want lose that because it was one of the things we did well and underestimated. And my takeaway is actually we do it better than other experiences. So we certainly want to keep that. So what we did was now we’re just updating their hypothesis to achieve the same goal, which is these things are working and when people want to discover, they use them and they seem to work, they can also get better.
Gustav Söderström (00:53:20): You’re on this hill-climbing journey from a machine learning point of view, but the question is, how do you make sure that whenever people feel that they are in that I’m trapped in my taste bubble, they understand that these things are there and they’re easy to use? So now we have a version of Home that we are also testing, obviously, where these things are very available but voluntary and you can still do all of the recall. And so from my point of view, this is the reason we A/B test because you want to be scientific about it and you want to learn as much as possible about your own product and your users. And now I’m sharing a lot of the learnings. Maybe we should keep them to ourselves, but my hunch is that it’s going to make it a much better product.
Gustav Söderström (00:54:09): But what I told my teams when we went into this, because I’ve done this a few times, agree to signing, I think there are two fundamentally different types of product development. One is designing a new feature. It is hard, but it’s voluntary for people to use. So you do the AI DJ. Some people love it, that’s fine. If you don’t like it, it didn’t make it worse for you. But when you redesign, it is much more tricky because it’s not voluntary to participate in the redesign. So there’s a cost even for people who don’t like it. Then you have a very tricky problem here, which is there are going to be two types of feedback. One is you did something and it was right, but people are upset because you changed stuff. The other is you did something and it wasn’t right, and people are also upset but for good reasons.
Gustav Söderström (00:55:08): And so how do you separate these two? Because I think I explained this to … When we talk through this with my teams, I think the analogy to think about is you have your desktop, your physical desktop, you have your computer in one place, you have your pencil over here, you have your notebook over there, and I come in and I just rearrange all of it. And you have spent, in our case, maybe 12 years with that setup. It doesn’t matter if I have a lot of quantitative data that my new setup is better, you’re going to get upset because you are effective in this old setup. And it’s hard to tell those apart. The most classic use case is the Facebook newsfeed, which people are very upset about when it became a single newsfeed. But it turned out to solve a lot of user problems that you didn’t have to run around all of Facebook collecting events yourself.
Gustav Söderström (00:55:58): So there are some ways of understanding if you made it better, but people’s habits are broken or if it’s not better. And one thing is, for example, to look at new user cohorts that don’t have that behavior versus all user cohorts and so forth. So we went through all of this with the teams. Before we did it, I said, “This is going to be painful.” There’s probably going to be a lot of tweets because chances that we get it exactly right are very low. So for that reason, it hasn’t been very hard on the team. It is hard … You want to respond to people, but the right way to do it is to listen, understand, try new hypothesis to really figure out what’s going on. So I think I’ve done it maybe three or four times now. Three maybe. One unsuccessful, two successfully. So kind of knew what I was getting into.
Gustav Söderström (00:56:45): So it’s almost like you punish yourself, very painful, but also the most exciting things. And I think any product person knows that the easiest, the most straightforward thing to do is to iterate around where you are. There’s no risk. You’re not going to get fired, no user is going to get angry. But everyone also knows that eventually if you don’t adapt new technologies, new paradigms, et cetera, you’re going to get replaced. You have to find this balance of trying new things. And when you work in software, you have this tool of A/B testing and being scientific about it. When you build hardware, it’s worse. If you’re wrong, you’re wrong. You can’t update.
Lenny: I love this story. I so appreciate you sharing it. I imagine also with a big launch like this, you can’t actually A/B test it ahead of time because of the press season. They’re like, “Oh my God, look what Spotify’s doing.” And so you’re limited there. Imagine, right? You couldn’t really test this ahead of time.
Gustav Söderström (00:57:40): The hardest thing about this is if you’re trying something completely new, the MVP needs to be very big so you can build a new IU, but if you didn’t do algorithms for single item feed, you can’t tell if it was the right idea but poor machine learning, right? UI poor machine learning. Or you have to build a lot and that gets quite expensive. That’s actually … The biggest why it’s painful is not really the feedback from the outside. It is the cost you have to take on the inside. You incur a lot of costs as you’re really hoping you’re right.
Gustav Söderström (00:58:15): And in our cases, the changes on the homepage aren’t that hard for us to do. The important thing is that the underlying hypothesis of, can we help you break out of your taste bubble actually works and then you update the acquisition funnels into that experience. But I think the problem is that you need to get so many things in place to be able to say, “You might get a false negative,” just because you didn’t do it well or not. That’s the biggest challenge, I think, with these big rewrites where everyone has to update everything before you can know if you’re right or wrong.
Lenny: What was that process like of helping you understand what is not working and what is working and what you wanted to change? I imagine there’s a bunch of data you’re looking at, some tweets, things like that. What was the tactical, “Oh, shoot, something’s not going the way we expected, here’s what we should do?”
Gustav Söderström (00:59:08): Well, the feeds, we tested, but the home feed, we rolled out and tested afterwards. And we tested out on users, a few different variants of it. And then we got the data back and we looked more at the quantitative data. And we do a lot of user research where people sit and use the feeds to understand and build our own theorem mind of what is working and what is not working. And then, obviously, you look at user feedback, of course, and some users are very good at expressing what is of it that isn’t working, others are not as good as expressing what isn’t working. So it can be hard to parse that, but certainly, that’s a factor as well.
Gustav Söderström (00:59:49): And so then once you do that, then you have quantitative data to look at. And then you sit in recent through, what do you think is right and wrong? What are the different hypotheses? What is working, what is not working? And then just update and test again and again until you prove or disprove your hypothesis. Trying to be as scientific as possible about it. And also I think the biggest risk also when you’ve invested so much time in something is getting precious about things. You have to just be brutal. You have to believe in things 100% until the data says no and then you believe in something else 100%. That sounds easy. It’s very hard to do, to the extent that people get upset when you do it because, for some reason, people don’t like when people change their minds. It is what we should want from everyone. I would love a politician who said, “I’d looked at the data and I realized actually this is right and now I believe this.” But we hate politicians that do that. They feel untrustworthy and we ridicule them.
Gustav Söderström (01:00:56): So I think that’s the biggest risk with anyone. You just have to be unemotional and just look at the proof and the data. And then if you do that, you just move on and then you get to where you want to be, and you solve the same problem but you adapt.
I really like that philosophy. Essentially, it’s the idea of strong opinions loosely held. Is that-
Gustav Söderström (01:01:19): Exactly. Exactly what it is. And it sounds so easy, but it’s hard.
Right? Because to your point, people don’t respect someone changing their mind. They’re like, “Oh, I see, they were wrong the whole time and they were so confident about being wrong.”
Gustav Söderström (01:01:31): Yeah, exactly. And it’s unclear why it is what we should want, but I think it has something to do with human psychology. We actually tend to love profits and people who hold very strong opinions with very little data. Those are the people we like. People will look at a lot of data and actually that, we don’t like. Not sure why.
We’re flawed creatures.
Gustav Söderström (01:01:59): For sure.
Is there something that you’ve recently changed your mind about along these same lines that maybe comes to mind of like, “Oh, yeah?”
Gustav Söderström (01:02:06): No, I think these learnings about the science system and homepage does really well, maybe better than others, that we don’t want to wash out with a bath water or whatever the [inaudible 01:02:20] expression is. I think that’s the biggest current learning I’m actually very happy about.
Yeah, I love learning that we’re doing some really well that we didn’t really realize necessarily and maybe we should lead into that more.
Gustav Söderström (01:02:34): Exactly.
Going in a somewhat different direction. Shishir Mehrotra suggested to ask you something. He’s on your board, I believe.
Gustav Söderström (01:02:41): Yes.
And he suggested to ask you about your 10% planning time. What is that about?
Gustav Söderström (01:02:46): This is a concept that I think Shishir has used for a long time ever since he worked at YouTube. And the idea is that, roughly, you shouldn’t be spending more than 10% of your time planning versus executing or building, which means that if you’re working quarterly 10 weeks, you should spend one week planning as we work in six-month increment. So we try to spend two weeks planning and roughly successful. And this is … Actually, when we talk about org models, give a shout-out to Brian Chesky at Airbnb, who is actually one of the first, I think, to have these more contrarian old models. He’s much more applesque than most of Silicon Valley. He also works in six-month increments. He has a lot of experience in that as well. So that’s what the 10% planning time is. And I think if you find yourself planning much more than that, you’re either planning too much or your execution period is just too short for that amount of planning. It’s a rule of thumb, but I find that it works.
I asked a few PMs what I should ask you, PMs that work at Spotify actually, that I haven’t told you. And someone pointed out that you always bring a lot of energy and clarity to a room. That’s something they see you as really strong at. What have you learned about just the importance of that or just how to do that well as a leader?
Gustav Söderström (01:04:11): Well, that’s great to hear. I didn’t know that so I’m trying to figure out what to answer. I think that the energy, I don’t know. I guess I’m just excited about what I do. I’ve always been excited about technology. I love seeing new things. My core drive is still this notion of you see something which I think you’ll empathize with that doesn’t exist yet. And you’re like, “Wow, I wonder if that could exist. That would be so cool.” And then in order to get people to do it, you try to share that excitement. So I don’t think I can bring a lot of energy for something I’m not excited about. So I have to work on things I actually believe in and that I’m excited about. And so maybe then the energy comes more naturally. Unfortunately, for me, so far, Spotify has been in this phase where a lot of innovation is allowed and I’m even asked to try to do new cool things.
Gustav Söderström (01:05:09): Maybe I would have less energy for a pure optimization phase. On the clarity, I’ve always liked trying to explain things. It’s a well-known fact that the best way to understand something is to try to explain to someone else. So I go around explaining things to people who didn’t ask for it and not to sound smart, but to see if I actually understood it. And so maybe it’s that practice. And on that note, I actually do ask my leaders that work for me and I ask them to ask their leaders to always explain themselves. And I think when … We talked a little bit about autonomy and so forth, we don’t promise everyone that they have to agree, but I think the promise we should make to all employees is that even if they don’t agree, they should be entitled to understand why you’re making the decision.
Gustav Söderström (01:06:06): What I don’t think is acceptable is to say, “No, we’re going to do it this way because I’m more senior. I’ve seen this a bunch of times. You are not smart enough.” All of those things. I think you have to explain yourself so you owe an explanation. And I find that valuable back to the only way to understand something is to explain it because it usually turns out that if you can’t explain it yourself, you probably don’t really even understand it yourself. Sometimes I think it’s possible that you can have product instincts that are good but you can’t express them. But most of them, when people say there’s something there but they can’t explain it, they actually don’t understand it themselves. And many times, there actually isn’t anything there. And also if you can explain it as a product person, that knowledge is now shared. So it just becomes much more effective for the organization. So sometimes I try to provoke people a little bit and say … When people ask how much is art versus science, I say, “It’s 0% art, 0% magic, and 100% science.” And that’s because I want to force people to try to explain it. I think we use the word art and magic. We have historically used the word art and magic for anything that we couldn’t yet explain.
Gustav Söderström (01:07:33): Genetics was magic and art until it was science. And quantum physics was magic until it was science. And most recently, actually, intelligence and creativity was art and magic until it was statistics in an LLM. So I think I try to push people to say, “Are you sure you can explain this?” because that forces people to think through. So maybe I like it and I try to force it on people. So maybe that’s why people think I sometimes bring clarity.
Lenny: I love that. Question along those lines, is there a system or an approach to explaining that you recommend? Is it just write it out in a document? Is it explaining in a certain style or is it just however is natural to the person?
Gustav Söderström (01:08:20): I used to write everything and then write and rewrite and make it more and more condensed. So that worked for me. I don’t write as much anymore. Now, I tend to walk and talk in my head myself. What I actually do is I … And I found this different for different people and a lot of people want to bounce something with someone else, that’s how they think. You repeat the same thing again and again and you get some feedback on it. And so I used to write a lot. I sometimes do when it’s an idea I want to understand better. And at some point in my life, I would love to write something real like a book or something. But what I do increasingly now is I do my one-on-ones with peers or people who report to me or something, and I just put on AirPods and do a distributed walk and talk.
Gustav Söderström (01:09:12): Both people are walking but in different locations and you spend an hour discussing something. That has actually turned out to be very, very fruitful. So then you get the power of you’re not alone so you get more brain power than your own. And I don’t think there is strong evolutionary proof for this, but there’s certainly indications that you’re thinking better when you’re walking, whether it’s because you’re oxygenating your brain or because it’s evolutionary for some other reason, I’m not sure. But I found that walking, talking, and thinking actually even if you’re not in person, just over AirPods, it’s super effective. It was the pandemic that forced us. I thought we would get less creative and strategizing will suffer during the pandemic and I found the opposite. We had more of this than ever and I started thinking about why, and I think it’s all of these walk and talks that we did.
You threw out there that you want to write a book someday. What do you think your book would be about?
Gustav Söderström (01:10:11): I have no idea. I have no idea. Statistically, it’s probably going to be about something that I did a lot, so it has to be about something with technology or product or something. But I would love to write something fictional. That’d be a lot of fun.
Oh, boy. I’ll pre-order as soon as that’s up. Another concept I wanted to touch on that another PM suggested, which is he called it the P in the pants analogy. Does that ring a bell? And is that interesting to talk about?
Gustav Söderström (01:10:40): I don’t know exactly which occasion this person is referring to, but I know I’ve used that analogy a few times.
Okay. Promising.
Gustav Söderström (01:10:50): I don’t know if it’s like a Swedish analogy because I thought it was more widely known. But the idea is that you do something … So the saying is that’s like peeing in your pants in cold weather. It feels really warm and nice to begin with. And then after a while, you start to regret it. It’s about being short term, basically. So now I just say that’s like peeing in the pants inside because people know what I mean. It’s a short-term thing.
That’s a hilarious way of communicating that idea. Must be a Swedish thing.
Gustav Söderström (01:11:25): Yes, I think Swedish people do it for some reason, apparently others don’t.
Maybe because it’s cold a lot of times of the year.
Gustav Söderström (01:11:33): Yes. That’s probably it. This is a saying in cold climate. In the warm, it doesn’t help. No one understands what you mean.
Speaking of Sweden, do you watch Succession?
Gustav Söderström (01:11:42): Yes, I do.
Okay. So Sweden’s become a big part of the show, specifically the company trying to … I guess I don’t want to spoil, but there’s a character that’s really important. Yes, exactly. That is Swedish. And so I’m curious just what do you think of the way they portray the Swedish culture and Swedish business dealings?
Gustav Söderström (01:11:59): It’s super fun to see this as a Sweden. And I guess, first and foremost, like anyone or any person or any country that gets represented by super tall, well-built, great looking Alexander Skarsgård should probably be pretty happy. So that’s good. Then I think there’s this episode where they are in Norway, without giving away too much.
Yep.
Gustav Söderström (01:12:26): There are elements that are authentic. There’s a lot of, I think, paid brand positioning from a Swedish brand named Fjällräven, which I think means arctic fox, which is actually a very popular outdoor brand in Sweden. So that’s authentic. The sauna things and so forth are authentic. So it’s real, but it’s exaggerated. Actually, the thing that isn’t very authentic is his negotiation style. Swedish people tend to be serious, cautious, and this guy’s more of a player. So he’s not the typical Swedish businessman from a negotiation tactic point of view, I think.
Yeah, it doesn’t make me think of the way you described it where in Sweden, people sit in a circle and no one’s in the center.
Gustav Söderström (01:13:15): No, exactly. He’s very much in the center.
And then when people go saunas, there are just like a chant, sauna, sauna [inaudible 01:13:23].
Gustav Söderström (01:13:23): Exactly.
The last episode.
Gustav Söderström (01:13:25): It’s a great show. I love it.
I love it. This season is insane. I’m so curious where it all goes. Maybe just the last question before very exciting lightning round. Spotify is, at this point, the biggest podcasting platform for me specifically and I think globally, and I love using it. It works great. I’m curious just what’s next for Spotify and specifically Spotify podcasting.
Gustav Söderström (01:13:48): There are two sides to it. It’s for Spotify creators and for Spotify listeners. For Spotify creators, there are two things. One is, and this is what we talked about at Stream On, we talked about it also for music discovery, but it’s the same problem and even harder for podcast. So we’re still focused very heavily on helping podcast creators find more audience. This is … Like I said, it’s even a bigger problem to break out of your habits and your bubbles in podcasting. Such a big investment to find a new podcast. And so that is something, I think, we could and should do really well. So we keep investing a lot there. And as I said, you’ll see more as we roll up more features now.
Gustav Söderström (01:14:37): The other big need for creators is monetization and you can monetize today in many ways with DEI and Spotify SEI and so forth. But we’re working hard to expand that and make it better because the industry is starting to mature and I think this is one of the biggest needs and the biggest things we could do for creators to help them monetize better, actually both free and paid. We also have paid podcast. So that’s on the creator side.
Gustav Söderström (01:15:07): On the consumer side, I don’t want to share too much. We’ve shown that we’re investing a lot in discovery. I want to keep some secrets for when they roll out, but we are investing a lot in the user expense itself. I think it’s far from optimal yet what it could be. One thing that I can share that we’re investing a lot in is just the ubiquity and playback across different devices and in cars and all these things that we’ve done well for music. But I think the listening experience can get a lot more seamless. I think search can get better. The data about podcasts and … Well, I don’t want to say too much, but looking at AI and generative technology, there is a lot that can be done.
Lenny: All right. Well, I’ll take what I can get. With that, we’ve reached our very exciting lightning round. I’ve got six questions for you, Gustav. Are you ready?
Gustav Söderström (01:16:01): I think I am. Let’s do it.
Okay, we’ll find out. What are two or three books that you’ve recommended most to other people?
Gustav Söderström (01:16:08): Okay. This is why I try to squeeze in seven into two and three. So if we start on product, I think it’s well known, but one that I would recommend product people to read is 7 Powers by Hamilton Helmer, and Netflix has used a lot. We use a lot. It’s just if you’re starting out, it’s great to have a strategy framework. No strategy framework is right, but having one is better than none.
Gustav Söderström (01:16:31): Another in the space of mental models and frameworks, I think, is The Complete Investor by Charlie Munger. So, yes, it’s about investment, but really it’s a bunch of mental models that he uses. And I think the key takeaway is you have a problem, you should always apply three different models to it because what models do is they simplify and reduce dimensionality. The world has probably infinite dimensions and they reduces to maybe three or four. And the risk with that is you happen to get rid of a really important dimension, maybe pandemic diseases or something. But if you use three models that have different dimensions and was reduced in different ways, statistically, and it comes to the same conclusion, even the second model you apply vastly increases your chances that you’re right. So that was a good book to read.
Gustav Söderström (01:17:24): Then I think if we go outside of product, I’m very interested in just science and mathematics. So a few quick ones. The Mystery of the Aleph, an amazing book. Something Deeply Hidden by Sean Carroll on the interpretation of quantum mechanics. Helgoland by Carlo Rovelli on the relational interpretation of quantum mechanics. The Beginning of Infinity and The Fabric of Reality by David Deutch. The Case Against Reality by Donald Hoffman on the evolution versus truth and that evolution doesn’t optimize for seeing the truth, just for fitness. Gödel’s Proof, I think, is an amazing book on his incompleteness theorem, that in any axiomatic systems, there will be true statements that can never be proven, which is a weird thing to think about. And then maybe one of my favorites is The Demon in the Machine by Paul Davies that, I think, is lesser known on how information is really just entropy and this concept of information engines, that you can power something by just information and exhaust is also information. That was not a quick list.
No, I was just going to say you’ve set the record for the most number of books, but it also shows how you’ve become so insightful and wise just reading books like these. And so I think if people are looking to get to a place that you’re at now, I think there’s the lesson.
Gustav Söderström (01:18:55): I’ll keep the artist much shorter, I promise.
It’s all good. We got time. Okay, next question. What’s a favorite recent movie or TV show?
Gustav Söderström (01:19:03): So we talked about Succession and it is a recent favorite. So I’ll just frivolously take something that isn’t recent but is an absolute favorite, which is Halt and Catch Fire, which I think is on FX. Amazing show. If you ever worked in technology, kind of starts out in the Silicon Prairie in the ’80s and follows up to present day. Amazing show.
Halt and Catch Fire. Yeah, I watched some of it. I actually fell off of it, but it’s a good reminder to go check it out.
Gustav Söderström (01:19:29): Got to go back.
I’m going to go back. What’s a favorite recent interview question you like to ask?
Gustav Söderström (01:19:34): I don’t ask it, but my favorite question is Lex Fridman’s small ending question that is usually something like, so what’s the meaning of it all? I like that. It’s a tough question to get.
I’m so tempted to ask you, but-
Gustav Söderström (01:19:48): No, don’t.
Okay. Let’s move on. That’ll be another … That’ll be our second take at this.
Gustav Söderström (01:19:53): Yes.
What are some favorite products you’ve recently discovered that you love?
Gustav Söderström (01:19:58): The obvious one is ChatGPT GPT-4 and just playing around with that, trying to create bots for yourself that do different things for you and so forth. But I don’t think that’s probably true for everyone. The other really favorite is something you’ve written about and talked about, which is Duolingo, which I think is both very impressive from a product point of view, the execution and what they’ve done. It is also insanely used in my family. We have a family account and everyone is using it and competing every day. So I’m both impressed by the product and I also use the product quite a lot.
What languages are folks learning within your family?
Gustav Söderström (01:20:38): In my family, it’s Spanish right now.
How’s it going?
Gustav Söderström (01:20:41): Bien.
You get a gold star.
Gustav Söderström (01:20:47): I only have a few thousand XP. I’m not that good yet.
No, I don’t know if that’s good. That sounds pretty good. Next question, what’s something relatively minor you’ve changed in your product development process that’s had a tremendous impact on your team’s ability to execute?
Gustav Söderström (01:21:01): I’m not sure I’ve done anything minor that had a tremendous impact. Usually, it takes something bigger to get a big impact. I think maybe one thing that I’ve tried to do back to clarity and so forth is this thing I mentioned about I’m trying to push a lot for what I call Socratic debate, where the idea is obviously that the best idea wins, not the most senior idea and so forth. And trying to push for this notion of having people explain themselves, not saying I think there’s something there or have a feeling or something like that. And apparently, as you said, that has had some impact because people apparently say that about me. So that’s probably the biggest thing.
Lenny: Final question, what is one fun ritual of the Spotify product team, and is it saunas?
Gustav Söderström (01:21:59): So Spotify is so big now that it’s quite local actually, different parts of Spotify, different product rituals. I accidentally created one ritual many years ago, maybe 12 years ago, when we talked about which phase a product is in. And it was … We needed some definition. So I think off the cuff, I said, “Well, it’s four phases.” It’s think it, build it, ship it, tweak it.” And then the think it phase, it should be cheap, not a lot of money spent. In the build it phase, you’re going to start spending a lot of money. So then you must have reduced the risk in the think it phase that you’re right. And then you have the ship it phase and then you go over and tweak it. And it was something that wasn’t that thought through, but it’s funny because I still hear it sometimes even from other companies like, “Oh, we’re in the think it phase,” or “We’re in the tweak it phase.” So it stuck. I don’t know if it’s very good, but it’s stuck.
It is catchy. I think anything getting stuck in people’s head is a success. Gustav, thank you so much for being here. We are two for two for Swedish people. Gustaf, with an F, Alströmer was on the podcast.
Gustav Söderström (01:23:10): Who is also an amazing person.
Also an amazing person. I feel very jealous of people that get to work with you and for you. Thank you again for being here. Two final questions, where can folks find you online if they want to learn more, maybe reach out, ask some questions.
Gustav Söderström (01:23:23): [inaudible 01:23:23] @GustavS.
Okay. Say it again
Gustav Söderström (01:23:28): @GustavS.
Awesome. And then final question is just how can listeners be useful to you?
Gustav Söderström (01:23:33): Just reach out. I do read feedback and I try to remove the angry comments and understand what they’re actually thinking and why they’re upset or what’s not working.
And then the reaching out, would you recommend an angry tweet at you or more of a email to that email address you shared?
Gustav Söderström (01:23:50): Well, the @GustavS is the Twitter handle, so just tweet at me.
Okay.
Gustav Söderström (01:23:56): You can be nice as well.
Okay.
Gustav Söderström (01:23:57): It’s okay.
Amazing. Gustav, thank you so much for being here.
Gustav Söderström (01:24:01): Thank you for having me, Lenny. It’s been a pleasure.
Bye, everyone.
Thank you so much for listening. If you found this valuable, you can subscribe to the show on Apple Podcast, Spotify, or your favorite podcast app. Also, please consider giving us a rating or leaving a review as that really helps other listeners find the podcast. You can find all past episodes or learn more about the show at lennyspodcast.com. See you in the next episode.
Glossary
| English | 中文 |
|---|---|
| acquisition funnels | 获客漏斗 |
| Alex Nordström | 保留原文(Spotify 联合总裁) |
| Avicii | 艾维奇(瑞典 DJ、音乐制作人) |
| Brian Chesky | 保留原文(Airbnb CEO 兼联合创始人) |
| Charlie Munger | 保留原文(投资家、Berkshire Hathaway 副董事长) |
| ChatGPT | 保留原文(原文转录为 GBT,实指 ChatGPT) |
| Chris Dixon | 保留原文(科技投资人、a16z 合伙人) |
| contrarian | 反传统的 |
| Dall-E | 保留原文(AI 图像生成工具) |
| Daniel Ek | 保留原文(Spotify CEO 兼联合创始人) |
| DAW (digital audio workstation) | 数字音频工作站 |
| dead clicks | 无效点击 |
| diffusion models | 扩散模型 |
| Discord | 保留原文(通讯平台) |
| discovery | 发现(与召回相对,指推荐用户未知的新内容) |
| Drake | 德雷克(加拿大说唱歌手) |
| dud | 次品(指推荐中不符合用户期望的曲目) |
| Duolingo | 保留原文(语言学习应用) |
| EDGE networks | EDGE 网络 |
| EDM (electronic dance music) | 电子舞曲 |
| Eppo | 保留原文 |
| escape hatch | 退出通道 |
| false negative | 假阴性 |
| fault-tolerant user interfaces | 容错用户界面 |
| Gustav Söderström | 保留原文(Spotify 首席研发官) |
| Halt and Catch Fire | 保留原文(FX 电视剧) |
| Hamilton Helmer | 保留原文(投资 strategist、《7 Powers》作者) |
| hard APIs | 硬 API(强制要求团队暴露和结构化维护的技术接口) |
| hit rate | 命中率 |
| Jeff Bezos | 保留原文(Amazon 创始人) |
| Lenny | 保留原文(播客主持人) |
| Lex Fridman | 保留原文(播客主持人) |
| Microsoft Clarity | 保留原文 |
| Midjourney | 保留原文(AI 图像生成工具) |
| MVP (minimum viable product) | 最小可行产品 |
| north star metrics | 北极星指标 |
| rage clicks | 愤怒点击 |
| recall | 召回(推荐系统中指用户已知内容的推荐/回访) |
| Reggaeton | 雷鬼顿(拉丁音乐流派) |
| Shishir Mehrotra | 保留原文(Spotify 董事会成员,曾任 YouTube 高管) |
| smiling curve | 微笑曲线 |
| squads | 小队(Spotify 曾采用的团队组织模式) |
| Stable Diffusion | 保留原文(AI 图像生成模型) |
| Stream On | 保留原文(Spotify 年度发布活动) |
| strong opinions loosely held | 坚定观点,松散持有 |
| subfeeds | 子信息流 |
| Succession | 《继承之战》(HBO 剧集) |
| taste bubble | 品味泡沫 |
| The Weeknd | 威肯(加拿大歌手) |
| tribes | 部落(Spotify 曾采用的团队组织模式) |
| two-pizza teams | 两个披萨团队(Amazon 提倡的小型团队理念) |
| zero intent use case | 零意图用例 |
Reformatted by reformat_english.py
产品科学、重大押注,以及 AI 如何影响音乐的未来 | Gustav Söderström
文字记录
Gustav Söderström (00:00:00): 互联网始于策展,通常是由用户来策展。你把某些东西——一些优质的东西,比如人、书籍或音乐——数字化后放到网上,然后让用户去策展。这就是你的 Facebook、Spotify 等等。之后,世界从策展转向了推荐——不再由人来做这件事,而是由算法来完成。那是一个巨大的变化,要求我们和其他公司重新思考整个用户体验,有时还包括商业模式。而我认为我们现在正在进入的阶段,是从策展到推荐再到生成的转变。我猜测这将会是一场同样巨大的变革,最终你将不得不重新思考你的产品。我们不得不为以推荐为先的时代重新设计用户界面和体验。那么在生成时代,这意味着什么?目前还没有人真正知道。
Lenny (00:00:47): 欢迎来到 Lenny 的播客,在这里我采访世界级的产品领导者和增长专家,从他们打造和增长当今最成功产品的宝贵经验中学习。今天我的嘉宾是 Gustav Söderström。Gustav 是一位产品界的传奇人物,他现在是 Spotify 的联合总裁、首席产品官兼首席技术官,负责 Spotify 的全球产品和技术战略,并 overseeing 公司的产品设计、数据和工程团队。自从我推出这档播客的第一天起,Gustav 就在我梦想嘉宾的愿望清单上,我非常高兴我们终于实现了。
Lenny (00:01:19): 在我们的对话中,我们深入探讨了 Gustav 关于做出重大押注的心得,以及当押注不成功时该怎么办,Spotify 是如何告别小队(squads)模式以及他们现在如何组建团队,AI 已经如何影响他们的产品,以及 AI 生成音乐的未来。此外,为什么所有伟大的产品都需要变一个魔术,剧集《继承之战》(Succession)对瑞典商业文化的还原度有多高,以及他那令人捧腹的”尿裤子”比喻。在短暂赞助商介绍之后,请欣赏这期与 Gustav Söderström 的节目。
Lenny (00:01:51): 本期节目由 Microsoft Clarity 赞助。Clarity 是一个免费且易用的工具,能够捕捉真实用户实际使用你网站的方式。你可以观看实时会话回放,发现用户在哪些流程上畅行无阻,在哪些地方遇到困难。你可以查看即时热力图,看到用户在页面上的哪些部分有互动,哪些内容被忽视。你还可以通过非常酷的挫败感指标来精确定位用户的困扰,比如愤怒点击(rage clicks)、无效点击(dead clicks)等等。如果你听这档播客,就会知道我们多频繁地谈论了解用户的重要性——通过观察用户真实体验你产品的方式,你可以发现产品机会、转化提升点,并找到你想象中人们使用产品的方式与他们实际使用方式之间的巨大差距。
Lenny (00:02:33): Microsoft Clarity 用一组简单却异常强大的功能让这一切成为可能。你会惊叹于 Clarity 使用起来有多简单,而且完全免费、永久免费。你永远不会遇到流量限制,也不会被迫升级到付费版本。它同时支持应用和网站。别再猜测了,获取 Clarity。请访问 clarity.microsoft.com 了解 Clarity。
Lenny (00:02:59): 本期节目由 Eppo 赞助。Eppo 是一个新一代 A/B 测试平台,由前 Airbnb 员工为现代增长团队打造。DraftKings、Zapier、ClickUp、Twitch 和 Cameo 等公司都依赖 Eppo 来驱动他们的实验。无论你在哪里工作,运行实验都越来越不可或缺,但目前没有商业工具能与现代增长团队的技术栈集成。这导致你要么浪费时间搭建内部工具,要么试图通过笨拙的营销工具来运行自己的实验。当我在 Airbnb 工作时,我最喜欢的事情之一就是我们的实验平台——我可以按设备类型、国家、用户阶段对数据进行切片和钻取。Eppo 能做到这一切甚至更多——快速交付结果,避免漫长的分析周期,并帮助你轻松找到所发现问题的根本原因。
Lenny (00:03:44): Eppo 让你超越基本的点击率指标,转而使用你的北极星指标(north star metrics),如激活、留存、订阅和支付。Eppo 支持前端测试、后端测试、邮件营销,甚至机器学习模型。请访问 geteppo.com 了解 Eppo。没错,就是 geteppo.com。让你的实验速度提升 10 倍。
加入 Spotify 的经历
Lenny (00:04:08): Gustav,欢迎来到播客。
Gustav Söderström (00:04:11): 谢谢你邀请我,Lenny。很高兴来到这里。
Lenny (00:04:13): 能有你来做客是我的荣幸。到现在为止,你在 Spotify 已经待了超过 14 年,这在科技界是很罕见的成就,而且你在 Spotify 期间担任过很多不同的角色。能不能先给我们讲讲这些年你在 Spotify 担任的各种角色和做的事情,然后再说说你现在在做什么?你现在负责什么?
Gustav Söderström (00:04:33): 我是在 2008 年底、2009 年初加入 Spotify 的。我那时的工作——我之前一直是创业者,在那个年代还非常早期的功能机、智能手机领域创办过自己的公司。所以我在那个领域积累了大量知识。我把公司卖给了 Yahoo,在移动领域。我在那里工作了一段时间后回到瑞典,然后通过一个共同的朋友认识了 Daniel Ek,Spotify 的 CEO 和联合创始人。他们当时已经做出了桌面端产品,那个免费的流媒体桌面应用,非常惊艳,我可以试用,但他们需要有人来搞清楚移动端该怎么做。因为我曾在那个领域创业,所以我得到了这份工作。
Gustav Söderström (00:05:20): 所以我的工作是负责 Spotify 的移动业务,搞清楚移动端的产品形态是什么。这是一个挑战,因为很显然,Spotify 桌面端是一个免费按需流媒体应用,而在当时,特别是在 EDGE 网络环境下,你根本无法进行实时流媒体播放。性能跟不上,而且你也无法用广告模式来支撑它。所以那是一个产品和商业模式的创新,非常有意思。这就是我开始的方式。几年后,我承担了 Spotify 所有产品开发的职责。再过几年,我实际上又接管了技术方面的职责,也就是 Spotify 的 CTO 角色。最近,我的正式头衔是与 Alex Nordström 一起担任 Spotify 的联合总裁。我们各管公司一半——我负责产品和技术这边,他负责业务和内容那边。这就是超快版的故事。
Gustav Söderström (00:06:20): 除了因承担更多职责而接管技术部门之外,我的职位名称其实一直是同一份工作。我一直向 Daniel 汇报,但因为 Spotify 增长得太快,每六到十二个月就像加入了一家新公司。一开始是一个瑞典和北欧的挑战,然后变成了一个欧洲的挑战,然后是进入美国,然后我们成了一家上市公司。所以就好像我在很多不同的工作之间跳来跳去一样,尽管头衔和角色大体上是相同的。
Lenny (00:06:55): 你的故事让我想到那句经典的话——小心你擅长的事,因为你最终会被分配越来越多。显然,这些年来你被赋予了越来越多的责任,说明一切进展顺利,你做得很好。
Lenny (00:07:08): 换个话题。你现在正在我的播客上做客。你其实有自己的播客,是关于 Spotify 产品故事的限定系列,我听了也很喜欢。实时听到你的声音有一种超现实的感觉,因为我最近为了准备这次对话一直在听你的播客。两个问题:第一,明知自己有全职工作、事务繁多,是什么让你决定推出自己的播客?而且据我观察,你播客的制作水准非常高。第二,就你最终打造的产品而言,以及从播客创作者的角度去共情,你从这段经历中学到了什么?
Gustav Söderström (00:07:41): 我做这件事有很多不同的原因。其中一个,而且不是小原因——我想和你一样,我喜欢写作,我内心有一个隐秘的创作者梦想。我很久以前写过博客,在公司内部也写很多。但当你在这样一家公司工作时,你不能在外部写太多东西。
Lenny (00:08:01): 对。
Gustav Söderström (00:08:02): 但我喜欢写作、交谈和演讲。所以这肯定是原因之一。然后同样重要的一个原因是,从产品的角度,去共情我们的核心利益相关方之一——播客创作者。很遗憾,我不是一个出色的音乐人。我尝试过演奏乐器之类的,但我没有出过唱片,唱歌也不太行。但我决定做一档播客,这让我深刻了解了作为一个创作者、制作不同风格的播客是什么体验。
作为创作者的切身感受
Gustav Söderström (00:08:33): 比如,我们想做一档制作成本更高的播客,加入音乐,然后你立刻就会遇到一堆问题——而 Spotify 实际上在解决这些问题上处于很好的位置——但从版权的角度来看,在播客中使用音乐真的很困难。所以你能理解播客创作者面临的所有这些难题,从而更好地去解决它们。但最大的好处,也是我做公开播客的真正原因,是我之前其实通过一个 hack 做过一档内部播客,我们可以把它设置为仅限员工收听。我试图在内部围绕 Spotify 建立更强的文化,帮助新老员工认识我们是谁、我们犯过的错误、取得的成功,以及我们如何思考战略,特别是产品战略,因为我们在外部以技术和 squads 等等这些东西而闻名,但在产品战略方面并不那么出名。
内部播客的起源
Gustav Söderström (00:09:36): 因为我喜欢讲故事胜过写 Google Docs,所以我决定做一档内部播客。我四处走访,采访了 Daniel 的直属下级——也就是 CMO、CHRO 和 CFO——问他们各种各样的事情。初衷是让他们对员工来说更平易近人,因为我觉得听播客的时候,即使那些完全不知道我是谁的人——因为我从未见过他们——我也会觉得我认识他们,了解他们的想法,而且更喜欢他们了。所以我的秘密想法是,如果你能比通过偶尔的会议或全体大会更好地了解你的领导层,会怎样?所以我在内部做了这件事。而因为我是一个做产品的人,我们最终聊了很多关于产品和产品战略的话题。内部的人非常喜欢。
从内部到公开播客
Gustav Söderström (00:10:26): 所以下一次,问题变成了:如果那些甚至还没在 Spotify 工作的人,也能觉得自己认识 Spotify 的人呢?那会很棒,因为大多数公司的大多数领导都非常不透明,当你看到他们出现在商业报纸上什么的,他们就像是来自另一个世界的、不太真实的生物。如果你听过他们聊一个小时左右呢?所以这就是基本想法。是招聘工具、分享更多我们关于产品战略的思考,以及——纯粹因为我觉得这很好玩——的结合体。我采访了一群聪明又有趣的人,有外部的也有内部的。
Lenny (00:11:09): 回顾来看,它达到了你期望的效果吗?
Gustav Söderström (00:11:12): 我觉得达到了。播客表现不错,而且我们没有给它自己的推广资源。我不得不像其他人一样竞争,这也让你对这个问题产生了很多共情——好,你现在有了一个产品,那用户获取呢?你到底怎么让人来听?所以它确实达成了我想要的。我们有一个叫 intradays 的活动,尤其是在过去几年我们大量招聘的时候,我们实际上会飞人到斯德哥尔摩参加入职培训,了解 Spotify。领导层会在台上讲述他们负责的工作、各个部门和战略等等。经常有人来跟我说,“哦,我听过这个播客,或者那期节目,这至少是我加入的关键原因之一,有时候就是加入的原因。“所以这是轶事性的,但至少有几十个人这么说过。看起来是有效的。
内容在招聘与文化中的力量
Lenny (00:12:12): 这真的很有意思。再说一次,这在播客中出现过好几次,就是内容在所有这些不同场景中的力量——招聘、文化建设。听起来最初的目标就是内部建立文化和战略。
Gustav Söderström (00:12:24): 最初的目标就是让高管层更平易近人,缩短距离,然后也用更有趣的方式分享更多的思考,而不是仅仅通过最终没人去读的文档。
Lenny (00:12:38): 我喜欢这个。所以我一直在听你的播客,正如我所说,真正有趣的是,第四集其实全是关于 AI 的,我觉得讲的是你们最初在 Spotify 内部利用机器学习和 AI 的尝试。我想那最终催生了 Discover Weekly 和其他一些功能。那是很多年前的事了。而现在 AI 再次成为一件大事,在当下听这些内容很有意思。所以我很好奇,非常具体地在产品团队层面,你对产品经理和产品团队有什么建议,关于如何在产品思考中考虑 AI,以及在日常工作中的使用?
AI 与产品思考
Gustav Söderström (00:13:12): 我可以举几个例子。我不知道我们是不是比别人更先进,但我们确实在传统机器学习方面已经做了相当长一段时间了。在播客里,我讲过互联网发展的几个阶段。一种理解方式是,互联网始于用户策展。就是你把某个东西——好的东西,比如人、书籍或音乐——数字化并放到网上,然后让用户来策展。这就是你的 Facebook、Spotify 等等。然后过了一段时间,世界从策展转向了推荐,由算法代替人来完成这些工作。这是一个重大变化,要求我们和其他人重新思考整个用户体验,有时甚至包括商业模式。
Gustav Söderström (00:14:04): 我认为我们现在正在进入的,是从策展到推荐再到生成的阶段。我猜想这将是一次同样巨大的转变,最终你将不得不重新思考你的产品。这是其中一个视角。所以我通常和我的团队谈论的是,即使这些都是机器学习,我让他们把它当作完全不同的东西来思考。推荐时代是一种类型的机器学习。生成时代是另一种类型,所以不要把它当作更多相同的东西,而要当作实际上完全崭新的东西来思考。而我们从中学到的……嗯,有几件事。如果你看看大语言模型和融合模型等新纪元,有两类应用。正如我所说,在推荐时代,我们不得不为推荐优先的时代重新思考用户界面和体验。
Gustav Söderström (00:14:57): 那么在生成时代这意味着什么呢?目前还没有人真正知道。和往常一样,有一系列的渐进式改进。我们用这些大语言模型来改进推荐。你可以使用更大的向量,拥有更多的文化知识。你可以用它对还没有人听过的播客做安全分类等等。所以有很多显而易见的改进,我们也在做这些。但到目前为止,我们只真正做了一个严格意义上的生成式产品——也就是一个没有生成式 AI 就不可能存在的产品——那就是 AI DJ。这是一个我们思考了很长时间的概念。AI DJ 就是你按下一个按钮,一个数字化的人,有一个叫 X 的真人,被数字化了的 X。他现在是一个 AI,过来说跟你聊聊你喜欢的音乐并推荐音乐,你可以听。如果你不喜欢,你可以把他叫回来,他会说,“好的,现在让我们来听听大概几个夏天前的音乐,“或者”这是昨天在《最后生还者》某一集里正在流行的一些新东西,诸如此类的。”
Gustav Söderström (00:16:10): 所以那个产品没有生成式 AI 就不可能存在,既需要生成语音,也需要生成语音所说的内容。这样你就能在五亿人的规模上实现个性化的语音。这个用例我们已经看到了很多很多年。有时候人们称之为电台用例。我们在内部称之为零意图(zero intent)用例——就是你真的完全不知道自己想听什么的时候。
Gustav Söderström (00:16:40): Spotify 在这方面并不擅长。Spotify 擅长的是,你至少大致知道自己的使用场景——是锻炼还是晚餐。我们为所有这些场景提供了很多选择。但如果你真的完全不知道,打开 Spotify 对着屏幕发呆是很困难的。人们过去常常怀念地说,这是电台唯一比我们做得好的地方。说实话,电台其实相当糟糕。它根本不是为你个性化的。它不是点播的。你从中间开始听,在很多方面它其实很糟糕。但人们仍然经常说电台有某些好的东西。我认为那个好的东西就是,你有一个旋钮,可以在不同的上下文之间切换。就像——不,无聊,无聊,无聊,无聊,好,这个不错。
Gustav Söderström (00:17:23): Spotify 从来没有过这种模式——我不知道自己想要什么,但我想不断切换直到找到我喜欢的东西。我认为通过 AI DJ,我们实际上解决了这个用例。所以 X 出来说,“我要给你推荐一些你可以听的东西。“如果你喜欢,你可以继续听,但如果不喜欢,你把他叫回来,换个频道。出于某种原因,我们尝试了很多次、很长时间来解决这个问题,但是直接播放一首随机歌曲而没有任何你为什么会听到这首歌的上下文,就是从来没有奏效过。这是我们第一次涉足一个以前不可能存在的产品。回到你关于原则的问题,我们学到了几个非常明确的原则。
容错用户界面
Gustav Söderström (00:18:09): 一个我非常喜欢、但完全不是我的原则——我认为它直接来自 Chris Dixon——就是容错用户界面(fault-tolerant user interfaces)的原则。在早期机器学习时代,当我们说从策展转向推荐的时候,我不知道看了多少次这样的设计草图:一个单一的大播放按钮,因为显然那是你能做到的最简单的用户界面,但如果你不了解你的机器学习的性能表现,你就无法为它做设计。如果你要设置一个单一播放按钮,你的机器学习质量需要真正达到 100%,即零预测误差,而这是永远不可能的。所以假设你有五分之一的命中率,五个里面有四个是不行的,那么你可能需要一个至少同时在屏幕上展示五个东西的 UI。这样你在屏幕上就有五分之一的机会看到相关的内容。
Gustav Söderström (00:19:03): 所以你需要理解你的机器学习的性能表现,才能为它做设计。它需要容错,而且你通常需要给用户一个退出通道(escape hatch)。你做了一个预测,但如果预测错了,用户需要能非常容易地说,“不,你错了,我想去我的音乐库,或者去这里、去那里。“所以我们有这样一个原则:拥有容错的用户界面,以及与算法当前性能表现相匹配的用户界面。我认为这对于生成式机器学习同样适用。一个非常清晰的例子实际上是 Midjourney。如果你回想 Midjourney 早期在 Discord 频道里的用户界面,实际上生成一张图像非常、非常慢。
Gustav Söderström (00:19:48): 生成一张高质量图像需要很长时间。他们本可以做一个那种银色按钮的东西——你输入一个 prompt,等上几分钟,得到一张图像,而且每四次就有一次是不好的。所以你会四次里有三次失望,每次一分钟,四分钟后你会觉得,“这是个垃圾产品。“而他们的做法是同时快速生成四张低分辨率图像,你可以说,“所以很明显,他们的性能表现大概是四分之一,这就是为什么他们展示四张而不是六张。“四张里通常有一张相当不错。你点击那张,然后要么继续迭代,要么放大。这也是一个我认为理解了生成式 AI 在构建 UI 时的性能表现的例子。这是我会从中获得灵感的东西。
Gustav Söderström (00:20:37): 就 AI DJ 而言,另一个原则是尽量避免那种只想炫耀技术、然后让这个声音一直说个不停的冲动。你必须记住,人们来这里是为了听音乐的。所以 AI DJ 的原则——顺便说一下,这其实是一个自下而上的产品,它需要大量的支持。我们实际上收购了大公司等等才能把它做出来。但这个想法是团队自下而上构建的。那里的原则就是尽可能少做,然后让开。我认为这真的很有帮助。它不会告诉你天气如何、新闻里发生了什么、也不会滔滔不绝地讲这个乐队的事情。它致力于把你带到音乐那里,我认为这就是它有效的原因——它确实对我们非常有效。
AI 生成音乐:乐器还是威胁?
Lenny (00:21:24): 我很喜欢你关于推荐和生成之间区别的论述。这自然引出了一个问题——我猜你也看到了这个趋势——人们正在使用艺人的作品目录来自动生成音乐。大概一两周前有个 Drake 和 The Weeknd 的东西出来了。你觉得这最终会走向何方?在音乐可以被自动生成的世界里,你认为艺人如何适应?这个播放按钮背后全部是生成的内容,而不像 DJ 那样只在歌曲之间做衔接。
Gustav Söderström (00:21:53): 首先,大的前提是,现在还非常早期。没有人知道这一切会如何演变,法律环境等等也都还不确定。但我认为它会产生巨大的影响。我想如果我们分两方面来看,一是它对音乐本身能做什么,二是权利状况,比如版权方是否能得到报酬等等。我们先单独谈第一个方面。我觉得一个很有意思的例子是,正好在我成长的年代,Avicii 出现了。回想这件事很有意思,因为当时的音乐行业并不真正把 Avicii 视为一个真正的艺人,因为他不会真正演奏乐器,也不会唱歌,他就坐在那里对着电脑和 DAW(数字音频工作站)。所以那不被认为是真正的音乐。而我认为现在我们所有人都认为那是非常真实的音乐,他拥有极其出色的真正的音乐才华。
Gustav Söderström (00:22:51): 所以我认为现在,我们大概正处于人们说”这不是真正的音乐”、“这 somehow 是假的”的阶段。我认为看待这些扩散模型——如果且当它们足够擅长生成音乐的话——大概应该和看待一件乐器一样。它只是一件强大得多的乐器,我们很可能会看到一种新型的创作者——他们不精通任何乐器,也无法组建一整支管弦乐队来实现脑中的想法,但他们现在可以生成非常新颖的东西。另外,顺便说一下,我认为所谓的 AI 音乐和真正的音乐之间的区分其实并不存在。非常肯定的是,非常有才华的真正的音乐人正在使用 AI 来做得更好、帮助创作新的想法。所以这种区分并不真正存在。一切都会是 AI。问题只是比例是多少,这让问题变得更难,因为你不能讨论它该不该存在。
Gustav Söderström (00:23:50): 你必须讨论的应该是多大比例应该存在、谁有权使用它。但我认为看待它的方式大概是把它当作一件乐器,它可以帮助创作大量的艺术作品。我觉得这对你们来说大概不是什么新闻,你们可能经常使用这些东西,但我认为如果你不用这些生成式模型,会有一种认知:你让它创作一首热门歌曲,它就会给你一首。实际上并非如此。这些模型做的事情是——因为它们听过大量的音乐——它们非常擅长做出听起来和已有作品非常相似的东西。真正做到原创其实非常难。从某种角度来看,随着创作普通音乐变得更容易,做到真正的独一无二反而会比以往更难。
权利问题与行业变革
Gustav Söderström (00:24:39): 所以我仍然认为,创作出真正独一无二的东西需要极大的技巧。我的希望是,就像 DAW 和那次技术飞跃带来了一种全新的流派——EDM(电子舞曲)——你不可能用一支管弦乐队或现场演出来呈现它——也许我们会借助这些技术看到全新的音乐风格。我觉得那将非常令人兴奋。这是积极的一面,但还有权利问题,对此我深表同理。而且 Spotify 之前经历过类似的事情。我们经历过一次类似的技术变革,就是音乐在线下载和盗版以及点对点传输的技术变革。首先是点对点传输的重大技术变革,对消费者来说很令人兴奋。比以往更多的消费者开始听更多的音乐。我认为这就是我们现在在生成式 AI 阶段所处的位置。
Gustav Söderström (00:25:31): 一项新技术出现了,但它还需要一种新的商业模式,创作者和行业才能真正参与其中并从中受益。当然,这说法有自利之嫌,因为我们在创新那种商业模式方面扮演了重要角色。但我仍然认为这是必要的,我希望这也是我和我们能参与其中的事情。所以我认为我们已经看到了第一部分——技术变革——接下来可能会有很多讨论和混乱,对此我深表理解,但我认为我们还没有看到第二部分。什么样的模式能让这成为一种利好?盗版之后实际发生的事情是,音乐产业变得比以往更大——不是恢复到从前那么大,而是比以往任何时候都更大。我认为这项技术也可能做到这一点。但我们正处于最开端。
产品中的”魔术效应”
Lenny (00:26:20): 顺着这个思路,你还有一个观点是,所有真正伟大的产品都必须变某种魔术。这个说法在你的播客里经常出现,我觉得你在其他地方也提到过。结合你在这里谈到的所有东西,感觉在某种意义上,一切都会让人觉得像魔法,因为 AI 已经融入其中了。
Gustav Söderström (00:26:38): 我觉得我们在做 AI DJ 的时候,就做了一次小版本的魔术。当人们第一次听到它时,我们在用户测试中就能看到那种反应——他们会想……所以那个魔术是:他们怎么可能录下这个人说这么多不同的话,因为它在谈论我的音乐。那个魔术,显然,不是录了一个人说这些话,而是生成的。而那个魔术感会消退。你现在随时都能听到它等等,但它确实是那种魔术之一。所以我仍然认为这个概念很重要,而且它似乎和产品的病毒式传播、起飞是相关的。
Gustav Söderström (00:27:15): 我觉得第一次使用 Dall-E 或 Stable Diffusion 或 Midjourney 也是一样的。它完全就像一个魔术。当然,不存在什么魔法,它只是数据和统计。但我认为达到那个点,把一个产品迭代到第一次使用时就让人觉得像魔法的程度,是非常有帮助的。而且这通常只是一个把性能提升到一定水平、缩小范围、去掉一些东西的问题。我认为有很多微调工作能让你跨越那条线——从”很酷、很令人印象深刻但不是魔法”到”感觉像魔法。我不明白这是怎么做到的。”
Lenny (00:28:00): 对,这让我想起 ChatGPT 的发布,它最终成为历史上最大、增长最快的产品,这就是魔术效应的典型。它感觉像真正的魔法。
Gustav Söderström (00:28:09): 完全同意,完全同意。而且对大多数人来说,它仍然非常……实际上在很大程度上,甚至对研究者来说,它也有一点魔法的味道。没有人真正完全理解。所以我想这个世界上也许还有一些魔法存在。
Lenny (00:28:23): 确实。我觉得很多人正因为不理解其中发生了什么而感到担忧。换个话题——转到你们在 Spotify 打造产品的方式。Spotify 因为推广了小队和部落这个理念而闻名。如果我理解有误请纠正,但你们已经放弃了那种做法?
Gustav Söderström (00:28:39): 是的,没错。
Lenny (00:28:41): 好的。我很想了解你们为什么转变,从那种产品构建方式中学到了什么,以及现在你们怎么组织团队?你们现在是怎么做的?
Gustav Söderström (00:28:51): 这是我们早期非常投入的一件事,后来证明把这些东西命名为小队、分会等等确实很明智。它其实不完全是……嗯,也许算是刻意品牌化吧,但我们发明这些称谓并不是出于品牌化的目的。我们发明它是因为我们认为这是一种好的组织结构,需要给各种东西取名字,而那还是互联网早期,什么东西都可以自己发明。所以它在当时对我们所处的阶段非常有效,当然也在招聘上帮了大忙。但现在它反而有点成了我们的负担,因为人们仍然以为我们是那样组织的,而在我们目前的规模下,那并不是一种很高效的组织方式——甚至如果你现在从头开始,鉴于我们学到了更多,可能也不会那么做了。
Gustav Söderström (00:29:35): 但我认为最大的区别在于,小队的理念具体来说是两方面的。它们应该小而全栈。一个小队大约七个人,应该包含前端和后端、移动端、QA、敏捷教练等等,核心理念是高度自治。而这正是我们真正转变的地方。首先,随着公司成长,以七名工程师为一个单位来扩张会产生大量的管理开销。所以很明显,我们的团队现在要大得多——每个管理者手下可能至少是原来的两到三倍,也许是十四个人而不是七个,同时减少了那些纯开销性质的岗位。这是第一点。随着你学到更多,随着规模扩大,组织结构看起来会更传统也更合理。第二件大事,我认为我们当时面临的挑战是,我刚加入的时候,Spotify 的平均年龄是……我是说,我是年纪最大的,那是十四年前的事了。所以平均年龄大概不到三十岁,这在大多数科技公司里也不多见。
Gustav Söderström (00:30:46): 我们来自瑞典,那里的文化和美国不同。我很喜欢瑞典文化的很多东西,我认为我们保留了其中最好的部分。但瑞典是一个高度自下而上、崇尚自治的文化。有一幅很著名的画,讲的是瑞典人怎么做决策的。在美国,我觉得就是层级制。在瑞典,是一个圆圈。大家围成一圈,中间没有人,没有所谓的领导。
Lenny (00:31:13): 有意思。
Gustav Söderström (00:31:14): 所以从文化上讲,我们深受这种超级自治理念的影响。我认为自治的理念是非常合理、正确的——我们的工作方式是,尽力招募我们能找到的最聪明的人,并为此支付高薪。所以如果你在招聪明人,一种理解方式是你在租用脑力。
Gustav Söderström (00:31:39): 所以如果你租用了这么多昂贵的脑力,却不给他们任何自主思考的空间,这听起来就不太明智——那你不如招没那么聪明的人,把成本降下来之类的。所以我认为你必须给予一定程度的自治,才能真正最大化你所做投资的价值。所以给予人们大量空间去尽可能发挥他们的才智和能力,是非常合理的。但问题在于,如果你把自治权放在组织非常靠下的末端,而且同时你的组织又非常年轻——我们当时就是这样——那很有可能你只是在产生热损耗。你会有一百个小队,带着一百套策略,朝着一百个方向跑。Spotify 确实经历过那个阶段。
从”产生热损耗”到”优化自治层级”
Gustav Söderström (00:32:29): 我是说,我们最终确实走到了某个地方,尽管如此,但我很难说我们的效率很高。所以我们做了几件事。团队结构更加传统了,团队更大,开销更少。而且我们一直在专门研究一个问题:自治权应该放在组织的哪个层级?极端情况之一是在最末端,我们曾经就在那里。另一个极端可能是在最顶层,比如说 Twitter,就一个人说了算。两种都有问题。如果你放在末端,你会产生大量热损耗。如果你放在顶层,你需要一个能力极强的人,Elon 确实能力很强,但从定义上讲,这就一定会形成瓶颈。所有决策都必须经过他。而 Daniel 的性格也不是想要亲自做所有决策的那种。
Gustav Söderström (00:33:19): 他想要的是最大化吞吐量,而不是成为吞吐量的瓶颈。所以问题是,如果不是在顶层,也不是在最底层,那该放在哪里?我们找到的答案——我觉得这并不算什么反常识的观点,我认为大多数公司都是这样——是在 VP 层级。也就是说,有 Daniel,然后是 C 层级,包括我和其他人,然后是 VP 层级,这是一个很好的平衡点。与其只有公司里一个人思考——也就是只有 Daniel 思考,其他人只管执行——不如在 VP 层级上有几十甚至上百人拥有高度自治的思考权。这样你获得了充分的思想自由和不同方向的思考,但又不是八千个人各想各的。而且这些 VP 层级的人既有一定的数量,通常也都相当资深。他们拥有很强的模式识别能力。
Gustav Söderström (00:34:16): 所以我认为这解决了一个问题——如果把它看作一个优化问题的话,这是一个很好的优化空间。所以 Spotify 现在的自治层级在 VP 层相当高,然后越往下自治度越低。
Lenny (00:34:33): 你说到自治,实际上意味着什么?比如说播客产品的 VP 对做什么有很大的决定权,而上面的人不会过多介入?我知道播客产品的 VP 是 Maya,对吧。
Gustav Söderström (00:34:48): 没错。
Lenny (00:34:49): 我想她以后也会来做播客嘉宾。那对她来说,实际操作上意味着什么?
Gustav Söderström (00:34:54): 意味着我会请 Maya 来制定播客业务的策略——我们要怎么做,怎么做出差异化,为什么播客创作者愿意选择我们这里。换作别的公司,这个策略可能由我来制定,或者由 Daniel 来制定。再比如 AI DJ,它就出自我们的个性化团队。那是他们下的一注赌注。他们有自治权来做这类赌注、制定策略。用户界面也是一样,我们有一个体验团队,组织结构可以之后再说,但我给了体验 VP 很大的自治权去定义和建议我们想做什么。在别的公司,这些可能全部由我自己来定。
组织设计的极端光谱
Lenny (00:35:45): 再进一步问——我知道你对团队组织方式以及组织结构如何帮助你优化特定目标有很强的观点。你在这方面的思考是什么?关于组织结构的影响和你到底在优化什么,你学到了什么?
Gustav Söderström (00:36:03): 好的。我经常讲一个理想化的光谱,或者说不一定是理想化,而是夸张化的光谱。没有什么绝对正确,但你通过创造极端来阐明观点。光谱的一端是类似 Amazon 这样的,以两个披萨团队、零依赖著称。你尽量最小化依赖,这样各团队可以并行运作。团队之间甚至会在同一个项目上相互竞争等等。但他们能直接触达用户。
Gustav Söderström (00:36:37): 所以这样做的好处是,如果你有一个想法,触达用户的时间非常短,这对他们来说是行之有效的。它催生了 Kindle,催生了 Alexa,催生了很多非常新颖的东西。但也有一些有趣的弊端。其中一个弊端——我对 Jeff Bezos 的前瞻性预见力极为钦佩——如果你有相互竞争的团队,激励机制会促使你隐藏自己的成果,隐藏自己的代码。这应该会导致一个毫无平台协同效应的组织,因为没有人会合作。我认为要么是他有这个洞察,要么是因为他看到了这个问题,所以不得不这么做,但他以极其强硬地推行硬 API 而闻名。如果你不为自己的技术创建硬 API,你就出局了。你仔细想想,必须是这样的,因为否则没有人会去做。
Lenny (00:37:32): 硬 API 本质上就是所有人都知道如何使用这个 API,知道如何连接到这个团队进行交互。
Gustav Söderström (00:37:38): 没错。你必须把自己的技术暴露给其他人。你必须维护这些 API,而且它们必须非常结构化,因为否则整个体系就会崩溃——因为大家都被要求竞争,没有任何合作的激励。你必须通过中央强制来实现。有趣的是,尽管理论上他们是处于最不利的位置来构建一个结构化的平台——我认为,正因为他们强制推行得如此坚决,反而他们做出了 Amazon Web Services,因为他们的硬性定义 API 如此严格,正是因为这条规则,所以他们更容易将其由内向外翻转,暴露给全世界。而如果你看 Google 这样的公司,我认为他们在对外暴露 API 方面反而更加挣扎,也许是因为内部环境太友好、太松散了。他们不需要在内部设置那么硬的 API,因为内部没有竞争。人们可以直接进入彼此的代码。
速度与一致性的权衡
Gustav Söderström (00:38:18): 所以这是一个有趣的小故事,但核心观点是你在那一端更快,但协作会很困难。所以你会看到——也许有点夸张——有时候你会在同一个页面上看到来自不同团队的多个搜索框。顺便说一下,这在 Spotify 也是真实发生过的。你会在”正在播放”界面上看到来自不同团队的多个弹窗提示,因为大家各做各的。当我们处于自治模式时,每个人都在各自为战。然后……所以你获得了速度的好处,但代价是把你的组织架构图发布出去,把复杂性推给了终端用户。但显然,这对 Amazon 来说是正确的选择,因为他们是一家万亿级的公司。而在光谱的另一端,你有像 Apple 这样同样是万亿级的公司。所以显然,两种模式都行得通——在 Apple 那里,你永远不会在 iPhone 上看到来自同一功能的两个搜索框。那里的一切都是由接近于单一决策者的角色来集中组织的。
Gustav Söderström (00:39:20): 所以他们实际上是处于世界上可能最大规模的功能型组织之中,他们在尽可能多地做事情。你想想 Apple 里面包含了什么——他们当然做了我们所做的一切。他们有音乐服务、播客服务、有声书,还有十亿个其他服务。所以并不是说他们面对的问题更简单。然而他们打造出来的产品感觉更像是由一个开发者为一个用户构建的。所以他们集中管理,有一个瓶颈式的功能——所有东西都必须经过它,被决定如何与其他一切协调配合。这样做的好处是用户体验更简洁,不会把组织架构图推给用户,不会增加复杂性。但它也有速度上的弊端——虽然我没有一手数据——我听在 Apple 工作过的人说过:“是的,那个东西花了七年才上市,“因为你只能排队等待。
双击电源键的决策
Gustav Söderström (00:40:17): 所以你有这两个极端。我认为最值得深思的例子是,当你双击 iPhone 的电源键时,Apple Pay 会弹出来。那个决策是怎么做出来的?你可以想象,所有服务团队都想在双击那个按钮时弹出自己的东西。所以必须有人来决定:应该是音乐弹出来?应该是支付弹出来?还是别的什么?所以他们在那里有一种不同的决策结构。在这个集中化与去集中化的光谱上,基于我们的战略——我们是一个单一应用,试图把我们实际上已经在做的、把多种内容类型——背后商业模式截然不同的内容类型——整合到一个单一的用户体验中,涉及不同的收入分成、版税和图书版权交易等等。这就是我们的战略。我们认为保持用户体验的简洁是最重要的。
Gustav Söderström (00:41:12): 所以我们选择了更偏集中化的模式,这些不同的垂直业务——你想想看,音乐业务、播客、有声书业务——它们都必须经过一个统一的推荐组织,因为这是另一个问题。你向哪个用户推荐什么?应该是一本书还是一个播客还是音乐?你如何在它们之间做权重权衡?而且用户界面也很容易变得极其复杂,如果每个团队都构建自己的 UI 的话。音乐团队构建自己的 UI,然后其他人在上面叠加功能。所以这就是我们选择优化的方式。但这是基于我们的战略,而且我认为两种模式都行得通。
Lenny (00:41:50): 本期节目由 Eco 赞助播出。上个月,Eco 用户平均赚取了 84 美元的现金返还奖励。怎么做到的?通过 Eco——个人理财的未来。Eco 是对一个错位的金融系统的升级。它提供了一款像你的银行一样运作的应用,但去除了几乎所有中间环节,帮助即便是最优秀的理财优化者也能在更短时间内自动完成优化。如果你付房租就能获得奖励呢?如果点外卖和网购也能获得奖励呢?甚至每月储蓄也能获得奖励呢?然后再想象一下,获得奖励这件事本身还能再次获得奖励。通过 Eco,你可以在一些最喜欢的商家消费并自动获得 5% 的现金返还,而且 Eco 的年化收益看起来更像是 80 美元,而不是 80 美分。然后还有 Eco 积分——世界上首个开放的奖励系统。你在 Eco 应用中做几乎任何事情都能赚取积分。
Lenny (00:42:40): Eco 正在努力让这些积分成为有史以来最超值的积分。所以越早加入越划算。听起来好得难以置信?访问 eco.com/lenny,注册参加一次入门介绍,看看为什么并非如此。参加 Eco 欢迎会的 Lenny’s Podcast 听众,将获得存款超过 1,000 美元部分的独家 4% 年化收益。了解更多请访问 eco.com/lenny,拼作 E-C-O.com/lenny。
微笑曲线与极端的优势
Lenny (00:43:05): 有趣的是,你举的这两个例子,Apple 和 Amazon,它们是世界上最大的两家公司,而它们恰好处于这两个极端光谱的两端。有意思的是,大多数公司处于中间某个位置。我在想,是不是处于极端本身就有一种好处,而这最终变得非常重要。
Gustav Söderström (00:43:21): 我认为是的。在几乎所有行业中,你都有这个微笑曲线的概念,你希望站在微笑曲线的两端,那里才是巨大的商业机会所在,而不是在中间。所以在组织模式上大概也是如此。
下大赌注与重大改版
Lenny (00:43:34): 说到极端,我想聊聊下大赌注这件事。你们最近搞了一场大型发布会,基本上把 Spotify 的主信息流整个重新设计了,让它更接近应用未来的走向——类似 TikTok 短视频那种流式体验,你开始听到视频的声音、音乐开始播放,有些人很喜欢,有些人则完全不喜欢。我很好奇,作为一名产品领导者,你怎么思考长期主义?怎么应对那些直接说”这他妈的什么……我讨厌变化,别再改东西了”的用户?你怎么看待这件事?你听谁的?忽略谁的?你怎么判断要坚持到底?你的思路是什么?
Gustav Söderström (00:44:10): 你说的已经很客气了。Twitter 上关于这件事有大量负面反馈。让我实际上深入讲一些细节,因为我觉得对于听这个播客的产品人来说,这是一个很有意思的教训,很少有公司愿意谈论这个,因为你只想谈那些完全按你预期发展的事情,而不想谈那些没有完全按预期发展的事情。
品味泡沫问题
Gustav Söderström (00:44:39): 所以我来过一遍我们想解决的问题和我们学到的东西。Spotify 主要是一个后台应用,长期以来,我们被认为在后台音乐和播客推荐方面做得非常好。当手机在你口袋里,你正在听一个电子舞曲播放列表或者流行音乐播放列表之类的,我们非常擅长在里面插入另一首电子舞曲,或者另一首流行歌曲,诸如此类,在后台默默地完成。
Gustav Söderström (00:45:09): 然而,我们从用户那里一次又一次听到的反馈是,他们说自己被困在了一个品味泡沫(taste bubble)里。我很喜欢我的 Spotify,很喜欢这些,但我现在对电子舞曲有点腻了,而 Spotify 没有给我推荐完全新的东西。如果你仔细想想这个问题,它听起来可能和推荐问题很相似——不过是又一个推荐问题而已——但实际上它在根本上完全不同。因为当你在电子舞曲播放列表里推荐另一首电子舞曲时,你拥有大量关于这个用户喜欢电子舞曲的信号。但如果你要推荐一个全新的流派,根据定义,你什么都不知道。因为如果你知道,那它对用户来说就不是新的了。你不可能知道任何东西。所以回到命中率(hit rate),当你向用户推荐完全新的东西时,你的命中率会极其低。
Gustav Söderström (00:46:03): 所以帮助人们走出品味泡沫这个问题,并不像听起来那么简单。而且我们不能随便插一个不属于典型口味的流派进去。比如我非常喜欢 Reggaeton,这在瑞典并不常见。如果你看我的其他档案,全是电子舞曲,你大概不会猜到我听 Reggaeton。Spotify 也猜不到。所以如果我正在后台听我最喜欢的电子舞曲播放列表,或者我的金属乐播放列表——金属乐在瑞典很流行——我们真的很难在中间插入一首 Reggaeton。大多数人会觉得 Spotify 坏了。
Lenny (00:46:41): 是的。
Gustav Söderström (00:46:41): 他们到底在想什么?这真的行不通。所以为了帮助人们打破品味泡沫,你需要一种不同的方式。你需要一种命中率可以非常低的机制,而且用户也预期命中率会非常低。
信息流范式
Gustav Söderström (00:47:00): 当我们在后台推荐东西时,命中率至少需要十分之九,一首次品(dud)也许可以接受,但如果你出了五首次品,你就会觉得我们毁了你的播放列表和你的收听体验。我们需要一种十分之一成功就行的机制。如果你试了十次找到了一首好歌,你就非常开心了。所以你需要一种完全不同的范式。而且你还需要能够快速浏览大量候选内容,因为命中率太低了。你不能每个条目花三分钟。就像,“好吧,我不喜欢这个,“但距离下一个来还有两分钟。你需要能快速说”不、不、不”。所以显而易见的候选方案就是这种信息流式的体验,你可以在里面浏览大量内容,你预期命中率会低很多。如果你不喜欢,成本也很低,滑走就行。
Gustav Söderström (00:47:50): 这也解释了为什么人们……当他们想走出品味泡沫,或者当他们来到 Spotify 听一些全新东西的时候,通常是因为他们先在 TikTok 或 YouTube 这类服务上发现的,在那上面他们接触到了大量新内容。所以用户一直在向我们要求这类工具,而这正是我们想要解决的问题。于是我们构建了一系列功能,信息流式的结构,你可以在里面浏览一个新流派的多首曲目,或者一个播客频道的多集内容,甚至完整的播放列表。我们实现了这些功能,把它们放在叫做子信息流(subfeeds)的地方。在当前的体验中——这已经在全球范围内推出了——如果你点击播客子信息流,你会看到一个播客剧集的信息流。点击音乐子信息流,你会看到一个播放列表的信息流,你可以在里面浏览很多播放列表。如果你不了解某个播放列表的名字,你可以快速听听它是什么感觉,试听几首曲目,判断这是否适合你。如果你去搜索和浏览页面,你可以找到全新的流派,快速浏览。
召回与发现的平衡
Gustav Söderström (00:48:59): 这些东西按我们的预期在工作。人们想找新音乐的时候会去这些地方,在里面浏览,保存新歌曲。所以它们按我们的预期在运作。没有按我们预期运作的是——用户一次又一次地向我们要求这些功能,我们把这些东西的汇总放到了首页上,因为用户对发现的需求如此强烈,而且我们可以清楚地看到发现与 Spotify 留存率之间的强相关性等等。但我们在自己的首页上误判了——或者说是失败了——或者更准确地说是学到了一课,那就是它目前的运作方式。你在 Twitter 的评论中可以看到这一点,如果你过滤掉愤怒的声音,试着去看他们到底在说什么,他们说的内容其实在量化数据中也表现得非常清晰:如果你看人们在 Spotify 首页上的行为,当前的首页,几乎 90% 是我们所说的召回(recall)。
Gustav Söderström (00:49:59): 也就是回到你正在进行中的收听会话,或者你知道自己想去的特定播放列表,或者至少是一个特定的使用场景。所以你带着高意图进来,你其实知道自己想要什么,而可能只有 10% 的时间是真正的发现——就是那种”我不知道我想听什么”的状态。所以你想想,这是 90% 召回,10% 发现。当我们测试新设计的时候……子信息流本身是有效的,但当我们在首页测试其中一些的时候,我们把比例从 90/10 翻转成了 10/90。也就是 10% 召回,90% 发现。虽然人们想要发现功能,但他们大概不想要 90% 的发现,取代 90% 的召回。所以如果你去看 Twitter 上的评论,他们说的就是:“嘿,我找不到我的播放列表了。这些东西都在哪?”
Gustav Söderström (00:50:47):
他们其实不是在抱怨发现功能,他们抱怨的是那些他们再也找不到的东西。这一点在量化数据中也能看到。你可以看到流量从首页转移到了搜索和资料库,这明显是用户在试图找那些他们找不到的东西。你甚至能看到人们试图用那些为快速了解新事物而优化的发现工具来做召回。“我那个想去的健身播放列表在哪?” 这其实是非常糟糕的召回 UI,就像老虎机一样,对吧?你能不能找到那个健身播放列表完全不可预测。它是为发现新事物优化的,不是为召回已有事物的。做召回的时候,你想要的是那种屏幕上密密麻麻排列很多项目的 UI,因为你知道自己在找什么。所以做发现的时候你不需要很多展示面积。你需要很多像素,而且你大概需要声音,因为你不知道那是什么。
UI 设计的经验与产品嫉妒
Gustav Söderström (00:51:40):
所以我们从 UI 中学到了这些,我觉得这里可能还有一点产品嫉妒,你总会去看其他产品体验。如果你环顾四周,你很容易就会觉得大多数其他产品——比如你看 YouTube——他们的首页就是那样的。一个巨大的单条目发现信息流,全是新内容。而且人们似乎不会愤怒地发推文说他们有多爱你、YouTube 是多大的产品之类的。我觉得我们发现的是,我们在首页上其实有一件事做得非常好,那就是支持你同时处于多个收听会话之中。你可以同时正在听两档播客和一本有声书,同时还有”我真的只想找到那个健身播放列表,我不记得它的名字了,但我知道是健身类的”。
Gustav Söderström (00:52:31):
这部分我们确实做得非常好。我敢说比其他体验好得多——在那些产品里你真的要切换到某个标签页,进入资料库,开始浏览才能回到你之前的状态。所以这可能跟路径依赖有关。因为我们的召回一直做得不错,当用户没法再召回的时候,他们的不满程度我觉得是合理的。我们不想丢掉这个优势,因为这是我们做得好的事情之一,而我们低估了它的价值。我的结论其实是我们比其他产品做得更好。所以我们当然想保留这一点。所以我们做的事情是,现在我们只是在更新假设来实现同一个目标——这些功能是有效的,当人们想要发现的时候,他们会使用这些功能,而且看起来是有效的,它们也还有提升空间。
Gustav Söderström (00:53:20):
从机器学习的角度来看,你正在这条不断攀峰的旅程上,但问题是,你如何确保当人们觉得自己被困在品味泡沫中的时候,他们知道这些东西就在那里,而且很容易使用?所以现在我们有了一个新版本的首页,当然我们也在测试,这些东西非常容易接触到但是是自愿使用的,而且你仍然可以做所有的召回。所以从我的角度来看,这就是我们做 A/B 测试的原因——因为你想用科学的方法来做这件事,你想尽可能多地了解你自己的产品和你的用户。现在我分享了很多这些经验教训。也许我们应该留给自己,但我的直觉是这会让产品变得更好。
重新设计的两种产品开发
Gustav Söderström (00:54:09):
但我告诉团队的是,当我们开始做这件事的时候——因为这种事我已经经历过几次了——我认为有两种根本不同的产品开发方式。一种是设计新功能。这很难,但它是用户自愿使用的。所以你做了 AI DJ,有些人喜欢它,没问题。如果你不喜欢它,它也不会让你的体验变差。但当你做重新设计的时候,就棘手得多,因为参与重新设计不是自愿的。所以即使对不喜欢它的人来说也有代价。然后你就面临一个非常棘手的问题,就是你将会收到两种反馈。一种是你做的事情是对的,但人们很生气因为你改了东西。另一种是你做的事情确实不对,人们也很生气,但这次是正当的理由。
Gustav Söderström (00:55:08):
那你怎么区分这两种情况呢?因为我觉得我向……当我们和团队讨论这些的时候,我觉得可以这样类比——你想想你的桌面,你物理上的桌面,电脑放在一个地方,铅笔在那边,笔记本在那边,然后我进来把所有东西重新排列了一遍。而在我们的场景中,你已经用了那个布局也许 12 年了。就算我有大量量化数据证明我的新布局更好,你还是会生气,因为你在旧布局中是高效的。这两者很难区分。最经典的案例就是 Facebook 的动态消息,当它变成单一的动态消息流时,人们非常愤怒。但结果证明它解决了很多用户问题——你不用再跑遍 Facebook 各处自己去收集动态了。
Gustav Söderström (00:55:58):
所以确实有一些方法可以判断你是确实做得更好了但人们的习惯被打乱了,还是你并没有做得更好。比如一个方法是看新用户群组——他们没有那种旧行为——和全部用户群组的对比,等等。所以我们和团队走完了整个流程。在做之前,我就说了,“这会很痛苦。” 大概率会有很多推文,因为我们一次性做对的概率非常低。所以正因为如此,团队并没有承受太大的压力。这确实很难……你想回应人们,但正确的做法是倾听、理解、尝试新的假设来真正搞清楚发生了什么。所以我觉得我大概做过三四次这种事了。三次可能。一次不成功,两次成功。所以多少知道自己在面对什么。
Gustav Söderström (00:56:45):
所以这几乎像是在惩罚自己,非常痛苦,但也是最令人兴奋的事情。我觉得任何做产品的人都知道,最简单、最直接的做法就是在现有的基础上迭代。没有风险。你不会被开除,也不会有用户生气。但每个人也都知道,如果你最终不去适应新技术、新范式等等,你会被取代。你必须找到这种尝试新事物的平衡。而在软件行业工作,你有 A/B 测试这个工具,可以用科学的方法来做。做硬件就更惨了。如果你错了,那就是错了。没法更新。
Lenny (00:57:26):
我很喜欢这个故事。非常感谢你分享它。我猜对于这样的大型发布,你其实没法提前做 A/B 测试,因为媒体会泄露消息。他们会说,“天哪,看看 Spotify 在做什么。” 所以你在那方面是受限的。对吧?你没法真正提前测试这个东西。
Gustav Söderström (00:57:40):
最困难的是,如果你在尝试一个全新的东西,MVP 需要做得非常大——你可以构建一个新的 UI,但如果你没有为单条目信息流做算法,你就没法判断到底是方向对了但机器学习没做好,对吧?是 UI 的问题还是机器学习的问题。或者你必须构建很多东西,而这就变得非常昂贵。实际上……之所以痛苦的最大原因并不是来自外部的反馈。而是你在内部必须承担的成本。你投入了大量成本,因为你真心希望自己是对的。
Gustav Söderström (00:58:15):
在我们的情况下,首页的改动其实没那么难做。重要的是底层的假设——我们能否帮你打破品味泡沫——确实有效,然后你再把获客漏斗更新到那个体验中。但我觉得问题在于,你需要把太多东西就位,才能判断”你可能得到一个假阴性”,仅仅因为你做得不够好或者做得不好。我认为这是这些大规模重写中最大的挑战,每个人都必须更新所有东西,然后你才能知道自己是对还是错。
Lenny (00:58:52):
帮助你们理解什么有效、什么无效以及你想改变什么,那个过程是什么样的?我想你们会看大量数据,一些推文之类的。具体的操作是什么样的——“糟糕,事情没按我们预期的发展,这是我们应该做的”?
Gustav Söderström (00:59:08):
信息流方面,我们做了测试,但首页信息流,我们是先上线再测试的。我们在用户身上测试了几个不同的变体,然后拿到数据,更多看的是定量数据。我们也做了大量用户研究,让用户坐下来使用信息流,来理解并建立我们自己的心智模型,判断什么有效、什么无效。然后当然,你也会看用户反馈,有些用户很善于表达哪里不好用,有些则不太善于表达。所以有时候很难解读,但这当然也是一个因素。
定量验证与假设迭代
Gustav Söderström (00:59:49):
然后一旦你做了这些,你就有定量数据可以看。然后你坐下来梳理,你觉得什么是对的、什么是错的?有哪些不同的假设?什么有效、什么无效?然后就更新,再测试,反复迭代,直到你证明或否定你的假设。尽量以科学的态度来做这件事。而且我觉得,当你在某件事上投入了那么多时间之后,最大的风险是对它产生珍惜心理。你必须毫不留情。你必须对一件事百分之百地相信,直到数据说不行,然后你就百分之百地相信另一件事。这听起来很简单,但做起来非常难,难到当你这么做的时候人们会不高兴,因为出于某种原因,人们不喜欢别人改变主意。而这恰恰是我们应该希望每个人都做到的。我很愿意看到一位政治家说:“我看了数据,我意识到实际上这才是对的,所以我现在相信这个。“但我们讨厌那样做的政治家。他们看起来不可信赖,我们会嘲笑他们。
Gustav Söderström (01:00:56):
所以我觉得这对任何人来说都是最大的风险。你必须保持不情绪化,只看证据和数据。然后如果你这么做了,你就继续前进,最终到达你想去的地方,你解决的是同样的问题,但你不断适应调整。
Lenny (01:01:14):
我很喜欢这个理念。本质上就是”坚定观点,松散持有”的想法。是吧——
Gustav Söderström (01:01:19):
没错。就是这个意思。听起来很简单,但很难做到。
Lenny (01:01:23):
对吧?因为就像你说的,人们不尊重改变主意的人。他们会说:“哦,看吧,他们一直都是错的,还那么自信地错着。”
Gustav Söderström (01:01:31):
对,没错。而且不太清楚为什么我们不愿意看到这种事,但我觉得这和人类心理有关。我们实际上倾向于喜欢那些几乎没什么数据却持有非常强硬观点的先知和人物。那些才是我们喜欢的人。而那些看了大量数据并据实改变想法的人,我们反而不喜欢。不太清楚为什么。
Lenny (01:01:57):
我们是有缺陷的生物。
Gustav Söderström (01:01:59):
确实。
近期的认知转变
Lenny (01:02:00):
沿着这个思路,你最近有没有改变主意的事情?比如让你想到”哦,对了”的那种?
Gustav Söderström (01:02:06):
没有,我觉得关于科学系统和首页方面的这些经验教训,确实做得很好,也许比别人做得更好,我们不想把它们连同洗澡水一起倒掉,不管那个表达怎么说。我认为这是我目前最大的收获,我实际上对此非常高兴。
Lenny (01:02:26):
是啊,我很高兴了解到我们有些事情做得很好,而我们之前未必意识到,也许我们应该在这方面更加发力。
Gustav Söderström (01:02:34):
没错。
10% 规划时间
Lenny (01:02:35):
换个方向。Shishir Mehrotra 建议我问你一个问题。他应该是你们董事会的成员。
Gustav Söderström (01:02:41):
是的。
Lenny (01:02:41):
他建议我问你关于你的 10% 规划时间。这是怎么回事?
Gustav Söderström (01:02:46):
这是一个概念,我觉得 Shishir 从他在 YouTube 工作时起就用了很久了。核心思想是,大致来说,你不应该把超过 10% 的时间花在规划上,而不是执行或构建上。这意味着如果你按季度工作,十个星期的话,你应该花一个星期来规划。我们按六个月的周期工作,所以我们试着花两周来规划,大致上还算成功。这个嘛……其实当我们谈到组织模型时,要提一下 Airbnb 的 Brian Chesky,他实际上是最早采用这些更反传统组织模型的人之一,我觉得。他比硅谷大多数人更苹果式。他也是按六个月周期工作的。他在这方面也有很多经验。所以这就是 10% 规划时间。我认为如果你发现自己规划的比这多得多,要么是你规划太多了,要么是你的执行周期对于那个规划量来说太短了。这是一个经验法则,但我觉得它很管用。
作为领导者如何带来能量与清晰度
Lenny (01:03:53):
我问了几位 PM 我应该问你什么——是在 Spotify 工作的 PM,我之前没跟你说过。有人指出你总是能给一个房间带来很多能量和清晰度。这是他们认为你非常擅长的事情。关于这件事的重要性,或者作为领导者如何做好这件事,你学到了什么?
Gustav Söderström (01:04:11):
嗯,听到这个真好。我不知道这件事,所以我在想该怎么回答。关于能量,我不知道。我想我就是对我做的事情很兴奋。我一直对技术很兴奋。我喜欢看到新东西。我的核心驱动力仍然是这种感觉——你看到一个还不存在的东西,你会有共鸣,你会想:“哇,不知道那能不能实现。那该多酷。“然后为了让人们去做这件事,你试着分享那种兴奋。所以我觉得我没办法对我自己不兴奋的事情带来很多能量。所以我必须做我真正相信的、让我兴奋的事情。也许这样能量就来得更自然。对我来说幸运的是,到目前为止,Spotify 一直处于一个允许大量创新的阶段,甚至有人要求我尝试做新的酷东西。
Gustav Söderström (01:05:09):
也许在一个纯粹的优化阶段,我的能量会少一些。关于清晰度,我一直喜欢尝试解释东西。一个众所周知的事实是,理解某件事最好的方式就是试着把它解释给别人听。所以我到处跟没有主动问我的人解释各种东西,不是为了显得聪明,而是为了检验我是否真的理解了。也许就是这种练习。顺便说一下,我确实会要求为我工作的领导者,并让他们也要求他们的下属,始终解释自己的决策理由。我觉得当我们……我们之前谈到过自主权之类的,我们不承诺所有人都要同意,但我认为我们应该向所有员工承诺的是,即使他们不同意,他们也应该有权理解你为什么做出这个决定。
Gustav Söderström (01:06:06):
我认为不可接受的是说,“不,我们要这样做,因为我资历更深。这种情况我见过很多次了。你还不够聪明。“诸如此类。我认为你必须解释自己的理由,所以你有义务给出解释。而且我发现这很有价值,回到刚才说的,理解一件事的唯一方式就是解释它,因为通常结果会是,如果你自己都解释不了,那你大概自己也没有真正理解。有时候我认为可能存在一种情况:你有好的产品直觉,但表达不出来。但大多数时候,当人们说”有什么东西在那儿”但又解释不了的时候,他们其实自己也没有理解。而且很多时候,那里其实什么都没有。另外,如果你作为产品人能把一件事解释清楚,这个知识就被共享了。所以这对整个组织来说效率会高得多。所以有时候我会故意激一下人们,说……当人们问这有多少是艺术、多少是科学时,我会说,“0% 的艺术,0% 的魔法,100% 的科学。“这是因为我想要逼人们去试着解释清楚。我觉得我们使用”艺术”和”魔法”这样的词。我们历史上一直在用”艺术”和”魔法”来称呼任何我们还无法解释的东西。
Gustav Söderström (01:07:33):
遗传学曾经是魔法和艺术,直到它变成了科学。量子物理学曾经是魔法,直到它变成了科学。而最近,实际上,智能和创造力曾经是艺术和魔法,直到它变成了大型语言模型中的统计学。所以我觉得我在推动人们说,“你确定你能解释这件事吗?“因为这会迫使人们去深入思考。所以也许我就是喜欢这样做,并且试着把它强加给别人。所以也许这就是为什么人们觉得我有时候能带来清晰度。
Lenny (01:08:07):
我很喜欢这一点。沿着这个方向有一个问题,你有没有推荐的系统或方法来解释事情?是把它写在一个文档里吗?是用某种特定的风格来解释,还是说顺其自然就好?
Gustav Söderström (01:08:20):
我以前什么都写下来,然后写了又写,不断改,让它越来越精炼。这个方法对我来说是有效的。我现在不像以前写那么多了。现在,我倾向于一边走一边在脑子里自己跟自己对话。我实际上的做法是……我发现这对不同的人是不同的,很多人想要跟别人碰撞想法,那是他们思考的方式。你一遍又一遍地重复同一件事,然后从中得到一些反馈。所以我以前写了很多。有时候当我想更好地理解一个想法时,我还是会写。在我人生的某个阶段,我很想写点真正的东西,比如一本书之类的。但我现在越来越多做的事情是,我跟同级或者向我汇报的人做一对一沟通,我就戴上 AirPods,做一个分布式的边走边聊。
Gustav Söderström (01:09:12):
两个人都在走,但在不同的地方,花一个小时讨论某个事情。这实际上被证明非常、非常有效。这样你就获得了不孤军奋战的优势,你有比自己更多的脑力。我觉得这方面没有很强的进化论证据,但确实有迹象表明你在走路的时候思考得更好,无论是因为你在给大脑供氧,还是因为其他什么进化原因,我不确定。但我发现边走、边说、边思考其实非常有效,即使你们不在同一个地方,只是通过 AirPods 也一样。是疫情迫使我们这样做的。我本来以为疫情期间我们的创造力会下降,战略制定会受到影响,但我发现了相反的情况。我们做的这类事情比以往任何时候都多,我开始思考为什么,我觉得就是因为我们做了所有这些边走边聊。
Lenny (01:10:07):
你刚才提到想有一天写本书。你觉得你的书会是关于什么的?
Gustav Söderström (01:10:11):
我不知道。不知道。从统计上来看,它大概是关于我做了很多的事情,所以一定跟技术或产品之类的有关。但我很想写点虚构类的。那会很有趣。
Lenny (01:10:26):
哦,天哪。一上架我就预购。我想触及的另一个概念是另一位产品经理提出的,他称之为”裤子里的 P”这个比喻。你有印象吗?聊聊这个有趣吗?
Gustav Söderström (01:10:40):
我不太确定这个人指的是哪一次,但我知道我用过那个比喻好几次。
Lenny (01:10:49):
好的。有希望。
Gustav Söderström (01:10:50):
我不知道这是不是一个瑞典的比喻,因为我以为它更广为人知。但它的意思是,你做了一件事……这个说法是这样的,就像在冷天尿在裤子里。一开始感觉很暖和很舒服。然后过了一会儿,你就开始后悔了。本质上就是说目光短浅。所以现在我就说这就是裤子里面撒尿,因为人们知道我的意思。这是一个短期行为。
Lenny (01:11:21):
这是一种表达那个想法的极其搞笑的方式。肯定是瑞典人的做法。
Gustav Söderström (01:11:25):
是的,我觉得瑞典人因为某种原因会这样做,显然其他人不会。
Lenny (01:11:30):
可能是因为一年中很多时候都很冷。
Gustav Söderström (01:11:33):
是的。大概就是这个原因。这是一个寒冷气候下的谚语。在温暖的地方它没用。没人知道你在说什么。
《继承之战》与瑞典文化
Lenny (01:11:39):
说到瑞典,你看《继承之战》吗?
Gustav Söderström (01:11:42):
看的。
Lenny (01:11:43):
好的。瑞典在剧中成了一个很大的部分,特别是那家试图……我不想剧透,但有一个很重要的角色。对,没错。是瑞典人。所以我很好奇,你觉得他们描绘瑞典文化和瑞典商业方式怎么样?
Gustav Söderström (01:11:59):
作为一个瑞典人看到这些觉得非常有趣。而且我想,首先,任何一个被超级高、身材健硕、帅气逼人的 Alexander Skarsgård 所代表的人或国家,大概都应该相当满意。所以这点很好。然后有一集他们在挪威,在不剧透太多的情况下。
Lenny (01:12:25):
对。
Gustav Söderström (01:12:26):
有一些元素是很真实的。有很多,我觉得是来自一个叫 Fjällräven 的瑞典品牌的付费品牌植入,这个名字我觉得是北极狐的意思,这确实是瑞典非常流行的户外品牌。所以那是真实的。桑拿之类的东西也是真实的。所以它是真的,但被夸张了。实际上,不太真实的是他的谈判风格。瑞典人倾向于严肃、谨慎,而这个家伙更像是个玩家。所以从谈判策略的角度来看,他不是典型的瑞典商人,我觉得。
Lenny (01:13:11):
对,这让我不像你描述的那种,在瑞典人们围坐成一圈,没有人在中心。
Gustav Söderström (01:13:15):
对,没错。他非常在中心。
Lenny (01:13:19):
然后人们去桑拿的时候,就像念咒语一样,sauna,sauna [听不清 01:13:23]。
Gustav Söderström (01:13:23):
没错。
Lenny (01:13:24):
最后一集。
Gustav Söderström (01:13:25):
很棒的剧。我很喜欢。
Lenny (01:13:26):
我很喜欢。这一季太疯狂了。我太好奇最终会怎样了。也许在进入非常刺激的快问快答之前最后一个问题。Spotify 现在是对我来说也是全球最大的播客平台了,我很喜欢用它。它运行得很好。我很好奇 Spotify 的下一步是什么,特别是 Spotify 播客的下一步。
Gustav Söderström (01:13:48):
这要从两个方面来看:针对 Spotify 创作者和 Spotify 听众。对于 Spotify 创作者来说,有两件事。其一,就是我们在 Stream On 上谈到的——我们也讨论过音乐发现,但播客面临的是同样的问题,而且更难。所以我们仍然非常专注地帮助播客创作者找到更多受众。这……就像我说的,在播客领域打破你的习惯和泡沫是一个更大的问题。找到一个新播客需要很大的投入。所以我认为,这是我们能够也应该做好的事情。因此我们持续在这方面大力投入。正如我所说,随着我们陆续推出更多功能,你会看到更多。
Gustav Söderström (01:14:37):
创作者的另一个大需求是变现,现在你可以通过 DEI 和 Spotify SEI 等多种方式变现。但我们正在努力扩展和改进,因为行业正在走向成熟,我认为这是最大的需求之一,也是我们能为创作者做的最大的事情——帮助他们更好地变现,实际上包括免费和付费两个方面。我们也有付费播客。这是创作者这边的情况。
Gustav Söderström (01:15:07):
在消费者方面,我不想透露太多。我们已经展示了在发现方面的大量投入。我想保留一些秘密等到推出时再说,但我们确实在用户体验本身上投入很大。我认为目前的体验还远未达到理想状态。有一件事我可以分享,那就是我们正在大量投入的是跨设备的无处不在性和播放体验——在车上以及所有我们在音乐方面已经做得很好的场景。但我认为收听体验可以变得更加无缝。搜索可以变得更好。关于播客的数据以及……好吧,我不想说太多,但如果看 AI 和生成式技术,还有很多可以做的事情。
快问快答
Lenny (01:15:52):
好的,那我就先收下这些。接下来,我们进入了非常刺激的快问快答环节。我为你准备了六个问题,Gustav。准备好了吗?
Gustav Söderström (01:16:01):
我想我准备好了。来吧。
Lenny (01:16:03):
好,我们试试看。你向别人推荐最多的两三本书是什么?
Gustav Söderström (01:16:08):
好的。这就是为什么我试图把七本塞进两三本里。如果从产品方面开始,我觉得这本书大家可能都知道,但我会推荐做产品的人去读的是 Hamilton Helmer 的《7 Powers》,Netflix 大量使用了其中的理念,我们也用了很多。如果你刚起步,有一个战略框架是很好的。没有哪个战略框架是完全正确的,但有一个总比没有好。
Gustav Söderström (01:16:31):
另一本在心智模型和框架领域的,是 Charlie Munger 的《The Complete Investor》。是的,它是关于投资的,但本质上是他使用的一系列心智模型。我觉得关键要点是,当你面对一个问题时,你应该总是套用三个不同的模型,因为模型的作用是简化和降维。世界大概有无限个维度,而模型把它们缩减到也许三四个。这样做的风险在于,你可能恰好丢掉了一个非常重要的维度,比如大流行病之类的。但如果你使用三个以不同方式降维的模型,而且从统计学上讲,它们得出相同的结论——即使只是套用第二个模型,也能大幅提高你判断正确的概率。所以这是一本很值得读的书。
Gustav Söderström (01:17:24):
然后如果跳出产品领域,我对科学和数学非常感兴趣。所以快速列几本。《The Mystery of the Aleph》,一本很棒的书。Sean Carroll 的《Something Deeply Hidden》,关于量子力学的诠释。Carlo Rovelli 的《Helgoland》,关于量子力学的关系诠释。David Deutsch 的《The Beginning of Infinity》和《The Fabric of Reality》。Donald Hoffman 的《The Case Against Reality》,讲进化与真理——进化并不优化对真相的感知,只优化适应性。还有《Gödel’s Proof》,我觉得是一本关于他的不完备定理的了不起的书——在任何公理系统中,都会存在永远无法被证明的真命题,这是一件很值得深思的事。最后一本,也许也是我的最爱之一,是 Paul Davies 的《The Demon in the Machine》,这本书可能没那么知名,讲的是信息本质上就是熵,以及”信息引擎”的概念——你可以仅凭信息来驱动某些东西,而排放出来的也是信息。这清单可不短啊。
Lenny (01:18:39):
我正想说你创下了推荐书目数量的纪录,但这也说明了你是如何通过阅读这样的书变得如此有洞察力和智慧的。所以如果有人想要达到你现在的高度,我想这就是启示。
Gustav Söderström (01:18:55):
下一个关于艺术家的问题我会简短得多,我保证。
Lenny (01:18:57):
没事没事。我们有时间。好,下一个问题。最近最喜欢的电影或电视剧是什么?
Gustav Söderström (01:19:03):
我们刚聊过《继承之战》,那确实是最近的最爱。那我就随便挑一部不算最近的、但绝对是最爱的——Halt and Catch Fire,应该是在 FX 播出的。很棒的剧。如果你在科技行业工作过,它从 80 年代的”硅草原”讲起,一直跟到现在。非常精彩的剧。
Lenny (01:19:24):
Halt and Catch Fire。对,我看过一些。我其实后来没追下去,但这是个很好的提醒,让我回去看看。
Gustav Söderström (01:19:29):
得回去看。
Lenny (01:19:30):
我会回去看的。你最近喜欢问的面试问题是什么?
Gustav Söderström (01:19:34):
我自己不问这个问题,但我最喜欢的问题是 Lex Fridman 结尾时的那个小问题,通常是类似于——所以这一切的意义是什么?我喜欢这个问题。很难得到好的回答。
Lenny (01:19:46):
我好想问你,但是——
Gustav Söderström (01:19:48):
别,别问。
Lenny (01:19:50):
好的,我们继续。那留到下次——我们下次再来一轮。
Gustav Söderström (01:19:53):
好的。
Lenny (01:19:55):
你最近发现的、让你爱不释手的产品有哪些?
Gustav Söderström (01:19:58):
最明显的是 ChatGPT GPT-4,用它玩一玩,给自己创建各种机器人来做不同的事情等等。但我觉得这对所有人来说可能都一样。另一个非常喜欢的是你写过也聊过的——Duolingo,我觉得无论是从产品角度还是执行层面和他们的成果来看,都非常令人印象深刻。而且在我家里也被疯狂使用。我们有家庭账号,每个人每天都在用,每天都在竞赛。所以我既对这个产品印象深刻,自己也大量使用。
Lenny (01:20:35):
你家里人在学什么语言?
Gustav Söderström (01:20:38):
在我家,现在是西班牙语。
Lenny (01:20:41):
学得怎么样?
Gustav Söderström (01:20:41):
Bien.
Lenny (01:20:44):
给你一颗金星。
Gustav Söderström (01:20:47):
我才几千 XP,还没那么厉害。
Lenny (01:20:51):
不,我不知道这算不算好。听起来还不错。下一个问题,你在产品开发流程中做了什么相对较小的改变,却对团队执行力产生了巨大的影响?
Gustav Söderström (01:21:01):
我不确定自己做过什么小的改变却产生了巨大的影响。通常,要产生大的影响需要更大的举措。我觉得可能有一件事——回到之前说的清晰性等等——就是我一直大力推动的所谓苏格拉底式辩论,核心理念显然是让最好的想法胜出,而不是最有资历的人的想法。我努力推动一个观念,就是要求人们解释自己的观点,而不是说”我觉得那里有点什么”或者”我有一种感觉”之类的。显然,正如你所说,这确实产生了一些影响,因为人们似乎会这样评价我。所以这可能是我做的最重要的一件事。
Spotify 产品团队的有趣仪式
Lenny (01:21:53):
最后一个问题,Spotify 产品团队有什么有趣的仪式吗,是桑拿吗?
Gustav Söderström (01:21:59):
Spotify 现在规模很大了,所以实际上不同部门的仪式各不相同,相当本地化。很多年前,大概 12 年前,我无意中创造了一个仪式,当时我们在讨论产品处于哪个阶段。我们需要一些定义,所以我随口说了,“嗯,有四个阶段:想出来、做出来、推出去、调到位。“(think it, build it, ship it, tweak it。)在”想出来”阶段,成本应该很低,不花太多钱。到了”做出来”阶段,你会开始花很多钱,所以你必须在”想出来”阶段就把风险降下来,确保自己是对的。然后是”推出去”阶段,再进入”调到位”阶段。这其实并不是经过深思熟虑的东西,但有意思的是,我有时候甚至从其他公司也能听到,“哦,我们正处于想出来阶段”或者”我们正处于调到位阶段”。所以它流传开来了。我不知道它是不是很好,但它就是流传开来了。
Lenny (01:22:57):
确实朗朗上口。我觉得任何能在人们脑海中扎根的东西都是成功的。Gustav,非常感谢你来参加节目。我们连续两期都是瑞典嘉宾了。Gustaf,带 F 的那个 Alströmer,之前也上过播客。
Gustav Söderström (01:23:10):
他也是一位非常了不起的人。
Lenny (01:23:12):
确实是非常了不起的人。我非常羡慕那些能和你共事、为你工作的人。再次感谢你来参加节目。最后两个问题,如果大家想了解更多信息或者联系你、提问题,可以在哪里找到你?
Gustav Söderström (01:23:23):
[听不清 01:23:23] @GustavS。
Lenny (01:23:24):
好的,再说一次。
Gustav Söderström (01:23:28):
@GustavS。
Lenny (01:23:29):
很好。最后一个问题就是,听众怎样才能帮到你?
Gustav Söderström (01:23:33):
直接联系我就好。我确实会看反馈,我会试着忽略那些愤怒的评论,去理解他们到底在想什么,为什么生气,或者什么东西不好用。
Lenny (01:23:44):
那关于联系方式,你推荐愤怒地 @你一条推文,还是发邮件到你分享的那个邮箱地址?
Gustav Söderström (01:23:50):
@GustavS 是我的 Twitter 账号,直接 @我就行。
Lenny (01:23:54):
好的。
Gustav Söderström (01:23:56):
你也可以友善一点。
Lenny (01:23:57):
好的。
Gustav Söderström (01:23:57):
没关系的。
Lenny (01:23:58):
太好了。Gustav,非常感谢你来参加节目。
Gustav Söderström (01:24:01):
谢谢你邀请我,Lenny。很荣幸。
Lenny (01:24:03):
大家再见。
结尾
Lenny (01:24:05):
非常感谢收听。如果你觉得这期节目有价值,可以在 Apple Podcast、Spotify 或你喜欢的播客应用上订阅。也请考虑给我们评分或留下评论,这真的能帮助更多听众发现这个播客。你可以在 lennyspodcast.com 找到所有往期节目或了解更多关于这个节目的信息。下期再见。
术语表
| 原文 | 中文 |
|---|---|
| acquisition funnels | 获客漏斗 |
| Alex Nordström | 保留原文(Spotify 联合总裁) |
| Avicii | 艾维奇(瑞典 DJ、音乐制作人) |
| Brian Chesky | 保留原文(Airbnb CEO 兼联合创始人) |
| Charlie Munger | 保留原文(投资家、Berkshire Hathaway 副董事长) |
| ChatGPT | 保留原文(原文转录为 GBT,实指 ChatGPT) |
| Chris Dixon | 保留原文(科技投资人、a16z 合伙人) |
| contrarian | 反传统的 |
| Dall-E | 保留原文(AI 图像生成工具) |
| Daniel Ek | 保留原文(Spotify CEO 兼联合创始人) |
| DAW (digital audio workstation) | 数字音频工作站 |
| dead clicks | 无效点击 |
| diffusion models | 扩散模型 |
| Discord | 保留原文(通讯平台) |
| discovery | 发现(与召回相对,指推荐用户未知的新内容) |
| Drake | 德雷克(加拿大说唱歌手) |
| dud | 次品(指推荐中不符合用户期望的曲目) |
| Duolingo | 保留原文(语言学习应用) |
| EDGE networks | EDGE 网络 |
| EDM (electronic dance music) | 电子舞曲 |
| Eppo | 保留原文 |
| escape hatch | 退出通道 |
| false negative | 假阴性 |
| fault-tolerant user interfaces | 容错用户界面 |
| Gustav Söderström | 保留原文(Spotify 首席研发官) |
| Halt and Catch Fire | 保留原文(FX 电视剧) |
| Hamilton Helmer | 保留原文(投资 strategist、《7 Powers》作者) |
| hard APIs | 硬 API(强制要求团队暴露和结构化维护的技术接口) |
| hit rate | 命中率 |
| Jeff Bezos | 保留原文(Amazon 创始人) |
| Lenny | 保留原文(播客主持人) |
| Lex Fridman | 保留原文(播客主持人) |
| Microsoft Clarity | 保留原文 |
| Midjourney | 保留原文(AI 图像生成工具) |
| MVP (minimum viable product) | 最小可行产品 |
| north star metrics | 北极星指标 |
| rage clicks | 愤怒点击 |
| recall | 召回(推荐系统中指用户已知内容的推荐/回访) |
| Reggaeton | 雷鬼顿(拉丁音乐流派) |
| Shishir Mehrotra | 保留原文(Spotify 董事会成员,曾任 YouTube 高管) |
| smiling curve | 微笑曲线 |
| squads | 小队(Spotify 曾采用的团队组织模式) |
| Stable Diffusion | 保留原文(AI 图像生成模型) |
| Stream On | 保留原文(Spotify 年度发布活动) |
| strong opinions loosely held | 坚定观点,松散持有 |
| subfeeds | 子信息流 |
| Succession | 《继承之战》(HBO 剧集) |
| taste bubble | 品味泡沫 |
| The Weeknd | 威肯(加拿大歌手) |
| tribes | 部落(Spotify 曾采用的团队组织模式) |
| two-pizza teams | 两个披萨团队(Amazon 提倡的小型团队理念) |
| zero intent use case | 零意图用例 |
此文档由 AI 分片翻译(translate_long_document)