如何衡量 2025 年 AI 开发者生产力 | Nicole Forsgren

Nicole Forsgren 2.0 2025-10-19

How to measure AI developer productivity in 2025 | Nicole Forsgren

Full Interview Transcript

Lenny Rachitsky: A lot of companies are trying to measure productivity for their teams.

Nicole Forsgren: Most productivity metrics are a lie. If the goal is more lines of code, I can prompt something to write the longest piece of code ever. It’s just too easy to gain that system.

The Lies of Productivity Metrics

Lenny Rachitsky: How do I know if my eng team is moving fast enough, if they can move faster, if they’re just not performing as well as they can?

Nicole Forsgren: Most teams can move faster. But faster for what? We can ship trash faster every single day. We need strategy and really smart decisions to know what to ship.

Introducing Our Guest Today

Lenny Rachitsky: One of the biggest issues we’re going to probably have with AI is learning how much to trust code that it generates.

Nicole Forsgren: We can’t just put in a command and guess something back and accept it. We really need to evaluate it. Are we seeing hallucinations? What’s the reliability? Does it meet the style that we would typically write?

The Main Interview Begins

Lenny Rachitsky: So much of the time is now going to be spent reviewing code versus writing code.

Nicole Forsgren: There’s some real opportunity there to not just rethink workflows, but rethink how we structure our days and how we structure our work. Now, we can also make a 45-minute work block useful because getting into the flow is actually kind of handed off, at least, in part to the machine or the machine can help us get back into the flow by, reminding us of context and generating diagrams of the system.

What is DevEx Exactly

Lenny Rachitsky: What’s just one thing that you think an eng team, a product team can do this week, next week to get more done?

Nicole Forsgren: Honestly, I think the best thing you can do-

Flow State and AI Impact

Lenny Rachitsky: Today, my guest is Nicole Forsgren. With so much talk about how AI is increasing developer productivity, more and more people are asking, “How do we measure this productivity gain? And are these AI tools actually helping us or hurting how our developers work?” Nicole has been at the forefront of this space longer than anyone. She created the most used frameworks for measuring developer experience called DORA and SPACE. She wrote the most important book in the space called Accelerate and is about to publish her newest book called Frictionless, which gives you a guide to helping your team move faster and do more in this emerging AI world. Her core thesis is that AI indeed accelerates coding. But developers aren’t speeding up as much as you think because they still have to deal with broken builds and unreliable tools and processes, and a bunch of new bottlenecks that are emerging.

In our conversation, we chat about her current, best and very specific advice for how to measure productivity gains from AI, signs that your team could be moving faster, what companies get wrong when trying to measure engineering productivity, how AI tools are both helping and hurting engineers, including getting into flow states, her seven-step process for setting up a developer experience team at your company, how to get buy-in and measure the impact of a team like this and a ton more. This episode is for anyone looking to improve the performance of their engineering teams. If you enjoy this podcast, don’t forget to subscribe and follow it in your favorite podcasting app or YouTube. It helps tremendously. Also, to become an annual subscriber of my newsletter, you get a year free of 15 incredible products including Lovable, Replit, Bolt, n8n, Linear, Superhuman, Descript, Wispr Flow, Gamma, Perplexity, Warp, Granola, Magic Patterns, Raycast, ChatPRD, and Mobbin. Head on over to lennysnewsletter.com and click product pass. With that, I bring Nicole Forsgren.

Whether you’re a seed-stage startup trying to land your first enterprise customer or a unicorn expanding globally, WorkOS is the fastest path to becoming enterprise-ready and unlocking growth. They’re essentially Stripe for enterprise features. Visit workos.com to get started or just hit up their Slack support where they have real engineers in there, who answer your questions super fast. WorkOS allows you to build like the best with delightful APIs, comprehensive docs, and a smooth developer experience. Go to workos.com to make your app enterprise-ready today.

Nicole, thank you so much for being here and welcome to the podcast.

Nicole Forsgren: Thank you. It’s so good to be here.

Measuring Productivity and Common Pitfalls

Lenny Rachitsky: It’s so good to have you back. I was just watching our first episode, which we did two and a half years ago. I was watching it, and I was both shocked and not shocked that we barely talked about AI. The episode was called How to Measure and Improve Developer Productivity, and we got to AI barely like an hour in and we’re just like, “Hmm, I wonder what’s going to happen with AI and productivity.” Does that just blow your mind?

New Contexts for Code Survival Metrics

Nicole Forsgren: Yeah. Because it was just hitting the scene, it was the topic of so much conversation, and at the same time, so many things don’t change. So many things are still important, so many things are the same. Yeah. It’s also a little wild that it’s been two and a half. Where does time go? Time is a social construct?

Understanding DORA Metric Boundaries

Lenny Rachitsky: Yeah. Most of our conversation was just questions like, “Well, how might this impact people? How will we change the way we build product?” It was barely a thing back then. Now, it’s the only thing that I imagine people want to talk about when they talk about engineering productivity. That’s where we’re going to be spending a lot of our time focusing on today. The reason I’m excited about this conversation, it feels like there’s been so much money poured into AI tools increasing productivity. The fastest growing companies in the world are these engineering AI tools. And now, more and more people are just asking this question of just, “What gains are we getting out of this? How much is this actually helping us be more productive? How do we become more productive?”

You’ve been at the center of this world for longer than anyone. You’ve invented so many of the frameworks that people rely on now. So I’m really excited to have you back to talk about this stuff. I want to start with just this term DevEx, it’s something that comes up a lot in this whole space, and we’re going to hear this term a bunch in this conversation. Can you just explain what is DevEx, this term DevEx?

SPACE Framework in the AI Era

Nicole Forsgren: DevEx is developer experience. And when we think about developer experience, we’re really talking about what it’s like to build software, day to day, for a developer. So the friction that they face, the workflows that they have to go through, any support that they have. It’s important because when DevEx is poor, everything else just isn’t going to help. The best processes, the best tools, the best… whatever magic you have, if the DevEx is bad, everything kind of takes-

Trust Issues and Code Reviews

Lenny Rachitsky: Within DevEx is productivity, and I think the key insight that you had and other folks in the space of that is not just productivity, but there’s also engineering happiness. We’re going to get into a lot of these parts, but just maybe speak to… there’s productivity and there’s broader components to engineers being successful at a company.

Nicole Forsgren: Yeah. I love that point because productivity, first of all, is hard to define anyway. But if you’re just looking at output, you can get there in a lot of different ways. But if you’re getting there in ways that are high toil or high friction, then at some point, a developer is going to burn out. Or if it’s super high cognitive load, if it’s hard to even think about what you’re doing because concentrating on the mechanics of… the plumbing of something, then you don’t have the brain space left to come up with really innovative solutions and questions. So I love that it’s kind of this self-reinforcing loop in terms of, “You do more work, you do better work.” And it’s better for people, it’s better for the systems, it’s better for our customers.

Deep Work and Attention Restructuring

Lenny Rachitsky: I was going to get to this later, but I want to actually get to this right now, this idea of flow state for engineers. I was an engineer, actually, early in my career. I went to a school for computer science. I was an engineer for 10 years. The best part of the job for me was just this flow state you enter when you’re coding and building, and just things feel like so fun. It feels like AI is making that harder in a lot of ways because there’s all these agents you’re working with now, there’s all this code that’s kind of being written for you. Talk about just the importance of flow state to a developer, happiness, developer productivity, and just what you’ve seen AI impacting. How you’ve seen AI impacting that?

Nicole Forsgren: Well, there are lots of different ways to talk about DevEx. One way to talk about it is kind of three key things that have components that are important of themselves, and they also kind of reinforce each other. Flow state is one of them, cognitive load is another, and then feedback loops are another. I think when you touch on this… Your question about flow state is a really good one, and I’ll admit we’re just a few years into this. We’re still figuring out what the best flow state and cognitive requirements are for people in this because, to your point, sometimes we’re getting interrupted all the time. You don’t just get in the flow and lock down, and write a whole bunch of code and do the typing of a whole bunch of code as much anymore. Instead, you’re kind of creating a prompt, getting some code back and reviewing the code, trying to integrate what’s happening in the system, and that can really interrupt.

At the same time though, it can contribute to flow if… I’ve seen some senior engineers pull together some tool chains that are really incredible, where they figured out how to keep the flow going. The fast feedback loops really, really work well for them. They can kind of assign out different pieces to agents. It helps them keep in the flow in terms of… Instead of details and line-by-line writing, they’re in the flow in terms of, “What’s my goal? What are the pieces that I need to get there? How quickly can I get there? So then, I can step back and kind of evaluate everything, and then dive back in and fix some pieces.”

Engineers Becoming AI Managers

Lenny Rachitsky: Is there anything more you could say about this engineer that figured out this really cool workflow, about just what that looks like?

Nicole Forsgren: I’ve spoken with a handful of them, and I’ve kind of watched them work. I haven’t built it myself yet. It’s on my list. They’ve been able to set up this really incredible workspace and workflow where… Right now, a lot of us play around with tools and… We’ll put in a prompt and we’ll get a few lines back or maybe we’ll put in a prompt and we’ll get whole programs back. Well, what they can do is they can… Many times I’ll see them say, to help prime it, “This is what I want to build. It needs to have these basic architectural components. It needs to have this kind of a stack. It needs to follow this general workflow. Help me think that through,” and it’ll kind of design it for it. And then for each piece, it’ll assign an agent to go work on each pace in parallel, and then it’ll say and upfront, “These need to be able to work together, make sure it’s architected correctly. Make sure we use appropriate APIs and conventions.”

Then at the end, they can let it run for a few minutes. They can think through something else that’s interesting or they anticipate is going to be hairy, and they come back to something that’s probably a little better than vibe coded. Because they were so systematic about it upfront, they’re much closer to something that looks like production code.

Why Companies Must Prioritize DevEx

Lenny Rachitsky: So what I’m hearing is spending a little time upfront planning, what all these AI engineers are doing, versus just powering through and just figuring out as you go.

Nicole Forsgren: Yeah.

One Actionable Step for This Week

Lenny Rachitsky: Okay, cool. Let me get to this quite a core question that I think on is a lot of people’s minds. A lot of companies are trying to measure productivity for their teams, “Is this improving our productivity? Is this hurting our productivity?” So let me just start with this question, how are people doing this wrong currently when they try to measure their productivity gains with AI?

Nicole Forsgren: I’ll say most productivity metrics are a lie. It’s really tricky because, historically… Now, look, lines of code has always been a bad metric, but many folks still use lines of code-

The Most Common DevEx Improvement

Lenny Rachitsky: [inaudible 00:12:37].

Nicole Forsgren: … yeah, as some proxy as some proxy for output or productivity or complexity or something. Well, now, for many of the systems, that they would sometimes whisper and not super talk about that uses lines of code, it’s just blown out of the water because, “What do you mean by lines of code?” If the goal is more lines of code, I can prompt something to write the longest piece of code ever and add tons of comments. We know that agents and LLMs tend to be very verbose by definition, and so it’s just too easy to gain that system and then introduce complexity and technical debt into all of the work that you’re doing. I will say there are some things that we can kind of watch and pay attention to because… So lines of code as a productivity metric isn’t great, it’s pretty bad. But now, it’s kind of more relevant if we can tease out which code came from people and which code came from AI because now we can answer downstream questions.

“What is the code survivability rate? What is the quality of our code? Is our code being fed back into trained systems? And for that code that’s retraining systems later, especially if we’re doing fine-tuning and local tuning, how much of that is machine generated? What types of loops is that creating, and what types of patterns or biases might it be inadvertently introducing?” On the one hand, it’s not good as a productivity metric, but it can be useful. I’ll even say the same for DORA. I have done DORA metrics, their speed metrics, their stability metrics. If that’s all you’re looking at, it’s not going to be sufficient anymore because AI has now changed the way we think about feedback loops. They need to be much faster. Now, what DORA’s meant for, kind of assessing the pipeline overall in terms of speed and stability. Still, that works. But we can’t just blindly apply the existing metrics we’ve used before because we’ll miss super important phenomenon and changes in the way people work.

How to Know if Teams Are Fast

Lenny Rachitsky: Interesting. You invented DORA, that was kind of the main framework people used for a long time to measure productivity. And then there’s SPACE, there’s Core 4, there’s probably others. So what I’m hearing here is all these are kind of out of date now, where AI is contributing large portions of code.

Integrating AI into Engineering Strategy

Nicole Forsgren: I will say if it is a prescriptive metric, it needs to be used only in the way it was prescribed.

How Much Does AI Boost Productivity

Lenny Rachitsky: So

Nicole Forsgren: DORA 4, there are four key metrics. There’s two speed metrics, deployment frequency and lead time. So code commit to code deploy. There’s stability metrics, MTTR and change fail rate. If those are used to assess the speed of the pipeline and the general performance of the pipeline, that’s great. If you’re trying to use those to understand… Because implied in that is feedback loops, right, because you used to kind of get feedback from customers. But we can’t just use that blindly now when we’re using AI, as an example, because we have feedback loops much earlier and not even just at the local build and test phase. We have feedback loops throughout, and even sometimes in the middle of some of the pipeline, that we really want to leverage in ways that weren’t as useful before. I won’t say they weren’t possible, but we just didn’t really focus there.

So those are prescriptive metrics. When we think about SPACE, SPACE is a framework. It doesn’t tell you what metric to use. So I’ll say, sometimes people get real frustrated because I didn’t tell them what to measure. But now, I think that’s the power of it. We’re actually seeing that SPACE applies fairly well in these new emerging contexts like AI because we still want to look at… SPACE is an acronym. We still want to look at satisfaction. We still want to look at performance, what’s the outcome. We still want to look at activity. Yes, in some ways, lines of code and number of PRs can be useful for something, or number of alerts or number of things, activities or counts. Seize communication and collaboration, this is also super important and useful because it’s how our systems communicate with each other, and also how our people do. “What proportion of work is being offloaded to a chat bot versus talking to a senior engineer on the team?” More isn’t always better and less isn’t always better, it depends.

And then efficiency and flow, “Can people get in the flow? How much time does it take to do things? What is the flow like through our system?” Here, I would probably add a couple of dimensions. So chatting with some of the early authors to say trust. Not to say trust wasn’t important before, but now it’s very, very front of mind. Right? Before you build your code, if the compile comes back, you’re fine. And that’s the way it is. LLMs are non-deterministic. Right now, we can’t just put in a command and guess something back and accept it. We really need to evaluate it, so, “Are we seeing hallucinations? What’s the reliability? Does it meet the style that we would typically write? And if it doesn’t meet, is that fine?” So it depends on… Prescriptive. You got to make sure you’re using it fit for purpose. Right?

Evaluating AI Capabilities in Debugging

Lenny Rachitsky: We’re going to get to your current thinking on the best way to do this stuff. You have a book coming out that explains how to do this well, so we’re going to get to that. One thing I wanted to highlight in our last chat that we had, you highlighted that one of the biggest issues we’re going to probably have with AI is trust, understanding and learning how much to trust the code that it generates, and also how much… you said this, two and a half years ago, that so much of the time is now going to be spent reviewing code versus writing code. That’s exactly what I’m hearing.

Nicole Forsgren: I think it’ll be interesting to see how that impacts the way we structure work moving forward. We were talking about flow state and cognitive load. Now that our attention has to focus on things at certain times and it’s broken up from how we used to do it, I think there’s some real opportunity there to, not just rethink workflows, but rethink how we structure our days and how we structure our work.

Introducing the New Book Frictionless

Lenny Rachitsky: Can you say more about that? Just what is that? What are you thinking will be happening? Where do you think things go? What are you seeing working?

Nicole Forsgren: This is purely speculative. But for example, Gloria Mark has done some really good work on attention and deep work, and humans can get about four hours of good deep work a day. That’s about it.

Discussing the DX Company Acquisition

Lenny Rachitsky: Yeah,. I feel that.

Nicole Forsgren: That’s kind of the upper limit-ish for the most part, and I’m sure people are going to be like, “Well, I am superhuman and I can do-

Explaining the Seven-Step Framework

Lenny Rachitsky: What if you take 20 grams of creatine?

Nicole Forsgren: Right. What if we microdose?

Developer Experience Versus Developer Productivity

Lenny Rachitsky: Yeah, exact;y.

Nicole Forsgren: Yeah. So in the context of knowing we have about four hours of good deep work… I’m sure many of us have probably hit this, right? We have good periods. Maybe it’s morning, maybe it’s afternoon for folks. And then you hit a time where you’re like, “I’m going to clean up my inbox because that is all I can do right now. I can be functional, but I’m not going to come up with my best innovative, problem solving, authoring, code writing work.” A lot of times, the way to do that and to get into it is to have these long chunks to get into flow and to get that deep work. Usually, I’m [inaudible 00:19:43] two hours-ish. An hour can be tricky because it could take time to get into that state. Okay. Well, when we think about what it used to be like, back in the old days, three years ago, three and a half years ago, we could block off four hours of time and we could probably get two or three hours of really good work done. Because we were just focused, right? There were no interruptions, minimal interruptions.

Now, the nature of writing code and systems itself is interrupt driven or full of interruptions, at least, because you start something and then it interjects. So how do we think about that? Does that mean that a four-hour word block is still useful? Probably. But does that mean that now we can also make a 45-minute work block useful? Because getting into the flow is actually kind of handed off, at least, in part to the machine or the machine can help us get back into the flow by reminding us of context and generating diagrams of the system and all the things. So I think that’s a really, really interesting area that’s just ripe for questions and opportunity. And please, folks, do this research and come back to me because… It might not make my list, but it’s such a great question.

Local Versus Global Improvement Examples

Lenny Rachitsky: That is so interesting. Essentially, every engineer is turning into an EM, engineering manager, coordinating all of these junior AI engineers. So your point is even if you have a 30-hour block, you can get deep into code, but you can unblock all these AI engineers that are running off doing tasks. Plus, your point is they remind you of just like, “Here’s where you left off. Okay. You can just jump into this code, maybe make some tweaks.”

Tailoring Communication for Different Audiences

Nicole Forsgren: Yeah.

Effective Methods to Quantify Benefits

Lenny Rachitsky: So interesting. Let me zoom out a little bit and… Before we get into your framework for how to approach developer experience, the latest thinking you’ve got, beyond just obviously engineers doing more is great, what’s your best pitch for why companies should really, really focus on developer experience?

Nicole Forsgren: I hate to say return of investment, but the business value is… the opportunity here is huge. In general, we write software for fun and for hobbies, but we also have software because it meets a business need. It helps us with market share, it helps us attract and retain customers, it helps us do all of these things. And I think DevEx is important because it enables all of that software creation, it enables all of that problem solving. It enables the super rapid experimentation with customers that… Before, you’d need a while for a prototype and maybe a little bit longer to actually flight it through an A/B test on a production system. You can do it in hours, right now.

Measuring AI Tool Impacts on Productivity

Lenny Rachitsky: Maybe the opposite end of the spectrum, getting very tactical, before we get into the larger framework, what’s just one thing that you think an eng team, a product team can do this week, next week to help their developer experience maybe get more done?

Nicole Forsgren: Honestly, I think the best thing you can do is go talk to people and listen. I love that the audience of this podcast is primarily PMs because they tend to be really good at this. And I would say start with listening and not with tools and automation. So many times companies are like, “Well, I’m just going to build this tool,” or, “I’m going to build this thing.” Often you build a thing that you yourself have had a challenge with or that is easy to do, easy to automate. And if you just go talk to people and ask the developers like, “Think of yesterday, what did you do yesterday? Walk me through it. What were the points that were just delightful? What were the points that were really difficult? Where did you get frustrated? Where did you get slowed down? Where was there friction?” If you go talk to a handful of people, a lot of times, you can surface a handful of things that are relatively low lift and still have impact or you can identify a process that’s unnecessarily complex and slow.

Measuring Developer Experience from Scratch

Lenny Rachitsky: So the listening to, I hear, almost is you want to help your teams move faster and be happier eng teams. Your advice is just, “Before you do anything, just go ask them what is bothering you.”

Key Considerations for Survey Design

Nicole Forsgren: Go ask them, yeah. And trust me, most developers are going to be more than happy to tell you what’s broken and what’s bad. I’ll say, there was one company that I had worked with. I remember they had a process that was really difficult and it was on an old mainframe system, and they were going to have to replat the whole thing and so they never went to work on it or talk about it. Everyone hated it because it was this huge delay. I mean, all they had to do was change a process. Sometimes all you have to do is change a process. And they changed it so that instead of… I think someone had to print it out and walk it down three or four flights, and they get approval. And then someone else had to walk it back up, and so it was just that interim. They didn’t replat anything. They didn’t redesign anything major. They just sent an email.

Developer Satisfaction Versus Developer Happiness

Lenny Rachitsky: Let me push on that and… I’m curious just what are the most common things people do. If you’re just starting on, “Okay, we need to focus on engineering experience,” what do you find are the most… two or three most common improvements companies need to make?

Nicole Forsgren: I’ll say, I’ll kind of echo that process, there’s almost always a process that can be improved and that can be improved without a lot of engineering lift or a lot of engineering headcount. Most large companies, in particular, have something that is several, several steps. It’s the way it is because it’s the way it is, but that’s no longer the way it is. And even small companies sometimes is just a little too YOLO, and you don’t know what it is and you’re kind of chasing everyone around. So if you can create a very lightweight process, that can also be helpful. That can be one of the best places to start, especially if you have limited exposure to the whole rest of the org. Sometimes just a team process can help.

I will say from a business leader’s standpoint, a lot of what you can do is provide structure and support for this organizational change. Communicate what you’re doing, communicate what the priorities are, communicate why this is important, to celebrate wins. Because if folks try to do this, just like a one-off side fully-isolated project, it’s really challenging to get some good momentum, to get people to care, and to get them stay involved. Because it feels like it’s just another internal project that isn’t going to matter or that isn’t going to get celebrated, but it has these huge upside potential returns for the business.

Top Recommended Developer Tools

Lenny Rachitsky: It’s interesting, what I’m hearing here is nothing about tools or technologies. It’s not like move to this cloud, it’s not like install this new deployment system, it’s processes and people and org and morale.

Applying Product Thinking to DevEx

Nicole Forsgren: Yeah. Now, there will be technical pieces that are very important, especially now with AI, where we’re rethinking how build and test systems work. We’re rethinking feedback to users so that it’s very, very customized in terms of what is shared and when it is shared. There are a lot of technical pieces that are involved, but that’s not the only thing. It’s necessary but not sufficient, and that doesn’t have to be the place that you start.

AI Corner: AI for Home Design

Lenny Rachitsky: I have a hard question I want to ask you that I thought of as you were talking. I feel like this is the question that most founders and heads think about. And the question is just like, how do I know if my eng team is moving fast enough, if they can move faster, if they’re just not performing as well as they can? What are just maybe smells, signs that tell you, “Yeah, my team should be moving faster,” versus, “This is just the way it works. This is as fast as they can move”?

Nicole Forsgren: Most teams can move faster, right? Also, given what we know about cognitive load, not all speed gains are necessarily good. Or the upside is going to be kind of limited once you hit kind of a certain point, and most people are not even near that point. I don’t know a single team, frankly. But how do you know? You know if you’re always hearing about bills breaking, flaky tests, overly long processes, if you have to request a new system or if you need to provision a new environment, or if it’s really, really hard to switch tasks or switch projects. So if someone has an opportunity to go work in another part of an org and they don’t for reasons that are unclear, and not political, and anyone says anything about the system, that’s usually a pretty good smell that there’s friction somewhere.

Because once you finally figure out your system and you’re able to get work done, the switching costs can often be really, really high to go anywhere else. So sometimes people will do that. But I’ve worked with companies where switching orgs within the company, you had to basically pay the same tax as a new hire because the systems were so different and they were so full of friction, and it was so difficult to do so many things.

Rapid Fire Q&A Session

Lenny Rachitsky: I love the first part of your answer especially, which is you can always move faster. I think every founder is going to love hearing that. To your point though, there’s diminishing returns over time?

Nicole Forsgren: Yeah. And you don’t know about the quality, right? So I think that’s the other side is that you can always move faster, but faster for what? Are we making the right business decisions? And I think that’s especially where PMs come in. We can ship trash faster every single day. We need strategy and really smart decisions to know what to ship, what to experiment with, what features we want to do in what order and what rollout. The strategy is the core piece, and then think about speeding that up. If we don’t have the other pieces in place, I mean, garbage in, garbage out.

My Current Favorite Product

Lenny Rachitsky: I want to follow that thread, but before I do that, just to mirror back what you shared. So signs that your team… There’s a lot of low-hanging fruit to improve the productivity of your team as builds are always breaking. There’s flaky tests are constantly incorrect, false positives. It’s hard to context switch between different projects. You just hear people talking about the system, it’s just really hard to work with. Is that roughly right?

Nicole Forsgren: Yeah.

My Personal Life Motto

Lenny Rachitsky: Cool, okay. So going back to the point you just made, there’s a sense that AI is making teams so much faster because it’s writing all this code for them. You’re going to have all these asynchronous agents, engineers working for you. It feels like a core part of your message is that’s just a one part of engineering work and there’s so much more, including figuring out what to build… an alignment internally. Maybe just speak to just… There is a lot of opportunity to improve engineering performance productivity, but there’s so many other elements that are not improved through AI?

Nicole Forsgren: Yes. Or could be in the future, right?

Discussing the New Google Role

Lenny Rachitsky: Mm-hmm.

Nicole Forsgren: I think there are a lot of ways that we can pull in AI tools to help us refine our strategy, refine our message, think about the experimentation methods or targets of experimentation, or think about our total addressable market, but we need to have that strategy and plan fairly well aligned or at least have two or three alternatives that you want to test. Because now, the engineering can go, or at least the prototyping especially, much, much faster. We can throw out prototypes. We can run any tests and experiments that are customer facing, assuming that we have the infrastructure in place, which allows us to learn and progress much faster before. In some places, it used to take months to get something through production to do A/B testing and get feedback. We can do this in a day or two, definitely under a week. But we want to make sure that we’re building and testing the right things, “Are we partnering with the right… Do we have the data that we need?”

And I will say AI can actually be a pretty good partner there if you have a good conversation with it, and then also check with you experts, “What type of data should I be looking at? What type of instrumentation do I need? What type of analysis can I do?” Because then, you can also go to your data science team and say, “I’m planning on doing this. I’d like to…” Let’s not just YOLO A/B tests because that can be… It’s a shame to do a large test and end up disrupting users or disrupting customers, or breaking privacy or security protocols and also end up with data that’s unusable because you just can’t get the signal that you’re looking for. But now, I’m also seeing people kind of accelerate that into a few days versus a few weeks. So they can start those key stakeholder discussions from a much more informed kind of filled out space.

Lenny Rachitsky:

Like I mentioned earlier, I use Coda every single day. And more than 50,000 teams trust Coda to keep them more aligned and focused. If you’re a startup team looking to increase alignment and agility, Coda can help you move from planning to execution in record time. To try it for yourself, go to coda.io/lenny today and get six months free of the team plan for startups. That’s C-O-D-A-dot-I-O-slash-Lenny to get started for free and get six months of the team plan, coda.io/lenny.

I love that you work with a bunch of different companies and a bunch of different types of businesses. I think very few people get to see inside a lot of different places. What kind of gains are you just seeing in terms of increased productivity with AI? How big of a gain have you seen?

Nicole Forsgren: I’d say it’s real, and I would also say we don’t have great measures for it yet. We’re still trying to figure out what to measure and what that looks like. One of the best is going to be velocity, all the way through the system, how quickly can you get a feature or a product or something through the system so that you can then experiment a test, either from idea to final end or even kind of a feature and a piece through the system so we can test. That’s really good. Now, that’s also hard to tie back directly to a particular AI tool in the hands of a particular developer. But there are some other things that we can look at and we can see, and that I’ve seen is, again, this kind of rapid prototyping.

I hate lines of code, but I’m going to use the lines of code. We do see… I know I worked with some folks who had kind of a whole set of companies they were looking at, and they found that AI was generating significantly more code for the people who were using it regularly. But then, they also found that for folks who were regular users of AI coding environments, AI ADEs, the tool kind of gave them more code. And then the engineers themselves, the increase was double what the coding agent had given them. So one, I’d say, probably it’s kind of a secondary or knock on or just a smell is it can unblock you. It can speed up the work that you would already do. I know sometimes when I work, the first few minutes, it’s hard for me to start. But once I get started, I’m there. So they’re really good at unblocking and unlocking that.

Lenny Rachitsky: Something I’ve seen people on Twitter sharing is how good OpenAI Codex, especially, is at finding really gnarly bugs. And I think it was Karpathy that shared it. He was so stuck on a bug and, no AI tool could figure it out. And then the latest version of Codex spent an hour or something, looking into it, and found it for him.

Nicole Forsgren: Yeah. I’m hearing incredible things like that, right? Well, and even also writing unit tests and spinning up unit tests, and creating documentation and cleaning up documentation because I know now people are like, “Oh. Well, we have agents. I don’t need to read the docs because there’s the code there.” It turns out, agents rely on good data because it’s all about how they’ve been trained or how they’ve been grounded. And better data gives you better outcomes, and some of that data includes documentation and comments. The better documentation and the better comments you have, the better performance you’re going to get out of your AI tools.

Lenny Rachitsky: And AI can help you write that documentation. I’ve been working with Devin a little bit, and it’s really good at that stuff.

Nicole Forsgren: Yeah.

Lenny Rachitsky: Okay. Let’s talk about this framework, this book. So you’re publishing a book called Frictionless, which sounds like a dream, “How do you create a dev team that’s frictionless?” It’s called Frictionless: 7 Steps to Remove Barriers, Unlock Value, and Outpace Your Competition in the Age of AI. There’s a seven-step process to this. Walk us through this and maybe give us just context on this book, who it’s meant for, what problem it solves, and then the seven steps.

Nicole Forsgren: I will say, I also wrote this with Abi Noda who has just… of DX. He has incredible experience in the space. He’s worked with hundreds of companies and so it was kind of nice bouncing ideas off of him. Also, thanks to all of the engineering leads and DevEx leads, and CTOs, and engineers that we talked to to make sure that our smells were right. So who is this book for-

Lenny Rachitsky: Let me take a tangent on Abi, and DX, since you mentioned him. This is super interesting, and I think it connects so directly with this conversation. Abi started this company called DX, which is such a great name for a company around developer experience. They just sold the company for a billion dollars to Atlassian. It’s a very high multiple on their ARR. It, to me, shows exactly why this conversation is so valuable, just how much value companies are putting into improving developer experience. Atlassian would spend a billion dollars on this. It’s an early stage-ish startup. It was doing really well and people loved it, but it was like early stage-ish, a billion dollars. And the idea is they have all these companies working using Jira and all their products. They’re all trying to figure out how do we measure productivity. It’s worth a lot of money to them. And I know you were an early advisor to them too, so-

Nicole Forsgren: Yeah.

Lenny Rachitsky: … it just shows us how important this is.

Nicole Forsgren: Yeah. Well, I think it also shows us how much value you can get out of this. There’s so much low-hanging fruit, there’s so much unlocked potential, and it’s hard to know where to start a lot of times even in… I’ve been at large companies that have a lot of expertise and a lot of really, really smart people. But if you haven’t kind of been in this space and thinking about it this way, it’s hard to know where to start or it’s easy to make simple mistakes up front that mean you kind of need to start over later. So I guess it also brings us back to, “Who is this book for?” It’s for anyone that cares about DevEx, so definitely technology leaders, anyone who’s trying to kick off a DevEx program, or is working on a DevEx DevEx improvement program. I think it’s particularly relevant for PMs because if you’re PMing something that involves software building and creating software, improving DevEx will only help your team. And also, you have key skills and insights and instincts that are so important to DevEx that many times, I will say, I’ve seen engineering teams just miss.

Lenny Rachitsky: Okay. What’s the framework? What are the steps? Where do people start?

Nicole Forsgren: The book goes through a seven-step process, and then also kind of provides some key kind of principles at the end. Step one is to start the journey. So assuming you’re kicking off, you can start the journey. And this involves what we have already talked about. Go talk to people, have a listening tour, synthesize what you learn, visualize the workflow and tools, get a handle on what the current state is. Step two is to get a quick win. So start small, get a quick win, pick the right projects, share out what you’ve done. Step three is using data to optimize the work. So establish some of your data foundation, find the data that’s there, start collecting new data, use some surveys for some really fast insights and may include example surveys. Step four then is to decide strategy and priority. Once you have some data, then you need to know of all the things that are potentially broken. And if you’ve already gotten your quick win of all the things that are left, “What should I do next?” So we walk through some evaluation frameworks there.

Step five is to sell your strategy. Once you’ve decided, now you have to kind of convince everyone else. So now you want to get feedback, you want to share why this is the right strategy right now. Step six is to drive change at your scale. So here, we address folks that have local scope of control. If you’re starting on just a dev team, you want to do it yourself, kind of grassroots effort or global scope of control. If you’re the VP of developer experience or something, there are some things that you can leverage for a top down, and then how do you drive change when you’re kind of somewhere in the middle, because you can leverage both types of strategies. And then step seven is to evaluate your progress and show value, and then kind of loop back around.

I will say that we wrote this so that you could kind of jump into any step wherever you are right now. If you’re kicking off a team or an initiative, you’ll probably want to start at step one. You should definitely start at step one. If you’re joining an existing initiative, you could jump into picking the priority or implementing the changes. So those are the seven steps. There’s a seven steps, there are a few practices that we also recommend. So thinking about resourcing it, change management, making technology sustainable, and then also bringing a PM lens to this, “How can we think about developer experience as a product, and how do we think about the metrics that we have as a product?”

Lenny Rachitsky: Awesome, okay. I have questions. Point people to the book real quick. What’s the URL? How do they get it? When does it come out?

Nicole Forsgren: Yeah, developerexperiencebook.com. Right now, you can sign up for the mailing list. We’ll let you know when it’s out on pre-order, and we’ll also be sharing pieces of the workbook. So we’ve got almost a hundred page workbook that goes along with the book, and then it should be out by end of year.

Lenny Rachitsky: Okay. So one piece of this is just this term developer experience feels very intentional in that it’s not developer productivity, developer work. It’s how do we make developer experiences better at our company, which includes they get more done, but also they’re happier and things like that. So I think that’s an important element of this, right?

Nicole Forsgren: Yeah, absolutely.

Lenny Rachitsky: Okay.

Nicole Forsgren: Because, again, it’s not just about productivity. We talked about this from the frame and the lens of, “We need to be building the right thing.” And you want to be productive, but you also want to be thinking about… and this is what engineers are also just really incredibly good at, give them a problem and don’t tell them how to solve it, and then they can solve it better. They have the freedom, they have the innovation, they have the creativity so that they can solve this problem. If it’s only about productivity, then it’s just lines of code or number PRs or whatever. But we really want to talk about value and how do we unlock value, and how do we get value faster. And that involves, yes, making them more productive and removing friction because then, they have the flow and the cognitive load and the things that we kind of talked about.

Lenny Rachitsky: Awesome, okay. And then say someone wants to start this team, what does it usually look like. At Airbnb, I remember this team forming. It was just like an engineer or two, getting it started and taking charge. What do you recommend as the pilot team, and then what does it look like as it grows?

Nicole Forsgren: There are a few ways to do this, right? So if you’re doing it yourself, you could do it with a couple of engineers, maybe a PM or a PGM or a TPM to kind of help communicate. Because really, comms plans are just so important here. On a small scale, what we want to do is look for those quick wins, look for things that you can do at small scale. Some folks call them things like paper cuts. There small things that you can do to help people see the value and feel the benefit themselves, “How can a developer’s work get better? How can their day-to-day work get better? Kind of build momentum from there?” If you’re working from a top-down structure and you have the remit, you still want some quick wins, but those quick wins can look a little more global in scale because you have the infrastructure or the backing to make different types of changes that aren’t only local.

So an example of a small local change could be just cleaning up your tests, your test suites. Any team could do that, any team could do that. At more global scale, it might be changing organization-wide process that is just overly cumbersome or throwing some resourcing into cleaning up the provisioning environment.

Lenny Rachitsky: Okay. What kind of impact have you seen from teams like this forming, on the engineering teams at their companies?

Nicole Forsgren: I’ll say I’ve seen a huge impact for smaller companies, hundreds of thousands of dollars for large companies or in the billions. Well, also, we need to learn how to communicate that, “What does the math look like?” Many times, we can look at saving time, we can look at saving costs, we can look at a lot of different things. We can look at speed to value as speed to market. We can look at risk reduction, but the gains really are there. I will mention that it tends to follow something like the J-curve. So you’ll have a couple of quick wins and it’ll look like a big win, and then you’ll hit kind of a little divot where suddenly the really obvious projects, the low-hanging fruit are handled. So now, we need to do a little bit of work. We might need to build out a little bit more infrastructure. We might need to build out a little more telemetry, so that we can capture the things we want to capture. And then once we get that done, then we start to see those benefits really compound.

Lenny Rachitsky: So going back to that measurement number, what do you recommend? How do people find these numbers? Because I think that’s so much of the power of this is like, “We saved a million dollars doing this.” What do you look at to figure that out?

Nicole Forsgren: I think there are a few different things to keep in mind, like who is our key audience, and we usually have a few key audiences. We really want to be able to speak to developers because they’re the ones that are going to be using the systems. They’ll be partnering with you on either building them or at least providing feedback about what you’re doing. So for them, we often want to frame this in terms of things they care about. So time savings. If something gets faster, they can save time. They don’t spend time doing setup when they don’t need to anymore, related to status reduced toil. So compliance and security are super important. Also, many times it requires several manual steps that… I don’t say they’re not value add. They’re not value add from an individual human perspective. If we can automate as much as possible, that’s great, and improved focus time.

That’s from the developer side of you. Leadership often cares about… They care about those things, but they often care more about other things. So we could talk about usually costs in dollars, “Can we accelerate revenue? What does our time to value look like? What is our velocity? How quickly can we get feedback from customers?” And for folks and organizations that are in really competitive environments, that can be really compelling because it’s all about speed. We could talk about saving money. Here, we can look at maybe quantifying savings. One example is test and build. If we can clean up a test and build suite to a developer, they really want to hear about time saved and more reliable systems. There’s less toil because they don’t have to keep re-running tests or kind of go clean up test suites.

From the business perspective, cleaning up a test in a build suite can be cloud cost savings because all of those tests are running somewhere on a cloud. And if they always fail or if it’s just kind of a waste of spend, that can be useful, recovering some capacity. We can always talk about time and productivity gains, “How much equivalent developer time are we losing on things that are not necessarily value add?” And then sometimes we can correlate to business outcomes and correlate is usually the best we can do here, but there can be some pretty compelling correlations in terms of speeding up time to value and increase market share, for example.

Lenny Rachitsky: Let me follow that thread and come back to this, what I think is the biggest question people have right now with AI and productivity, and I don’t think anyone has the answer yet, but I’m curious to get your take of just what should people do today? What’s the best approach to understanding what impact AI tools are having on their productivity? Because they’re spending all this money on there. I don’t know, what are we getting out of this? So I guess things are moving faster, but I don’t know. So if someone had to just like, “Okay, here’s what I should probably try to do,” what would be your best advice here for measuring the impact of AI tools on productivity?

Nicole Forsgren: I would say it depends. In part, it depends on what your leadership chain really cares about. We are usually pretty good at figuring out what matters to developers and we could communicate that to them. But if we’re trying to just identify two or three data points to really kind of focus on, because when we’re first starting with data, sometimes it can be challenging, what do they care about? Think about the messaging you’ve been hearing. Have they been talking about market share? Losing market share or competitiveness in the marketplace, if that’s it, focus on speed. Think about ways that you can capture metrics for speed from feature to production or feature to customer or feature to experiment and what that feedback loop looks like if they’re talking about profit margin all the time.

Now, we always talk about money because this is business. But if that seems to be an overarching narrative, look for ways that you can save money and then translate that into recovered and recouped headcount cost. Or sometimes you’ll reinvent, change a process, and then you no longer need as many vendors. So reductions in vendor spent can also help there. I say also it depends because sometimes they’ll say something, leadership will say something, and it kind of comes up as a theme. If you could solve a problem that they have or it’s something that they’re focused on, if you can slightly reframe it even, like if they’re calling everything developer productivity, go ahead and call it productivity. If they’re calling it velocity, and velocity is what matters to them, think about how to frame this in terms of velocity. If they’re talking about transformation or disruption, how does this help with the disruption? Because then, it will resonate with them. We don’t want to make them work to understand what it is that we’re doing and the value that we provide.

Lenny Rachitsky: That is such good advice. Just to reflect back, the advice here is if your company’s trying to figure out what sort of impact are AI tools having on our company, first, it’s just like, what does the company care about most? What do leaders care about most? Could be market share, could be profit margin, could be velocity. We need higher velocity or we need to transform, transformation. So your advice there is figure that out based on words and phrases you’re hearing. Then figure out ways to measure that, ways to measure market share growing, profit margin increasing. I love these examples, like time from feature, idea to production or to experiment, so maybe start tracking that. If it’s margin, it’s money saved by fewer tests, failing or some vendor you don’t have to pay for, things like that. And then velocity, I imagine that’s where things like DORA come in of just speed of engineering, shipping, or… What would you think about there for velocity?

Nicole Forsgren: I would say it’s actually one of those… I would pick as broad a swathe as you can. So if you can go from idea to customer or idea to experiment, how long does that take? How long does it typically take, and how long can it take, and does it take now with improved use of AI tooling and reduction in friction? That’s where I will say, we talk about this a little bit in the book, how do we deal with attribution challenges? What was responsible for this? Was it the DevEx or was it AI? Go ahead and disclose that. Say, “Yes, we rolled out AI tools. We also had this effort in DevEx. They partnered very closely together.” Both of them probably contributed to this, right? If we had AI tools without the DevEx improvements, we probably would’ve had some improvements, but not nearly as much.

Lenny Rachitsky: If people were starting to do this today, say they’re just like, “I want to start measuring developer experience,” are there a two or three metrics everybody basically needs they should just start measuring ASAP?

Nicole Forsgren: If you’re just starting today and if you have nothing at all, talk to people, obviously. After that, I would do surveys because surveys can give you a nice overall view of the landscape quickly so that you know where the big kind of challenges are. I say that because if you’re just starting, you might not have instrumentation through your system, all the metrics. And if you do already, it might not be what you think you want. Metrics that were designed without purpose, questionable. Metrics that were designed for another purpose, they might work for what you want, but they might not, so we can’t just assume we have them. That’s one reason I like surveys, and we include an example in the book. You can just ask a few questions, “How satisfied are you? What are the biggest barriers to your productivity, or what are the biggest challenges to getting work done?” and let them pick either from a set of tools or maybe a set of processes and then say… Let them pick three, just three.

Of those three, how often does this affect you? Is this hourly? Is this daily? Is this weekly? Is this quarterly? Because sometimes it hits you every single day, and you’re just mad about it. Sometimes it only hits you once a quarter because it’s end of quarter, but it’s so onerous, and then kind of open text, like, “Is there anything else we should know?” That can give you incredible signal because by making folks prioritize the top three things… Let them pick everything, it makes the data super, super messy. But three things and how often, you can just come up with a score or a weighted score if you want, and then go kind of dig into, where should that data be? What data do we need? But also, then you’ve got at least some kind of baseline. It’ll be a subjective baseline, but now you’ll know what the biggest challenges are.

Lenny Rachitsky: I love how all this just comes back just starting by talking to people and asking them these things, which is very similar to product management and just building great products is, have you talked to your customers? Everyone thinks they’re doing this, but most people are not doing this enough.

Nicole Forsgren: And I will say one thing that’s challenging when you think about getting data, so interviews are data and that’s important, surveys are a little more quantified because we can turn it into counts, but that’s where we also want to be careful. A lot of folks go to write a survey question and they’ll say something like, “Were the build and test system slow or complicated in the last week?” You’re asking four different questions there. If someone answers yes, was it the build? Was it the test? Was it slow or was it flaky or complicated or something? So it can be really difficult to untangle what the signal is you’re actually getting there, and so it is worth the time chatting with someone who’s familiar with survey design, having a conversation with Claude or Gemini or ChatGPT around, “Here are the survey questions. Or can you propose some?” And then make sure you take a couple of rounds. Is this a good survey question? What questions can I answer from the data that I get? What problems could I solve? If you can’t answer a question with data, don’t get it.

Lenny Rachitsky: And you have example surveys in your book for folks that want to just copy and paste and not have to think about this much.

Nicole Forsgren: Yeah, example surveys, a lot of example questions. We even recommend what the format, what the flow should look like, how long it should be, how long it should not be.

Lenny Rachitsky: One thing that I was reading is that you don’t love happiness surveys specifically, asking engineers how happy they are, is that true? If so, why is that?

Nicole Forsgren: I don’t, no. Well, I’ll say I don’t love a happiness survey because there are too many things that contribute to happiness. Happiness is a lot, right? So happiness is work, happiness is family, happiness is hobbies, happiness is weekends, happiness… There are so many things that contribute to happiness. Now, that doesn’t mean I don’t care about happiness. I think happiness surveys are not particularly useful here. What can be helpful is satisfaction and people are like, “That’s the same thing.” It’s not because you can ask, “Are you satisfied with this tool?” and then ask some follow-up questions. Now, those two are related because the more satisfied you are with your job and your tools and the work and your team, it contributes to happiness. I used to joke… Remember the golf commercials like, “Happy cows like happy cheese”?

Lenny Rachitsky: No.

Nicole Forsgren: I had a Calabrian. That was the best. Happy devs make happy code. They write better programs, they do better work, they’re better team members and collaborators. But capturing and trying to directly influence happiness, that’s not what we are here for. It’s too challenging, it’s too all-encompassing. Satisfaction can give us some signal.

Lenny Rachitsky: In a totally different direction, in terms of just tools you see people using, are there any that just like, “Oh, yeah, this one’s really commonly great.” For people, this is just a tool people are finding a lot of success with. There’s the common ones, Copilot, Cursor. I don’t know. Is there anything that stands out that you want to share, just like, “Hey, you should check this tool out. People seem to love it”?

Nicole Forsgren: I think they’re huge, right? Copilot, Cursor, Gemini.

Lenny Rachitsky: Claude Code.

Nicole Forsgren: Yep, Claude Code. I love Claude Code.

Lenny Rachitsky: I have a whole post coming on ways to use Claude Code for non-engineering use cases.

Nicole Forsgren: Cool. Nice.

Lenny Rachitsky: It’s so interesting. For example, Claude Code, “Find ways to clean up storage on my laptop,” and it just tells you there’s a bunch of files. It’s just like ChatGPT running on your computer and you could do all kinds of crazy stuff on your computer for you, like a mini God.

Nicole Forsgren: I’m going to do that now. This is great.

Lenny Rachitsky: It’s so good. Yeah, that’s why I’m writing this. I had Dan Shipper was on the podcast and he said Claude Code is the most underrated AI tool out there because people don’t realize what it’s capable of. It’s not just for coding, and that’s what I’m trying to explore more and more. Okay. Is there anything else that you think would be valuable to help people improve their developer experience, help them adapt to this new world of AI and engineering that we haven’t covered?

Nicole Forsgren: I think something that’s important to think about in general is to bring a product mindset to any type of DevEx improvements that are happening, and also the metrics that we collect and capture. By that, I mean we want to identify a problem, make sure we’re solving a problem for a set of users. We want to think about creating MVPs and experiments and get fast feedback, do some rapid iteration. We want to have a strategy. We want to know who our addressable market is. We want to know what success is. We want to basically have a go-to-market function. We need to have comms. We need to get continuous feedback from our customers. We want to keep improving. And, at some point, we want to think about sunsetting something. Is it in maintenance mode? Is it sun setting?

And I think that’s important in general, but I think it’s extra important now because when we have AI tools, we’re using AI tools, we’re embedding AI into our products, things are changing so rapidly that it can be really important to take half a beat and say, “Okay, what’s the problem I’m trying to solve right here? Is this metric that we’ve had for the last 10 years still important or should this be sunset because it’s not really important anymore? It’s not driving the types of decisions and actions that I need.”

Lenny Rachitsky: Before we get to our exciting lightning round, I want to take us to AI Corner, which is a recurring segment on this podcast. Is there some way that you’ve found a use for an AI tool in your life, in your work that you think might be fun to share, that you think might be useful to other people?

Nicole Forsgren: I have been working on some home design and redecorating rooms and stuff. I’m working with a designer because I know what I like, but I don’t know how to get there, I’m not good at this. But I’ve really been loving ChatGPT and Gemini especially to render pictures for me, so I can give it the floor plan, I can give it one shot of the room that’s definitely not what it’s supposed to look like, and then I can give it pictures of a couple different things, and then I can just tell it change the walls or change the furniture layout or change something. It helps me and it’s relatively quick. It helps me kind of visualize the things… Again, I know what I like, but I don’t know how to get there, so I know if I like it or not, which is probably a very random use, but it’s fun for now.

Lenny Rachitsky: My wife does exactly the same thing. She’s sending me constantly, “Here’s what this rug will look like in our living room. Here’s this water feature.” It’s so good and it keeps getting better. It’s just like, “Wow, that’s exactly our house with this new rug,” and all you do is just upload these two photos and just like, “Cool. How would this look in our room?”

Nicole Forsgren: Yeah, I’ve been impressed a couple times. Definitely the machines are listening to us. It’s given me a mock-up of a room or something and then it throws in a dog bed, because I have dogs. I’m like, “I did not tell you to do that, but yeah, that’s probably the color and style of dog bed that I should have in this room.”

Lenny Rachitsky: Speaking of that, have you tried this use case, ask ChatGPT, “Generate an image of what you think my house looks like based on everything you know about me.”

Nicole Forsgren: I haven’t.

Lenny Rachitsky: Because it has memory and it remembers everything you’ve talked about, and it’s hilarious. You got to do it.

Nicole Forsgren: Okay, that’s on my to-do list.

Lenny Rachitsky: There we go. Bonus use case. Nicole, with that, we’ve reached our very exciting lightning round. I’ve got five questions for you. Are you ready?

Nicole Forsgren: Awesome. Let’s go.

Lenny Rachitsky: What are two or three books that you find yourself recommending most to other people?

Nicole Forsgren: Outlive by Peter Attia is fantastic. Another one that’s I guess maybe related, I hurt my back so it’s not great, Back Mechanic by Stuart McGill is incredible. Shout out to anyone who has hurt lower back. It’s for a lay person to read through and figure out how to fix lower back problems. It’s kind of a random one. I will say I love How Big Things Get Done. I can’t pronounce the names. I think one’s… There’s Scandinavian, one is. It kind of dissects really large projects through recent-ish history and where they failed and why. And I think it’s really interesting for us to think about, especially now in this AI moment where basically all of our at least software systems are going to be changing. So how do we think about approaching what is essentially going to be a very large project? And then, sorry, I’m going to throw in a bonus one, The Undoing Project by Michael Lewis. Matt Velloso recommended it to me, and it’s so good.

Lenny Rachitsky: Yes, I read that-

Nicole Forsgren: I audibly gasped at the last sentence.

Lenny Rachitsky: Oh. I was like, “What?”

Nicole Forsgren: I was [inaudible 01:03:48]. Yeah, I was not expecting it.

Lenny Rachitsky: I read that and I do not remember that last sentence. Oh, man. Okay, cool. Next question. Do you have a favorite movie or TV show you recently watched and enjoyed?

Nicole Forsgren: I’ll say I watch Love Is Blind. If I got to shut down at the end of the day, Love Is Blind is fun.

Lenny Rachitsky: There’s a new season out.

Nicole Forsgren: Yeah, very excited… and Shrinking. Have you seen Shrinking?

Lenny Rachitsky: No. I think I started The Therapist and yeah, I gave it a shot.

Nicole Forsgren: Strongly recommend it. It’s cute.

Lenny Rachitsky: Sweet. Is there a product you’ve recently discovered that you really love? Could be an app, could be some kitchen gadgets, some clothing.

Nicole Forsgren: Yeah, the Ninja Creami is-

Lenny Rachitsky: Did you say this last time?

Nicole Forsgren: I don’t know. I may have. I don’t think so.

Lenny Rachitsky: Somebody said this and I still remember it. It’s like-

Nicole Forsgren: It’s so good.

Lenny Rachitsky: … you make ice cream and stuff with it, right?

Nicole Forsgren: Yeah, and you can basically freeze a protein shake and then it turns it into ice cream-

Lenny Rachitsky: Oh, man.

Nicole Forsgren: … which is delicious. Another one is a Jura coffee maker. I’d love good coffee and I’m not great at making it, so I can just push the button and it’ll give me anything I want, including lattes, cappuccinos or anything. So that’s kind of fun.

Lenny Rachitsky: Sweet, okay. Do you have a favorite-

Nicole Forsgren: Just sugar and caffeine. I just need a power through the day.

Lenny Rachitsky: There’s the engineering productivity 101.

Nicole Forsgren: Yes.

Lenny Rachitsky: Oh, man. Okay, two more questions. Do you have a favorite life motto that you often find useful in work or life and come back to in various ways?

Nicole Forsgren: Yeah, I think one that’s come up a couple times, it’s not a verbatim thing, I think it’s more the vibe, hindsight is 2020, but it’s also really dumb. I think if we made the best decision we could at the time with the information that we had available, then it is what it is. If you make a bad decision because you made a bad decision and you knew better, you had the information, not great. I don’t think we give ourselves or other people enough grace because we always end up finding more information out later.

Lenny Rachitsky: Hear, hear. Final question. I was going to ask you something else, but as we are preparing for this, you shared that you have a new role at Google. Maybe just talk about that, what you’re up to there, why you joined Google, anything folks should know.

Nicole Forsgren: Sure. I am senior director of developer intelligence and core developer. It’s super exciting and super fun because of all of these things we’ve been talking about. It’s focused on Google and all their properties and their underlying infrastructure, how can we improve developer experience, developer productivity, velocity, all of these things we’ve been talking about and, because kind of the numbers person, how do we want to think about measuring it, how does measurement change, how do feedback loops change, how can we improve the experience throughout and then kind of drive that change through an organization in ways that are meaningful and impactful and faster than they’ve been before.

Lenny Rachitsky: Nice job, Google, getting Nicole. What a win. I need to get some more Google stock ASAP. Okay, two follow-up questions. Where can folks find you online and find your book online if they want to dig deeper? And how can listeners be useful to you?

Nicole Forsgren: Online, you can find the book at developerexperiencebook.com, I’m at nicolefv.com, and LinkedIn occasionally. Sometimes it’s a mess. I try to wade through all of the noise. I get there to be useful, sign up for the book and the workbooks. The workbooks are free. I’d love to get any kind of feedback on what works, what doesn’t. I always love hearing those kind of stories.

Lenny Rachitsky: Nicole, thank you so much for being here.

Nicole Forsgren: Thanks for having me, Lenny.

Lenny Rachitsky: My pleasure. Thanks, again. Bye, everyone.

Thank you so much for listening. If you found this valuable, you can subscribe to the show on Apple Podcasts, Spotify, or your favorite podcast app. Also, please consider giving us a rating or leaving a review as that really helps other listeners find the podcast. You can find all past episodes or learn more about the show at lennyspodcast.com. See you in the next episode.

Reformatted by reformat_english_direct.py

如何衡量 2025 年 AI 开发者生产力 | Nicole Forsgren

本片仅含标题，无正文内容待翻译。

文字稿

生产力指标的谎言

Lenny Rachitsky： 很多公司都在尝试衡量团队的生产力。

Nicole Forsgren： 大多数生产力指标都是一种谎言。如果目标是产出更多代码行数，我可以让提示词写出史上最长的代码。这种系统太容易被钻空子了。

Lenny Rachitsky： 我怎么知道我的工程团队速度够不够快，能不能更快，或者他们是否没有发挥出最佳水平？

Nicole Forsgren： 大多数团队都可以更快。但更快是为了什么？我们可以每天更快地交付垃圾。我们需要战略和真正明智的决策，才能知道该交付什么。

Lenny Rachitsky： AI 可能带来的最大问题之一，就是学会在多大程度上信任它生成的代码。

Nicole Forsgren： 我们不能只是输入一条命令，拿回一个猜测结果就照单全收。我们真的需要去评估它。是否出现了幻觉？可靠性如何？是否符合我们通常的编码风格？

Lenny Rachitsky： 现在大量时间将花在审查代码上，而不是写代码上。

Nicole Forsgren： 这里确实存在一个真正的机会，不仅是重新思考工作流，更是重新思考我们如何安排每天的时间、如何组织工作。现在，我们也可以让 45 分钟的工作时段变得有用，因为进入心流状态这件事实际上已经被部分地交给了机器，或者机器可以通过提醒我们上下文、生成系统图表来帮助我们重新进入心流。

Lenny Rachitsky： 你认为工程团队、产品团队本周或下周可以做的一件事是什么，来提高产出？

Nicole Forsgren： 说实话，我觉得你能做的最好的事情是——

嘉宾介绍

Lenny Rachitsky： 今天的嘉宾是 Nicole Forsgren。随着关于 AI 如何提升开发者生产力的讨论越来越多，越来越多的人开始问：“我们如何衡量这种生产力提升？这些 AI 工具到底是在帮助我们，还是在损害开发者的工作方式？” Nicole 在这个领域的前沿深耕时间比任何人都长。她创建了使用最广泛的开发者体验衡量框架 DORA 和 SPACE。她撰写了该领域最重要的著作《Accelerate》，并即将出版她的新书《Frictionless》，为团队在这个新兴的 AI 世界中如何加速运转、实现更多提供了指南。她的核心论点是：AI 确实加速了编码。但开发者的提速并没有你想象的那么多，因为他们仍然要应对失败的构建、不可靠的工具和流程，以及一系列正在浮现的新瓶颈。

在我们的对话中，我们聊到了她目前关于如何衡量 AI 生产力提升的、最佳且非常具体的建议，团队可以更快的信号，公司在衡量工程生产力时犯的错误，AI 工具如何在帮助和损害工程师（包括进入心流状态），她在公司搭建开发者体验团队的七步流程，如何获得支持并衡量这样一个团队的影响力，以及大量其他内容。本期节目适合所有希望提升工程团队表现的人。如果你喜欢这档播客，别忘了在你最喜欢的播客应用或 YouTube 上订阅和关注，这对我们帮助极大。

正式对话

Lenny Rachitsky： Nicole，非常感谢你的到来，欢迎来到播客。

Nicole Forsgren： 谢谢，很高兴来到这里。

Lenny Rachitsky： 很高兴你再次做客。我刚看了我们两年半前做的那一期节目。看的时候我既惊讶又不惊讶——我们几乎没有谈到 AI。那期节目叫”如何衡量和提升开发者生产力”，我们聊了快一个小时才提到 AI，然后就只是说了句”嗯，不知道 AI 和生产力会怎样发展”。这是不是让你觉得不可思议？

Nicole Forsgren： 是的。因为那时候 AI 刚刚进入人们的视野，是很多讨论的话题，但与此同时，很多东西并没有改变。很多东西依然重要，很多东西还是一样的。而且，两年半就这么过去了，也有点不可思议。时间都去哪了？时间是一种社会建构？

Lenny Rachitsky： 是的。我们当时大部分对话都是一些问题，比如”这可能会怎样影响人们？我们会怎样改变构建产品的方式？“那时候这还几乎算不上什么。而现在，我猜想当人们谈论工程生产力时，这是他们唯一想谈的话题。这也是我们今天要花大量时间聚焦的方向。我对这次对话感到兴奋的原因是，感觉有大量资金涌入了旨在提升生产力的 AI 工具。世界上增长最快的公司正是这些工程 AI 工具公司。现在，越来越多的人开始追问这个问题：“我们到底从中获得了什么收益？这在多大程度上真正帮助我们提升了生产力？我们如何变得更加高效？“

什么是 DevEx

Lenny Rachitsky： 你在这个领域深耕的时间比任何人都长，你发明了许多如今人们赖以使用的框架。所以我很高兴能再次邀请你来聊这些话题。我想先从一个词开始——DevEx，这个词在整个领域里经常出现，我们在今天的对话中也会反复提到。能不能先解释一下，DevEx 到底是什么？

Nicole Forsgren： DevEx 就是开发者体验。说到开发者体验，我们真正关心的是开发者在日常构建软件时的感受——他们面临的摩擦、必须经历的流程、能获得的支撑。这件事之所以重要，是因为当 DevEx 很差的时候，其他一切都没用。最好的流程、最好的工具、最好的……不管你有什么法宝，如果 DevEx 不行，一切都会——

Lenny Rachitsky： 在 DevEx 中包含了生产力，而且你和这个领域的其他同仁提出的一个关键洞察是，这里不仅仅是生产力的问题，还涉及工程师的幸福感。我们会深入讨论这些方面，但也许你可以先谈谈——除了生产力之外，工程师在公司里取得成功还有哪些更广泛的要素？

Nicole Forsgren： 对，我很喜欢这个观点，因为首先，生产力本身就很难定义。如果你只看产出，达成目标的路径有很多。但如果你是通过高强度的苦工或高摩擦的方式来实现产出，那么开发者迟早会倦怠。又或者，如果认知负荷极高——光是想清楚自己在做什么就很费力，因为注意力全被管道连接之类的机械性工作占满了——那就没有剩余的心智空间去想出真正创新的解决方案和问题。我喜欢这个概念的一点在于，它形成了一个自我强化的循环：“你做更多的工作，你做更好的工作。“这对人更好，对系统更好，对我们的客户也更好。

心流状态与 AI 的影响

Lenny Rachitsky： 我本来打算稍后再谈这个，但我现在就想聊聊——工程师的心流状态这个概念。其实我职业生涯早期就是工程师，我学的是计算机科学，做了十年工程师。对我来说，这份工作最美好的部分就是编程和构建时进入的那种心流状态，一切都觉得非常有趣。我感觉 AI 在很多方面反而让这变得更难了，因为你现在要与各种 agent 协作，有大量代码是”替”你写的。能谈谈心流状态对开发者的幸福感、对开发者生产力有多重要吗？以及你所观察到的 AI 对此的影响？

Nicole Forsgren： 谈论 DevEx 有很多不同的方式。一种方式是把它归结为三个关键要素——它们各自都很重要，同时又相互强化。心流状态是其中之一，认知负荷是另一个，反馈回路是第三个。你提到心流状态的问题非常好，我承认我们还处于早期阶段——才几年时间。我们仍在摸索在这种新环境下，什么样的心流状态和认知要求对人们来说是最优的，因为正如你所说，现在我们经常被不断打断。你不再像过去那样进入心流、锁住自己、一口气写下一大堆代码。取而代之的是，你构建一个提示词，拿到一些代码回来审阅，再把它们整合到系统中——这个过程确实容易打断心流。

与此同时，它也有可能促进心流。我见过一些资深工程师搭建起非常厉害的工具链，他们想出了如何保持心流的方法。快速的反馈回路对他们来说效果非常好。他们可以把不同的部分分配给各个 agent 去做，这帮助他们在心流中保持运转——只不过不再是纠结细节和逐行编写，而是处于这样的心流中：“我的目标是什么？达成目标需要哪些部分？多快能到那一步？“然后可以退后一步评估整体，再深入修复某些部分。

Lenny Rachitsky： 关于那位想出了非常酷的工作流的工程师，你能再多说说具体是什么样的吗？

Nicole Forsgren： 我和好几位这样的工程师聊过，也观察过他们工作的方式。我自己还没有搭建过这样的环境，但已经在计划列表上了。他们能够搭建出非常出色的工作空间和工作流。现在，我们大多数人使用工具的方式是——输入一个提示词，拿回几行代码，或者输入一个提示词，拿回一整个程序。而他们的做法是——很多时候我会看到他们先做一个引导性的说明：“这是我要构建的东西。它需要具备这些基本架构组件，需要用这样的技术栈，需要遵循这样的大致流程。帮我想清楚这件事。“然后系统会为他们做一个设计。接着，对于每个部分，他们会分配一个 agent 并行处理，而且他们会事先声明：“这些部分需要能协同工作，确保架构正确，确保使用恰当的 API 和规范。“然后他们可以让它跑几分钟。在此期间，他们可以思考其他有意思的问题，或者预判可能会比较棘手的部分。等他们回来的时候，得到的结果大概比随意让 AI 生成的好不少。因为他们在前期做了系统性的规划，最终产出已经非常接近生产级代码了。

Lenny Rachitsky： 所以我听到的是——这些 AI 工程师会在前期花一点时间做规划，而不是一路硬推、边做边想。

Nicole Forsgren： 对。

如何衡量生产力——以及常见的误区

Lenny Rachitsky： 好，接下来我想聊一个很核心、很多人都在想的问题。很多公司试图衡量团队的生产力：“这有没有提升我们的生产力？这有没有损害我们的生产力？“先问这个：当人们试图衡量 AI 带来的生产力提升时，目前最常见的做法有哪些是错的？

Nicole Forsgren： 我想说，大多数生产力指标都是假象。这真的很棘手，因为从历史上看……当然，代码行数一直都不是一个好指标，但很多人仍然在用代码行数——

Lenny Rachitsky： [听不清]

Nicole Forsgren： ——把它当作产出、生产力或复杂度的某种代理指标。而现在，对于那些系统中——有些系统之前可能不太声张地使用代码行数作为指标——这件事已经彻底崩了。因为”代码行数到底意味着什么？“如果目标是更多行代码，我可以让 AI 写出史上最长的代码，再加一大堆注释。我们都知道 agent 和 LLM 天生就非常冗长，所以游戏化这个指标太容易了，而且会在所有工作里引入复杂度和技术债。我想说的是，确实有一些东西是我们可以关注和追踪的。代码行数作为生产力指标不太好，相当差。但现在，如果我们能区分出哪些代码是人写的、哪些代码是 AI 生成的，它反而变得更有意义了，因为这样我们就能回答下游的问题。

代码存活率与指标的新语境

Nicole Forsgren： “代码存活率是多少？代码质量如何？我们的代码是否被回灌到训练系统中？对于那些后续用于重新训练系统的代码，尤其是当我们做微调和本地调优时，其中有多少是机器生成的？这会形成什么样的循环，又会无意中引入什么样的模式或偏见？” 一方面，它作为生产力指标不太好，但也可以有用。我对 DORA 也是同样的看法。我做了 DORA 指标，包括速度指标和稳定性指标。如果你只看这些，现在已经不够了，因为 AI 改变了我们对反馈回路的方式。反馈回路需要快得多。DORA 的初衷是从速度和稳定性角度评估整条流水线，这一点依然成立。但我们不能盲目套用以前用过的指标，因为我们会错过极其重要的现象和工作方式的转变。

Lenny Rachitsky： 有意思。DORA 是你发明的，长期以来它是大家衡量生产力的主要框架。然后还有 SPACE、Core 4，可能还有别的。所以我的理解是，现在 AI 在贡献大量代码的情况下，这些框架都有些过时了。

Nicole Forsgren： 我想说的是，如果是一个规范性指标（prescriptive metric），就只能按照它所规定的方式使用。

DORA 指标的适用边界

Lenny Rachitsky： 所以——

Nicole Forsgren： DORA 4，有四个关键指标。两个速度指标：部署频率和交付周期，即从代码提交到代码部署。还有稳定性指标：MTTR（平均恢复时间）和变更失败率。如果用它们来评估流水线的速度和整体表现，那很好。但如果你试图用它们来理解……因为其中隐含了反馈回路，对吧，因为你以前主要从客户那里获得反馈。但现在我们不能在使用 AI 时盲目套用，因为反馈回路出现得更早了，而且不仅仅是在本地构建和测试阶段。我们在整个流程中都有反馈回路，甚至在流水线中间有时也有，我们真的想以以前不那么重视的方式来利用它们。不能说以前做不到，只是我们确实没有聚焦在那里。

SPACE 框架在 AI 时代的适用性

所以那些是规范性指标。而当我们想到 SPACE，SPACE 是一个框架。它不告诉你具体用什么指标。所以我想说，有时候人们会很沮丧，因为我没有告诉他们该衡量什么。但现在，我认为这正是它的力量所在。我们实际上看到 SPACE 在 AI 等新兴语境下适用得相当好，因为我们仍然想关注……SPACE 是一个缩写。我们仍然想关注满意度（Satisfaction）。我们仍然想关注绩效（Performance），即产出结果是什么。我们仍然想关注活动（Activity）。是的，在某些方面，代码行数和 PR 数量在某些事情上可能有用，或者告警数量、事件数量——活动或计数。沟通与协作（Communication and collaboration），这也超级重要和有用，因为这是我们的系统之间相互通信的方式，也是人与人之间沟通的方式。“有多大比例的工作被交给聊天机器人，而不是与团队中的高级工程师讨论？” 不是越多越好，也不是越少越好，取决于具体情况。

然后是效率与心流（Efficiency and flow），“人们能否进入心流状态？做事情需要多少时间？我们系统中的流转状态是什么样的？” 在这里，我可能会增加几个维度。所以我正在跟一些早期作者交流，想说加入信任（trust）这个维度。不是说信任以前不重要，而是现在它变得非常、非常突出。对吧？以前你构建代码，编译通过了，就没问题了。事情就是这样。但 LLM 是非确定性的。现在我们不能只是输入一个命令，猜一个结果回来就接受它。我们真的需要评估它，所以，“我们有没有看到幻觉？可靠性如何？它是否符合我们通常的编码风格？如果不符合，这样可以吗？” 所以这取决于……规范性。你必须确保按照适合的目的来使用。对吧？

信任问题与代码审查

Lenny Rachitsky： 我们稍后会谈到你目前对最佳实践的想法。你有一本即将出版的书讲如何做好这件事，我们会聊到那个。但我想强调一下我们上次聊天时你说的一点——你指出我们在 AI 方面可能面临的最大问题之一就是信任：理解并学会在多大程度上信任它生成的代码，以及有多少……你说过这话，那是两年半以前了——现在大量时间将花在审查代码而不是编写代码上。这正是我现在听到的状况。

Nicole Forsgren： 我认为观察这将如何影响我们未来的工作组织方式会很有意思。我们之前谈到心流状态和认知负荷。现在我们的注意力必须在特定时刻聚焦于某些事情，而且这种方式跟我们以前做的不同了。我认为这里确实存在真正的机会，不仅是重新思考工作流，而且是重新思考我们如何安排一天的时间、如何组织我们的工作。

深度工作与注意力结构的重塑

Lenny Rachitsky： 能再多说一些吗？具体是什么情况？你觉得会发生什么？你认为事情会走向何方？你看到哪些做法是有效的？

Nicole Forsgren： 这纯属推测。但举个例子，Gloria Mark 在注意力和深度工作方面做了一些非常好的研究，人类每天大约能做四个小时的高质量深度工作。差不多就这么多。

Lenny Rachitsky： 是的，我有同感。

Nicole Forsgren： 这基本上是大多数人的上限了，我确定会有人说，“但我超乎常人，我可以做到——”

Lenny Rachitsky： 要是你服用 20 克肌酸呢？

Nicole Forsgren： 对。要是我们微量用药呢？

Lenny Rachitsky： 哈哈，没错。

Nicole Forsgren： 是的。所以在知道我们大约有四个小时高质量深度工作的前提下……我确定我们很多人大概都有过这种体验，对吧？我们有状态好的时段。也许是上午，对某些人来说是下午。然后你会到达一个时间点，心想，“我要去清理收件箱了，因为这是我目前唯一能做的事。我可以保持基本运转，但我没法产出最好的创新性工作、问题解决、写作或编码工作了。” 很多时候，要做到这一点并进入状态，需要有大段连续的时间来进入心流、完成深度工作。通常是两个小时左右。一个小时可能比较勉强，因为进入那种状态本身就需要时间。

好，那我们想想以前是什么情况——回到”过去”，三年前、三年半前——我们可以划出四个小时的整块时间，大概能完成两到三个小时真正高质量的工作。因为我们就是专注的，对吧？没有中断，或者说中断很少。

而现在，编写代码和构建系统这件事本身的性质就是中断驱动的，或者至少充满了中断，因为你开始做一件事，然后它就会插入打断。那我们该怎么看待这件事？这是否意味着四个小时的工作块仍然有用？很可能。但这是否也意味着现在 45 分钟的工作块也能派上用场了？因为进入心流这件事，至少在一定程度上被交接给了机器，或者说机器可以通过提醒我们上下文、生成系统图等等来帮助我们重新回到心流状态。所以我觉得这是一个非常、非常有趣的领域，充满了问题和机遇。拜托各位，去做这些研究吧，然后把结果告诉我，因为……它可能排不进我的研究清单，但这确实是一个极好的问题。

工程师变成了 AI 的管理者

Lenny Rachitsky： 这太有意思了。基本上，每个工程师都在变成 EM——工程经理——在协调所有这些初级 AI 工程师。所以你的观点是，即使你只有 30 分钟的时间块，你也可以深度进入代码，同时还能为那些跑去做各种任务的 AI 工程师扫除障碍。而且，你的观点是它们还能提醒你，“这是你上次做到的地方。好，你可以直接跳进这段代码，可能做一些调整。”

Nicole Forsgren： 没错。

Lenny Rachitsky： 太有意思了。

为什么公司应该关注开发者体验

Lenny Rachitsky： 让我把视角拉远一点。在我们讨论你关于如何推进开发者体验的框架和最新思考之前——显然工程师能做得更多是好事——但你最有力的话术是什么？为什么公司应该真正、真正地把重心放在开发者体验上？

Nicole Forsgren： 我不太想说”投资回报率”这个词，但商业价值……这里的机会是巨大的。总体来说，我们写软件有时候是为了好玩、当作爱好，但我们拥有软件更根本的原因是它满足了一个业务需求。它帮我们获取市场份额，帮我们吸引和留住客户，帮我们做所有这些事情。我认为开发者体验之所以重要，是因为它是所有这些软件创造的基础，它支撑着所有这些问题解决的工作。它使得与客户进行超快速实验成为可能——以前你可能需要一段时间来做原型，再花更长时间才能在生产系统上跑 A/B 测试。现在你几个小时就能搞定。

一个本周就能做的行动

Lenny Rachitsky： 也许换个完全不同的角度，说得非常具体、非常战术化——在我们进入更大的框架之前——你觉得一个工程团队、一个产品团队在本周或下周可以做的一件事是什么？来改善他们的开发者体验，也许让他们能做成更多事情？

Nicole Forsgren： 说实话，我觉得你能做的最好的事情就是去找人聊天，然后倾听。我很高兴这个播客的听众主要是产品经理，因为他们通常很擅长这件事。我想说的是，从倾听开始，而不是从工具和自动化开始。很多次公司会说，“好吧，我就去建一个工具，“或者，“我要去做这么一个东西。“通常你构建的都是你自己曾经遇到过的困难，或者是容易做、容易自动化的东西。但如果你只是去找人聊聊天，问开发者，“回想一下昨天，你昨天做了什么？带我走一遍。哪些环节让你觉得很愉快？哪些环节真的很困难？你在哪里感到了挫败？你在哪里被拖慢了？哪里有摩擦？“如果你去找一小群人聊聊，很多时候你能挖掘出一些相对低成本但仍然有影响力的事情，或者你能识别出一个不必要地复杂和缓慢的流程。

Lenny Rachitsky： 所以我听到的”倾听”几乎就是——如果你想帮助你的团队跑得更快、成为更快乐的工程团队——你的建议就是，“在做任何事情之前，先去问他们什么在困扰他们。”

Nicole Forsgren： 去问他们，没错。相信我，大多数开发者都非常乐意告诉你什么东西坏了、什么东西不好用。我想说，我曾经合作过一家公司，我记得他们有一个流程非常麻烦，运行在一个老旧的主机系统上，如果要重新平台化整个东西需要很大投入，所以他们一直没有去碰它，也不去谈论它。所有人都讨厌这个流程，因为它造成了巨大的延迟。其实他们只需要改一下流程。有时候你只需要改一个流程。他们的改动是——本来需要有人把它打印出来，走三四层楼梯去拿审批，然后另一个人再走楼梯送回来，所以整个中间环节就是这样。他们没有重新平台化任何东西，没有做任何重大的重新设计，他们只是发了一封邮件。

最常见的改进

Lenny Rachitsky： 让我在这方面再追问一下。我很好奇大家最常做的事是什么。如果你刚开始说，“好，我们需要关注工程体验，“你觉得公司最需要做的两三个最普遍的改进是什么？

Nicole Forsgren： 我想说，我还是会回到流程这个话题上——几乎总有一个流程是可以改进的，而且可以在不需要大量工程投入或大量工程人力的前提下改进。尤其是大多数大公司，都有某个东西是好几个、好几个步骤的。它之所以这样是因为”一直都是这样的”，但它已经不必再是这样了。即使是小公司，有时候也太随意了，你不知道流程是什么，到处追着人跑。如果你能创建一个非常轻量的流程，那也会很有帮助。这可能是最好的起点之一，特别是当你对组织的其他部分了解有限的时候。有时候仅仅一个团队级别的流程就能起作用。

从业务领导者的角度来说，你能做的很多事情是为这种组织变革提供结构和支持。沟通你们在做什么，沟通优先事项是什么，沟通为什么这很重要，庆祝胜利。因为如果大家只是把它当作一个一次性的、完全孤立的边缘项目来做，很难建立起好的势头，很难让人在意并持续参与。因为它感觉就像又一个不会有什么影响的内部项目，或者不会被重视的内部项目，但它对业务有着巨大的潜在回报。

Lenny Rachitsky： 有意思的是，我在这里听到的内容没有一个是关于工具或技术的。不是说迁移到这个云平台，不是说安装这个新的部署系统——而是流程、人员、组织和士气。

Nicole Forsgren： 是的。当然，会有一些技术层面的东西非常重要，特别是现在有了 AI，我们正在重新思考构建和测试系统的工作方式。我们正在重新思考给用户的反馈机制，让反馈在共享什么内容以及何时共享方面变得非常、非常定制化。有很多技术层面的问题是需要涉及的，但这不是唯一的事情。技术是必要的，但不是充分的，而且它不一定是你的起点。

如何判断团队是否足够快

Lenny Rachitsky： 我有一个难题想问你，是你刚才说的时候我想到的。我觉得这是大多数创始人和负责人都在思考的问题。问题就是：我怎么知道我的工程团队跑得够不够快？他们能不能更快？他们是不是没有发挥出最好的水平？有没有一些信号或迹象告诉你，“嗯，我的团队应该能跑得更快，“对比”这就是正常的状态，这就是他们能达到的最快速度了”？

Nicole Forsgren： 大多数团队都能跑得更快，对吧？另外，鉴于我们对认知负荷的了解，并不是所有的提速都一定是好事。或者说一旦达到某个临界点，收益就会比较有限，而大多数人甚至离那个点都还很远。坦白说，我不认识任何一个已经达到那个点的团队。那你怎么判断呢？如果你总是听到关于构建失败、不稳定的测试、过长的流程，如果你需要申请一个新系统或需要配置一个新环境非常困难，或者切换任务、切换项目非常非常难。如果有人有机会去组织的另一个部分工作，但因为一些说不清楚的原因而没去——不是政治原因，而且大家都在抱怨系统——那通常就是一个很好的信号，说明某个地方存在摩擦。

Nicole Forsgren： 因为一旦你终于搞明白了你的系统，能够开展工作之后，切换到其他地方的成本往往会非常、非常高。所以有时候人们会选择留在原地。但我曾与一些公司合作过，在那几家公司里，要在内部不同组织之间切换，你基本上要支付和新人入职一样的”税”，因为系统差异如此之大，充满了摩擦，做很多事情都非常困难。

Lenny Rachitsky： 我特别喜欢你回答的前半部分，就是你总是可以跑得更快。我想每个创始人都会爱听这句话。不过按照你说的，收益会随时间递减？

Nicole Forsgren： 是的。而且你不知道质量如何，对吧？所以我认为另一面是，你总是可以跑得更快，但更快是为了什么？我们是否在做正确的商业决策？我觉得这正是 PM 需要发挥作用的地方。我们可以每天源源不断地生产垃圾。我们需要战略和真正明智的决策，来知道该交付什么、该实验什么、功能以什么顺序做、怎样逐步推出。战略是核心，然后再考虑加速。如果我们没有把其他部分安排好，那就是垃圾进，垃圾出。

Lenny Rachitsky： 我想顺着这个话题继续聊，但在此之前，让我先复述一下你刚才分享的内容。所以，团队效率有待提升的信号是……构建总是在失败，不稳定的测试频繁出现误报，在不同项目之间切换上下文很困难，你听到人们在抱怨系统，说用起来真的很痛苦。大致是这样吗？

Nicole Forsgren： 是的。

Lenny Rachitsky： 好。那回到你刚才提到的观点，现在有一种感觉是 AI 正在让团队变得更快，因为它在替他们写代码。你会有各种异步 agent，有工程师在为你工作。但你传达的一个核心信息似乎是，这只是工程工作的一部分，还有太多其他的工作，包括确定要构建什么……以及内部的对齐。也许你可以谈谈……工程绩效和生产力确实有很大的提升空间，但还有很多其他要素是 AI 无法改善的？

AI 如何融入工程战略与实验

Nicole Forsgren： 是的。或者说将来有可能，对吧？我认为有很多方式可以让 AI 工具帮助我们优化战略、打磨信息，思考实验方法或实验目标，或者思考我们的总可寻址市场。但我们需要把战略和计划对齐得比较好，至少要有两三个想要测试的备选方案。因为现在，工程的推进——尤其是原型制作——可以快得多。我们可以快速推出原型，可以运行任何面向客户的测试和实验，前提是基础设施已经到位，这使我们能够比以前更快地学习和推进。在过去有些地方，要把一个东西推过生产环境做 A/B 测试并获取反馈可能需要几个月。现在我们一两天就能完成，一周之内肯定能搞定。但我们要确保我们在构建和测试正确的东西——“我们是否与正确的伙伴合作……我们是否有所需的数据？”

我想说，AI 实际上可以成为一个很好的伙伴，如果你跟它有很好的对话，然后再去和你的专家确认——“我应该关注什么类型的数据？我需要什么样的埋点？我可以做什么样的分析？“因为这样你就可以去找数据科学团队说，“我打算做这件事，我希望……”我们不要只是凭冲动去做 A/B 测试，因为这样做一次大规模测试可能会扰乱用户或客户，或者破坏隐私或安全协议，而且最终得到的数据可能根本无法使用，因为你无法从中提取你要找的信号。但现在，我也看到人们把这个流程从几周压缩到了几天。所以他们可以在与关键利益相关者讨论时，从一个更加了解情况、更加充分准备的状态出发。

AI 带来的生产力提升有多大

Lenny Rachitsky： 我很喜欢你能接触到许多不同的公司和不同类型的业务。我觉得很少有人能看到这么多不同组织的内部。就 AI 带来的生产力提升而言，你观察到了什么样的收益？你看到的增幅有多大？

Nicole Forsgren： 我会说这个提升是真实的，但我也想说我们还没有很好的衡量方式。我们仍在摸索该测量什么以及它看起来是什么样子。最好的衡量指标之一将是速度，贯穿整个系统的速度——你能多快地把一个功能或产品从系统中推出去，然后进行实验测试，从想法到最终交付，甚至是系统中的某个功能片段，以便我们能够测试。这非常好。不过，这也很难直接回溯到某个特定开发者手中的某个特定 AI 工具。但还有其他一些我们可以观察的东西，我确实看到的，还是这种快速原型制作。

我讨厌代码行数，但我还是要用代码行数来说。我们确实看到……我知道我与一些人合作过，他们观察了一组公司，发现 AI 为经常使用它的人生成了显著更多的代码。但他们还发现，对于经常使用 AI 编码环境——AI ADE 的用户，工具给了他们更多的代码。然后工程师自己产出的代码增量，是编码 agent 所提供增量的两倍。所以，我想说，这可能是一种间接的、连锁的效应，或者说一种迹象——它可以帮你解除阻塞。它可以加速你本来就要做的工作。我知道有时候我工作时，最初的几分钟很难开始。但一旦开始了，我就进入状态了。所以 AI 真的很擅长帮你解除阻塞、打开局面。

AI 在调试中的能力

Lenny Rachitsky： 我看到人们在 Twitter 上分享，OpenAI Codex 在查找那些非常棘手的 bug 方面有多么出色。我记得好像是 Karpathy 分享的经历——他被一个 bug 卡住了，没有任何 AI 工具能解决。然后最新版的 Codex 花了大概一个小时去排查，帮他找到了问题所在。

Nicole Forsgren： 是的，我也听到了很多类似这样不可思议的故事。还有编写单元测试、快速搭建测试套件，以及创建文档和整理文档——因为我知道现在很多人会说，“哦，既然有了 agent，我就不需要读文档了，因为代码就在那里。” 但事实证明，agent 依赖优质的数据，因为一切取决于它们的训练方式或基础数据的支撑。更好的数据带来更好的结果，而这些数据中就包括文档和注释。文档和注释写得越好，你的 AI 工具表现就越好。

Lenny Rachitsky： 而且 AI 可以帮你写这些文档。我最近在用 Devin，它在这些方面真的很擅长。

Nicole Forsgren： 是的。

《Frictionless》新书介绍

Lenny Rachitsky： 好，我们来聊聊这个框架、这本书。你正在出版一本叫《Frictionless》的书，听起来像是一个梦想——“如何打造一个无摩擦的开发团队？” 全名是《Frictionless: 7 Steps to Remove Barriers, Unlock Value, and Outpace Your Competition in the Age of AI》。里面有一个七步流程。请带我们走一遍，也可以先介绍一下这本书的背景——它是写给谁的，解决什么问题，然后是那七个步骤。

Nicole Forsgren： 我想说，这本书是我和 Abi Noda 一起写的——他是 DX 公司的创始人，在这个领域有着非常丰富的经验，与数百家公司合作过，所以和他碰撞想法非常有收获。同时，也要感谢所有与我们交流过的工程负责人、DevEx 负责人、CTO 和工程师们，他们帮助我们确认了判断的方向是否正确。那么，这本书是写给谁的——

关于 DX 公司的收购

Lenny Rachitsky： 既然你提到了，让我岔开一下聊聊 Abi 和 DX。这件事非常有趣，而且和我们这次对话的主题直接相关。Abi 创办了一家叫 DX 的公司——对于一家做开发者体验的公司来说，这个名字太棒了。他们刚刚以十亿美元的价格把公司卖给了 Atlassian。相对于他们的 ARR 来说，这是一个非常高的倍数。对我来说，这恰恰说明了我们这次对话为什么如此有价值——企业在改善开发者体验上投入的价值有多高。Atlassian 愿意花十亿美元来收购。这是一家尚处早期阶段的创业公司，发展得很好，用户也很喜欢，但仍然属于早期阶段——十亿美元。背后的逻辑是，他们有大量的客户在使用 Jira 和他们的各种产品，这些客户都在想办法如何衡量生产力。这对他们来说价值巨大。而且我知道你也是他们的早期顾问——

Nicole Forsgren： 是的。

Lenny Rachitsky： 所以这恰恰说明了这件事有多重要。

Nicole Forsgren： 是的。我觉得这也说明了你能从中获得多大的价值。有太多的低垂的果实，太多的未释放的潜能，而很多时候你甚至不知道该从哪里开始。即使在我待过的大公司里，有很多专家和非常非常聪明的人，但如果你没有深入过这个领域、没有用这种方式思考过，就很难知道从何入手，或者很容易在初期犯一些简单的错误，导致后来不得不推倒重来。所以我想这也引回了那个问题——“这本书是写给谁的？” 它面向所有关心 DevEx 的人：技术领导者当然是核心读者，任何正在启动 DevEx 项目或推动 DevEx 改进计划的人也很适合。我认为对 PM 来说尤其相关，因为如果你管理的产品涉及软件开发和构建，改善 DevEx 只会对你的团队有帮助。而且，PM 拥有一些关键的能力、洞察和直觉，这些对 DevEx 来说非常重要，而我经常看到工程团队恰恰缺失这些东西。

七步框架详解

Lenny Rachitsky： 好，那框架是什么？步骤有哪些？人们应该从哪里开始？

Nicole Forsgren： 这本书提供了一个七步流程，最后还提供了一些关键原则。第一步是启动旅程。假设你正在起步，可以开始这段旅程。这一步包括我们之前谈到的一些内容——去和人交谈，做一轮倾听，综合你听到的信息，可视化工作流程和工具，摸清当前的现状。第二步是获得快速胜利。从小处着手，取得一个快速胜利，挑选正确的项目，分享你做了什么。第三步是用数据优化工作。建立数据基础，找到已有的数据，开始收集新数据，用一些调研来获取快速洞察——书中还包含了一些调研示例。第四步是决定策略和优先级。当你有了一些数据之后，需要知道在所有可能存在问题的事情中——如果你已经拿到了快速胜利，在剩余的事情里——“我接下来应该做什么？” 我们在这里介绍了一些评估框架。

第五步是推销你的策略。一旦你做出了决定，现在要让其他人也信服。所以你要收集反馈，说明为什么现在这是正确的策略。第六步是在你的规模上推动变革。这里我们针对不同范围的控制权进行了讨论。如果你只是在某个开发团队内部，想自己做这件事，属于自下而上的努力；如果你是开发者体验副总裁之类的角色，则是自上而下的范围，有些东西你可以利用；如果你处于中间位置，也可以结合两种策略来推动变革。第七步是评估你的进展并展示价值，然后循环回去。

我想说的是，我们写这本书的方式允许你从当前所在的任何步骤切入。如果你正在启动一个团队或一项新计划，你可能应该从第一步开始——你绝对应该从第一步开始。如果你加入的是一个已有的计划，可以直接跳到选择优先级或实施变革的步骤。这就是七个步骤。除了七个步骤之外，我们还推荐了一些实践，包括资源配置、变革管理、让技术可持续发展，以及用 PM 的视角来看待这个问题——“我们如何把开发者体验当作一个产品来思考，如何把我们手中的指标当作一个产品来思考？”

Lenny Rachitsky： 太好了。我有问题想问。先告诉大家怎么找到这本书。网址是什么？怎么购买？什么时候出版？

Nicole Forsgren： 网站是 developerexperiencebook.com。现在可以注册邮件列表，预购开放时我们会通知你。我们还会分享配套工作手册的部分内容——我们有一份将近一百页的配套工作手册，书应该会在年底前出版。

开发者体验 vs 开发者生产力

Lenny Rachitsky： 好。其中有一点是，“开发者体验”这个说法感觉是非常刻意的——它不是”开发者生产力”，也不是”开发者工作”，而是”我们如何让开发者在我们公司的体验变得更好”，这包括他们能完成更多工作，也包括他们更快乐等等。所以我觉得这是一个很重要的元素，对吧？

Nicole Forsgren： 是的，完全正确。

Lenny Rachitsky： 好的。

Nicole Forsgren： 因为，再说一次，这不只是关于生产力的问题。我们之前也从”我们需要构建正确的东西”这个框架和视角讨论过这一点。你当然希望高效，但你同时也需要思考……而这恰恰也是工程师们极其擅长的事情——给他们一个问题，不要告诉他们怎么解决，他们反而能解决得更好。他们拥有自由、创新和创造力，因此能够解决这些问题。如果仅仅关乎生产力，那就只是代码行数、PR 数量之类的东西了。但我们真正想讨论的是价值——如何释放价值，如何更快地获取价值。这当然包括提升他们的生产力、减少摩擦，因为这样他们才能获得心流状态、降低认知负荷，以及我们之前讨论过的那些要素。

Lenny Rachitsky： 很好。那如果说有人想组建这样的团队，通常是什么样的？我记得在 Airbnb 时见过这样的团队成形，最初就是一两个工程师牵头启动、承担责任。你对初始团队有什么建议？后续扩展又是什么样的？

Nicole Forsgren： 有几种做法。如果是自己动手的话，可以搭配几个工程师，也许再加一个 PM、PGM 或 TPM 来协助沟通。因为在这里，沟通计划真的非常重要。在小规模上，我们要做的是寻找那些快速见效的东西，寻找能在小范围内做到的改进。有些人称之为”纸割伤”式的问题——那些你能做的小事，帮助人们看到价值、亲身体验到好处：开发者的工作怎么变得更好？日常体验怎么改善？从这里开始逐步积累势头。如果你是在自上而下的结构中工作，并且有授权，你同样需要快速见效，但这些见效可以是在更大范围上的，因为你有基础设施或有后台支撑，能够做出不只是局部层面的不同类型的改变。

局部改进与全局改进的例子

举个例子，一个小的局部改进可能就是清理你的测试、你的测试套件。任何团队都能做这件事，任何团队都可以。而在更大范围上，可能是改变一个组织级流程——那些过于繁琐的流程——或者投入一些资源去清理配置环境。

Lenny Rachitsky： 好的。你看到过这种团队成形后对公司工程团队产生了什么样的影响？

Nicole Forsgren： 我可以说，对于小公司我看到了巨大的影响，对于大公司则是数十万甚至数十亿美元的量级。当然，我们也需要学会如何传达这些成果——“数学是什么样的？“很多时候，我们可以看节省的时间，可以看节省的成本，可以看很多不同的东西。我们可以把价值交付的速度看作抢占市场的速度，可以看风险降低——但收益确实是实实在在存在的。我想提一下，它通常遵循类似 J 曲线的形态。一开始你会取得几个快速见效的成果，看起来像是巨大的胜利，然后你会遇到一个小低谷——因为那些显而易见的项目、那些低垂的果实已经被处理完了。现在我们需要做一些更深入的工作，可能需要搭建更多基础设施，可能需要建设更多遥测能力，以便捕获我们想要捕获的数据。而一旦完成了这些，我们就开始看到那些收益真正地复利增长。

Lenny Rachitsky： 那回到那个量化数字的话题，你有什么建议？人们怎么找到这些数字？因为我觉得这件事的力量很大程度上就在于此——“我们通过做这件事省了一百万美元。“你用什么指标来得出这个结论？

面向不同受众的沟通方式

Nicole Forsgren： 我觉得有几件事需要记住，比如我们的关键受众是谁，而我们通常有几个关键受众。我们非常需要能够与开发者对话，因为他们是使用这些系统的人。他们会与你们合作——要么参与构建，要么至少提供关于你正在做的事情的反馈。所以对他们来说，我们通常要用他们关心的事情来表达。比如节省时间——如果某个环节变快了，他们就能省下时间。他们不再需要花时间做不必要的配置工作，这跟减少重复性劳动是相关的。合规和安全也非常重要，而且很多时候它们需要好几个手动步骤……我不是说这些没有价值，而是从个人角度来看它们不产生增值。如果我们能尽可能自动化，那就太好了——还有改善专注时间。

从开发者这边来看是这样。领导层关心的……他们也关心这些事情，但通常更关心其他的东西。所以我们通常可以谈以美元计算的成本——“我们能加速收入吗？我们的价值交付周期是什么样的？我们的速度如何？我们能多快从客户那里获得反馈？“对于处于高度竞争环境中的组织来说，这可以非常有说服力，因为一切都在拼速度。我们可以谈省钱，这里可以看量化的节省。一个例子是测试和构建。如果我们能清理测试和构建套件，对开发者来说，他们真正想听到的是节省了时间和更可靠的系统。重复性劳动减少了，因为他们不需要反复重跑测试或者去清理测试套件。

量化收益的方法

从业务角度来看，清理测试和构建套件可以节省云成本，因为所有这些测试都在云上某处运行。如果它们总是失败，或者只是在浪费支出，那这里就有用武之地了——可以回收一些容量。我们总是可以谈时间和生产力的收益——“我们在那些不一定产生增值的事情上损失了多少等效的开发者时间？“然后有时候我们可以与业务成果做关联——关联通常是我们在这里能做到的最好的程度——但在加快价值交付速度与增加市场份额之间，可以有一些非常有说服力的关联关系。

如何衡量 AI 工具对生产力的影响

Lenny Rachitsky： 让我顺着这个线索继续，回到我认为当下人们对 AI 和生产力最大的疑问上——我觉得还没有人有答案，但我很想听听你的看法：人们现在应该怎么做？理解 AI 工具对生产力的影响的最佳方法是什么？因为他们在这些工具上花了很多钱。我不确定，我们到底从中得到了什么？感觉事情确实变快了，但我不确定。如果有人必须说，“好吧，以下是我大概应该尝试做的事情，“对于衡量 AI 工具对生产力的影响，你最好的建议是什么？

Nicole Forsgren： 我会说，要看情况。部分取决于你的领导层真正关心什么。我们通常很擅长弄清楚什么对开发者重要，并且能够把这些传达给他们。但如果我们只是想确定两三个数据点来真正聚焦——因为当我们刚开始使用数据的时候，有时候会有挑战——他们关心什么？想想你一直在听到的话语。他们一直在谈论市场份额吗？是失去市场份额还是市场竞争力的下降？如果是这样，就聚焦于速度。想想如何捕获从功能到生产环境、或从功能到客户、或从功能到实验的速度指标，以及那个反馈循环是什么样的。如果他们一直在谈论利润率……

Nicole Forsgren： 我们总是会谈到钱，因为这是商业。但如果这似乎是一个贯穿始终的主题，那就寻找省钱的方法，然后把它转化为回收的人头成本。或者有时候你会重新改造、变更某个流程，然后就不再需要那么多供应商了。所以供应商支出的减少也能在这方面有所帮助。我说”看情况”也是因为，有时候领导层会说一些话，然后某种主题就会浮现出来。如果你能解决他们面临的问题，或者他们正在关注什么，甚至可以稍微调整一下表述框架——比如如果他们把什么都叫”开发者生产力”，那你也叫它生产力。如果他们叫”速度”，而速度对他们来说很重要，那就想想怎么用速度来表述这件事。如果他们在谈论转型或颠覆，那这件事怎么帮助实现颠覆？因为这样才会与他们产生共鸣。我们不想让他们费力气去理解我们在做什么、我们提供的价值是什么。

Lenny Rachitsky： 这个建议太好了。让我复述一下，这里的建议是，如果你的公司想要搞清楚 AI 工具对公司有什么影响，首先要做的就是——公司最关心什么？领导层最关心什么？可能是市场份额，可能是利润率，可能是速度。我们需要更高的速度，或者我们需要转型。所以你的建议是，根据你听到的话语和措辞来判断他们关心什么。然后找到衡量那些东西的方法，衡量市场份额增长的方法，衡量利润率提高的方法。我很喜欢你举的这些例子，比如从功能、想法到生产环境或到实验的时间，也许可以开始追踪这个。如果是利润率，那就是通过更少的测试失败或某些供应商不再需要付费来节省的钱，诸如此类。而速度，我猜那就是 DORA 派上用场的地方了，就是工程交付的速度……关于速度你会怎么看？

Nicole Forsgren： 我会说，这其实是那种……我会选择尽可能宽泛的跨度。如果你能量从想法到客户或想法到实验的过程，这需要多长时间？通常需要多长时间？可以多快？在使用 AI 工具改进并减少摩擦之后现在需要多长时间？这里我要说，我们在书中也讨论了一点，就是我们如何应对归因挑战？是什么导致了这个结果？是 DevEx 还是 AI？大胆地公开说明这一点。说，“是的，我们推出了 AI 工具。我们同时也有 DevEx 方面的改进。它们紧密配合在一起。“两者可能都有贡献，对吧？如果我们只有 AI 工具而没有 DevEx 改进，可能也会有一些提升，但远不如现在这么多。

从零开始衡量开发者体验

Lenny Rachitsky： 如果人们今天就要开始做这件事，比如他们说，“我想开始衡量开发者体验”，有没有两三个基本上每个人都需要的指标，应该立刻开始衡量的？

Nicole Forsgren： 如果你今天刚开始，如果你什么都没有，显然先去跟人交流。之后，我会做问卷调查，因为问卷可以快速给你一个整体的全景视图，让你知道大的挑战在哪里。我这么说是因为，如果你刚开始，你的系统中可能还没有埋点，没有各种指标。即使你已经有了，也可能不是你真正想要的。没有明确目的而设计的指标，存疑。为其他目的设计的指标，可能对你的需求有用，但也可能没用，所以我们不能想当然地认为自己已经有了。这是我喜欢问卷的原因之一，我们在书中也提供了一个示例。你只需问几个问题：“你的满意度如何？你生产力最大的障碍是什么？“或者”完成工作最大的挑战是什么？“然后让他们从一组工具或一组流程中选择，让他们选三个，就三个。

在这三个中，这对你影响的频率如何？是每小时？每天？每周？还是每季度？因为有时候它每天都困扰你，你为此很恼火。有时候它一个季度才出现一次，因为那是季度末，但负担特别重。然后可以加一些开放性的文本，比如”还有什么我们应该知道的？“这能给你带来非常有价值的信号，因为让人们优先排列前三件事……如果让他们全部都选，数据会变得非常非常混乱。但选三件事再加上频率，你就可以得出一个分数，或者如果你愿意的话一个加权分数，然后去深入挖掘——那些数据应该在哪里？我们需要什么数据？而且，这样你至少有了一个基准。它会是一个主观的基准，但你现在知道了最大的挑战是什么。

Lenny Rachitsky： 我很喜欢这一切最终都回到最基本的一步——先去跟人交流，问他们这些问题。这和产品管理以及打造优秀产品非常相似：你跟你的客户聊过吗？每个人都觉得自己在这么做，但大多数人做得不够。

问卷调查设计的注意事项

Nicole Forsgren： 我还要说，在获取数据时有一个挑战。访谈是数据，这很重要，问卷则更加量化，因为我们可以把它转化为计数。但这正是我们需要小心的地方。很多人去写问卷问题时会说类似这样的话：“过去一周构建和测试系统是否缓慢或复杂？“你在这里问了四个不同的问题。如果有人回答”是”，那是构建的问题？还是测试的问题？是慢？还是不稳定？还是复杂？还是别的什么？所以要理清你实际获得的信号是什么会非常困难。所以，花时间与熟悉问卷设计的人聊一聊，与 Claude 或 Gemini 或 ChatGPT 讨论一下：“这是我的问卷问题。你能帮我提一些吗？“然后确保你进行几轮迭代。这是一个好的问卷问题吗？从获得的数据中我能回答什么问题？我能解决什么问题？如果你不能用数据回答一个问题，就不要收集它。

Lenny Rachitsky： 你的书里有示例问卷，给那些想直接复制粘贴、不想花太多心思的人。

Nicole Forsgren： 是的，示例问卷，很多示例问题。我们甚至推荐了格式、流程应该是什么样的，应该多长，不应该多长。

满意度 vs 幸福感

Lenny Rachitsky： 我在读你的内容时注意到一点，你不太喜欢”幸福感”问卷，就是问工程师他们有多幸福，这是真的吗？如果是的话，为什么？

Nicole Forsgren： 我不喜欢，确实。我想说我不喜欢幸福感问卷，因为影响幸福感的因素太多了。幸福感涵盖的范围太广了。幸福感来自工作，来自家庭，来自爱好，来自周末，幸福感……有太多东西构成幸福感。但这不意味着我不关心幸福感。我认为幸福感问卷在这里并不是特别有用。有帮助的是满意度，人们会说”这不是一回事吗？“并不是，因为你可以问”你对这个工具满意吗？“然后再问一些后续问题。这两者是相关的，因为你对工作、工具、你所做的事情和团队的满意度越高，就越有助于幸福感。我以前开玩笑说……记得那个高尔夫球广告吗，“快乐的奶牛产好奶酪”？

Lenny Rachitsky： 不记得。

Nicole Forsgren： 我用过 Calabrian 奶酪广告。那个最经典了。快乐的开发者写出快乐的代码。他们写更好的程序，做更好的工作，成为更好的团队成员和合作者。但是去捕捉并试图直接影响幸福感，这不是我们要做的事。这太难了，涵盖面太广了。满意度能给我们一些有用的信号。

用产品思维做 DevEx 改进

Nicole Forsgren： 我觉得总体来说很重要的一点是，把产品思维带入任何类型的 DevEx 改进中，也包括我们收集和采集的指标。我的意思是，我们要识别一个问题，确保我们在为一群用户解决一个问题。我们要考虑创建 MVP 和实验，获取快速反馈，进行快速迭代。我们要有策略。我们要知道我们的目标用户是谁。我们要知道怎样才算成功。我们基本上需要有推向市场的功能。我们需要有沟通机制。我们需要持续从客户那里获取反馈。我们想要不断改进。而且到了某个阶段，我们要考虑淘汰某些东西。它是不是已经进入维护模式了？是不是该日落了？

我觉得这在一般情况下很重要，但现在尤其重要，因为当我们使用 AI 工具、将 AI 嵌入到产品中时，一切都在快速变化，所以花半拍时间停下来想一想很关键：“好吧，我现在要解决的问题到底是什么？这个用了十年的指标还有意义吗，还是应该淘汰它，因为它已经不重要了？它已经不能推动我所需要的那种决策和行动了。“

AI Corner：用 AI 做家居设计

Lenny Rachitsky： 在进入精彩的快问快答之前，我想带大家进入 AI Corner，这是本播客的一个固定环节。你有没有在生活或工作中发现某种 AI 工具的用法，觉得分享出来可能很有趣，对其他人也有用？

Nicole Forsgren： 我最近在做家居设计，重新装修房间什么的。我在跟一个设计师合作，因为我知道自己喜欢什么，但不知道怎么实现，我不擅长这个。但我真的很喜欢用 ChatGPT，尤其是 Gemini 来帮我渲染图片。我可以给它户型图，给它一张房间的照片——虽然完全不是它最终应该有的样子——然后我可以给它几张不同东西的照片，接着我就能告诉它改变墙壁、或者改变家具布局、或者改变某些东西。它帮了我，而且速度相对很快。它帮我可视化这些东西……再说一次，我知道自己喜欢什么，但不知道怎么达到那个效果，所以至少我能判断喜不喜欢。这大概是一个很随机的用法，但挺好玩的。

Lenny Rachitsky： 我妻子做的完全一样的事。她不断给我发，“这是这块地毯在我们客厅里的样子。这是这个水景。“效果太好了，而且越来越好。就是那种，“哇，那真的是我们的房子，放着这块新地毯”，你只需要上传两张照片，然后说，“好的，这个在我们房间里会是什么样？”

Nicole Forsgren： 对，有好几次我被惊到了。机器绝对在监听我们。它给我生成了一个房间的效果图什么的，然后突然加了个狗窝，因为我养狗。我就想，“我根本没让你加这个，但确实，那大概就是应该放在这个房间里的狗窝的颜色和风格。”

Lenny Rachitsky： 说到这个，你试过这个用例吗——问 ChatGPT，“根据你对我了解的所有信息，生成一张你认为我家长什么样的图片。”

Nicole Forsgren： 我还没试过。

Lenny Rachitsky： 因为它有记忆功能，记得你聊过的所有内容，结果非常好笑。你一定要试试。

Nicole Forsgren： 好的，这已经加到我的待办列表上了。

Lenny Rachitsky： 好了。额外用例。Nicole，话说到这里，我们到了非常精彩的快问快答环节。我有五个问题。准备好了吗？

Nicole Forsgren： 太棒了，来吧。

快问快答

Lenny Rachitsky： 你最常推荐给别人的是哪两三本书？

Nicole Forsgren： Peter Attia 的《Outlive》非常棒。另一本可能相关的，我伤了背所以不太妙，Stuart McGill 的《Back Mechanic》简直不可思议。推荐给所有伤过腰的人。这本书是给普通人读的，帮助弄清楚怎么解决腰部问题。算是个比较冷门的书。我还想说我很喜欢《How Big Things Get Done》。作者名字我不会读。好像有一个是……有一个是斯堪的纳维亚人。它通过近现代历史拆解了一些非常大的项目，分析它们在哪里失败以及为什么。我觉得这对我们现在的思考很有启发，特别是在 AI 时代，我们几乎所有的软件系统都即将发生变化。那么我们该如何思考去应对本质上将是一个非常庞大的项目？还有，抱歉，我再额外加一本，Michael Lewis 的《The Undoing Project》。Matt Velloso 推荐给我的，太好看了。

Lenny Rachitsky： 对，我读过——

Nicole Forsgren： 我读到最后一句时发出了声惊叹。

Lenny Rachitsky： 哦。我当时就想，“什么？”

Nicole Forsgren： 我当时……对，完全没想到。

Lenny Rachitsky： 我读过那本书但完全不记得最后那句。天哪。好吧。下一个问题。你最近看过并喜欢的电影或电视剧是什么？

Nicole Forsgren： 我看《Love Is Blind》。如果一天结束想放空一下，《Love Is Blind》挺好玩的。

Lenny Rachitsky： 新一季出了。

Nicole Forsgren： 对，非常期待……还有《Shrinking》。你看过《Shrinking》吗？

Lenny Rachitsky： 没有。我好像开始看《The Therapist》了，看了一集。

Nicole Forsgren： 强烈推荐。很可爱。

最喜欢的产品

Lenny Rachitsky： 好。你最近有没有发现什么特别喜欢的产品？可以是 App、厨房小工具，什么都行。

Nicole Forsgren： 有的，Ninja Creami 算一个——

Lenny Rachitsky： 你上次说过这个吗？

Nicole Forsgren： 不确定，可能说过。我觉得没有。

Lenny Rachitsky： 有人提到过这个，我现在还记得。就是——

Nicole Forsgren： 真的很好用。

Lenny Rachitsky： 可以用它做冰淇淋之类的东西，对吧？

Nicole Forsgren： 对，基本上你可以把蛋白奶昔冻起来，然后它就能打成冰淇淋——非常好吃。还有一个是 Jura 咖啡机。我很喜欢好咖啡，但不太擅长做，所以按一下按钮就行，想喝什么都行，拿铁、卡布奇诺什么的都可以。挺有意思的。

Lenny Rachitsky： 不错。你有没有最喜欢的——

Nicole Forsgren： 就是糖和咖啡因，靠着撑过一整天。

Lenny Rachitsky： 这就是工程生产力入门第一课。

Nicole Forsgren： 没错。

人生座右铭

Lenny Rachitsky： 天哪。好，还有两个问题。你有没有一个特别喜欢的人生座右铭，在工作或生活中经常觉得有用，会反复想到？

Nicole Forsgren： 有的，有一个说过几次了，不是原话，更多是一种感觉——后见之明总是很清晰，但其实也很蠢。我觉得如果我们当时在已有信息的基础上，做出了能做的最好决定，那就这样了。如果你做了一个糟糕的决定，而且你明知故犯，当时就有信息却没有好好利用，那确实不好。我觉得我们对自己、对别人都不够宽容，因为我们事后总会发现更多信息。

Lenny Rachitsky： 说的太对了。

新角色与 Google

Lenny Rachitsky： 最后一个问题。我本来想问别的，但我们准备这次访谈的时候，你提到自己在 Google 有了新角色。简单聊聊吧，你在那里做什么，为什么加入 Google，大家需要了解什么。

Nicole Forsgren： 好的。我是开发者智能和核心开发团队的高级总监。这份工作非常令人兴奋，也非常有趣，正是因为我们聊的这些内容。它聚焦于 Google 及其所有产品和底层基础设施——如何改善开发者体验、开发者生产力、速度，所有我们讨论过的东西。然后，因为我是个数据人，我们怎么思考度量这件事，度量方式如何变化，反馈循环如何变化，如何持续改善体验，然后以一种有意义、有影响力、比以往更快的方式推动整个组织变革。

Lenny Rachitsky： Google 真会挑人，把 Nicole 挖到了。太赚了。我得赶紧买点 Google 股票。好，两个后续问题。大家在网上哪里能找到你、找到你的书？听众怎么帮到你？

Nicole Forsgren： 网上可以在 developerexperiencebook.com 找到这本书，我的个人网站是 nicolefv.com，LinkedIn 上偶尔也会出现。有时候那里挺乱的，我尽量从噪音中筛选有用的东西。欢迎大家去注册获取这本书和工作手册。工作手册是免费的。我很希望收到各种反馈，哪些有用，哪些没用。我也一直很喜欢听大家的故事。

Lenny Rachitsky： Nicole，非常感谢你来参加节目。

Nicole Forsgren： 谢谢你邀请我，Lenny。

Lenny Rachitsky： 我的荣幸。再次感谢。大家再见。

感谢大家的收听。如果你觉得这期内容有价值，可以在 Apple Podcasts、Spotify 或你喜欢的播客 App 上订阅节目。也请考虑给我们评分或写评论，这真的能帮助更多听众发现这个播客。你可以在 lennyspodcast.com 找到所有往期节目或了解更多关于节目的信息。下期再见。

术语表

原文	中文
A/B test	A/B 测试
Abi Noda	Abi Noda（DX 公司创始人）
Accelerate	《Accelerate》（开发者生产力领域著作）
AI ADE (AI Assisted Development Environment)	AI ADE（AI Assisted Development Environment，AI 辅助开发环境）
ARR	ARR（Annual Recurring Revenue，年度经常性收入）
Atlassian	Atlassian（企业软件公司，Jira 等产品母公司）
Back Mechanic	《Back Mechanic》（Stuart McGill 著腰部康复指南）
change fail rate	变更失败率
cognitive load	认知负荷
Dan Shipper	Dan Shipper（播客嘉宾、Every 公司 CEO）
deployment frequency	部署频率
DevEx	DevEx（Developer Experience，开发者体验）
Devin	Devin（AI 编码 agent 产品）
DORA	DORA（一种开发者体验衡量框架）
DX	DX（开发者体验领域的创业公司）
EM (Engineering Manager)	EM（Engineering Manager，工程经理）
flaky tests	不稳定的测试
flow state	心流状态
Frictionless	《Frictionless》（Nicole Forsgren 即将出版的著作）
Gloria Mark	Gloria Mark（注意力与深度工作研究者）
go-to-market	推向市场
How Big Things Get Done	《How Big Things Get Done》（Bent Flyvbjerg 与 Dan Gardner 著，大型项目管理著作）
instrumentation	埋点
J-curve	J 曲线（一种先下降后上升的收益曲线形态）
Jura	Jura（瑞士高端全自动咖啡机品牌）
Karpathy	Karpathy（AI 领域知名研究者、前 Tesla AI 总监）
lead time	交付周期（从代码提交到部署的时间）
Love Is Blind	《Love Is Blind》（Netflix 恋爱真人秀）
low-hanging fruit	低垂的果实（容易实现的目标）
Matt Velloso	Matt Velloso（Nicole Forsgren 的朋友/同事）
MTTR	MTTR（Mean Time To Restore，平均恢复时间）
MVP	MVP（Minimum Viable Product，最小可行产品）
Ninja Creami	Ninja Creami（一款家用冰淇淋机品牌）
Outlive	《Outlive》（Peter Attia 著健康类畅销书）
paper cuts	”纸割伤”式问题（指那些虽小但令人烦恼的摩擦点）
PGM	PGM（Program Manager，项目经理）
provisioning environment	配置环境
Shrinking	《Shrinking》（Apple TV+ 喜剧剧集）
SPACE	SPACE（一种开发者体验衡量框架）
sunset	淘汰/日落（停止维护和运营）
telemetry	遥测（数据采集与监控能力）
The Undoing Project	《The Undoing Project》（Michael Lewis 著，关于行为经济学的非虚构作品）
toil	重复性劳动
total addressable market	总可寻址市场
TPM	TPM（Technical Program Manager，技术项目经理）
YOLO	凭冲动/随意（YOLO 原意为 You Only Live Once，此处指不经思考就行动）

此文档由 AI 分片翻译（translate_long_document）

How to measure AI developer productivity in 2025 | Nicole Forsgren

Full Interview Transcript

The Lies of Productivity Metrics

Introducing Our Guest Today

The Main Interview Begins

What is DevEx Exactly

Flow State and AI Impact

Measuring Productivity and Common Pitfalls

New Contexts for Code Survival Metrics

Understanding DORA Metric Boundaries

SPACE Framework in the AI Era

Trust Issues and Code Reviews

Deep Work and Attention Restructuring

Engineers Becoming AI Managers

Why Companies Must Prioritize DevEx

One Actionable Step for This Week

The Most Common DevEx Improvement

How to Know if Teams Are Fast

Integrating AI into Engineering Strategy

How Much Does AI Boost Productivity

Evaluating AI Capabilities in Debugging

Introducing the New Book Frictionless

Discussing the DX Company Acquisition

Explaining the Seven-Step Framework

Developer Experience Versus Developer Productivity

Local Versus Global Improvement Examples

Tailoring Communication for Different Audiences

Effective Methods to Quantify Benefits

Measuring AI Tool Impacts on Productivity

Measuring Developer Experience from Scratch

Key Considerations for Survey Design

Developer Satisfaction Versus Developer Happiness

Top Recommended Developer Tools

Applying Product Thinking to DevEx

AI Corner: AI for Home Design

Rapid Fire Q&A Session

My Current Favorite Product

My Personal Life Motto

Discussing the New Google Role

如何衡量 2025 年 AI 开发者生产力 | Nicole Forsgren

文字稿

生产力指标的谎言

嘉宾介绍

正式对话

什么是 DevEx

心流状态与 AI 的影响

如何衡量生产力——以及常见的误区

代码存活率与指标的新语境

DORA 指标的适用边界

SPACE 框架在 AI 时代的适用性

信任问题与代码审查

深度工作与注意力结构的重塑

工程师变成了 AI 的管理者

为什么公司应该关注开发者体验

一个本周就能做的行动

最常见的改进

如何判断团队是否足够快

AI 如何融入工程战略与实验

AI 带来的生产力提升有多大

AI 在调试中的能力

《Frictionless》新书介绍

关于 DX 公司的收购

七步框架详解

开发者体验 vs 开发者生产力

局部改进与全局改进的例子

面向不同受众的沟通方式

量化收益的方法

如何衡量 AI 工具对生产力的影响

从零开始衡量开发者体验

问卷调查设计的注意事项

满意度 vs 幸福感

推荐的开发工具

用产品思维做 DevEx 改进

AI Corner：用 AI 做家居设计

快问快答

最喜欢的产品

人生座右铭

新角色与 Google

术语表