AI Claude Code DevOps Opinion

Help, Claude Code Is Smarter Than My Co-worker

2026-03-30

I am going to make a claim that sounds worse than it is, and then I am going to explain what it actually means. Here it is: Claude Code is better at certain parts of my job than some of the people I have worked with. If that sentence made you uncomfortable, good. It should. It made me uncomfortable when I first caught myself thinking it.

The interesting thing about that sentence is not the comparison to AI. It is what the comparison reveals about engineering as a profession. Claude Code did not get smarter this year. What changed is that Claude Code got good enough to expose a distinction that the industry has been quietly avoiding for a decade: the difference between engineers who understand what they are building and engineers who are very good at looking like they do. That distinction used to be invisible in most day-to-day work, because the output looked the same from the outside. AI makes it visible, because AI can produce the looking-like-it output cheaply, and the engineers who were surviving on that alone suddenly have nothing left to trade.

This post is about what Claude Code actually does well, what it cannot do, and what the gap between those two things says about the next five years of this profession. The goal is not to scare you. It is to help you figure out which side of the gap you have been building your career on. That answer matters more now than it did in 2024.

Where Claude Actually Wins

The clearest way to explain what Claude Code is good at is to describe a specific task I ran through it recently. I needed a Terraform module that wraps an Azure Verified Module for web apps, adds CIS compliance preconditions, and includes test files with expect_failures blocks so the preconditions stay enforced across refactors. That is the kind of work that usually requires reading the AVM interface documentation, finding the relevant CIS controls, translating them into Terraform precondition syntax, and then writing tests that prove the preconditions actually fire. A careful engineer takes a day.

Claude Code produced the module, the tests, and caught a variable naming inconsistency I would have shipped without noticing. Four minutes. The module compiled, the tests passed, and when I read the output line by line I could not find anything I would have done differently. That is not a story about AI being magic. It is a story about the task having a specific shape that AI handles well. The shape is this: the work requires cross-referencing multiple sources of documentation, applying well-known patterns, and being consistent about naming and structure. None of those parts require knowing anything about my specific environment. Every part of the work exists in public documentation somewhere, and the engineer is essentially doing translation and pattern application.

That is the category where Claude Code consistently performs. Boilerplate with patterns, cross-referencing multiple docs, catching inconsistencies that humans skim over. If the task has been done a thousand times before in public repositories, Claude will nail it, because it has seen the thousand previous examples and can produce the next one almost instantly. The speed is not the interesting part. The interesting part is that the output is often indistinguishable from what a careful human engineer would write for the same task, and in some cases better, because Claude reads every line where a human would start skimming around line two hundred.

This is the half of engineering work that has quietly gotten cheap. It used to take hours of careful attention. Now it takes a prompt and a review pass. The engineers who were fast at this kind of work had a real skill, and that skill is now a commodity.

Where Claude Falls Flat

Later that same week I ran into a problem Claude could not touch. An Azure DevOps pipeline was failing, but only on Tuesdays, and only on self-hosted agents in one specific network segment. Not every Tuesday. Not every self-hosted agent. Just this combination. I described the symptom to Claude and asked for debugging directions. It confidently gave me three suggestions. Increase the pipeline timeout. Check the agent pool for resource pressure. Look at the service connection configuration. All three were reasonable based on the symptom. All three were wrong, and not in a small way. They were fundamentally wrong about what kind of problem this was.

The actual cause was a weekly compliance scan that ran Tuesday mornings on the subnet that hosted the agent. The scan consumed enough bandwidth that pipelines hitting the Azure REST API from inside that subnet timed out on calls they would complete instantly on other days. You would only figure that out by knowing three things: that the scan existed, that the scan ran on Tuesday, and that the scan lived in a subnet that happened to also host this agent. None of those three facts are in any documentation or any public repository anywhere. They live in the heads of the security team and the network team, and they only connect if someone with access to both conversations correlates them.

Claude Code has no access to those conversations. That is not a bug in Claude Code. That is the structural limit of pattern matching. You cannot pattern match against information that was never written down. And in real operations work, a huge fraction of the information that matters for a given incident was never written down, because nobody had a reason to write it down until the moment you need it. By then, the people who knew are on different teams or different jobs, and the knowledge is sitting in slack threads and meeting notes and other people’s memories.

The category Claude is useless on is the category where the answer depends on knowing your specific environment, your specific history, your specific organizational context. Not “the answer is in documentation somewhere, you just need to find it”. The answer is not in documentation. It is in the accumulated context that only humans working in your organization hold.

Claude cannot help you debug the effect of the firewall rule Henk from networking added six months ago as a temporary fix and then forgot about. It cannot help you understand why the staging environment has a DNS override from 2024 that nobody remembers adding. It cannot tell you that the PM’s actual requirement is different from what they wrote in the ticket, because they never thought to explain the unwritten constraint that makes the ticket make sense to them.

This is the other half of engineering work, and it is the half that has gotten more valuable, not less, because AI amplifies the engineers who have this kind of context without being able to replace them. A senior engineer plus Claude Code is significantly more productive than a senior engineer alone. A junior engineer plus Claude Code, without the context, produces polished output for the wrong problem.

The Split Is What Matters

Take those two halves and put them next to each other. One half is cheap now. The other half is expensive. The cheap half was what a lot of engineers built their careers on without realizing it. They were fast at writing boilerplate, quick at finding the right Stack Overflow answer, reliable at turning a ticket into a working pull request without asking too many questions. Those are real skills and they used to be enough to get you promoted, because nobody else could do them faster than you. They are not enough anymore, because a language model does them in seconds for free.

The engineers who were surviving on that combination are in a harder spot than they know. Their output is suddenly a commodity. The work they were fastest at is now the work a four-dollar-a-month subscription handles better than they do. And the response from some of them, which I have seen in multiple teams now, is to use Claude Code to generate more of the cheap work at higher volume, which makes the problem worse rather than better. They are trying to outrun a treadmill that only speeds up.

The engineers who built their careers on context, judgment, and knowing where the bodies are buried are in a better spot than they realize. Their work is the half that did not get cheap. If anything it got harder to do well, because the volume of code being generated around them means more things to evaluate, more decisions to make, and more opportunities for subtle wrongness to slip into production. But they are also the only people who can evaluate, decide, and catch the wrongness. The market for their judgment just expanded.

I was in the first group for years. I remember what it felt like to be the fast-and-reasonable engineer who could ship anything the ticket asked for. It felt like competence, and it was competence, until the definition of competence quietly moved and nobody sent out a memo.

I only noticed because I started reaching for Claude Code more and more for exactly the kind of work I used to feel proud of. If a machine could do it in four minutes, the thing I was proud of was never the work. It was the speed and the polish and the consistency of the output. The actual thinking was somewhere else, and I had not been doing enough of it.

What to Do About It

If you are reading this and wondering which side of the split you have been working on, here is the honest diagnostic. Spend a week working with Claude Code for everything you reasonably can. At the end of the week, notice two things. First, how much of your normal workload felt noticeably faster. Second, how much of your normal workload felt unchanged. The ratio between those two is approximately the ratio between cheap work and expensive work in your current role.

For me that ratio came out to about seventy-thirty. Seventy percent of my week sped up meaningfully with AI in the loop. The other thirty percent did not move, because it was the part that depended on context Claude could not access. That thirty percent is where I now earn my salary in 2026, and it is the exact skill set I have written about before as slowly disappearing from the industry as engineers outsource their debugging reflexes to machines that do not have local context to debug with. Disappearing from the industry, but not disappearing from the value mix. Exactly the opposite.

If your ratio is closer to ninety-ten, where almost everything you do feels faster with Claude, that is worth paying attention to. It does not mean you are in trouble today. It means the work you are doing is work a language model can do, which means your employer is paying you for a skill that has a clear substitute. The next five years of your career should involve deliberately moving toward the thirty percent, not optimizing the ninety.

If your ratio is closer to fifty-fifty or worse, you are probably fine. The work that did not speed up is the work AI cannot replicate, and you are spending half your time on it, which is where the value is concentrated now.

The prediction I will put on this: in five years, the engineers who did well are the ones who used Claude Code to spend less time on the commodity half and more time on the context half, because the context half is where humans will still matter and where the 2026 version of Claude Code still guesses. The engineers who did badly are the ones who used Claude Code to spend more time on the commodity half and let the context half atrophy, because that is the path that feels productive in the short term and quietly hollow in the long term.

And yes, I am including myself in this assessment. I catch myself on the wrong path more often than I would like to admit. That is the honest part. Claude Code did not just make me faster. It made me look at my own work with a question I did not have a clean answer to: which half of my job would actually be hard to automate, and am I spending enough of my day there? The question is uncomfortable because the answer is not automatic. Which is the whole point. If the answer were automatic, I would not need to think about it, and thinking about it is exactly the work that Claude Code still cannot do for me.

The Non-Traditional Path to DevOps Engineering

Azure Infrastructure: Lessons from the Field That Microsoft Doesn't Teach You

← All posts