Back to archive

Engineering

Why Platform Teams Accidentally Become Gatekeepers

How internal platforms drift from leverage engines into approval queues, and how to pull them back.

Why Platform Teams Accidentally Become Gatekeepers

Platform teams are usually created to increase leverage. They centralize hard problems, reduce duplicated effort, and let product teams move with better defaults. Yet many platforms drift into gatekeeping. The team that was supposed to make delivery easier becomes the team everyone must wait for.

The drift is rarely malicious. It is usually a consequence of unclear product boundaries, risk ownership, and incentives.

The thesis

A platform becomes a gatekeeper when it owns decisions that should be encoded into self-service product, policy, or paved paths. The fix is not to make the platform team "more responsive." The fix is to remove them from low-value approval loops.

Responsiveness helps symptoms. Product thinking fixes the system.

The production pattern

At first, the platform team handles a shared problem. Maybe deployment, observability, identity, data access, build infrastructure, or service templates. Early users need help, so the platform team provides hands-on support. The platform improves. More teams adopt it.

Then adoption creates queues. Product teams need exceptions, configuration, capacity, permissions, migrations, debugging help, and roadmap commitments. The platform team becomes the route to production. It accumulates tribal knowledge and approval power. Engineers start asking, "Can platform do this for us?" instead of "How do we safely do this ourselves?"

The platform has won adoption and lost leverage.

The model

I use a gatekeeping diagnostic with five questions.

Can users complete the common path without talking to the platform team? If not, the platform is a service desk with software attached.

Are exceptions rare, named, and reviewed? If every serious use case needs an exception, the paved path is too narrow or the product model is wrong.

Does the platform encode policy or merely enforce it through humans? Human approval may be necessary for high-risk actions, but repeated approvals should become automated checks, quotas, templates, or delegated ownership.

Are platform priorities tied to user outcomes or internal neatness? A platform can be technically elegant while making product teams slower.

Does the platform make the safe path the easy path? If teams bypass it to move quickly, the platform has not earned authority.

These questions separate platform leverage from platform control.

Where this goes wrong

The counterpoint is that some gates are legitimate. Security boundaries, regulatory constraints, data access, production safety, and large cost commitments may require review. A completely self-service platform with no brakes can distribute risk faster than the organization can absorb it.

The issue is not the existence of gates. The issue is unnamed gates, inconsistent gates, and gates where the platform team substitutes judgment for missing product design.

Another failure mode is turning platform into an internal vendor with no architectural authority. Product teams get what they ask for, the platform becomes a feature factory, and the shared system fragments under customization. A healthy platform is opinionated. It should just be opinionated through defaults, contracts, and clear extension points rather than constant human mediation.

What I do now

When I review platform work, I ask what user action becomes self-service after the project lands. If the answer is "platform can do it faster for them," that is support efficiency, not platform leverage.

I ask for a paved path, an escape hatch, and an ownership model. The paved path is the default workflow most teams should use. The escape hatch is how advanced teams handle unusual needs without forking reality. The ownership model says who supports what when something breaks.

I also push for product metrics that reflect independence: time to create a service, time to restore a failed deploy, percentage of changes completed without platform intervention, number of active exceptions, age of exceptions, and support requests by category. Exact numbers matter less than trend and classification.

The best platform teams I have seen are comfortable deleting their own tickets by turning them into product capabilities. They protect shared standards, but they do not confuse being necessary with being effective.

I pay special attention to the language around platform work. If every roadmap item is described as "support team X," the platform is drifting toward bespoke service. If roadmap items are described as "make this class of action safe for any qualified owner," the platform is building leverage. The difference seems semantic, but it changes prioritization. One creates a queue. The other creates capability.

There is also a trust loop. Product teams trust platforms that let them move without surprises. Platform teams trust product teams that stay inside clear contracts. Gatekeeping grows when either side does not trust the other to manage risk. A good platform reduces the amount of trust required by making safe behavior obvious, observable, and enforceable.

Closing takeaway

A platform is working when it turns expert judgment into safe self-service defaults, not when every important decision has to pass through the platform team.