Platform Engineering | DevSecOps

Operational Guardrails for Multi-Tenant PostgreSQL

2026-01-06T00:00:00-08:00

Context

Running PostgreSQL in a multi-tenant configuration is a powerful cost-optimization strategy—especially in environments where dozens or hundreds of isolated workloads coexist. But as I wrote previously in Operational Realities of Running PostgreSQL, security isolation is only half the story.

PostgreSQL is extremely stable when respected, but it has sharp edges when pushed into resource contention. Multi-tenant architectures amplify those failure modes. Even with perfect security isolation (role-per-tenant, database-per-tenant, schema hardening), tenants still share a single set of physical resources:

CPU
memory
IOPS
WAL throughput
background workers
connection slots

If one tenant misbehaves, it can degrade the experience for all others—even without violating a single privilege boundary.

This post explains the operational guardrails required to ensure safe, predictable, and compliant multi-tenant PostgreSQL deployments. All guardrails described here are fully implemented and verifiable in the accompanying project:

Project: Multi-Tenant PostgreSQL Security & Operational Isolation

Why Operational Guardrails Matter

Multi-tenant PostgreSQL is only viable when both of these are true:

1. Security boundaries must be provable

No tenant should ever be able to read or affect another tenant’s data.

2. Operational behavior must be controlled

No tenant should be able to destabilize the shared database server.

The first requirement is handled by:

database-per-tenant
role-per-tenant
schema-per-tenant
hardened public schema
restricted search_path
default privilege hardening
extension restrictions
negative security tests

The second requirement requires operational guardrails, which this post covers in detail.

Both sets of controls are implemented and actively tested in the project linked above.

Operational Risks in Multi-Tenant PostgreSQL

1. Connection Exhaustion — The Classic Failure Mode

Every PostgreSQL instance has a global connection budget (max_connections). All tenants draw from this shared pool.

A single tenant with:

an oversized ORM pool
idle-in-transaction leaks
a bug sending excessive connections

…can exhaust all connections and knock the instance offline.

Guardrail: Per-role connection limits

ALTER ROLE tenant_a_app CONNECTION LIMIT 2;

Small limits dramatically reduce blast radius.

In the project

The test suite spawns multiple concurrent connections and confirms one fails once the limit is exceeded.

2. Runaway or Long-Running Queries

A single long query—or a stuck transaction—can tie up CPU, I/O, locks, and memory.

Guardrail: Per-role timeouts

ALTER ROLE tenant_a_app SET statement_timeout = '3s';
ALTER ROLE tenant_a_app SET lock_timeout = '2s';
ALTER ROLE tenant_a_app SET idle_in_transaction_session_timeout = '10s';

These serve as circuit breakers against runaway behavior.

In the project

SELECT pg_sleep(10) is used to confirm the timeout fires predictably.

3. Lock Contention → Autovacuum Starvation → Bloat

Long-lived locks stop autovacuum from doing its job. The result:

rising dead tuples
bloated indexes
WAL amplification
I/O latency spikes

In multi-tenant environments, all tenants suffer.

Guardrails

enforce idle transaction timeouts
surface lock metrics
alert on autovacuum lag

These are documented operational expectations for production RDS deployments.

4. Shared WAL, Checkpoints, and I/O

PostgreSQL’s background processes operate at the instance level:

checkpointer
WAL writer
autovacuum workers

A high-churn tenant can degrade performance for everyone.

Guardrails

WAL monitoring
Instance sizing
Enforced workload limits

5. Backups and Snapshots Include All Tenants

On AWS RDS, a snapshot contains all tenant databases.

Guardrails

strict IAM permissions for snapshot creation/restoration
KMS key policy constraints
auditing of all snapshot actions

This is essential for IL4 workloads.

Guardrails Implemented in the Project

Security Controls

role-per-tenant
database-per-tenant
schema-per-tenant
hardened public schema
restricted search_path
enforced default privileges
blocked extension creation
negative cross-tenant isolation tests

Operational Controls

per-role connection limits
per-role statement timeouts
per-role lock timeouts
per-role idle-in-transaction timeouts

Automated Tests

connection-limit exceedance validated via concurrency
long-query timeout enforcement
concurrency behaviors tested safely and repeatably

Compliance Documentation

NIST 800-53 mapping
FedRAMP Moderate alignment
DoD IL2/IL4 considerations
pgAudit integration strategy

When Not To Use Multi-Tenant PostgreSQL

Avoid multi-tenant PostgreSQL when:

tenants require strict performance isolation
tenants need independent backup/restore capabilities
tenants have materially different compliance requirements
tenant load is unpredictable or unbounded
applications cannot follow connection pool discipline

These constraints are architectural realities, not limitations of PostgreSQL itself.

Conclusion

Multi-tenant PostgreSQL can be secure, cost-effective, and IL4-aligned — but only when operational guardrails are enforced. These include:

per-tenant connection limits
per-tenant timeouts
lock and idle-in-transaction protection
shared resource awareness (WAL, checkpoints, autovacuum)
auditable configuration

The accompanying project provides a complete, reproducible reference architecture

Upcoming work: Terraform integration using the PostgreSQL provider, RDS automation, and CI/CD validation pipelines.

Recovering from Toolchain Drift on macOS

2024-03-11T01:00:00-07:00

Context

Modern development environments move fast.

Package managers update. Libraries deprecate APIs. Defaults change. Previously working builds suddenly fail.

This post documents a real case of toolchain drift on macOS involving Homebrew and OpenSSL, and—more importantly—how to reason through recovery when the ecosystem moves out from under you.

This is not a recommendation to stay on old versions indefinitely.
It’s about getting unstuck responsibly.

What Toolchain Drift Looks Like

Toolchain drift usually presents as:

build failures after an unrelated update
cryptic linker or compilation errors
software that worked yesterday but not today
incompatibilities between system libraries and expected versions

In this case, the symptoms appeared after routine updates to Homebrew and OpenSSL.

Nothing in the application code changed.

The environment did.

Why This Happens

On macOS, Homebrew:

aggressively tracks upstream releases
removes or unlinks deprecated versions
updates formulae with breaking changes

OpenSSL:

has major version boundaries
frequently breaks ABI compatibility
is depended on implicitly by many tools

When those two collide, downstream tooling often breaks first.

This is not negligence. It’s the cost of a fast-moving ecosystem.

The Immediate Constraint

At the moment of failure:

the project needed to build and run
rewriting dependencies was not an option
upgrading the application code was non-trivial
time mattered

The goal was restoration of functionality, not architectural perfection.

The Pragmatic Recovery Strategy

The chosen approach was to:

temporarily switch to older, compatible versions
restore a known-good toolchain
unblock work
document the decision

This is a containment strategy, not a permanent fix.

Reverting Homebrew and OpenSSL Versions

The recovery involved:

installing an older OpenSSL version
ensuring it was correctly linked
preventing accidental upgrades during the recovery window

Commands like the following were used during diagnosis and recovery:

brew info openssl
brew install openssl@1.1
brew unlink openssl
brew link openssl@1.1 --force

The exact commands matter less than the intent:

Restore the environment the software was built against.

Once the expected library versions were present again, the failures disappeared. Nothing about the application itself had changed. The mismatch between the application’s expectations and the system-provided libraries was the entire problem.

Why This Worked

Most native tooling is sensitive to ABI and linking changes. Tools often assume specific library versions, paths, or symbols will exist. When those assumptions are violated, failures surface in ways that look unrelated to the actual cause.

By reverting to a known-good toolchain, the assumptions held again. The system returned to a stable state without modifying application code.

This is why toolchain drift so often manifests as “random” build failures. The failure is deterministic, but the dependency chain is opaque.

Risks and Tradeoffs

Downgrading or pinning dependencies is not without cost.

It can:

delay security updates
make future upgrades more difficult
introduce divergence between machines
hide underlying upgrade work that still needs to happen

This approach should always be treated as temporary. It is a recovery technique, not a long-term strategy.

The important part is not the downgrade itself, but the discipline around it: documenting the change, understanding why it was necessary, and planning how to move forward.

What I’d Do Differently Next Time

With more time and less pressure, better long-term solutions include:

containerizing the build environment
explicitly versioning toolchains
documenting expected dependency versions
avoiding reliance on system-wide libraries
making upgrades deliberate rather than incidental

The goal is not to freeze the environment forever, but to control when and how it changes.

Practical Guidance

When toolchain drift causes failures:

identify what actually changed
avoid trial-and-error fixes
restore a known-good state first
document the deviation
plan a proper upgrade path

Stability first. Improvements second.

Closing Thought

Fast-moving ecosystems are powerful, but unforgiving.

Toolchain drift is not a personal failure or a lack of skill. It is a reminder that reproducibility is something you must design for.

Sometimes the correct move is not forward.

It is back — briefly, intentionally, and with full awareness of why.

Operational Realities of Running PostgreSQL

2024-03-08T00:00:00-08:00

Context

PostgreSQL is often treated like a dependency:

install it
point an app at it
scale when it gets slow

In reality, PostgreSQL is a stateful system with strong opinions about memory, disk, and durability. When those expectations aren’t met, performance problems and outages tend to look mysterious.

This post captures practical realities of running PostgreSQL in production—especially in containerized and Kubernetes environments—without turning into a tuning checklist.

PostgreSQL Is a System, Not a Library

PostgreSQL:

runs multiple cooperating processes
manages its own memory aggressively
assumes durable storage
trades performance for correctness by default

You don’t “embed” Postgres. You host it.

Treating it like a stateless service almost always leads to surprises.

Memory: Connections Matter More Than Queries

One of the most common misconceptions is that PostgreSQL memory usage scales primarily with data size or query complexity.

In practice, it scales with connections.

Each connection:

consumes memory
spawns backend processes
increases scheduling and locking overhead

A large number of idle connections can be just as harmful as active ones.

This is why:

connection pooling matters
unbounded client connections are dangerous
“it works locally” doesn’t translate to production

CPU Is Rarely the First Bottleneck

When PostgreSQL is slow, adding CPU is often the first instinct.

In reality, PostgreSQL performance issues are more commonly caused by:

disk I/O latency
WAL contention
excessive connections
lock contention
memory pressure

CPU becomes a bottleneck after those are addressed.

Disk and WAL Are Central to Performance

PostgreSQL’s durability guarantees rely heavily on the Write-Ahead Log (WAL).

This means:

every write involves disk I/O
latency matters more than raw throughput
storage performance directly affects commit speed

Slow or inconsistent disks show up as:

slow transactions
replication lag
unexplained query latency

This is especially important in virtualized or networked storage environments.

Containers Don’t Change the Fundamentals

Running PostgreSQL in a container does not change how PostgreSQL works.

It still:

writes to disk
uses shared memory
expects predictable I/O
assumes stable filesystem semantics

Common container mistakes include:

ephemeral storage for data directories
ignoring filesystem sync behavior
assuming resource limits replace tuning
placing Postgres on storage designed for stateless workloads

Containers change packaging, not physics.

Kubernetes Adds Indirection, Not Immunity

Kubernetes can help manage PostgreSQL, but it does not remove operational requirements.

In Kubernetes:

PersistentVolumes define durability
StorageClasses define behavior
the underlying storage still matters
noisy neighbors still exist

If the storage layer is slow or misconfigured, PostgreSQL will faithfully surface those problems.

Defaults Are Conservative (for a Reason)

PostgreSQL defaults prioritize:

correctness
durability
broad compatibility

They are intentionally conservative.

This is good for safety, but it means:

defaults are rarely optimal for high-throughput systems
tuning should be intentional and informed
copying random config snippets is risky

Understanding why a setting exists matters more than memorizing values.

Monitoring Tells the Truth

PostgreSQL is verbose when asked correctly.

Key signals include:

connection counts
transaction duration
lock waits
WAL write latency
disk I/O wait times

When Postgres is unhealthy, it usually tells you—just not always in the place people look first.

Common Anti-Patterns

A few patterns show up repeatedly in troubled deployments:

treating Postgres like stateless infrastructure
scaling application replicas without considering DB impact
ignoring connection pooling
placing data on slow or inconsistent storage
assuming Kubernetes abstracts database concerns away

None of these fail immediately. They fail under load.

Practical Guidance

plan connections before planning CPU
treat storage latency as a first-class concern
assume containers do not change database fundamentals
understand WAL behavior before tuning performance
observe before optimizing

PostgreSQL rewards understanding. It punishes assumptions.

Closing Thought

Most PostgreSQL outages aren’t caused by bugs.

They’re caused by mismatches between:

what PostgreSQL expects
and what the platform provides

Once you treat Postgres as a system with real physical constraints, its behavior becomes predictable—and manageable.

Kubernetes ServiceAccount Tokens and CI/CD Authentication

2024-03-05T00:00:00-08:00

Context

CI/CD systems frequently need non-interactive access to Kubernetes clusters.

Historically, this was straightforward:

create a ServiceAccount
bind it with RBAC
extract a token
embed it in a kubeconfig
deploy

In Kubernetes 1.24 and later, that workflow quietly broke.

How CI/CD Auth to Kubernetes Works

In a CI/CD environment:

the job runs outside the cluster
it uses a kubeconfig file
the kubeconfig authenticates as a ServiceAccount
Kubernetes evaluates RBAC rules for that identity

This requires a long-lived credential.

How ServiceAccount Tokens Used to Work

Before Kubernetes 1.24:

ServiceAccounts automatically created token Secrets
tokens were long-lived
stored as Kubernetes Secrets
easy to extract for CI usage

Many pipelines relied on this behavior.

What Changed in Kubernetes 1.24

Starting in Kubernetes 1.24:

token Secrets are no longer auto-created
Kubernetes uses bound ServiceAccount tokens
tokens are short-lived
projected only into pods
not stored as Secrets

This improves security but breaks external CI workflows.

Why CI/CD Pipelines Break

CI systems:

run outside the cluster
cannot receive projected tokens
cannot refresh short-lived credentials

The ServiceAccount exists. RBAC is correct. But no token exists to authenticate with.

Detecting the Issue

You can confirm this by inspecting the ServiceAccount:

kubectl get serviceaccount deployer-service-account -o jsonpath='{.secrets}'

If the output is empty, no token Secret exists.

Bound Tokens vs Secret Tokens

Bound tokens:

short-lived
pod-scoped
secure by default
unsuitable for external CI

Secret-based tokens:

long-lived
manually created
usable by CI systems
require explicit lifecycle management

Creating a Token Secret Explicitly

When CI access is required, a token Secret can be created manually:

kubectl apply -f - <



Kubernetes will populate the token automatically.



Security Implications

Manually created tokens:


  reintroduce long-lived credentials
  require rotation discipline
  increase blast radius if leaked


They should be treated as exceptions, not defaults.



Modern Alternatives

More robust approaches include:


  OIDC federation
  cloud IAM integrations
  exec-based kubeconfig plugins
  workload identity systems


These eliminate static tokens entirely.



Practical Guidance


  do not assume ServiceAccounts have tokens
  distinguish authentication from authorization
  validate permissions with kubectl auth can-i
  treat long-lived tokens as transitional


The system did not break.
The defaults changed.
The model finally caught up with security reality.



Creating and Understanding kubeconfig Files
2024-03-02T00:00:00-08:00
Context

kubectl feels simple once it works.

But when access breaks—or when you need to create access from scratch—the kubeconfig file suddenly becomes mysterious. Tokens, certificates, contexts, users, clusters: everything is there, but rarely explained clearly.

This post explains what a kubeconfig actually is, how it’s structured, and how to create or modify one intentionally instead of relying on magic.



What a kubeconfig Really Is

A kubeconfig file is not credentials.

It is a configuration document that tells kubectl:


  which cluster to talk to
  how to authenticate
  which identity to use
  which context ties those together


Think of it as a connection profile, not a secret store.



The Four Core Concepts

Every kubeconfig is built from four pieces.

Cluster

Defines:


  API server endpoint
  CA certificate used to trust the server


User

Defines:


  how authentication happens
  certificates, tokens, or exec plugins


Context

Binds:


  one cluster
  one user
  optionally a namespace


Current Context

Tells kubectl which context to use by default.

Nothing works unless all four line up.



Inspecting an Existing kubeconfig

To see what kubectl is currently using:

kubectl config view


To see only the active context:

kubectl config current-context


To list all contexts:

kubectl config get-contexts


These commands are often enough to diagnose access confusion.



Creating a kubeconfig Manually (Step by Step)

Creating a kubeconfig intentionally makes the model click.

Step 1: Define the Cluster

kubectl config set-cluster example-cluster \
  --server=https://api.example.internal:6443 \
  --certificate-authority=/path/to/ca.crt


This tells kubectl where the API server is and how to trust it.



Step 2: Define the User

Example using a client certificate:

kubectl config set-credentials example-user \
  --client-certificate=/path/to/client.crt \
  --client-key=/path/to/client.key


Other authentication methods exist, but the structure is the same.



Step 3: Create a Context

kubectl config set-context example-context \
  --cluster=example-cluster \
  --user=example-user \
  --namespace=default


This binds identity to destination.



Step 4: Activate the Context

kubectl config use-context example-context


At this point, kubectl is fully configured.



Where kubeconfig Files Live

By default, kubectl looks for:

~/.kube/config


You can override this with:

KUBECONFIG=/path/to/config kubectl get pods


Multiple kubeconfig files can be merged automatically via the KUBECONFIG environment variable.



Why Contexts Matter More Than Credentials

Most access mistakes are context mistakes, not auth failures.

Common issues include:


  talking to the wrong cluster
  using the wrong namespace
  reusing similarly named contexts
  assuming the current context is what you think it is


Always check the context before acting.



How This Relates to RBAC

A kubeconfig:


  defines how you authenticate
  does not define what you can do


Authorization is enforced by:


  Kubernetes RBAC
  Roles and RoleBindings
  ClusterRoles and ClusterRoleBindings


If access is denied, the kubeconfig is usually fine—the permissions are not.



Verifying Access with kubectl auth can-i

Once authentication is working, the next question is authorization.

kubectl auth can-i answers a simple but critical question:


  “Is this identity allowed to do this action?”


Basic check

kubectl auth can-i get pods


This checks whether the current context’s user is allowed to list pods in the current namespace.

Explicit namespace check

kubectl auth can-i create deployments -n example-namespace


This avoids false assumptions caused by the active namespace.

Cluster-scoped permissions

kubectl auth can-i list nodes


This verifies permissions that are not namespace-bound.

Why This Command Is So Valuable

kubectl auth can-i


Is the fastest way to distinguish between:


  authentication problems (kubeconfig, credentials)
  authorization problems (RBAC)


If the command returns no, authentication succeeded but permissions are insufficient.

If the command errors, the kubeconfig itself may be misconfigured.

Make It a Habit

Before debugging:


  forbidden errors
  CI/CD access failures
  “works for me” discrepancies
  broken automation


Run:

kubectl auth can-i  


It turns RBAC from guesswork into something concrete.



Practical Guidance


  treat kubeconfig as connection metadata
  keep contexts clearly named
  never assume the current context
  avoid sharing kubeconfig files directly
  regenerate credentials instead of copying them


Understanding kubeconfig reduces both mistakes and anxiety.



Why This Mental Model Scales

Once this clicks:


  EKS/GKE/AKS configs make sense
  CI/CD kubeconfigs are less scary
  access rotation becomes manageable
  multi-cluster workflows are predictable


The file didn’t change—your understanding did.


Understanding Byte Size Units (Without Overthinking Them)
2024-02-28T00:00:00-08:00
Context

Most engineers know that there’s a difference between decimal and binary byte units.

Fewer engineers can confidently say:


  which one a given system is using
  when the distinction matters
  when it’s safe to ignore


This post explains byte size units in the way that’s actually useful in practice—without turning it into a standards lecture.



The Two Systems You’ll Encounter

There are two byte size systems in common use:

Decimal (Base-10)

Used primarily for:


  disk marketing
  network throughput
  vendor specifications


1 KB = 1,000 bytes
1 MB = 1,000,000 bytes
1 GB = 1,000,000,000 bytes
1 TB = 1,000,000,000,000 bytes


These scale cleanly by powers of 10.



Binary (Base-2)

Used primarily by:


  operating systems
  memory reporting
  filesystems
  low-level tooling


1 KiB = 1,024 bytes
1 MiB = 1,048,576 bytes
1 GiB = 1,073,741,824 bytes
1 TiB = 1,099,511,627,776 bytes


These scale by powers of 2.



Why the Names Look So Similar

The confusion comes from history.

For years, binary quantities were labeled using decimal names:


  “KB” meant 1024 bytes
  “MB” meant 1024² bytes


That shorthand stuck—long after it became misleading.

The IEC standard introduced:


  KiB, MiB, GiB, TiB


Not to complicate things—but to be precise.



Where This Actually Matters

In practice, you’ll most often see:


  Disks advertised in GB/TB (decimal)
  Operating systems reporting GiB/TiB (binary)
  Memory measured in GiB
  Network speeds measured in Gb/s (decimal bits)


This is why a “1 TB disk” doesn’t show up as “1 TB” in your OS.

Nothing is missing. Nothing is broken.

The units changed.



A Practical Example

A disk advertised as 1 TB contains:

1,000,000,000,000 bytes


Your OS reports in GiB:

1,000,000,000,000 ÷ 1,073,741,824 ≈ 931 GiB


That ~7% difference is expected.

It’s not overhead. It’s arithmetic.



When You Should Care

You should pay attention to units when:


  capacity planning
  comparing vendor claims
  sizing storage or memory limits
  troubleshooting “missing” space
  interpreting monitoring metrics


This is especially true in:


  Kubernetes resource limits
  cloud storage pricing
  filesystem usage reports




When You Can Mostly Ignore It

You can often ignore the distinction when:


  working at small scales
  eyeballing approximate usage
  doing relative comparisons within the same system


Just don’t mix unit systems mid-calculation.



Practical Guidance

A simple rule of thumb:


  If it’s hardware, bandwidth, or marketing → decimal
  If it’s an OS, memory, or filesystem → binary


When precision matters, check the unit label explicitly.

If the tool says GiB, believe it.



Why This Is Still Worth Knowing

This confusion persists because:


  both systems are valid
  both are widely used
  tools are inconsistent about labeling


Understanding the distinction once prevents years of second-guessing.

It’s a small mental model with a long shelf life.


Installing the NFS Subdir External Provisioner with Helm
2024-02-26T00:00:00-08:00
Context

Understanding how Kubernetes storage works is one thing.

Actually enabling that capability in a cluster is another.

If you want dynamic NFS-backed Persistent Volumes, Kubernetes needs a component that can:


  watch for PersistentVolumeClaims
  create directories on an NFS server
  register those directories as PersistentVolumes


That component is the NFS Subdir External Provisioner.

This post focuses on installing it intentionally using Helm—and understanding what you’re enabling when you do.



What This Provisioner Does

The NFS Subdir External Provisioner:


  runs as a pod in your cluster
  listens for PVCs referencing its StorageClass
  creates subdirectories on an external NFS server
  dynamically provisions PersistentVolumes


Kubernetes itself does not talk to NFS directly.

This provisioner is the bridge.



Prerequisites

Before installing anything, you need:


  a reachable NFS server
  an exported directory writable by the provisioner
  network connectivity from cluster nodes to the NFS server
  Helm installed and configured


If the NFS server isn’t healthy, this installation will succeed—but provisioning will not.



Adding the Helm Repository

First, add the Helm repository that hosts the chart:

helm repo add nfs-subdir-external-provisioner \
  https://kubernetes-sigs.github.io/nfs-subdir-external-provisioner/

helm repo update


This makes the chart available locally.



Installing the Provisioner

The core installation uses helm install with a small but important set of values.

Example:

helm install nfs-provisioner \
  nfs-subdir-external-provisioner/nfs-subdir-external-provisioner \
  --namespace storage-system \
  --create-namespace \
  --set nfs.server=192.0.2.50 \
  --set nfs.path=/exports/kubernetes \
  --set storageClass.name=managed-nfs


Key values explained:


  
    nfs.server

    Address of the external NFS server
  
  
    nfs.path

    Base directory where subdirectories will be created
  
  
    storageClass.name

    The StorageClass PVCs will reference
  


This command installs the provisioner and registers a new StorageClass.



What Helm Is Actually Creating

After installation, you should see:


  a Deployment running the provisioner
  a Pod connected to the NFS server
  a StorageClass pointing to this provisioner


Helm handles object creation, but you are responsible for understanding the consequences.



Verifying the Installation

Check that the pod is running:

kubectl get pods -n storage-system


Confirm the StorageClass exists:

kubectl get storageclass


You should see the managed-nfs StorageClass listed.



Validating Dynamic Provisioning

Create a simple PVC referencing the StorageClass:

kubectl apply -f - <<EOF
apiVersion: v1
kind: PersistentVolumeClaim
metadata:
  name: test-nfs-claim
spec:
  accessModes:
    - ReadWriteMany
  storageClassName: managed-nfs
  resources:
    requests:
      storage: 1Gi
EOF


If provisioning works:


  a PersistentVolume will be created automatically
  a new directory will appear on the NFS server
  the PVC will bind successfully


This confirms end-to-end functionality.

If the reported size appears smaller than expected, this is often a unit conversion issue rather than a provisioning failure.

See: Understanding Byte Size Units (Without Overthinking Them)



Common Failure Modes

If things don’t work, check:


  NFS server permissions
  firewall rules
  pod logs for the provisioner
  correctness of nfs.server and nfs.path
  whether the StorageClass name matches the PVC


Most failures are external to Kubernetes.



When This Is (and Isn’t) the Right Choice

This approach works well for:


  shared storage
  development clusters
  on-prem environments
  workloads needing ReadWriteMany


It may not be appropriate for:


  high-performance databases
  latency-sensitive workloads
  cloud-native block storage replacements


NFS is a tool, not a default.



How This Fits the Bigger Picture

This installation enables the architecture described in:


  How NFS-Backed Persistent Volumes Actually Work in Kubernetes


Understanding the model first makes this step predictable instead of magical.

Helm just applies the intent—you still own the system.


Fixing Kubernetes Namespaces Stuck in Terminating
2024-02-22T00:00:00-08:00
Context

Most Kubernetes resources delete cleanly.

Namespaces are the exception.

When a namespace gets stuck in Terminating, it’s usually not because Kubernetes is broken—it’s because Kubernetes is waiting for something else to finish its job.

Understanding why that happens requires understanding finalizers.



What a Namespace Deletion Actually Means

Deleting a namespace is not a single operation.

When you run:

kubectl delete namespace example-namespace


Kubernetes:


  marks the namespace for deletion
  enumerates all namespaced resources
  waits for controllers to clean up what they own
  removes finalizers
  deletes the namespace object


If any step stalls, the namespace remains in Terminating.



What Finalizers Are (Conceptually)

A finalizer is a promise.

It says:

  “Do not delete this object until I have cleaned something up.”


Finalizers are commonly added by:


  controllers
  operators
  storage provisioners
  custom resources


They exist to prevent data loss and orphaned infrastructure.

The downside: if the controller is gone or broken, the promise is never fulfilled.



Why Namespaces Get Stuck

Namespaces typically get stuck when:


  a controller was removed before cleanup finished
  a CRD was deleted before its instances
  a storage provisioner no longer exists
  a webhook or operator is failing
  finalizers reference resources that no longer respond


At that point, Kubernetes is waiting for a cleanup step that will never occur.



Confirming the Problem

First, verify the namespace state:

kubectl get namespace example-namespace


If it shows:

STATUS   Terminating


Inspect it more closely:

kubectl describe namespace example-namespace


Often, you’ll see references to remaining resources or finalizers.

For deeper inspection:

kubectl get namespace example-namespace -o json


Look specifically at:

spec.finalizers




Why Force Deletion Usually Doesn’t Work

Commands like:

kubectl delete namespace example-namespace --force --grace-period=0


are commonly tried—and commonly ineffective.

That’s because:


  finalizers live at the API level
  force deletion does not bypass finalizers
  Kubernetes is still honoring the contract


Force only skips graceful termination, not cleanup guarantees.



The Last-Resort Fix: Removing Finalizers

⚠️ This is an administrative recovery action.

You are explicitly telling Kubernetes to stop waiting.

Proceed only when:


  you understand what’s stuck
  the owning controller no longer exists
  cleanup cannot complete naturally




Step 1: Export the Namespace Definition

kubectl get namespace example-namespace -o json > namespace.json




Step 2: Remove the Finalizers

Edit namespace.json and remove the finalizers field entirely.

Before:

"spec": {
  "finalizers": [
    "kubernetes"
  ]
}


After:

"spec": {}




Step 3: Submit the Finalized Object

kubectl replace --raw "/api/v1/namespaces/example-namespace/finalize" \
  -f namespace.json


This bypasses the normal deletion workflow and tells the API server:

  “Delete this namespace now.”


If successful, the namespace disappears immediately.



What You’re Skipping by Doing This

Removing finalizers means:


  controllers do not clean up external resources
  storage or cloud artifacts may remain
  audit trails may be incomplete


This is why this approach is corrective, not routine.



When This Is the Right Call

This approach is appropriate when:


  the cluster is already inconsistent
  the namespace is blocking automation
  recovery is impossible via normal controllers
  the resources are already orphaned


In practice, this is often the only viable path forward.



Preventing This in the Future

A few practices reduce the odds of hitting this:


  delete CR instances before deleting CRDs
  remove operators last, not first
  monitor namespaces during teardown
  understand which controllers add finalizers
  treat namespace deletion as a process, not a command


Finalizers are powerful—but they require discipline.



Practical Takeaways


  namespaces don’t delete instantly by design
  finalizers exist to protect external state
  stuck namespaces usually mean broken cleanup
  force deletion does not bypass finalizers
  removing finalizers is safe only when cleanup is impossible


This is one of those Kubernetes behaviors that feels mysterious—until it isn’t.

Once you understand the contract, the fix becomes deliberate instead of desperate.


Creating Kubernetes Secrets from the Command Line (and When Not To)
2024-02-19T00:00:00-08:00
Context

Kubernetes Secrets are often introduced early, but rarely explained clearly.

Most examples focus on how to create a Secret, not:


  why you’d choose one method over another
  what tradeoffs you’re making
  how Secrets fit into a broader operational model


This post focuses specifically on creating Secrets from the command line using kubectl create secret, and—just as importantly—when not to do that.



What kubectl create secret Actually Does

At a high level, kubectl create secret:


  takes input (literals, files, or environment variables)
  base64-encodes the values
  submits a Secret object to the Kubernetes API server


It does not:


  encrypt values by itself
  manage secret rotation
  track provenance
  enforce security policies


It is a creation mechanism, not a secrets management system.



Creating a Secret from Literal Values

The most direct pattern uses literals:

kubectl create secret generic example-db-creds \
  --from-literal=username=example_user \
  --from-literal=password=example_password


This is useful for:


  quick experiments
  local clusters
  validating application wiring


It is not ideal for long-lived or production secrets.



Creating a Secret from a File

A more common pattern is file-based creation:

kubectl create secret generic example-config \
  --from-file=application.yaml


This creates a Secret where:


  the key is the filename
  the value is the file contents


This works well for:


  config blobs
  certificates
  structured files


But it still raises questions about where that file lives and how it’s protected.



Creating Secrets from Environment Files

Environment-style files can also be used:

kubectl create secret generic example-env \
  --from-env-file=.env


This is convenient, but dangerous if:


  .env files are committed accidentally
  shell history is not managed carefully
  multiple environments share similar filenames


Convenience and risk scale together here.



Namespaces Matter More Than Syntax

By default, Secrets are created in the current namespace.

This is one of the most common failure modes.

Always be explicit:

kubectl create secret generic example-db-creds \
  --from-literal=username=example_user \
  --from-literal=password=example_password \
  -n example-namespace


Secrets in the wrong namespace are indistinguishable from missing secrets.



Inspecting What You Created

You can confirm a Secret exists with:

kubectl get secret example-db-creds -n example-namespace


And inspect metadata with:

kubectl describe secret example-db-creds -n example-namespace


Avoid decoding values casually unless you need to verify wiring.

If you do decode, do it intentionally and clean up afterward.



When kubectl create secret Is the Right Tool

This approach works well when:


  bootstrapping a cluster
  validating application configuration
  working in ephemeral environments
  teaching or learning Kubernetes mechanics


It’s a mechanical tool, not a long-term strategy.



When kubectl create secret Becomes a Liability

Problems arise when:


  secrets are created manually and forgotten
  values live in shell history
  environments drift
  rotation becomes manual and error-prone
  auditability matters


At scale, this approach does not age well.



Better Patterns for the Long Term

As systems mature, secrets creation usually moves toward:


  GitOps workflows
  external secret managers
  sealed or encrypted manifests
  automated rotation


In those models:


  kubectl create secret is often replaced
  or used only as a bootstrap mechanism


That’s not a failure—it’s progress.



Practical Takeaways


  kubectl create secret is about object creation, not security
  be explicit about namespaces
  understand where secret material lives
  treat manual creation as transitional
  plan for replacement as systems grow


Secrets are less about syntax and more about discipline.

Understanding the limits of your tools is part of operating Kubernetes responsibly.


Understanding Docker Containers by Using docker create Explicitly
2024-02-16T00:00:00-08:00
Context

Most people interact with Docker through docker run. It’s convenient, compact, and hides a lot of complexity.

But docker run is actually doing two things at once:


  creating a container
  starting it


When you separate those steps using docker create, Docker becomes easier to reason about—especially when debugging, experimenting, or learning how containers are really wired.

This post explains why and when that separation matters.



docker run vs docker create

At a high level:

docker run = docker create + docker start


docker create:


  defines a container
  stores its configuration
  does not start it


docker start:


  executes an already-defined container


That distinction is subtle, but important.



Why Use docker create Explicitly?

Using docker create forces you to think in terms of:


  container identity
  configuration as state
  lifecycle boundaries


This is especially useful when:


  debugging complex flags
  iterating on volume mounts
  inspecting container configuration before execution
  understanding restart behavior


It’s also a bridge toward thinking in Compose and Kubernetes terms.



Anatomy of a docker create Command

A typical docker create command defines:


  the image
  container name
  ports
  volumes
  environment variables
  restart policies


Example pattern:

docker create \
  --name example-service \
  -p 8080:8080 \
  -v example-data:/var/lib/example \
  -e EXAMPLE_MODE=production \
  --restart unless-stopped \
  example-image:latest


Nothing runs yet. Docker simply records intent.



Inspecting Before Running

Once created, you can inspect the container:

docker inspect example-service


This is where docker create shines.

You can:


  verify mounts
  confirm ports
  inspect environment variables
  validate restart policies


All without starting the container.



Starting and Stopping Becomes Explicit

After creation:

docker start example-service


Stopping and restarting now operate on a known container, not a transient command.

This makes container behavior:


  more predictable
  easier to debug
  less error-prone




Patterns This Enables

Using docker create works well for:


  long-running infrastructure containers
  stateful services
  local development environments
  reproducing issues reliably


It’s less useful for:


  one-off commands
  disposable CI jobs
  quick experiments


Knowing when not to use it is part of the skill.



Relationship to Docker Compose

Docker Compose formalizes what docker create makes explicit:


  declarative configuration
  repeatability
  separation of config and execution


If docker create feels helpful, Compose is usually the next step.

If Compose feels confusing, docker create is often a good way to learn why it exists.



Practical Takeaways


  docker create exposes container configuration clearly
  separating creation from execution improves debuggability
  explicit lifecycle boundaries reduce surprises
  this mental model scales toward Compose and Kubernetes


Understanding containers starts with understanding how they are defined.