Azure Fundamentals

Azure Cloud Economics: Pricing, TCO, SLAs, Service Lifecycle & Support

The cloud’s most famous promise is “pay only for what you use” — but that single sentence hides almost everything that matters about running on Azure. What exactly are you paying for? When does the meter start? Why did egress cost more than the virtual machine? What does Microsoft actually guarantee in exchange for the money, and what happens when a service falls short? And when something breaks at 2 a.m., who picks up the phone? This lesson answers all of it.

Cloud economics is the pillar that turns a hobbyist into an architect. Anyone can click “Create”; the person who can predict the bill, defend the design on cost, read an SLA correctly, and know which support plan a workload needs is the person organisations actually trust with production. It is also a heavily weighted theme on the AZ-900: Azure Fundamentals exam — the “describe cost management” and “describe SLAs and service lifecycle” domains together account for a meaningful slice of the questions. We go from zero, define every term, and stay grounded in real decisions and real numbers (illustrative INR figures included, because that is the currency you will be budgeting in).

Learning objectives

By the end of this lesson you can:

Prerequisites & where this fits

You need only basic IT literacy and a rough idea of what a virtual machine, a database, and storage are — no prior cloud billing experience is assumed, and every financial term is defined the first time it appears. A free Azure account is enough to follow the hands-on lab (the lab itself costs nothing). This is a Foundations lesson in the Azure Zero-to-Hero course and the economics counterpart to the “What is Azure?” lesson: where that one taught you how Azure is organised, this one teaches you how Azure charges you and what it guarantees. It builds the financial vocabulary that every later design lesson quietly assumes.

CapEx vs OpEx: the mental shift

Before a single price, understand the kind of spending the cloud represents, because it reshapes how a whole organisation budgets.

Azure converts most IT spending from CapEx into OpEx. The practical consequences are the heart of the cloud business case:

Dimension Traditional data centre (CapEx) Azure (OpEx)
Up-front cost Large; buy hardware before first use Near zero; provision on demand
Capacity planning Guess peak years ahead; over-provision to be safe Scale up and down as demand moves
Time to provision Weeks to months (purchase, ship, rack, cable) Minutes
Idle capacity You paid for it; it is sunk Deallocate it and stop paying
Risk of getting it wrong High — wrong hardware is a multi-year mistake Low — change the SKU tomorrow
Who carries the depreciation You Microsoft

The instinct to build early: in the cloud, elasticity has financial value. The ability to give capacity back is worth as much as the ability to add it, because every idle resource is now a line item you could simply switch off.

The three pricing models

Almost every Azure charge resolves to one of three pricing models. Knowing which one fits a workload is one of the most valuable cost skills there is.

Model How you pay Commitment Typical saving vs PAYG Best for The catch
Pay-as-you-go (consumption) Per second/hour/operation/GB actually used None Baseline (0%) Spiky, unpredictable, short-lived, or new workloads Most expensive per unit; no discount
Reserved (Reservations / Savings Plans) Commit to 1 or 3 years of usage, paid up front or monthly 1 or 3 years Up to ~72% (varies by service/term) Steady, predictable baseline running 24×7 You pay even if you stop using it
Spot Bid for Azure’s spare capacity at a deep discount None, but can be evicted Up to ~90% Interruptible, stateless, batch, dev/test, fault-tolerant work Azure can reclaim the capacity with ~30 s notice

How to think about each:

A separate, stackable discount worth naming here is the Azure Hybrid Benefit: if you already own eligible Windows Server or SQL Server licences with Software Assurance, you can apply them to Azure VMs and pay only for the compute, saving a large fraction of the licensing cost. It layers on top of reservations.

What actually drives the bill

When a bill surprises someone, the cause is almost always one of five drivers. Internalise these and you can estimate costs in your head and read an invoice with confidence.

Driver What it means Why it bites
Compute vCPU + RAM hours for VMs, App Service plans, AKS nodes, Functions executions Usually the largest line item; the wrong SKU or a VM left running overnight dominates the bill
Storage GB stored per month × the tier, plus transaction/operation counts Cheap per GB but accumulates forever; hot vs cool vs archive tiers differ by an order of magnitude
Data transfer (egress) Data leaving Azure to the internet or across regions Inbound is free; outbound is metered. Cross-region replication and chatty internet APIs quietly add up
Region The same service costs different amounts in different regions A premium region can cost noticeably more than a cheaper one for identical specs
Tier / SKU The service level you choose (Basic vs Standard vs Premium) Higher tiers add features and price; over-provisioning the tier is a classic waste

Two of these trip up beginners hardest:

A useful estimation instinct: Cost ≈ (what) × (how much) × (how long) × (where) × (which tier) — the resource, its quantity, its run-time, its region, and its service level. Get those five right and your estimate is usually within a sensible margin.

The two calculators: Pricing vs TCO

Microsoft gives you two free, login-free web calculators, and a common exam (and real-life) mistake is reaching for the wrong one. They answer different questions.

Calculator Question it answers What you put in What you get out Use it when
Pricing Calculator “What will this Azure design cost per month?” Specific Azure services, SKUs, regions, hours, tiers An itemised monthly/annual Azure estimate you can save and share Designing or sizing a workload you intend to build on Azure
Total Cost of Ownership (TCO) Calculator “How much cheaper is Azure than my current on-premises setup?” Your existing servers, storage, network, electricity, IT labour A side-by-side on-prem vs Azure cost comparison over years Building the migration business case for moving to the cloud

The distinction in one line: the Pricing Calculator prices a future Azure design; the TCO Calculator compares your existing on-premises costs against the equivalent on Azure to justify a move. If a question asks “estimate the monthly cost of a planned set of Azure VMs,” that is the Pricing Calculator. If it asks “show leadership the savings of migrating our data centre,” that is the TCO Calculator.

Governing spend: Microsoft Cost Management

Estimating cost before you build is half the job; controlling it after you build is the other half. Microsoft Cost Management is Azure’s free, built-in toolset for monitoring, allocating, and optimising spend. Its four pillars:

Tool What it does The key idea
Cost analysis Interactive view of actual + forecast spend, sliced by subscription, resource group, service, region, or tag See where the money goes and where it is heading
Budgets A spending threshold (per subscription/RG/etc.) that fires alerts at chosen percentages (e.g. 80%, 100%) Get warned before you blow the budget
Azure Advisor (Cost) Automated recommendations: right-size or shut down idle VMs, buy reservations, delete unattached disks Azure tells you how to save
Tags Key/value labels (e.g. env=prod, costcentre=marketing) attached to resources Allocate and charge back cost to teams/projects

Each tool, a little deeper:

Service-level agreements (SLAs)

An SLA is Microsoft’s formal, contractual promise about a service’s availability — the percentage of time it will be up in a given period — and the service credit (a partial refund) you can claim if it misses. SLAs are expressed as a percentage of uptime, and the difference between two percentages that look close is dramatic once you translate it into allowed downtime.

SLA Allowed downtime / month (~30 days) Allowed downtime / year Plain-English feel
99% (“two nines”) ~7 h 18 m ~3.65 days Generous; fine for non-critical/dev
99.9% (“three nines”) ~43.8 m ~8.76 h Common single-instance PaaS baseline
99.95% ~21.9 m ~4.38 h Typical for VMs in an availability set / multi-instance
99.99% (“four nines”) ~4.38 m ~52.6 m Achieved with availability zones
99.999% (“five nines”) ~26 s ~5.26 m Elite; multi-region, carefully engineered

Read this table once and the stakes are obvious: moving from 99.9% to 99.99% shrinks tolerated downtime from nearly 9 hours a year to under an hour. That extra “nine” is exactly what availability zones buy you.

Composite SLA: chaining services multiplies risk

Here is the single most important — and most commonly tested — SLA concept. When your application depends on several services in series (a request must pass through all of them to succeed), the overall SLA is not the lowest one. You multiply the individual SLAs, because any component failing breaks the request. This product is the composite (or compound) SLA.

Composite SLA = SLA₁ × SLA₂ × SLA₃ × …

A worked example. Suppose a web request flows through App Service (99.95%) → Azure SQL Database (99.99%) → Azure Storage (99.9%), each as a hard dependency:

0.9995 × 0.9999 × 0.999 = 0.99840  →  about 99.84%

The composite SLA (~99.84%) is lower than the weakest single component (99.9%), because the small risks compound. The more services you chain in series, the lower the combined promise — a sobering reason to keep critical paths short.

The opposite move is redundancy in parallel: deploy two independent copies of a component so the request succeeds if either works. Parallel redundancy raises availability (the chance both fail at once is the product of their tiny failure probabilities), which is precisely why we add zones and regions. The maths is the inverse: combined failure = failure₁ × failure₂, so two 99.9% instances in parallel approach 99.9999%.

How availability zones and region pairs raise the SLA

The ladder to remember: single instance → 99.9%; availability set / multiple instances → 99.95%; availability zones → 99.99%; multi-region → toward 99.999%. Each rung adds resilience, and cost.

The service lifecycle: preview vs GA

Not every Azure feature is production-ready, and the service lifecycle tells you how finished a thing is. There are three stages.

Stage Who can use it Maturity SLA & support Production use?
Private preview Invited customers only Early, experimental, may change drastically No SLA, limited/no support No
Public preview Anyone (often opt-in) Feature-complete-ish but still changing; “use at your own risk” No SLA, best-effort support No — test and evaluate only
General availability (GA) Everyone Stable, fully supported Backed by an SLA, full support Yes

The phrase “preview means no SLA” is exam-critical and has a precise meaning: while a service is in preview (private or public), Microsoft makes no availability guarantee and you cannot claim service credits if it goes down or misbehaves. Preview features may also change their API, change their pricing, or be withdrawn entirely before GA. The rule of thumb: explore in preview, depend in GA. Use previews to learn and to plan, but build production systems on GA services so you are covered by an SLA and full support. (Microsoft also publishes retirement notices when a service is being deprecated — the other end of the lifecycle — typically with a year or more of notice.)

Azure support plans

When something goes wrong, the support plan on your subscription decides how fast Microsoft will help and how much hand-holding you get. There are four (excluding the bespoke enterprise agreements). They are cumulative: each tier adds to the one below.

Plan Indicative monthly cost Who it is for Technical support Fastest response target
Basic Free (every subscription) Everyone No paid technical support — docs, community, Service Health, Advisor only n/a
Developer Low (~US$29) Trial / non-production / dev-test Business-hours email support for non-critical issues ~8 business hours
Standard Mid (~US$100) Production workloads 24×7 support for production-down issues, phone + email ~1 hour for critical (Sev A)
Professional Direct (ProDirect) High (~US$1000) Business-critical workloads 24×7 with faster response, plus ProDirect delivery — architecture guidance, proactive advice, escalation management ~1 hour for critical, with priority handling

What actually changes as you climb the tiers:

A clean way to remember the decision: learning → Basic; dev/test → Developer; production → Standard; business-critical → Professional Direct.

Azure Service Health

Independent of your support plan, Azure Service Health is a free, personalised dashboard of the health of the Azure services you use. It surfaces three things: Service issues (active Azure outages affecting your resources right now), Planned maintenance (upcoming work that may affect you), and Health advisories (changes and deprecations that need your attention). You can set alerts on it so you are notified the moment an issue touches a region or service you depend on. Do not confuse it with the Azure status page (the global status of all Azure, for everyone) or with Resource Health (the health of one specific resource instance). Service Health is the personalised middle layer — and your first stop when “is it me or is it Azure?”.

Azure cloud economics: pricing, TCO, SLAs & support

The diagram ties the pillars together: the pricing models and cost drivers feed the calculators and Cost Management on the spend side, while the SLA ladder, service lifecycle, and support plans form the value-and-assurance side of the same coin.

Hands-on lab

You will explore the real cost and health tooling in your own subscription — entirely within the free tier, creating no billable resources. All you need is a free Azure account and Azure Cloud Shell (the browser-based terminal in the portal). Everything below is read-only or trivially reversible.

Step 1 — Open Cloud Shell. In the Azure portal, click the terminal icon (>_) in the top bar and choose Bash. Cloud Shell is free; you are only billed for the small backing storage if you opt into it (a few rupees a month — you can decline and use an ephemeral session).

Step 2 — Inspect your subscription and current spend context.

# Confirm which subscription you are in
az account show --output table

# List the resource providers — these are the catalogues of services you *could* be billed for
az provider list --query "[?registrationState=='Registered'].namespace" --output table | head

Expected output: a table with your subscription name/ID and a list of registered providers (e.g. Microsoft.Compute, Microsoft.Storage). This is the surface area of what can generate cost.

Step 3 — Create a budget with an alert (this is free; a budget is metadata, not a billable resource). Replace <SUBSCRIPTION_ID> with the ID from Step 2.

# Set a ₹500 monthly budget that alerts the account owner at 80% and 100%
az consumption budget create \
  --budget-name "learn-monthly-cap" \
  --amount 500 \
  --category Cost \
  --time-grain Monthly \
  --start-date 2026-06-01 \
  --end-date 2027-06-01 \
  --resource-group "$(az group list --query '[0].name' -o tsv)" 2>/dev/null \
  && echo "Budget created" || echo "If this errors, create the budget in Portal → Cost Management → Budgets (the CLI surface varies by environment)"

Expected output: Budget created (or the fallback hint). Either way, now open Portal → Cost Management → Budgets to see it: this is the safety net that warns you before spend runs away. Note that it only alerts — it does not stop anything.

Step 4 — Read Azure Advisor’s cost recommendations (free, read-only).

# List any cost-saving recommendations Advisor has for your subscription
az advisor recommendation list --category Cost --output table 2>/dev/null \
  || echo "No cost recommendations yet (expected on a brand-new/empty subscription)"

Expected output: a (possibly empty) table. On an empty learner subscription there may be nothing yet; on a real subscription you would see “right-size VM”, “delete unattached disk”, etc.

Step 5 — Check Service Health from the portal. Go to Portal → Service Health → Service issues. You will see any active Azure incidents affecting your services and regions. Click Health alerts to see how you would wire up notifications. This is read-only.

Validation. You have (a) confirmed your billing scope, (b) created a budget you can see under Cost Management, © queried Advisor for savings, and (d) viewed Service Health — the four spend-and-assurance tools from this lesson, in your own account.

Cleanup. Delete the budget so nothing lingers:

az consumption budget delete --budget-name "learn-monthly-cap" \
  --resource-group "$(az group list --query '[0].name' -o tsv)" 2>/dev/null \
  || echo "If created in the portal, delete it under Cost Management → Budgets"

Cost note. This entire lab is free. Budgets, Advisor, Cost analysis, and Service Health are all no-cost features. If you opted into Cloud Shell’s persistent storage, that is roughly ₹5–₹15/month for the small file share — delete the backing storage account if you do not want even that.

Common mistakes & troubleshooting

Symptom Likely cause Fix
“My budget didn’t stop the spending!” A budget only alerts; it does not cap or shut down resources Wire the budget alert to an action group that runs automation (e.g. deallocate VMs), or use Azure Policy/quotas to prevent the spend
Surprise charges after a “free” migration Egress (outbound/cross-region data transfer) not estimated Model egress in the Pricing Calculator; minimise cross-region chatter and large internet responses
Reservation bought but bill unchanged Reservation scope wrong, or usage doesn’t match the reserved SKU/region Check the reservation’s scope and utilisation in Cost Management; use a Savings Plan if the workload SKU drifts
Picked the TCO Calculator to size a new app Confusing TCO (on-prem comparison) with Pricing (Azure estimate) Use the Pricing Calculator to estimate an Azure design; TCO is for the migration business case
Built production on a preview feature, no credit after an outage Preview = no SLA; service credits don’t apply Use GA services in production; keep previews to evaluation only
Expected the composite SLA to equal the lowest component Chained dependencies multiply, they don’t take the minimum Compute the product of all in-series SLAs; shorten the critical path or add parallel redundancy
Can’t open a technical support ticket On the Basic (free) plan, which has no paid technical support Upgrade to Developer (dev/test) or Standard (production)

Best practices

Security notes

Interview & exam questions

  1. What is the difference between CapEx and OpEx, and how does the cloud change the balance? CapEx is up-front spend on owned assets that depreciate; OpEx is ongoing pay-as-you-go consumption. The cloud converts most IT spend from CapEx to OpEx, removing up-front cost and over-provisioning risk and making elasticity financially valuable.
  2. Name Azure’s three pricing models and when you’d use each. Pay-as-you-go (consumption) for spiky/unpredictable/short-lived workloads; reservations or savings plans for steady 24×7 baselines (commit 1–3 years for up to ~72% off); spot for interruptible, fault-tolerant batch work (up to ~90% off, but can be evicted).
  3. What’s the difference between a Reservation and a Savings Plan? A reservation commits to a specific resource (e.g. a VM size in a region); a savings plan commits to a fixed hourly compute spend across services — more flexible, slightly smaller discount. Use a savings plan when the workload’s exact SKU may change.
  4. A web app calls App Service (99.95%) → SQL DB (99.99%) → Storage (99.9%) in series. What’s the composite SLA? Multiply them: 0.9995 × 0.9999 × 0.999 ≈ 99.84%lower than the weakest component, because in-series risks compound.
  5. How do availability zones and region pairs each raise availability, and how do they differ? Zones are separate datacentres within a region (lift a single instance from 99.9% to 99.99%, protecting against a datacentre failure with no cross-region latency). Region pairs are two regions for disaster recovery (protect against a whole-region outage; add latency and egress cost).
  6. Which calculator do you use to justify a data-centre migration, and which to size a new Azure app? TCO Calculator compares your existing on-prem costs to Azure (the migration business case); Pricing Calculator estimates the cost of a planned Azure design.
  7. Does an Azure budget stop you from overspending? No. A budget alerts at thresholds (and can trigger an action group); it does not cap or shut down resources by itself. To actually prevent spend, use automation, quotas, or policy.
  8. What does “preview means no SLA” mean in practice? While a service is in private or public preview, Microsoft gives no availability guarantee and no service credits; the feature may change or be withdrawn. Use previews for evaluation only; run production on GA services.
  9. Which support plan would you put on a production workload, and which is free? Basic is free on every subscription (self-service only, no paid technical support). Production typically needs Standard (24×7, ~1-hour critical response). Business-critical estates use Professional Direct.
  10. What’s the difference between Azure Service Health, the Azure status page, and Resource Health? The status page is the global status of all Azure; Service Health is personalised to the services and regions you use; Resource Health is the health of one specific resource instance.
  11. Why is data egress a notorious cost driver? Inbound data is free, but outbound data (to the internet or across regions) is metered per GB after a small allowance — multi-region replication and chatty internet APIs can run egress that exceeds the compute cost.
  12. How do tags help with cost? Tags (key/value labels) attach to resources and flow into cost records, letting you slice spend by team, project, or environment for showback/chargeback. Enforce a consistent scheme with Azure Policy.

Quick check

  1. Spreading a workload across availability zones typically raises its SLA from 99.9% to which tier?
  2. True or false: an Azure budget automatically shuts down resources when you hit 100%.
  3. You have a steady production database running 24×7 for the next three years. Which pricing model is cheapest?
  4. Which calculator builds the business case for migrating an on-premises data centre to Azure?
  5. Three services chained in series have SLAs of 99.9%, 99.95% and 99.99%. Is the composite SLA higher or lower than 99.9%?

Answers

  1. 99.99% (“four nines”) — zone redundancy is what earns that extra nine.
  2. False. A budget only alerts (and can trigger automation via an action group); it does not stop spending on its own.
  3. A reservation (1- or 3-year) — or a savings plan — because the baseline is predictable and steady; commitment buys a large discount.
  4. The Total Cost of Ownership (TCO) Calculator, which compares current on-prem costs against the Azure equivalent.
  5. Lower — in-series SLAs multiply (0.999 × 0.9995 × 0.9999 ≈ 99.84%), so the composite drops below the weakest component.

Exercise

Using only free tools, produce a one-page “cloud economics brief” for a hypothetical app — a small internet-facing web app (one App Service) plus one Azure SQL Database, expected to run 24×7 in your nearest region, with modest internet traffic:

  1. Estimate the monthly Azure cost in the Pricing Calculator (pick Basic/Standard tiers; include a rough egress figure). Note the total in INR.
  2. Compute the composite SLA for the App Service → SQL chain (multiply the two services’ SLAs) and write down the tolerated downtime per month it implies.
  3. Propose one resilience upgrade (e.g. zone redundancy) and state how it changes the SLA and roughly the cost.
  4. Pick the right support plan for this app once it’s in production, and justify it in one sentence.
  5. List two cost-control measures you’d put in place on day one (e.g. a budget with alerts; a tagging scheme).

This mirrors exactly what an architect is asked to produce before a project is approved.

Certification mapping

This lesson maps directly to AZ-900: Microsoft Azure Fundamentals, primarily the domains:

CapEx/OpEx and the cloud economic benefits also appear in the “describe cloud concepts” domain. Mastering this material covers a substantial, high-frequency portion of the AZ-900 question pool.

Glossary

Next steps

Now that you can reason about cost and SLAs, ground them in the physical platform they describe. Continue to Azure Global Infrastructure: Geographies, Regions, Availability Zones, Fault & Update Domains, which makes the availability concepts in this lesson concrete — exactly how zones, region pairs, fault domains, and update domains deliver the SLA ladder you just learned to calculate. From there, the commitment-strategy and FinOps lessons go deeper on squeezing the bill once you are running real workloads.

AzureCloud FundamentalsCloud EconomicsCost ManagementSLAAZ-900
Need this built for real?

Vinod is a Senior Cloud Architect (22+ yrs) — available for Azure / AWS / GCP architecture, landing zones, and migrations.

Work with me

Comments

Keep Reading