Azure AI Fundamentals

AI-900: AI & Machine Learning Fundamentals on Azure (incl. Responsible AI)

“Artificial intelligence” is one of the most over-used phrases in technology, and that makes it strangely hard to learn. Marketing slaps “AI-powered” on everything; films imagine conscious robots; vendors promise magic. Underneath the noise, though, AI is a concrete, learnable engineering discipline — and the foundational ideas are genuinely approachable. This lesson strips away the hype and teaches you what AI and machine learning actually are, the handful of problem shapes (“workloads”) they solve, how a model is built and evaluated, what Azure gives you to do it, and — crucially — how Microsoft expects you to build AI responsibly.

This is a foundation lesson in the Azure Zero-to-Hero course, so we assume zero prior knowledge of AI or statistics and define every term the first time it appears. We still go deeper than a typical intro: you will finish able to hold a sensible conversation about model types, recognise which Azure service fits which job, and explain all six Responsible AI principles with real examples. Everything here maps directly to the AI-900: Microsoft Azure AI Fundamentals certification, where these topics are worth a large share of the exam.

Learning objectives

By the end of this lesson you can:

Prerequisites & where this fits

You need only basic IT literacy and a little comfort with the idea that computers follow instructions — no maths beyond arithmetic, and no programming, is assumed for the concepts (the optional lab uses a no-code tool). It helps to have read the earlier Azure foundation lessons so terms like subscription, resource group and region are familiar, but it is not essential. This is the first lesson of the AI Fundamentals module in the Azure Zero-to-Hero course; it builds the vocabulary and mental model that the applied-AI lessons (Azure AI Vision, Language, Speech, Document Intelligence) and the generative-AI lesson all assume.

What is artificial intelligence?

The clearest working definition for our purposes: artificial intelligence is software that performs tasks which normally require human intelligence — recognising objects in a photo, understanding written or spoken language, spotting patterns, making predictions, or generating new content. It is not consciousness, and (today) it is not general reasoning across any task a person can do; it is a collection of techniques that each do something narrow surprisingly well.

It helps to nest three terms that are often used loosely as if they were synonyms:

Term What it means Relationship
Artificial intelligence (AI) The broad field: any technique that makes software behave “intelligently”. The outermost umbrella.
Machine learning (ML) A subset of AI in which the system learns patterns from data rather than being explicitly programmed with rules. Inside AI.
Deep learning A subset of ML that uses neural networks with many layers, especially good for images, audio and language. Inside ML.

The pivotal shift to understand is the one between traditional programming and machine learning. In traditional programming a developer writes the rules: if the email contains the word “lottery” and three exclamation marks, mark it as spam. That works until spammers change tactics, and someone must keep rewriting rules. In machine learning you do not write the rules at all — you show the system thousands of examples already labelled “spam” or “not spam”, and an algorithm discovers the patterns itself, producing a model. The model is the learned thing you then use to make predictions on new, unseen emails. This is why ML dominates modern AI: for messy, high-variety problems (images, speech, fraud, language), learning patterns from data beats hand-writing rules.

The common AI workloads

The AI-900 exam thinks in terms of workloads — the recognisable shapes of problem that AI solves. Recognising the shape is the most useful skill, because it tells you which family of services to reach for. Here are the ones you must know:

Workload What it does Everyday example Typical Azure service (covered next lesson)
Machine learning Predict a value or category from data; find patterns. Forecast tomorrow’s sales; predict which customers will churn. Azure Machine Learning
Computer vision Interpret the visual world — images and video. Detect defects on a production line; read a number plate. Azure AI Vision, Face
Natural language processing (NLP) Understand and work with written and spoken language. Gauge the sentiment of reviews; extract names from contracts; power a chatbot. Azure AI Language, Azure AI Speech
Document intelligence Extract structured information (fields, tables) from documents. Read invoices and receipts into a database automatically. Azure AI Document Intelligence
Knowledge mining Index large volumes of mixed content so it becomes searchable and queryable. Make a decade of PDFs and scanned reports instantly searchable. Azure AI Search
Generative AI Create new content — text, code, images — in response to a prompt. Draft an email, summarise a report, generate an image. Azure OpenAI (its own lesson)

A few notes that examiners like to probe. Computer vision and NLP are themselves usually powered by machine learning underneath — they are not separate from ML so much as ML applied to a particular kind of input (pixels, or words). Document intelligence is a focused blend of computer vision (reading the page) and NLP (understanding the fields). Generative AI is the newest workload and the one most people now mean when they say “AI”; it is built on very large deep-learning models and gets its own lesson because it behaves quite differently from the predictive workloads.

How machine learning learns: features, labels, and types

To talk sensibly about ML you need four words. Imagine a spreadsheet of past house sales used to predict prices:

Machine learning then splits into two big families based on whether your training data has labels:

Supervised learning (you have labelled answers)

In supervised learning every training example comes with the correct answer (the label), and the model learns to reproduce it. There are two sub-types you must distinguish for the exam:

Type Predicts Output Example Example algorithms
Regression A numeric, continuous value. A number. House price, tomorrow’s temperature, expected revenue. Linear regression, decision-tree/forest regressors
Classification A category (a class). A label from a fixed set. Spam vs not-spam (binary); cat/dog/bird (multi-class); is this transaction fraud? Logistic regression, decision trees, neural networks

The quick test: “how much / how many?” is regression; “which one / is it X?” is classification. Both are supervised because both need historic examples with known answers to learn from.

Unsupervised learning (no labels — find structure)

In unsupervised learning the data has no labels; the algorithm finds structure on its own. The headline technique for AI-900 is clustering — grouping similar items together without being told the groups in advance. For example, an online shop can cluster customers by purchasing behaviour to discover natural segments (“bargain hunters”, “premium buyers”) that nobody defined up front. Because there is no “correct answer” to check against, evaluating unsupervised models is more subjective than supervised ones.

Note: you may hear of a third family, reinforcement learning (an agent learns by trial-and-error from rewards, as in game-playing or robotics). It is good to recognise the term, but supervised and unsupervised are the two the fundamentals exam concentrates on.

Train, validate, test — and why you split the data

A model that simply memorises its training data is useless — it would score perfectly on what it has seen and fail on anything new. This failure is called overfitting. To detect and avoid it, you never evaluate a model on the same data you trained it on. You split your data into three parts:

Split Purpose Rough share
Training set The data the model learns from. ~70%
Validation set Used during development to tune the model and compare options without touching the test set. ~15%
Test set Held back untouched until the very end to give an honest estimate of how the model performs on unseen data. ~15%

The discipline is simple but vital: the test set is your stand-in for the real world, so peeking at it during training quietly inflates your confidence and undermines the whole exercise.

How do you know if a model is any good?

Different workloads use different scorecards (evaluation metrics), and AI-900 expects you to recognise a few:

You do not need to compute these by hand for the exam, but you should know that “a good model” means good on data it has never seen, measured by the right metric for the task.

The machine-learning lifecycle

Building a model is a repeatable, iterative process — not a one-shot event. Knowing the stages helps you place every Azure tool in context:

  1. Define the problem. Turn a business question (“which customers will churn?”) into an ML task (binary classification) with a measurable goal.
  2. Collect and prepare data. Gather data, clean it (fix missing values, remove duplicates), and engineer features. This is usually the largest, least glamorous part of the work, and its quality caps everything downstream — garbage in, garbage out.
  3. Train the model. Choose an algorithm, feed it the training set, and let it learn. Often you try many algorithms and settings.
  4. Evaluate. Measure performance on the validation/test data using the right metric; iterate back to data or training if it is not good enough.
  5. Deploy. Package the model behind an endpoint (a web address other software can call) so applications can get predictions in real time or in batches.
  6. Monitor and retrain. Watch the live model. The world changes and data drifts, so accuracy decays over time; you periodically retrain on fresh data. This loop is often called MLOps (machine-learning operations).

The crucial mindset: ML is circular, not linear. You will revisit data and training many times, and a deployed model is the beginning of an operational responsibility, not the end of a project.

Azure Machine Learning at a fundamentals level

Azure Machine Learning (Azure ML) is Azure’s end-to-end, managed platform for the whole lifecycle above — preparing data, training, evaluating, deploying and monitoring models. For AI-900 you do not need to operate it deeply; you need to recognise its main building blocks and what each is for.

Compute type What it is Used for
Compute instance A personal, managed cloud workstation (a VM) for one developer. Writing and running notebooks during development.
Compute cluster A pool of VMs that scales up for training and back down to zero when idle. Training jobs and Automated ML runs.
Inference cluster / endpoint Managed compute that hosts a deployed model. Serving live (real-time) or batch predictions.
Attached compute External resources you connect (e.g. a Databricks or Kubernetes cluster). Reusing compute you already run.

The way to hold this in your head: Designer is the visual/low-code path, Automated ML is the “let Azure find the best model” path, and notebooks/SDK is the full-control code path — all inside one workspace, running on compute you provision and (importantly) shut down when idle.

Microsoft’s six Responsible AI principles

AI makes consequential decisions about people — who gets a loan, which CV is shortlisted, what a self-driving car does. That power demands responsibility, and the AI-900 exam takes this seriously: expect several questions. Microsoft frames its approach as six guiding principles. Learn all six, in this order, each with a concrete example of what violating it looks like and how you uphold it.

Principle Core idea What it looks like in practice
Fairness AI systems should treat all people fairly and not discriminate. A hiring model trained on biased history must not disadvantage candidates by gender or ethnicity. Uphold it by testing performance across groups and mitigating bias in data and features.
Reliability & safety Systems must operate reliably, safely and consistently, including under unexpected conditions. A self-driving car or a medical-triage model must be rigorously tested and fail safe. Uphold it with thorough testing, monitoring, and human oversight for high-stakes decisions.
Privacy & security AI must respect privacy and keep data secure throughout its lifecycle. Personal data used to train a model must be protected, consented to, and not leak through the model’s outputs. Uphold it with encryption, access control, data minimisation and de-identification.
Inclusiveness AI should empower everyone and engage people of all abilities and backgrounds. Speech recognition should work across accents and dialects; interfaces should support assistive technology. Uphold it by designing for accessibility and diverse users from the start.
Transparency People should understand how an AI system works and its limitations. Users should know they are interacting with AI and roughly why it decided as it did (interpretability). Uphold it with clear documentation, model explainability and honest communication of limits.
Accountability People — not the AI — are responsible for the systems they build and run, under appropriate governance. A human and an organisation must answer for an AI decision; “the algorithm did it” is not an excuse. Uphold it with governance, oversight, audit trails and clear ownership.

Two of these are best thought of as the foundation that supports the other four: transparency and accountability. You can only achieve fairness, reliability, privacy and inclusiveness if a system is transparent enough to inspect and if real people are accountable for it. A simple way to remember all six is the mnemonic “FRPITA”Fairness, Reliability & safety, Privacy & security, Inclusiveness, Transparency, Accountability — or just rehearse them as three pairs: fair & safe, private & inclusive, transparent & accountable.

The Azure AI & ML landscape

The diagram above ties the lesson together: the AI → ML → deep-learning nesting, the common workloads fanning out to their Azure services, the train-evaluate-deploy lifecycle running through Azure Machine Learning, and the six Responsible AI principles framing the whole picture.

Hands-on lab

This lab is no-code and free — you will explore Azure Machine Learning’s Automated ML to see the lifecycle without writing a line of code. Creating a workspace is free; you incur cost only while compute runs, and you will delete everything at the end. To keep this lab at essentially zero cost you may simply walk through the create blades and Studio (most learning value is in seeing the options) — only start compute if you want to run an actual training job.

Steps

  1. Sign in to the Azure portal (portal.azure.com) with your free account.
  2. In the search bar type “Azure Machine Learning” and select Create. On the Basics blade choose your subscription and resource group (create one called rg-aml-lab), give the workspace a name (e.g. mlw-ai900-lab), pick a nearby region, and accept the defaults for the supporting resources (a storage account, Key Vault and Application Insights are created automatically). Select Review + create, then Create, and wait for deployment to finish.
  3. Open the resource and click Launch studio to open Azure Machine Learning Studio.
  4. In the left menu open Automated ML → New Automated ML job. Browse the steps: select or create a dataset, choose the task type (e.g. Classification), pick the target column (the label), and review the compute step where you would create a small compute cluster. Stop here if you want to avoid all cost — you have now seen the full set-up.
  5. (Optional, incurs cost): create a tiny compute cluster (one small node, minimum 0 nodes so it scales to zero), submit the job on a sample dataset, and after it completes inspect the leaderboard of models and the metrics for the best one.
  6. Explore Designer from the left menu to see the drag-and-drop pipeline canvas and its pre-built modules.

Validation

You have succeeded when you can: see your workspace in the portal, open Studio, reach the Automated ML job-creation screen and identify where you choose the task type, target column and compute, and open the Designer canvas. (If you ran the optional job, success also means seeing a ranked leaderboard of trained models.)

Cleanup

To avoid any ongoing charges, delete the lab when done. The simplest, most reliable way is to delete the whole resource group:

az group delete --name rg-aml-lab --yes --no-wait

If you created a compute cluster, deleting the resource group removes it; if you want to be doubly sure first, set its minimum and maximum nodes to 0 in Studio so it cannot run.

Cost note

Creating the workspace and clicking through the blades costs nothing. The only charge is compute time if you run the optional job. A single small/burstable node (e.g. a B-series VM) for a short demo run is a few rupees — well under ₹50 for a brief session — provided the cluster scales to zero when idle and you delete the resource group afterwards. The cardinal rule with Azure ML cost: idle compute that is not scaled to zero is the thing that quietly runs up a bill.

Common mistakes & troubleshooting

Symptom / mistake Cause Fix
“AI” and “machine learning” used as synonyms Conflating the umbrella with a subset ML is a subset of AI; deep learning is a subset of ML. Use the nested model.
Confusing regression with classification Both are supervised, so they blur Numeric output = regression; category output = classification. “How much?” vs “which one?”.
Model scores brilliantly in training, poorly in production Overfitting — evaluated on data it trained on Always hold back a test set; judge the model only on unseen data.
Compute cluster keeps billing after the demo Minimum nodes not set to 0, or resources left running Scale clusters to 0 minimum nodes and delete the resource group when finished.
Treating Responsible AI as optional paperwork Seeing it as compliance, not design The six principles are design requirements; bias and safety must be tested, not assumed.
Expecting AutoML to fix bad data Believing the tool replaces data quality AutoML picks algorithms, not facts. Garbage in, garbage out — data prep still dominates.
Mixing up the workloads in exam questions Not recognising the problem shape Map the scenario: pixels → computer vision; words → NLP; forms → document intelligence; create content → generative AI.

Best practices

Security notes

Interview & exam questions

  1. What is the difference between AI, machine learning and deep learning? AI is the broad field of “intelligent” software; ML is a subset where systems learn patterns from data instead of being explicitly programmed; deep learning is a subset of ML using multi-layer neural networks, especially for images, audio and language.
  2. How does machine learning differ from traditional programming? Traditional programming: a human writes explicit rules. ML: you provide labelled examples and an algorithm learns the rules itself, producing a model.
  3. Define features and labels. Features are the input variables the model learns from; the label is the output value the model is trained to predict.
  4. Contrast supervised and unsupervised learning. Supervised learning trains on labelled data (known answers); unsupervised learning finds structure in unlabelled data (e.g. clustering).
  5. Regression vs classification — give an example of each. Regression predicts a continuous number (e.g. house price); classification predicts a category (e.g. spam vs not-spam).
  6. Why split data into training, validation and test sets? To detect and avoid overfitting and to get an honest estimate of performance on unseen data; the test set must stay untouched until the end.
  7. Name the stages of the machine-learning lifecycle. Define the problem → collect/prepare data → train → evaluate → deploy → monitor & retrain (iterating throughout).
  8. What is Automated ML and who is it for? A capability that automatically trains and compares many algorithms/settings and ranks the best model; ideal for non-experts and for quickly establishing a strong baseline.
  9. What is the Azure Machine Learning Designer? A drag-and-drop visual canvas for building ML pipelines without code, by connecting modules for import, clean, train, evaluate and deploy.
  10. Name Microsoft’s six Responsible AI principles. Fairness; reliability & safety; privacy & security; inclusiveness; transparency; accountability.
  11. Which two Responsible AI principles are considered foundational, and why? Transparency and accountability — you can only ensure the other four if systems are understandable and real people are answerable for them.
  12. Give a concrete example of the fairness principle. A recruitment model must not disadvantage candidates based on gender or ethnicity; you uphold fairness by evaluating performance across groups and mitigating bias in data.

Quick check

  1. Deep learning is a subset of which broader field — AI, or machine learning, or both?
  2. You want to predict a customer’s lifetime spend in rupees. Is that regression or classification?
  3. Grouping customers into segments without predefined labels is an example of which learning type and which technique?
  4. Which Azure ML capability automatically tries many algorithms and ranks the best model?
  5. Which Responsible AI principle is violated if a speech model fails to recognise certain regional accents?

Answers

  1. Both — deep learning is a subset of machine learning, which is itself a subset of AI.
  2. Regression — the output is a continuous numeric value.
  3. Unsupervised learning, using clustering.
  4. Automated ML (AutoML).
  5. Inclusiveness — the system fails to work for people of all backgrounds (and arguably touches fairness too).

Exercise

Pick a real decision in your own world — for example, “which support tickets should be escalated to a senior engineer?” Write a short half-page that:

  1. States the business question and reframes it as an ML task (regression, classification, or clustering — and say which and why).
  2. Lists three plausible features and the label you would predict.
  3. Notes which AI workload it belongs to and which Azure service you would investigate.
  4. Identifies one Responsible AI risk in your scenario (e.g. could the model be unfair, or leak private data?) and one concrete step to mitigate it.

This exercise forces you to practise the most valuable fundamentals skill: turning a fuzzy problem into a well-shaped ML task while thinking about responsibility from the start.

Certification mapping

This lesson maps to the AI-900: Microsoft Azure AI Fundamentals certification, primarily the domains “Describe Artificial Intelligence workloads and considerations” (the workloads and the Responsible AI principles) and “Describe fundamental principles of machine learning on Azure” (model types, the lifecycle, features/labels, and Azure Machine Learning’s workspace, Automated ML, Designer and compute). The Responsible AI content here is examined across the AI-900 syllabus and is also valuable background for the AI-102: Azure AI Engineer Associate certification.

Glossary

Next steps

You now have the mental model for AI and machine learning, know your way around Azure Machine Learning at a fundamentals level, and can explain all six Responsible AI principles. Next, move from concepts to the ready-made building blocks Microsoft provides — pre-trained services you can call without training a model yourself — in AI-900: Azure AI Services — Vision, Language, Speech, Document Intelligence & Search. After that, the course turns to generative AI and Azure OpenAI, where these foundations and the Responsible AI principles become essential.

AzureAI FundamentalsMachine LearningResponsible AIAzure Machine LearningAI-900
Need this built for real?

Vinod is a Senior Cloud Architect (22+ yrs) — available for Azure / AWS / GCP architecture, landing zones, and migrations.

Work with me

Comments

Keep Reading