Now, AI that creates. β¨
Module 1 gave you the foundation β now the exciting part. How AI creates β foundation models, prompts, tuning, and the agents that act for you.
Module 2 β 7 Chapters πΊοΈ
This is the official Generative AI course, distilled. From the models underneath, all the way to building AI agents.
1 Β· What is Gen AI
Creates & acts Β· the stack
2 Β· Foundation Models
Gemini family & friends
3 Β· Idea to App
Vertex AI Studio Β· prompts
4 Β· Prompt Engineering
Temperature, Top-K, Top-P
5 Β· Grounding & Tuning
RAG & fine-tuning
6 Β· AI Agents
Chatbot β agent β agentic
7 Β· Build Agents on Google Cloud
Gemini Enterprise Β· Agent Builder Β· ADK
Two Things, Not One β¨
We met this in Module 1 β generative AI creates and acts. Let's open that up.
π¨ It creates content
- Text, code, images, speech, video, even 3D
- From a prompt β a question or instruction
- Summaries, reports, Q&A chatbots, images & video
π¦Ύ It takes action
- Autonomous, goal-oriented action on your behalf
- Automate workflows, book travel, schedule
- We reach agents at the end of this module
The Gen AI Stack β 3 Layers ποΈ
Just like the AI architecture in Module 1, but now for generative AI. Bottom to top:
What Is a Foundation Model? π
The backbone of every Gen AI app. Here's the idea in plain terms:
Trained on a LOT
Learns from massive existing text, images, video. That learning = training.
Huge parameters
From millions β trillions. More parameters = more capacity to learn.
General-purpose
Pre-trained for broad use, then specialised later.
Google's Models β Tap to Explore π
π Gemini Pro
β‘ Gemini Flash
πͺΆ Gemini Flash-Lite
πΌοΈ Imagen / π¬ Veo
π Embeddings
π‘ The big idea
Multimodal Β· Pre-trained vs Fine-tuned π§
π Pre-trained
- Trained on a huge general dataset
- Like 12 years of school β literate, general
- Horizontal AI β works across industries
π©Ί Fine-tuned
- Further trained on a small, field-specific dataset
- Like medical school β a specialist
- Vertical AI β retail, finance, healthcare
Match the Model to the Job π§©
Meet Bea, Ann & Ian π©βπΌπ¨βπ»
Three people at Cymbal Insurance β our guides for the rest of the module.
Bea Β· Analyst
No tech background. Wants to prototype an idea fast.
Ann Β· AI Developer
Wants to design & manage prompts.
Ian Β· ML Engineer
Wants to deploy & fine-tune at scale.
Anatomy of a Prompt β Tap Each Part π§ͺ
Design vs Engineering Β· Idea β App π οΈ
βοΈ Prompt Design
- Crafting a prompt to get the response you want
- Be direct & specific Β· use structure Β· iterate
π Prompt Engineering
- Designing, refining & optimising prompts over time
- Explore few-shot, chain-of-thought, RAG
The Knobs You Can Turn ποΈ
After the prompt, you tune how the model picks its words. Four settings:
Model
Pick the right one β Gemini Flash/Pro, or specialists. Vertex AI Studio even hosts Claude, Llama, GPT.
Temperature
Controls randomness. Low = safe & typical. High = creative & unusual.
Top-K
Pick randomly from the K most-likely words. K=2 β choose from the top 2.
Top-P
Pick from the smallest set of words whose probabilities add up to P.
Feel the Temperature π‘οΈ
Keeping Answers Accurate π
Foundation models are pre-trained β their knowledge can be outdated. Two fixes:
π Grounding
- Connect the model to trusted, current data
- Answers get verified against the latest info
- Ground with Google Search or your own data
π RAG
- Retrieval-Augmented Generation
- The method that implements grounding
- Retrieves relevant data, then generates
The Tuning Spectrum ποΈ
Want to improve the model itself? Options run from light to heavy:
LIGHTEST
Prompt Design
Guide with words. Doesn't change the model.
MIDDLE
Parameter-Efficient
(Adapter) Tuning
Updates a small subset of parameters.
HEAVIEST
Full Fine-Tuning
Updates ALL parameters. Best quality, most compute.
Supervised Fine-Tuning π·οΈ
The tuning Vertex AI supports today. You teach the model a new skill with labelled examples.
Labelled pairs
Hundreds of input β desired output examples, in a JSONL file.
Good for
Classification, summarising, extraction, chat β well-defined tasks.
Result
A new tuned model in the Model Registry, ready to deploy.
From Chatbot to Agentic AI π€
Gen AI is evolving. A chatbot answers. An agent acts. Agentic AI coordinates many agents.
ASK
Chatbot
You prompt, it answers. Conversational.
ACT
AI Agent
Connects to tools & data, takes action, observes feedback.
COORDINATE
Agentic AI
Multiple agents reasoning together on multi-step tasks.
What's Inside an Agent? π§©
Three components working together β picture a body:
Model β the brain
The reasoning centre. Thinks, plans, decides the steps to reach the goal.
Tools β hands & senses
APIs (GET, POSTβ¦) to act on the world β send an email, fetch the weather.
Orchestration β nervous system
The cyclical loop: take the decision β use a tool β feed the result back.
Match the Agent Component π§©
The Agent Tool Stack ποΈ
Same three layers as before β now for building agents. Bottom to top:
Which Tool? Ease vs Flexibility π§
Pick by how much code you want to write versus how much control you need.
| Tool | Code | Best for |
|---|---|---|
| Gemini Enterprise | No-code | Business users β ready-to-use, minimal setup |
| Agent Garden + Builder | Low-code | Analysts β start from a sample & customise |
| ADK (Agent Dev Kit) | Pro-code | Engineers β full control, deep integrations |
Lock It In π§ͺ
Q1What makes a model a "foundation model"?
Q2Which architecture (Google, 2017) underpins modern Gen AI?
Q3The iterative loop of refining & optimising prompts is:
Q4A reusable prompt with replaceable variables is a:
Q5Which tuning updates ALL the model's parameters?
Q6Which component is the agent's "brain"?
Q7A business user wants a no-code agent. Best choice?
Q8As you move from Gemini Enterprise β ADK, you gain:
Module 2 β You Can Now Explainβ¦ β
π§ͺ Step 1 Β· Lab β Gemini Multimodal with Vertex AI Studio
Build an app from a prompt, apply prompt best practices, and generate multimodal media. Open the lab β
π Step 2 Β· Official Module 2 Quiz β
After the lab, take the graded quiz on Skills Boost. skills.google βΊ course 593 βΊ quiz 617895
Next: AI Development Options. π€οΈ
You've seen how Gen AI is built. Next we compare the four ways to build any AI β pre-trained APIs, BigQuery ML, AutoML, and custom training β and how to choose.
M3 Β· AI Dev Options
Start now β
M4 Β· AI Dev Workflow
Coming soon
Then: the exam
You're getting there