Fatskills
Practice. Master. Repeat.
Study Guide: Cloud ML - Google Cloud Professional Machine Learning Engineer: Vertex AI Agent Builder and Search & Conversation Overview
Source: https://www.fatskills.com/hesi/chapter/cloud-ml-cert-gcp-ml-vertex-ai-agent-builder-and-search-conversation-overview

Cloud ML - Google Cloud Professional Machine Learning Engineer: Vertex AI Agent Builder and Search & Conversation Overview

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~7 min read

GCP_ML – Vertex AI Agent Builder and Search & Conversation Overview

Vertex AI Agent Builder & Search & Conversation: Exam-Ready Study Guide

What This Is

Vertex AI Agent Builder (formerly Vertex AI Search and Conversation) is Google Cloud’s no-code/low-code platform for building AI-powered search engines, chatbots, and virtual agents using retrieval-augmented generation (RAG). It’s critical for enterprise-grade conversational AI where you need secure, scalable, and customizable solutions without deep ML expertise.

Real-world scenario: A retail company wants to deploy a customer support chatbot that answers questions about products, orders, and returns by searching internal knowledge bases (PDFs, FAQs, product catalogs) and external websites—without hallucinating. Vertex AI Agent Builder lets them index structured and unstructured data, apply custom ranking, and deploy a chat interface in days, not months.


Key Terms & Services

  • Vertex AI Agent Builder (formerly Search & Conversation): GCP’s managed service for building search engines, chatbots, and virtual agents using RAG (Retrieval-Augmented Generation). Combines Google’s search and LLM capabilities with custom data sources.

  • Vertex AI Search: A fully managed search service that indexes structured (BigQuery, Cloud SQL) and unstructured (PDFs, HTML, Docs) data and returns semantic or keyword-based results. Supports custom ranking, filters, and faceted search.

  • Vertex AI Conversation: A no-code chatbot builder that uses LLMs (PaLM 2, Gemini) + RAG to generate responses from indexed data. Supports multi-turn conversations, intent detection, and fallback policies.

  • Data Store: A container for indexed data in Vertex AI Search. Can be structured (BigQuery, Cloud SQL) or unstructured (Google Drive, websites, PDFs). Supports real-time updates (for structured) or scheduled crawls (for unstructured).

  • Serving Config: Defines how search results are returned (e.g., semantic vs. keyword search, ranking rules, filters). Can be customized per use case (e.g., prioritize recent documents for news search).

  • Grounding (in RAG): The process of linking LLM responses to specific data sources to reduce hallucinations. Vertex AI Agent Builder automatically grounds responses in indexed data.

  • Enterprise Search: A pre-built search experience for internal knowledge bases (e.g., HR docs, IT support). Can be embedded in apps or used via API.

  • Chat App: A pre-built conversational interface that uses Vertex AI Conversation to answer questions. Can be customized with branding, intents, and fallback policies.

  • Intent Detection: The process of identifying user goals (e.g., "I want to return a product") to route queries to the right data store or LLM prompt.

  • Fallback Policy: Rules for what happens when the chatbot can’t answer a question (e.g., escalate to a human, provide a default response, or search the web).

  • Vector Search (via Vertex AI Matching Engine): Used for semantic search (finding similar items based on meaning, not keywords). Vertex AI Agent Builder automatically generates embeddings for unstructured data.

  • Citation & Attribution: Vertex AI Agent Builder links responses to source documents, improving transparency and trust (critical for enterprise use cases).


Step-by-Step / Process Flow

1. Set Up a Data Store

  • Action: Create a data store in Vertex AI Agent Builder.
  • Choose structured (BigQuery, Cloud SQL) or unstructured (Google Drive, websites, PDFs).
  • For unstructured data, configure a crawler schedule (e.g., daily updates).
  • Example:
  • For a customer support chatbot, create a data store with:
    • Structured data: Product catalog (BigQuery).
    • Unstructured data: FAQ PDFs (Google Drive) + company website.

2. Configure Search & Ranking

  • Action: Define a serving config to control how results are returned.
  • Choose search type:
    • Keyword search (exact matches, fast).
    • Semantic search (meaning-based, uses embeddings).
  • Set ranking rules (e.g., prioritize recent documents, boost certain sources).
  • Add filters (e.g., "only show products in stock").
  • Example:
  • For an e-commerce search, use semantic search + filters for price range and availability.

3. Build a Chat App (Optional)

  • Action: Create a chat app in Vertex AI Conversation.
  • Define intents (e.g., "return_product," "check_order_status").
  • Set fallback policies (e.g., "If confidence < 80%, escalate to human").
  • Customize LLM prompts (e.g., "Answer in a friendly tone, max 3 sentences").
  • Example:
  • For a banking chatbot, define intents like:
    • check_balance-Query BigQuery.
    • report_fraud-Escalate to a human agent.

4. Deploy & Test

  • Action: Deploy the search engine or chat app and test it.
  • Use the Vertex AI Console to preview responses.
  • Integrate via API (REST) or embeddable UI (for web/mobile apps).
  • Example:
  • Embed the chat app in a customer portal using the JavaScript SDK.

5. Monitor & Improve

  • Action: Use Vertex AI Monitoring to track:
  • Search quality (click-through rates, zero-results queries).
  • Chatbot performance (user satisfaction, fallback rates).
  • Grounding accuracy (are responses linked to correct sources?).
  • Example:
  • If users frequently fall back to human agents, add more training data or adjust intent thresholds.

Common Mistakes

Mistake 1: Using Vertex AI Search for Real-Time Transactional Data

  • Problem: Candidates assume Vertex AI Search can replace a database (e.g., for order status lookups).
  • Correction:
  • Vertex AI Search is for retrieval, not transactions.
  • For real-time data (e.g., order status), use BigQuery or Cloud SQL and join with search results if needed.

Mistake 2: Ignoring Data Freshness for Unstructured Data

  • Problem: Assuming PDFs or websites are automatically updated in real time.
  • Correction:
  • Unstructured data stores (e.g., Google Drive, websites) require scheduled crawls (e.g., daily).
  • For real-time updates, use structured data (BigQuery, Cloud SQL).

Mistake 3: Over-Relying on Default LLM Prompts

  • Problem: Using generic prompts (e.g., "Answer the question") without custom instructions.
  • Correction:
  • Customize prompts for tone, length, and grounding (e.g., "Answer in 2 sentences, cite sources").
  • Use few-shot examples to improve response quality.

Mistake 4: Not Setting Up Fallback Policies

  • Problem: Assuming the chatbot will always answer correctly.
  • Correction:
  • Define fallback rules (e.g., "If confidence < 70%, say 'I don’t know' and escalate").
  • Use human-in-the-loop for critical use cases (e.g., medical, legal).

Mistake 5: Confusing Vertex AI Search with Vertex AI Matching Engine

  • Problem: Thinking Vertex AI Search is the same as Vertex AI Matching Engine (vector database).
  • Correction:
  • Vertex AI Search = Managed RAG + search (for structured/unstructured data).
  • Vertex AI Matching Engine = Low-latency vector search (for custom embeddings, e.g., product recommendations).

Certification Exam Insights

1. "Which Service?" Traps

  • Exam loves testing:
  • Vertex AI Search vs. BigQuery-Use BigQuery for analytics, Vertex AI Search for semantic search.
  • Vertex AI Conversation vs. Dialogflow-Dialogflow is for rule-based chatbots, Vertex AI Conversation is for LLM-powered RAG.
  • Vertex AI Search vs. Vertex AI Matching Engine-Search = managed RAG, Matching Engine = custom vector search.

2. Key Constraints to Remember

  • Data Store Limits:
  • Unstructured data: Max 100M documents per data store.
  • Structured data: Max 100K rows per table (for BigQuery/Cloud SQL).
  • Latency:
  • Semantic search is slower than keyword search (due to embedding generation).
  • Cost Model:
  • Pay per query (not per document indexed).
  • Enterprise Search has a higher cost than custom search.

3. Tricky Scenarios

  • "We need a chatbot that answers questions from internal docs + public websites."
  • Answer: Vertex AI Agent Builder (combines internal data stores + web search).
  • "We need a search engine for product recommendations with low latency."
  • Answer: Vertex AI Matching Engine (for custom embeddings) + Vertex AI Search (for hybrid search).
  • "We need a chatbot that escalates to humans when unsure."
  • Answer: Vertex AI Conversation with fallback policies.

Quick Check Questions

1. A retail company wants to build a chatbot that answers customer questions about products, orders, and returns. The chatbot must cite sources and avoid hallucinations. Which GCP service should they use?

Answer: Vertex AI Agent Builder (Search & Conversation) ? Why? It supports RAG with grounding, citation of sources, and multi-turn conversations without requiring deep ML expertise.

2. A healthcare provider needs a search engine for internal medical guidelines (PDFs) and patient records (BigQuery). They want semantic search with real-time updates. Which GCP services should they use?

Answer: Vertex AI Search (for PDFs + BigQuery) + Vertex AI Matching Engine (for semantic search) ? Why? Vertex AI Search handles structured (BigQuery) and unstructured (PDFs) data, while Matching Engine enables low-latency semantic search.

3. A company wants to deploy a chatbot that answers HR policy questions. They need to ensure responses are always accurate and linked to official documents. What’s the best way to achieve this?

Answer: Vertex AI Conversation with a data store of HR documents + grounding enabled ? Why? Grounding ensures responses are tied to source documents, reducing hallucinations.


Last-Minute Cram Sheet

  1. Vertex AI Agent Builder = Managed RAG for search & chatbots (no-code/low-code).
  2. Data Store = Container for indexed data (structured or unstructured).
  3. Serving Config = Controls search ranking, filters, and response format.
  4. Grounding = Links LLM responses to source documents (reduces hallucinations).
  5. Fallback Policy = Rules for when the chatbot can’t answer (e.g., escalate to human).
  6. Semantic Search = Meaning-based (slower, more accurate) vs. Keyword Search = Exact matches (faster).
  7. Vertex AI Search-Vertex AI Matching Engine-Search = managed RAG, Matching Engine = custom vector DB.
  8. Unstructured data (PDFs, websites) requires scheduled crawls (not real-time).
  9. Vertex AI Search is not a database-Use BigQuery/Cloud SQL for transactions.
  10. Default LLM prompts are too generic-Customize for tone, length, and grounding.