Fatskills
Practice. Master. Repeat.
Study Guide: AI MCP and Tooling: Action safety permissions and scoped tools
Source: https://www.fatskills.com/ai-for-work/chapter/ai-mcp-and-tooling-action-safety-permissions-and-scoped-tools

AI MCP and Tooling: Action safety permissions and scoped tools

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~5 min read

Action Safety, Permissions, and Scoped Tools

What This Is

Action safety ensures AI tools and agents perform only approved, bounded tasks—preventing unauthorized data access, unintended operations, or misuse. In everyday work, this means AI can draft emails or pull sales reports but can’t delete databases or send messages without review. Example: A marketing team uses an AI assistant to generate social posts but requires human approval before publishing to avoid brand misalignment or compliance risks.


Key Facts & Principles

  • Least Privilege Principle: Grant AI tools the minimum permissions needed to complete a task. Example: A customer support AI can read ticket data but can’t modify user accounts or access payment info.
  • Scoped Actions: Define explicit, narrow boundaries for what an AI can do (e.g., "summarize documents" vs. "execute code"). Example: A legal AI is scoped to search case law but not file motions.
  • Permission Granularity: Break down access into fine-grained controls (e.g., read-only vs. write, specific datasets). Example: An HR AI can view employee names but not salaries unless explicitly granted.
  • Approval Gates: Require human review for high-risk actions (e.g., sending emails, modifying records). Example: A sales AI drafts contracts but flags them for legal review before sending.
  • Audit Trails: Log all AI actions (who triggered it, what it did, when) for accountability. Example: A finance AI’s expense report edits are timestamped and tied to the user who initiated the request.
  • Sandboxing: Isolate AI tools from sensitive systems (e.g., run in a virtual environment with no network access). Example: A code-review AI runs in a container with no write permissions to production.
  • Fallback Mechanisms: Define what happens if an AI exceeds its scope (e.g., block the action, alert a human). Example: If an AI tries to delete a file, it’s denied, and IT is notified.
  • Dynamic Scoping: Adjust permissions based on context (e.g., time of day, user role). Example: A support AI can escalate tickets to managers only during business hours.

Step-by-Step Application

  1. Map Workflows to Actions
  2. List all tasks the AI will perform (e.g., "generate reports," "send Slack messages").
  3. Categorize by risk: Low (read-only), Medium (write to non-critical systems), High (financial/data changes).

  4. Define Scopes and Permissions

  5. For each action, specify:

    • Data access (e.g., "read-only: customer emails from the last 30 days").
    • Execution limits (e.g., "can send 5 Slack messages/day").
    • Human gates (e.g., "require approval for external emails").
  6. Implement Technical Controls

  7. Use APIs with role-based access (e.g., OAuth scopes, AWS IAM policies).
  8. Deploy middleware (e.g., a "permission checker" service that validates requests before execution).
  9. Example: A Python decorator like @requires_permission("send_email") to wrap AI functions.

  10. Set Up Monitoring and Alerts

  11. Log all actions (e.g., "AI attempted to delete file X at 2:15 PM").
  12. Configure alerts for:

    • Repeated denied actions (possible misconfiguration).
    • High-risk actions (e.g., "AI modified a database record").
  13. Test and Iterate

  14. Run penetration tests (e.g., try to make the AI exceed its scope).
  15. Review logs weekly to spot over-permissioned actions.
  16. Example: If the AI keeps failing to access a dataset, either adjust permissions or clarify the workflow.

  17. Document and Train

  18. Write a 1-pager for teams: "What our AI can/can’t do."
  19. Train users on how to escalate if the AI behaves unexpectedly.

Common Mistakes

  • Mistake: Granting broad permissions "just in case." Correction: Start with no permissions and add only what’s necessary. Why: Over-permissioning is the #1 cause of AI-related breaches (e.g., an AI accidentally emailing sensitive data).

  • Mistake: Assuming "read-only" is always safe. Correction: Even read access can leak data (e.g., an AI summarizing confidential documents). Why: Combine read permissions with output controls (e.g., "don’t include raw data in summaries").

  • Mistake: Skipping audit logs for "low-risk" actions. Correction: Log everything. Why: You’ll need trails to debug issues (e.g., "Why did the AI send 100 emails at 3 AM?").

  • Mistake: Hardcoding permissions in the AI’s code. Correction: Use external permission systems (e.g., AWS IAM, Okta). Why: Centralized systems let you revoke access instantly if compromised.

  • Mistake: Ignoring "shadow AI" (unapproved tools). Correction: Scan for unauthorized AI usage (e.g., employees using unvetted chatbots). Why: These often bypass your safety controls.


Practical Tips

  • Use "Dry Run" Mode: Before enabling a new action, test it in a mode where the AI reports what it would do (e.g., "I would send this email to 500 customers").
  • Pair AI with "Guardrails": Deploy a second AI to review the first’s actions (e.g., "Check if this email contains PII before sending").
  • Rotate Permissions: Treat AI access like human access—review and rotate keys/API tokens quarterly.
  • Leverage Existing Tools: Use your cloud provider’s IAM (e.g., AWS, GCP) or enterprise SSO (e.g., Okta) to manage AI permissions, rather than building from scratch.

Quick Practice Scenario

Scenario: Your sales team wants an AI to auto-generate and send follow-up emails to leads. The AI will pull data from your CRM (Salesforce) and send via Gmail. Question: What 3 permission controls would you implement before enabling this?

Answer:
1. Read-only CRM access (only to lead contact info, not full records).
2. Gmail send limit (e.g., max 20 emails/day per user).
3. Human approval gate (emails are drafted but require a "send" click from a sales rep).

Why: Balances automation with safety—prevents spam, data leaks, and unauthorized sends.


Last-Minute Cram Sheet

  1. Least privilege = Only what’s needed, nothing more. Avoid "admin by default."
  2. Scoped actions = Define exactly what the AI can do (e.g., "summarize"-"edit").
  3. Approval gates = High-risk actions need human review. Don’t automate what you can’t undo.
  4. Audit trails = Log who triggered the AI, what it did, and when.
  5. Sandboxing = Isolate AI from sensitive systems (e.g., no production DB access).
  6. Dynamic scoping = Adjust permissions by context (e.g., time, user role).
  7. Fallbacks = Block and alert if the AI exceeds scope. Don’t let it "fail open."
  8. Dry runs = Test actions in "report-only" mode first.
  9. Shadow AI = Scan for unapproved tools. Employees will bypass controls.
  10. Rotate keys = Treat AI access like human access—revoke and rotate regularly.