Working With Policies (cloudaware-mcp)

This guide provides instructions for interacting with and managing the Cloudaware Compliance Engine v2 policy repository using the CloudAware MCP (Model Context Protocol) tools. Adhere strictly to these guidelines to ensure consistency and proper functioning of the automation toolchain.

1. Task Execution Protocol

For every user request, you MUST follow this protocol to ensure transparency, reproducibility, and clear communication. This is your primary operational loop.

Create a Task Directory: Upon receiving a new task, immediately create a dedicated directory for it within the tmp directory. Name the directory descriptively (e.g., tmp/task-add-s3-encryption-policy).
Initialize readme.md Log: Inside the new task directory, create a readme.md file. This file will serve as your plan, log, and report for the user.
Formulate and Document the Plan: Before taking any action, formulate a step-by-step plan to address the user's request. Write this plan at the top of the readme.md.
Log Every Action: As you execute each step, meticulously log your actions in the readme.md under a "Log" or "Execution Details" section. For each action, you must record:
- A brief description of the action's purpose.
- The full command or MCP tool call you are executing.
- The complete output from the command or tool.
- Links to any files created or modified.
Update the Plan: After completing a step, update the plan section of the readme.md to reflect your progress (e.g., by marking items as complete).
Inform the User: Keep the user informed by referencing the readme.md file. When you have completed the task, notify the user that a detailed log of all actions is available in the task directory. This log is crucial for the user to understand your work and potentially automate the task later.

Example `readme.md` Structure

# Task: Add a New Policy for S3 Bucket Encryption

**Status:** In Progress

## Plan

- [ ] 0. Prepare repository (cleanup and import types)
- [ ] 1. Research similar existing policies
- [ ] 2. Read relevant guides documentation
- [ ] 3. Use MCP to discover the S3 bucket type and encryption-related fields
- [ ] 4. Analyze field values to understand correct values for conditions
- [ ] 5. Create the policy directory structure
- [ ] 6. Create the `policy.yaml` file
- [ ] 7. Create the `description.md` and `remediation.md` files
- [ ] 8. Implement the logic in `wip.logic.yaml` (WITHOUT testData section)
- [ ] 9. Run capture-test-data command to generate test-data.json
- [ ] 10. Add testData section to wip.logic.yaml
- [ ] 11. Modify captured test-data.json to cover all scenarios
- [ ] 12. Run tests and verify the logic
- [ ] 13. Rename `wip.logic.yaml` to `prod.logic.yaml`
- [ ] 14. Ask user if they want documentation generated (optional)

## Execution Log

### Step 3: Discover S3 Bucket Type and Fields

Using MCP to find the S3 bucket type...

**MCP Tool:** search_types
**Query:** ["s3 bucket"]
**Output:**
{
  "types": [
    {
      "typeLabel": "AWS S3 Bucket",
      "typeApiName": "CA10__CaAwsS3Bucket__c",
      ...
    }
  ]
}

Now searching for encryption-related fields...

**MCP Tool:** search_fields
**Parameters:**
- typeApiName: "CA10__CaAwsS3Bucket__c"
- query: ["encryption", "encrypted", "kms"]

**Output:**
...

---
*... (log continues for each step) ...*

2. Core Concepts

The repository stores compliance policies as declarative YAML files.

Policy: A rule or idea to test against a cloud environment. It is a declarative description, not code. It is defined in a policy.yaml file.
Logic: The implementation of a policy's idea. It contains the instructions for how to validate objects. It is defined in a {name}.logic.yaml file. A policy can have multiple logic files (e.g., prod.logic.yaml, wip.logic.yaml).
Folder: A logical container for organizing policies, defined by a folder.yaml file.
Section: A component of a compliance framework (e.g., NIST, CIS), defined by a section.yaml file.
Type: Represents a Cloudaware CMDB SObject type (e.g., CA10__CaAwsInstance__c). Its directory contains type.json (schema) and *.extracts.yaml files.
Extract: A reusable, validated piece of information extracted from an input object. Extracts are defined in types/{type}/{name}.extracts.yaml and are used in logic files to abstract away direct field access. Always prefer EXTRACT over FIELD in production logic.
Entity ID: The unique identifier for any entity (Policy, Folder, Section, etc.) is its absolute Linux-style path from the repository root. For example, the ID for a policy in ce/ca/aws/ec2/my-policy/policy.yaml is /ce/ca/aws/ec2/my-policy.

3. Repository Structure

ce: Root directory for all policies.
- ce/ca: Cloudaware's public policies.
- ce/unit-test: Unit tests for Compliance Engine operations. This is the most up-to-date reference for how operations work, including all edge cases. Consult this directory when implementing complex logic.
frameworks: Contains compliance framework definitions. Each subdirectory is a framework.
types: Contains CMDB SObject type definitions and associated extracts.yaml files.
guides: IMPORTANT: Contains comprehensive documentation. Read relevant guides BEFORE implementing policies.
schema: Contains JSON schemas for IDE validation.
lists: Contains auto-generated lists of policies for maintenance (e.g., policies without tests).

4. Naming and File Conventions

Strict adherence is mandatory.

Directory Names: Use lowercase-with-hyphens.
Descriptor Files:
- Folders require folder.yaml.
- Policies require policy.yaml.
- Framework sections require section.yaml.
Logic Files:
- prod.logic.yaml: Production logic. This is executed by the engine.
- wip.logic.yaml: Work-in-progress.
- unit-test.logic.yaml: Unit tests.
- example.logic.yaml: Examples.
Documentation Files:
- description.md: Detailed technical specification of the policy (Rationale, Impact, Audit).
- remediation.md: Step-by-step guidance to fix non-compliance.
- internal.md: Developer notes.
- todo.md: Implementation tasks.
Test Data:
- test-data.json: Contains JSON objects for testing a logic file. It is referenced in the testData property of the logic YAML.

5. Common Workflows

Workflow: Creating a New Policy

IMPORTANT: Always start by preparing the repository, then use MCP tools to discover types and fields, and read relevant guides.

Step 0: Prepare Repository

Before starting work on a new policy, ensure the repository is in a clean, working state:

# Clean up any problematic files that might break parsing
repo-manager cleanup

# Restore type descriptors that were removed by cleanup
repo-manager types import -p REFERENCED

This removes any files that could cause parsing errors and restores all type definitions referenced in the repository, putting it in a clean working state.

Step 1: Research and Learn

Find Similar Policies: Search for existing policies similar to what you're creating.

# Example: Find instance-related policies
ls -la /ce/ca/aws/ec2/

# Example: Find policies using specific operations
grep -r "RELATED_LIST_COUNT" /ce/ca/ --include="*.logic.yaml"

Examine a Similar Policy: Read at least one similar policy completely:
- The policy.yaml file for metadata structure
- The prod.logic.yaml file for logic patterns
- The test-data.json file for test data format
- The description.md and remediation.md for documentation style
Read Relevant Guides: Based on your task, read the appropriate guides:
- For logic operations: guides/developer/operations/index.md
- For specific operations (e.g., RELATED_LIST_COUNT): guides/developer/operations/related-list-count.md
- For type system: guides/developer/type-system/index.md
- For object relationships: guides/developer/object-relationships/index.md
Check Unit Tests for Edge Cases: If implementing complex logic, check ce/unit-test/ for unit tests of the operations you're using. These tests show all edge cases and are the most up-to-date reference.

Step 2: Discover Types and Fields Using MCP

Use MCP tools to discover the schema:

Find the Type: Use search_types to find the CMDB object type:

Use search_types with query: ["ec2 instance"]
# Returns: CA10__CaAwsInstance__c with tableId and disappearanceTimeField

Find Fields: Use search_fields to discover relevant fields:

Use search_fields with:
  typeApiName: "CA10__CaAwsInstance__c"
  query: ["monitoring", "performance", "detailed monitoring"]
# Returns: CA10__monitoringState__c field with description

Analyze Field Values: Use analyze_field to understand what values the field contains:
```
Use analyze_field with:
  typeApiName: "CA10__CaAwsInstance__c"
  fieldApiName: "CA10__monitoringState__c"
# Returns: topValues showing "disabled" (94 instances) and "enabled" (1 instance)
```
CRITICAL: This step ensures you use correct values in your conditions. Don't guess "Disabled" when the actual value is "disabled"!

Find Available Extracts: Check what extracts are already available:

# Example: Check available extracts for EC2 instances
cat /types/CA10__CaAwsInstance__c/object.extracts.yaml

Step 3: Understand Relationships Using MCP

Use MCP tools to discover relationships between objects.

If your policy needs to check related objects (e.g., volumes attached to instances):

Get Relationship Graph: Use get_relationship_graph:

Use get_relationship_graph with:
  startTypeApiName: "CA10__CaAwsInstance__c"
# Returns: All relationships to/from instances, including volumes

Identify the Relationship: Look for the edge connecting your types:

Example output:
{
  "fromType": "CA10__CaAwsVolume__c",
  "toType": "CA10__CaAwsInstance__c",
  "lookupFieldApiName": "CA10__attachmentInstance2__c",
  "lookupFieldLabel": "Attachment Instance",
  "relatedListApiName": "CA10__AWS_EBS_Volumes__r"
}

Read Related List Operations Guide:

cat /guides/developer/operations/related-list-count.md

Step 4: Determine Policy Categories

Valid categories are: PERFORMANCE, RELIABILITY, COST, SECURITY Do not use any other categories (e.g., "BEST_PRACTICES" is invalid).

Step 5: Create Policy Structure

Identify Path: Determine the correct path for the new policy under ce. For example: ce/ca/aws/ec2/new-instance-policy.
Create Folders: Ensure all parent directories exist and each contains a folder.yaml file with names.contextual and names.full properties.
Create Policy Directory: Create the final directory for your policy.
Create policy.yaml: Create the main descriptor file. Populate names, description, type, and valid categories.
Create Documentation: Create description.md and remediation.md with detailed content based on similar policies.

Step 6: Implement Logic

Create wip.logic.yaml (WITHOUT testData section):
- Define inputType (the CMDB object to check)
- Use importExtracts to reference the object's extracts file
- Define conditions sequentially. Each condition has:
  - check: The logical test
  - status: One of DISAPPEARED, INAPPLICABLE, COMPLIANT, INCOMPLIANT, UNDETERMINED
  - currentStateMessage: Description of the current state
  - remediationMessage: How to fix (for INCOMPLIANT status)
- Define the otherwise block as a fallback
- IMPORTANT: Do NOT add a testData section yet - add it only after you have captured actual test data

Use Analyzed Field Values: When writing conditions, use the values discovered by analyze_field:

# Correct: Based on analyze_field results
check:
  IS_EQUAL:
    left:
      FIELD: "CA10__monitoringState__c"
    right:
      TEXT: "disabled"  # Exact value from analyze_field

For Related Lists:
- Add a relatedLists section after the main conditions
- Specify the relationshipName (e.g., CA10__AWS_EBS_Volumes__r)
- Import the related object's extracts
- Define conditions for the related objects
- Use RELATED_LIST_COUNT, RELATED_LIST_HAS, or RELATED_LIST_HAS_NO in your main conditions

Step 7: Create Test Data

IMPORTANT: Always use the capture-test-data command to generate test data. Never write test-data.json manually.

Workflow:

Capture Test Data (no testData section needed in logic.yaml yet):
```
repo-manager policies capture-test-data /path/to/wip.logic.yaml
```
This command does everything in one step:
- Generates the SQL query and saves it to .generated/capture_test_data.sql
- Executes the query against BigQuery
- Saves the results directly to test-data.json
Add testData section to wip.logic.yaml: Now that you have actual test data, add the testData reference to your wip.logic.yaml file:
```
testData:
  - file: "test-data.json"
```
Place this after inputType and before or after importExtracts.
Examine the Test Data: Open test-data.json to see what data was captured. The format will be obvious from the results.
Modify Test Data to cover all scenarios:
- Edit the captured data to ensure coverage of all conditions (INAPPLICABLE, INCOMPLIANT, COMPLIANT)
- Add or modify test cases to cover edge cases (0 items, exactly at limit, over limit)
- Update expectedResult for each test case with the correct status and conditionIndex
- You may need to capture more data or create additional test cases by duplicating and modifying existing entries
- The captured data will have the correct structure for all fields automatically

Step 8: Test and Debug

Run Tests:

repo-manager policies test /path/to/wip.logic.yaml

Fix Common Issues:
- Invalid category: Check that categories are from the valid list (PERFORMANCE, RELIABILITY, COST, SECURITY)
- Wrong conditionIndex: Update expectedResult to match actual
- Wrong field values: Use analyze_field to verify the correct values
- Test data issues: Check the captured data format matches what similar policies use
Review Test Results: The output shows which tests passed/failed. The .test-results.yaml file contains detailed results.
Iterate: Fix issues and re-run tests until all pass.

Step 9: Finalize

Rename Logic File: Once all tests pass:

mv /path/to/policy/wip.logic.yaml /path/to/policy/prod.logic.yaml
mv /path/to/policy/wip.test-results.yaml /path/to/policy/prod.test-results.yaml

Verify Production Tests:

repo-manager policies test /path/to/policy/prod.logic.yaml

Ask User About Documentation Generation: Documentation generation is optional and can be time-consuming. Ask the user: "Would you like me to generate repository documentation? This updates cross-references but may take some time."

If yes:
```
repo-manager docs generate
```
Note: This may fail if there are unrelated repository issues. Failure is not critical for the policy itself.

Workflow: Testing a Policy

Run Tests:
- Execute repo-manager policies test /path/to/your/prod.logic.yaml for a single policy or repo-manager policies test all for all policies.
- This command validates the logic against test-data.json and creates a .test-results.yaml file.
Review Results: Check the output for failures. Look for:
- Status mismatches (expected INCOMPLIANT but got COMPLIANT)
- ConditionIndex mismatches (wrong condition was triggered)
- ConditionText mismatches (actual logic differs from expected)
Common Test Failures and Fixes:
- "FAIL conditionIndex": Your test expected a different condition to match. Update the expectedResult or fix the logic order.
- "FAIL status": The logic returned a different status. Check your conditions and test data values.
- "Can't deserialize" errors: Invalid category name or wrong data type in YAML.
- Field value mismatches: Use analyze_field to verify the correct field values.
- Other errors: Check similar policies' test-data.json format and consult unit tests in ce/unit-test/.

Workflow: Debugging a Policy

Generate Debug Query:
- Run repo-manager policies generate DEBUG path/to/your/logic.yaml.
- This creates a debug.sql file in a .generated directory.
Execute Query: Run the generated SQL queries against BigQuery to inspect how the logic evaluates against real data. This helps understand which objects match which conditions.

Workflow: Managing Documentation

Before Committing: Always run repo-manager docs generate to create/update the *.gen.md files which provide cross-references between entities.
Before Pulling Changes: Run repo-manager docs cleanup to remove all generated files to avoid merge conflicts.
To Fix Parsing Errors or Repository Issues: Prepare the repository:
1. repo-manager cleanup (comprehensive cleanup)
2. repo-manager types import -p REFERENCED (restore type definitions)
3. repo-manager policies test all (verify everything works)
4. repo-manager docs generate (regenerate documentation if needed)

6. `repo-manager` CLI Tool

repo-manager is the primary tool for managing this repository.

Configuration and Behavior

Configuration: The tool uses authentication profiles stored in a profiles.json file. This file is typically located in a .ca directory in the repository root or your home directory. The agent must have a valid and active profile to communicate with the Cloudaware API.
Auto-Update: The tool auto-updates periodically. You can force an update with --auto-update or suppress it with --no-auto-update.

Command Reference

The following are the most critical commands for an AI agent.

`docs` - Documentation Management

repo-manager docs generate: Generates/updates all *.gen.md and other documentation files. These files contain crucial cross-links between entities. Run this after making changes and before committing.
- --watch: Use to keep documentation updated in real-time during development sessions.
repo-manager docs cleanup: Removes all generated documentation files. Run this before pulling changes from a remote repository to avoid merge conflicts.

`policies` - Policy Management

repo-manager policies capture-test-data <path>: Captures test data for a policy in one step.
- <path>: Path to the policy or logic file (e.g., ce/ca/aws/ec2/instance-detailed-monitoring/wip.logic.yaml).
- This command automatically generates the SQL query, executes it against BigQuery, and saves results to test-data.json.
- -o or --output-file: Optional path to the output file (default: POLICY_DIR/test-data.json).
repo-manager policies generate <kind> <path>: Generates a BigQuery script for a policy.
- <path>: Path to policy or logic file (e.g., ce/ca/aws/ec2/instance-detailed-monitoring/prod.logic.yaml).
- <kind>: The type of script to generate. Valid values are:
  - DEBUG: For debugging the policy against live data.
  - CAPTURE_TEST_DATA: Generates a SQL query to capture real data from BigQuery for creating test-data.json. Note: prefer using repo-manager policies capture-test-data instead, which executes the query automatically.
  - TESTS: To generate a script for running tests from test-data.json.
repo-manager policies test <path>: Runs tests for policies.
- <path>: Path to a specific policy/logic file, or 'all' to test every policy in the repository, or 'unit-tests' for all unit tests.

`types` - CMDB Type Management

repo-manager types import: Imports or updates SObject definitions from Salesforce into the types directory. This also updates the master list in types/types.json. Use this if a policy requires a new object type or field that is not yet present.
- -t <api_name> or --type <api_name>: Specify a single type to import. Can be used multiple times.
- -p <preset_name> or --preset <preset_name>: Import a group of types. Can be used multiple times.
  - REFERENCED: All types referenced in the repository (recommended for preparing repository)
  - CLOUDAWARE: All Cloudaware types
repo-manager types delete: Removes imported type data from the types directory.

`cleanup` - Repository Cleanup

repo-manager cleanup: Performs a comprehensive cleanup of the repository, removing generated files and potentially problematic files. More extensive than docs cleanup. Note: This will delete type JSON descriptors, which must be restored with repo-manager types import -p REFERENCED.

`schema` - Schema Management

repo-manager schema update: Updates the JSON schema files in the schema directory. This ensures IDEs and validation tools have the latest information for autocompletion and correctness checks.

`auth` - Authentication

The agent's environment should be pre-configured with an active authorization profile. The primary commands for this are repo-manager auth add and repo-manager auth activate. You will likely not need to use these, but be aware that API communication depends on a valid profile.

As an AI agent, one of your key roles is to help users retrieve data from their Cloudaware CMDB. When a user asks a question about their cloud resources, you should use MCP tools to discover the schema and then execute queries.

IMPORTANT: With MCP, you can now execute BigQuery queries directly without SOQL conversion!

Scenario 1: General Data Queries

For ad-hoc questions not directly related to an existing compliance policy.

Analyze the User's Question: Determine the resources, properties, and conditions in the user's request.

Discover Types Using MCP: Use search_types to find the object type:

Use search_types with query: ["vpc", "virtual private cloud"]
# Returns: CA10__CaAwsVpc__c with tableId and disappearanceTimeField

Discover Fields Using MCP: Use search_fields to find relevant fields:

Use search_fields with:
  typeApiName: "CA10__CaAwsVpc__c"
  query: ["default", "state", "status"]
# Returns: CA10__isDefault__c field

Analyze Field Values: Use analyze_field to understand field values:

Use analyze_field with:
  typeApiName: "CA10__CaAwsVpc__c"
  fieldApiName: "CA10__isDefault__c"
# Returns: topValues showing true/false distribution

Construct and Save BigQuery Query: Using the discovered information, write a BigQuery SQL query directly. Save this query to a .bqsql file in your task directory.
```
SELECT
  Name,
  CA10__vpcId__c,
  CA10__isDefault__c,
  CA10__cidrBlock__c
FROM `cloudaware-cacanew.sobjects.CA10__CaAwsVpc__c`
WHERE CA10__isDefault__c = true
  AND CA10__disappearanceTime__c IS NULL
```
IMPORTANT (Excluding Deleted Objects): Always use the disappearanceTimeField discovered by search_types to filter deleted objects (e.g., WHERE CA10__disappearanceTime__c IS NULL).

Execute Using MCP: Use the execute_query tool:

Use execute_query with sql:
  SELECT Name, CA10__vpcId__c, CA10__isDefault__c
  FROM `cloudaware-cacanew.sobjects.CA10__CaAwsVpc__c`
  WHERE CA10__isDefault__c = true
    AND CA10__disappearanceTime__c IS NULL

Format Results for User: The MCP tool returns results as JSON. You can:
- Present the results directly to the user
- Save to a file in the task directory if the user requests it
- Aggregate or summarize the results if needed

Alternative: Use bq for Large Results or Specific Formats

If the user needs results in a specific format (CSV) or if the dataset is very large:

Save the query to a .bqsql file in your task directory

Execute with bq:

# For CSV output
bq query --format=csv --use_legacy_sql=false < tmp/task-name/query.bqsql > tmp/task-name/results.csv

# For JSON output
bq query --format=json --use_legacy_sql=false < tmp/task-name/query.bqsql > tmp/task-name/results.json

For questions about which objects are compliant or non-compliant with a specific policy.

Identify the Policy: From the user's request, determine the relevant policy ID (e.g., /ce/ca/aws/ec2/instance-detailed-monitoring).
Generate Debug Script: Use the repo-manager policies generate DEBUG command with the policy's logic file path.
```
repo-manager policies generate DEBUG ce/ca/aws/ec2/instance-detailed-monitoring/prod.logic.yaml
```

Execute the Debug Query:

The command creates a debug.sql file in a .generated directory.

You can execute it with MCP execute_query (if results are small) or with bq query:

# For CSV output
bq query --format=csv --use_legacy_sql=false < ce/ca/aws/ec2/instance-detailed-monitoring/.generated/debug.sql > tmp/task-name/debug-results.csv

# For JSON output
bq query --format=json --use_legacy_sql=false < ce/ca/aws/ec2/instance-detailed-monitoring/.generated/debug.sql > tmp/task-name/debug-results.json

Present Results: Inform the user of the results location or present them directly.

MCP vs bq: When to Use Each

Use MCP execute_query:

Quick exploratory queries
Small result sets (< 1000 rows)
When you need to process results programmatically
Interactive data discovery

Use bq query:

Large result sets
When user needs CSV format
When saving results to files
Complex aggregations with many rows

8. Quick Reference: Common Tasks

Discovering Types and Fields

Use MCP tools to find types and fields:

# Find type
Use search_types with query: ["vpc"]

# Find field
Use search_fields with:
  typeApiName: "CA10__CaAwsVpc__c"
  query: ["default"]

Understanding Field Values

Use analyze_field to understand actual data values before writing conditions:

Use analyze_field with:
  typeApiName: "CA10__CaAwsVpc__c"
  fieldApiName: "CA10__isDefault__c"
# Returns: Statistics including topValues

Finding Relationships

Use get_relationship_graph to discover relationships:

Use get_relationship_graph with:
  startTypeApiName: "CA10__CaAwsInstance__c"
# Returns: All relationship edges including volumes

Querying Data

Execute BigQuery SQL directly using MCP:

Use execute_query with sql:
  SELECT Name, CA10__vpcId__c
  FROM `cloudaware-cacanew.sobjects.CA10__CaAwsVpc__c`
  WHERE CA10__disappearanceTime__c IS NULL
  LIMIT 10

Valid Policy Categories

Only these values are allowed in policy.yaml:

PERFORMANCE
RELIABILITY
COST
SECURITY

Capturing Test Data

Always use the capture-test-data command to capture test data in one step:

# Step 1: Capture test data (wip.logic.yaml should NOT have testData section yet)
repo-manager policies capture-test-data /path/to/wip.logic.yaml

# This command automatically:
# - Generates the SQL query to .generated/capture_test_data.sql
# - Executes it against BigQuery
# - Saves results to test-data.json

# Step 2: Add testData section to wip.logic.yaml now that you have actual test data
# Add this to wip.logic.yaml:
#   testData:
#     - file: "test-data.json"

# Step 3: Edit test-data.json to add expectedResult for each test case

9. Reference Documentation

IMPORTANT: Always consult the guides directory BEFORE implementing policies.

Depending on the task, you must read the relevant documentation in the guides directory:

Policy Development

Policy Development Guide - Comprehensive guide to developing policies
Repository Structure - Understanding the repository layout
Naming Conventions - Required naming patterns for files and directories
Logic Files - How to write policy logic implementations
Extracts - Reusable data extraction rules
Related Lists - Working with related objects
Test Data - Creating and using test data for policies

Operations

Operations Reference - Complete list of available operations
Data Access Operations - FIELD and EXTRACT operations
Comparison Operations - IS_EQUAL, IS_EMPTY, etc.
Logical Operations - AND, OR, NOT operations
Numerical Comparison - GREATER_THAN, LESS_THAN, etc.
Collection Operations - Working with lists and sets
JSON Operations - Parsing and querying JSON data
Related List Operations - RELATED_LIST_HAS, RELATED_LIST_COUNT, etc.

Type System

Type System Guide - Understanding Compliance Engine data types
Text Type - Case-insensitive string handling
Bytes Type - Case-sensitive string handling
Collection Types - Lists and Sets behavior

Object Relationships

Object Relationships - Understanding how objects are connected
Lookup Validation - IS_EMPTY_LOOKUP and NOT_EMPTY_LOOKUP operations

Repository Management

repo-manager CLI Reference - Complete command reference
Documentation Generation - How to generate and maintain documentation

10. Learning Resources (in order of priority)

Similar existing policies: The repository contains hundreds of working policies - use them as your primary reference
Unit tests (ce/unit-test/): Most up-to-date examples of how operations work, including edge cases
Guides (guides/developer/): Comprehensive documentation for operations, type system, and concepts
MCP tools: Use search_types, search_fields, analyze_field to discover schema and data
Captured test data: Use repo-manager policies capture-test-data to see actual data structure

When implementing a policy:

Find 2-3 similar policies and study them completely
Use MCP tools to discover types, fields, and relationships
Use analyze_field to understand field values before writing conditions
Check unit tests for any complex operations you're using
Use repo-manager policies capture-test-data to get real test data structure
Test iteratively and fix one issue at a time
Log everything in the task readme for future reference

11. Environment Assumptions

The agent environment is assumed to have:

MCP Server: CloudAware MCP server providing schema discovery and query tools
bq CLI: Properly configured for BigQuery access
repo-manager: Installed and authenticated with Cloudaware API
No need to manually specify project IDs or configure authentication

12. MCP Benefits Summary

The CloudAware MCP tools provide several key benefits:

Faster Discovery: No need to download and parse large type.json files
Better Search: Keyword search across labels, descriptions, and help text
Field Value Analysis: Understand actual data values before writing conditions
Relationship Mapping: Visual graph of relationships
Direct Querying: Execute BigQuery SQL directly without conversion steps
Smaller Context: MCP tools return only relevant results, keeping context size manageable
Type Safety: Get field data types and descriptions upfront

When to use each tool:

Task	Recommended Tool
Find a type	`search_types`
Find a field	`search_fields`
Understand field values	`analyze_field`
Find relationships	`get_relationship_graph`
Query data (small results)	`execute_query`
Query data (large exports)	`bq query`
Policy testing	`repo-manager policies test`
Test data capture	`repo-manager policies capture-test-data`

1. Task Execution Protocol​

Example readme.md Structure​

2. Core Concepts​

3. Repository Structure​

4. Naming and File Conventions​

5. Common Workflows​

Workflow: Creating a New Policy​

Step 0: Prepare Repository​

Step 1: Research and Learn​

Step 2: Discover Types and Fields Using MCP​

Step 3: Understand Relationships Using MCP​

Step 4: Determine Policy Categories​

Step 5: Create Policy Structure​

Step 6: Implement Logic​

Step 7: Create Test Data​

Step 8: Test and Debug​

Step 9: Finalize​

Workflow: Testing a Policy​

Workflow: Debugging a Policy​

Workflow: Managing Documentation​

6. repo-manager CLI Tool​

Configuration and Behavior​

Command Reference​

docs - Documentation Management​

policies - Policy Management​

types - CMDB Type Management​

cleanup - Repository Cleanup​

schema - Schema Management​

auth - Authentication​

7. Answering Data-Related Questions​

Scenario 1: General Data Queries​

Scenario 2: Policy-Related Data Queries​

MCP vs bq: When to Use Each​

8. Quick Reference: Common Tasks​

Discovering Types and Fields​

Understanding Field Values​

Finding Relationships​

Querying Data​

Valid Policy Categories​

Capturing Test Data​

9. Reference Documentation​

Policy Development​

Operations​

Type System​

Object Relationships​

Repository Management​

10. Learning Resources (in order of priority)​

11. Environment Assumptions​

12. MCP Benefits Summary​