Azure Resource Management Assistant (ARMA)

ARMA is a modular, multi-agent assistant for Azure resource provisioning, validation, and management, built with LangGraph and LangChain.

Overview

Azure Resource Management Assistant (ARMA) provides a robust, streaming, and user-friendly workflow for managing Azure resources. It leverages a multi-agent architecture to extract user intent, validate ARM templates, and manage Azure resources, with all agent/system progress streamed to the UI.

Features

Modular, multi-agent workflow for Azure resource management
Intent extraction, template validation, deployment, and resource action agents
Real-time streaming of agent/system progress to the UI
Human-in-the-loop support for missing or ambiguous fields
Production-grade, extensible codebase using LangGraph and LangChain

Architecture

ARMA is composed of several subgraphs/agents:

Intent Detection Agent: Extracts user intent and resource details
Template Validation Agent: Fetches and validates ARM templates
Deployment Agent: Handles Azure deployments
Resource Action Agent: Manages resource actions (create, update, delete, etc.)
Human Node: Prompts for missing/unclear fields

All state and progress are logged for real-time UI display.

Getting Started

Clone the repository
Install dependencies from requirements.txt
Run the main entry point (see usage examples in the codebase)

Usage

Interact with ARMA via the provided Streamlit or console harness
All agent progress and system messages are streamed to the UI

Architecture

The workflow is composed of three main subgraphs (agents), each responsible for a distinct phase of the Azure provisioning process:

Intent Detection: Extracts user intent, resource type, and relevant fields from natural language.
Template Validation: Validates user-provided fields against ARM template requirements.
Deployment: Deploys the validated template to Azure, handling both resource group and subscription scopes.

These subgraphs are orchestrated by a master graph (see implementation in the main app), which manages the overall workflow and state transitions.

State Management

All conversational and workflow state is managed via a single, strongly-typed state object (ARMAState in state.py). This state includes:

messages: Full conversation history (user, agent, and system/progress messages)
intent, resource_type, provided_fields, resource_group_name, subscription_id, subscription_name, location
template, scope, parameter_file_content, validation_error, deployment_status, etc.

All user and agent/system interactions are logged in the messages list, ensuring the UI can display the complete conversation and agent progress in real time.

Agent & Subgraph Design

1. Intent Detection Subgraph (`agents/intent_agent.py`)

Purpose: Extracts the user's intent, Azure resource type, and all relevant fields from the user's natural language input.

Nodes:

intent_extraction: Uses an LLM (Azure OpenAI) to extract intent, resource type, and fields.
scope_fields_check: Ensures required fields (resource group, subscription) are present; interrupts if missing.
template_fetch: Loads the correct ARM template based on resource type.
scope_determination: Determines deployment scope (resource group, subscription, etc.) from the template schema.

Flow:

flowchart TD
    START([START]) --> intent_extraction
    intent_extraction --> scope_fields_check
    scope_fields_check --> decision{intent}
    decision -- create/update --> template_fetch
    decision -- get/list/delete --> END([END])
    template_fetch --> scope_determination
    scope_determination --> END([END])

Key Features:

Uses a detailed system prompt with examples for robust extraction.
Handles edge cases (e.g., GUID vs. name for subscription).
Interrupts and prompts user if required fields are missing.

2. Template Validation Subgraph (`agents/validation_agent.py`)

Purpose: Validates that all required ARM template parameters are provided and correct, using both code and LLM-based validation.

Nodes:

check_subscription: Verifies the Azure subscription exists and is enabled.
check_resource_group: Verifies the resource group exists in the subscription.
validate: Uses an LLM to check provided fields against template parameters, types, and allowed values.
prompt_for_missing: Interrupts and prompts the user for any missing or invalid parameters.

Flow:

flowchart TD
    START([START]) --> check_subscription
    check_subscription --> check_resource_group
    check_resource_group --> validate
    validate --> decision{parameters valid?}
    decision -- yes --> END([END])
    decision -- no --> prompt_for_missing
    prompt_for_missing --> END([END])

Key Features:

LLM intelligently maps user fields to template parameters.
Handles type checking, allowed values, and extra fields.
Prompts user for missing/invalid parameters.

3. Deployment Subgraph (`agents/deployment_agent.py`)

Purpose: Deploys the validated ARM template to Azure, handling both resource group and subscription-level deployments.

Nodes:

resource_group_deployment: Deploys to a resource group (creates it if needed).
subscription_deployment: Deploys at the subscription scope.

Flow:

flowchart TD
    START([START]) --> decision{scope}
    decision -- resourceGroup --> resource_group_deployment
    decision -- subscription --> subscription_deployment
    resource_group_deployment --> END([END])
    subscription_deployment --> END([END])

Key Features:

Uses Azure SDK for Python for deployments.
Handles resource group creation if missing.
Logs deployment status and errors.

4. Resource Action Agent (`agents/resource_action_agent.py`)

Purpose: Handles Azure resource management actions such as get, list, and delete for resources, using the Azure SDK.

Nodes:

get_resource: Retrieves details of a specific Azure resource.
list_resources: Lists resources of a specified type in a resource group.
delete_resource: Deletes a specified Azure resource.

Flow:

flowchart TD
    START([START]) --> decision{intent}
    decision -- get --> get_resource
    decision -- list --> list_resources
    decision -- delete --> delete_resource
    get_resource --> END([END])
    list_resources --> END([END])
    delete_resource --> END([END])

Key Features:

Uses Azure SDK for Python for all resource actions.
Logs and stores all results in the workflow state in a consistent, JSON-formatted way.
Handles missing required fields by interrupting and prompting for user input.
Supports extensible intent-based routing for future resource actions.

Main Graph Wiring

The master graph orchestrates the full workflow by chaining the subgraphs:

flowchart TD
    START([START]) --> intent_detection
    intent_detection --> decision{intent}
    decision -- get/list/delete --> resource_action
    decision -- create/update --> template_validation
    decision -- other --> END([END])
    resource_action --> END([END])
    template_validation --> deployment
    deployment --> END([END])

Each subgraph is compiled and added as a node.
State is passed between subgraphs, with all messages and progress logged.
Interrupts (e.g., missing fields) are handled gracefully, prompting the user as needed.

Message Logging & UI Integration

All user, agent, and system/progress messages are appended to the messages list in the state.
The UI (e.g., streamlit_app.py) displays the full conversation, including agent/system progress and interruptions.
The Streamlit app formats and displays all messages, updating in real time as the workflow progresses.

Example UI flow:

User submits a request (e.g., "create a storage account...").
Each agent/subgraph appends progress/system messages (e.g., "Extracting intent...", "Validating template...").
If user input is needed, the system interrupts and prompts for missing fields.
All messages are shown in the chat interface, providing full transparency.

ARM Template Storage

ARM templates are stored in the quickstarts/ directory, organized by resource type (e.g., quickstarts/microsoft.storage/storageaccounts.json).
The intent detection agent dynamically loads the correct template based on the extracted resource type.

How to Run

Install dependencies:
```
pip install -r requirements.txt
```

Set up environment variables:

# Your AI Foundry project chat model
AZURE_OPENAI_ENDPOINT=
AZURE_OPENAI_DEPLOYMENT=gpt-4o
AZURE_OPENAI_API_VERSION=2024-12-01-preview

LANGCHAIN_TRACE_V2=False

LANGSMITH_TRACING=true
LANGSMITH_API_KEY=
LANGSMITH_ENDPOINT=https://api.smith.langchain.com

# Azure DefaultCredentials
AZURE_CLIENT_ID=
AZURE_TENANT_ID=
AZURE_CLIENT_SECRET=

Run the Streamlit app:
```
python streamlit_app.py
```
Interact with the assistant:
- Enter natural language requests (e.g., "create a storage account named test in rg demo").
- The UI will display all agent/system progress and prompt for any missing information.

Extending the System

Add new resource types: Add new ARM templates to the quickstarts/ directory.
Add new agents/subgraphs: Create new agent modules in agents/ and wire them into the main application logic.
Customize validation or deployment logic: Edit the relevant agent node functions for custom business logic or additional checks.
Prompts and Factories:
- Add or update prompt templates in the prompts/ directory.
- Use the factory/ directory for shared construction logic or utilities.

File Structure

.
├── agents/                  # All agent implementations (intent, validation, deployment, resource actions)
│   ├── __init__.py
│   ├── intent_agent.py
│   ├── validation_agent.py
│   ├── deployment_agent.py
│   └── resource_action_agent.py
├── factory/                 # Shared factory functions/utilities
├── prompts/                 # Prompt templates for LLMs
├── quickstarts/             # ARM templates organized by resource type
│   ├── microsoft.storage/
│   └── microsoft.keyvault/
├── .env                     # Environment variables
├── arma.py                  # Main application logic (entry point)
├── events_handler.py        # Event handling logic
├── state.py                 # State management and schemas
├── streamlit_app.py         # Streamlit UI
├── requirements.txt         # Python dependencies
├── README.md                # Project documentation
└── ...

agents/: Contains all agent logic for intent extraction, validation, deployment, and resource actions.
factory/: Shared construction logic, factories, or utilities for agents and other components.
prompts/: Prompt templates for LLMs, organized by use case or agent.
quickstarts/: ARM templates for supported Azure resources, organized by provider/type.
arma.py: Main entry point for the application logic.
events_handler.py: Handles event streaming and logging.
state.py: Defines the ARMAState and manages workflow state.
streamlit_app.py: Streamlit-based UI for interacting with ARMA.

Appendix: Example Conversation Flow

Example 1

User: create a storage account with the following values, name: aiteststorg01, rg: myrg, subscription: 00000000-0000-0000-0000-000000000000 and region eastus

Example 2

User: delete storage account with the following values, name: aiteststorg01, rg: myrg, subscription: 00000000-0000-0000-0000-000000000000

Example 3

User: list all storage accounts in the rg: myrg, subscription: 00000000-0000-0000-0000-000000000000

Example 4

User: get key vault named mykeyvault in resource group myrg, subscription e98a7bdd-1e97-452c-939c-4edf569d31f6

If missing fields:

We use LangGraph's Interrupt to interrupt the workflow and prompt the user for the missing fields.

All of the above messages are logged in the messages list and displayed in the UI.

Work In Progress (WIP)

1. Multi-turn conversations

Currently, ARMA is designed for single-turn conversations. It will be extended to support multi-turn conversations in the future.

2. Loading Templates from a Vector Store

Currently, ARM templates are loaded from the local file system based on the extracted resource_type (e.g., quickstarts/microsoft.storage/storageaccounts.json).

Planned Improvement:

Store ARM templates in a vector store (e.g., FAISS, ChromaDB) with metadata including resource type, description, and tags.
On intent extraction, use the resource type to query the vector store for the most relevant template, enabling fuzzy matching, semantic search, and easier extensibility.
This will allow for more flexible template retrieval, support for similar resource types, and easier management of a large template library.

Intended Approach:

Index all templates in the vector store at startup or via a management script.
On user request, extract the resource type and use it as a query to the vector store.
Retrieve the best-matching template and load it into the workflow state for validation and deployment.

3. Notification Agent

Add a notification agent that sends a notification to the user when deployment is complete. It will contain deployment details.
This will allow for more flexible notification, support for different notification types (e.g., email, SMS, Azure DevOps, Teams), and easier management of a large notification library.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.vscode		.vscode
agents		agents
assets/images		assets/images
factory		factory
prompts		prompts
quickstarts		quickstarts
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTION.md		CONTRIBUTION.md
10000 Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
arma.py		arma.py
langgraph.json		langgraph.json
requirements.txt		requirements.txt
state.py		state.py
streamlit_app.py		streamlit_app.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Azure Resource Management Assistant (ARMA)

Overview

Features

Architecture

Getting Started

Usage

Table of Contents

Architecture

State Management

Agent & Subgraph Design

1. Intent Detection Subgraph (`agents/intent_agent.py`)

2. Template Validation Subgraph (`agents/validation_agent.py`)

3. Deployment Subgraph (`agents/deployment_agent.py`)

4. Resource Action Agent (`agents/resource_action_agent.py`)

Main Graph Wiring

Message Logging & UI Integration

ARM Template Storage

How to Run

Extending the System

File Structure

Appendix: Example Conversation Flow

Example 1

Example 2

Example 3

Example 4

Work In Progress (WIP)

1. Multi-turn conversations

2. Loading Templates from a Vector Store

3. Notification Agent

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

eosho/ARMA

Folders and files

Latest commit

History

Repository files navigation

Azure Resource Management Assistant (ARMA)

Overview

Features

Architecture

Getting Started

Usage

Table of Contents

Architecture

State Management

Agent & Subgraph Design

1. Intent Detection Subgraph (agents/intent_agent.py)

2. Template Validation Subgraph (agents/validation_agent.py)

3. Deployment Subgraph (agents/deployment_agent.py)

4. Resource Action Agent (agents/resource_action_agent.py)

Main Graph Wiring

Message Logging & UI Integration

ARM Template Storage

How to Run

Extending the System

File Structure

Appendix: Example Conversation Flow

Example 1

Example 2

Example 3

Example 4

Work In Progress (WIP)

1. Multi-turn conversations

2. Loading Templates from a Vector Store

3. Notification Agent

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

1. Intent Detection Subgraph (`agents/intent_agent.py`)

2. Template Validation Subgraph (`agents/validation_agent.py`)

3. Deployment Subgraph (`agents/deployment_agent.py`)

4. Resource Action Agent (`agents/resource_action_agent.py`)

Packages