Phaedrus - Automated UI Testing Framework Powered by Anthropic Computer Use

Proof-of-concept framework for UI testing powered by AI that combines test plan generation, test execution, and result tracking using Claude 3.5 Sonnet (regular model and computer use), Streamlit, Google Sheets, and Jira integration. Presented at NextGen AI Agents: Computer Use Hackathon, Nov 9-10, 2024.

Presentation link: https://drive.google.com/file/d/1xl_hLYxD_nHRGAnk0aB7dozV8AOj7ttn/view?usp=share_link

Team: Bayram Annakov Yersultan Sapar

Overview

This framework provides end-to-end automation for UI testing:

Test Plan Generation: Automatically generates test plans using Claude 3.5 Sonnet based on website description and requirements
Test Execution: Executes tests through automated browser interactions using Claude 3.5 Sonnet and Playwright
Result Tracking: Records test results in Google Sheets and creates Jira tickets for failures
Visual Verification: Captures and stores screenshots for test evidence

Key Components

streamlit.py: Web interface for test execution and monitoring
loop.py: Core test execution engine using Claude 3.5 Sonnet
planner.py: Test plan generation using Claude 3.5 Sonnet
jira.py: Jira integration for issue tracking
spreadsheet.py: Google Sheets integration for test plan and results management
record_result.py: Test result recording with screenshot capture

Features

Automated test plan generation based on website analysis
Browser automation with Chrome/Firefox/WebKit support
Real-time test execution monitoring
Automatic screenshot capture for visual verification
Google Sheets integration for test plan management
Jira ticket creation for test failures
Support for multiple LLM providers (Anthropic, Bedrock, Vertex)

Prerequisites

Python 3.12+
Google Cloud credentials
Jira API token
Anthropic API key
Streamlit
Google Sheets API access

Below is the original README from Anthropic Computer Use (for Mac)

https://x.com/deedydas/status/1849481225041559910

Anthropic Computer Use (for Mac)

Anthropic Computer Use is a beta Anthropic feature which runs a Docker image with Ubuntu and controls it. This fork allows you to run it natively on macOS, providing direct system control through native macOS commands and utilities.

Caution

This comes with obvious risks. The Anthropic agent can control everything on your Mac. Please be careful. Anthropic's new Claude 3.5 Sonnet model refuses to do unsafe things like purchase items or download illegal content.

Features

Native macOS GUI interaction (no Docker required)
Screen capture using native macOS commands
Keyboard and mouse control through cliclick
Multiple LLM provider support (Anthropic, Bedrock, Vertex)
Streamlit-based interface
Automatic screen resolution scaling
File system interaction and editing capabilities

Prerequisites

macOS Sonoma 15.7 or later
Python 3.12+
Homebrew (for installing additional dependencies)
cliclick (brew install cliclick) - Required for mouse and keyboard control

Setup Instructions

Clone the repository and navigate to it:

git clone https://github.com/deedy/mac_computer_use.git
cd mac_computer_use

Create and activate a virtual environment:

python3.12 -m venv venv
source venv/bin/activate

Run the setup script:

chmod +x setup.sh
./setup.sh

Install Python requirements:

pip install -r requirements.txt

Running the Demo

Set up your environment and Anthropic API key

In a .env file add:

API_PROVIDER=anthropic
ANTHROPIC_API_KEY=<key>
WIDTH=800
HEIGHT=600
DISPLAY_NUM=1

Set the screen dimensions (recommended: stay within XGA/WXGA resolution), and put in your key from Anthropic Console.

Start the Streamlit app:

streamlit run streamlit.py

The interface will be available at http://localhost:8501

Screen Size Considerations

We recommend using one of these resolutions for optimal performance:

XGA: 1024x768 (4:3)
WXGA: 1280x800 (16:10)
FWXGA: 1366x768 (~16:9)

Higher resolutions will be automatically scaled down to these targets to optimize model performance. You can set the resolution using environment variables:

export WIDTH=1024
export HEIGHT=768
streamlit run streamlit.py

Important

The Beta API used in this reference implementation is subject to change. Please refer to the API release notes for the most up-to-date information.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
tools		tools
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
activate.sh		activate.sh
app.pid		app.pid
loop.py		loop.py
requirements.txt		requirements.txt
setup.sh		setup.sh
streamlit.py		streamlit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Phaedrus - Automated UI Testing Framework Powered by Anthropic Computer Use

Overview

Key Components

Features

Prerequisites

Below is the original README from Anthropic Computer Use (for Mac)

Anthropic Computer Use (for Mac)

Features

Prerequisites

Setup Instructions

Running the Demo

Set up your environment and Anthropic API key

Screen Size Considerations

About

Uh oh!

Releases

Packages

Languages

yersultan-17/nextgen_qa_local

Folders and files

Latest commit

History

Repository files navigation

Phaedrus - Automated UI Testing Framework Powered by Anthropic Computer Use

Overview

Key Components

Features

Prerequisites

Below is the original README from Anthropic Computer Use (for Mac)

Anthropic Computer Use (for Mac)

Features

Prerequisites

Setup Instructions

Running the Demo

Set up your environment and Anthropic API key

Screen Size Considerations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages