Cookbook

Step-by-step guides for common Runhuman use cases.

Issue Testing Automation

Automatically verify that issues are fixed when PRs are merged or commits that fix issues are pushed.

How It Works

A PR with “Fixes #123” in the description is merged or a commit that closes an issue is pushed to main
The action analyzes the linked issue to generate test instructions
A human tester verifies the fix on your deployment URL
Results are posted as a comment on the issue
If the test fails, the issue is reopened

Setup

Add these secrets and variables to your repository:

Name	Type	Description
RUNHUMAN_API_KEY	Secret	Your API key
RUNHUMAN_TESTING_URL	Variable	Your staging/preview URL

Create the workflow file:

# .github/workflows/test-issues.yml
name: Test Linked Issues

on:
  workflow_run:
    workflows: [CI]  # Replace with your deploy workflow name
    types: [completed]
    branches: [main]

concurrency:
  group: test-issues-${{ github.event.workflow_run.head_sha }}
  cancel-in-progress: true

jobs:
  test-issues:
    if: github.event.workflow_run.conclusion == 'success'
    runs-on: ubuntu-latest
    steps:
      - uses: volter-ai/runhuman-action@v1
        with:
          url: ${{ vars.RUNHUMAN_TESTING_URL }}
          pr-numbers: '[${{ github.event.workflow_run.pull_request_number }}]'
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          on-success-add-labels: '["qa:passed"]'
          on-failure-add-labels: '["qa:failed"]'
          fail-on-failure: true

Configuration

See the GitHub Actions documentation for full configuration options including:

Label management (add/remove labels on success, failure, timeout)
Workflow control (fail-on-error, fail-on-failure, fail-on-timeout, wait-for-result)
Test configuration (target-duration-minutes, device-class, output-schema)

Writing Testable Issues

Include a test URL and clear reproduction steps:

## Bug Description
The checkout button is unresponsive on Safari.

## Test URL
https://staging.myapp.com/checkout

## Steps to Reproduce
1. Add items to cart
2. Go to checkout
3. Click "Place Order"
4. Nothing happens

## Expected Behavior
Order should be submitted and confirmation shown.

Bulk Issue Testing

Test all open issues in your repository on a schedule or on-demand.

How It Works

Workflow fetches all open issues with the qa-test label
Each issue is tested in parallel using matrix strategy
Results are posted as comments on each issue
Failed issues get reopened and labeled

Setup

# .github/workflows/test-all-issues.yml
name: Test All Open Issues

on:
  # Option 1: Run daily at 9 AM UTC
  schedule:
    - cron: '0 9 * * *'
  # Option 2: Manual trigger only
  # workflow_dispatch: {}
  # Option 3: Both scheduled and manual (recommended)
  workflow_dispatch:

jobs:
  find-issues:
    runs-on: ubuntu-latest
    outputs:
      issues: ${{ steps.get-issues.outputs.issues }}
      count: ${{ steps.get-issues.outputs.count }}
    steps:
      - name: Get open issues with qa-test label
        id: get-issues
        env:
          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
        run: |
          issues=$(gh issue list \
            --repo ${{ github.repository }} \
            --label "qa-test" \
            --state open \
            --json number \
            --jq '[.[].number]')
          echo "issues=$issues" >> $GITHUB_OUTPUT
          echo "count=$(echo $issues | jq length)" >> $GITHUB_OUTPUT

  test-issue:
    needs: find-issues
    if: needs.find-issues.outputs.count != '0'
    runs-on: ubuntu-latest
    strategy:
      fail-fast: false
      max-parallel: 3  # Limit concurrent tests to control costs
      matrix:
        issue: ${{ fromJson(needs.find-issues.outputs.issues) }}
    steps:
      - uses: volter-ai/runhuman-action@v1
        with:
          url: ${{ vars.RUNHUMAN_TESTING_URL }}
          issue-numbers: '[${{ matrix.issue }}]'
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          on-failure-add-labels: '["qa:failed"]'

Cost Considerations

Each test costs ~$0.32-0.54 (3-5 minutes)
10 issues = ~$3-5 per run
Use max-parallel to control concurrent spending
Consider running less frequently for large issue counts

Preview Deployment Testing

Test Vercel, Netlify, or other preview deployments automatically.

Vercel

name: Test Vercel Preview
on:
  deployment_status

jobs:
  test:
    if: github.event.deployment_status.state == 'success'
    runs-on: ubuntu-latest
    steps:
      - uses: volter-ai/runhuman-action@v1
        with:
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          url: ${{ github.event.deployment_status.target_url }}
          description: Test the preview deployment
          output-schema: |
            {
              "pageLoads": { "type": "boolean", "description": "Page loads correctly?" },
              "noErrors": { "type": "boolean", "description": "No console errors?" }
            }

Netlify

name: Test Netlify Preview
on:
  pull_request

jobs:
  test:
    runs-on: ubuntu-latest
    steps:
      - name: Wait for Netlify
        uses: jakepartusch/wait-for-netlify-action@v1.4
        id: netlify
        with:
          site_name: your-site-name
          max_timeout: 300

      - uses: volter-ai/runhuman-action@v1
        with:
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          url: ${{ steps.netlify.outputs.url }}
          description: Test the Netlify preview

Custom Preview URLs

If your preview URLs follow a pattern:

url: https://pr-${{ github.event.pull_request.number }}.preview.myapp.com

Visual Regression Testing

Catch UI bugs before they reach production.

Basic Visual Check

const result = await fetch('https://runhuman.com/api/run', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://staging.myapp.com/product/123',
    description: 'Check for visual issues: broken images, layout problems, text overflow, color contrast issues',
    outputSchema: {
      imagesLoad: { type: 'boolean', description: 'All images load correctly?' },
      layoutCorrect: { type: 'boolean', description: 'Layout looks correct, no overflow?' },
      textReadable: { type: 'boolean', description: 'All text is readable?' },
      visualIssues: { type: 'string', description: 'Describe any visual problems found' }
    }
  })
});

Mobile Responsiveness

const result = await fetch('https://runhuman.com/api/run', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://staging.myapp.com',
    description: 'Test on mobile: check navigation menu, forms, buttons. Look for overflow, tiny text, unreachable elements.',
    outputSchema: {
      navigationWorks: { type: 'boolean', description: 'Mobile nav opens and closes correctly?' },
      formsUsable: { type: 'boolean', description: 'Forms are usable on mobile?' },
      mobileIssues: { type: 'array', description: 'List of mobile-specific issues' }
    }
  })
});

Multi-Step Flow Testing

Test complex user journeys that span multiple pages.

Checkout Flow

const result = await fetch('https://runhuman.com/api/run', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://staging.myapp.com/products',
    description: `
      1. Browse products and add one to cart
      2. Go to cart and verify the item is there
      3. Proceed to checkout
      4. Fill shipping information
      5. Select payment method
      6. Verify order summary shows correct total
      7. Do not submit the final order
    `,
    targetDurationMinutes: 10,
    outputSchema: {
      addToCartWorks: { type: 'boolean', description: 'Product added to cart successfully?' },
      cartShowsItem: { type: 'boolean', description: 'Cart displays the added item?' },
      checkoutLoads: { type: 'boolean', description: 'Checkout page loads?' },
      shippingFormWorks: { type: 'boolean', description: 'Shipping form accepts input?' },
      totalCorrect: { type: 'boolean', description: 'Order total looks correct?' },
      issues: { type: 'array', description: 'Any issues encountered' }
    }
  })
});

User Onboarding

const result = await fetch('https://runhuman.com/api/run', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://staging.myapp.com/signup',
    description: `
      1. Create account with email test-${Date.now()}@example.com
      2. Complete the onboarding wizard
      3. Set up profile with sample data
      4. Verify you reach the dashboard
    `,
    targetDurationMinutes: 8,
    outputSchema: {
      signupWorks: { type: 'boolean', description: 'Account created successfully?' },
      onboardingCompletes: { type: 'boolean', description: 'Onboarding wizard completes?' },
      profileSaves: { type: 'boolean', description: 'Profile changes save?' },
      dashboardReached: { type: 'boolean', description: 'User reaches dashboard?' },
      confusingSteps: { type: 'array', description: 'Any confusing or unclear steps' }
    }
  })
});

Authentication Flows

const result = await fetch('https://runhuman.com/api/run', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://staging.myapp.com/login',
    description: `
      Test authentication:
      1. Try login with valid credentials (test@example.com / demo123)
      2. Verify redirect to dashboard
      3. Log out
      4. Try login with wrong password
      5. Verify error message is shown
      6. Try forgot password link
    `,
    targetDurationMinutes: 8,
    outputSchema: {
      loginWorks: { type: 'boolean', description: 'Valid login succeeds?' },
      logoutWorks: { type: 'boolean', description: 'Logout works?' },
      errorShown: { type: 'boolean', description: 'Error shown for wrong password?' },
      errorMessage: { type: 'string', description: 'What error message is displayed?' },
      forgotPasswordWorks: { type: 'boolean', description: 'Forgot password link works?' }
    }
  })
});

Scheduled Daily Testing with Templates

Run comprehensive daily tests at a specific time with reusable templates, detailed checklists, and full video/event review.

Use Case

You want to run the same comprehensive test every day at 5 PM (or any specific time) to catch regressions early. The test should use a detailed checklist that testers fill out, and you should be able to review video recordings and activity logs in the dashboard.

How It Works

Create a reusable template with a detailed output schema (checkboxes for each verification item)
Set up a GitHub Actions workflow with cron scheduling
Review test results in the dashboard: watch the video recording and see all captured events

Step 1: Create a Template

The easiest approach is a repo template — a markdown file committed to your repository. Create .runhuman/templates/daily-smoke.md:

---
name: Daily Smoke Test
duration: 10
device_class: desktop
---

Comprehensive daily test of core functionality:

1. Load the home page and verify it renders without errors
2. Test all main navigation links
3. Log in with test credentials
4. Verify the dashboard displays correctly
5. Test the search feature with a few queries
6. Run through the checkout flow
7. Check the site on a mobile viewport
8. Open the browser console and check for errors

## Results

Home page loads: [ ]
Navigation works: [ ]
Login flow works: [ ]
Dashboard displays: [ ]
Search works: [ ]
Checkout completes: [ ]
Mobile responsive: [ ]
No console errors: [ ]
Issues found: ___
Additional notes: ___

Commit and push this file to your repo. See Templates for the full format reference.

Alternative: Create via CLI

runhuman templates create "Daily Smoke Test" \
  --project proj_abc123 \
  -d "Comprehensive daily test of core functionality" \
  --duration 600 \
  --schema ./daily-test-schema.json

Step 2: Schedule with GitHub Actions

Create a workflow file that runs daily at 5 PM UTC (adjust timezone as needed):

# .github/workflows/daily-qa-test.yml
name: Daily QA Test

on:
  schedule:
    # Runs at 5 PM UTC every day (cron format: minute hour day month weekday)
    # For 5 PM EST (10 PM UTC), use: '0 22 * * *'
    # For 5 PM PST (1 AM UTC next day), use: '0 1 * * *'
    - cron: '0 17 * * *'
  # Also allow manual triggering for testing
  workflow_dispatch:

jobs:
  daily-test:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Run Daily QA Test
        uses: volter-ai/runhuman-action@v1
        with:
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          url: ${{ vars.RUNHUMAN_TESTING_URL }}
          template-file: .runhuman/templates/daily-smoke.md

      - name: Comment on Failure
        if: failure()
        run: |
          echo "Daily QA test failed! Check the dashboard for details."

Cron Syntax Reference:

┌───────────── minute (0-59)
│ ┌───────────── hour (0-23)
│ │ ┌───────────── day of month (1-31)
│ │ │ ┌───────────── month (1-12)
│ │ │ │ ┌───────────── day of week (0-6, Sunday to Saturday)
│ │ │ │ │
0 17 * * *  # 5 PM UTC daily

Common Schedules:

Time	Cron Expression	Description
5 PM UTC daily	`0 17 * * *`	Every day at 5 PM UTC
9 AM UTC weekdays	`0 9 * * 1-5`	Monday-Friday at 9 AM UTC
Every 6 hours	`0 /6 * *`	12 AM, 6 AM, 12 PM, 6 PM UTC
Twice daily	`0 9,17 * * *`	9 AM and 5 PM UTC

Note: GitHub Actions runs on UTC time. Convert your local time to UTC:

EST: Add 5 hours (5 PM EST = 10 PM UTC = 0 22 * * *)
PST: Add 8 hours (5 PM PST = 1 AM UTC next day = 0 1 * * *)

Required Secrets and Variables:

Add these to your repository settings:

Name	Type	Description
`RUNHUMAN_API_KEY`	Secret	Your API key from the dashboard
`RUNHUMAN_TESTING_URL`	Variable	Your staging/production URL to test

Step 3: Review Results in Dashboard

After the scheduled test runs:

Open the Dashboard
- Go to https://runhuman.com/dashboard
- Navigate to your project
- Click on “Jobs” to see all test runs
View Test Details
- Click on the job to see full details
- Watch the video: See exactly what the tester did, recorded from their screen
- Review the checklist: See which items passed/failed based on your output schema
- Check the events: See all browser interactions, clicks, navigation, console logs
Dashboard Features
- Video playback: Scrub through the recording to see specific moments
- Event timeline: See timestamps for every action taken
- Console logs: Review any JavaScript errors or warnings
- Network activity: See API calls and their responses
- Key moments: Jump to AI-detected key moments in the test video
Results Structure

The test results will show your schema fields as checkboxes:

✅ homePageLoads: true
✅ navigationWorks: true
✅ loginFlowWorks: true
✅ dashboardDisplays: true
❌ searchFunctional: false
✅ checkoutWorks: true
✅ mobileResponsive: true
❌ noConsoleErrors: false

Issues found:
- Search returns 500 error for special characters
- Console error: "Cannot read property 'map' of undefined" in ProductList.tsx

Additional notes:
- Login is slower than usual (3-4 seconds)
- Mobile menu animation is laggy on iPhone 12

Full Workflow Example

1. Create the template (.runhuman/templates/health-check.md):

---
name: Production Health Check
duration: 10
device_class: desktop
url: https://myapp.com
---

Daily comprehensive test of the production environment.

1. Verify the homepage loads
2. Test login with test credentials
3. Check the dashboard for data accuracy
4. Run a search query
5. Complete a checkout flow

## Results

Homepage loads: [ ]
Login works: [ ]
Dashboard accurate: [ ]
Search works: [ ]
Checkout completes: [ ]
Issues found: ___

2. Add the workflow file (.github/workflows/daily-health-check.yml):

name: Production Health Check

on:
  schedule:
    - cron: '0 17 * * *'  # 5 PM UTC
  workflow_dispatch:

jobs:
  health-check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - uses: volter-ai/runhuman-action@v1
        with:
          api-key: ${{ secrets.RUNHUMAN_API_KEY }}
          template-file: .runhuman/templates/health-check.md

3. Wait for scheduled run (or trigger manually):

# Manually trigger the workflow for testing
gh workflow run daily-health-check.yml

4. Review in dashboard:

Open https://runhuman.com/dashboard/proj_abc123/jobs
Click on the latest job
Watch video, review checklist, check events

Tips and Best Practices

Template Design:

Keep output schemas focused (8-12 items maximum)
Use boolean fields for yes/no checks
Include an issues array for detailed problem descriptions
Add an additionalNotes string field for tester observations

Scheduling:

Run during low-traffic periods to avoid affecting real users
Consider timezone differences when scheduling
Use workflow_dispatch to allow manual triggering for testing

Cost Management:

Each 10-minute test costs approximately $0.54-0.90
Daily tests = ~$16-27/month
Use shorter durations for simple smoke tests (5 minutes = $0.27-0.45)

Notifications:

Set up Slack/email notifications for test failures
Use GitHub Actions’ built-in notifications
Consider creating GitHub issues automatically for failed tests

Video Review:

Videos are essential for debugging visual issues
Scrub to specific timestamps using the event timeline
Share video links with your team for collaborative debugging

Next Steps

Topic	Link
Agent workflow automation	Agent Workflow Integration
Template format and options	Templates
Full technical specification	Reference
REST API integration	REST API
CI/CD integration	GitHub Actions