AI Test Automation & Playwright Testing Blog

Playwright How-to

July 19, 2026

How to Generate Playwright Tests from a Website

Go from an observed browser journey to stable locators, meaningful assertions, and a test run with trace evidence.

Read the guide →

Engineering

July 19, 2026

Blocked by Our Own Reviewer: Four BLOCKs in Twenty Minutes

60 billing PRs, 25 BLOCK verdicts, and a four-layer authorization bug our AI reviewer refused to let merge. The receipts.

Read more →

QualityMax Vibe launch offer: 1,000 AI credits per month for the first 50 users.

Product

July 19, 2026

The Vibe plan: We review every PR.

Generate with Claude Code, Codex, or your own keys — and put an independent reviewer with receipts on every pull request.

Read more →

QualityMax Studio: AI included, quality assured, with 6,000 monthly hosted-AI credits for Founding 50 users.

Product

July 19, 2026

QualityMax Studio: AI Included. Quality Assured.

One hosted-AI quality platform for crawling, generation, execution, healing, performance, audits, mobile, and PR review.

Read more →

Engineering

July 16, 2026

Your AI QA Agent Has Amnesia. We Fixed It.

How evidence-backed memory lets autonomous QA agents reuse what passed before, without treating recalled text as truth.

Read more →

Engineering

July 15, 2026

Enterprise Readiness Is Built at the Edges

AI and cloud egress need more than promises. How signed Exposure Receipts make QualityMax activity inspectable—and what enterprise readiness still requires beyond them.

Read more →

Ruslan Strazhnyk presenting QualityMax at Berlin SaaS Night

Community

June 25, 2026

Who Checks the Work? QualityMax at Berlin SaaS Night

A founder’s view from an evening of live pitches, honest SaaS conversations, and an engaged Berlin community—plus the question behind the QualityMax demo.

Read more →

Engineering

June 18, 2026

Schrödinger's Eval: Your Agent Passes and Fails Until a User Opens It

Run your eval once and you get 91%. Run it again: 95%, then 97%, then 91%. An LLM agent is stochastic, so a single pass rate measures noise as much as signal. Why agent evaluation is a probabilistic question — not a win/loss count.

Read more →

Opinion

June 10, 2026

Value Per Token: The Number the Agent-Loop Hype Forgets

Loops where agents prompt agents to write code are useful in the right hands — and oversold as a universal law by the people selling the tokens. A case for value per token, treating models as commodity suppliers, and keeping one deterministic check between generation and trust.

Read more →

Open Source

June 4, 2026

23 Free QA Skills for Your Coding Agent

We open-sourced a catalogue of 23 diagnostic QA skills for Claude Code and Codex — Core Web Vitals, secret scanning, dependency audits, dead-code detection, flaky-selector hunting, and more. No signup. Copy a folder and go.

Read more →

Engineering

June 2026

You Can't Review Your Own Work

AI code generation and AI code testing are adversarial systems that should be separate products. When the same agent writes the code and its tests, they pass by construction. The thesis QualityMax is built on — and the deterministic guardrail harness that makes it real.

Read more →

Monthly Recap

July 4, 2026

June in Review: The Month We Taught Our AI to Doubt Itself

A hallucination gate that verifies every generated test against the live app, a reviewer that learns to stay quiet, pass rates with real confidence intervals, and run status that admits “partly”.

Read more →

Monthly Recap

June 2, 2026

May in Review: The Month We Made It More Trustworthy

A rebuilt AI crawl planner grounded in the live page, platform stability work, security hardening, qmax-code going open source, and the dogfooding loop behind it all.

Read more →

Faroe Islands cliffs during a phone-first QualityMax shipping week

Dogfooding

May 26, 2026

Six days of shipping QualityMax from my phone

A Faroe Islands trip May 18–23, 43 PRs landed, zero days at a desk, and one revert that proved why the gates exist.

Read more →

Free Tier

May 19, 2026

You're already paying for an AI subscription. Get the full QA loop for free.

The free tier is the product, not a trial. Bring your existing Claude Code or Codex subscription, get crawl → generate → run → fix without leaving your terminal — plus 60 free isolated cloud-sandbox minutes a month for the runs that shouldn't live on a laptop.

Read more →

Dogfooding

May 13, 2026

We built our iPhone app in 4 days — without being a mobile testing platform

QualityMax started as a web E2E testing platform. This week we shipped our own iPhone app to TestFlight. The next day, the new app caught its own production bug end-to-end in under 20 minutes. Here's how the dogfooding loop closed.

Read more →

Announcement

May 2026

qmax-code is now open source

The Go + Charm TUI agent that orchestrates Claude over the QualityMax API is now public on GitHub. Read every line, fork it, send PRs — FSL-1.1-ALv2 (converts to Apache 2.0 in 2 years).

Read more →

Engineering

May 2026

qmax-code 1.13: Claude Code and Codex on QA Steroids

v1.13 doesn't just find the bug — it patches it in your terminal while you watch. Multi-model routing, in-terminal auto-fix, instant PR security review. Works free with your existing CC or Codex subscription.

Read more →

Dogfooding

April 2026

We Redesigned 14 Landing Pages — Through Our Own AI Review Gates

22 commits, 3 PRs, a 5-persona AI review, every commit gated by SAST + prompt-injection + brute-force checks. If we don’t trust our pipeline with our own brand pages, why would you?

Read more →

Engineering

April 2026

Teaching the Reviewer: How 👍/👎 on a PR Comment Rewires the Next Review

A single click on a QualityMax PR comment becomes durable, per-repo knowledge the reviewer retrieves on the next PR. Here’s the plumbing — three feedback channels, one storage layer, and the GitHub-webhook limitation that forced us to build a poller.

Read more →

Analysis

April 2026

Two Posts, Same Day: The Gap Between AI Policy and Vibe Coding

One mature engineering org writes a 27-page AI policy with the rule “if you can’t explain the code, don’t commit it.” One workshop ships 10 live websites in an afternoon with Lovable and Cursor. The gap between them is the whole QualityMax market.

Read more →

Engineering

April 2026

The Möbius Strip QA Loop: When the Tool Tests Itself

Most QA tools sit outside the code they test. QualityMax sits inside — and now monitors its own errors, generates its own regression tests, closes its own loops. A single-sided surface where tool and target merge.

Read more →

Product Update

April 2026

Your AI Reviewer Now Asks What You Care About

Interactive calibration for AI code reviews: pick which categories to check, which to skip, and get structured findings with a one-command fix for your LLM agent.

Read more →

Anthropic status page showing claude.ai partial outage and Claude Code degraded

Engineering

April 2026

When Claude Goes Down, Your Tests Shouldn't

Today's Anthropic outage took claude.ai partial, Claude Code degraded. Every AI test platform built on a single LLM provider went down with it. Here's why QualityMax routes per-task across Claude, GPT, and Gemini — and what that costs.

Read more →

Engineering

April 2026

Building qmax-code: Why We Built Our Own AI Testing Agent

7,951 lines of Go. Charm framework TUI. 48 MCP tools. Not based on Claude Code. Two tools, one mission — here's the engineering story.

Read more →

Analysis

April 2026

AI Coding Agents Are Secured in the Wrong Direction

The Claude Code source leak reveals an industry-wide gap: AI tools invest in containing the agent but barely verify whether the code it produces is secure. 4% of GitHub commits are now AI-generated. Who's checking them?

Read more →

Real Incident

March 2026

We Got Brute-Forced on Launch Day

We posted our vibe-check page on Reddit and Hacker News. 1,145 users came. So did a brute-force attack that blew through our Resend email quota in 4 minutes.

Read more →

Engineering

February 2026

Building the Matrix Demo

Behind the scenes of our interactive demo page — boot sequences, the red pill / blue pill choice, a chat-driven AI terminal, and live Playwright execution in the browser.

Read more →

Security

February 2026

Building a Hostile Site to Test Our AI

How we created an adversary website full of prompt injections, XSS traps, and redirect loops to stress-test our AI crawl pipeline — and what we learned.

Read more →

Blog

Explore developer testing workflows

How to Generate Playwright Tests from a Website

Blocked by Our Own Reviewer: Four BLOCKs in Twenty Minutes

The Vibe plan: We review every PR.

QualityMax Studio: AI Included. Quality Assured.

Your AI QA Agent Has Amnesia. We Fixed It.

Enterprise Readiness Is Built at the Edges

Who Checks the Work? QualityMax at Berlin SaaS Night

Schrödinger's Eval: Your Agent Passes and Fails Until a User Opens It

Value Per Token: The Number the Agent-Loop Hype Forgets

23 Free QA Skills for Your Coding Agent

You Can't Review Your Own Work

June in Review: The Month We Taught Our AI to Doubt Itself

May in Review: The Month We Made It More Trustworthy

Six days of shipping QualityMax from my phone

You're already paying for an AI subscription. Get the full QA loop for free.

We built our iPhone app in 4 days — without being a mobile testing platform

qmax-code is now open source

qmax-code 1.13: Claude Code and Codex on QA Steroids

We Redesigned 14 Landing Pages — Through Our Own AI Review Gates

Teaching the Reviewer: How 👍/👎 on a PR Comment Rewires the Next Review

Two Posts, Same Day: The Gap Between AI Policy and Vibe Coding

The Möbius Strip QA Loop: When the Tool Tests Itself

Your AI Reviewer Now Asks What You Care About

When Claude Goes Down, Your Tests Shouldn't

Building qmax-code: Why We Built Our Own AI Testing Agent

AI Coding Agents Are Secured in the Wrong Direction

We Got Brute-Forced on Launch Day

Building the Matrix Demo

Building a Hostile Site to Test Our AI