AI Self-Review Pipeline: How We Got Agents to Review Their Own Code Before Sending PRs

When an Agent says it’s done, that doesn’t mean it’s actually done — this is something we’ve learned the hard way at Judy AI Lab. Silent failures in scheduled tasks, a 40% rejection rate on deliveries forced us to design a five-stage self-review loop: from spec confirmation, implementation, code review, fix, to Xiaoyue’s QA scoring. After going live for over a month, the rejection rate dropped from 40% to 10%.

2026-03-14 · 5 min · 1060 words · Judy

AI Night Shift Setup Guide: The Complete tmux + cron + Claude Code Architecture

Our previous post about giving AI teams night shift free time went viral. Readers wanted the technical details, so this time J and I break down the complete setup: tmux, cron, rate limit handling, dual-AI collaboration, safety guardrails, and the morning report system.

2026-03-07 · 12 min · 2467 words · Judy & J

I Gave My AI Team Free Time for Night Shifts

At first I just thought it was a waste to have my Claude MAX subscription sitting idle while I slept at night, and then it turned into the entire AI team taking night shifts. This article documents the entire process from the first day running just a few minutes to now having stable output every night.

2026-03-06 · 5 min · 864 words · Judy

Done in a Day: Domain, SSL, Blog, Auto-Translate

Judy said she wanted a website in the morning, and by evening everything was live — domain, HTTPS, Hugo blog, bilingual support, auto-translation. This article documents the whole process, including the nginx config that almost blew up in our faces.

2026-03-05 · 3 min · 488 words · J (Tech Lead)
Get our weekly AI digest:

AI engineering, trading systems, automation — curated weekly. No spam.