Cloudberry Engineering

2025-09-03

Finding vulnerabilities in modern web apps using Claude Code and OpenAI Codex. Super interesting to see some benchmarks.

Traditional rule based detection can’t find complex vulnerabilities and even potentially detectable issues might go unnoticed as false negatives. This helps answer the question whether LLM could be integrated to cover this blind spot.

They could! But the problem is the noise:

AI Coding Agents Find Real Vulnerabilities: Claude Code found 46 vulnerabilities (14% true positive rate – TPR, 86% false positive rate – FPR) and Codex reported 21 vulnerabilities (18% TPR, 82% FPR). About 20 of these are high severity vulnerabilities.
# / #llm
2025-09-01 The nx Breach
2024-05-15 How to Sell to Security Teams
2020-11-28 Foundations of a Multi-Cloud Security Strategy
2020-11-15 The Quirks of Apps Script and Google Cloud
2020-11-08 Google Cloud Service Accounts Security Best Practices
2020-11-01 A Practical Introduction to Container Security
2020-10-25 Google Cloud IAM for Security Teams
2020-10-18

I’ve built a directory of open source cloud security tools.

A good part of my day to day is spent trying to automate away problems. Over the years I learned how to invest my time wisely, and I made a habit to research and use available tools before start coding my own.

As a consequence I have a fairly large collection of utilities I keep nurturing, alongside references, commands and debugging adventures.

I thought I could as well make it public, so here we are: check it out.

Every tool has a page were I (will) store my own notes: see my very own docker-security as an example. I am still cleaning up most of them and I will publish a bit per time.

I’ll also commit a more compact version to github.

In the meanwhile if you have some tools to share please do!
# / #tools #aws #gcp #azure
2020-10-11 How to find and delete idle GCP Projects