-
Finding vulnerabilities in modern web apps using Claude Code and OpenAI Codex. Super interesting to see some benchmarks.
Traditional rule based detection can’t find complex vulnerabilities and even potentially detectable issues might go unnoticed as false negatives. This helps answer the question whether LLM could be integrated to cover this blind spot.
They could! But the problem is the noise:
# / #llmAI Coding Agents Find Real Vulnerabilities: Claude Code found 46 vulnerabilities (14% true positive rate – TPR, 86% false positive rate – FPR) and Codex reported 21 vulnerabilities (18% TPR, 82% FPR). About 20 of these are high severity vulnerabilities.
- The nx Breach
- How to Sell to Security Teams
- Foundations of a Multi-Cloud Security Strategy
- The Quirks of Apps Script and Google Cloud
- Google Cloud Service Accounts Security Best Practices
- A Practical Introduction to Container Security
- Google Cloud IAM for Security Teams
-
I’ve built a directory of open source cloud security tools.
A good part of my day to day is spent trying to automate away problems. Over the years I learned how to invest my time wisely, and I made a habit to research and use available tools before start coding my own.
As a consequence I have a fairly large collection of utilities I keep nurturing, alongside references, commands and debugging adventures.
I thought I could as well make it public, so here we are: check it out.
Every tool has a page were I (will) store my own notes: see my very own docker-security as an example. I am still cleaning up most of them and I will publish a bit per time.
I’ll also commit a more compact version to github.
In the meanwhile if you have some tools to share please do!
# / #tools #aws #gcp #azure - How to find and delete idle GCP Projects