Welcome to Cloudberry Engineering
a notebook on building, breaking and securing systems by Gianluca Brindisi.
  • We are moving towards a place where ticketing systems will become an important component to protect, akin to CI/CD.

    Tickets are a new source of untrusted input we need to account for when threat modeling against prompt injections.

    Ghostty only allows maintainers to create issues, seems to me they figured out a cheap and pragmatic security policy by accident.

    # / #agents
  • Claude Code Sandbox
  • How I Think About Agentic Risks
  • Coding Agents Security Theater
  • Finding vulnerabilities in modern web apps using Claude Code and OpenAI Codex. Super interesting to see some benchmarks.

    Traditional rule based detection can’t find complex vulnerabilities and even potentially detectable issues might go unnoticed as false negatives. This helps answer the question whether LLM could be integrated to cover this blind spot.

    They could! But the problem is the noise:

    AI Coding Agents Find Real Vulnerabilities: Claude Code found 46 vulnerabilities (14% true positive rate – TPR, 86% false positive rate – FPR) and Codex reported 21 vulnerabilities (18% TPR, 82% FPR). About 20 of these are high severity vulnerabilities.

    # / #llm