Static Application Security Testing (SAST): The Good, the Bad, and the Ugly

Static Application Security Testing (SAST) promises early vulnerability detection directly from source code. But how effective is it in practice? This guide explores where SAST tools excel, where modeling and rule-based detection break down, and what security and engineering teams should realistically expect from static analysis.

By Sherif Koussa

・

5 min read

Table of contents

Text Link

Static Application Security Testing (SAST) refers to tools that analyze source code to identify potential security vulnerabilities without executing the application. In simple terms, one application analyzes another and flags suspicious patterns for review. SAST promises early detection, automation, and scalability. In practice, it delivers real value but also real frustration.

Let’s take a sober look at what SAST does well, where it struggles, and what engineering teams should realistically expect from it inside a modern SDLC.

What SAST Is (And How It Actually Works)

At its core, a SAST tool performs two major functions:

Modeling the software
Applying rules to detect vulnerability patterns

Modern SAST engines invest heavily in modeling code. That modeling may include:

Lexical analysis
Abstract Syntax Trees (AST)
Data flow graphs
Interprocedural call graphs
Taint tracking across execution paths

On top of that model, vendors create rule sets designed to detect common vulnerability classes, typically aligned to benchmarks such as:

In theory, this gives us structured, repeatable vulnerability detection directly from source code before deployment.

‍The Good

There’s a reason SAST remains a staple in secure SDLC pipelines.

Good SAST tools are designed to run statically against source code and identify vulnerability patterns. This process is extremely efficient, as analysis can be done offline using only the source code. The SAST analysis process, in general, can be broken down into two distinct steps: modeling the software program and creating rules to detect patterns within the model.

The SAST industry has employed and invested significant engineering effort in modeling software from lexical analysis, Abstract Syntax Tree (AST), data flow graphs, to full-blown program call graphs. In addition, they have invested even more resources in generating rules to pattern-match these models against common vulnerability types (ex. OWASP Top 10, SANS Top 25)

‍The Bad

Anyone who has used a modern-day static code analysis tool will tell you that these rules are not perfect. In the SAST industry, there are basically two measures of the effectiveness of a SAST tool: the precision (notice how it doesn’t say accuracy) and recall of that tool against known benchmarks [give benchmarks] created by the industry and your own code base. The informal definition of the two states:

‍

In pattern recognition, information retrieval, and binary classification, precision (also called positive predictive value) is the fraction of relevant instances among the retrieved instances. At the same time, recall (also known as sensitivity) is the fraction of relevant instances retrieved over the total number of relevant instances. Both precision and recall are therefore based on an understanding and measure of relevance.

In practical terms, if an application has 100 actual bugs in the system, recall is the number of those 100 bugs that were reported. Precision is the number of true positives divided by the total number of true positives and false positives. The problem with SAST tools today is that an initial scan of a virgin, medium-sized software project could return thousands of reported issues. This brings us to the first natural set of questions asked:

Are there really that many bugs within my program?
What is the precision of these findings reported?
What are the assurances that recall (the software’s coverage of my application) is high?

‍The Ugly

The ugly answer to the above question is that there is no assurance. But why? The answer boils down to two dimensions of how a SAST tool works: modeling and pattern recognition rules. So let's dig a little deeper into why modelling could pose a problem. SAST modelling is limited in many respects because it only accounts for the application's source code. Any security professional will tell you that vulnerabilities can be exposed not only by the application but also by the system, the application configuration, and how that system is deployed in a production environment.

Additional technologies that involve data flows make it difficult to model proper data flow paths. Take, for example, a user posting on Twitter. The user enters text into a form and sends it via an HTTP POST request to the Twitter server. That tweet gets processed and inserted into a database of user/tweet records. In another portion of the code, those tweets are retrieved from the database and processed, pushing their content to another client that requests specific users’ tweet information. These two call flows are disconnected due to the database technology. The question arises: Did the insertion of the tweet data into the database get sanitized? Didn’t it get sanitized for the technology of the consumer of that data? Will the insert have to sanitize against HTML, JSON, or XML? After retrieving that information, has it been sanitized from the database before sending it to the remote client? Has the server serving that information properly sanitized the data for the display technology's context? There are, however, only a limited number of ways to insert data into a database and to fetch and sanitize that data for consumption by a client. A more difficult task to model is server setup and configuration. There is no clear visibility into the source code to understand how a server is hardened and its policies for different use cases. Another difficult task to model is access control. In access control, the way the system authenticates a user can be implemented in an infinite number of ways, depending on the product's business requirements. From a vulnerability rules standpoint, it is at best a best-effort approach to detect business rule violations and vulnerabilities.

Describe pattern recognition accuracy based upon the SAST vendor benchmarks vs a specific software project.
The nature of your software projects and how often or rarely that software touches third-party tools, and how that affects your SAST results.
Third-party tools and their addition of vulnerabilities to your code base
The education aspect of security for your developer teams

Listed below are vulnerability categories where SAST tools are better, as well as vulnerabilities where SAST tools are not so great.

Cross-site scripting
SQL Injection
Cross-Site Request Forgery
Insecure logging
3rd Party Dependencies
Information Leakage
XML External Entity Injection
Insecure deserialization

Not so good at:

Insecure Access Control
Cross-site origin sharing
Insecure direct object reference

How Teams Can Make SAST More Effective

Instead of asking “Is SAST good or bad?”, the better question is:

How do we reduce friction and increase signal?

1. Calibrate Rulesets

Disable noisy categories.
Tune thresholds.
Focus on high-confidence findings first.

2. Treat It as Hygiene, Not Assurance

SAST improves baseline hygiene.
It does not prove your system is secure.

3. Invest in Developer Education

Many repeated findings stem from:

Misunderstood frameworks
Improper input handling
Unsafe dependency use

Better education reduces findings upstream.

4. Combine With Other Controls

SAST works best as part of a layered approach:

SAST
Dependency scanning
DAST
IaC scanning
Code review
Runtime monitoring

Security is multi-dimensional. SAST covers one dimension.

Final Thoughts

SAST excels at detecting repeatable, pattern-based vulnerabilities in source code.It struggles with context-heavy, configuration-driven, and business-logic flaws.Its effectiveness depends heavily on precision, recall, and how closely your code aligns with vendor benchmarks.

Used thoughtfully, SAST reduces risk and improves hygiene.
Used blindly, it creates noise and false confidence.

The real value of SAST isn’t that it makes your application secure. It’s that it makes insecure patterns harder to ignore early on.

And in modern software development, early matters.

‍

About the author

Sherif Koussa

Sherif Koussa is a cybersecurity expert and entrepreneur with a rich software building and breaking background. In 2006, he founded the OWASP Ottawa Chapter, contributed to WebGoat and OWASP Cheat Sheets, and helped launch SANS/GIAC exams. Today, as CEO of Software Secured, he helps hundreds of SaaS companies continuously ship secure code.

Continue your reading with these value-packed posts

Vulnerability Management & Scoring

Attack Chains: The Hidden Weakness in Modern API & Web Application Security