What is Static Code Analysis?

OWASP Definition:

Static Code Analysis (also known as Source Code Analysis) is usually performed as part of a Code Review (also known as white-box testing) and is carried out at the Implementation phase of a Security Development Lifecycle (SDL).

Static Code Analysis commonly refers to the running of Static Code Analysis tools that attempt to highlight possible vulnerabilities within 'static' (non-running) source code by using techniques such as Taint Analysis and Data Flow Analysis.

Automating Source Code Review

Applications come in all shapes and sizes

From 20 lines to over 100 Million lines!!

Security Engineers / Researchers only can do some much

Generally sandbox and limited time
How much code can they review in the time they have?
Covering all security patterns?

Need to automate the discovery of security issues

This is where Static Code Analysis comes into play

Security Patterns

Two general types

Using something insecure

Configurations / Setting / Uses Dangerous Functionality
"Is debugging set to True?"

Data flows into somewhere insecure

User Input => [some other stuff] => sql.execute(input)
"Are request parameters get used to build a SQL query?"

In Static Analysis these are called Rules or Queries

x = input() y = "Hello " z = y + x print(x) # <- Data Flow Analysis (data comes directly from input ) print(y) # Static string print(z) # <- Taint Flow Analysis (data is propagated through by x being concatenated)

from flask import Flask, render_template app = Flask("MyApp") @app.route("/") def index(): return render_template("index.html") if __name__ == "__main__": app.run("0.0.0.0", 80, debug=True)

from sca import ast, cfg, dfg, results sources = ast.getType("bool").getValue("True") flask = ast.findImport("flask").getExpr("Flask") sinks = flask.getCall("run").getParameters("debug") or flask.getCall("run").getParameters(2) results = dfg.taint(sources, sinks)

from flask import Flask, request, render_template, make_response # ... @app.route("/search") def search(): query = request.args.get("s") results = lookup(query) if len(results) > 0: return render_template("search.html", results=results) else: return make_response("No results found for: " + query, 404)

name: "Cross Site Scripting" sources: flask: - "flask.request.args[]" - "flask.request.args.get()" sinks: flask: - "flask.make_response([0])" - "flask.Response([0]){}" - "flask.render_template_string([0])" - "flask.abort([2])"

from sca import ast, cfg, dfg, results flask = ast.findImport("flask") # `args.get('x')` or `args['x']` sources = flask.getMember("request").getMember("args").getUses() routes = flask.findDecorator("route").getCall() # def search(...): sinks = ast.getType("str").getExpr() & routes.getReturns() results = dfg.taint(sources, sinks)

Modeling

Models != Modeling

Researching a framework, library, or module
- Flask, Django, etc.
Creating reuseable models for the Static Analyser
- "User Inputs"
  - flask.request.args[], etc.
- "XSS Sinks"
  - flask.make_response([0]), etc.

from sca import dfg, results from sca.flask import flask_sources, flask_sinks_xss, flask_sanitizers_xss # Or even easier... from sca.web import web_sources, web_sinks_xss, sanitizers_xss # XSS Query in a couple of lines results = dfg.taint(web_sources, web_sinks_xss, sanitizers_xss)

from flask import Flask, request, render_template, escape # ... @app.route("/search") def search(): query = request.args.get("s") results = lookup(query) if len(results) > 0: return render_template("search.html", results=results) else: return "No results found for: " + escape(query)

from flask import Flask, request, render_template from pymysql.converters import escape_string # ... @app.route("/search") def search(): query = request.args.get("s") results = lookup(query) if len(results) > 0: return render_template("search.html", results=results) else: return "No results found for: " + escape_string(query)

# > inline output = escape(input()) # > direct (secure function just checks for non-alpha/non-int chars) if secure(output): output = input() else: output = "Error, insecure value passed in" # > indirect (validator) if not secure(output): output = "Error, insecure value passed in" return output output = input()

Introduction to Static Code Analysis

# Whoami

Today's Talk

What is Static Code Analysis?

What is Static Code Analysis?

Automating Source Code Review

Static Analysis Pipeline / Workflow

Code & Models

Security Patterns

Results Produced

Warning: Here be Dragons

Before we begin: Glossary

Types of Static Analysis Tools

How is Static Code Analysis done?

Static Analysis Pipeline / Workflow

Static Code Analysis Parsing

Compiler and Interpreter Pipelines

So how do Static Code Analysis tools do it?

Abstract Syntax Tree (AST)

Example - Abstract Syntax Tree

Example - Abstract Syntax Tree

Example - Abstract Syntax Tree (web app)

Control Flow Graph (CFG)

Example - Control Flow Graph

Showcase - Radare2 CFG

Data Flow Graph (DFG)

Example - Simple Application + DFG

Static Analysis Pipeline / Workflow

Data Flow & Taint Flow Analysis

Example - Data Flow and Taint Flow

Taint Analysis

Patterns - Rules & Queries

Configuration Rules or Dynamic Queries

Just use Regex!?

Example - Detecting Simple Configuration Problems

Configuration Rules - Basic Configuration Queries

Dynamic Queries - Language for Querying

Example - Simple Taint Flow

Configuration Rules - Data Flow Queries

Dynamic Queries - Language for Querying

Modeling

Example - Modeling

Sanitizers

Example - XSS but using Sanitizer

Example - Using another Sanitizer...

Context is so important!

Example - Hashing

Answer: It Depends on context

Example - Hashing attempt 2

Answer: It Depends on context

Sanitizers - Inline, Direct, and Indirect

Passthroughs / Taintstep's

Example - Passthroughs / Taintstep's

Results - Static Analysis Final Step

Congratulations:

Here not be Dragons , Here be Security

Conclusion

The Pros

The Cons

Thanks to...

Thanks you, any questions?