The Optimizer

The Solidity compiler involves optimizations at three different levels (in order of execution):

Optimizations during code generation based on a direct analysis of Solidity code.
Optimizing transformations on the Yul IR code.
Optimizations at the opcode level.

The opcode-based optimizer applies a set of simplification rules to opcodes. It also combines equal code sets and removes unused code.

The Yul-based optimizer is much more powerful, because it can work across function calls. For example, arbitrary jumps are not possible in Yul, so it is possible to compute the side-effects of each function. Consider two function calls, where the first does not modify storage and the second does modify storage. If their arguments and return values do not depend on each other, we can reorder the function calls. Similarly, if a function is side-effect free and its result is multiplied by zero, you can remove the function call completely.

The codegen-based optimizer affects the initial low-level code produced from the Solidity input. In the legacy pipeline, the bytecode is generated immediately and most of the optimizations of this kind are implicit and not configurable, the only exception being an optimization which changes the order of literals in binary operations. The IR-based pipeline takes a different approach and produces Yul IR closely matching the structure of the Solidity code, with nearly all optimizations deferred to the Yul optimizer module. In that case codegen-level optimization is done only in very limited cases which are difficult to handle in Yul IR, but are straightforward with the high-level information from analysis phase at hand. An example of such an optimization is the bypass of checked arithmetic when incrementing the counter in certain idiomatic for loops.

Currently, the parameter --optimize activates the opcode-based optimizer for the generated bytecode and the Yul optimizer for the Yul code generated internally, for example for ABI coder v2. One can use solc --ir-optimized --optimize to produce an optimized Yul IR for a Solidity source. Similarly, one can use solc --strict-assembly --optimize for a stand-alone Yul mode.

Note

Some optimizer steps, such as, for example, the peephole optimizer and the unchecked loop increment optimizer are always enabled by default and can only be turned off via the Standard JSON.

Note

An empty optimizer sequence, i.e :, is accepted even without --optimize in order to fully disable the user-supplied portion of the Yul optimizer sequence, as by default, even when the optimizer is not turned on, the unused pruner step will be run.

You can find more details on both optimizer modules and their optimization steps below.

Benefits of Optimizing Solidity Code

Overall, the optimizer tries to simplify complicated expressions, which reduces both code size and execution cost, i.e., it can reduce gas needed for contract deployment as well as for external calls made to the contract. It also specializes or inlines functions. Especially function inlining is an operation that can cause much bigger code, but it is often done because it results in opportunities for more simplifications.

Differences between Optimized and Non-Optimized Code

Generally, the most visible difference is that constant expressions are evaluated at compile time. When it comes to the ASM output, one can also notice a reduction of equivalent or duplicate code blocks (compare the output of the flags --asm and --asm --optimize). However, when it comes to the Yul/intermediate-representation, there can be significant differences, for example, functions may be inlined, combined, or rewritten to eliminate redundancies, etc. (compare the output between the flags --ir and --optimize --ir-optimized).

Optimizer Parameter Runs

The number of runs (--optimize-runs) specifies roughly how often each opcode of the deployed code will be executed across the life-time of the contract. This means it is a trade-off parameter between code size (deploy cost) and code execution cost (cost after deployment). A “runs” parameter of “1” will produce short but expensive code. In contrast, a larger “runs” parameter will produce longer but more gas efficient code. The maximum value of the parameter is 2**32-1.

Note

A common misconception is that this parameter specifies the number of iterations of the optimizer. This is not true: The optimizer will always run as many times as it can still improve the code.

Opcode-Based Optimizer Module

The opcode-based optimizer module operates on assembly code. It splits the sequence of instructions into basic blocks at JUMPs and JUMPDESTs. Inside these blocks, the optimizer analyzes the instructions and records every modification to the stack, memory, or storage as an expression which consists of an instruction and a list of arguments which are pointers to other expressions.

Additionally, the opcode-based optimizer uses a component called “CommonSubexpressionEliminator” that, amongst other tasks, finds expressions that are always equal (on every input) and combines them into an expression class. It first tries to find each new expression in a list of already known expressions. If no such matches are found, it simplifies the expression according to rules like constant + constant = sum_of_constants or X * 1 = X. Since this is a recursive process, we can also apply the latter rule if the second factor is a more complex expression which we know always evaluates to one.

Certain optimizer steps symbolically track the storage and memory locations. For example, this information is used to compute Keccak-256 hashes that can be evaluated during compile time. Consider the sequence:

PUSH 32
PUSH 0
CALLDATALOAD
PUSH 100
DUP2
MSTORE
KECCAK256

or the equivalent Yul

The Optimizer

Benefits of Optimizing Solidity Code

Differences between Optimized and Non-Optimized Code

Optimizer Parameter Runs

Opcode-Based Optimizer Module

Simple Inlining

Yul-Based Optimizer Module

Optimizer Steps

Selecting Optimizations

Preprocessing

Disambiguator

FunctionHoister

FunctionGrouper

ForLoopConditionIntoBody

ForLoopInitRewriter

VarDeclInitializer

Pseudo-SSA Transformation

ExpressionSplitter

SSATransform

UnusedAssignEliminator

Tools

Movability

DataflowAnalyzer

Expression-Scale Simplifications

CommonSubexpressionEliminator

ExpressionSimplifier

LiteralRematerialiser

LoadResolver

Statement-Scale Simplifications

CircularReferencesPruner

ConditionalSimplifier

ConditionalUnsimplifier

ControlFlowSimplifier

DeadCodeEliminator

EqualStoreEliminator

UnusedPruner

StructuralSimplifier

BlockFlattener

LoopInvariantCodeMotion

Function-Level Optimizations

FunctionSpecializer

UnusedFunctionParameterPruner

UnusedStoreEliminator

EquivalentFunctionCombiner

Function Inlining

ExpressionInliner

FullInliner

Cleanup

ExpressionJoiner

SSAReverser

StackCompressor

Rematerialiser

ForLoopConditionOutOfBody

Codegen-Based Optimizer Module

Unchecked Loop Increment