Introduction
Maven Dependency
Quick Start
- Defining the Lexical Analyzer
Defining Grammar
Semantic Action

Introduction

The compilation process generally includes the following stages

Lexical Analysis: The process of converting the input stream into a Token stream
Syntax Analysis: Determine whether the Token stream satisfies the given grammar definition
Semantic Analysis: During the syntax analysis process, some side effects will be executed, these side effects are semantic analysis
Intermediate Code Generation: The result of semantic analysis may be intermediate code
Machine Independent Code Optimization: Intermediate code can undergo some machine-independent code optimizations, such as
- Deleting common sub-expressions
- Removing useless code
- Constant merging
- Code movement
- Deleting induction variables
Machine-Dependent Target Code: Translating intermediate code into machine code for a specific architecture
Machine Dependent Code Optimization

The compilation engine provides a complete framework for compilation, abstracting the following steps

Lexical Analysis: With the help of the compilation engine, we can create a lexical analyzer with simple construction steps
Syntax Analysis: We only need to define the specific grammar, the compilation engine will translate the grammar definition into a state automaton
Semantic Analysis: With the help of the compilation engine, we can easily expand the semantic analysis

The compilation engine supports various different grammar analysis methods, including

LL1: Worst analysis capability
SLR: Analysis capability slightly stronger than LL1
LR0: Analysis capability slightly stronger than SLR
LR1: Strongest analysis capability
LALR (recommended): The analysis capability is the same as LR1, but the size of the state machine is smaller than LR1

Maven Dependency

        <dependency>
            <groupId>com.github.liuyehcf</groupId>
            <artifactId>compile-engine</artifactId>
            <version>1.0.3</version>
        </dependency>

Quick Start

We use the compilation engine to complete a simple calculator, with the following functions

Supports +, -, *, /
*, / have higher priority than +, -
Supports ()
Only supports integers

Detailed Example Code

Defining the Lexical Analyzer

We can use DefaultLexicalAnalyzer to build a lexical analyzer. The lexemes of the calculator include

Operators, +, -, *, /
Brackets ()
Numbers

There are three types of lexemes

normal: Normal lexeme, such as +, -, *, / here
keyword: Keywords, which have higher priority than normal lexemes
operator: Custom parsing process, such as parsing integers here

    static LexicalAnalyzer LEXICAL_ANALYZER = DefaultLexicalAnalyzer.Builder.builder()
            .addTokenOperator(Symbol.createIdentifierTerminator(IDENTIFIER_INTEGER_LITERAL), new IntegerIdentifier())
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_SMALL_LEFT_PARENTHESES), "(")
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_SMALL_RIGHT_PARENTHESES), ")")
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_MUL), "*")
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_DIV), "/")
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_ADD), "+")
            .addNormalMorpheme(Symbol.createTerminator(NORMAL_SUB), "-")
            .build();

Defining Grammar

The grammar definition of the calculator is as follows

<program> → <expression>
<expression> → <additive expression>
<additive expression> → <additive expression> + <multiplicative expression>
    | <additive expression> - <multiplicative expression>
    | <multiplicative expression>
<multiplicative expression> → <multiplicative expression> * <primary>
    | <multiplicative expression> * <primary> 
    | <multiplicative expression> / <primary>
    | <primary>
<primary> → #integerLiteral 
    | ( <expression> )

Using the grammar definition tool provided by the compilation engine, we translate the above grammar:

Symbol: Grammar symbols, including terminators and non-terminators
SymbolString: Grammar symbol string
PrimaryProduction: Productions, can contain semantic actions
Production: A collection of productions with the same left-hand side
Grammar: Grammar

For example code, refer to com.github.liuyehcf.framework.compile.engine.test.example.calculator.CalculatorGrammar

Semantic Actions

The semantic actions of the calculator are simple:

When reducing <additive expression> → <additive expression> + <multiplicative expression>, add Add opcode
When reducing <additive expression> → <additive expression> - <multiplicative expression>, add Sub opcode
When reducing <multiplicative expression> → <multiplicative expression> * <primary>, add Mul opcode
When reducing <multiplicative expression> → <multiplicative expression> / <primary>, add Div opcode
When reducing <primary> → #integerLiteral, add Load opcode

The generated product after compilation is a collection of opcodes containing Add, Sub, Mul, Div, Load. The result calculation can be done by performing calculations in the order of the opcode arrangement.

For example code, refer to com.github.liuyehcf.framework.compile.engine.test.example.calculator.CalculatorCode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Introduction

Maven Dependency

Quick Start

Defining the Lexical Analyzer

Defining Grammar

Semantic Actions

Files

README.md

Latest commit

History

README.md

File metadata and controls

Introduction

Maven Dependency

Quick Start

Defining the Lexical Analyzer

Defining Grammar

Semantic Actions