shifu

This is a personal project intended to write my own interpreter in go lang.

Introduction:

Any input that is written in the language goes first through the Lexer where the input is divided and assigned respective Tokens.
Once tokens are ready the Parser kicks in and parses the input tokens, it also checks the grammer of the input and raises parsing errors if any.
The parser creates an AST (Abstract Syntax Tree) which becomes the input for the evaluator which reads this tree.
The Evaluator converts the tokens within the AST into the Objects which are stored in Environment.
The Evaluator has it's own Environment where it stores any temporary variables and evaluated.
Once the evaluator is done it outputs the result of the function if there is any return statement in the input.

Language Featuers:

Dynamically typed
Supports higher order functions, or in other words functions are first class citizens
Supports closures
Supports integer arithmetic
Supports strings, integers, arrays and hashs
Supports builtin functions
Completely written in golang
Hashs can have Strings, Integers or Booleans as keys.
Also anything that evaluates to Strings, Integers or Booleans can be used as Keys in Hashs.

Getting Started:

go run main.go

Integer Arithmetic :

3 + 4 * 5 == 3 * 1 + 4 * 5

true

3 + 4; -5 * 5

-25

Using Strings and String concatenation:

"Inner " + " " + " Peace!"


Inner Peace!

Using Functions :

let double = func(x) { x * 2; };
double(5);

10

Higher order Functions :

let add = func(x, y) { x + y; }; 
add(5 + 5, add(5,5));

20

Closures :

let newAdder = func(x){
  func(y) { x + y };
  };

let addTwo = newAdder(2);
addTwo(2);


4

Enclosed Environments :

let first = 10;
let second = 10;
let third = 10;

let ourFunction = func(first) {
  let second = 20;

  first + second + third;
};

ourFunction(20) + first + second;


70

IF Else Statements:

if(10 > 1){
  if(10 > 1){
    return 10;
  }
return 1;
}


10

Arrays:

let myArray = [1, 2, 3]; 
myArray[2];

3

Array Builtin Functions:

len

len([1, 2, 3])

3

push

push([3,6,8,9], 1)

[3,6,8,9,1]

pop

pop([2,4,5,6], 1)

[2,5,6]

first

first([3, 1, 9])

3

last

last([3, 1, 9])

9

Hashes:

{"foo": 5}["foo"]


5

Strings as keys:

{"foo": 5};["foo"]


5

Integers as Keys:

{300: 5}[300]


5

Booleans as Keys:

{true: 5}[true]


5

Expressions as Keys:

{"th"+"ree": 23421}["three"]


23421

{100 < 3 : 4455}[false]


4455

puts Builtin Function:

prints the given arguments on new lines to STDOUT.

puts("Hello !")


Hello !
null

Language Parts:

Lexer:

Lexer is the lexeical analyser for the language it converts the raw input into Tokens.
The list of valid tokens is within the the token.go file.
If the input has anything apart from valid tokens then lexer assigns it an Illegal token.
This is the first stage of understanding/interpreting the input.

Parser:

Parser just like anything else parses the input into a meaningful datastructure.
Our Parser takes the input from Lexer and converts the tokens into an AST (Abstract Syntax Tree).
This is the second stage to understanding/interpreting the input.
The type of Parser we use here is Recursive Descent Parser that works from top down.
This is also called top down operator precedence parser or a Pratt Parser.
The main idea behind pratt parser is to associate parsing functions with token types, whenever a token type is encountered the appropriate parsing function is called which returns an AST node that represents the expression.
Each token type can have upto two parsing functions associated with it, depending on whether the otken is found in a prefix position or infix position.
The parser here won't be fastest or have a formal proof of its correctness and its error recovey process and detection of errorneous syntax won't be always right as it's just the begining for me.
Supports prefix and infix operators. work for supporting postfix operators in progress.
Supports let statements, return statements and expressions.

Evaluator:

Evaluator/Interpreter is the part of the language that takes the parsed code from the parser as an input and then executes it.
This gives the meaning to the language and makes it come to life.
This is the final stage to understand/interpret the input.
There are several different ways to build an Interpreter, some of them listed in terms of increasing complexity and performance:
- Tree Walking interpreter that interprets the AST on the fly. (The one implemented here.)
- Compiling the AST to an Intermediate byteCode that is compact and then use a virtual machine (something like JVM) to interpret this byteCode.
- Convert the byteCode compiled in the above step to highly optimized machine code right before interpreting that machine code. This is also called JIT or Just In Time Interpreter. Although here would need to support different machine architectures.
- Others skip the conversion to byteCode and directly convert the AST to machine code and then interpret it.

Ast:

AST also known as Abstract Syntax Tree is a datastructure that is used to store the langugage tokens to make sense.

Repl:

Repl stands for "Read Evaluate Print Loop".
This loop takes any input and performs the said steps.
Reads the input, evaluates the input, prints the output and loops again.
This is a standard in many languages such as Python and Javascript that comes with inbuilt REPL.
Usually REPL starts with a prompt ">>" thus in shifu the repl starts with the same.

Credits:

For this project I have been following through a great book written by Thorsten Ball. The Book name is : Writing an Interpreter in GO, Also have been adding along a few changes as per my understanding.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
ast		ast
evaluator		evaluator
imgs		imgs
lexer		lexer
object		object
parser		parser
repl		repl
token		token
README.md		README.md
go.mod		go.mod
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

shifu

Introduction:

Language Featuers:

Getting Started:

Integer Arithmetic :

Using Strings and String concatenation:

Using Functions :

Higher order Functions :

Closures :

Enclosed Environments :

IF Else Statements:

Arrays:

Array Builtin Functions:

len

push

pop

first

last

Hashes:

puts Builtin Function:

Language Parts:

Lexer:

Parser:

Evaluator:

Ast:

Repl:

About

Releases

Packages

Languages

Neeraj-Natu/shifu

Folders and files

Latest commit

History

Repository files navigation

shifu

Introduction:

Language Featuers:

Getting Started:

Integer Arithmetic :

Using Strings and String concatenation:

Using Functions :

Higher order Functions :

Closures :

Enclosed Environments :

IF Else Statements:

Arrays:

Array Builtin Functions:

len

push

pop

first

last

Hashes:

puts Builtin Function:

Language Parts:

Lexer:

Parser:

Evaluator:

Ast:

Repl:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages