Learning RNN

Overview

The goal is to learn about RNN, by building it from scratch.

Goals:

Build a RNN specific to MNIST.
Both training and inferencing

Non-goals:

Configurable. Just to start with a fixed network.
Performance. A single-threaded implementation is enough.
GPU.

The network structure

The MNIST in RNN

This example is using MNIST handwritten digits. The dataset contains 60,000 examples for training and 10,000 examples for testing. The digits have been size-normalized and centered in a fixed-size image (28x28 pixels) with values from 0 to 1. For simplicity, each image has been flattened and converted to a 1-D numpy array of 784 features 28x28.

To classify images using a recurrent neural network, we consider every image row as a sequence of pixels. Because MNIST image shape is 28x28px, we will then handle 28 sequences of 28 timesteps for every sample.

source

The RNN cell

Just to start with the simplest ones, for example, Elman network with ReLU:

h_t = ReLU(W_h x_t + U_h h_{t-1} + b_h)
y_t = ReLU(W_y h_t + b_y)

source

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
keras		keras
README.md		README.md
elman.py		elman.py
lstm.py		lstm.py
rnn.py		rnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning RNN

Overview

The network structure

The MNIST in RNN

The RNN cell

About

Releases

Packages

Languages

shijin1984/learn_rnn

Folders and files

Latest commit

History

Repository files navigation

Learning RNN

Overview

The network structure

The MNIST in RNN

The RNN cell

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages