Language modelling using PyTorch

Folders

char_rnn

It contains the character level language modelling in tinyshakespheare dataset using Recurrent Neural Networks implemented with PyTorch. The data set and a well commented jupyter notebook is added in this folder. The jupyter notebook is orignally a Google Colab notebook. If you find it difficult to reproduce the results locally, try it on Google Colaboratory. Anyway you can find the original work at here.

A bunch of sample folders are added in this folder where you can find the performance of the network at different hyper parameter settings. Each folder will contain a generated text file, a loss vs iterations graph and the saved trained model which can be reused using PyTorch. This is planned as a programming session for this blog post in my personal blog.

bible database

This folder contains the different bible version data in JSON format. The versions are,

American Standard-ASV1901 (ASV)
Bible in Basic English (BBE)
Darby English Bible (DARBY)
King James Version (KJV)
Webster's Bible (WBT)
Young's Literal Translation (YLT)

Bible data is originally dowloaded from this github repo.

raw_gospel_data

This folder includes the raw gospel text extracted from the bible database and stored as JSON files.

training_data

This folder contains the data for training the LSTMs which are generated from the files in raw_data folder.

notebooks

This folder contains the Jupyter Notebooks which shows the demo of data fetching, cleaning EDA of bible stats. Bible stats is available in JSON format at root folder.

scripts

The scripts for cleaning and converting raw data to training data is included in the folder. The demos in notebook folder comes in action here.

best_model

The folder contains the best trained model, loss vs epochs graph and stats of training.

issues_of_mark

The folder contains the training results on different validation sets. Using gospel of Mark as validation shows less convergence.

generated_gospels

Some generated samples using trained model using warmup context.

Important Links

Medium Post: https://medium.com/@sleebapaul/gospel-of-lstms-how-i-wrote-5th-gospel-of-bible-using-lstms-4cffa70e5f1a

Google Colab Link: https://colab.research.google.com/drive/1euakjbNiZgCfbmCWzT6pIZB2MYZbHjk-

The trained model can be reproduced in Colab Notebook. Though, I've added the copy of Colab Notebook as pytorch_LSTM.ipynb for the local reference, I recommend the reader to make efforts for reproducing the results at Google Colaboratory.

I've added the Gospel of LSTMs epub version in the repo. Please use it for better reading experience.

Happy Learning !

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
best_model		best_model
bible_database		bible_database
char_rnn		char_rnn
generated_gospels		generated_gospels
issue_of_mark		issue_of_mark
notebooks		notebooks
raw_gospel_data		raw_gospel_data
scripts		scripts
training_data		training_data
various_models_trained		various_models_trained
.gitignore		.gitignore
Gospel of LSTMs.epub		Gospel of LSTMs.epub
README.md		README.md
bible_stats.json		bible_stats.json
pytorch_LSTM.ipynb		pytorch_LSTM.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language modelling using PyTorch

Folders

char_rnn

bible database

raw_gospel_data

training_data

notebooks

scripts

best_model

issues_of_mark

generated_gospels

Important Links

About

Releases

Packages

Languages

luizcz/gospel_of_rnn

Folders and files

Latest commit

History

Repository files navigation

Language modelling using PyTorch

Folders

char_rnn

bible database

raw_gospel_data

training_data

notebooks

scripts

best_model

issues_of_mark

generated_gospels

Important Links

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages