Transfer Learning for Text Classification with Tensorflow

Tensorflow implementation of Semi-supervised Sequence Learning(https://arxiv.org/abs/1511.01432).

Auto-encoder or language model is used as a pre-trained model to initialize LSTM text classification model.

SA-LSTM: Use auto-encoder as a pre-trained model.
LM-LSTM: Use language model as a pre-trained model.

Requirements

Python 3
Tensorflow
pip install -r requirements.txt

Usage

DBpedia dataset is used for pre-training and training.

Pre-train auto encoder or language model

$ python pre_train.py --model="<MODEL>"

(<Model>: auto_encoder | language_model)

Train LSTM text classification model

$ python train.py --pre_trained="<MODEL>"

(<Model>: none | auto_encoder | language_model)

Experimental Results

Orange lines: LSTM
Blue lines: SA-LSTM
Red lines: LM-LSTM

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
model		model
README.md		README.md
data_utils.py		data_utils.py
pre_train.py		pre_train.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transfer Learning for Text Classification with Tensorflow

Requirements

Usage

Pre-train auto encoder or language model

Train LSTM text classification model

Experimental Results

Loss

Accuracy

About

Releases

Packages

Languages

dongjun-Lee/transfer-learning-text-tf

Folders and files

Latest commit

History

Repository files navigation

Transfer Learning for Text Classification with Tensorflow

Requirements

Usage

Pre-train auto encoder or language model

Train LSTM text classification model

Experimental Results

Loss

Accuracy

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages