GitHub - EECS486/Bezos: Predicting Helpfulness of Amazon Reviews

Instillation Instructions

Clone the repository

Make a directory

git clone https://github.com/EECS486/Bezos.git

Install Virtual Env

Virtual Env acts as a virtual enviornment so that we can virtually install python packages and not overwrite the ones on our system

pip install virtualenv

Create Virtual Env and Enter it

Go to your directory where you cloned the repository

$ cd Bezos

$ virtualenv -p python3 env

$ source env/bin/activate

Install the requirements

Make sure in home directory

pip install -r requirements.txt

Extra: If You Want to Install A New Package

$ source bin/activate

pip install package

pip freeze > requirements.txt

$ deactivate

File + Folder Descriptions

data_analytics_erneh.py - displays analytics for the review and metadata data
jsonReviewRead.py - review parser for classification models
jsonReviewRead_vBlackfyre.py - review parser for classification models
metadata.py - metadata parser for classification models
naivebayes.py - naivebayes model classifer
porter.py - porter stemmer
reviewdata.py - reviewer parser for naive bayes model
reviewdataNB.py - review parser for naive bayes model
linking_and_metrics.py - links the metadata and review data and calculates analytics for each review
modelGeneration.py - generates each classification model and projects helpfulness of amazon reviews
output/ - output for naive bayes and classification models
plots/ - plots for feature importance

Instructions

Download and Extract the contents of this folder into the repository https://drive.google.com/file/d/1QCZXLE9F9BqI2k2y3APi4tPItdREnhMS/view?usp=sharing

It contains the json review and metadata files used to generate the review data along with pickle files to make the model generation faster

Naive Bayes

Open Naive Bayes
set the line: params = {"stem": False, "stop": True, "condProb": True, 'bigram': True} to what parameters you want to run the naive bayes with

Stem: stems words
stop: remove stop words
condProb: make sure True
bigram: creates a bigram model instead of a unigram model

Classification Models

Install Stanford CoreNLP https://stanfordnlp.github.io/CoreNLP/index.html
Go To Directory Where Installed Stanford CoreNLP
Follow Instructions to Run Stanford CoreNLP on Port 9000 https://stanfordnlp.github.io/CoreNLP/corenlp-server.html#getting-started
In linking_and_metrics.py

Set category to the category you want to process. ie: 'Grocery_and_Gourmet_Food'

Run data_analytics_erneh.py to generate statics on the whole review set
In modelGeneration.py

Set category to the category you want to process. ie: 'Grocery_and_Gourmet_Food'
Note: pkl files must be generated for said category

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instillation Instructions

Clone the repository

Install Virtual Env

Create Virtual Env and Enter it

Install the requirements

Extra: If You Want to Install A New Package

File + Folder Descriptions

Instructions

Naive Bayes

Classification Models

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
output		output
plots		plots
README.md		README.md
data_analytics_erneh.py		data_analytics_erneh.py
jsonReviewRead.py		jsonReviewRead.py
jsonReviewRead_vBlackfyre.py		jsonReviewRead_vBlackfyre.py
linking_and_metrics.py		linking_and_metrics.py
metadata.py		metadata.py
modelGeneration.py		modelGeneration.py
naivebayes.py		naivebayes.py
porter.py		porter.py
requirements.txt		requirements.txt
reviewdata.py		reviewdata.py
reviewdataNB.py		reviewdataNB.py
stopwords.txt		stopwords.txt

EECS486/Bezos

Folders and files

Latest commit

History

Repository files navigation

Instillation Instructions

Clone the repository

Install Virtual Env

Create Virtual Env and Enter it

Install the requirements

Extra: If You Want to Install A New Package

File + Folder Descriptions

Instructions

Naive Bayes

Classification Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages