Skip to content
This repository has been archived by the owner on Jul 15, 2023. It is now read-only.

Preprocess: u'geoNetwork.city', u'geoNetwork.cityId', u'geoNetwork.continent', #73

Open
wesleytian opened this issue Oct 7, 2018 · 1 comment
Assignees
Labels
easy preprocessing Preprocessing for data matrix

Comments

@wesleytian
Copy link

wesleytian commented Oct 7, 2018

Preprocess the following features:

u'geoNetwork.city',
u'geoNetwork.cityId',
u'geoNetwork.continent',

  1. Standardization: http://scikit-learn.org/stable/modules/preprocessing.html#standardization-or-mean-removal-and-variance-scaling

  2. Impute missing values: http://scikit-learn.org/stable/modules/impute.html

  3. Normalization: http://scikit-learn.org/stable/modules/preprocessing.html#normalization

  4. Encode categorical features (optional): http://scikit-learn.org/stable/modules/preprocessing.html#encoding-categorical-features

  5. Discretization (optional): http://scikit-learn.org/stable/modules/preprocessing.html#discretization

http://scikit-learn.org/stable/modules/preprocessing.html

@wesleytian wesleytian added easy preprocessing Preprocessing for data matrix labels Oct 7, 2018
@htcao htcao self-assigned this Oct 8, 2018
@htcao
Copy link

htcao commented Oct 9, 2018

It seems that the values of feature geoNetwork.cityId in the dataset are all missing, how to deal with that?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
easy preprocessing Preprocessing for data matrix
Projects
None yet
Development

No branches or pull requests

2 participants