Skip to content
This repository has been archived by the owner on Mar 16, 2024. It is now read-only.

A tool to clean various datasets from Turing AI

License

Notifications You must be signed in to change notification settings

TuringAI-Team/data-cleaning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data-cleaning

A tool for cleaning datasets from turing ai

Usage guide

  1. Clone the repository
git clone https://github.com/TuringAI-Team/Data-cleaning.git
  1. Install the requirements
npm install
  1. Create folders, where you will store your datasets and results Create a folder in root called data Create a folder in root called steps
  2. Export your dataset from turing ai to the data folder and name it data.csv
  3. Run the CLI
npm run dev
  1. Follow the instructions in the CLI
  2. The results will be stored in the steps folder

To do

  • Fix bug where sometimes it answer with invalid json
  • Improve cleaning to not remove so much data