Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

for dataset #2

Open
DAOUDISARA2 opened this issue Dec 6, 2019 · 4 comments
Open

for dataset #2

DAOUDISARA2 opened this issue Dec 6, 2019 · 4 comments

Comments

@DAOUDISARA2
Copy link

Is there a way to do a k-means clustering on data (file.txt )? Please, help me...

@LutzCle
Copy link
Member

LutzCle commented Dec 9, 2019

Hi,

I assume you're trying to load a plain text file? Clustering requires numeric data, e.g., a matrix of floating-point numbers.
As this is a research prototype, CL-kmeans only supports a very simple binary format. I guess it wouldn't be too hard to write, e.g., a CSV-to-binary converter. If you're interested in contributing a converter, take a look at binary_format.cpp and file_info.py.

@DAOUDISARA2
Copy link
Author

hi,
I use the data set from benchmark Rodinia : https://github.com/JuliaParallel/rodinia/tree/master/data/kmeans

@LutzCle
Copy link
Member

LutzCle commented Dec 9, 2019

Ok, in that case there are two options. You can either convert the file, or tweak the generator to generate the binary format. You can take a look at my data generator for an example of how to generate the binary format.

@DAOUDISARA2
Copy link
Author

Thank you, in your article "Efficient and Scalable k-Means on GPUs ", how you applied your binary dataset with kmeans Rodinia benchmark, please I need your help because I applied your binary dataset with kmeans Rodinia and it does not work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants