Skip to content

Harvester - data acquisition software. Scalable modular system which supports any data type (structured/semi-structured/not-structured, regular/stream), protocols and storages (SQL/NOSQL)

Notifications You must be signed in to change notification settings

pnuzhdin/Harvester

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Harvester

Harvester - data acquisition software. Inspired by Yahoo! Harvester http://www.drdobbs.com/parallel/using-erlang-to-build-reliable-fault-tol/220600332

Scalable modular system which supports any data type (structured/semi-structured/not-structured, regular/stream), protocols and storages (SQL/NOSQL).

Main goal is to create software infrastructure for Data collection from various sources and transfering data (with some preprocessing maybe) to persistent storage for the following processing (Data Mining).

May be used for Web indexing for example.

About

Harvester - data acquisition software. Scalable modular system which supports any data type (structured/semi-structured/not-structured, regular/stream), protocols and storages (SQL/NOSQL)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages