Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 574 Bytes

README.md

File metadata and controls

14 lines (10 loc) · 574 Bytes

Harvester

Harvester - data acquisition software. Inspired by Yahoo! Harvester http://www.drdobbs.com/parallel/using-erlang-to-build-reliable-fault-tol/220600332

Scalable modular system which supports any data type (structured/semi-structured/not-structured, regular/stream), protocols and storages (SQL/NOSQL).

Main goal is to create software infrastructure for Data collection from various sources and transfering data (with some preprocessing maybe) to persistent storage for the following processing (Data Mining).

May be used for Web indexing for example.