Elasticsearch Data Collector

What is the aim of this app?

Sometimes, you need to transfer your data that is on a RDBMS to Elasticsearch. Elasticsearch Data Collector can help you on that way. You can easily transfer your data to Elasticsearch with a few definitions and sql query.

What do you need to do this?

Create a datasource to retrieve your data
Write a sql query and control the result of the query with the data preview feature
Create a Elasticsearch index with the sql query
Schedule a syncronization that's it.

After that your data will be on the Elasticsearch.

Technical Details

Elasticsearch Data Collector is highly scalable. It supports this feature. When you launch new instances, it automatically coordinates processes to avoid conflicts.

Supported databases:

Name
MySQL

Environment variables:

Variable	Decription	Default Value
ES_DATA_COLLECTOR_ELASTICSEARH_ADDRESS	Elasticsearch address	http://localhost:9200
ES_DATA_COLLECTOR_CHIPHER_KEY	Chipher key for using encrypt confidential data	es-data-collector-key-0123456789
ES_DATA_COLLECTOR_APP_DB_CONNECTION_STRING	Application database connection string.	root:root@tcp(127.0.0.1:3306)/es-data-collector?parseTime=true

Docker Image

You can build a docker image using below commands.

docker build --tag es-data-collector .
docker rm -f es-data-collector-app
docker run --name es-data-collector-app -p 3000:3000 -e ES_DATA_COLLECTOR_ELASTICSEARH_ADDRESS='http://192.168.1.52:9200' -e ES_DATA_COLLECTOR_APP_DB_CONNECTION_STRING='root:root@tcp(192.168.1.52:3306)/es-data-collector?parseTime=true' es-data-collector

After build docker image and run you can access Elasticsearch Data Collector app with this http://localhost:3000

Starting steps

Create a datasource

Go to the Datasources menu.

Click "Add New" button and fill the form with your database information and save.

You have done. Let's to the next step...

Create a Elasticsearch index

Let's we create a new index.

Click "Add New" button and fill the form with your information. You should enter a suitable name lowercase and concatenated with an underscore character. You can enter a description for your index. You must enter a valid sql query to retrieve your data from database. After that you must select your datasource. At this point you can preview your sql query results with clicking preview button.

If everyting is OK, save the index.

Schedule data synchronization

Go to the Indices menu and click the edit button on the list.

Click the "Schedule Data Sync" link.

Enter a valid cron expression what you want your synchronization period.

Enter the "Document Id Field". Document Id field must specify a unique row key in your data.

You must select a "Sync Type". There are the synchronization types: "Reload All" and "Iterative". If you select the reload all type, your exist data that is on the Elasticsearch will be deleted after synchronization. First, all the data in RDBMS will be transfered to Elasticsearch again. Don't worry. Until the synchronization is completed, your exist data will be reachable. When the synchronization is completed, you can reach the new data. And after that the old data that is on the Elasticsearch will be deleted.

If you select the Iterative type, you can only retrieve the data that is changed from after last synchronization time. To do that you can use the special keyword ":#sql_last_value" to modify your query dynamically. For example; "select * from customers where created_at >= :#sql_last_value".

Run it

docker run --name es-data-collector-app -p 8080:8080 -p 3000:3000 -e ES_DATA_COLLECTOR_ELASTICSEARH_ADDRESS='http://192.168.1.52:9200' -e ES_DATA_COLLECTOR_APP_DB_CONNECTION_STRING='root:root@tcp(192.168.1.52:3306)/es-data-collector?parseTime=true' ddemirel/elasticsearch-data-collector:tagname

You can access the UI with http://localhost:3000 and the API with http://localhost:8080

TODO List

Docker image optimization

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.vscode		.vscode
appenv		appenv
controllers		controllers
dao		dao
elasticsearch		elasticsearch
gormlock		gormlock
images		images
scheduler		scheduler
service		service
ui		ui
.gitignore		.gitignore
Dockerfile		Dockerfile
es_data_collector.go		es_data_collector.go
go.mod		go.mod
go.sum		go.sum
readme.md		readme.md
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Elasticsearch Data Collector

What is the aim of this app?

What do you need to do this?

Technical Details

Docker Image

Starting steps

Create a datasource

Create a Elasticsearch index

Schedule data synchronization

Run it

TODO List

About

Releases

Packages

Languages

dilaverdemirel/elasticsearch-data-collector

Folders and files

Latest commit

History

Repository files navigation

Elasticsearch Data Collector

What is the aim of this app?

What do you need to do this?

Technical Details

Docker Image

Starting steps

Create a datasource

Create a Elasticsearch index

Schedule data synchronization

Run it

TODO List

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages