-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
chore: lint data-process code with pylint and optimize the dockerfile
Signed-off-by: bjwswang <[email protected]>
- Loading branch information
Showing
23 changed files
with
804 additions
and
523 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -252,4 +252,13 @@ arctl: fmt vet ## Build manager binary. | |
gql-gen: | ||
@go run github.com/99designs/[email protected] generate | ||
build-graphql-server: gql-gen | ||
@CGO_ENABLED=0 GOOS=linux go build -o bin/graphql-server graphql-server/go-server/main.go | ||
@CGO_ENABLED=0 GOOS=linux go build -o bin/graphql-server graphql-server/go-server/main.go | ||
|
||
|
||
# Commands for Data-Processing | ||
DATA_PROCESSING_IMAGE ?= kubebb/dp-base | ||
|
||
.PHONY: docker-build-dp-base | ||
docker-build-dp-base: | ||
docker build -f ./data-process/Dockerfile.base -t $(DATA_PROCESSING_IMAGE):$(VERSION) ./data-process/ | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
FROM python:3.10.13-slim | ||
|
||
ENV TZ=Asia/Shanghai | ||
|
||
RUN sed -i 's/deb.debian.org/mirrors.tuna.tsinghua.edu.cn/g' /etc/apt/sources.list.d/debian.sources | ||
|
||
RUN export DEBIAN_FRONTEND=noninteractive \ | ||
&& apt-get update \ | ||
&& apt-get install -y tzdata \ | ||
&& ln -fs /usr/share/zoneinfo/Asia/Shanghai /etc/localtime \ | ||
&& dpkg-reconfigure --frontend noninteractive tzdata \ | ||
&& apt-get install -y python3-distutils curl python3-pip | ||
|
||
WORKDIR /happy_work_space |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,37 @@ | ||
# Data Process | ||
The current documentation is only available in Chinese. Please refer to the content in .zh.md for specific details. | ||
# Data Processing | ||
|
||
## Current Version Main Features | ||
|
||
Data Processing is used for data processing through MinIO, databases, Web APIs, etc. The data types handled include: | ||
- txt | ||
- json | ||
- doc | ||
- html | ||
- excel | ||
- csv | ||
- markdown | ||
- ppt | ||
|
||
### Current Text Type Processing | ||
|
||
The data processing process includes: cleaning abnormal data, filtering, de-duplication, and anonymization. | ||
|
||
## Design | ||
|
||
![Design](../assets/data_process.drawio.png) | ||
|
||
## Local Development | ||
### Software Requirements | ||
|
||
Before setting up the local data-process environment, please make sure the following software is installed: | ||
|
||
- Python 3.10.x | ||
|
||
### Environment Setup | ||
|
||
Install the Python dependencies in the requirements.txt file | ||
|
||
### Running | ||
|
||
Run the server.py file in the data_manipulation directory |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.