Highlights
- Pro
Pinned Loading
-
Data-Ingestion-with-Kafka-and-NiFi
Data-Ingestion-with-Kafka-and-NiFi PublicThis project demonstrates the integration of Apache Kafka, Apache NiFi, and a Python producer/consumer using confluent_kafka.
Python
-
learning-projects
learning-projects PublicRepository for my learning porjects on data engineering and machine learning
Python
-
parkinson-predictive-analysis
parkinson-predictive-analysis PublicThis script processes the combined clinical, peptide, and protein data to train a machine learning model for predicting the severity of Parkinson's disease as measured by UPDRS scores. The script i…
Jupyter Notebook
-
Distributed_Data_Storage
Distributed_Data_Storage PublicDistributed Data Storage with Hadoop HDFS and Amazon S3
Shell
-
Data_Processing_using_Spark_Flink
Data_Processing_using_Spark_Flink PublicThis project demonstrates data cleaning, processing with Apache Spark and Apache Flink, both locally and on AWS EMR.
Python
-
Data_Pipeline_Spark_Azure_DBT
Data_Pipeline_Spark_Azure_DBT PublicIn this project, I tried implementing a data engineering pipeline using the Medallion Architecture with a set of specific technologies: Apache Spark, Azure Databricks, Data Build Tool (DBT), and Az…
If the problem persists, check the GitHub status page or contact support.