Skip to content
View Elsayed91's full-sized avatar
  • Cairo

Block or report Elsayed91

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Elsayed91/README.md

Hello there! I'm Islam ✨✨

Islam's LinkedIn Outlook

🔧 Technologies & Tools

Python Apache Airflow DBT Great Expectations Kubernetes Terraform Spark Kafka Google Cloud AWS Azure PowerBi Looker Data Studio Streamlit Grafana Prometheus

🌱 About Me

Data Engineer, but I mainly work with/as Data/Dev/MLOps, so am I really a data engineer? Idk.

🚀 Featured Projects

  • NY Taxi Data & MLOps Pipeline: Automated data & MLOps pipeline leveraging Kubernetes and Apache Airflow. Integrates Spark, Kafka, and DBT with a focus on data quality. Tailors solutions for diverse user needs.
  • Xbox Data Scraping & Analysis Pipeline: Automated data-driven project leveraging Python, Airflow, and GKE. Scrapes diverse data sources, providing insights into Xbox hardware and game data.

📦 Packages

  • Easy Expectations: A python package that abstracts away the complexity of Great Expectations and allow for easy no-knowledge-required implementation for basic use cases.
  • SchemaDiff: A python package that efficiently detects files with inconsistent schemas amidst thousands of files by reading the parquet files metadata.
  • Order of The Template: A Python toolkit for parsing and processing YAML templates, capable of resolving Bash syntax environment variables and Jinja templating. It also offers schema validation functionality.

Pinned Loading

  1. xbox_de_project xbox_de_project Public

    Automated data-driven project leveraging Python, Airflow, and GKE. Scrapes diverse data sources, providing insights into Xbox hardware and software data.

    Python

  2. taxi-data-pipeline taxi-data-pipeline Public

    Automated data & MLOps pipeline leveraging Kubernetes and Apache Airflow. Integrates Spark, Kafka, and DBT with a focus on data quality. Tailors solutions for diverse user needs.

    Python 4

  3. easy_ge easy_ge Public archive

    Simplified Data Validation and Quality Testing with Great Expectations

    Python 2

  4. schemadiff schemadiff Public

    Efficiently groups Parquet files based on their schema, allowing for the detection of schema inconsistencies between files on different filesystems. Achieves high efficiency by reading metadata wit…

    Python

  5. oot oot Public

    Order Of The Template - a Python toolkit to parse YAML files and Jinja templates, resolve environment variables, and validate JSON schemas

    Python