Skip to content
View nikitgoku's full-sized avatar

Block or report nikitgoku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nikitgoku/README.md

Hi 👋, I'm Nikit

About Me:


Welcome to my GitHub profile! I'm Nikit Gokhale, a passionate data engineer with a knack for problem-solving and a love for all things data. With a background in computer science and a focus on data engineering and machine learning, I thrive on turning data into actionable insights and building scalable solutions that make a difference. Here on GitHub, you'll find a collection of projects reflecting my interests and expertise. From ETL data pipelines and machine learning models to software applications and automation scripts, each project represents a learning experience and a step forward in my journey as a data engineer. Whether you're a fellow developer, a data enthusiast, or simply curious about what I'm working on, I invite you to explore my repositories, contribute your ideas, and join me on this exciting adventure of continuous learning and growth.

Connect with me:

Tech Stack:

python cplusplus sql aws postgresql mysql mssql spark spark airflow docker git kubernetes linux pandas pytorch scikit_learn seaborn tensorflow

Certified:

Pinned Loading

  1. aws_data_engineering_e2e aws_data_engineering_e2e Public

    This is an end-to-end data engineering ETL project on my personal spotify streaming data for 2023 done on AWS console

    Jupyter Notebook

  2. spotify_data_investigation spotify_data_investigation Public

    Conducted an extensive SQL investigation into the top songs on Spotify in August 2019, using Kaggle's external dataset. Additionally, integrated personal Spotify streaming history data from 2023 to…

    Jupyter Notebook

  3. spotify_etl_airflow_postgress spotify_etl_airflow_postgress Public

    This repository contains code where an ETL pipeline was developed to extract Spotify data using Web API orchestrated using Airflow using Docker and stored in PostgresSQl and S3.

    Python

  4. wiki_data_streaming wiki_data_streaming Public

    This is a small end-to-end streaming data pipeline which extracts the recent changes data from Media Wiki API and loads these data into a database for further visualisation and dashboard creation.

    Python