Skip to content
View mobashshir005's full-sized avatar

Block or report mobashshir005

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mobashshir005/README.md

Mohammad Mobashshir

Data Engineer | Azure Databricks | PySpark | US Healthcare Domain

LinkedIn GitHub


About Me

I am a highly motivated and results-driven Data Engineer with over 3.5 years of experience specializing in data engineering within the US healthcare domain. I have a proven track record in leveraging PySpark, Python, and SQL for complex data transformation tasks, with advanced proficiency in Azure Databricks, Azure Synapse, Azure Data Lake, Azure Data Factory, and Airflow.

Certified as both a Microsoft Azure Data Engineer Associate and a Databricks Data Engineer Associate, I excel in designing and implementing scalable, efficient, and robust ETL solutions that drive business value.

Key Skills

  • Programming Languages: Python, SQL, PySpark, Scala, Shell Scripting
  • Big Data Technologies: Apache Spark, Hadoop, Hive
  • Cloud Platforms: Azure Databricks, Azure Synapse, Azure Data Lake, Azure Data Factory, Snowflake
  • Orchestration Tools: Airflow, Control-M, ASG Zena
  • DevOps Tools: Git, Jenkins, UrbanCode Deployment, ADO (Azure DevOps)
  • Project Management: Jira, Agile Methodologies
  • Domain Expertise: US Healthcare

Certifications

  • Microsoft Certified:
    • Azure Data Engineer Associate (DP-203)
    • Azure Data Fundamentals (DP-900)
  • Databricks Certified:
    • Data Engineer Associate
    • Lakehouse Fundamentals
  • Infosys Certifications:
    • MySQL Associate
    • Spark Professional

Experience

EY GDS | Data Engineer (June 2023 – Present)

  • Project: Nexus for Health (Healthcare Domain)
  • Developed and optimized ETL pipelines using Azure Databricks and Azure Data Lake.
  • Utilized PySpark and SQL for complex data transformations.
  • Implemented Delta Lake solutions for efficient data storage and retrieval.
  • Collaborated with cross-functional teams to align deliverables with business objectives.
  • Contributed to design documentation and participated in Agile sprints.

Infosys Limited | Big Data Engineer (Mar 2021 – May 2023)

  • Project: Data Analytics Platform Migration (Healthcare Domain)
  • Developed and optimized PySpark scripts for data migration to Azure cloud.
  • Migrated 100+ TB of healthcare data ensuring data integrity and consistency.
  • Reduced query processing time by 30% using Azure Synapse.
  • Implemented monitoring mechanisms with Azure Monitor and Log Analytics.
  • Supported advanced data modeling and analytics by collaborating with data scientists.

Awards & Recognition

  • Insta Awards: Award of Appreciation by Infosys DNA (Sep 2021, Mar 2022)
  • EY GDS User Recognition Award

Education

Bachelor of Engineering in Computer Science
Shri Ram Group of Institutions, Jabalpur MP - GPA: 8.0

Contact

Popular repositories Loading

  1. Face-Recognition Face-Recognition Public

    Forked from jamshedakhtar804/Face-Recognition

    We have used LBPH algorithm.

    Python

  2. The-Tech-Factory The-Tech-Factory Public

    this is a flask blog for an organization

    JavaScript

  3. Inventory-Management-App Inventory-Management-App Public

    Frappe Flask Inventory Management App

    HTML

  4. ims ims Public

  5. mobashshir005 mobashshir005 Public

    Config files for my GitHub profile.

  6. RockScissorPaperGame RockScissorPaperGame Public

    Java