Skip to content

shivam2502/MLinPython

 
 

Repository files navigation

MLinPython

This repository has Jupyter Notebooks showcasing code examples for the purpose of Data Analysis and Machine Learning in Python. The notebooks contain code blocks that can be used as reference for required tasks.

Notebooks:

Python

  • 1_Data_operations.ipynb: Covers major data operations required before getting into any analysis or model building
  • 2_Pandas_apply_optimization.ipynb: Shows the comparison between various ways of applying functions to a pandas df. Helps in optimizing pandas codes
  • 3_Clustering_kmeans.ipynb: Showcases the flow of a clustering exercise using customer sales data

PySpark

  • pyspark/1_Clustering_kmeans.ipynb: Showcases the flow of a clustering exercise using customer sales data

Planned for immediate future:

  • Linear regression
  • Logistic regression

Planned for later (list is WIP as well; suggestions welcome):

  • Decision Trees and Random forests
  • More nbs for PySpark

I plan to keep updating existing notebooks as well along the way.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 53.2%
  • HTML 46.2%
  • Python 0.6%