Add save and load variables functionality to data pre-processing

PauloHFS · Oct 1, 2024 · 382c534 · 382c534
1 parent 94fc939
commit 382c534
Show file tree

Hide file tree

Showing 2 changed files with 15 additions and 1 deletion.
diff --git a/my-brain/machine-learning-and-data-science-course/data-pre-processing/index.md b/my-brain/machine-learning-and-data-science-course/data-pre-processing/index.md
@@ -10,3 +10,17 @@ Data pre-processing is a crucial step in the data analysis process. It involves
 3. Data standardization: Normalize the data to a common scale to make it easier to compare.
 4. Data transformation: Convert categorical data into numerical values using techniques like one-hot encoding.
 5. Introduction of models validation: Split the data into training and testing sets to evaluate the model's performance.
+
+How to save the vars in a file and load them later:
+
+```python
+import pickle
+
+# Save the variables to a file
+with open('vars.pkl', 'wb') as f:
+  pickle.dump([X_train, X_test, y_train, y_test], f)
+
+# Load the variables from a file
+with open('vars.pkl', 'rb') as f:
+  X_train, X_test, y_train, y_test = pickle.load(f)
+```
diff --git a/...rain/machine-learning-and-data-science-course/data-pre-processing/split-data.md b/...rain/machine-learning-and-data-science-course/data-pre-processing/split-data.md
@@ -25,4 +25,4 @@ from sklearn.model_selection import train_test_split
 
 # Split the data into training and testing sets
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
-```
+```