To get started with the project, follow these steps:
-
Clone the repository:
$ git clone https://github.com/open-data-kazakhstan/city-percentage-of-population-with-higher-education-degrees.git
-
Create and activate a virtual environment:
pip install venv python -m venv /path/to/localrepo cd /path/to/localrepo Scripts/activate # For Windows users
-
Install dependencies:
pip install -r requirements.txt
-
Run the scripts:
python scripts/transform.py python scripts/package.py
Education data is sourced from (stat.gov.kz).
document.pdf
: The original document containing the data about the education levels in Kazakhstan.source.csv
: Raw data of population age 10 and older with higher education degrees by region for the year 2021. The 2022 data was not available from credible sources, hence the dataset focuses on 2021.higher_education.csv
: English version containing data of population percentages with higher education degrees in urban areas only in 2021.
All percentage values are given relative to the total population aged 10 and older.
Note: The values for the Abai, Jetisu, and Ulytau regions are based on the broader location they are part of, due to specific data unavailability for these regions. For example, Jetisu's figures are included with those of the Almaty region.
package.py
: Used to create or update thedatapackage.json
file containing metadata about the dataset.transform.py
: Used to convert thesource.csv
file for easier processing.
This dataset is licensed under the Open Data Commons Public Domain and Dedication License.