Dataset of Indian aviation traffic. Sourced from DGCA and Ministry of Civil Aviation.
Explore the dataset using the below links:
- Daily Summary (Ministry of Civil Aviation)
- Carrier-wise: Domestic or International (DGCA)
- City-wise: Domestic or International (DGCA)
- Country-wise (DGCA)
Visualizations and related R code are available under the viz/ folder in this repository:
The complete dataset is available as CSV files under the aggregated/ folder in this repository:
Data: daily.csv
- Sourced from Ministry of Civil Aviation
- Daily reports are fetched from historical site snapshots available on Wayback Machine
- Domestic, international, cargo, on time performance, passenger load factor, grievances and more data points reported on the Ministry of Civil Aviation site
- Mid-2022 to 2024
- Report update frequency on the Ministry of Civil Aviation is irregular, and not daily. Many days in between the start and end date have no data points.
Data: domestic/city.csv
- Sourced from the Monthly Statistics (Domestic Air Transport) page in the DGCA site
- Monthly city-pair wise passenger, freight and mail traffic
- Mid-2015 to 2024
Data: domestic/carrier.csv
- Sourced from the Monthly Statistics (Domestic Air Transport) page in the DGCA site
- Monthly carrier-pair wise passenger, freight and mail traffic
- Mid-2015 to 2024
Data: international/city.csv
- Sourced from the Table 4 reports under the Quarterly Statistics (International Air Transport) page in the DGCA site
- Quarterly city-pair wise passenger and freight traffic
- 2015 to 2024
Data: international/country.csv
- Sourced from the Table 3 reports under the Quarterly Statistics (International Air Transport) page in the DGCA site
- Quarterly country-pair wise passenger and freight traffic
- 2015 to 2024
Data: international/carrier.csv
- Sourced from DGCA
- Monthly carrier wise passenger and freight traffic. M1, M2 and M3 correspond to the 1st, 2nd and 3rd month of the quarter.
- 2015 to 2024
- initialize.sh: Initializes the list of XLSX URLs to be fetched
- fetch.sh: Fetches the raw XLSX files from DGCA
- parse.sh: Parses the raw XLSX files, and save them as equivalent CSV files
- aggregate.py: Parses the individual CSV files, and aggregates them into combined CSV files
- fetch.sh: Fetches historical HTML files of the Ministry of Civil Aviation site from Wayback Machine
- parse.py: Parses the HTML files, and aggregates the reports into a CSV file
This india-aviation-traffic dataset is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Users of this data should attribute DGCA (https://www.dgca.gov.in/digigov-portal/) and Ministry of Civil Aviation (https://www.civilaviation.gov.in/)
You are free:
- To share: To copy, distribute and use the database.
- To create: To produce works from the database.
- To adapt: To modify, transform and build upon the database.
As long as you:
- Attribute: You must attribute any public use of the database, or works produced from the database, in the manner specified in the ODbL. For any use or redistribution of the database, or works produced from it, you must make clear to others the license of the database and keep intact any notices on the original database.
- Share-Alike: If you publicly use any adapted version of this database, or works produced from an adapted database, you must also offer that adapted database under the ODbL.
- Keep open: If you redistribute the database, or an adapted version of it, then you may use technological measures that restrict the work (such as DRM) as long as you also redistribute a version without such measures.
Ensure you have bash
, curl
, python
and ssconvert
installed
# Initialize list of URLs to scrape
bash initialize.sh
# Fetch the data
bash fetch.sh
# Generate the CSVs
bash parse.sh
# Generate the aggregated CSVs
python aggregate.py
The fetch script sources data from DGCA (https://www.dgca.gov.in/)
Ensure you have bash
, python
and waybackpack
installed
# Fetch the HTML
bash fetch.sh
# Generate the CSV
python parse.py
The fetch script sources data from Wayback Machine (https://archive.org/)
Found an error in the data processing, have a question, or looking for data aggregated differently? Create an issue with the details.
The information in this repository is intended to be updated regularly. In case the data has not been updated for multiple months, create an issue