Skip to content

Using Python 3 to extract tables from URLs and HTML files and automatically saving them as excel files and csv files.

Notifications You must be signed in to change notification settings

Tejasweee/Website-tables-to-Excel-files

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Site-to-Excel

This script allows to pass urls or html files and it extracts tables from html contents and saves them as Excel files and CSV files in .xlsx and .csv format.

Using the Script:

  • From Command Line Argument:

python site2excel.py, url

python site2excel.py, url1, url2, url3....

python site2excel.py, afile.html

python site2excel.py, afile1.html, afile2.html, afile3.html...

  • Passing argument after running the script:

python site2excel.py

-- Now arguments can be passed in the input prompt

This script needs 'pandas' library to run. If pandas library is not installed this script will automatically try to install pandas first using pip. The secondary requirements like openpyxl library and lxml library can be easily fulfilled by running requirements.py

About

Using Python 3 to extract tables from URLs and HTML files and automatically saving them as excel files and csv files.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages