An app to scrape data from news websites and display the articles in web GUI.
Explore the docs »
View Demo
·
Report Bug
·
Request Feature
Table of Contents
This is an app that scrapes news article details such as title, date, auther etc. from different news websites, processes the data and shows on a single page.
This app is inspired by inshorts
Before starting, please make sure that following dependencies are installed on your machine
After the above dependecies are installed, follow below instructions:
- Clone the repo
Clone the repo
git clone https://github.com/NiravJoshi33/news_crunch.git
- Navigate to the project folder using CLI
- Install other dependecies with following command
Wait for the packages to be installed.
pip install -r requirements.txt
Follow the below instructions to run the project
- Run following script
main.py
- After the script has run, browser should open and display a GUI. In case, it doesn't open, open it manually and open following url
http://localhost:8501
- By default, a side bar will load with the page. From there, you can deselect any website you don't want to see news from and use the slider to select the number of articles to show.
- Resolve Major Bugs with the current Basic Version
- Run the app with the single script
- OpenSSL Error occuring sometimes
- Clean Data before showing in GUI
- Dates from all websites in same format
- Inconsistent card size due to different size of thumbs and excerpts
- Test app on macOS
- Data storage and access from an online database
See the open issues for a full list of proposed features (and known issues).
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Nirav Joshi
Email - [email protected]
Project Link: https://github.com/NiravJoshi33/news_crunch
- Scrapy Course - Python Web Scraping for Beginners by freecodecamp.org
- Python Streamlit Full Course
- Best-README-Template by Othneil Drew
- Awesome community on stackoverflow
- ChatGPT by OpenAI for some Debugging