Skip to content
This repository has been archived by the owner on Sep 11, 2022. It is now read-only.

Commit

Permalink
Merge branch 'master' of https://github.com/danthespal/webScraper
Browse files Browse the repository at this point in the history
  • Loading branch information
danthespal committed Feb 20, 2020
2 parents 3c482c6 + f828dfa commit 3663550
Showing 1 changed file with 4 additions and 1 deletion.
5 changes: 4 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,10 @@ foo@bar:~$ scrapy crawl crawler -o items.json
- ~~base structure for downloading items~~
- ~~user agent~~
- ~~download_delay and auto_throttle for better scraping~~
- database to store the scraped data
- ~~download from multiple pages~~
- ~~database to store the scraped data~~
- establish list of sites
- remove unicode from product names
- check for duplicates product_name
- customization for product items
- ... more will come

0 comments on commit 3663550

Please sign in to comment.