Skip to content

Commit

Permalink
Merge pull request #20 from zytedata/dud-upgrade
Browse files Browse the repository at this point in the history
use new duplicate-url-discarder==0.2.0 item pipeline
  • Loading branch information
BurnzZ authored Jul 23, 2024
2 parents 358342a + 3e0cf2c commit 74d81c8
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 1 deletion.
2 changes: 1 addition & 1 deletion requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
Scrapy==2.11.2
scrapy-zyte-api[provider]>=0.17.3
zyte-spider-templates==0.7.2
duplicate-url-discarder[rules]>=0.1.0
duplicate-url-discarder[rules]>=0.2.0
16 changes: 16 additions & 0 deletions zyte_spider_templates_project/settings.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,3 +30,19 @@
"zyte_spider_templates.pages",
"zyte_spider_templates_project.pages",
]

# duplicate-url-discarder
DUD_ATTRIBUTES_PER_ITEM = {
"zyte_common_items.Product": [
"canonicalUrl",
"brand",
"name",
"gtin",
"mpn",
"productId",
"sku",
"color",
"size",
"style"
],
}

0 comments on commit 74d81c8

Please sign in to comment.