-
-
Notifications
You must be signed in to change notification settings - Fork 166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RFC] Metascraper for e-commerce #412
Comments
Thanks for helping to build easier e-commerce data extraction. Overall, e-commerce sites that I've tested that use ld+json tend to consistently contain brand, product name, and sku information in a predictable manner. Sites that opt for structured microdata without ld+json tend to be more inconsistent in how they represent brand information; with some using an element with As of today, critical e-commerce data I'm seeking include product name, product brand, and product sku. In the near future, I may have a need for product pricing, variants, and accessories as defined in https://schema.org/Product. Some data-gathering strategies I intend to use for products include:
Based on current Microlink features, I am able to extra product data using the Product pages I have tested:
|
Has this moved anywhere in the past last years? or are you using addons like https://github.com/samirrayani/metascraper-shopping? very keen to know more about this. |
https://github.com/zbicin/metascraper-shopping might have some of the goods that you are looking for. |
The idea behind this issue is to determine what kind of data can be extracted and normalized across e-commerce URLs.
examples of e-commerces
(no exhausted list, we need a lot more!)
The text was updated successfully, but these errors were encountered: