Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vercel WebApp #85

Closed
Sitevity opened this issue May 18, 2024 · 5 comments
Closed

Vercel WebApp #85

Sitevity opened this issue May 18, 2024 · 5 comments

Comments

@Sitevity
Copy link

Is your feature request related to a problem? Please describe.

I'm currently working on improving Scrape-ML's ability to handle websites with dynamically loaded content. This is a common challenge because websites often use JavaScript to fetch and display content after the initial page load. Scrape-ML's current static parsing approach often misses this dynamically generated content, leading to incomplete data extraction.

Describe the solution you'd like

I propose implementing a feature that utilizes browser automation to handle dynamic content. This could be achieved by integrating with a library like Selenium or Puppeteer. These libraries allow Scrape-ML to simulate a real browser, execute JavaScript code, and wait for the dynamically loaded content to appear before parsing the page.

Describe alternatives you've considered

I've explored using Scrape-ML's existing features like custom selectors and regular expressions to target specific elements within the source code. However, this approach becomes cumbersome and unreliable for complex websites with intricate JavaScript interactions. Additionally, it requires a deep understanding of the website's underlying code, making it difficult for users who are not familiar with web development.

Additional context

Several popular web scraping frameworks utilize browser automation for handling dynamic content. This functionality has become a critical aspect of modern web scraping due to the prevalence of dynamic websites.

Copy link

Thank you for raising a issue, Hope you enjoing the open source. we try to reply or assign as soon possibe. Connect with mentor.

@Sitevity
Copy link
Author

I Request You to Assign me This Feature Request under GSSOC'24 (Level 3)

@AashishKumar-3002
Copy link

Hey @sanjay-kv @Sitevity if this issue is available, I would like to work on it

@sanjay-kv
Copy link
Member

Its already assigned if you want to collaborate reach out to assigned person

@github-actions github-actions bot closed this as completed Jul 5, 2024
Copy link

github-actions bot commented Jul 5, 2024

This issue has been automatically closed because it has been inactive for more than 30 days. If you believe this is still relevant, feel free to reopen it or create a new one. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

3 participants