Skip to content
Naibo Wang edited this page Jul 3, 2023 · 47 revisions

教程导航

EasySpider介绍:中国地震台网采集案例

https://www.bilibili.com/video/BV1th411A7ey/

自动/手动同类型元素匹配功能说明

https://www.bilibili.com/video/BV1pu411a7pK/

如何爬取需要登录才能爬的网站

https://www.bilibili.com/video/BV1HV4y1r7v8

实战采集汽车网文章内容并下载文章内图片

https://www.bilibili.com/video/BV1RW4y197ih/

Ebay样例博客

https://blog.csdn.net/ihero/article/details/130805504

多种选中子元素模式区别展示+将最后一次提取到的数据作为变量进行输入+定时执行任务展示

https://www.bilibili.com/video/BV14V4y1m7iL/

【重要】自定义条件判断之使用循环项内的JS命令返回值

https://www.bilibili.com/video/BV1mu411x7Nn/

如何执行自己写的JS代码和系统代码 (自定义操作)

https://www.bilibili.com/video/BV1qs4y1z7Hc/

如何自定义循环和判断条件

https://www.bilibili.com/video/BV1Ys4y1z777/

如何对元素和网页截图及命令行执行指南

https://www.bilibili.com/video/BV1dV4y1z764/

OCR识别元素内容功能

https://www.bilibili.com/video/BV1xz4y1b72D/

如何爬需要输入验证码的网站

https://www.bilibili.com/video/BV18c411K7FH

如何切换IP池和使用隧道IP - 打开详情页采集案

https://www.bilibili.com/video/BV1KT411t79n

流程图执行逻辑解析 - 58同城房源描述采集案例

https://www.bilibili.com/video/BV1YL411z7uW

MacOS系统设计和执行eBay网站爬虫任务教程

https://www.bilibili.com/video/BV1WL411h71r

如何定时执行任务

https://github.com/NaiboWang/EasySpider/issues/22

如何通过API调用来获得任务执行ID

https://github.com/NaiboWang/EasySpider/wiki/API-Invoke-Example

如何使用命令行执行任务

https://github.com/NaiboWang/EasySpider/issues/28 https://github.com/NaiboWang/EasySpider/wiki/Argument-Instruction

任务流程有多个提取数据操作时,软件的执行和输出逻辑

https://github.com/NaiboWang/EasySpider/issues/35

并行多开,同时执行多个任务教程

https://github.com/NaiboWang/EasySpider/wiki/Run-multiple-tasks-in-parallel

当前循环项包含元素的XPATH的写法

https://github.com/NaiboWang/EasySpider/wiki/%E5%BD%93%E5%89%8D%E5%BE%AA%E7%8E%AF%E9%A1%B9%E5%8C%85%E5%90%AB%E5%85%83%E7%B4%A0%E7%9A%84XPATH%E7%9A%84%E5%86%99%E6%B3%95

What is EasySpider

EasySpider is a no-code visual web crawler for no-coders and also experts to crawl data from the World Wide Web. It can design and execute web crawling tasks without coding via GUI. The workflow of a new web crawling task can be visually programmed by following EasySpider's visual wizard on the target webpages using an intuitive point-and-click interface. The generated crawler task can then be easily invoked locally or as a web service. Our EasySpider is cross-platform and flexible to adapt to different web-resources. It also supports advanced configuration for complicated tasks and extension. The whole system is open-sourced and transparent for free-access, which avoids possible privacy leakage.

Features of EasySpider

  • Interactive and user-friendly: it provides a user-friendly point-to-click GUI for quick and easy task design for non- programmers. Only a little basic knowledge, such as XPath, is needed to modify the task flow.

  • Flexible: it is able to adapt to various web resources and platforms, dynamic content support, such as collect web content generated by JavaScript (AJAX).

  • Configurable: it supports easy design of complicated crawling tasks with unlimited loops, if-conditions, and fields.

  • Cross-platform: supports on Windows, Linux, and MacOS.

  • Open-Source & Trustworthy: the source code is publicly available, making it a transparent and white-box software, with no privacy leakage risk.

  • Free: all features listed in this documentation are free of charge.

UI of EasySpider

  • UI of Crawler Task design

image

  • UI of Task Information

service_information

  • UI of Task Invocation

service_invoke

  • Example Collected Results

results

Clone this wiki locally