This list contains Java libraries related to web scraping and data processing
- FooLanguage Web Scraping
- Network
- Web-scraping Frameworks
- HTML/XML Parsing
- Text processing
- Specific Formats Processing
- Natural Language Processing
- Browser automation and emulation
- Multiprocessing
- Queue
- URL and Network Address Manipulation
- Web Content Extracting
- Asynchronous
- WebSocket
- DNS Resolving
- Computer Vision
- Proxy Server
- Other FooLanguage Lists
- General
- Asynchronous
-
Full Featured Crawlers
-
Other
Libraries for parsing and manipulating plain texts.
- General
Libraries for parsing and manipulating specific text formats.
-
General
-
Something
- TODO
Libraries for working with human languages.
- TODO
Libraries for asynchronous networking programming.
- TODO
- TODO
Libraries for parsing email.
- TODO
Libraries for parsing/modifying URLs and network addresses.
- URL
- TODO
- Network Address
- TODO
Libraries for extracting web contents.
- Text and Meta Data from HTML pages
Libraries for working with WebSocket.
- TODO
- TODO
- TODO
- TODO