![]() A decent WebSocket server class is included too.An unncessarily feature-laden web server class with optional SSL/TLS support.International domain name (IDNA/Punycode) support.TagFilter is much faster and more accurate as well as more powerful and flexible. NOTE: Simple HTML DOM is only included for legacy reasons. Includes the legacy Simple HTML DOM library to parse and extract desired content from HTML.TagFilter::HTMLPurify() produces XSS defense results on par with HTML Purifier.Microsoft Word HTML) and can easily extract desired content from HTML and XHTML using CSS3 compatible selectors. Includes a fast and powerful tag filtering library (TagFilter) for correctly parsing really difficult HTML content (e.g.An impressive CSS3 selector tokenizer (TagFilter::ParseSelector()) that carefully follows the W3C Specification and passes the official W3C CSS3 static test suite.A full cURL emulation layer for drop-in use on web hosts that are missing cURL.For when you need to scrape lots of content simultaneously. Asynchronous/Non-blocking socket support.HTML form extraction and manipulation support.301) and automatic cookie handling for managing multiple requests. A web browser-like state engine that emulates redirection (e.g.Easy to emulate various web browser headers.Supports file transfers, SSL/TLS, and HTTP/HTTPS/CONNECT proxies.Carefully follows the IETF RFC Standards surrounding the HTTP protocol.That custom API you want the average person to install on their home computer or deploy to devices in the enterprise just became easier to deploy. This tookit also comes with classes for creating custom web servers and WebSocket servers. The powerful tag filtering library TagFilter is included to easily extract the desired content from each retrieved document or used to process HTML documents that are offline. This toolkit easily makes RFC-compliant web requests that are indistinguishable from a real web browser, has a web browser-like state engine for handling cookies and redirects, and a full cURL emulation layer for web hosts without the PHP cURL extension installed. A PHP library of tools designed to handle all of your web scraping needs under a MIT or LGPL license.
0 Comments
Leave a Reply. |