GO language efficient crawler software – Pholcus

GO language efficient crawler software – Pholcus

2022-08-02 1 1,267
Resource Number 36554 Last Updated 2025-02-24
¥ 0USD Upgrade VIP
Download Now Matters needing attention
Can't download? Please contact customer service to submit a link error!
Value-added Service: Installation Guide Environment Configuration Secondary Development Template Modification Source Code Installation
Pholcus is a highly concurrent, distributed, and heavyweight crawler software written in pure Go language, which supports three operation modes: stand-alone, server, and client, and has three operation interfaces: Web, GUI, and command line. The rules are simple and flexible, batch tasks are concurrent, the output methods are abundant (mysql, mongodb, csv, excel, etc.), and a large number of demos are shared. At the same time, it also supports two grasping modes in both horizontal and vertical directions, and supports a series of advanced functions such as simulated login and task pause and cancellation.

GO language efficient crawler software – Pholcus插图

Frame features

Provide users with a certain Go or JS programming foundation to provide heavyweight crawler tools that only need to pay attention to rule customization and complete functions;
It supports three operation modes: stand-alone, server-side and client-side;
GUI (Windows), Web, Cmd three kinds of operation interfaces, can be opened by parameter control;
Support state control, such as pause, resume, stop, etc.;
The amount of collection can be controlled;
The number of concurrent coroutines can be controlled;
Support concurrent execution of multiple collection tasks;
Support proxy IP list, which can control the frequency of replacement;
Support random pause in the collection process to simulate manual behavior;
Custom configuration input interfaces are provided based on rule requirements
There are five output methods: mysql, mongodb, kafka, csv, excel, and original file download;
Support batch output, and the quantity of each batch is controllable;
It supports static Go and dynamic JS collection rules, horizontal and vertical capture modes, and has a large number of demos.
Persistent success record for automatic deduplication;
Serialization failure requests, support deserialization automatic overload processing;
It adopts surfer high-concurrency downloader, supports GET/POST/HEAD method and http/https protocol, and supports two modes of fixed UserAgent automatic saving cookies and random large number of UserAgent disabling cookies, which highly simulates browser behavior and can realize functions such as simulated login.
The server/client mode adopts the Teleport high-concurrency SocketAPI framework, full-duplex connection communication, and the internal data transmission format is JSON.

 

GO language efficient crawler software – Pholcus插图1

The above is about some content about Pholcus (ghost spider), there are detailed installation and use tutorials on the official website, and students who want to learn about crawlers can pay attention to it.

资源下载此资源为免费资源立即下载
Telegram:@John_Software

Disclaimer: This article is published by a third party and represents the views of the author only and has nothing to do with this website. This site does not make any guarantee or commitment to the authenticity, completeness and timeliness of this article and all or part of its content, please readers for reference only, and please verify the relevant content. The publication or republication of articles by this website for the purpose of conveying more information does not mean that it endorses its views or confirms its description, nor does it mean that this website is responsible for its authenticity.

Ictcoder Free source code GO language efficient crawler software – Pholcus https://ictcoder.com/kyym/go-language-efficient-crawler-software-pholcus.html

Share free open-source source code

Q&A
  • 1, automatic: after taking the photo, click the (download) link to download; 2. Manual: After taking the photo, contact the seller to issue it or contact the official to find the developer to ship.
View details
  • 1, the default transaction cycle of the source code: manual delivery of goods for 1-3 days, and the user payment amount will enter the platform guarantee until the completion of the transaction or 3-7 days can be issued, in case of disputes indefinitely extend the collection amount until the dispute is resolved or refunded!
View details
  • 1. Heptalon will permanently archive the process of trading between the two parties and the snapshots of the traded goods to ensure that the transaction is true, effective and safe! 2, Seven PAWS can not guarantee such as "permanent package update", "permanent technical support" and other similar transactions after the merchant commitment, please identify the buyer; 3, in the source code at the same time there is a website demonstration and picture demonstration, and the site is inconsistent with the diagram, the default according to the diagram as the dispute evaluation basis (except for special statements or agreement); 4, in the absence of "no legitimate basis for refund", the commodity written "once sold, no support for refund" and other similar statements, shall be deemed invalid; 5, before the shooting, the transaction content agreed by the two parties on QQ can also be the basis for dispute judgment (agreement and description of the conflict, the agreement shall prevail); 6, because the chat record can be used as the basis for dispute judgment, so when the two sides contact, only communicate with the other party on the QQ and mobile phone number left on the systemhere, in case the other party does not recognize self-commitment. 7, although the probability of disputes is very small, but be sure to retain such important information as chat records, mobile phone messages, etc., in case of disputes, it is convenient for seven PAWS to intervene in rapid processing.
View details
  • 1. As a third-party intermediary platform, Qichou protects the security of the transaction and the rights and interests of both buyers and sellers according to the transaction contract (commodity description, content agreed before the transaction); 2, non-platform online trading projects, any consequences have nothing to do with mutual site; No matter the seller for any reason to require offline transactions, please contact the management report.
View details

Related Article

make a comment
No comments available at the moment
Official customer service team

To solve your worries - 24 hours online professional service