SeaTunnel is a high-performance distributed data integration platform

SeaTunnel is a high-performance distributed data integration platform

2022-09-14 0 824
Resource Number 38254 Last Updated 2025-02-24
¥ 0HKD Upgrade VIP
Download Now Matters needing attention
Can't download? Please contact customer service to submit a link error!
Value-added Service: Installation Guide Environment Configuration Secondary Development Template Modification Source Code Installation

SeaTunnel is an easy-to-use ultra-high-performance distributed data integration platform that supports real-time synchronization of massive data, which can stably and efficiently synchronize tens of billions of data every day, and has been used by nearly 100 companies in production.

 

SeaTunnel is a high-performance distributed data integration platform插图

Introduction to SeaTunnel

SeaTunnel does its best to solve the problems you may encounter when syncing massive amounts of data:

Data loss and duplication
Task piling up and delays
Low throughput
It takes a long time to apply to the production environment
Lack of application health monitoring

SeaTunnel use cases

Massive data synchronization
Massive data integration
ETL for massive amounts of data
Massive data aggregation
Multi-source data processing

Features of SeaTunnel

Easy to use, flexible configuration, no development required
Real-time streaming
Offline multi-source data analysis
High-performance, massive data processing capabilities
Modular and plug-in, easy to expand
SQL can be used for data processing and aggregation
Spark Structured Streaming is supported
Spark 2.x is supported
Environmental dependencies
Java runtime environment, Java > = 8

If you want to run seatunnel in a clustered environment, you’ll need one of the following Spark cluster environments:

Spark on Yarn
Spark Standalone

If you have a small amount of data or just do functional verification, you can also start it in local mode without the need for a cluster environment, SeaTunnel supports stand-alone operation. Note: seatunnel 2.0 runs on Spark and Flink

Production use cases

Weibo

Data Platform of the Value-added Business Department A Weibo business has hundreds of real-time stream computing tasks that use the internal customized version of SeaTunnel, and its sub-project Guardian does task monitoring of seatunnel On Yarn.

Sina

Big Data O&M Analysis Platform Sina O&M Data Analysis Platform uses SeaTunnel to perform real-time and offline analysis of O&M big data for Sina News, CDN and other services, and writes it to Clickhouse.

Sogou

Sogou Singularity System Sogou Singularity System uses SeaTunnel as an ETL tool to help establish a real-time data warehouse system.
Get started quickly

1.seatunnel relies on JDK1.8 runtime environment.

2. seatunnel depends on Spark, you need to prepare Spark before installing seatunnel. Please download Spark first, and select >= 2.x.x for the Spark version. After downloading and decompressing, you do not need to configure any configuration to submit the Spark deploy-mode = local task. If you want the task to run on a standalone cluster or a Yarn or Mesos cluster, please refer to the Spark configuration documentation.

3. Download the seatunnel installation package and unzip it, here is the community version as an example:

wget https://github.com/InterestingLab/seatunnel/releases/download/v<version>/seatunnel-<version>.zip -O seatunnel-<version>.zip
unzip seatunnel-<version>.zip
ln -s seatunnel-<version> seatunnel
Deploy and run

Run Seatunnel locally in local mode

./bin/start-seatunnel.sh –master local[4] –deploy-mode client –config ./config/application.conf

Run seatunnel on a Spark Standalone cluster

# client mode
./bin/start-seatunnel.sh –master spark://207.184.161.138:7077 –deploy-mode client –config ./config/application.conf

# cluster mode
./bin/start-seatunnel.sh –master spark://207.184.161.138:7077 –deploy-mode cluster –config ./config/application.conf

Run seatunnel on the Yarn cluster

# client mode
./bin/start-seatunnel.sh –master yarn –deploy-mode client –config ./config/application.conf

# cluster mode
./bin/start-seatunnel.sh –master yarn –deploy-mode cluster –config ./config/application.conf

Run seatunnel on Mesos

# cluster mode
./bin/start-seatunnel.sh –master mesos://207.184.161.138:7077 –deploy-mode cluster –config ./config/application.conf

If you want to specify the size of resources occupied by the seatunnel runtime, or other Spark parameters, you can specify the configuration file specified in –config:

spark {
spark.executor.instances = 2
spark.executor.cores = 1
spark.executor.memory = “1g”

}

资源下载此资源为免费资源立即下载
Telegram:@John_Software

Disclaimer: This article is published by a third party and represents the views of the author only and has nothing to do with this website. This site does not make any guarantee or commitment to the authenticity, completeness and timeliness of this article and all or part of its content, please readers for reference only, and please verify the relevant content. The publication or republication of articles by this website for the purpose of conveying more information does not mean that it endorses its views or confirms its description, nor does it mean that this website is responsible for its authenticity.

Ictcoder Free Source Code SeaTunnel is a high-performance distributed data integration platform https://ictcoder.com/seatunnel-is-a-high-performance-distributed-data-integration-platform/

Share free open-source source code

Q&A
  • 1. Automatic: After making an online payment, click the (Download) link to download the source code; 2. Manual: Contact the seller or the official to check if the template is consistent. Then, place an order and make payment online. The seller ships the goods, and both parties inspect and confirm that there are no issues. ICTcoder will then settle the payment for the seller. Note: Please ensure to place your order and make payment through ICTcoder. If you do not place your order and make payment through ICTcoder, and the seller sends fake source code or encounters any issues, ICTcoder will not assist in resolving them, nor can we guarantee your funds!
View details
  • 1. Default transaction cycle for source code: The seller manually ships the goods within 1-3 days. The amount paid by the user will be held in escrow by ICTcoder until 7 days after the transaction is completed and both parties confirm that there are no issues. ICTcoder will then settle with the seller. In case of any disputes, ICTcoder will have staff to assist in handling until the dispute is resolved or a refund is made! If the buyer places an order and makes payment not through ICTcoder, any issues and disputes have nothing to do with ICTcoder, and ICTcoder will not be responsible for any liabilities!
View details
  • 1. ICTcoder will permanently archive the transaction process between both parties and snapshots of the traded goods to ensure the authenticity, validity, and security of the transaction! 2. ICTcoder cannot guarantee services such as "permanent package updates" and "permanent technical support" after the merchant's commitment. Buyers are advised to identify these services on their own. If necessary, they can contact ICTcoder for assistance; 3. When both website demonstration and image demonstration exist in the source code, and the text descriptions of the website and images are inconsistent, the text description of the image shall prevail as the basis for dispute resolution (excluding special statements or agreements); 4. If there is no statement such as "no legal basis for refund" or similar content, any indication on the product that "once sold, no refunds will be supported" or other similar declarations shall be deemed invalid; 5. Before the buyer places an order and makes payment, the transaction details agreed upon by both parties via WhatsApp or email can also serve as the basis for dispute resolution (in case of any inconsistency between the agreement and the description of the conflict, the agreement shall prevail); 6. Since chat records and email records can serve as the basis for dispute resolution, both parties should only communicate with each other through the contact information left on the system when contacting each other, in order to prevent the other party from denying their own commitments. 7. Although the probability of disputes is low, it is essential to retain important information such as chat records, text messages, and email records, in case a dispute arises, so that ICTcoder can intervene quickly.
View details
  • 1. As a third-party intermediary platform, ICTcoder solely protects transaction security and the rights and interests of both buyers and sellers based on the transaction contract (product description, agreed content before the transaction); 2. For online trading projects not on the ICTcoder platform, any consequences are unrelated to this platform; regardless of the reason why the seller requests an offline transaction, please contact the administrator to report.
View details

Related Source code

ICTcoder Customer Service

24-hour online professional services