A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories

A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories

2022-09-05 0 1,365
Resource Number 38079 Last Updated 2025-02-24
¥ 0HKD Upgrade VIP
Download Now Matters needing attention
Can't download? Please contact customer service to submit a link error!
Value-added Service: Installation Guide Environment Configuration Secondary Development Template Modification Source Code Installation

This edition recommends a Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories.

introduce

git repo-clean is a Git extension tool developed with Golang that provides the ability to scan, clean, and rewrite commit records for large Git repositories.

The general flow of Git repository data filtering

Git itself provides two commands: git-fast-export and git-fast-import, which respectively export git repository data (.git/objects) as metadata in a specific format, stream read the metadata in this specific format, and generate a completed git repository. Any file that fits this format and is typed into git-fast-import will create a Git repository.

So the general process of git-clean-repo is as follows:

fast-export
    |
    | output stream
    |
    ---> parser(blob, commit, reset, tag...)
            |
            |
            |
            ---> filter(blob size, blob oid)
                    |
                    | input stream
                    |
                    ---> fast-import

Dependent environment:

  • Git >= 2.24.0 (必须)
  • Golang >= 1.15 (可选)

Install

1 Install binary packages

Download link:https://gitee.com/oschina/git-repo-clean/releases/

Decompress the file to enter the pressurized directory, which contains the following files:

-rwxrwxr-x 1 git git 6.3M Dec  1 17:31 git-repo-clean.exe   # Package (This is a package under Windows, similar to other platforms)
-rw-rw-r-- 1 git git 5.1K Dec  1 17:31 README.md            # Usage document (this document)
drwxrwxr-x 3 git git 4.0K Dec  1 17:31 docs                 # Appendix document
-rw-rw-r-- 1 git git 9.6K Dec  1 17:31 LICENSE              # licence

Instead of just clicking git-repo-clean.exe, you need to go through the following installation steps to use it.

2 Source code compilation installation package

This method requires your computer to have a basic make compilation environment, as well as a Golang environment

$ git clone https://gitee.com/oschina/git-repo-clean
# Go to the source directory and compile
$ cd git-repo-clean
$ make
# In the bin/ directory is the compiled package
  • Linux environment
sudo cp git-repo-clean $(git --exec-path)
  • Windows environment

Method 1: Put the PATH of the executable file git-repo-clean.exe into the $PATH path of the system. The general procedure is as follows: Click Windows [to] — > input path — – > select edit system environment variables — > select environment variable < N > – > select the path in the system variable (S) – > select new (N) — — > Copy the path that contains the git-repo-clean.exe file you just extracted into the newly created environment variable.

Method 2: Also copy git-repo-clean.exe to the git execution directory: cp git-repo-clean.exe $(git –exec-path). (Git may be installed in a directory on drive C and requires special permission to copy.)

Method 3: You can also copy the executable file git-repo-clean.exe directly to the C:\Windows\system32 directory. (This method is not recommended because it may cause damage to system files.)

  • Mac OS environment

Similar to the operation on Linux. However, note that the configuration may not be executed on Mac OS and requires authorization by following the following methods: System Preferences -> Security & Privacy click Allow Anyway to allow:

A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories插图

After the installation, run the following command to check whether the installation is successful:

git repo-clean --version

Use

  • Interactive usage

Enter git repo-clean to enter the interactive mode directly. This way, you can only use the default options because no parameters are added. In this mode, the default options are –scan, –delete, and –verbose. If you want to use other options, such as –branch, you can use the following options:

git repo-clean -i[--interactive]

Enter interactive mode with the -i option, which can append other options, such as git repo-clean -i –branch=topic

A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories插图1

  • Command line usage
git repo-clean --scan --limit=1G --type=tar.gz --number=1

In the warehouse, run the command line to scan the files in the current branch of the warehouse. The file size is at least 1G and the type is tar.gz

git repo-clean --scan --limit=1G --type=tar.gz --number=1 --delete

Add the –delete option to batch delete the files scanned by the current branch and rewrite the relevant commit history (including the HEAD).

If you want to clear the data of other branches or all branches, you can use the –branch option. For example, –branch=all can perform a full scan and clear the filtered data of all branches.

git repo-clean --scan --limit=1G --type=tar.gz --number=1 --delete --branch=all

With the –branch option, the files for all branches are scanned for deletion and the relevant commit history is rewritten.

A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories插图2
资源下载此资源为免费资源立即下载
Telegram:@John_Software

Disclaimer: This article is published by a third party and represents the views of the author only and has nothing to do with this website. This site does not make any guarantee or commitment to the authenticity, completeness and timeliness of this article and all or part of its content, please readers for reference only, and please verify the relevant content. The publication or republication of articles by this website for the purpose of conveying more information does not mean that it endorses its views or confirms its description, nor does it mean that this website is responsible for its authenticity.

Ictcoder Free Source Code A Git extension tool for scanning, cleaning, and rewriting the commit history of large files in Git repositories https://ictcoder.com/scan-and-clean-large-files-in-the-git-repository-and-rewrite-the-submission-history/

Share free open-source source code

Q&A
  • 1. Automatic: After making an online payment, click the (Download) link to download the source code; 2. Manual: Contact the seller or the official to check if the template is consistent. Then, place an order and make payment online. The seller ships the goods, and both parties inspect and confirm that there are no issues. ICTcoder will then settle the payment for the seller. Note: Please ensure to place your order and make payment through ICTcoder. If you do not place your order and make payment through ICTcoder, and the seller sends fake source code or encounters any issues, ICTcoder will not assist in resolving them, nor can we guarantee your funds!
View details
  • 1. Default transaction cycle for source code: The seller manually ships the goods within 1-3 days. The amount paid by the user will be held in escrow by ICTcoder until 7 days after the transaction is completed and both parties confirm that there are no issues. ICTcoder will then settle with the seller. In case of any disputes, ICTcoder will have staff to assist in handling until the dispute is resolved or a refund is made! If the buyer places an order and makes payment not through ICTcoder, any issues and disputes have nothing to do with ICTcoder, and ICTcoder will not be responsible for any liabilities!
View details
  • 1. ICTcoder will permanently archive the transaction process between both parties and snapshots of the traded goods to ensure the authenticity, validity, and security of the transaction! 2. ICTcoder cannot guarantee services such as "permanent package updates" and "permanent technical support" after the merchant's commitment. Buyers are advised to identify these services on their own. If necessary, they can contact ICTcoder for assistance; 3. When both website demonstration and image demonstration exist in the source code, and the text descriptions of the website and images are inconsistent, the text description of the image shall prevail as the basis for dispute resolution (excluding special statements or agreements); 4. If there is no statement such as "no legal basis for refund" or similar content, any indication on the product that "once sold, no refunds will be supported" or other similar declarations shall be deemed invalid; 5. Before the buyer places an order and makes payment, the transaction details agreed upon by both parties via WhatsApp or email can also serve as the basis for dispute resolution (in case of any inconsistency between the agreement and the description of the conflict, the agreement shall prevail); 6. Since chat records and email records can serve as the basis for dispute resolution, both parties should only communicate with each other through the contact information left on the system when contacting each other, in order to prevent the other party from denying their own commitments. 7. Although the probability of disputes is low, it is essential to retain important information such as chat records, text messages, and email records, in case a dispute arises, so that ICTcoder can intervene quickly.
View details
  • 1. As a third-party intermediary platform, ICTcoder solely protects transaction security and the rights and interests of both buyers and sellers based on the transaction contract (product description, agreed content before the transaction); 2. For online trading projects not on the ICTcoder platform, any consequences are unrelated to this platform; regardless of the reason why the seller requests an offline transaction, please contact the administrator to report.
View details

Related Source code

ICTcoder Customer Service

24-hour online professional services