AI Onomatopoeia: Clone a voice in 5 seconds and generate any voice content – MockingBird

AI Onomatopoeia: Clone a voice in 5 seconds and generate any voice content – MockingBird

2022-09-02 0 447
Resource Number 36655 Last Updated 2025-02-24
¥ 0USD Upgrade VIP
Download Now Matters needing attention
Can't download? Please contact customer service to submit a link error!
Value-added Service: Installation Guide Environment Configuration Secondary Development Template Modification Source Code Installation

This issue recommends a Python-based AI onomatopoeia project – MockingBird.

MockingBird can clone the sound through 5 seconds of audio material, the output timbre produced is very similar to the original sound, and can synthesize sounds and consonants that do not exist in the original audio sample, but also support the generation of arbitrary voice content.

e8b1d80e672046c49cdc717ab4aa6ad2noop.image_

MockingBird Features:

Chinese supports Mandarin and tests with multiple Chinese datasets: aidatatang_200zh, magicdata, aishell3, biaobei, MozillaCommonVoice, data_aishell, etc
PyTorch works with pytorch and has been tested in version 1.9.0 (latest August 2021), GPU Tesla T4 and GTX 2060
Windows+Linux can run on both Windows and linux operating systems (there are also community success cases for MAC OS M1 edition)
Easy & Awesome Downloader or newly trained synthesizer The synthesizer has good results, comes with a sound preset encoder/code, or real-time HiFi-GAN as a vocoder
Webserver Ready The training results of the server for remote invocation

How to use:

1. Installation

Install PyTorch.
Install ffmpeg.
Run pip install -r requirements.txt to install the necessary packages.
install webrtcvad pip install webrtcvad-wheels.

2. Prepare the pre-training model

2.1 Train the synthesizer model yourself using the data set (as against 2.2)

Download the data set and extract it: Make sure you have access to all the audio files in the train folder (such as.wav)
Preprocess the audio and Mayer spectrum: python pre-. py <datasets_root> -d {dataset} -n {number} can be passed in parameters
-d{dataset} Specifies the dataset. aidatatang_200zh, magicdata, aishell3, data_aishell are supported. The default value is aidatatang_200zh
-n {number} Specifies the number of parallel CPU 11770k + 32GB Measured 10 no problem

If you download the aidatatang_200zh file on disk D, the train file path is D:\data\aidatatang_200zh\corpus\train, your datasets_root is D:\data\

Train synthesizer: python synthesizer_train.py mandarin <datasets_root>/SV2TTS/synthesizer
When you see in the training folder synthesizer/saved_models/ that the attention line displays and the loss meets your needs, go to the startup step.

2.2 Use of community pre-trained synthesizers (with 2.1 as an alternative)

Please refer to the link at the end of this article

2.3 Training Vocoder (Optional)

Preprocessing data: python vocoder_preprocess.py <datasets_root> -m <synthesizer_model_path>

Replace <datasets_root> with your dataset directory, and <synthesizer_model_path> with one of your best synthesizer model directories, such as sythensizer\saved_mode\xxx

Train wavernn vocoder: python vocoder_train.py <trainid> <datasets_root>

<trainid> is replaced with the identifier you want, and the same identifier continues the original model when retrained

Train the hifigan vocoder: python vocoder_train.py <trainid> <datasets_root> hifigan

<trainid> is replaced with the identifier you want, and the same identifier continues the original model when retrained

3. Start the program or toolbox

3.1 Start the Web program:

python web.py is successfully run in the browser to open the address, the default is http://localhost:8080

c009e660a3fa4322a62cae48ae3906b5noop.image_

3.2 Starting the Toolbox

python demo_toolbox.py -d <datasets_root>

Specify an available data set file path. If supported data sets are available, they will be automatically loaded for debugging and will also serve as a storage directory for manually recorded audio.

b168b60a58ec4e0db10bfe5ce4cf19d4noop.image_

You can read more on your own.

资源下载此资源为免费资源立即下载
Telegram:@John_Software

Disclaimer: This article is published by a third party and represents the views of the author only and has nothing to do with this website. This site does not make any guarantee or commitment to the authenticity, completeness and timeliness of this article and all or part of its content, please readers for reference only, and please verify the relevant content. The publication or republication of articles by this website for the purpose of conveying more information does not mean that it endorses its views or confirms its description, nor does it mean that this website is responsible for its authenticity.

Ictcoder Free source code AI Onomatopoeia: Clone a voice in 5 seconds and generate any voice content – MockingBird https://ictcoder.com/kyym/ai-onomatopoeia-clone-a-voice-in-5-seconds-and-generate-any-voice-content-mockingbird.html

Share free open-source source code

Q&A
  • 1, automatic: after taking the photo, click the (download) link to download; 2. Manual: After taking the photo, contact the seller to issue it or contact the official to find the developer to ship.
View details
  • 1, the default transaction cycle of the source code: manual delivery of goods for 1-3 days, and the user payment amount will enter the platform guarantee until the completion of the transaction or 3-7 days can be issued, in case of disputes indefinitely extend the collection amount until the dispute is resolved or refunded!
View details
  • 1. Heptalon will permanently archive the process of trading between the two parties and the snapshots of the traded goods to ensure that the transaction is true, effective and safe! 2, Seven PAWS can not guarantee such as "permanent package update", "permanent technical support" and other similar transactions after the merchant commitment, please identify the buyer; 3, in the source code at the same time there is a website demonstration and picture demonstration, and the site is inconsistent with the diagram, the default according to the diagram as the dispute evaluation basis (except for special statements or agreement); 4, in the absence of "no legitimate basis for refund", the commodity written "once sold, no support for refund" and other similar statements, shall be deemed invalid; 5, before the shooting, the transaction content agreed by the two parties on QQ can also be the basis for dispute judgment (agreement and description of the conflict, the agreement shall prevail); 6, because the chat record can be used as the basis for dispute judgment, so when the two sides contact, only communicate with the other party on the QQ and mobile phone number left on the systemhere, in case the other party does not recognize self-commitment. 7, although the probability of disputes is very small, but be sure to retain such important information as chat records, mobile phone messages, etc., in case of disputes, it is convenient for seven PAWS to intervene in rapid processing.
View details
  • 1. As a third-party intermediary platform, Qichou protects the security of the transaction and the rights and interests of both buyers and sellers according to the transaction contract (commodity description, content agreed before the transaction); 2, non-platform online trading projects, any consequences have nothing to do with mutual site; No matter the seller for any reason to require offline transactions, please contact the management report.
View details

Related Article

make a comment
No comments available at the moment
Official customer service team

To solve your worries - 24 hours online professional service