Auto burp detection and compilation (Open source code)

Artem · 21 November 2024 15:40

I would like to share my script that helps me create compilations.
Here’s how the script works:

I write YouTube video links in the data.txt file in the required format and run the code.
The script downloads the videos, detects timestamps, saves a compilation of the video, and then deletes the auxiliary files before moving on to the next video.

The idea to create my own script came to me after I saw a couple of similar projects on a forum and faced some difficulties with them.
I’m aware that there is already a website that performs such tasks, but I haven’t tried it due to payment issues.
I also saw a GUI-based program, but my script turned out to be more precise and faster. Moreover, I haven’t come across any open-source programs for this purpose on the forum.

For an overview of the script, I created a Google Colab notebook (it includes an example of how it works). You can check it out via this link: Google Colab Notebook.
Please read the description carefully and follow the steps in order.

I personally use Jupyter Notebook installed on my computer. This is because in Jupyter Notebook, the script (specifically the analysis and compilation process) runs faster, though this depends on your computer’s specifications.

To run the script locally like I do, you need to:

Install Python.
Install the required libraries specified in the script.
Install Jupyter Notebook.
Add the necessary files (the same ones you would upload to Google Colab).
Copy and run the code.

For installing Jupyter Notebook, I recommend looking up instructions online; that’s how I managed to do it.

My script is based on the pre-trained YAMNet model (you can read more about it on GitHub; it’s open-source) and a few additional modules. This allows the algorithm to detect timestamps quite accurately (this parameter is adjustable), sometimes finding things I wouldn’t have noticed myself.

I use a threshold setting of 0.02, which works perfectly for me, along with 2 seconds before and 3 seconds after the detected timestamp. This also suits my needs. Of course, I further refine the resulting compilations in Sony Vegas, as software tools aren’t yet perfect.

Downsides of the script:

With a large volume of videos, after some time (for me, around 1–2 hours), it’s necessary to refresh the cookies files to confirm you’re not a bot. After that, processed links are removed, and the algorithm resumes.
A rare error occurs if a video link starts with a - (minus) sign. This happens approximately once every 20 hours.
The algorithm sometimes reacts to specific sounds, such as the clatter of a bottle (though not always) or a loud, bassy voice (possibly because it resembles a burp). However, such false positives are not frequent enough to bother me. Besides, I don’t have access to a model better than YAMNet, which is trained on a larger dataset like AudioSet.
For videos longer than 20 minutes, the algorithm struggles. However, for shorter videos, it works perfectly—for example, analyzing a 10-minute video takes less than a minute. I suspect this is due to my PC’s specs (GTX 1060, Ryzen 1500X, and 16GB RAM). For videos over 20 minutes, I split them into smaller files, although I rarely process such long videos.

Additional features of the script:

Since downloads are handled by the yt_dlp module, it can download videos not only from YouTube but also from similar platforms (I’ve tested it with one messenger, and it worked).
If the videos are already stored on your PC, you can modify the code slightly to process them. You’ll just need to extract the audio from the videos and reformat it as required by the script (single-channel WAV format, among other specifics listed in the audio loader function).
The YAMNet model is quite versatile. If you’re interested in detecting farts, for instance, you can simply change two numbers in the code. I recommend checking the .csv table file for supported sounds.

Suggestions for improving the script:

While I’m not a professional programmer, I’m satisfied with the current script. However, I’d be happy to hear suggestions from experienced developers on how to improve it.
I think the process of video creation could be optimized by utilizing NVIDIA hardware acceleration.

I hope I’ve explained my script clearly enough. Feel free to ask any questions! (I also hope Google Colab works fine, as transferring the code there was a bit challenging for me.)

Maycon · 1 December 2024 05:08

Im not a programmer, but I was interested in try you code. But, I don’t get it how to start, like I don’t know which required libraries the project requires to download and other basics stuffs. Can u make a simpler but full explanation for people who don’t understand much? Thank you for your abilities!

Artem · 24 December 2024 16:08

I created a program based on this code, and it is 700 MB in size because the libraries were included during compilation.

When starting, you need to press Enter once and wait (it takes about a minute for me). After that, you can enter the parameters (it’s pretty easy to understand if you read the Google Colab documentation).

Important: The yamnet.h5 file should be in the same folder as the program (unfortunately, I couldn’t include it in the .exe file).

The program has the same limitations as the code:

There will be issues if the first character of the unique video ID is a minus sign.
The program processes videos quickly for videos up to 10 minutes long (at least for me). For longer videos, the processing time increases significantly (on 20-minute videos, my computer froze, and I had to restart it).

If anyone is interested, I can recompile the program with the fixed issues (it will remove the errors and split long videos into 10-minute parts). I can also write a more detailed instruction.

I hope I did everything correctly, I’ve tested it, but if anything goes wrong, let me know.

Artem · 23 February 2025 02:13

I’ve optimized the code—it now uses less RAM and downloads only the necessary segments instead of the entire video. If anyone needs it, I can share the Google Colab.

Artem · 20 May 2025 23:34

I have updated and improved the code.
Link: https://colab.research.google.com/drive/1b3vvDFLUqf6UZa54yefiK0gN1C9xpBQ5?usp=sharing

TC · 21 May 2025 03:52

Nice work! This is really well documented!

Alpo · 21 May 2025 15:16

as someone who is going to try and learn to use this I think more detailed instructions would be great since i’m not super experienced