TranslateProject/sources/news/20240108 AI Tools Are Here for Open-Source Audio Editor -Audacity.md
2024-01-12 09:09:39 +08:00

5.4 KiB

AI Tools Are Here for Open-Source Audio Editor 'Audacity'

Audacity is now equipped with new AI tools. Over the past couple of years, we have seen notable developments in AI, including AI-powered functionalities being added to various popular tools. And, more AI is something we already mentioned in our predictions for Linux and open source for 2024.

This time it is Audacity, one of the best audio editors for Linux, that has welcomed AI. Users can now enhance their audio editing experience by leveraging the power of AI.

Let's take a look at what these fascinating features bring to the table.

Intel Makes AI Tools 🛠️

The folks over at Intel have introduced new features for working with spoken audio and music, all powered by AI.

These features are a part of the OpenVINO plugin suite. For those unaware, Intel's OpenVINO is an AI framework that helps accelerate deep-learning models.

One cool, yet important, aspect to note is that all these features run on your PC locally! More specifically, the PC's CPU or GPU will be utilized for the processing part.

Podcast Editing

Ever wanted to quickly transcribe or even translate the audio? Transcription , which is powered by OpenAI's Whisper, allows users to do exactly this.

Transcription | Source: GitHub Intel

  • It produces a new label track that includes transcribed/translated text based on a given audio sample.
  • Translation will always produce the output in English, while transcribing will produce the output in the same language as the source audio.
  • For now, only the Whisper Base model is supported by default. Support for more models may arrive later.

Similar to the existing Noise Removal effect, there's Noise Suppression which helps remove unwanted background noise from a spoken audio sample.

  • Since it's powered by AI, you can expect this feature to perform better than the Noise Removal effect.
  • For now, only the denseunet model is supported by default. Support for more models may arrive later.

Music Generation

Yes, you read it right: You can now generate new music right inside your machine!

Music Generation allows users to generate a snippet of music. This is possible thanks to an open-source project called Riffusion , which is a music generation model based on Stable Diffusion (a popular open-source model used to generate images.)

Music Generation | Source: GitHub Intel repo

  • You generate music by giving a text prompt and adjusting some other parameters like the duration.
  • In the text prompt, you specify the genre of music to be generated and the AI will try its best to maintain the genre when generating music.
  • There exists an option to switch between the default Simple mode and an optional Advance mode for more control like specifying a start prompt and end prompt.

Furthermore, you can even remix music with Music Style Remix by selecting a portion of a music track!

Music Separation

What if you wished to create your own instrumental track of your favourite song? Maybe even have a fun karaoke session with your friends?

Music Separation allows you to split a song into either a vocal or instrumental version. There are two separation modes offered:

Music Separation. Source: github.com/intel/openvino-plugins-ai-audacity

  • 2-Stem produces two new tracks for vocals and instrumental.
  • 4-Stem produces four new tracks for vocals, drums, bass and the last one of other instruments combined.

How to Use The AI Tools? 📥

If you're a Linux user, hold on to your horses.

While the Windows version of the plugin is available for download, its Linux counterpart isn't available... just yet.

If you still want to try it out, you will need to compile the project yourself. You can refer to the official instructions for this.

Hopefully, a Linux build arrives soon!🤞

💬 What do you think about these new AI goodies for Audacity? Let us know your thoughts in the comments!



via: https://news.itsfoss.com/audacity-ai-tools/

作者:Rishabh Moharir 选题:lujun9972 译者:译者ID 校对:校对者ID

本文由 LCTT 原创编译,Linux中国 荣誉推出