YouTube working on an AI music tool that’ll let you use the voices of famous musicians

YouTube is apparently working on a new AI tool that could give content creators the ability to produce songs using the voices of famous singers and musicians.

According to a recent Bloomberg report, the platform has approached several record labels with this technology with negotiations still ongoing. YouTube is trying to obtain rights to use certain songs to train the AI while also trying not to step on any land mines that would lead to them getting sued to high heaven. We’re already seeing a similar situation happen with OpenAI as it’s currently being sued by 17 authors, including A Song of Ice and Fire creator George R.R. Martin, who all allege ChatGPT is illegally using their work. Bloomberg states musicians and labels want to maintain control over their work so developers aren’t using it “to train models without permission or compensation.”

Originally, a beta of this tech was supposed to be shown off during the Made On YouTube event last month. Billboard states in their report the beta would have had a “select pool of artists [give] permission to” certain creators to use their likeness on the platform. Eventually, it would officially launch as a feature where everybody can try using the voices of consenting artists. 

Mixed response

The response from the music industry at large has been mixed. Bloomberg claims “companies have been receptive” agreeing to work with YouTube on this project. However, Billboard states record executives have had a tough time finding artists willing to participate. Some acts feel anxious about putting their voices into “the hands of unknown creators who could use them to make statements or sing lyrics” that they don’t agree with.

YouTube is trying to position itself as everybody’s best friend – as a partner to help the music industry figure this whole thing out. However, the air is gloomy. The industry sees generative AI as an unstoppable force, but it’s not an immovable object. The technology is an inevitability that they’ll have to deal with or they risk getting left behind. 

Ray of positivity

There’s another snag in all this regarding publishing. Making music isn’t a one-person show as there are entire teams involved in production. To solve this, a Billboard source says YouTube will probably give labels one big licensing fee that they have to “figure out how to divide among” songwriters.

Despite the dour attitude, there is some positivity. Billboard claims rights holders are engaging in “good faith to get a deal done” amicably. A few artists do “recognize these models could open new avenues for creative expression.” Record executives may be less keen as another Billboard source states AI can put “companies at a disadvantage”.

We’ll just have to wait and see what comes from all this. Again, YouTube’s new model could help people explore their creative side assuming deals are made fairly.

While we're on the topic of production, be sure to check out TechRadar's list of the best free music-making software for 2023.

You might also like

TechRadar – All the latest technology news

Read More

ChatGPT can now look at pictures and tell you a bedtime story in five different voices

ChatGPT can now hear, see and speak, opening up a whole new world of possibilities for how we interact with AI chatbots. The new capabilities unlock the ability to have a voice conversation with ChatGPT, or physically show the bot what you’re talking about. 

According to the official OpenAI blog post, you’ll soon be able to show the bot pictures of a landmark while on holiday and have a conversation about the history behind the structure. You could also send the bot a photo of your fridge contents and have it whip up a potential recipe.  

The new features will be rolling out to ChatGPT Plus and Enterprise users first over the next few weeks. Voice is coming to iOS and Android apps, and images will be available across platforms. As with most ChatGPT features, users who aren’t subscribed to the Plus platform will likely see the features a little later. 

ChatGPT talks back

The blog post notes that you’ll now be able to engage in back-and-forth conversations with your AI assistant on the go via the phone app. From what we can tell it would be a similar experience to how you’d speak to Siri or Amazon Alexa

The video example on the blog post shows off a stylish user interface with a voice asking ChatGPT to tell a bedtime story, with the user interrupting every so often to ask questions. 

Regardless of how you might feel about the technology it’s still very impressive. We’ll have to wait to see if real conversations match up with the seamless example in the video, but if they do, Siri and Amazon Alexa have a lot to be worried about. If I can access a talkative, intelligent chatbot like ChatGPT, which looks at pictures and can go into depth about topics without pause, why would I ever use any other virtual assistants? 

If you’re a Plus subscriber, head over to Settings, click ‘New Features’ on the mobile app and opt into voice conversations. You’ll be able to choose your favorite voice out of five different options: Sky, Cove, Ember, Breeze and Juniper, and you can listen to each one over on the official site.

Sight for sore eyes

ChatGPT can also now look at more than one image as well. You can show graphs that need analyzing, get help with homework or just show a rough draft of work you’d like feedback on, but can’t be bothered to type out. 

If you want it to focus on something specific in the photo, you can use the new drawing tool within the ChatGPT app and circle exactly what you want the bot to concentrate on. 

While this is scarily impressive for a generative AI chatbot, there are concerns that immediately spring to mind upon hearing about the new features. 

OpenAI does acknowledge these concerns at the bottom of the announcement, stating that with new features come new challenges, including hallucinations – basically an incorrect response given by an AI bot but delivered with confidence – and the possibility of the voice capabilities that impersonate public figures or commit fraud. 

In order to combat this, OpenAI states that Voice Chat was created with real voice actors, and the image input feature was tested with rosh domains in extremism and scientific proficiency, to “align key features for responsible usage”.  

We’re so incredibly buzzed to try out the new features, especially the ability to chat directly to ChatGPT and probe its mind. We’re also keen to see how this will ripple down to other products like Bing AI, Google Bard and even Meta’s budding AI project. As ChatGPT is an AI trailblazer, introducing new features like this will mean everyone else will have to catch up.

You might also like…

TechRadar – All the latest technology news

Read More

Windows 11 gets smart interface changes and new voices

Windows 11 has a new preview out in the Dev Channel which comes with some smart tweaks for the interface, and some better, more natural, voices for Narrator.

Narrator – the built-in tool which reads out the contents of the screen for you, such as a web page, for example – now has two new natural voices in English US (female), which are called ‘Jenny’ and ‘Aria’. Users can select whichever they prefer, and once the voices are downloaded and installed, they work without an internet connection.

Microsoft has also introduced some new keyboard shortcuts for Narrator in order to more easily facilitate switching between different voices (and more besides).

The new preview build 22543 further applies some small, but nifty, tweaks to the desktop interface, including for resizing snapped windows. When you’re doing this, the snapped windows (aside from the main one) are blurred out and overlaid with their relevant app icon. It’s a pretty cool effect that makes it slightly easier to see exactly how much space you’re granting these snapped windows.

Furthermore, the media control fly-out panel on the lock screen has now been changed to match the controls in Quick Settings. This particular tweak is only rolling out to a limited number of testers at the moment, and feedback will be evaluated before a wider rollout commences. In other words, don’t be surprised if you aren’t getting this yet.

As ever, there are a bunch of fixes for Windows 11 delivered in this preview, and that includes the solution to a crashing issue with File Explorer that happens when dragging a file out of a ZIP. All the work done is summed up in Microsoft’s blog post on the new build (along with the inevitable known issues with an early preview – expect some unknown ones, too).


Analysis: Pacey progress with accessibility features

Continued progress on the accessibility front is good to see, in terms of the more natural-sounding voices for Narrator, which have already been welcomed by testers who use the feature. Presumably we will see more options for different voices rolling out before long.

Accessibility is something Microsoft has rightly been prioritizing in Windows, with the most recent major move being the introduction of full voice control capabilities (built using Nuance’s Dragon speech recognition tech), and a virtual keyboard you can type with using your voice. Work on accessibility has been going on for years, of course, and bringing in very useful features like eye tracking which debuted almost five years ago with Windows 10.

TechRadar – All the latest technology news

Read More