Forget ChatGPT – NExT-GPT can read and generate audio and video prompts, taking generative AI to the next level

2023 has felt like a year dedicated to artificial intelligence and its ever-expanding capabilities, but the era of pure text output is already losing steam. The AI scene might be dominated by giants like ChatGPT and Google Bard, but a new large language model (LLM), NExT-GPT, is here to shake things up – offering the full bounty of text, image, audio, and video output. 

NExT-GPT is the brainchild of researchers from the National University of Singapore and Tsinghua University. Pitched as an ‘any-to-any’ system, NExT-GPT can accept inputs in different formats and deliver responses according to the desired output in video, audio, image, and text responses. This means that you can put in a text prompt and NExT-GPT can process that prompt into a video, or you can give it an image and have that converted to an audio output. 

ChatGPT has only just announced the capability to ‘see, hear and speak’ which is similar to what NExT-GPT is offering – but ChatGPT is going for a more mobile-friendly version of this kind of feature, and is yet to introduce video capabilities. 

We’ve seen a lot of ChatGPT alternatives and rivals pop up over the past year, but NExT-GPT is one of the few LLMs we’ve seen so far that can match the text-based output of ChatGPT but also provide outputs beyond what OpenAI’s popular chatbot can currently do. You can head over to the GitHub page or the demo page to try it out for yourself. 

So, what is it like?

I’ve fiddled around with NExT-GPT on the demo site and I have to say I’m impressed, but not blown away. Of course, this is not a polished product that has the advantages of public feedback, multiple updates, and so on – but it is still very good. 

I asked it to turn a photo of my cat Miso into an image of him as a librarian, and I was pretty happy with the result. It may not be at the same level of quality as established image generators like Midjourney or Stable Diffusion, but it was still an undeniably very cute picture.

Cat in a library wearing glasses

This is probably one of the least cursed images I’ve personally generated using AI. (Image credit: Future VIA NExT-GPT)

I also tested out the video and audio features, but that didn't go quite as well as the image generation. The videos that were generated were again not awful, but did have the very obvious ‘made by AI’ look that comes with a lot of generated images and videos, with everything looking a little distorted and wonky. It was uncanny. 

Overall, there’s a lot of potential for this LLM to fill the audio and video gaps within big AI names like OpenAI and Google. I do hope that as NExT-GPT gets better and better, we’ll be able to see a higher quality of outputs and make some excellent home movies out of our cats seamlessly in no time. 

You might also like…

TechRadar – All the latest technology news

Read More

Microsoft unveils Turing Bletchley v3: The AI model taking Bing to the next level

Microsoft is working hard towards proving the 'intelligence' part in artificial intelligence, and has just revealed the latest version of its Turing Bletchley series of machine intelligence models, Turing Bletchley v3.

As explained in an official blog post, Turing Bletchley v3 is a multilingual vision-language foundation model, and will be integrated into many existing Microsoft products. If the name of this model sounds scary, don’t worry – let’s break it down. 

The ‘multilingual' part is self-explanatory – the model helps Microsoft products function better in a range of languages, currently standing at more than ninety. The ‘vision-language' part means that the model has image processing and language capabilities simultaneously, which is why this kind of model is known as ‘multimodal’. Finally, the ‘foundation model’ part refers to the conceptual and technical structure of the actual model. 

The first version of this multimodal model was launched in November 2021, and in 2022, Microsoft started testing the latest version – v3. Turing Bletchley v3 is pretty impressive because making a model that can “understand” one type of input (say, text or images) is already a big undertaking. This model combines both text and image processing to, in the case of Bing, improve search results. 

Incorporating neural networks 

The Turing Bletchley v3 model makes use of the concept of neural networks, which is a way of programming a machine that mimics a human brain. These neural networks allow it to make connections in the following manner, as described by Microsoft itself: 

“Given an image and a caption describing the image, some words in the caption are masked. A neural network is then trained to predict the hidden words conditioned on both the image and the text. The task can also be flipped to mask out pixels instead of words.”

The model is trained over and over in this way, not unlike how we learn. The model is also continuously monitored and improved by Microsoft developers. 

Where else the new model is being used

Bing Search isn’t the only product that’s been revamped with Turing Bletchley v3. It’s also being used for content moderation in Microsoft’s Xbox Live game service. The model helps the Xbox moderation team to identify inappropriate and harmful content uploaded by Xbox users to their profiles. 

Content moderation is a massive job scale-wise and often mentally exhausting, so any assistance that helps moderators actually have to see less upsetting content is a big win in my eyes. I can see Turing Bletchley v3 being deployed in content moderation for Bing Search in a similar manner.

This sounds like a significant improvement for Bing Search. The AI-aided heat is on, especially between Microsoft and Google. Recently, Microsoft brought Bing AI to Google Chrome, and now it’s coming for image search. I don’t see how Google doesn’t see this as direct competition in the most direct manner. Google still enjoys the greatest popularity both in terms of browser and search volume, but nothing is set in stone. Your move, Google. 

You might also like …

TechRadar – All the latest technology news

Read More

Adobe Express adds Firefly AI to its free plan for next level creativity

The all-in-one creative suite Adobe Express is getting a wave of new features; chief among them is the introduction of the Adobe Firefly generative AI.

With Firefly being added, you will be able to create “custom image and text effects” using nothing more than a simple text prompt. The official trailer displays these tools in action as it showing the steps of how to create a poster for a neighborhood event. Firefly is used to change the basic lettering of a short phrase into a “purple gloss balloon” font. It can also be used to generate decorative backgrounds for posters. 

So it's nothing groundbreaking or anything that will blow your mind, but it is a nice addition to the Express toolbox. The best part is it’s available on the free version of Adobe Express, meaning anybody can take the AI feature out for a spin. 

We do want to warn you to not expect too much from this rendition of Firefly. Like a lot of other free image generators, the results can look rather nightmarish, especially when they involve people. It’s nowhere on the same level as Generative Fill on Photoshop. We recommend keeping things simple, like throwing in graphical flourishes, if you ever decide to try out the Express AI.

The company states the prompts support over 100 languages including French, German, Japanese, Spanish, as well as Brazilian Portuguese. Something we found a little funny is how Adobe clarifies that the content Firefly generates is “designed to be safe for commercial use.” Given how several companies with AIs are currently being sued over copyright issues, it looks like the Photoshop-developer felt the need to offer some reassurance to its customers. 

Notable non-AI features

The update introduces a lot of other non-AI tools. For the sake of brevity, we’re just going to focus on the more notable ones. 

For instance, you have Quick Actions for faster editing. These actions can remove the background in images, immediately convert a video into a GIF, edit PDFs, and “animate a character using just audio”. That last one is fittingly called Animate from Audio which will have “characters come to life” as their bodies automatically sync up to recorded dialogue. It takes some of the busy work out of animating the finer details.

Adobe is also introducing an all-in-one editor consisting of various design elements and pre-made templates for social media platforms. So if you want to make videos for TikTok or Instagram but don’t know how to start, the editor can help you out tremendously. 

Availability

Everything you see here is currently available on Adobe Express for desktop. A full list detailing each feature can be found on the official website. The company says it has plans to bring the update to the mobile app soon, but declined to give an exact date for the future patch in its announcement. 

It is great to see Adobe offer some of its latest tech for free. Photoshop can be very expensive. If you’re looking for other options, check out TechRadar’s list of the best Photoshop alternatives for 2023

TechRadar – All the latest technology news

Read More

Apple thinks it has the tools to take your SMB to the next level

After launching in beta last year, Apple has announced that Apple Business Essentials is now available to all small businesses in the US.

The iPhone maker’s new service brings mobile device management, 24/7 Apple support and cloud storage from iCloud together into flexible subscription plans.

Apple Business Essentials is designed to support SMBs throughout the entire device management life cycle from device setup to device upgrades while also providing strong security, prioritized support, data storage and cloud backup. It begins with simple employee onboarding which allows a small business to easily configure, deploy and manage the company’s products from anywhere.

VP of enterprise and education marketing at Apple, Susan Prescott provided further insight on the company’s complete solution for SMBs in a press release, saying

“Apple has a deep and decades-long commitment to helping small businesses thrive. From dedicated business teams in our stores to the App Store Small Business Program, our goal is to help each company grow, compete, and succeed. We look forward to bringing Apple Business Essentials to even more small businesses to simplify device management, storage, support, and repairs. Using this new service leads to invaluable time savings for customers — including those without dedicated IT staff — that they can invest back into their business.”

Apple Business Essentials

One of the most useful features in Apple Business Essentials is Collections which allows groups of apps to be delivered to employees or teams while settings such as VPN configurations, Wi-Fi passwords and more can be automatically pushed to devices.

To get started, employees simply need to sign in to their work account on their iPhone, iPad or Mac using a Managed Apple ID. Once this is done, they will have access to everything they need to be productive including the new Apple Business Essentials app from where they can download their organization’s work apps.

Managed Apple IDs for employees can be created by federating with Microsoft Azure, Azure Director and later this spring with Google Workspace identity services. This allows employees to log into their business laptops using a single business username and passwords.

Apple Business Essentials also works with both company-provided and personal devices and with Apple’s User Enrollment feature, employees’ personal information stays private and cryptographically separated from work data.

In addition to Apple Business Essentials, Apple has announced the launch of AppleCare+ for Business Essentials which provides organizations with 24/7 access to phone support and up to two device repairs per plan per year by individual, group or device. Employees can initiate repairs directly from the Apple Business Essentials app and an Apple-trained technician will come onsite in as little as four hours to get their devices back up and running.

Apple Business Essentials with up to 2TB of iCloud cloud storage starts at $ 2.99 per month after a two-month free trial while plans for AppleCare+ for Apple Business Essentials start at $ 9.99 per month.

TechRadar – All the latest technology news

Read More

Microsoft Teams update will level the playing field for all users

Microsoft will soon roll out updates for Teams that will benefit users running the collaboration software in a virtual machine (VM).

As per three new entries to the company’s product roadmap, Microsoft Teams will soon allow users of Azure, Citrix and VMware virtual desktop services to utilize give and take controls during video meetings.

Give controls allow Teams users to recruit fellow attendees to help them present, make changes to a file and perform other actions. With take controls, meanwhile, people can request they be given these kinds of administrative privileges.

Virtualization and Microsoft Teams

As many organizations migrate to a hybrid working model, whereby workers split their time between the home and office, video meetings and virtual presentations will continue to play a major role in professional life.

It’s also common for companies to use virtual desktop infrastructure to enable secure remote work. But so far, people running Microsoft Teams in a virtual machine have not had access to the full breadth of functionality, including give and take controls.

The effect of this upcoming round of updates will be to create greater consistency across Microsoft Teams environments, and open up access to core presentation functionality to those required to use virtual desktop services by their IT teams.

Support for Azure Window Desktop and Citrix services is due to arrive in March, with support for VMware’s hypervisor set to follow one month later.

TechRadar Pro has asked Microsoft whether users of other popular virtualization services (Amazon WorkSpaces, Nutanix XI Frame etc.) can expect to benefit from similar updates in future.

TechRadar – All the latest technology news

Read More