Google provides a tool for making objects and places within video searchable

Share

Why it matters to you

Google’s new tool for developers will enable applications to use cloud-based machine learning to detect and label objects and locations within video, speeding up searches.

During the Google Cloud Next Conference in San Francisco, Google revealed a new machine learning application program interface (API) called Cloud Video Intelligence. With this API, developers can create applications capable of detecting objects within video and making them searchable and discoverable. Both nouns and verbs can be applied to those objects, such as “dog” and “run.”

An API is essentially a bridge between a service and an application. In this case, the API connects to the Google Cloud Machine Learning platform for the compute aspect and stores annotated videos on Google Cloud Storage. Thus, due to this “bridge,” an application based on Google’s new API will have access to this specific functionality to provide end-users with a better way of searching through videos.

More: Google is developing a series of AI-powered features for Android O

“You can now search every moment of every video file in your catalog and find every occurrence as well as its significance,” Google states. “It helps you identify key nouns entities of your video, and when they occur within the video. Separate signal from noise, by retrieving relevant information at the video, shot or per frame.”

In a demo, users can search for animals in an MP4 video file lasting just over a minute and a half. The labels generated by Cloud Video Intelligence consist of Animal (99 percent), Wildlife (94 percent), Zoo (91 percent), Terrestrial Animal (54 percent), Nature (51 percent), Tourism (47 percent), and Tourist Destination (43 percent). The sample video focuses on the Los Angeles Zoo presented by Disney’s Zootopia CGI-animated movie.

However, what’s really neat about the new API is how it can detect a scene in a video. In the same clip, Cloud Video Intelligence can detect 48 scene changes and in real time detect and label objects as the scenes change. For instance, in one scene that displays just Nick the fox, the API will generate seven labels. In another scene focusing on the zoo’s sign, the system only generates two labels … again, all in real time.

What Google has done is create a tool that enables users to search through a video catalog just like they would with text documents. According to the company, this will be highly useful for businesses to separate signals that are buried under noise. It can also “detect features of a signal providing only relevant entities at video, shot or frame level.”

“Google has a long history working with the largest media companies in the world, and we help them find value from unstructured data like video,” said Fei-Fei Li, Chief Scientist of Google Cloud AI and Machine Learning. “This API is for large media organizations and consumer technology companies, who want to build their media catalogs or find easy ways to manage crowd-sourced content.”

The new API is now in a private beta and will also be offered to Google’s partners such as Cantemo, which will use the API to connect its video management software to the Google Cloud Machine Learning platform.

News

Company:

Some Galaxy phone screens are showing a nasty green line after an update

A recent update just brought AV1 support to your Android phone

Apple’s M4 MacBook Pro Lineup: What to Expect

How a rumored CPU might embarrass the PS5

Does your Mac need antivirus software in 2024? We asked the experts

Spigen Liquid Air Samsung Galaxy S24 case review: Should you buy it?

HP LaserJet Pro MFP 3101fdw review: a fast business printer for home offices

Spigen Ultra Hybrid Samsung Galaxy S24 case review: Should you buy it?

Razer Kishi Ultra review: Should you buy it?

The Asus ROG Zephyrus G16 completely challenged my expectations

I’ve worn two of the best smart rings. Here’s which one you should buy

I did a camera test with two $1,800 phones. Then something annoying happened

Google Pixel 7a vs. Pixel 7: don’t buy the wrong Pixel

This is the most unusual Galaxy S23 Ultra camera test I’ve ever done

I tested the Galaxy S23 Ultra and iPhone 14 Pro cameras. Only one is a winner

How to search ChatGPT conversations

How to set up Windows 11 without a Microsoft account

How to transfer a Wear OS smartwatch from one phone to another

How to type an em dash in Windows

Ask Jerry: How to fight email spam

8 iPhone browser apps you should use instead of Safari

Are Facebook and Instagram still down? Here’s what we know

Are Facebook and Instagram still down? Here’s what we know

The 1Password Android app just got a huge upgrade

I never knew I needed this mini Mac app, but now I can’t live without it

Google provides a tool for making objects and places within video searchable

Spigen Liquid Air Samsung Galaxy S24 case review: Should you buy it?

Some Galaxy phone screens are showing a nasty green line after an update

A recent update just brought AV1 support to your Android phone

Apple’s M4 MacBook Pro Lineup: What to Expect

How a rumored CPU might embarrass the PS5

More News

Spigen Liquid Air Samsung Galaxy S24 case review: Should you buy it?

Some Galaxy phone screens are showing a nasty green line after an update

A recent update just brought AV1 support to your Android phone

Apple’s M4 MacBook Pro Lineup: What to Expect