Video Metadata: Describing the Details that Matter

Modern surveillance systems generate an overwhelming (and mostly unused) amount of data. This is especially true when recording video in 24/7 operations, which is essential to capturing evidence, incidents and events. It is not only hard to pick out what really matters in a scene, but also extremely time consuming. Making data more identifiable and actionable is a key problem to solve. Applying metadata to describe key details in a scene allows data to be more identifiable and actionable.

This is why metadata is the foundation for gathering intelligence from surveillance video and/or audio streams. Metadata provides a fast way to find, evaluate, and act on the singular details that matter the most through one, hundreds or thousands of video and audio footage streams. Metadata is now an essential part of effective security and business operations.

What is Metadata?
Typically, Metadata is referred to as ‘data about other data.’ In the context of video surveillance, that translates to ‘data about video data’. Video metadata accurately describes the details that matter in a scene. For instance, attributes for metadata can describe all sort of details about moving objects of interest, e.g. location, time, colors, sizes, shapes, coordinates, volume decibels, speed, direction, etc.
Additionally, foundational details can be added, such as video stream description, codec, time stamps and device identity.

The aforementioned are ‘meta’ descriptions of details in, or related to, a scene. Based on AI machine and deep learning, Meta descriptions can be more (or less) granular. This allows for classifying a group of pixels as a person, animal, vehicle or other pre-defined object classes. Being more precise with more refined descriptions of people or objects e.g. vehicle type, make model, color, speed, direction, etc.

The Value of Metadata
Metadata not only provides details about people, objects and events in a scene. It also allows large amounts of video and recorded footage to quickly group, sort, search, recover and use. As a result, the overall use cases for metadata fit into three areas.

1. Real-time alarm triggering and notifications
2. Post event forensic searching
3. Statistical analysis and reporting

Adding Intelligence to Scenes
Metadata essentially assigns digital meaning to each video frame about the objects and events within it. In other words, it adds interpretation or intelligence about the scene rather than just the raw video footage, which needs to be processed manually by an operator.

Once software can interpret scenes in this way, it can understand the scene details and enable the scene to be acted upon in real-time via events, after events (post-event), via manual search or simply analyzed for statistical analysis. This enables the use of metadata to design baselines that define what is ‘normal’ for any scene feed from any individual camera. In turn, this allows software to recognize any degree of deviation, anomaly or specific behavior or activities, etc. as well as predict what will happen in that scene to a specific probability.

Harnessing the Full Potential of Metadata
Video metadata adds immense value to a video management system. In fact, its true potential is realized when applied to multiple inputs spanning visual, audio, activity, and process-related inputs. In the management of any site, things like RFID tracking, GPS coordinates, tampering alerts, noise detection, and point of sale transactional data, are all high value data sources. Unifying this metadata generated from many different sources means gaining much more insights than one can ever get from each system alone. Interoperability is key, and open-protocols and industry standards are essential to this effort. Ultimately seamless metadata integration will allow us to harness massive amounts of data from all kids of systems and gain a greater understanding of everything around us.

This article originally appeared in the November / December 2022 issue of Security Today.

About the Author

Joe Danielson, Global Enterprise Solutions, Axis Communications, AB.

Featured

  • The Evolution of IP Camera Intelligence

    As the 30th anniversary of the IP camera approaches in 2026, it is worth reflecting on how far we have come. The first network camera, launched in 1996, delivered one frame every 17 seconds—not impressive by today’s standards, but groundbreaking at the time. It did something that no analog system could: transmit video over a standard IP network. Read Now

  • From Surveillance to Intelligence

    Years ago, it would have been significantly more expensive to run an analytic like that — requiring a custom-built solution with burdensome infrastructure demands — but modern edge devices have made it accessible to everyone. It also saves time, which is a critical factor if a missing child is involved. Video compression technology has played a critical role as well. Over the years, significant advancements have been made in video coding standards — including H.263, MPEG formats, and H.264—alongside compression optimization technologies developed by IP video manufacturers to improve efficiency without sacrificing quality. The open-source AV1 codec developed by the Alliance for Open Media—a consortium including Google, Netflix, Microsoft, Amazon and others — is already the preferred decoder for cloud-based applications, and is quickly becoming the standard for video compression of all types. Read Now

  • Cost: Reactive vs. Proactive Security

    Security breaches often happen despite the availability of tools to prevent them. To combat this problem, the industry is shifting from reactive correction to proactive protection. This article will examine why so many security leaders have realized they must “lead before the breach” – not after. Read Now

  • Achieving Clear Audio

    In today’s ever-changing world of security and risk management, effective communication via an intercom and door entry communication system is a critical communication tool to keep a facility’s staff, visitors and vendors safe. Read Now

  • Beyond Apps: Access Control for Today’s Residents

    The modern resident lives in an app-saturated world. From banking to grocery delivery, fitness tracking to ridesharing, nearly every service demands another download. But when it comes to accessing the place you live, most people do not want to clutter their phone with yet another app, especially if its only purpose is to open a door. Read Now

New Products

  • Luma x20

    Luma x20

    Snap One has announced its popular Luma x20 family of surveillance products now offers even greater security and privacy for home and business owners across the globe by giving them full control over integrators’ system access to view live and recorded video. According to Snap One Product Manager Derek Webb, the new “customer handoff” feature provides enhanced user control after initial installation, allowing the owners to have total privacy while also making it easy to reinstate integrator access when maintenance or assistance is required. This new feature is now available to all Luma x20 users globally. “The Luma x20 family of surveillance solutions provides excellent image and audio capture, and with the new customer handoff feature, it now offers absolute privacy for camera feeds and recordings,” Webb said. “With notifications and integrator access controlled through the powerful OvrC remote system management platform, it’s easy for integrators to give their clients full control of their footage and then to get temporary access from the client for any troubleshooting needs.”

  • Compact IP Video Intercom

    Viking’s X-205 Series of intercoms provide HD IP video and two-way voice communication - all wrapped up in an attractive compact chassis.

  • Camden CV-7600 High Security Card Readers

    Camden CV-7600 High Security Card Readers

    Camden Door Controls has relaunched its CV-7600 card readers in response to growing market demand for a more secure alternative to standard proximity credentials that can be easily cloned. CV-7600 readers support MIFARE DESFire EV1 & EV2 encryption technology credentials, making them virtually clone-proof and highly secure.