Understanding AI in Video Surveillance

Applying human intelligence to computer programs

Many video surveillance professionals have come across the terms Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL). But what do those terms mean, and how do they affect Video Surveillance?

AI, MACHINE LEARNING AND DEEP LEARNING

AI is a term that loosely refers to applying human intelligence to computer programs or allowing programs to learn over time with the goal of producing better results as they learn. Machine Learning is a technique used to achieve a level of AI, and Deep Learning is an evolution of Machine Learning. In short, Deep Learning is an advanced, more sophisticated Machine Learning technique, and both are methods of achieving a level of AI.

Application in video surveillance. In video surveillance, video analytics uses Machine Learning and Deep Learning methods to identify objects, classify them, and determine their properties.

Whenever people receive new information, our brains attempt to compare the data to similar items in order to make sense of it. This comparative approach is the same concept that Machine and Deep Learning algorithms employ.

Machine and Deep Learning algorithms differ in how they are programmed to determine what constitutes a known object. Machine Learning requires more human intervention from a programmer to establish desired parameters in order to achieve the desired outcome. Deep Learning identifies object attributes independently and may consider characteristics the programmers would not.

Machine learning versus deep learning. What do Machine Learning and Deep Learning mean for Video Analytics? Both approaches describe programming methods where a system learns based on a data set. With Machine Learning, the attributes of the data a system looks for are usually preset, or corrected for, by human programmers. For instance, the system may be programmed to delineate an object that is taller than it is wide, with limbs moving in specified ways, and so on, and label this object a “person.”

Deep Learning is considered superior to Machine Learning, in part because the programmers may not recognize the most relevant criteria. Using the previous algorithm to identify a person, a seated and stationary person may not trigger an accurate detection.

With Deep Learning, the video analytic algorithms are fed an extensive data set representing an object. This step is called training, where the algorithm trains itself to recognize a type of object. For example, the system is fed thousands of images of people of varying genders, styles of clothing, ethnic backgrounds, images taken at different angles, and more.

The algorithm figures out attributes that are similar as well as dissimilar, and also determines how to weigh the relevance of those characteristics. After analyzing thousands of images, the algorithm may calculate the majority of images include a triangular- shaped object near the upper part of the image, with two darkened oval spots near its bottom, which we would think of as a nose on someone’s face. In fact, the algorithm may have identified many other such characteristics we wouldn’t think of.

Training the system is done by the developers of the software before it is used by a consumer. The process takes a substantial amount of computing power; much more than what is required to detect and classify objects when used in the field. The result is a file that is referenced by the system to determine if a detected object matches the classification.

Because the Deep Learning process uses the machine to determine object characteristics, it has led to analytics which can provide much more granular classification. For instance, older approaches may be able to detect a person, but Deep Learning based analytics can detect whether the person is a man, woman, or child. It may also be able to detect associated characteristics of an individual as well as vehicle type or make.

Learning over time. Typically, AI in video surveillance is trained at design time and, in some cases, does not get progressively “smarter” when used in the field. Deep Learning and Machine Learning do have this capability, however, and if used, can employ analytics which can learn over time.

Typical applications may include systems that determine what is normal in a scene. For instance, a school hallway experiences a rush of traffic about every 45 minutes between class periods. During that high traffic time, the traffic is dispersed and not concentrated in any particular area.

Furthermore, it is unusual for all the people to be moving at a very high speed. If the system detects an unusual concentration of objects, it could indicate a fight broke out. If all the people are running in the same direction outside of the usual inter-class period, it could indicate an emergency situation.

SMARTER SYSTEMS, BETTER RESULTS

Video surveillance systems produce huge volumes of data. Monitoring and filtering through such vast quantities of information makes the task of quickly identifying security incidents and finding evidence more difficult than ever.

Intelligent systems using Deep Learning can help us identify evidence much more promptly and analyze video in real-time to alert system operators of suspected events, providing better results for your security program.

This article originally appeared in the May/June 2020 issue of Security Today.

Featured

  • 2025 Gun Violence Statistics Show Signs of Progress

    Omnilert, a national leader in AI-powered safety and emergency communications, has released its 2025 Gun Violence Statistics, along with a new interactive infographic examining national and school-related gun violence trends. In 2025, the U.S. recorded 38,762 gun-violence deaths, highlighting the continued importance of prevention, early detection, and coordinated response. Read Now

  • Big Brand Tire & Service Rolls Out Interface Virtual Perimeter Guard

    Interface Systems, a managed service provider delivering remote video monitoring, commercial security systems, business intelligence, and network services for multi-location enterprises, today announced that Big Brand Tire & Service, one of the nation’s fastest-growing independent tire and automotive service providers, has eliminated costly overnight break-ins and significantly reduced trespassing and vandalism at a high-risk location. The company achieved these results by deploying Interface Virtual Perimeter Guard, an AI-powered perimeter security solution designed to deter incidents before they occur. Read Now

  • The Evolution of ID Card Printing: Customer Challenges and Solutions

    The landscape of ID card printing is evolving to meet changing customer needs, transitioning from slow, manual processes to smart, on-demand printing solutions that address increasingly complex enrollment workflows. Read Now

  • TSA Awards Rohde & Schwarz Contract for Advanced Airport Screening Ahead of Soccer World Cup 2026

    Rohde & Schwarz, a provider of AI-based millimeter wave screening technology, announced today it has won a multi-million dollar award from TSA to supply its QPS201 AIT security scanners to passenger security screening checkpoints at selected Soccer World Cup 2026 host city airports. Read Now

  • Brivo, Eagle Eye Networks Merge

    Dean Drako, Chairman of Brivo, the leading global provider of cloud-native access control and smart space technologies, and Founder of Eagle Eye Networks, the global leader in cloud AI video surveillance, today announced the two companies will merge, creating the world’s largest AI cloud-native physical security company. The merged company will operate under the Brivo name and deliver a truly unified cloud-native security platform. Read Now

New Products

  • Mobile Safe Shield

    Mobile Safe Shield

    SafeWood Designs, Inc., a manufacturer of patented bullet resistant products, is excited to announce the launch of the Mobile Safe Shield. The Mobile Safe Shield is a moveable bullet resistant shield that provides protection in the event of an assailant and supplies cover in the event of an active shooter. With a heavy-duty steel frame, quality castor wheels, and bullet resistant core, the Mobile Safe Shield is a perfect addition to any guard station, security desks, courthouses, police stations, schools, office spaces and more. The Mobile Safe Shield is incredibly customizable. Bullet resistant materials are available in UL 752 Levels 1 through 8 and include glass, white board, tack board, veneer, and plastic laminate. Flexibility in bullet resistant materials allows for the Mobile Safe Shield to blend more with current interior décor for a seamless design aesthetic. Optional custom paint colors are also available for the steel frame.

  • A8V MIND

    A8V MIND

    Hexagon’s Geosystems presents a portable version of its Accur8vision detection system. A rugged all-in-one solution, the A8V MIND (Mobile Intrusion Detection) is designed to provide flexible protection of critical outdoor infrastructure and objects. Hexagon’s Accur8vision is a volumetric detection system that employs LiDAR technology to safeguard entire areas. Whenever it detects movement in a specified zone, it automatically differentiates a threat from a nonthreat, and immediately notifies security staff if necessary. Person detection is carried out within a radius of 80 meters from this device. Connected remotely via a portable computer device, it enables remote surveillance and does not depend on security staff patrolling the area.

  • FEP GameChanger

    FEP GameChanger

    Paige Datacom Solutions Introduces Important and Innovative Cabling Products GameChanger Cable, a proven and patented solution that significantly exceeds the reach of traditional category cable will now have a FEP/FEP construction.