Understanding AI in Video Surveillance

Applying human intelligence to computer programs

Many video surveillance professionals have come across the terms Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL). But what do those terms mean, and how do they affect Video Surveillance?

AI, MACHINE LEARNING AND DEEP LEARNING

AI is a term that loosely refers to applying human intelligence to computer programs or allowing programs to learn over time with the goal of producing better results as they learn. Machine Learning is a technique used to achieve a level of AI, and Deep Learning is an evolution of Machine Learning. In short, Deep Learning is an advanced, more sophisticated Machine Learning technique, and both are methods of achieving a level of AI.

Application in video surveillance. In video surveillance, video analytics uses Machine Learning and Deep Learning methods to identify objects, classify them, and determine their properties.

Whenever people receive new information, our brains attempt to compare the data to similar items in order to make sense of it. This comparative approach is the same concept that Machine and Deep Learning algorithms employ.

Machine and Deep Learning algorithms differ in how they are programmed to determine what constitutes a known object. Machine Learning requires more human intervention from a programmer to establish desired parameters in order to achieve the desired outcome. Deep Learning identifies object attributes independently and may consider characteristics the programmers would not.

Machine learning versus deep learning. What do Machine Learning and Deep Learning mean for Video Analytics? Both approaches describe programming methods where a system learns based on a data set. With Machine Learning, the attributes of the data a system looks for are usually preset, or corrected for, by human programmers. For instance, the system may be programmed to delineate an object that is taller than it is wide, with limbs moving in specified ways, and so on, and label this object a “person.”

Deep Learning is considered superior to Machine Learning, in part because the programmers may not recognize the most relevant criteria. Using the previous algorithm to identify a person, a seated and stationary person may not trigger an accurate detection.

With Deep Learning, the video analytic algorithms are fed an extensive data set representing an object. This step is called training, where the algorithm trains itself to recognize a type of object. For example, the system is fed thousands of images of people of varying genders, styles of clothing, ethnic backgrounds, images taken at different angles, and more.

The algorithm figures out attributes that are similar as well as dissimilar, and also determines how to weigh the relevance of those characteristics. After analyzing thousands of images, the algorithm may calculate the majority of images include a triangular- shaped object near the upper part of the image, with two darkened oval spots near its bottom, which we would think of as a nose on someone’s face. In fact, the algorithm may have identified many other such characteristics we wouldn’t think of.

Training the system is done by the developers of the software before it is used by a consumer. The process takes a substantial amount of computing power; much more than what is required to detect and classify objects when used in the field. The result is a file that is referenced by the system to determine if a detected object matches the classification.

Because the Deep Learning process uses the machine to determine object characteristics, it has led to analytics which can provide much more granular classification. For instance, older approaches may be able to detect a person, but Deep Learning based analytics can detect whether the person is a man, woman, or child. It may also be able to detect associated characteristics of an individual as well as vehicle type or make.

Learning over time. Typically, AI in video surveillance is trained at design time and, in some cases, does not get progressively “smarter” when used in the field. Deep Learning and Machine Learning do have this capability, however, and if used, can employ analytics which can learn over time.

Typical applications may include systems that determine what is normal in a scene. For instance, a school hallway experiences a rush of traffic about every 45 minutes between class periods. During that high traffic time, the traffic is dispersed and not concentrated in any particular area.

Furthermore, it is unusual for all the people to be moving at a very high speed. If the system detects an unusual concentration of objects, it could indicate a fight broke out. If all the people are running in the same direction outside of the usual inter-class period, it could indicate an emergency situation.

SMARTER SYSTEMS, BETTER RESULTS

Video surveillance systems produce huge volumes of data. Monitoring and filtering through such vast quantities of information makes the task of quickly identifying security incidents and finding evidence more difficult than ever.

Intelligent systems using Deep Learning can help us identify evidence much more promptly and analyze video in real-time to alert system operators of suspected events, providing better results for your security program.

This article originally appeared in the May/June 2020 issue of Security Today.

Featured

  • The Impact of Convergence Between IT and Physical Security

    For years, the worlds of physical security and information technology (IT) remained separate. While they shared common goals and interests, they often worked in silos. Read Now

  • Unlocking Trustworthy AI: Building Transparency in Security Governance

    In situations where AI supports important security tasks like leading investigations and detecting threats and anomalies, transparency is essential. When an incident occurs, investigators must trace the logic behind each automated response to confirm its validity or spot errors. Demanding interpretable AI turns opaque “black boxes” into accountable partners that enhance, rather than compromise, organizational defense. Read Now

  • Seeking Innovative Solutions

    Denial, Anger, Bargaining, Depression and Acceptance. You may recognize these terms as the “5 Phases” of a grieving process, but they could easily describe the phases one goes through before adopting any new or emerging innovation or technology, especially in a highly risk-averse industry like security. However, the desire for convenience in all aspects of modern life is finally beginning to turn the tide from old school hardware as the go-to towards more user-friendly, yet still secure, door solutions. Read Now

  • Where AI Meets Human Judgment

    Artificial intelligence is everywhere these days. It is driving business growth, shaping consumer experiences, and showing up in places most of us never imagined just a few years ago. Read Now

  • Report: Only 44 Percent of Organizations are Fully Equipped to Support Secure AI

    Delinea recently published new research on the impact of artificial intelligence in reshaping identity security. According to the report, “AI in Identity Security Demands a New Playbook,” only 44% of organizations say their security architecture is fully equipped to support secure AI, despite widespread confidence in their current capabilities. Read Now

New Products

  • Camden CV-7600 High Security Card Readers

    Camden CV-7600 High Security Card Readers

    Camden Door Controls has relaunched its CV-7600 card readers in response to growing market demand for a more secure alternative to standard proximity credentials that can be easily cloned. CV-7600 readers support MIFARE DESFire EV1 & EV2 encryption technology credentials, making them virtually clone-proof and highly secure.

  • Camden CM-221 Series Switches

    Camden CM-221 Series Switches

    Camden Door Controls is pleased to announce that, in response to soaring customer demand, it has expanded its range of ValueWave™ no-touch switches to include a narrow (slimline) version with manual override. This override button is designed to provide additional assurance that the request to exit switch will open a door, even if the no-touch sensor fails to operate. This new slimline switch also features a heavy gauge stainless steel faceplate, a red/green illuminated light ring, and is IP65 rated, making it ideal for indoor or outdoor use as part of an automatic door or access control system. ValueWave™ no-touch switches are designed for easy installation and trouble-free service in high traffic applications. In addition to this narrow version, the CM-221 & CM-222 Series switches are available in a range of other models with single and double gang heavy-gauge stainless steel faceplates and include illuminated light rings.

  • A8V MIND

    A8V MIND

    Hexagon’s Geosystems presents a portable version of its Accur8vision detection system. A rugged all-in-one solution, the A8V MIND (Mobile Intrusion Detection) is designed to provide flexible protection of critical outdoor infrastructure and objects. Hexagon’s Accur8vision is a volumetric detection system that employs LiDAR technology to safeguard entire areas. Whenever it detects movement in a specified zone, it automatically differentiates a threat from a nonthreat, and immediately notifies security staff if necessary. Person detection is carried out within a radius of 80 meters from this device. Connected remotely via a portable computer device, it enables remote surveillance and does not depend on security staff patrolling the area.