Computer Scientists Developing Technology To Improve Data Mining For Homeland Security

From online news articles to blogs, a massive amount of information is voluntarily being put before the public every day.

Some of this information may be valuable to protecting homeland security. However, to sift through this readily available content and summarize it for agencies like the Department of Homeland Security, analysts need to do more than sit at a computer, entering words like "al-Quaida" into Internet search engines.

That's why Kansas State University's William Hsu and other computer scientists who research data mining are part of a project to develop technology that makes automated Internet searches more useful and productive.

"We're helping to develop the next generation of Web search and crawling," Hsu said. "Our goal is to develop a research program that will help with homeland security. The Department of Homeland Security wants to pull information that's available to anyone in the public domain, like millions of articles from sources like CNN and Al-Jazeera, and monitor them for security."

Hsu is an associate professor of computer and information sciences, head of K-State's Laboratory for Knowledge Discovery in Databases, and co-principal investigator of a Department of Homeland Security-funded summer institute aimed at training future researchers in data sciences. The $2.4 million Data Sciences Summer Institute, headed by the University of Illinois along with K-State and the University of Texas San Antonio, is titled "Multimodal Information Access and Synthesis." The Illinois-led cooperative is one of four such University Affiliate Centers nationwide.

Data mining is a way of processing vast amounts of information and putting it in multiple, useful formats. Hsu's data mining research at K-State includes applications in fields like genome analysis, nanoscale materials modeling and diagnostic medicine. The work at K-State that will benefit homeland security strives to resolve ambiguity in Internet searches. For instance, this would allow a search engine to differentiate between homeland security as a concept and Homeland Security as a government agency. Hsu said that one of the institute's projects aims to improve name recognition, a heavily studied problem in information extraction.

"The goal is to develop an automated system that can pick out al-Quaida as an organization, Kandahar as a place and Osama bin Laden as a person, based upon rules developed from previously-seen documents," Hsu said. "Subcategories are a problem," he said. "'People' is a big tag. Is this a head of state? A celebrity? Someone who was interviewed?"

Data mining research at K-State and collaborating institutions is helping solve another problem with getting information off the Internet -- inefficient crawling. Hsu said search engines provide up-to-date results by first looking through vast numbers of Web pages and archiving them in a process called crawling. Hsu said the project leader, Kevin Chang at the University of Illinois, describes the problem with this process as "crawling in the dark -- you start somewhere and grab everything." Hsu said research in this area will lead to better searches whereby search engines can anticipate keywords, for instance. Search engines also could create virtual neighborhoods of information in which connections are made among bits of information based on the results of similar searches.

Although text-based searches have their complications, Hsu said searching for images is even harder because searches rely on the words people use to describe the images, such as a photo caption. Data mining research at K-State and its partner institutions is leading to technology that will allow search engines to "look" through images from the Web. Hsu said search engines would sift through images that are automatically annotated, or marked up, to describe their contents. This would be done using tools that analyze the shape, border, color and orientation of objects, among many other features, to pick out, for instance, an image of George W. Bush in a press conference photo.

"Computers will figure out an image identity by 'seeing' a feature that all such images have in common," Hsu said.

The next generation of data mining research, Hsu said, will involve computer scientists working with social scientists. By scouring news articles and other public data, researchers can work on something called sentiment analysis.

"Sometimes Homeland Security just needs to know, for instance, what the local reaction is to a particular event such as a bomb threat or recent explosion," Hsu said.

Featured

  • Video Surveillance Trends to Watch

    With more organizations adding newer capabilities to their surveillance systems, it’s always important to remember the “basics” of system configuration and deployment, as well as the topline benefits of continually emerging technologies like AI and the cloud. Read Now

  • New Report Reveals Top Trends Transforming Access Controller Technology

    Mercury Security, a provider in access control hardware and open platform solutions, has published its Trends in Access Controllers Report, based on a survey of over 450 security professionals across North America and Europe. The findings highlight the controller’s vital role in a physical access control system (PACS), where the device not only enforces access policies but also connects with readers to verify user credentials—ranging from ID badges to biometrics and mobile identities. With 72% of respondents identifying the controller as a critical or important factor in PACS design, the report underscores how the choice of controller platform has become a strategic decision for today’s security leaders. Read Now

  • Overwhelming Majority of CISOs Anticipate Surge in Cyber Attacks Over the Next Three Years

    An overwhelming 98% of chief information security officers (CISOs) expect a surge in cyber attacks over the next three years as organizations face an increasingly complex and artificial intelligence (AI)-driven digital threat landscape. This is according to new research conducted among 300 CISOs, chief information officers (CIOs), and senior IT professionals by CSC1, the leading provider of enterprise-class domain and domain name system (DNS) security. Read Now

  • ASIS International Introduces New ANSI-Approved Investigations Standard

    • Guard Services
  • Cloud Security Alliance Brings AI-Assisted Auditing to Cloud Computing

    The Cloud Security Alliance (CSA), the world’s leading organization dedicated to defining standards, certifications, and best practices to help ensure a secure cloud computing environment, today introduced an innovative addition to its suite of Security, Trust, Assurance and Risk (STAR) Registry assessments with the launch of Valid-AI-ted, an AI-powered, automated validation system. The new tool provides an automated quality check of assurance information of STAR Level 1 self-assessments using state-of-the-art LLM technology. Read Now

New Products

  • PE80 Series

    PE80 Series by SARGENT / ED4000/PED5000 Series by Corbin Russwin

    ASSA ABLOY, a global leader in access solutions, has announced the launch of two next generation exit devices from long-standing leaders in the premium exit device market: the PE80 Series by SARGENT and the PED4000/PED5000 Series by Corbin Russwin. These new exit devices boast industry-first features that are specifically designed to provide enhanced safety, security and convenience, setting new standards for exit solutions. The SARGENT PE80 and Corbin Russwin PED4000/PED5000 Series exit devices are engineered to meet the ever-evolving needs of modern buildings. Featuring the high strength, security and durability that ASSA ABLOY is known for, the new exit devices deliver several innovative, industry-first features in addition to elegant design finishes for every opening.

  • Compact IP Video Intercom

    Viking’s X-205 Series of intercoms provide HD IP video and two-way voice communication - all wrapped up in an attractive compact chassis.

  • AC Nio

    AC Nio

    Aiphone, a leading international manufacturer of intercom, access control, and emergency communication products, has introduced the AC Nio, its access control management software, an important addition to its new line of access control solutions.