On The Road to Recovery

Negative impact on data center in case of calamity

Earthquake. Fire. Hurricane. Blackout. Virus. Terror attack. Any of these natural or manmade events can cause the obliteration of business data. And if that isn’t frightening enough, Sarbanes-Oxley, HIPAA and Securities and Exchange Commission requirements will really scare you. Face it, data needs to be kept alive and accounted for to survive and thrive in the testing conditions of modern-day business.

The challenge is to provide solutions that allow a business to continue to operate in the event of any number of calamities that can have a negative impact on a data center. The problem is not knowing which one it will be or how bad it will get—but each scenario involves the data center being wiped out or, if luck prevails, just being offline for a few days.

The two biggest challenges for disaster recovery in IT are the movement of data to the recovery site and the actual process of recovery. Now that virtualization technology is mainstream, and thanks to the emergence of new IT business models, such as Infrastructure as a Service (IaaS), some compelling new ways to approach the problem are becoming available.

Getting Warmer
The ideal scenario for most organizations is a hot site. A hot site is a near replica of the entire production environment at another data center, ideally several hundred miles away and in a different environment. Unfortunately, hot sites are extremely expensive and can increase production cost by as much as 250 percent. This large cost increase generally drives most companies away from implementing a true hot site.

The majority of companies end up with a cold site—a physical location held in reserve with a promise from the supplier that there will be an adequate number of computers waiting when they arrive. A cold site is a form of insurance. Businesses share the costs with other companies and hope that not everyone has a disaster at the same time. This makes cold sites a much more affordable option.

Clearly, neither of these options—the hot site or cold site—offers sufficient protection for today’s data-dependent businesses. The best available option, however, may be a combination of the two.

Odds are that you’ve already used virtualization technology in the data center to consolidate old servers or perform testing. Using the same concepts, it’s possible to create a warm site—a hybrid between a hot site and a cold site—at a secondary location. Using replication software like Doubletake, one normally needs a one-to-one server ratio in the data center and at the recovery site.

It All Adds Up

Caught on Tape

Many companies have a 36-hour recovery time objective (RTO) because they have determined that’s how much time it will take to get the backup tapes, fly to the recovery site, pull everything off of tape, test the systems and be back online. That is, if it works.

Oftentimes the hardware isn’t exactly the same as you use in production. Tape-based recovery isn’t perfect, either. It might contain a good copy of your data, but all of the information about the network -- IP addresses, registry configurations, patch levels — is frequently not on the tape. It takes a huge amount of time to reconstruct this information, even provided you have great (and up-to-date) documentation.

With IaaS, instead of purchasing a stock of extra servers and a SAN, it’s possible to rent 60 processor cores, 2 Terabytes of storage and 64 GB of memory, and pay on a monthly or quarterly basis. Most IaaS vendors run VMware or a similar operating system that enables virtualization.

This operating system approach is the key to putting a shim between the hardware and your environment, allowing the hardware to scale, move around and be replicated. It also is what makes an IaaS provider different from a traditional service provider or hosting center

Suppose a company has approximately 100 physical servers in its environment. Forty of these are designated as Tier 1 mission critical. If these servers failed, so would the business.

In the old model, choices would be to go with either a hot, cold or warm site. Assuming the cost per server is $6,000 fully loaded, it would be approximately $240,000 worth of server equipment at the disaster recovery site to do a true hot site approach. Add to this the cost for co-location at $4,000 a month (including 5 kw of power/cooling load per cabinet) for two cabinets. This figure doesn’t include switches, routers, operating systems, replication software, human resources, bandwidth and recovery testing.

Given that the remote site is not using resources on the servers, it’s a perfect opportunity for consolidation. Conservatively, consolidate virtual servers to physical servers at a rate of 10-to-one because this is for disaster recovery, not production. That means you can use four physical servers instead of 40. It also means you are using 90 percent less of the co-location space. If the co-location center doesn’t charge in less than half-cabinet increments, this puts the cost at $1,000 per month. Now beef up the memory, adding 16 GB to each host. Keep in mind that the VMware also costs money. But even with these additional costs, savings will net more than $200,000 per year.

However, there are other costs involved, such as replication software. This can range dramatically in price, depending on features and vendors. Use an agent-based approach, and figure the cost per agent to be $2,500. Even though there are now only four servers at the recovery site, it’s still 40 virtual machines. That means there are 40 agents in production and 40 agents at the recovery site, for a total of 80 agents, amounting to $200,000 in software licenses.

Virtual Reality
SAN vendors and customers already know the secret here. They boot all of the servers from the SAN and use SAN replication software to transport data to the recovery site. However, there are two problems with this method. Depending on the SAN vendor, that replication software might be an additional replication license. It also might mean the purchase of one or more additional SANs for the recovery site. Not only does this amount to more of an expense, but the recovery picture gets complicated. With both approaches, recovering the actual data is possible, but the configuration information that’s so important is lost. Although better than tapes, it is still not a perfect system.

Here is where virtualization comes to the rescue once again. Whether using SAN-to-SAN replication or an agent-based approach, virtualize the production servers. This makes it possible to replicate at the VMware level instead of within Windows®.

Take the existing scenario of 40 servers. Say these are spread across four very robust physical servers. If a software replication product that runs on ESX is employed, you only need four licenses at the production site and a similar number at the remote site. This means even more significant cost savings, as you are replicating all of the patches, configuration data and permissions by sending fully bootable virtual machines instead of SQL data or an Exchange store.

Even though companies are virtualizing more production servers than ever, it still may not be ready to virtualize certain servers. A new class of replication agents is coming. These will enable the user to take a traditional, non-virtualized server and convert it into a virtual machine during the replication process. This way, the source production server stays the same, but users can gain all of the efficiencies of using virtual machines instead of physical servers at the recovery site

In the new model, recovery and testing work the same way. Since there are fully-replicated, bootable virtual servers at the recovery site, one simply needs to access them remotely and power them up.

In most cases, leave virtual machines in a heavily consolidated mode if they’re just being tested. During testing, users have opportunity to make sure they can view tables, access files or even mount a mail store and open a replicated e-mail inbox.

During an actual or simulated recovery, it might be necessary to spread virtual servers around on enough physical servers to provide the performance the production environment normally requires.

This is where IaaS can be a huge help. An IaaS provider already has many racks of servers and network capacity available on demand. The only thing that’s required is a small foothold in that environment that can replicate data. Expensive resources, such as processing power and memory, aren’t necessary. Since the IaaS vendor already has the capacity available, this process can generally be completed in a few hours or less. In a declaration or test, the provider will take one or two replication servers onto 20, 30 or 100 real physical servers, whatever is appropriate, to equal the resources that were there in production.

Bonus Features
Virtualization software features like VMware’s DRS will even allow users to move already booted (but slow) servers from the consolidated hardware onto the new hardware resources as they become available without any downtime.

As a bonus, push the development, staging or other tertiary applications out to run in production at the DR site on the DR equipment. That way, IaaS is can actually be considered a production investment. This makes the CFO happy.

An even bigger challenge is to recover people and business processes. Suppose a prolonged power outage in one facility forces a company to shift data to computers in another state. What about its employees? Instead of moving people into a workplace recovery center, with IaaS, all an employee needs is remote Internet access from a home office until the issue is resolved.

Welcome to the brave new world of disaster recovery. These new approaches go a long way toward making business continuity simpler, more affordable and more reliable than ever.

Featured

  • Security Industry Association Announces the 2026 Security Megatrends

    The Security Industry Association (SIA) has identified and forecasted the 2026 Security Megatrends, which form the basis of SIA’s signature annual Security Megatrends report defining the top 10 factors influencing both near- and long-term change in the global security industry. Read Now

  • The Future of Access Control: Cloud-Based Solutions for Safer Workplaces

    Access controls have revolutionized the way we protect our people, assets and operations. Gone are the days of cumbersome keychains and the security liabilities they introduced, but it’s a mistake to think that their evolution has reached its peak. Read Now

  • A Look at AI

    Large language models (LLMs) have taken the world by storm. Within months of OpenAI launching its AI chatbot, ChatGPT, it amassed more than 100 million users, making it the fastest-growing consumer application in history. Read Now

  • First, Do No Harm: Responsibly Applying Artificial Intelligence

    It was 2022 when early LLMs (Large Language Models) brought the term “AI” into mainstream public consciousness and since then, we’ve seen security corporations and integrators attempt to develop their solutions and sales pitches around the biggest tech boom of the 21st century. However, not all “artificial intelligence” is equally suitable for security applications, and it’s essential for end users to remain vigilant in understanding how their solutions are utilizing AI. Read Now

  • Improve Incident Response With Intelligent Cloud Video Surveillance

    Video surveillance is a vital part of business security, helping institutions protect against everyday threats for increased employee, customer, and student safety. However, many outdated surveillance solutions lack the ability to offer immediate insights into critical incidents. This slows down investigations and limits how effectively teams can respond to situations, creating greater risks for the organization. Read Now

New Products

  • Automatic Systems V07

    Automatic Systems V07

    Automatic Systems, an industry-leading manufacturer of pedestrian and vehicle secure entrance control access systems, is pleased to announce the release of its groundbreaking V07 software. The V07 software update is designed specifically to address cybersecurity concerns and will ensure the integrity and confidentiality of Automatic Systems applications. With the new V07 software, updates will be delivered by means of an encrypted file.

  • EasyGate SPT and SPD

    EasyGate SPT SPD

    Security solutions do not have to be ordinary, let alone unattractive. Having renewed their best-selling speed gates, Cominfo has once again demonstrated their Art of Security philosophy in practice — and confirmed their position as an industry-leading manufacturers of premium speed gates and turnstiles.

  • A8V MIND

    A8V MIND

    Hexagon’s Geosystems presents a portable version of its Accur8vision detection system. A rugged all-in-one solution, the A8V MIND (Mobile Intrusion Detection) is designed to provide flexible protection of critical outdoor infrastructure and objects. Hexagon’s Accur8vision is a volumetric detection system that employs LiDAR technology to safeguard entire areas. Whenever it detects movement in a specified zone, it automatically differentiates a threat from a nonthreat, and immediately notifies security staff if necessary. Person detection is carried out within a radius of 80 meters from this device. Connected remotely via a portable computer device, it enables remote surveillance and does not depend on security staff patrolling the area.