Glossary / H / High availability (HA)

High availability (HA)

High availability (HA) for file transfer protocol (FTP) or managed file transfer (MFT) environments means building a service that will continue to function even when a server, switch, disk or other hardware or software component fails. HA design relies on a number of tools to function, such as load balancers, shared or replicated storage, duplicated servers, backup hardware and other elements. 

IT administrators will typically pool these resources in clusters and use distributed lock managers and health checks to detect faults and move traffic, configuration and user state information between clusters. In the event of a disruption, the handoff will happen in seconds without dropping sessions or corrupting files. 

Cluster redundancy can be achieved through active or passive means:

  • Active redundancy is a health-based approach with control modules that constantly monitor system health and shift demand to the healthiest modules
  • Passive redundancy is a capacity-based approach where administrators build in enough excess hardware and software capacity to handle their most likely failure scenarios (e.g., running three servers where a typical network load only requires one)

These clusters can be grouped into active-active (meaning all components are operational at all times and actively monitored) or active-passive (meaning only certain resources are in use at any one time).

Properly configured, a HA file transfer environment keeps all FTPS, SFTP and HTTPS sessions active in the event of a failure to meet service-level targets and prevent data loss.

Understanding HA components

Redundancy

Redundancy means resource duplication. For example, two identical file transfer servers could be installed on different servers in separate racks with dual power feeds and switches. Mirrored storage could copy transfer queues and transaction journals, with dual network paths preventing link failures from halting traffic.

Failover mechanisms

Failover mechanisms describe the tools a system uses to monitor network health and shift resources when needed. A number of failover mechanisms exist, from reverse proxy servers that can detect and shift network overload to heartbeat agents that continually ping all system resources and shift usage to standby solutions when they detect a loss. 

Error handling

Error handling describes the processes that will occur in the event of a failure. For example, when a fault is detected, your file transfer system could be configured to roll over to a backup system that has a seconds-old copy of the user session and transfer logs, or it could require the user to log out and log back in again to restart their session. 

Critical infrastructure

The software and hardware that ensure your business operations can function are known as critical infrastructure. File transfer servers, along with their storage endpoints, the network infrastructure they use, directory services and all supporting hardware, would typically rise to the level of critical infrastructure in any organization that would be materially affected by a slowdown or stoppage of data transfer. This could include order management systems, financial and payment systems, reporting tools, content delivery networks and more.

Active-active vs. active-passive HA

HA clusters run in two main modes: active-active or active-passive. Each method has different risks and budget profiles. 

  • Active-active: All components are operational at all times and actively monitored.
  • Active-passive: Only certain resources are in use at any one time

Consider the following core factors when deciding which HA mode to use:

  • Cost and complexity: Active-active approaches will typically require higher license counts and advanced monitoring and routing configurations, whereas active-passive may not need these expenses but would add hardware costs for unused machines.
  • Data sync: Active-active needs real-time bidirectional replication of state and counters, but active-passive can use shared storage with lock control.
  • Failover time: Active-active shifts traffic in milliseconds through load balancing, whereas active-passive can take longer to recognize a failure and bring resources online.
  • Maintenance: Active-active environments can support rolling upgrades with no outages, but active-passive setups often require short switchovers to various clusters for updates.
  • Performance: Active-active splits live sessions across nodes for linear scalability, while active-passive traditionally keeps backup nodes idle.

Your organization should choose the mode that aligns with its uptime targets, growth plans and operational skillsets.

Key high availability features

HA in file transfer and MFT platforms relies on features and controls that absorb faults without breaking client workflows. 

Here are some essential components your organization should look for when building high-availability file transfer:

  • Automatic setting, directory and authentication replication across file transfer servers
  • For active-active configurations, monitoring tools, such as heartbeat loops with quorum and fencing stop, split-brain and trigger role changes within seconds
  • Load balancing tools, such as a reverse proxy server, can spread sessions across nodes and keep idle links alive when a path fails
  • Policy-driven alerts via SNMP, syslog, webhook or chat ops to warn administrators of an issue before users notice
  • Real-time session replication capabilities
  • Replicated storage backed by block sync or a distributed file system to keep partial uploads, audit logs and keys consistent

Using these features together can help your organization sustain aggressive uptime targets and protect compliance records without manual action.

Benefits of high availability for file transfers

HA adds business and technical value by keeping workflows operational during faults. 

Some benefits of using HA for file transfers include:

  • Continual transfer throughput with little impact on internal or external users, which improves operational reliability
  • Cost savings by avoiding outages for critical revenue-related infrastructure
  • Instant failover reduces retransmit traffic, which cuts network usage and storage I/O spikes
  • Scalable clusters handle peak loads without reconfiguring firewalls or client scripts

Examples of HA file transfer solutions

The following file transfer tools can support high-availability use cases:

MFT platforms

MFT platforms typically embed clustering tools that replicate server configurations, user data and job schedules across nodes. When a node stops, another picks up the workflow, so large batch jobs and automated triggers finish on time.

SFTP servers

SFTP servers like Cerberus FTP Server by Redwood can share session state and mirrored key stores with clustering features while taking advantage of SFTP’s more granular packet confirmation. This allows them to reconnect without retransmitting completed blocks, which saves bandwidth and avoids corrupted payloads.

Load balancers and gateways

File transfer systems with load balancers, such as a reverse proxy server, strengthen an HA file transfer design by spreading network load. They monitor node health, steer new sessions to the fastest target and hold persistent mappings so ongoing transfers stay anchored while control paths shift.

High availability FAQs

What is the difference between active-active and high availability?

Active-active is a type of high-availability approach. 

High availability is the goal of keeping the file transfer service reachable through component redundancy, rapid failover and replicated states. Targets such as 99.99 percent uptime are met by removing single points of failure across compute, network and storage and by shifting live sessions to a healthy node within seconds of a fault.

Active-active is one way to reach that goal. Two or more nodes process traffic at the same time behind load balancers, so the cluster keeps pace with demand, and any node can take over if another stops. A site may also run active-passive, where a standby node stays idle until needed, but both designs fall under the broader high-availability umbrella.

What is HA and non-HA?

High availability (HA) is an architecture that keeps file transfer services online in the event of a failure through configurations like redundant servers, shared storage and automated failover tools. When a node or link fails, another node takes over within seconds. Targets such as 99.99 percent uptime are common.

Non-HA refers to a configuration that halts operation if hardware, power or network components fail. Organizations may accept this simpler setup for test environments or workloads that can tolerate longer outages.

What is the difference between HA and a cluster?

High availability (HA) is a service objective that keeps an FTP or MFT system operational in the event of a failure. 

A cluster is a group of linked components that can be configured to deliver HA (or for other reasons). For example, an HA file transfer setup may have two identical clusters, each consisting of a dedicated server that houses the file transfer application, the storage endpoint, copies of user directories and authentication credentials and network security and configuration tools. In the event of a failure, operations would be shifted from the failed cluster to the operational cluster.  

In other words, HA describes the outcome, while the cluster is the infrastructure used to reach it.

What is the difference between high availability and continuous availability?

IT administrators design high-availability (HA) configurations to minimize downtime, typically striving for availability ratings measured in “nines” (e.g., 99.9% uptime). In HA environments, brief service pauses are tolerated as long as the system meets this required uptime target.

Continuous-availability (CA) setups strive for zero downtime, such that maintenance and faults create no visible break in sessions. CA typically demands a higher budget and tighter operational discipline than HA because every component must stay online without interruption.

Choose the best protocol for your secure file transfers

Compare FTPS to other file transfer protocols that balance speed and security for your workloads.
Blog

What is managed file transfer (MFT)?

Blog

The next step in secure, scalable file transfers

Blog

Common SFTP server issues: Solved

Blog

What are the differences between file servers?

Start a 25-day free trial

Try Cerberus Enterprise Edition for free

  • 25-Day Free Trial
  • Installs in minutes
  • No credit card required