Given architecture is an example of a client-server based system. In this, there is a client who sends requests to the server and then the client receives a response from the server accordingly but when the number of users/clients increases, the load on the server increases enormously which makes it difficult for the server to perform efficiently and hence becomes slow. Therefore, it is important to make the server scalable in a way such that the server capacity increases according to the increasing traffic without any sort of failure.
It can be defined as a process to expand the existing configuration (servers/computers) to handle a large number of user requests or to manage the amount of load on the server. This process is called scalability. This can be done either by increasing the current system configuration (increasing RAM, number of servers) or adding more power to the configuration. Scalability plays a vital role in the designing of a system as it helps in responding to a large number of user requests more effectively and quickly.
There are two ways to do this :
- Vertical Scaling
- Horizontal Scaling
It is defined as the process of increasing the capacity of a single machine by adding more resources such as memory, storage, etc. to increase the throughput of the system. No new resource is added, rather the capability of the existing resources is made more efficient. This is called Vertical scaling. Vertical Scaling is also called the Scale-up approach.
Advantages of Vertical Scaling
- It is easy to implement
- Reduced software costs as no new resources are added
- Fewer efforts required to maintain this single system
Disadvantages of Vertical Scaling
- Single-point failure
- Since when the system (server) fails, the downtime is high because we only have a single server
- High risk of hardware failures
A Real-time Example of Vertical Scaling
When traffic increases, the server degrades in performance. The first possible solution that everyone has is to increase the power of their system. For instance, if earlier they used 8 GB RAM and 128 GB hard drive now with increasing traffic, the power of the system is affected. So a possible solution is to increase the existing RAM or hard drive storage, for e.g. the resources could be increased to 16 GB of RAM and 500 GB of a hard drive but this is not an ultimate solution as after a point of time, these capacities will reach a saturation point.
It is defined as the process of adding more instances of the same type to the existing pool of resources and not increasing the capacity of existing resources like in vertical scaling. This kind of scaling also helps in decreasing the load on the server. This is called Horizontal Scaling
Horizontal Scaling is also called the Scale-out approach.
In this process, the number of servers is increased and not the individual capacity of the server. This is done with the help of a Load Balancer which basically routes the user requests to different servers according to the availability of the server. Thereby, increasing the overall performance of the system. In this way, the entire process is distributed among all servers rather than just depending on a single server.
Example: NoSQL, Cassandra, and MongoDB
Advantages of Horizontal Scaling
- Fault Tolerance means that there is no single point of failure in this kind of scale because there are 5 servers here instead of 1 powerful server. So if anyone of the servers fails then there will be other servers for backup. Whereas, in Vertical Scaling there is single point failure i.e: if a server fails then the whole service is stopped.
- Low Latency: Latency refers to how late or delayed our request is being processed.
- Built-in backup
Disadvantages of Horizontal Scaling
- Not easy to implement as there are a number of components in this kind of scale
- Cost is high
- Networking components like, router, load balancer are required
A Real-time Example of Horizontal Scaling
For example, if there exists a system of the capacity of 8 GB of RAM and in future, there is a requirement of 16 GB of RAM then, rather than the increasing capacity of 8 GB RAM to 16 GB of RAM, similar instances of 8 GB RAM could be used to meet the requirements.
Attention reader! Don’t stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.
- Python | How and where to apply Feature Scaling?
- ML | Feature Scaling - Part 1
- ML | Feature Scaling – Part 2
- OpenCV - Overview
- Overview of Style Transfer (Deep Harmonization)
- A Quick Overview to Computer Vision
- Overview to the Amazon Virtual Private Cloud
- Overview of Personality Prediction Project using ML
- MongoDB AND operator ( $and )
- Command Injection Vulnerability and Mitigation
- Basic SQL Injection and Mitigation with Example
- Denial of Service and Prevention
- Format String Vulnerability and Prevention with Example
- Code Injection and Mitigation with Example
- Path Traversal Attack and Prevention
- Passwords and Cryptographic hash function
- Supervised and Unsupervised learning
- Underfitting and Overfitting in Machine Learning
- Difference between Machine learning and Artificial Intelligence
- Understanding Data Attribute Types | Qualitative and Quantitative
If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to email@example.com. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.