Web infrastructures

https://github.com/heig-vd-dai-course

Web · PDF

L. Delafontaine and H. Louis, with the help of GitHub Copilot.

Based on the original course by O. Liechti and J. Ehrensberger.

This work is licensed under the CC BY-SA 4.0 license.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Objectives

  • Understand the concepts of web infrastructures
  • Understand how HTTP features can help to build web infrastructures
  • Understand the concepts of a reverse proxy
  • Understand the concepts of load balancing
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Prepare and setup your environment

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Prepare and setup your environment

Take 20 minutes to set up your environment using the course material:

  • Clone the code examples repository on the virtual machine
  • Validate you have access to the virtual machine from the Internet on ports 80 and 443
  • Obtain a domain name
  • Add the required DNS records to the DNS zone

This will allow to run the examples along the theory in this chapter.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Functional and non-functional requirements

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Functional and non-functional requirements

  • Requirements used to define the scope of a system.
  • An abstract representation of the system that will be implemented.
  • Can help to define the architecture of the system.
  • Functional requirements: features that a system must have to satisfy the needs of its users. It is the "what" of a system.
  • Non-functional requirements: constraints on the system. It is the "how" of a system.
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Examples of functional requirements:

  • User management: Users must be able to register, login, logout, etc.
  • Product management: Users must be able to create, read, update, delete products, etc.
  • Order management: Users must be able to create, read, update, delete orders, etc.
  • Payment management: Users must be able to pay for their orders, etc.
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Examples of non-functional requirements:

  • Response time: Time between a request and a response (end user)
  • Throughput: Number of requests/interval (service provider)
  • Scalability: Property of a system to handle a varying amount of work
  • Availability: Percentage of time that the system provides a satisfactory service
  • Maintainability: How easily the system can be managed
  • Security: Confidentiality, authentication, authorization, etc.
  • ...and many, many more
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Web infrastructure definition

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Web infrastructure definition

Software and hardware components that are necessary to support:

  • the development
  • the deployment
  • the management

of web applications.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

The Host header

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

The Host header

  • Part of the HTTP request
  • Used to specify the domain name of the server
  • Can be used to host multiple websites on the same server using a reverse proxy
  • The reverse proxy will route the request to the correct website based on the Host header
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Forward proxy and reverse proxy

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Forward proxy and reverse proxy

  • Proxies are components that intercept requests and responses and filter/forward/change them to another component.
  • Forward proxy: used by a client to access external servers
  • Reverse proxy: used by an external server to access internal servers
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Forward proxy

  • Operates between clients and external systems
  • Can be used to:
    • Restrict access to external systems
    • Regulate traffic
    • Mask the identity of the client
    • Enforce security policies
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Reverse proxy

  • Operates between external systems and internal systems
  • Can be used to:
    • Restrict access
    • Load balance requests to internal systems
    • Cache responses from internal systems
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Traefik as a reverse proxy

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Experiment with Traefik

Take 20 minutes to experiment with Traefik using the course material and the code examples:

  • Run the "whoami with host-based routing" example
  • Run the "whoami with host-based and path-based routing" example
  • Run the "whoami with host-based, path-based routing and StripPrefix middleware" example

You should have a better understanding of how Traefik works.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

System scalability

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

System scalability

  • Capacity of a system to handle a varying amount of work
  • Can be achieved by:
    • Vertical scaling (adding more hardware resources to a server, also known as scale up)
    • Horizontal scaling (adding more servers to share the load, also known as scale out)
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Vertical scaling

Adding more resources to a server:

  • More/better RAM
  • More/better CPU
  • More/better storage
  • etc.

Limited by the hardware: at a certain point, you cannot add more/better resources to a server.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Horizontal scaling

Add more servers to a system and distribute the load between them.

Limited by the software - your software must be able to run on multiple servers:

  • Backends/APIs accessing the same database(s)
  • Frontends accessing backends/API
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

When to use scale up or scale out?

  • Determined by the non-functional requirements of the system
  • You need metrics to determine when to scale up or scale out to identify bottlenecks
  • Once the bottleneck is identified (from monitoring), you can decide to scale up or scale out
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

How to monitor a system?

Out of scope for this course, but here are some tools you can use:

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Load balancing

More details for this section in the course material. You can find other resources and alternatives as well.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Load balancing

Process of distributing the load between multiple servers.

This can work thanks to the stateless nature of HTTP and the Host header.

The load balancer must know the pool of servers it can forward the requests to.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Multiple strategies can be used to distribute the load:

  • Round-robin: each server in the pool in turn (covered in this course)
  • Least connections: least number of active connections
  • Least response time: least response time
  • Hashing: based on a hash of the request (e.g. the IP address of the client, the URL of the request, etc.)
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

An issue with load balancing is session management: the load balancer could forward requests from the same client to different servers, loosing their session.

As HTTP is stateless, the load balancer must know how to forward requests from the same client to the same server. A solution is sticky sessions with the help of a cookie.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Take 20 minutes to experiment with Traefik using the course material and the code examples:

  • Run the "whoami with host-based routing and sticky sessions" example

You should have a better understanding of how sticky sessions work.

For those who have time, you can check the "Go further" section in the course material.

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Questions

Do you have any questions?

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Finished? Was it easy? Was it hard?

Can you let us know what was easy and what was difficult for you during this chapter?

This will help us to improve the course and adapt the content to your needs. If we notice some difficulties, we will come back to you to help you.

➡️ GitHub Discussions

You can use reactions to express your opinion on a comment!

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

What will you do next?

You will start the practical work!

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0

Sources

HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0
HEIG-VD - DAI Course 2024-2025 - CC BY-SA 4.0