Realtime Architecture
Realtime is a globally distributed Elixir cluster. Clients can connect to any node in the cluster via WebSockets and send messages to any other client connected to the cluster.
Realtime is written in Elixir, which compiles to Erlang, and utilizes many tools the Phoenix Framework provides out of the box.
Elixir & Phoenix
Phoenix is fast and able to handle millions of concurrent connections.
Phoenix can handle many concurrent connections because Elixir provides lightweight processes (not OS processes) to work with.
Client-facing WebSocket servers need to handle many concurrent connections. Elixir & Phoenix let the Supabase Realtime cluster do this easily.
Channels
Channels are implemented using Phoenix Channels which uses Phoenix.PubSub with the default Phoenix.PubSub.PG2 adapter.
The PG2 adapter utilizes Erlang process groups to implement the PubSub model where a publisher can send messages to many subscribers.
Global cluster
Presence is an in-memory key-value store backed by a CRDT. When a user is connected to the cluster the state of that user is sent to all connected Realtime nodes.
Broadcast lets you send a message from any connected client to a Channel. Any other client connected to that same Channel will receive that message.
This works globally. A client connected to a Realtime node in the United States can send a message to another client connected to a node in Singapore. Connect two clients to the same Realtime Channel and they'll all receive the same messages.
Broadcast is useful for getting messages to users in the same location very quickly. If a group of clients are connected to a node in Singapore, the message only needs to go to that Realtime node in Singapore and back down. If users are close to a Realtime node they'll get Broadcast messages in the time it takes to ping the cluster.
Thanks to the Realtime cluster, you (an amazing Supabase user) don't have to think about which regions your clients are connected to.
If you're using Broadcast, Presence, or streaming database changes, messages will always get to your users via the shortest path possible.
Connecting to a database
Realtime allows you to listen to changes from your Postgres database. When a new client connects to Realtime and initializes the postgres_changes Realtime Extension the cluster will connect to your Postgres database and start streaming changes from a replication slot.
Realtime knows the region your database is in, and connects to it from the closest region possible.
Every Realtime region has at least two nodes so if one node goes offline the other node should reconnect and start streaming changes again.
Broadcast from Postgres
Realtime Broadcast sends messages when changes happen in your database. Behind the scenes, Realtime creates a publication on the realtime.messages table. It then reads the Write-Ahead Log (WAL) file for this table, and sends a message whenever an insert happens. Messages are sent as JSON packages over WebSockets.
The realtime.messages table is partitioned by day. This allows old messages to be deleted performantly, by dropping old partitions. Partitions are retained for 3 days before being deleted.
Broadcast uses Realtime Authorization by default to protect your data.
Streaming the Write-Ahead Log
A Postgres logical replication slot is acquired when connecting to your database.
Realtime delivers changes by polling the replication slot and appending channel subscription IDs to each wal record.
Subscription IDs are Erlang processes representing underlying sockets on the cluster. These IDs are globally unique and messages to processes are routed automatically by the Erlang virtual machine.
After receiving results from the polling query, with subscription IDs appended, Realtime delivers records to those clients.