Disallow starting server with insufficient wal_level for existing slot.
authorAndres Freund <[email protected]>
Wed, 31 Oct 2018 21:47:41 +0000 (14:47 -0700)
committerAndres Freund <[email protected]>
Wed, 31 Oct 2018 22:46:40 +0000 (15:46 -0700)
Previously it was possible to create a slot, change wal_level, and
restart, even if the new wal_level was insufficient for the
slot. That's a problem for both logical and physical slots, because
the necessary WAL records are not generated.

This removes a few tests in newer versions that, somewhat
inexplicably, whether restarting with a too low wal_level worked (a
buggy behaviour!).

Reported-By: Joshua D. Drake
Author: Andres Freund
Discussion: https://postgr.es/m/20181029191304[email protected]
Backpatch: 9.4-, where replication slots where introduced

src/backend/replication/logical/logical.c
src/backend/replication/slot.c

index 3745edb445d4598266d5ae9d48dcba5f6ef5d009..29b8cbaaf9cbd41af94f36a1704600e9a8a126ab 100644 (file)
@@ -72,6 +72,11 @@ CheckLogicalDecodingRequirements(void)
 {
    CheckSlotRequirements();
 
+   /*
+    * NB: Adding a new requirement likely means that RestoreSlotFromDisk()
+    * needs the same check.
+    */
+
    if (wal_level < WAL_LEVEL_LOGICAL)
        ereport(ERROR,
                (errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
index a538ae17541597b9c88b2fbc6833098568200680..909d9f3df4c2a3d7d3cdcf590b38faf92d467e5b 100644 (file)
@@ -795,6 +795,11 @@ ReplicationSlotsCountDBSlots(Oid dboid, int *nslots, int *nactive)
 void
 CheckSlotRequirements(void)
 {
+   /*
+    * NB: Adding a new requirement likely means that RestoreSlotFromDisk()
+    * needs the same check.
+    */
+
    if (max_replication_slots == 0)
        ereport(ERROR,
                (errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
@@ -1236,6 +1241,31 @@ RestoreSlotFromDisk(const char *name)
        return;
    }
 
+   /*
+    * Verify that requirements for the specific slot type are met. That's
+    * important because if these aren't met we're not guaranteed to retain
+    * all the necessary resources for the slot.
+    *
+    * NB: We have to do so *after* the above checks for ephemeral slots,
+    * because otherwise a slot that shouldn't exist anymore could prevent
+    * restarts.
+    *
+    * NB: Changing the requirements here also requires adapting
+    * CheckSlotRequirements() and CheckLogicalDecodingRequirements().
+    */
+   if (cp.slotdata.database != InvalidOid && wal_level < WAL_LEVEL_LOGICAL)
+       ereport(FATAL,
+               (errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
+                errmsg("logical replication slots \"%s\" exists, but wal_level < logical",
+                       NameStr(cp.slotdata.name)),
+                errhint("Change wal_level to be replica or higher.")));
+   else if (wal_level < WAL_LEVEL_ARCHIVE)
+       ereport(FATAL,
+               (errcode(ERRCODE_OBJECT_NOT_IN_PREREQUISITE_STATE),
+                errmsg("physical replication slots \"%s\" exists, but wal_level < archive",
+                       NameStr(cp.slotdata.name)),
+                errhint("Change wal_level to be archive or higher.")));
+
    /* nothing can be active yet, don't lock anything */
    for (i = 0; i < max_replication_slots; i++)
    {