流复制失败,“ WAL段已被移动”

问题描述 投票:0回答:1

我正在尝试在Postgres 11.5上实现主/从流复制。我执行了以下步骤-

在大师上

select pg_start_backup('replication-setup',true);

从属停止了postgres 11数据库并运行

rsync -aHAXxv --numeric-ids --progress -e "ssh -T -o Compression=no -x" --exclude pg_wal --exclude postgresql.pid --exclude pg_log MASTER:/var/lib/postgresql/11/main/* /var/lib/postgresql/11/main

在大师上

select pg_stop_backup();

从属

rsync -aHAXxv --numeric-ids --progress -e "ssh -T -o Compression=no -x"  MASTER:/var/lib/postgresql/11/main/pg_wal/* /var/lib/postgresql/11/main/pg_wal

我在从属〜/ 11 /主文件夹中创建了recovery.conf文件

standby_mode = 'on'
primary_conninfo = 'user=postgres host=MASTER port=5432 sslmode=prefer sslcompression=1 krbsrvname=postgres'
primary_slot_name='my_repl_slot'

[当我在Slave上启动Postgres时,在MASTER和SLAVE日志上都收到错误-

019-11-08 09:03:51.205 CST [27633] LOG:  00000: database system was interrupted; last known up at 2019-11-08 02:53:04 CST
2019-11-08 09:03:51.205 CST [27633] LOCATION:  StartupXLOG, xlog.c:6388
2019-11-08 09:03:51.252 CST [27633] LOG:  00000: entering standby mode
2019-11-08 09:03:51.252 CST [27633] LOCATION:  StartupXLOG, xlog.c:6443
2019-11-08 09:03:51.384 CST [27634] LOG:  00000: started streaming WAL from primary at 12DB/C000000 on timeline 1
2019-11-08 09:03:51.384 CST [27634] LOCATION:  WalReceiverMain, walreceiver.c:383
2019-11-08 09:03:51.384 CST [27634] FATAL:  XX000: could not receive data from WAL stream: ERROR:  requested WAL segment 00000001000012DB0000000C has already been removed
2019-11-08 09:03:51.384 CST [27634] LOCATION:  libpqrcv_receive, libpqwalreceiver.c:772
2019-11-08 09:03:51.408 CST [27635] LOG:  00000: started streaming WAL from primary at 12DB/C000000 on timeline 1
2019-11-08 09:03:51.408 CST [27635] LOCATION:  WalReceiverMain, walreceiver.c:383

问题是START WAL-在执行00000001000012DB0000000C之前,pg_stop_backup()一直可用,直到我运行pg_stop_backup()并被存档且不再可用。因此,这不是由于WAL_KEEP_SEGMENTS低而将WAL存档的问题。

postgres@SLAVE:~/11/main/pg_wal$ cat 00000001000012DB0000000C.00000718.backup
START WAL LOCATION: 12DB/C000718 (file 00000001000012DB0000000C)
STOP WAL LOCATION: 12DB/F4C30720 (file 00000001000012DB000000F4)
CHECKPOINT LOCATION: 12DB/C000750
BACKUP METHOD: pg_start_backup
BACKUP FROM: master
START TIME: 2019-11-07 15:47:26 CST
LABEL: replication-setup-mdurbha
START TIMELINE: 1
STOP TIME: 2019-11-08 08:48:35 CST
STOP TIMELINE: 1

我的主设置为archive_command,并且我有可用的丢失的WAL。我将它们复制到SLAVE上的还原目录中,并尝试了下面的recovery.conf,但是由于MASTER报告相同的WAL segment has already been moved错误,它仍然失败。知道如何解决这个问题吗?过去我在Postgres 9.6上使用rsync设置复制时没有任何问题,但在Postgres 11上一直遇到此问题。

standby_mode = 'on'
primary_conninfo = 'user=postgres host=MASTER port=5432 sslmode=prefer sslcompression=1 krbsrvname=postgres'
restore_command='cp /var/lib/postgresql/restore/%f %p'
postgresql replication
1个回答
0
投票

restore_command放入可以还原已存档的WAL文件的recovery.conf中,就可以了。

© www.soinside.com 2019 - 2024. All rights reserved.