PostgreSQL 10.0 preview 功能增强 - 逻辑复制支持并行COPY初始化数据

4 minute read

背景

PostgreSQL 已支持逻辑复制，同时对逻辑复制增加了一个初始同步的增强功能，支持通过wal receiver协议跑COPY命令（已封装在逻辑复制的内核代码中），支持多表并行。

也就是说，你可以使用PostgreSQL的逻辑复制，快速的（流式、并行）将一个实例迁移到另一个实例。

Logical replication support for initial data copy  
  
Add functionality for a new subscription to copy the initial data in the  
tables and then sync with the ongoing apply process.  
  
For the copying, add a new internal COPY option to have the COPY source  
data provided by a callback function.  The initial data copy works on  
the subscriber by receiving COPY data from the publisher and then  
providing it locally into a COPY that writes to the destination table.  
  
A WAL receiver can now execute full SQL commands.  This is used here to  
obtain information about tables and publications.  
  
Several new options were added to CREATE and ALTER SUBSCRIPTION to  
control whether and when initial table syncing happens.  
  
Change pg_dump option --no-create-subscription-slots to  
--no-subscription-connect and use the new CREATE SUBSCRIPTION  
... NOCONNECT option for that.  
  
Author: Petr Jelinek <petr.jelinek@2ndquadrant.com>  
Tested-by: Erik Rijkers <er@xs4all.nl>  

逻辑复制包含的初始化COPY的流程如下

主库开启事务快照(快照支持在多个会话间共享, 这也是PostgreSQL的独门秘籍之一), COPY数据, COPY结束后释放快照, 从快照对应的WAL LSN开始接收增量.

/*-------------------------------------------------------------------------  
* tablesync.c  
*    PostgreSQL logical replication  
*  
* Copyright (c) 2012-2016, PostgreSQL Global Development Group  
*  
* IDENTIFICATION  
*    src/backend/replication/logical/tablesync.c  
*  
* NOTES  
*    This file contains code for initial table data synchronization for  
*    logical replication.  
*  
*    The initial data synchronization is done separately for each table,  
*    in separate apply worker that only fetches the initial snapshot data  
*    from the publisher and then synchronizes the position in stream with  
*    the main apply worker.  
*  
*    The are several reasons for doing the synchronization this way:  
*     - It allows us to parallelize the initial data synchronization  
*       which lowers the time needed for it to happen.  
*     - The initial synchronization does not have to hold the xid and LSN  
*       for the time it takes to copy data of all tables, causing less  
*       bloat and lower disk consumption compared to doing the  
*       synchronization in single process for whole database.  
*     - It allows us to synchronize the tables added after the initial  
*       synchronization has finished.  
*  
*    The stream position synchronization works in multiple steps.  
*     - Sync finishes copy and sets table state as SYNCWAIT and waits  
*       for state to change in a loop.  
*     - Apply periodically checks tables that are synchronizing for SYNCWAIT.  
*       When the desired state appears it will compare its position in the  
*       stream with the SYNCWAIT position and based on that changes the  
*       state to based on following rules:  
*        - if the apply is in front of the sync in the wal stream the new  
*          state is set to CATCHUP and apply loops until the sync process  
*          catches up to the same LSN as apply  
*        - if the sync is in front of the apply in the wal stream the new  
*          state is set to SYNCDONE  
*        - if both apply and sync are at the same position in the wal stream  
*          the state of the table is set to READY  
*     - If the state was set to CATCHUP sync will read the stream and  
*       apply changes until it catches up to the specified stream  
*       position and then sets state to READY and signals apply that it  
*       can stop waiting and exits, if the state was set to something  
*       else than CATCHUP the sync process will simply end.  
*     - If the state was set to SYNCDONE by apply, the apply will  
*       continue tracking the table until it reaches the SYNCDONE stream  
*       position at which point it sets state to READY and stops tracking.  
*  
*    The catalog pg_subscription_rel is used to keep information about  
*    subscribed tables and their state and some transient state during  
*    data synchronization is kept in shared memory.  
*  
*    Example flows look like this:  
*     - Apply is in front:  
*        sync:8  
*          -> set SYNCWAIT  
*        apply:10  
*          -> set CATCHUP  
*          -> enter wait-loop  
*        sync:10  
*          -> set READY  
*          -> exit  
*        apply:10  
*          -> exit wait-loop  
*          -> continue rep  
*     - Sync in front:  
*        sync:10  
*          -> set SYNCWAIT  
*        apply:8  
*          -> set SYNCDONE  
*          -> continue per-table filtering  
*        sync:10  
*          -> exit  
*        apply:10  
*          -> set READY  
*          -> stop per-table filtering  
*          -> continue rep  
*-------------------------------------------------------------------------  
*/  
 

这个patch的讨论，详见邮件组，本文末尾URL。

PostgreSQL社区的作风非常严谨，一个patch可能在邮件组中讨论几个月甚至几年，根据大家的意见反复的修正，patch合并到master已经非常成熟，所以PostgreSQL的稳定性也是远近闻名的。

参考

https://git.postgresql.org/gitweb/?p=postgresql.git;a=commit;h=7c4f52409a8c7d85ed169bbbc1f6092274d03920

digoal’s 大量PostgreSQL文章入口

Twitter Facebook Google+ LinkedIn

Digoal.zhou

PostgreSQL 10.0 preview 功能增强 - 逻辑复制支持并行COPY初始化数据

背景

参考

digoal’s 大量PostgreSQL文章入口

You May Also Enjoy

PostgreSQL(PPAS 兼容Oracle) 从零开始入门手册 - 珍藏版

PostgreSQL pipelinedb 流计算插件 - IoT应用 - 实时轨迹聚合

PostgreSQL plpgsql 存储过程、函数 - 状态、异常变量打印、异常捕获… - GET [STACKED] DIAGNOSTICS

PostgreSQL datediff 日期间隔（单位转换）兼容SQL用法