HOME/Articles/

[筆記] 用zfs的snapshot 快照功能來做 postgresql 的備份還原 / Postgresql Backup Restore Using Zfs Snapshot

Article Outline

前面測試了用pgbarman / pgbackrest 來備份 postgresql

這次改從system file level 來下手

採用zfs 的快照來備份、還原postgresql 資料庫

<!--more-->

建立測試資料庫、TABLE、snapshot

資料庫現況

只有系統預設的DB,沒有其他多的東西

postgres@hqdc034:~$ psql -c '\l'
                                  List of databases
   Name    |  Owner   | Encoding |   Collate   |    Ctype    |   Access privileges   
-----------+----------+----------+-------------+-------------+-----------------------
 postgres  | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | 
 template0 | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | =c/postgres          +
           |          |          |             |             | postgres=CTc/postgres
 template1 | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | =c/postgres          +
           |          |          |             |             | postgres=CTc/postgres
(3 rows)

postgres@hqdc034:~$ du -sh /zp/database/10/main/
232M    /zp/database/10/main/

建立第一次的快照

2019-09-06 09:03:46 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
no datasets available
2019-09-06 09:03:53 [changch@hqdc034 ~]$ sudo zfs snapshot zp/database@init_db_no_demo
2019-09-06 09:04:09 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                          USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo      0      -   231M  -
2019-09-06 09:04:15 [changch@hqdc034 ~]$

建立、倒回測試資料庫 demo

postgres@hqdc034:~$ createdb demo
postgres@hqdc034:~$ psql demo < /home/changch/Downloads/demo.sql 
SET
SET
略...

再檢查一次資料庫的狀況,看到 demo DB出現了,資料庫目錄也變大了

postgres@hqdc034:~$ psql -c '\l'
                                  List of databases
   Name    |  Owner   | Encoding |   Collate   |    Ctype    |   Access privileges   
-----------+----------+----------+-------------+-------------+-----------------------
 demo      | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | 
 postgres  | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | 
 template0 | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | =c/postgres          +
           |          |          |             |             | postgres=CTc/postgres
 template1 | postgres | UTF8     | en_US.UTF-8 | en_US.UTF-8 | =c/postgres          +
           |          |          |             |             | postgres=CTc/postgres
(4 rows)

postgres@hqdc034:~$ du -sh /zp/database/10/main/
2.1G    /zp/database/10/main/
postgres@hqdc034:~$ 

建立第二次快照

這次的快照,將包含剛剛倒回的 demo DB,但是不包含等下才要建立的測試 table

2019-09-06 09:16:01 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                               USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo        250K      -   231M  -
zp/database@demo_db_just_restore      0      -  2.08G  -
2019-09-06 09:16:04 [changch@hqdc034 ~]$

建立測試 table

postgres@hqdc034:~$ psql -c 'create table test ( a int, b varchar(50) );'
CREATE TABLE
postgres@hqdc034:~$

建立第三次快照

這次快照,只有建立 test table ,但是裡面沒有資料

2019-09-06 09:18:34 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data      0      -  2.08G  -
2019-09-06 09:18:36 [changch@hqdc034 ~]$

在test table 插入100萬筆資料

postgres@hqdc034:~$ psql -c 'with aa as ( select * from generate_series (1,1000000) a ) insert into test select aa.a, md5(aa.a::varchar) from aa;'
INSERT 0 1000000
postgres@hqdc034:~$ psql -c 'select count(*) from test;'
  count  
---------
 1000000
(1 row)

postgres@hqdc034:~$

建立第四次快照

test table 內有 1000000 筆資料

2019-09-06 09:18:36 [changch@hqdc034 ~]$ sudo zfs snapshot zp/database@demo_db_test_table_with_1M_rows
2019-09-06 09:21:08 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data   116K      -  2.08G  -
zp/database@demo_db_test_table_with_1M_rows            0      -  2.15G  -
2019-09-06 09:21:09 [changch@hqdc034 ~]$

再次插入 100萬筆資料

postgres@hqdc034:~$ time psql -c 'with aa as ( select * from generate_series (1,1000000) a ) insert into test select aa.a, md5(aa.a::varchar) from aa;'
INSERT 0 1000000

real    0m4.276s
user    0m0.020s
sys    0m0.012s
postgres@hqdc034:~$ psql -c 'select count(*) from test;'
  count  
---------
 2000000
(1 row)

postgres@hqdc034:~$

建立第五次快照

現在 test table 有 200萬筆資料了

2019-09-06 09:21:09 [changch@hqdc034 ~]$ sudo zfs snapshot zp/database@demo_db_test_table_with_2M_rows
2019-09-06 09:22:29 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data   116K      -  2.08G  -
zp/database@demo_db_test_table_with_1M_rows         218K      -  2.15G  -
zp/database@demo_db_test_table_with_2M_rows            0      -  2.23G  -
2019-09-06 09:22:30 [changch@hqdc034 ~]$

玩大點,直接湊滿1000萬筆資料好了

postgres@hqdc034:~$ time psql -c 'with aa as ( select * from generate_series (1,8000000) a ) insert into test select aa.a, md5(aa.a::varchar) from aa;'
INSERT 0 8000000

real    0m32.172s
user    0m0.024s
sys    0m0.008s
postgres@hqdc034:~$ psql -c 'select count(*) from test;'
  count   
----------
 10000000
(1 row)
postgres@hqdc034:~$

建立第六次快照

10M rows in test table

2019-09-06 09:22:30 [changch@hqdc034 ~]$ sudo zfs snapshot zp/database@demo_db_test_table_with_10M_rows
2019-09-06 09:25:18 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data   116K      -  2.08G  -
zp/database@demo_db_test_table_with_1M_rows         218K      -  2.15G  -
zp/database@demo_db_test_table_with_2M_rows         530K      -  2.23G  -
zp/database@demo_db_test_table_with_10M_rows        163K      -  2.97G  -
2019-09-06 09:25:21 [changch@hqdc034 ~]$

到1000萬筆資料為止,現在資料庫大小是這樣

postgres@hqdc034:~$ du -sh /zp/database/10/main/
3.0G    /zp/database/10/main/
postgres@hqdc034:~$

還原測試

最後一次做快照的時候,demo DB 裡面有一千萬筆資料,現在來砍掉500萬筆

postgres@hqdc034:~$ time psql -c 'delete from test where a > 5000000;'
DELETE 3000000

real    0m7.844s
user    0m0.024s
sys    0m0.004s

postgres@hqdc034:~$ time psql -c 'select count(*) from test;'
  count  
---------
 7000000
(1 row)


real    0m0.268s
user    0m0.024s
sys    0m0.004s
postgres@hqdc034:~$

咦,怪怪的,為什麼只有砍掉300萬筆? 好,這邊先不管,等等正好來驗證restore的狀況

假設剛剛這個刪除是錯誤的動作,我要回到1000萬資料的狀態,就可以用zfs rollback 來達成

第一次還原

目標是還原到包含1000萬筆資料的狀態(現在是700萬筆)

2019-09-06 09:25:21 [changch@hqdc034 ~]$ sudo service postgresql stop
 * Stopping PostgreSQL 10 database server      [ OK ] 
2019-09-06 10:14:12 [changch@hqdc034 ~]$ sudo zfs rollback -r zp/database@demo_db_test_table_with_10M_rows
2019-09-06 10:14:28 [changch@hqdc034 ~]$ sudo service postgresql start
 * Starting PostgreSQL 10 database server      [ OK ]
2019-09-06 10:14:57 [changch@hqdc034 ~]$ 

檢查一下

postgres@hqdc034:~$ time psql -c 'select count(*) from test;'
  count   
----------
 10000000
(1 row)


real    0m5.019s
user    0m0.040s
sys    0m0.008s
postgres@hqdc034:~$

沒錯,又回到1000萬筆資料的狀態了

要注意的是,如果回到更之前的狀態,在該狀態之後的快照將會被清除,除非你先做clone 比如我現在要回到 200萬筆的狀態,那1000萬筆資料的快照就會被刪除

2019-09-06 10:17:32 [changch@hqdc034 ~]$ sudo service postgresql stop
 * Stopping PostgreSQL 10 database server         [ OK ] 
2019-09-06 10:18:50 [changch@hqdc034 ~]$ sudo zfs rollback -r zp/database@demo_db_test_table_with_2M_rows
2019-09-06 10:18:57 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data   116K      -  2.08G  -
zp/database@demo_db_test_table_with_1M_rows         218K      -  2.15G  -
zp/database@demo_db_test_table_with_2M_rows            0      -  2.23G  -
2019-09-06 10:19:04 [changch@hqdc034 ~]$ sudo service postgresql start
 * Starting PostgreSQL 10 database server        [ OK ] 
2019-09-06 10:19:17 [changch@hqdc034 ~]$

postgres@hqdc034:~$ time psql -c 'select count(*) from test;'
  count  
---------
 2000000
(1 row)

real    0m0.175s
user    0m0.024s
sys    0m0.008s
postgres@hqdc034:~$ 

啊,我剛剛應該先clone的.... 沒關係,我們再做一次,新增800萬筆資料,湊齊1000萬筆,然後快照

postgres@hqdc034:~$ time psql -c 'with aa as ( select * from generate_series (1,8000000) a ) insert into test select aa.a, md5(aa.a::varchar) from aa;'
INSERT 0 8000000

real    0m35.662s
user    0m0.048s
sys    0m0.004s
postgres@hqdc034:~$ time psql -c 'select count(*) from test;'
  count   
----------
 10000000
(1 row)

real    0m5.259s
user    0m0.024s
sys    0m0.008s
postgres@hqdc034:~$ 

做快照

2019-09-06 10:19:17 [changch@hqdc034 ~]$ sudo zfs snapshot zp/database@demo_db_test_table_with_10M_rows
2019-09-06 10:22:59 [changch@hqdc034 ~]$ sudo zfs list -t snapshot
NAME                                                USED  AVAIL  REFER  MOUNTPOINT
zp/database@init_db_no_demo                         250K      -   231M  -
zp/database@demo_db_just_restore                    138K      -  2.08G  -
zp/database@demo_db_create_test_table_but_no_data   116K      -  2.08G  -
zp/database@demo_db_test_table_with_1M_rows         218K      -  2.15G  -
zp/database@demo_db_test_table_with_2M_rows        56.4M      -  2.23G  -
zp/database@demo_db_test_table_with_10M_rows           0      -  1.81G  -
2019-09-06 10:23:02 [changch@hqdc034 ~]$

接著來測試看看 clone snapshot,這是基本的說明

Clones can only be created from a snapshot and a snapshot can not 
be deleted until you delete the clone that is based on this snapshot. 
To create a clone, use the zfs clone command.

clone 會做出一份跟clone來源一模一樣的資料,在快照模式下,資料是唯讀的,clone出來後,就可以做異動。但是不能刪除clone來源的快照,會提示錯誤。

2019-09-06 10:28:31 [changch@hqdc034 ~]$ sudo zfs clone zp/database@demo_db_test_table_with_10M_rows zp/database/clone_with_10M_rows
2019-09-06 10:29:21 [changch@hqdc034 ~]$ sudo zfs list
NAME                              USED  AVAIL  REFER  MOUNTPOINT
zp                               3.08G   231G    22K  /zp
zp/database                      3.08G   231G  1.88G  /zp/database
zp/database/clone_with_10M_rows      0   231G  1.81G  /zp/database/clone_with_10M_rows
2019-09-06 10:29:26 [changch@hqdc034 ~]$

可以看到做了clone之後,多了一個 zfs dataset 試試看把資料庫路徑直接改到這個新做的dataset 看看能不能啟動資料庫 修改 /etc/postgresql/10/main/postgresql.conf,然後重起postgresql

#data_directory = '/var/lib/postgresql/10/main'     # use data in another directory
#data_directory = '/zp/database/10/main'
data_directory = '/zp/database/clone_with_10M_rows/10/main'

啟動有比較久一點 而且好像沒成功啟動

2019-09-06 10:32:27 [changch@hqdc034 ~]$ sudo service postgresql restart
 * Restarting PostgreSQL 10 database server       [ OK ]
2019-09-06 10:33:37 [changch@hqdc034 ~]$ 
2019-09-06 10:33:37 [changch@hqdc034 ~]$ sudo netstat -antlp |grep 5432

而且在 syslog & postgresql log 中看不到什麼異常,怪了! 而且再啟動一次就好了?

再來測試一次看看

2019-09-06 10:37:22 [changch@hqdc034 ~]$ sudo service postgresql stop
 * Stopping PostgreSQL 10 database server        [ OK ] 
2019-09-06 10:38:03 [changch@hqdc034 ~]$ sudo zfs list
NAME                              USED  AVAIL  REFER  MOUNTPOINT
zp                               3.24G   231G    22K  /zp
zp/database                      3.24G   231G  1.88G  /zp/database
zp/database/clone_with_10M_rows   165M   231G  1.88G  /zp/database/clone_with_10M_rows
2019-09-06 10:38:13 [changch@hqdc034 ~]$ sudo zfs destroy zp/database/clone_with_10M_rows
2019-09-06 10:38:21 [changch@hqdc034 ~]$ sudo zfs clone zp/database@demo_db_test_table_with_10M_rows zp/database/clone_with_10M_rows
2019-09-06 10:38:32 [changch@hqdc034 ~]$ sudo zfs list
NAME                              USED  AVAIL  REFER  MOUNTPOINT
zp                               3.08G   231G    22K  /zp
zp/database                      3.08G   231G  1.88G  /zp/database
zp/database/clone_with_10M_rows      0   231G  1.81G  /zp/database/clone_with_10M_rows
2019-09-06 10:38:35 [changch@hqdc034 ~]$ sudo service postgresql start^C
2019-09-06 10:38:44 [changch@hqdc034 ~]$ sudo zfs list
NAME                              USED  AVAIL  REFER  MOUNTPOINT
zp                               3.08G   231G    22K  /zp
zp/database                      3.08G   231G  1.88G  /zp/database
zp/database/clone_with_10M_rows      0   231G  1.81G  /zp/database/clone_with_10M_rows
2019-09-06 10:38:45 [changch@hqdc034 ~]$ sudo service postgresql start
 * Starting PostgreSQL 10 database server               [ OK ] 
2019-09-06 10:39:04 [changch@hqdc034 ~]$ netstat -antlp |grep 5432
(Not all processes could be identified, non-owned process info
 will not be shown, you would have to be root to see it all.)
tcp        0      0 0.0.0.0:5432            0.0.0.0:*               LISTEN      -               
tcp6       0      0 :::5432                 :::*                    LISTEN      -               
2019-09-06 10:39:13 [changch@hqdc034 ~]$ 

這次就沒問題?看來是我第一次下指令的時候,不該用sudo netstat -antlp 去檢查? anyway ,回到psql 來看看內容

postgres@hqdc034:~$ time psql -c 'select count(*) from test;'
  count   
----------
 10000000
(1 row)


real    0m4.716s
user    0m0.028s
sys    0m0.004s
postgres@hqdc034:~$ 

Good ,clone 出來的果然是1000萬筆資料時的狀態


這次測試就先到此為止,後面再來測試zfs的 replication and send/recv