首頁手記 PostgreSQL DBA(4) - PG 11...

PostgreSQL DBA(4) - PG 11 New Features#1

標簽：

SQL Server

PG 11即将正式发布，本节简单了PG 11的一些新特性，包括并行查询的性能提升和数据表分区的功能增强。

一、并行查询

Parallel Hash
Hash Join执行时，在构造Hash表和进行Hash连接时，PG 11可使用并行的方式执行。
测试脚本：

testdb=# create table t1 (c1 int,c2 varchar(40),c3 varchar(40));CREATE TABLE
testdb=# testdb=# insert into t1 select generate_series(1,5000000),'TEST'||generate_series(1,1000000),generate_series(1,1000000)||'TEST';INSERT 0 5000000testdb=# drop table if exists t2;DROP TABLE
testdb=# create table t2 (c1 int,c2 varchar(40),c3 varchar(40));CREATE TABLE
testdb=# testdb=# insert into t2 select generate_series(1,1000000),'T2'||generate_series(1,1000000),generate_series(1,1000000)||'T2';INSERT 0 1000000testdb=# explain verbosetestdb-# select t1.c1,t2.c1 testdb-# from t1 inner join t2 on t1.c1 = t2.c1;
                                         QUERY PLAN                                          
---------------------------------------------------------------------------------------------
 Gather  (cost=18372.00..107975.86 rows=101100 width=8)
   Output: t1.c1, t2.c1
   Workers Planned: 2 -- 2 Workers
   ->  Parallel Hash Join  (cost=17372.00..96865.86 rows=42125 width=8) -- Parallel Hash Join
         Output: t1.c1, t2.c1
         Hash Cond: (t1.c1 = t2.c1)
         ->  Parallel Seq Scan on public.t1  (cost=0.00..45787.33 rows=2083333 width=4)
               Output: t1.c1
         ->  Parallel Hash  (cost=10535.67..10535.67 rows=416667 width=4) -- Parallel Hash
               Output: t2.c1
               ->  Parallel Seq Scan on public.t2  (cost=0.00..10535.67 rows=416667 width=4)
                     Output: t2.c1

除了Parallel Hash外,PG 11在执行Parallel Append(执行UNION ALL等集合操作)/CREATE TABLE AS SELECT/CREATE MATERIALIZED VIEW/SELECT INTO/CREATE INDEX等操作时以并行的方式执行.

二、数据表分区

Hash Partition
PG 在11.x引入了Hash分区,关于Hash分区,官方文档有如下说明:

The table is partitioned by specifying a modulus and a remainder for each partition. Each partition will hold the rows for which the hash value of the partition key divided by the specified modulus will produce the specified remainder.

每个Hash分区需指定"模"(modulus)和"余"(remainder),数据在哪个分区(partition index)的计算公式:
partition index = abs(hashfunc(key)) % modulus

drop table if exists t_hash1;
create table t_hash1 (c1 int,c2  varchar(40),c3 varchar(40)) partition by hash(c1);
create table t_hash1_1 partition of t_hash1 for values with (modulus 6,remainder 0);
create table t_hash1_2 partition of t_hash1 for values with (modulus 6,remainder 1);
create table t_hash1_3 partition of t_hash1 for values with (modulus 6,remainder 2);
create table t_hash1_4 partition of t_hash1 for values with (modulus 6,remainder 3);
create table t_hash1_5 partition of t_hash1 for values with (modulus 6,remainder 4);
create table t_hash1_6 partition of t_hash1 for values with (modulus 6,remainder 5);

testdb=# insert into t_hash1 testdb-# select generate_series(1,1000000),'HASH'||generate_series(1,1000000),generate_series(1,1000000)||'HASH';INSERT 0 1000000

数据在各分区上的分布大体均匀.
2018-9-19 注:由于插入数据时语句出错,昨天得出的结果有误(但数据在各个分区的分布上不太均匀,t_hash1_1分区行数明显的比其他分区的要多很多),请忽略

testdb=# select count(*) from only t_hash1;; count 
-------
     0
(1 row)

testdb=# select count(*) from only t_hash1_1;
 count  
--------
 166480
(1 row)

testdb=# select count(*) from only t_hash1_2;
 count  
--------
 166904
(1 row)

testdb=# select count(*) from only t_hash1_3;
 count  
--------
 166302
(1 row)

testdb=# select count(*) from only t_hash1_4;
 count  
--------
 166783
(1 row)

testdb=# select count(*) from only t_hash1_5;
 count  
--------
 166593
(1 row)

testdb=# select count(*) from only t_hash1_6;
 count  
--------
 166938
(1 row)

Hash分区键亦可以创建在字符型字段上

testdb=# drop table if exists t_hash3;DROP TABLE
testdb=# create table t_hash3 (c1 int,c2  varchar(40),c3 varchar(40)) partition by hash(c2);CREATE TABLE

-- 需创建相应的"Partition"用于存储相应的数据
testdb=# insert into t_hash3 testdb-# select generate_series(1,100000),'HASH'||generate_series(1,1000000),generate_series(1,1000000)||'HASH';ERROR:  no partition of relation "t_hash3" found for row
DETAIL:  Partition key of the failing row contains (c2) = (HASH1).

-- 6个分区,3个sub-table,插入数据会出错
testdb=# testdb=# create table t_hash3_1 partition of t_hash3 for values with (modulus 6,remainder 0);CREATE TABLE
testdb=# create table t_hash3_2 partition of t_hash3 for values with (modulus 6,remainder 1);CREATE TABLE
testdb=# create table t_hash3_3 partition of t_hash3 for values with (modulus 6,remainder 2);CREATE TABLE
testdb=# insert into t_hash3 testdb-# select generate_series(1,10000),'HASH'||generate_series(1,10000),generate_series(1,10000)||'HASH';ERROR:  no partition of relation "t_hash3" found for row
DETAIL:  Partition key of the failing row contains (c2) = (HASH1).

-- 3个分区,3个sub-table,正常
testdb=# drop table if exists t_hash3;DROP TABLE
testdb=# create table t_hash3 (c1 int,c2  varchar(40),c3 varchar(40)) partition by hash(c2);CREATE TABLE
testdb=# create table t_hash3_1 partition of t_hash3 for values with (modulus 3,remainder 0);CREATE TABLE
testdb=# create table t_hash3_2 partition of t_hash3 for values with (modulus 3,remainder 1);CREATE TABLE
testdb=# create table t_hash3_3 partition of t_hash3 for values with (modulus 3,remainder 2);CREATE TABLE
testdb=# insert into t_hash3 testdb-# select generate_series(1,10000),'HASH'||generate_series(1,10000),generate_series(1,10000)||'HASH';INSERT 0 10000

考察分区的数据分布,还比较均匀:

testdb=# testdb=# select count(*) from only t_hash3;
 count 
-------
     0
(1 row)

testdb=# select count(*) from only t_hash3_1;
 count 
-------
  3378
(1 row)

testdb=# select count(*) from only t_hash3_2;
 count 
-------
  3288
(1 row)

testdb=# select count(*) from only t_hash3_3;
 count 
-------
  3334
(1 row)

Default Partition
List和Range分区可指定Default Partition(Hash分区不支持).

Update partition key
PG 11可Update分区键,这会导致数据的"迁移".

Create unique constraint
PG 11在分区表上创建主键和唯一索引(注:Oracle在很早的版本已支持此特性).
在普通字段上可以创建BTree索引.

testdb=# alter table t_hash1 add primary key(c1);ALTER TABLE
testdb=# create index idx_t_hash1_c2 on t_hash1(c2);CREATE INDEX

FOREIGN KEY support
PG 11支持在分区上创建外键.

除了上述几个新特性外,分区上面,PG 11在Automatic index creation/INSERT ON CONFLICT/Partition-Wise Join / Partition-Wise Aggregate/FOR EACH ROW trigger/Dynamic Partition Elimination/Control Partition Pruning上均有所增强.

作者：EthanHe
链接：https://www.jianshu.com/p/e2ea0354179c

點擊查看更多內容

為 TA 點贊

若覺得本文不錯，就分享一下吧！

評論

評論

共同學習，寫下你的評論

評論加載中...

展開查看更多評論

作者其他優質文章

正在加載中

青春有我

JAVA開發工程師

手記
篇

粉絲

205

獲贊與收藏

1011

關注作者，訂閱最新文章

閱讀免費教程

后端通用面試教程

41個小節 32074 358

網絡編程入門教程

20個小節 13206 249

Pandas 入門教程

25個小節 19595 369

推薦

評論

收藏

共同學習，寫下你的評論



感謝您的支持，我會繼續努力的～

掃碼打賞，你說多少就多少

贊賞金額會直接到老師賬戶

支付方式

打開微信掃一掃，即可進行掃碼打賞哦

今天注冊有機會得

100積分直接送

付費專欄免費學

大額優惠券免費領

立即參與放棄機會

點擊
抽獎

慕課手記新用戶專享福利

恭喜你，你的運氣太好了，居然抽中了 100個積分！

恭喜你，抽中了價值元的專欄！

太棒了，直接落到你賬戶里！

積分商城里的羅技鼠標、機械鍵盤、
Kindle 閱讀器、小米平衡車
Apple iPad （10.2英寸）、大額優惠券
在等著你去兌換了噢

作者：

免費贈送

兌換碼：1111222211 復制

優惠券可用于購買實戰課、體系課
無門檻使用

先去看看，有什么好東西馬上兌換我愛學習，選課去


亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

熱搜

最近搜索清空

PostgreSQL DBA(4) - PG 11 New Features#1

一、并行查询

二、数据表分区

閱讀免費教程