WHY prepared Statement running slower in some situation CASE

3 minute read

背景

在某些情况下，使用PREPARED STATEMENT或函数，可能会比直接执行SQL更慢。为什么呢？

这个要从执行计划说起，

下面来看一个测试表

digoal=> \d tbl_user    
            Table "digoal.tbl_user"    
  Column   |         Type          | Modifiers     
-----------+-----------------------+-----------    
 id        | integer               | not null    
 firstname | character varying(32) |     
 lastname  | character varying(32) |     
 corp      | character varying(32) |     
 age       | integer               |     
Indexes:    
    "tbl_user_pkey" PRIMARY KEY, btree (id)    
    "idx_user_age" btree (age)    
digoal=> insert into tbl_user select generate_series(1,100000),'zhou','digoal','sky-mobi',27 ;    
INSERT 0 100000    
digoal=> insert into tbl_user select generate_series(100001,100100),'zhou','digoal','sky-mobi',generate_series(1,100) ;    
INSERT 0 100    
digoal=> analyze tbl_user;    
ANALYZE    
digoal=> select age,count(*) from tbl_user group by age order by count(*);    
 age | count      
-----+--------    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
|      1    
| 100001    
(100 rows)    

如果按照AGE为条件查询，可能全表扫描（如age=27），或走索引(如age=1)。

digoal=> explain analyze select * from tbl_user where age=1;    
                                                       QUERY PLAN                                                           
------------------------------------------------------------------------------------------------------------------------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..4.32 rows=3 width=29) (actual time=0.018..0.019 rows=1 loops=1)    
   Index Cond: (age = 1)    
 Total runtime: 0.042 ms    
(3 rows)    
    
Time: 0.516 ms    
digoal=> explain analyze select * from tbl_user where age=27;    
                                                   QUERY PLAN                                                        
-----------------------------------------------------------------------------------------------------------------    
 Seq Scan on tbl_user  (cost=0.00..1989.25 rows=100000 width=29) (actual time=0.010..21.112 rows=100001 loops=1)    
   Filter: (age = 27)    
 Total runtime: 27.784 ms    
(3 rows)    
    
Time: 28.114 ms    

使用prepared statement看看情况如何:

digoal=> prepare p_user (int) as select * from tbl_user where age=$1;    
PREPARE    
Time: 12.408 ms    
digoal=> explain analyze execute p_user(1);    
                                                         QUERY PLAN                                                              
-----------------------------------------------------------------------------------------------------------------------------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..105.55 rows=3229 width=29) (actual time=0.012..0.012 rows=1 loops=1)    
   Index Cond: (age = $1)    
 Total runtime: 0.038 ms    
(3 rows)    
    
Time: 0.191 ms    
digoal=> explain analyze execute p_user(27);    
                                                            QUERY PLAN                                                                 
-----------------------------------------------------------------------------------------------------------------------------------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..105.55 rows=3229 width=29) (actual time=0.016..23.100 rows=100001 loops=1)    
   Index Cond: (age = $1)    
 Total runtime: 30.069 ms    
(3 rows)    
    
Time: 30.403 ms    

10W左右数据量的情况下用索引比全表扫描相差3ms左右

插入1000W左右数据再看看情况,

digoal=> insert into tbl_user select generate_series(100101,9999999),'zhou','digoal','sky-mobi',27 ;    
INSERT 0 9899899    
digoal=> explain analyze execute p_user(1);    
                                                            QUERY PLAN                                                                 
-----------------------------------------------------------------------------------------------------------------------------------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..159018.74 rows=5000094 width=29) (actual time=0.015..0.015 rows=1 loops=1)    
   Index Cond: (age = $1)    
 Total runtime: 0.036 ms    
(3 rows)    
    
Time: 0.535 ms    
digoal=> explain analyze execute p_user(27);    
                                                                 QUERY PLAN                                                             
            
------------------------------------------------------------------------------------------------------------------------------------    
--------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..159018.74 rows=5000094 width=29) (actual time=0.014..2352.636 rows=9999900 l    
oops=1)    
   Index Cond: (age = $1)    
 Total runtime: 3042.357 ms    
(3 rows)    
    
Time: 3042.689 ms    
    
digoal=> explain analyze select * from tbl_user where age=27;    
                                                      QUERY PLAN                                                           
-----------------------------------------------------------------------------------------------------------------------    
 Seq Scan on tbl_user  (cost=0.00..198533.34 rows=9999854 width=29) (actual time=0.011..2062.048 rows=9999900 loops=1)    
   Filter: (age = 27)    
 Total runtime: 2737.911 ms    
(3 rows)    
    
Time: 2738.430 ms    
digoal=> explain analyze select * from tbl_user where age=1;    
                                                        QUERY PLAN                                                             
---------------------------------------------------------------------------------------------------------------------------    
 Index Scan using idx_user_age on tbl_user  (cost=0.00..15.03 rows=333 width=29) (actual time=0.012..0.013 rows=1 loops=1)    
   Index Cond: (age = 1)    
 Total runtime: 0.031 ms    
(3 rows)    
    
Time: 0.403 ms    

相差约300mS

并不是说这样就不建议使用prepared statement了，prepared statement的使用对于降低CPU开销和服务端代码重用来说是非常有效的。

好在PostgreSQL有算法，可以优化plan cache，即使使用了prepared statement，也可以对于不同的输入值，选择不同的执行计划。

请参考

[《执行计划选择算法与绑定变量 - PostgreSQL prepared statement: SPI_prepare, prepare

execute COMMAND, PL/pgsql STYLE: custom & generic plan cache》](../201212/20121224_01.md)

digoal’s 大量PostgreSQL文章入口

Twitter Facebook Google+ LinkedIn

Digoal.zhou

WHY prepared Statement running slower in some situation CASE

背景

digoal’s 大量PostgreSQL文章入口

You May Also Enjoy

PostgreSQL(PPAS 兼容Oracle) 从零开始入门手册 - 珍藏版

PostgreSQL pipelinedb 流计算插件 - IoT应用 - 实时轨迹聚合

PostgreSQL plpgsql 存储过程、函数 - 状态、异常变量打印、异常捕获… - GET [STACKED] DIAGNOSTICS

PostgreSQL datediff 日期间隔（单位转换）兼容SQL用法