PostgreSQL-rako ASH analogo bat sortzeko saiakera bat
Arazoaren formulazioa
PostgreSQL kontsultak optimizatzeko, oso beharrezkoa da jardueren historia aztertzeko gaitasuna, bereziki itxaronaldiak, blokeoak eta taula estatistikak.
pgsentinel luzapena :
Β«Metatutako informazio guztia RAMan soilik gordetzen da, eta kontsumitutako memoria kopurua gordetako azken erregistroen arabera arautzen da.
Queryid eremua gehitzen da - pg_stat_statements luzapeneko queryid bera (aurretik instalatu behar da).Β«
Horrek, noski, asko lagunduko luke, baina trabarik handiena lehen puntua daΒ».Metatutako informazio guztia RAM memorian soilik gordetzen da ", alegia. helburua oinarrian eragina dago. Horrez gain, ez dago blokeo-historiarik eta taula-estatistikorik. Horiek. irtenbidea, oro har, osatugabea da: "Oraindik ez dago instalatzeko prest dagoen paketerik. Iturriak deskargatzea eta liburutegia zuk zeuk muntatzea gomendatzen da. Lehenik eta behin zure zerbitzarirako "devel" paketea instalatu eta bidea ezarri behar duzu pg_config PATH aldagaian.".
Oro har, zalaparta handia dago, eta ekoizpen datu-base serioen kasuan, baliteke zerbitzariarekin ezer egin ezin izatea. Berriro ere gure zerbait asmatu behar dugu.
Abisua.
Bolumen handi samarra dela eta eta azterketa-aldia osatugabea dela eta, artikuluak, batez ere, informazio izaera du, tesi eta tarteko emaitzen multzo gisa baizik.
Material zehatzagoa geroago prestatuko da, zatika
Konponbiderako eskakizunen zirriborroa
Beharrezkoa da gordetzeko aukera ematen duen tresna bat garatzea:
pg_stat_activity ikusi historia Saioen blokeoaren historia pg_locks ikuspegia erabiliz
Irtenbide-eskakizunaβhelburuko datu-basean eragina gutxitzea.
Ideia orokorraβ Datuak biltzeko agentea ez da helburuko datu-basean abiarazten, baizik eta jarraipenaren datu-basean systemd zerbitzu gisa. Bai, datu-galera batzuk posible dira, baina hori ez da kritikoa txostenak egiteko, baina ez dago helburuko datu-basean eraginik memoriari eta diskoko espazioari dagokionez. Eta konexio-pool bat erabiltzearen kasuan, erabiltzaileen prozesuetan eragina gutxienekoa da.
Ezarpen-etapak
1.Zerbitzu taulak
Taulak gordetzeko eskema bereizi bat erabiltzen da, erabilitako taula nagusien azterketa ez zailtzeko.
DROP SCHEMA IF EXISTS activity_hist ;
CREATE SCHEMA activity_hist AUTHORIZATION monitor ;
Garrantzitsua: eskema ez da helburuko datu-basean sortzen, jarraipenaren datu-basean baizik.
pg_stat_activity ikusi historia
Taula bat erabiltzen da pg_stat_activity ikuspegiaren uneko argazkiak gordetzeko
activity_hist.history_pg_stat_activity :
--ACTIVITY_HIST.HISTORY_PG_STAT_ACTIVITY
DROP TABLE IF EXISTS activity_hist.history_pg_stat_activity;
CREATE TABLE activity_hist.history_pg_stat_activity
(
timepoint timestamp without time zone ,
datid oid ,
datname name ,
pid integer,
usesysid oid ,
usename name ,
application_name text ,
client_addr inet ,
client_hostname text ,
client_port integer,
backend_start timestamp with time zone ,
xact_start timestamp with time zone ,
query_start timestamp with time zone ,
state_change timestamp with time zone ,
wait_event_type text ,
wait_event text ,
state text ,
backend_xid xid ,
backend_xmin xid ,
query text ,
backend_type text ,
queryid bigint
);
Txertazioa bizkortzeko - indizerik edo murrizketarik gabe.
Historia bera gordetzeko, zatitutako taula bat erabiltzen da:
activity_hist.archive_pg_stat_activity :
DROP TABLE IF EXISTS activity_hist.archive_pg_stat_activity;
CREATE TABLE activity_hist.archive_pg_stat_activity
(
timepoint timestamp without time zone ,
datid oid ,
datname name ,
pid integer,
usesysid oid ,
usename name ,
application_name text ,
client_addr inet ,
client_hostname text ,
client_port integer,
backend_start timestamp with time zone ,
xact_start timestamp with time zone ,
query_start timestamp with time zone ,
state_change timestamp with time zone ,
wait_event_type text ,
wait_event text ,
state text ,
backend_xid xid ,
backend_xmin xid ,
query text ,
backend_type text ,
queryid bigint
)
PARTITION BY RANGE (timepoint);
Kasu honetan txertatze-abiadurarako baldintzarik ez dagoenez, indize batzuk sortu dira txostenak sortzea azkartzeko.
Saioa blokeatzeko historia
Taula bat erabiltzen da saioen blokeoen uneko argazkiak gordetzeko:
activity_hist.history_locking:
--ACTIVITY_HIST.HISTORY_LOCKING
DROP TABLE IF EXISTS activity_hist.history_locking;
CREATE TABLE activity_hist.history_locking
(
timepoint timestamp without time zone ,
locktype text ,
relation oid ,
mode text ,
tid xid ,
vtid text ,
pid integer ,
blocking_pids integer[] ,
granted boolean
);
Gainera, txertaketa bizkortzeko, ez dago indizerik edo murrizketarik.
Historia bera gordetzeko, zatitutako taula bat erabiltzen da:
activity_hist.archive_locking:
DROP TABLE IF EXISTS activity_hist.archive_locking;
CREATE TABLE activity_hist.archive_locking
(
timepoint timestamp without time zone ,
locktype text ,
relation oid ,
mode text ,
tid xid ,
vtid text ,
pid integer ,
blocking_pids integer[] ,
granted boolean
)
PARTITION BY RANGE (timepoint);
Kasu honetan txertatze-abiadurarako baldintzarik ez dagoenez, indize batzuk sortu dira txostenak sortzea azkartzeko.
2.Gaur egungo historia betetzea
Ikuspegiaren argazkiak zuzenean biltzeko, plpgsql funtzioa exekutatzen duen bash script bat erabiltzen da.
plpgsql dblink funtzioak helburuko datu-baseko bistara sartzen ditu eta errenkadak txertatzen ditu jarraipenaren datu-baseko zerbitzu-tauletan.
get_current_activity.sql
CREATE OR REPLACE FUNCTION activity_hist.get_current_activity( current_host text , current_s_name text , current_s_pass text ) RETURNS BOOLEAN AS $$
DECLARE
database_rec record;
dblink_str text ;
BEGIN
EXECUTE 'SELECT dblink_connect(''LINK1'',''host='||current_host||' port=5432 dbname=postgres'||
' user='||current_s_name||' password='||current_s_pass|| ' '')';
--------------------------------------------------------------------
--GET pg_stat_activity stats
INSERT INTO activity_hist.history_pg_stat_activity
(
SELECT * FROM dblink('LINK1',
'SELECT
now() ,
datid ,
datname ,
pid ,
usesysid ,
usename ,
application_name ,
client_addr ,
client_hostname ,
client_port ,
backend_start ,
xact_start ,
query_start ,
state_change ,
wait_event_type ,
wait_event ,
state ,
backend_xid ,
backend_xmin ,
query ,
backend_type
FROM pg_stat_activity
')
AS t (
timepoint timestamp without time zone ,
datid oid ,
datname name ,
pid integer,
usesysid oid ,
usename name ,
application_name text ,
client_addr inet ,
client_hostname text ,
client_port integer,
backend_start timestamp with time zone ,
xact_start timestamp with time zone ,
query_start timestamp with time zone ,
state_change timestamp with time zone ,
wait_event_type text ,
wait_event text ,
state text ,
backend_xid xid ,
backend_xmin xid ,
query text ,
backend_type text
)
);
---------------------------------------
--ACTIVITY_HIST.HISTORY_LOCKING
INSERT INTO activity_hist.history_locking
(
SELECT * FROM dblink('LINK1',
'SELECT
now() ,
lock.locktype,
lock.relation,
lock.mode,
lock.transactionid as tid,
lock.virtualtransaction as vtid,
lock.pid,
pg_blocking_pids(lock.pid),
lock.granted
FROM pg_catalog.pg_locks lock LEFT JOIN pg_catalog.pg_database db ON db.oid = lock.database
WHERE NOT lock.pid = pg_backend_pid()
')
AS t (
timepoint timestamp without time zone ,
locktype text ,
relation oid ,
mode text ,
tid xid ,
vtid text ,
pid integer ,
blocking_pids integer[] ,
granted boolean
)
);
PERFORM dblink_disconnect('LINK1');
RETURN TRUE ;
END
$$ LANGUAGE plpgsql;
Ikuspegiak biltzeko, systemd zerbitzua eta bi script erabiltzen dira:
pg_current_activity.service
# /etc/systemd/system/pg_current_activity.service
[Unit]
Description=Collect history of pg_stat_activity , pg_locks
Wants=pg_current_activity.timer
[Service]
Type=forking
StartLimitIntervalSec=0
ExecStart=/home/postgres/pgutils/demon/get_current_activity.sh 10.124.70.40 postgres postgres
[Install]
WantedBy=multi-user.target
pg_current_activity.timer
# /etc/systemd/system/pg_current_activity.timer
[Unit]
Description=Run pg_current_activity.sh every 1 second
Requires=pg_current_activity.service
[Timer]
Unit=pg_current_activity.service
OnCalendar=*:*:0/1
AccuracySec=1
[Install]
WantedBy=timers.target
Hasi gaitezen zerbitzua:
# systemctl daemon-reload
# systemctl start pg_current_activity.service
Horrela, bisten historia segundoz segundoko argazkien moduan biltzen da. Noski, dena bere horretan utziz gero, mahaiak oso azkar handituko dira tamaina eta lan produktiboa edo gutxiago ezinezkoa izango da.
Beharrezkoa da datuen artxiboa antolatzea.
3. Historia artxibatzea
Artxibatzeko, partizionatutako taulen artxiboa* erabiltzen da.
Partizio berriak sortzen dira orduro, datu zaharrak historia* tauletatik kentzen diren bitartean, beraz, historia* taulen tamaina ez da asko aldatzen eta txertatze-abiadura ez da hondatzen denborarekin.
Atal berrien sorrera plpgsql funtzioak egiten du activity_hist.archive_current_activity. Lanaren algoritmoa oso erraza da (archive_pg_stat_activity taularako ataleko adibidea erabiliz).
Sortu eta bete atal berri bat
EXECUTE format(
'CREATE TABLE ' || partition_name ||
' PARTITION OF activity_hist.archive_pg_stat_activity FOR VALUES FROM ( %L ) TO ( %L ) ' ,
to_char(date_trunc('year', partition_min_range ),'YYYY')||'-'||
to_char(date_trunc('month', partition_min_range ),'MM')||'-'||
to_char(date_trunc('day', partition_min_range ),'DD')||' '||
to_char(date_trunc('hour', partition_min_range ),'HH24')||':00',
to_char(date_trunc('year', partition_max_range ),'YYYY')||'-'||
to_char(date_trunc('month', partition_max_range ),'MM')||'-'||
to_char(date_trunc('day', partition_max_range ),'DD')||' '||
to_char(date_trunc('hour', partition_max_range ),'HH24')||':00'
);
INSERT INTO activity_hist.archive_pg_stat_activity
(
SELECT *
FROM activity_hist.history_pg_stat_activity
WHERE timepoint BETWEEN partition_min_range AND partition_max_range
);
Indizeak sortzea
EXECUTE format (
'CREATE INDEX '||index_name||
' ON '||partition_name||' ( wait_event_type , backend_type , timepoint )'
);
EXECUTE format ('CREATE INDEX '||index_name||
' ON '||partition_name||' ( wait_event_type , backend_type , timepoint , queryid )'
);
Historia_pg_stat_activity taulatik datu zaharrak kentzen
DELETE
FROM activity_hist.history_pg_stat_activity
WHERE timepoint < partition_max_range;
Noski, noizean behin, atal zaharrak ezabatzen dira alferrikako moduan.
Oinarrizko txostenak
Egia esan, zergatik egiten da hau guztia? Oracle-ren AWR gogorarazten duten txostenak lortzeko.
Garrantzitsua da gehitzea txostenak jasotzeko, pg_stat_activity eta pg_stat_statements ikuspegien arteko konexioa sortu behar duzula. Taulak 'queryid' zutabe bat gehituz lotzen dira 'history_pg_stat_activity', 'archive_pg_stat_activity' tauletan. Zutabe-balioa gehitzeko metodoa artikulu honen esparrutik kanpo dago eta hemen deskribatzen da - pg_stat_statements + pg_stat_activity + loq_query = pg_ash? .
PUZAREN DENBORA GUZTIRA KONTSULTETAN
Eskaera:
WITH hist AS
(
SELECT
aa.query ,aa.queryid ,
count(*) * interval '1 second' AS duration
FROM activity_hist.archive_pg_stat_activity aa
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND backend_type = 'client backend' AND datname != 'postgres' AND ( aa.wait_event_type IS NULL ) ANDaa.state = 'active'
GROUP BY aa.wait_event_type , aa.wait_event , aa.query ,aa.queryid
UNION
SELECT
ha.query ,ha.queryid,
count(*) * interval '1 second' AS duration
FROM activity_hist.history_pg_stat_activity_for_reports ha
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND backend_type = 'client backend' AND datname != 'postgres' AND ( ha.wait_event_type IS NULL )AND ha.state = 'active'
GROUP BY ha.wait_event_type , ha.wait_event , ha.query ,ha.queryid
)
SELECT query , queryid , SUM( duration ) as duration
FROM hist
GROUP BY query , queryid
ORDER BY 3 DESC
WITH hist AS
(
SELECT
aa.query ,aa.queryid ,
count(*) * interval '1 second' AS duration
FROM activity_hist.archive_pg_stat_activity aa
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
( aa.wait_event_type IS NOT NULL )
GROUP BY aa.wait_event_type , aa.wait_event , aa.query ,aa.queryid
UNION
SELECT
ha.query ,ha.queryid,
count(*) * interval '1 second' AS duration
FROM activity_hist.history_pg_stat_activity_for_reports ha
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
( ha.wait_event_type IS NOT NULL )
GROUP BY ha.wait_event_type , ha.wait_event , ha.query ,ha.queryid
)
SELECT query , queryid , SUM( duration ) as duration
FROM hist
GROUP BY query , queryid
ORDER BY 3 DESC
WITH hist AS
(
SELECT
aa.wait_event_type , aa.wait_event
FROM activity_hist.archive_pg_stat_activity aa
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
aa.wait_event IS NOT NULL
GROUP BY aa.wait_event_type , aa.wait_event
UNION
SELECT
ha.wait_event_type , ha.wait_event
FROM activity_hist.history_pg_stat_activity_for_reports ha
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
ha.wait_event IS NOT NULL
GROUP BY ha.wait_event_type , ha.wait_event
)
SELECT wait_event_type , wait_event
FROM hist
GROUP BY wait_event_type , wait_event
ORDER BY 1 ASC,2 ASC
----------------------------------------------------------------------
WITH hist AS
(
SELECT
aa.wait_event_type , aa.wait_event , aa.query ,aa.queryid ,
count(*) * interval '1 second' AS duration
FROM activity_hist.archive_pg_stat_activity aa
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
( aa.wait_event_type = waitings_stat_rec.wait_event_type AND aa.wait_event = waitings_stat_rec.wait_event )
GROUP BY aa.wait_event_type , aa.wait_event , aa.query ,aa.queryid
UNION
SELECT
ha.wait_event_type , ha.wait_event , ha.query ,ha.queryid,
count(*) * interval '1 second' AS duration
FROM activity_hist.history_pg_stat_activity_for_reports ha
WHERE timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
backend_type = 'client backend' AND datname != 'postgres' AND
( ha.wait_event_type = waitings_stat_rec.wait_event_type AND ha.wait_event = waitings_stat_rec.wait_event )
GROUP BY ha.wait_event_type , ha.wait_event , ha.query ,ha.queryid
)
SELECT query , queryid , SUM( duration ) as duration
FROM hist
GROUP BY query , queryid
ORDER BY 3 DESC
SELECT
MIN(date_trunc('second',timepoint)) AS started ,
count(*) * interval '1 second' as duration ,
pid , blocking_pids , relation , mode , locktype
FROM
activity_hist.archive_locking al
WHERE
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
NOT granted AND
locktype = 'relation'
GROUP BY pid , blocking_pids , relation , mode , locktype
UNION
SELECT
MIN(date_trunc('second',timepoint)) AS started ,
count(*) * interval '1 second' as duration ,
pid , blocking_pids , relation , mode , locktype
FROM
activity_hist.history_locking
WHERE
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
NOT granted AND
locktype = 'relation'
GROUP BY pid , blocking_pids , relation , mode , locktype
ORDER BY 1
SELECT
blocking_pids
FROM
activity_hist.archive_locking al
WHERE
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
NOT granted AND
locktype = 'relation'
GROUP BY blocking_pids
UNION
SELECT
blocking_pids
FROM
activity_hist.history_locking
WHERE
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour') AND
NOT granted AND
locktype = 'relation'
GROUP BY blocking_pids
ORDER BY 1
---------------------------------------------------------------
SELECT
pid , usename , application_name , datname ,
MIN(date_trunc('second',timepoint)) as started ,
count(*) * interval '1 second' as duration ,
state ,
query
FROM activity_hist.archive_pg_stat_activity
WHERE pid= current_pid AND
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour')
GROUP BY pid , usename , application_name ,
datname ,
state_change,
state ,
query
UNION
SELECT
pid , usename , application_name , datname ,
MIN(date_trunc('second',timepoint)) as started ,
count(*) * interval '1 second' as duration ,
state ,
query
FROM activity_hist.history_pg_stat_activity_for_reports
WHERE pid= current_pid AND
timepoint BETWEEN pg_stat_history_begin+(current_hour_diff * interval '1 hour') AND pg_stat_history_end+(current_hour_diff * interval '1 hour')
GROUP BY pid , usename , application_name ,
datname ,
state_change,
state ,
query
ORDER BY 5 , 1
Erakutsitako oinarrizko kontsultek eta ondoriozko txostenek dagoeneko bizitza askoz errazten dute errendimendu-intzidurrak aztertzean.
Oinarrizko kontsultetan oinarrituta, Oracle-ren AWRren antza lausoa duen txostena lor dezakezu. Laburpen txostenaren adibidea
+------------------------------------------------- ----------------------------------- | JARDUERAREN ETA ITXOEN TXOSTENA BATERATUA.
Jarraituko du. Hurrengo lerroan blokeo-historia bat sortzea da (pg_stat_locks), taulak betetzeko prozesuaren deskribapen zehatzagoa.