No beagan tetrisology gnìomhaichte.
Tha a h-uile dad ùr air a dhìochuimhneachadh gu math sean.
Epigraphs.
Aithris dhuilgheadas
Feumar am faidhle log PostgreSQL gnàthach a luchdachadh sìos bho sgòth AWS chun aoigh Linux ionadail. Chan ann ann an àm fìor, ach, canaidh sinn, le beagan dàil.
Is e 5 mionaidean an ùine luchdachadh sìos faidhle log.
Tha am faidhle log, ann an AWS, air a thionndadh gach uair a thìde.
Innealan air an cleachdadh
Gus am faidhle log a luchdachadh suas chun aoigh, thèid sgriobt bash a chleachdadh a dh’ ainmicheas API AWS “
Paramadairean:
- --db-instance-identifier: Instance name in AWS;
- --log-file-name: ainm an fhaidhle loga a chaidh a chruthachadh an-dràsta
- --max-item: An àireamh iomlan de nithean a chaidh a thilleadh ann an toradh na h-àithne.Meud pìos an fhaidhle a chaidh a luchdachadh sìos.
- --starting-token: Tòcan tòcan tòiseachaidh
Tha, agus gu sìmplidh - gnìomh inntinneach airson trèanadh agus measgachadh rè uairean obrach.
Tha mi a 'gabhail ris gu bheil an duilgheadas air a rèiteachadh mar-thà mar thoradh air gnàthachadh. Ach cha do mhol Google luath fuasglaidhean, agus cha robh miann sònraichte ann rannsachadh nas doimhne. Ann an suidheachadh sam bith, is e deagh eacarsaich a th 'ann.
Foirmeil na h-obrach
Tha am faidhle log mu dheireadh na sheata de loidhnichean de dh'fhaid caochlaideach. Gu grafaigeach, faodar am faidhle log a riochdachadh mar seo:
A bheil e mar-thà gad chuimhneachadh air rudeigin? Dè th' ann an "tetris"? Agus seo dè.
Ma tha sinn a 'riochdachadh na roghainnean a dh'fhaodadh èirigh nuair a bhios sinn a' luchdachadh an ath fhaidhle gu grafaigeach (airson sìmplidh, sa chùis seo, leig leis na loidhnichean an aon fhaid), gheibh sinn àireamhan tetris àbhaisteach:
1) Tha am faidhle air a luchdachadh sìos gu h-iomlan agus tha e deireannach. Tha meud a’ phìos nas motha na meud an fhaidhle mu dheireadh:
2) Tha leantainneachd aig an fhaidhle. Tha meud a’ phìos nas lugha na meud an fhaidhle mu dheireadh:
3) Tha am faidhle a’ leantainn air an fhaidhle a bh’ ann roimhe agus tha leantainn air. Tha meud a’ phìos nas lugha na meud a’ chòrr den fhaidhle mu dheireadh:
4) Tha am faidhle a’ leantainn air an fhaidhle roimhe agus tha e deireannach. Tha meud a’ phìos nas motha na meud a’ chòrr den fhaidhle mu dheireadh:
Is e an obair ceart-cheàrnach a chruinneachadh no Tetris a chluich air ìre ùr.
Duilgheadasan a tha ag èirigh fhad 'sa tha thu a' fuasgladh na duilgheadas
1) Glue sreang de 2 earrannan
San fharsaingeachd, cha robh duilgheadasan sònraichte ann. Gnìomh àbhaisteach bhon chiad chùrsa prògramadh.
Am meud seirbheis as fheàrr
Ach tha seo beagan nas inntinniche.
Gu mì-fhortanach, chan eil dòigh ann air cothromachadh a chleachdadh às deidh an leubail tòiseachaidh:
Mar a tha fios agad mu thràth tha an roghainn --starting-token air a chleachdadh gus sònrachadh càite an tòisich thu air duilleagadh. Bidh an roghainn seo a’ gabhail luachan String a bhiodh a’ ciallachadh ma dh’ fheuchas tu ri luach cuir dheth a chuir air beulaibh sreang Next Token, cha tèid beachdachadh air an roghainn mar chothromachadh.
Agus mar sin, feumaidh tu leughadh ann an pìosan-cuibhreann.
Ma leughas tu ann an cuibhreannan mòra, bidh an àireamh de leughaidhean glè bheag, ach bidh an tomhas-lìonaidh aig a’ char as àirde.
Ma leughas tu ann an earrannan beaga, air an làimh eile, bidh an àireamh de leughaidhean aig a ’char as àirde, ach bidh an tomhas-lìonaidh glè bheag.
Mar sin, gus trafaic a lughdachadh agus airson bòidhchead iomlan an fhuasglaidh, bha agam ri fuasgladh de sheòrsa air choreigin a lorg, a tha, gu mì-fhortanach, a ’coimhead caran coltach ri crutch.
Gus sealltainn, leig dhuinn beachdachadh air a’ phròiseas airson faidhle log a luchdachadh sìos ann an dreachan 2 a tha gu math sìmplidh. Tha an àireamh de leughaidhean anns gach cùis an urra ri meud a’ chuibhreann.
1) Luchdaich ann am pìosan beaga:
2) Luchdaich ann am pàirtean mòra:
Mar as àbhaist, tha am fuasgladh as fheàrr sa mheadhan.
Tha meud na cuibhreann glè bheag, ach ann am pròiseas leughaidh, faodar am meud àrdachadh gus an àireamh de leughaidhean a lughdachadh.
Bu chòir a thoirt fa-near nach deach an duilgheadas a thaobh a bhith a’ taghadh am meud as fheàrr den chuibhreann leughaidh fhuasgladh gu tur fhathast agus gu bheil feum air sgrùdadh agus mion-sgrùdadh nas doimhne. Is dòcha beagan nas fhaide air adhart.
Tuairisgeul coitcheann air buileachadh
Clàran seirbheis air an cleachdadh
CREATE TABLE endpoint
(
id SERIAL ,
host text
);
TABLE database
(
id SERIAL ,
…
last_aws_log_time text ,
last_aws_nexttoken text ,
aws_max_item_size integer
);
last_aws_log_time — временная метка последнего загруженного лог-файла в формате YYYY-MM-DD-HH24.
last_aws_nexttoken — текстовая метка последней загруженной порции.
aws_max_item_size- эмпирическим путем, подобранный начальный размер порции.
Teacs slàn den sgriobt
download_aws_piece.sh
#!/bin/bash
#########################################################
# download_aws_piece.sh
# downloan piece of log from AWS
# version HABR
let min_item_size=1024
let max_item_size=1048576
let growth_factor=3
let growth_counter=1
let growth_counter_max=3
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:''STARTED'
AWS_LOG_TIME=$1
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:AWS_LOG_TIME='$AWS_LOG_TIME
database_id=$2
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:database_id='$database_id
RESULT_FILE=$3
endpoint=`psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE_DATABASE -A -t -c "select e.host from endpoint e join database d on e.id = d.endpoint_id where d.id = $database_id "`
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:endpoint='$endpoint
db_instance=`echo $endpoint | awk -F"." '{print toupper($1)}'`
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:db_instance='$db_instance
LOG_FILE=$RESULT_FILE'.tmp_log'
TMP_FILE=$LOG_FILE'.tmp'
TMP_MIDDLE=$LOG_FILE'.tmp_mid'
TMP_MIDDLE2=$LOG_FILE'.tmp_mid2'
current_aws_log_time=`psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -A -t -c "select last_aws_log_time from database where id = $database_id "`
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:current_aws_log_time='$current_aws_log_time
if [[ $current_aws_log_time != $AWS_LOG_TIME ]];
then
is_new_log='1'
if ! psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -v ON_ERROR_STOP=1 -A -t -q -c "update database set last_aws_log_time = '$AWS_LOG_TIME' where id = $database_id "
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: FATAL_ERROR - update database set last_aws_log_time .'
exit 1
fi
else
is_new_log='0'
fi
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh:is_new_log='$is_new_log
let last_aws_max_item_size=`psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -A -t -c "select aws_max_item_size from database where id = $database_id "`
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: last_aws_max_item_size='$last_aws_max_item_size
let count=1
if [[ $is_new_log == '1' ]];
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: START DOWNLOADING OF NEW AWS LOG'
if ! aws rds download-db-log-file-portion
--max-items $last_aws_max_item_size
--region REGION
--db-instance-identifier $db_instance
--log-file-name error/postgresql.log.$AWS_LOG_TIME > $LOG_FILE
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: FATAL_ERROR - Could not get log from AWS .'
exit 2
fi
else
next_token=`psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -v ON_ERROR_STOP=1 -A -t -c "select last_aws_nexttoken from database where id = $database_id "`
if [[ $next_token == '' ]];
then
next_token='0'
fi
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: CONTINUE DOWNLOADING OF AWS LOG'
if ! aws rds download-db-log-file-portion
--max-items $last_aws_max_item_size
--starting-token $next_token
--region REGION
--db-instance-identifier $db_instance
--log-file-name error/postgresql.log.$AWS_LOG_TIME > $LOG_FILE
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: FATAL_ERROR - Could not get log from AWS .'
exit 3
fi
line_count=`cat $LOG_FILE | wc -l`
let lines=$line_count-1
tail -$lines $LOG_FILE > $TMP_MIDDLE
mv -f $TMP_MIDDLE $LOG_FILE
fi
next_token_str=`cat $LOG_FILE | grep NEXTTOKEN`
next_token=`echo $next_token_str | awk -F" " '{ print $2}' `
grep -v NEXTTOKEN $LOG_FILE > $TMP_FILE
if [[ $next_token == '' ]];
then
cp $TMP_FILE $RESULT_FILE
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: NEXTTOKEN NOT FOUND - FINISH '
rm $LOG_FILE
rm $TMP_FILE
rm $TMP_MIDDLE
rm $TMP_MIDDLE2
exit 0
else
psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -v ON_ERROR_STOP=1 -A -t -q -c "update database set last_aws_nexttoken = '$next_token' where id = $database_id "
fi
first_str=`tail -1 $TMP_FILE`
line_count=`cat $TMP_FILE | wc -l`
let lines=$line_count-1
head -$lines $TMP_FILE > $RESULT_FILE
###############################################
# MAIN CIRCLE
let count=2
while [[ $next_token != '' ]];
do
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: count='$count
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: START DOWNLOADING OF AWS LOG'
if ! aws rds download-db-log-file-portion
--max-items $last_aws_max_item_size
--starting-token $next_token
--region REGION
--db-instance-identifier $db_instance
--log-file-name error/postgresql.log.$AWS_LOG_TIME > $LOG_FILE
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: FATAL_ERROR - Could not get log from AWS .'
exit 4
fi
next_token_str=`cat $LOG_FILE | grep NEXTTOKEN`
next_token=`echo $next_token_str | awk -F" " '{ print $2}' `
TMP_FILE=$LOG_FILE'.tmp'
grep -v NEXTTOKEN $LOG_FILE > $TMP_FILE
last_str=`head -1 $TMP_FILE`
if [[ $next_token == '' ]];
then
concat_str=$first_str$last_str
echo $concat_str >> $RESULT_FILE
line_count=`cat $TMP_FILE | wc -l`
let lines=$line_count-1
tail -$lines $TMP_FILE >> $RESULT_FILE
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: NEXTTOKEN NOT FOUND - FINISH '
rm $LOG_FILE
rm $TMP_FILE
rm $TMP_MIDDLE
rm $TMP_MIDDLE2
exit 0
fi
if [[ $next_token != '' ]];
then
let growth_counter=$growth_counter+1
if [[ $growth_counter -gt $growth_counter_max ]];
then
let last_aws_max_item_size=$last_aws_max_item_size*$growth_factor
let growth_counter=1
fi
if [[ $last_aws_max_item_size -gt $max_item_size ]];
then
let last_aws_max_item_size=$max_item_size
fi
psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -A -t -q -c "update database set last_aws_nexttoken = '$next_token' where id = $database_id "
concat_str=$first_str$last_str
echo $concat_str >> $RESULT_FILE
line_count=`cat $TMP_FILE | wc -l`
let lines=$line_count-1
#############################
#Get middle of file
head -$lines $TMP_FILE > $TMP_MIDDLE
line_count=`cat $TMP_MIDDLE | wc -l`
let lines=$line_count-1
tail -$lines $TMP_MIDDLE > $TMP_MIDDLE2
cat $TMP_MIDDLE2 >> $RESULT_FILE
first_str=`tail -1 $TMP_FILE`
fi
let count=$count+1
done
#
#################################################################
exit 0
Sgriobtaichean le beagan mìneachaidhean:
Paramadairean cuir a-steach sgriobt:
- Clàr-ama ainm faidhle loga ann an cruth YYYY-MM-DD-HH24: AWS_LOG_TIME=$1
- ID an stòr-dàta: database_id=$2
- Ainm faidhle loga cruinnichte: RESULT_FILE=$3
Faigh stampa-ama an fhaidhle log mu dheireadh a chaidh a luchdachadh suas:
current_aws_log_time=`psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -A -t -c "select last_aws_log_time from database where id = $database_id "`
Mura h-eil clàr-ama an fhaidhle loga luchdaichte mu dheireadh a’ freagairt ris a’ pharameter cuir a-steach, thèid faidhle loga ùr a luchdachadh:
if [[ $current_aws_log_time != $AWS_LOG_TIME ]];
then
is_new_log='1'
if ! psql -h ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -v ON_ERROR_STOP=1 -A -t -c "update database set last_aws_log_time = '$AWS_LOG_TIME' where id = $database_id "
then
echo '***download_aws_piece.sh -FATAL_ERROR - update database set last_aws_log_time .'
exit 1
fi
else
is_new_log='0'
fi
Gheibh sinn luach an leubail nexttoken bhon fhaidhle luchdaichte:
next_token_str=`cat $LOG_FILE | grep NEXTTOKEN`
next_token=`echo $next_token_str | awk -F" " '{ print $2}' `
Is e soidhne deireadh an luchdachadh sìos luach falamh nexttoken.
Ann an lùb, bidh sinn a 'cunntadh earrannan den fhaidhle, air an t-slighe, a' co-chòrdadh loidhnichean agus a 'meudachadh meud a' chuibhreann:
Prìomh lùb
# MAIN CIRCLE
let count=2
while [[ $next_token != '' ]];
do
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: count='$count
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: START DOWNLOADING OF AWS LOG'
if ! aws rds download-db-log-file-portion
--max-items $last_aws_max_item_size
--starting-token $next_token
--region REGION
--db-instance-identifier $db_instance
--log-file-name error/postgresql.log.$AWS_LOG_TIME > $LOG_FILE
then
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: FATAL_ERROR - Could not get log from AWS .'
exit 4
fi
next_token_str=`cat $LOG_FILE | grep NEXTTOKEN`
next_token=`echo $next_token_str | awk -F" " '{ print $2}' `
TMP_FILE=$LOG_FILE'.tmp'
grep -v NEXTTOKEN $LOG_FILE > $TMP_FILE
last_str=`head -1 $TMP_FILE`
if [[ $next_token == '' ]];
then
concat_str=$first_str$last_str
echo $concat_str >> $RESULT_FILE
line_count=`cat $TMP_FILE | wc -l`
let lines=$line_count-1
tail -$lines $TMP_FILE >> $RESULT_FILE
echo $(date +%Y%m%d%H%M)': download_aws_piece.sh: NEXTTOKEN NOT FOUND - FINISH '
rm $LOG_FILE
rm $TMP_FILE
rm $TMP_MIDDLE
rm $TMP_MIDDLE2
exit 0
fi
if [[ $next_token != '' ]];
then
let growth_counter=$growth_counter+1
if [[ $growth_counter -gt $growth_counter_max ]];
then
let last_aws_max_item_size=$last_aws_max_item_size*$growth_factor
let growth_counter=1
fi
if [[ $last_aws_max_item_size -gt $max_item_size ]];
then
let last_aws_max_item_size=$max_item_size
fi
psql -h MONITOR_ENDPOINT.rds.amazonaws.com -U USER -d MONITOR_DATABASE -A -t -q -c "update database set last_aws_nexttoken = '$next_token' where id = $database_id "
concat_str=$first_str$last_str
echo $concat_str >> $RESULT_FILE
line_count=`cat $TMP_FILE | wc -l`
let lines=$line_count-1
#############################
#Get middle of file
head -$lines $TMP_FILE > $TMP_MIDDLE
line_count=`cat $TMP_MIDDLE | wc -l`
let lines=$line_count-1
tail -$lines $TMP_MIDDLE > $TMP_MIDDLE2
cat $TMP_MIDDLE2 >> $RESULT_FILE
first_str=`tail -1 $TMP_FILE`
fi
let count=$count+1
done
Dè an ath rud?
Mar sin, tha a 'chiad obair eadar-mheadhanach - "luchdaich sìos am faidhle log bhon sgòth" air a rèiteachadh. Dè a nì thu leis an log a chaidh a luchdachadh sìos?
An toiseach feumaidh tu am faidhle log a pharsadh agus na fìor iarrtasan a thoirt a-mach às.
Chan eil an obair gu math duilich. Tha an sgriobt bash as sìmplidh gu math.
upload_log_query.sh
#!/bin/bash
#########################################################
# upload_log_query.sh
# Upload table table from dowloaded aws file
# version HABR
###########################################################
echo 'TIMESTAMP:'$(date +%c)' Upload log_query table '
source_file=$1
echo 'source_file='$source_file
database_id=$2
echo 'database_id='$database_id
beginer=' '
first_line='1'
let "line_count=0"
sql_line=' '
sql_flag=' '
space=' '
cat $source_file | while read line
do
line="$space$line"
if [[ $first_line == "1" ]]; then
beginer=`echo $line | awk -F" " '{ print $1}' `
first_line='0'
fi
current_beginer=`echo $line | awk -F" " '{ print $1}' `
if [[ $current_beginer == $beginer ]]; then
if [[ $sql_flag == '1' ]]; then
sql_flag='0'
log_date=`echo $sql_line | awk -F" " '{ print $1}' `
log_time=`echo $sql_line | awk -F" " '{ print $2}' `
duration=`echo $sql_line | awk -F" " '{ print $5}' `
#replace ' to ''
sql_modline=`echo "$sql_line" | sed 's/'''/''''''/g'`
sql_line=' '
################
#PROCESSING OF THE SQL-SELECT IS HERE
if ! psql -h ENDPOINT.rds.amazonaws.com -U USER -d DATABASE -v ON_ERROR_STOP=1 -A -t -c "select log_query('$ip_port',$database_id , '$log_date' , '$log_time' , '$duration' , '$sql_modline' )"
then
echo 'FATAL_ERROR - log_query '
exit 1
fi
################
fi #if [[ $sql_flag == '1' ]]; then
let "line_count=line_count+1"
check=`echo $line | awk -F" " '{ print $8}' `
check_sql=${check^^}
#echo 'check_sql='$check_sql
if [[ $check_sql == 'SELECT' ]]; then
sql_flag='1'
sql_line="$sql_line$line"
ip_port=`echo $sql_line | awk -F":" '{ print $4}' `
fi
else
if [[ $sql_flag == '1' ]]; then
sql_line="$sql_line$line"
fi
fi #if [[ $current_beginer == $beginer ]]; then
done
A-nis faodaidh tu obrachadh leis a’ cheist a chaidh a thoirt a-mach às an fhaidhle log.
Agus tha grunn chothroman feumail ann.
Feumar fiosan parsed a stòradh an àiteigin. Airson seo, thathar a 'cleachdadh clàr seirbheis. log_query
CREATE TABLE log_query
(
id SERIAL ,
queryid bigint ,
query_md5hash text not null ,
database_id integer not null ,
timepoint timestamp without time zone not null,
duration double precision not null ,
query text not null ,
explained_plan text[],
plan_md5hash text ,
explained_plan_wo_costs text[],
plan_hash_value text ,
baseline_id integer ,
ip text ,
port text
);
ALTER TABLE log_query ADD PRIMARY KEY (id);
ALTER TABLE log_query ADD CONSTRAINT queryid_timepoint_unique_key UNIQUE (queryid, timepoint );
ALTER TABLE log_query ADD CONSTRAINT query_md5hash_timepoint_unique_key UNIQUE (query_md5hash, timepoint );
CREATE INDEX log_query_timepoint_idx ON log_query (timepoint);
CREATE INDEX log_query_queryid_idx ON log_query (queryid);
ALTER TABLE log_query ADD CONSTRAINT database_id_fk FOREIGN KEY (database_id) REFERENCES database (id) ON DELETE CASCADE ;
Tha an t-iarrtas parsaichte air a phròiseasadh a-steach plpgsql gnìomhan "log_query".
log_query.sql
--log_query.sql
--verison HABR
CREATE OR REPLACE FUNCTION log_query( ip_port text ,log_database_id integer , log_date text , log_time text , duration text , sql_line text ) RETURNS boolean AS $$
DECLARE
result boolean ;
log_timepoint timestamp without time zone ;
log_duration double precision ;
pos integer ;
log_query text ;
activity_string text ;
log_md5hash text ;
log_explain_plan text[] ;
log_planhash text ;
log_plan_wo_costs text[] ;
database_rec record ;
pg_stat_query text ;
test_log_query text ;
log_query_rec record;
found_flag boolean;
pg_stat_history_rec record ;
port_start integer ;
port_end integer ;
client_ip text ;
client_port text ;
log_queryid bigint ;
log_query_text text ;
pg_stat_query_text text ;
BEGIN
result = TRUE ;
RAISE NOTICE '***log_query';
port_start = position('(' in ip_port);
port_end = position(')' in ip_port);
client_ip = substring( ip_port from 1 for port_start-1 );
client_port = substring( ip_port from port_start+1 for port_end-port_start-1 );
SELECT e.host , d.name , d.owner_pwd
INTO database_rec
FROM database d JOIN endpoint e ON e.id = d.endpoint_id
WHERE d.id = log_database_id ;
log_timepoint = to_timestamp(log_date||' '||log_time,'YYYY-MM-DD HH24-MI-SS');
log_duration = duration:: double precision;
pos = position ('SELECT' in UPPER(sql_line) );
log_query = substring( sql_line from pos for LENGTH(sql_line));
log_query = regexp_replace(log_query,' +',' ','g');
log_query = regexp_replace(log_query,';+','','g');
log_query = trim(trailing ' ' from log_query);
log_md5hash = md5( log_query::text );
--Explain execution plan--
EXECUTE 'SELECT dblink_connect(''LINK1'',''host='||database_rec.host||' dbname='||database_rec.name||' user=DATABASE password='||database_rec.owner_pwd||' '')';
log_explain_plan = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN '||log_query ) AS t (plan text) );
log_plan_wo_costs = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN ( COSTS FALSE ) '||log_query ) AS t (plan text) );
PERFORM dblink_disconnect('LINK1');
--------------------------
BEGIN
INSERT INTO log_query
(
query_md5hash ,
database_id ,
timepoint ,
duration ,
query ,
explained_plan ,
plan_md5hash ,
explained_plan_wo_costs ,
plan_hash_value ,
ip ,
port
)
VALUES
(
log_md5hash ,
log_database_id ,
log_timepoint ,
log_duration ,
log_query ,
log_explain_plan ,
md5(log_explain_plan::text) ,
log_plan_wo_costs ,
md5(log_plan_wo_costs::text),
client_ip ,
client_port
);
activity_string = 'New query has logged '||
' database_id = '|| log_database_id ||
' query_md5hash='||log_md5hash||
' , timepoint = '||to_char(log_timepoint,'YYYYMMDD HH24:MI:SS');
RAISE NOTICE '%',activity_string;
PERFORM pg_log( log_database_id , 'log_query' , activity_string);
EXCEPTION
WHEN unique_violation THEN
RAISE NOTICE '*** unique_violation *** query already has logged';
END;
SELECT queryid
INTO log_queryid
FROM log_query
WHERE query_md5hash = log_md5hash AND
timepoint = log_timepoint;
IF log_queryid IS NOT NULL
THEN
RAISE NOTICE 'log_query with query_md5hash = % and timepoint = % has already has a QUERYID = %',log_md5hash,log_timepoint , log_queryid ;
RETURN result;
END IF;
------------------------------------------------
RAISE NOTICE 'Update queryid';
SELECT *
INTO log_query_rec
FROM log_query
WHERE query_md5hash = log_md5hash AND timepoint = log_timepoint ;
log_query_rec.query=regexp_replace(log_query_rec.query,';+','','g');
FOR pg_stat_history_rec IN
SELECT
queryid ,
query
FROM
pg_stat_db_queries
WHERE
database_id = log_database_id AND
queryid is not null
LOOP
pg_stat_query = pg_stat_history_rec.query ;
pg_stat_query=regexp_replace(pg_stat_query,'n+',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,'t+',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,' +',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,'$.','%','g');
log_query_text = trim(trailing ' ' from log_query_rec.query);
pg_stat_query_text = pg_stat_query;
--SELECT log_query_rec.query like pg_stat_query INTO found_flag ;
IF (log_query_text LIKE pg_stat_query_text) THEN
found_flag = TRUE ;
ELSE
found_flag = FALSE ;
END IF;
IF found_flag THEN
UPDATE log_query SET queryid = pg_stat_history_rec.queryid WHERE query_md5hash = log_md5hash AND timepoint = log_timepoint ;
activity_string = ' updated queryid = '||pg_stat_history_rec.queryid||
' for log_query with id = '||log_query_rec.id
;
RAISE NOTICE '%',activity_string;
EXIT ;
END IF ;
END LOOP ;
RETURN result ;
END
$$ LANGUAGE plpgsql;
Nuair a thathar a 'giollachd, thathar a' cleachdadh clàr seirbheis pg_stat_db_ceisteanA anns a bheil dealbh de na ceistean làithreach bhon chlàr pg_stat_eachdraidh (Tha cleachdadh clàr air a mhìneachadh an seo -
TABLE pg_stat_db_queries
(
database_id integer,
queryid bigint ,
query text ,
max_time double precision
);
TABLE pg_stat_history
(
…
database_id integer ,
…
queryid bigint ,
…
max_time double precision ,
…
);
Leigidh an gnìomh leat grunn fheartan feumail a chuir an gnìomh airson a bhith a’ giullachd iarrtasan bho fhaidhle log. Is e sin:
Cothrom #1 - Ceist Eachdraidh Cur gu bàs
Gu math feumail airson tachartas coileanaidh a thòiseachadh. An toiseach, faigh eòlas air an eachdraidh - agus cuin a thòisich an slaodachadh?
An uairsin, a rèir nan clasaigeach, coimhead airson adhbharan bhon taobh a-muigh. Is dòcha gu bheil eallach an stòr-dàta air a dhol suas gu mòr agus nach eil gnothach aig an iarrtas sònraichte ris.
Cuir inntrigeadh ùr ris a’ chlàr log_query
port_start = position('(' in ip_port);
port_end = position(')' in ip_port);
client_ip = substring( ip_port from 1 for port_start-1 );
client_port = substring( ip_port from port_start+1 for port_end-port_start-1 );
SELECT e.host , d.name , d.owner_pwd
INTO database_rec
FROM database d JOIN endpoint e ON e.id = d.endpoint_id
WHERE d.id = log_database_id ;
log_timepoint = to_timestamp(log_date||' '||log_time,'YYYY-MM-DD HH24-MI-SS');
log_duration = to_number(duration,'99999999999999999999D9999999999');
pos = position ('SELECT' in UPPER(sql_line) );
log_query = substring( sql_line from pos for LENGTH(sql_line));
log_query = regexp_replace(log_query,' +',' ','g');
log_query = regexp_replace(log_query,';+','','g');
log_query = trim(trailing ' ' from log_query);
RAISE NOTICE 'log_query=%',log_query ;
log_md5hash = md5( log_query::text );
--Explain execution plan--
EXECUTE 'SELECT dblink_connect(''LINK1'',''host='||database_rec.host||' dbname='||database_rec.name||' user=DATABASE password='||database_rec.owner_pwd||' '')';
log_explain_plan = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN '||log_query ) AS t (plan text) );
log_plan_wo_costs = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN ( COSTS FALSE ) '||log_query ) AS t (plan text) );
PERFORM dblink_disconnect('LINK1');
--------------------------
BEGIN
INSERT INTO log_query
(
query_md5hash ,
database_id ,
timepoint ,
duration ,
query ,
explained_plan ,
plan_md5hash ,
explained_plan_wo_costs ,
plan_hash_value ,
ip ,
port
)
VALUES
(
log_md5hash ,
log_database_id ,
log_timepoint ,
log_duration ,
log_query ,
log_explain_plan ,
md5(log_explain_plan::text) ,
log_plan_wo_costs ,
md5(log_plan_wo_costs::text),
client_ip ,
client_port
);
Feart #2 - Sàbhail Planaichean Cur an Gnìomh Ceist
Aig an ìre seo, faodaidh gearan-soilleireachadh-beachd èirigh: “Ach tha fèin-mhìneachadh ann mu thràth". Tha, tha, ach dè a’ phuing ma tha am plana gnìomh air a stòradh san aon fhaidhle log agus gus a shàbhaladh airson tuilleadh sgrùdaidh, feumaidh tu am faidhle log a pharsadh?
Ach, bha feum agam air:
an toiseach: stòraich am plana cur gu bàs ann an clàr seirbheis an stòr-dàta sgrùdaidh;
san dàrna h-àite: a bhith comasach air coimeas a dhèanamh eadar planaichean cur an gnìomh gus faicinn sa bhad gu bheil plana gnìomh na ceiste air atharrachadh.
Tha iarrtas le crìochan coileanaidh sònraichte ri fhaighinn. Is e obair bhunaiteach a th’ ann a bhith a’ faighinn agus a’ stòradh a phlana cur gu bàs a’ cleachdadh EXPLAIN.
A bharrachd air an sin, a’ cleachdadh an abairt EXPLAIN (COSTS FALSE), gheibh thu frèam a’ phlana, a thèid a chleachdadh gus luach hash a’ phlana fhaighinn, a chuidicheas le mion-sgrùdadh às dèidh sin air eachdraidh atharrachaidh a’ phlana buileachaidh.
Luchdaich a-nuas am foirm plana cur-an-gnìomh
--Explain execution plan--
EXECUTE 'SELECT dblink_connect(''LINK1'',''host='||database_rec.host||' dbname='||database_rec.name||' user=DATABASE password='||database_rec.owner_pwd||' '')';
log_explain_plan = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN '||log_query ) AS t (plan text) );
log_plan_wo_costs = ARRAY ( SELECT * FROM dblink('LINK1', 'EXPLAIN ( COSTS FALSE ) '||log_query ) AS t (plan text) );
PERFORM dblink_disconnect('LINK1');
Cothrom #3 - A’ cleachdadh Log na Ceist airson Sgrùdadh
Leis gu bheil na metrics dèanadais air an rèiteachadh chan ann airson teacsa an iarrtais, ach airson an ID aige, feumaidh tu iarrtasan bhon fhaidhle log a cheangal ri iarrtasan airson a bheil tomhasan coileanaidh air an rèiteachadh.
Uill, co-dhiù gus am bi an dearbh àm ann nuair a thachair tachartas coileanaidh.
Mar sin, nuair a thachras tachartas coileanaidh airson ID iarrtas, bidh iomradh air iarrtas sònraichte le luachan paramadair sònraichte agus an dearbh ùine cur an gnìomh agus fad an iarrtais. Faigh am fiosrachadh a chaidh a thoirt seachad a’ cleachdadh dìreach an sealladh pg_stat_aithrisean - tha e toirmisgte.
Lorg queryid na ceiste agus ùraich an inntrig sa chlàr log_query
SELECT *
INTO log_query_rec
FROM log_query
WHERE query_md5hash = log_md5hash AND timepoint = log_timepoint ;
log_query_rec.query=regexp_replace(log_query_rec.query,';+','','g');
FOR pg_stat_history_rec IN
SELECT
queryid ,
query
FROM
pg_stat_db_queries
WHERE
database_id = log_database_id AND
queryid is not null
LOOP
pg_stat_query = pg_stat_history_rec.query ;
pg_stat_query=regexp_replace(pg_stat_query,'n+',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,'t+',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,' +',' ','g');
pg_stat_query=regexp_replace(pg_stat_query,'$.','%','g');
log_query_text = trim(trailing ' ' from log_query_rec.query);
pg_stat_query_text = pg_stat_query;
--SELECT log_query_rec.query like pg_stat_query INTO found_flag ;
IF (log_query_text LIKE pg_stat_query_text) THEN
found_flag = TRUE ;
ELSE
found_flag = FALSE ;
END IF;
IF found_flag THEN
UPDATE log_query SET queryid = pg_stat_history_rec.queryid WHERE query_md5hash = log_md5hash AND timepoint = log_timepoint ;
activity_string = ' updated queryid = '||pg_stat_history_rec.queryid||
' for log_query with id = '||log_query_rec.id
;
RAISE NOTICE '%',activity_string;
EXIT ;
END IF ;
END LOOP ;
Post-d gu caraid
Mar thoradh air an sin, tha an dòigh a chaidh a mhìneachadh air a chleachdadh a lorg
Ged, gu dearbh, nam bheachd pearsanta, bidh e fhathast riatanach a bhith ag obair air an algairim airson a bhith a 'taghadh agus ag atharrachadh meud a' chuibhreann a chaidh a luchdachadh sìos. Chan eil an duilgheadas fhathast air a rèiteachadh anns a 'chùis choitcheann. Is dòcha gum bi e inntinneach.
Ach is e sgeulachd gu tur eadar-dhealaichte a tha sin ...
Source: www.habr.com