Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Ma ka nānā ʻana i nā helu helu pūnaewele, loaʻa iā mākou kahi manaʻo o ka mea e hana nei me ia. Hoʻohālikelike mākou i nā hopena me nā ʻike ʻē aʻe e pili ana i ka huahana a i ʻole lawelawe a ma laila e hoʻomaikaʻi ai i kā mākou ʻike.

Ke hoʻopau ʻia ka loiloi o nā hopena mua, ua hoʻomaopopo ʻia ka ʻike a ua huki ʻia nā hopena, hoʻomaka ka pae aʻe. Kū mai nā manaʻo: he aha ka hopena inā ʻoe e nānā i ka ʻikepili mai kēlā ʻaoʻao?

Aia nā palena o nā mea hana loiloi ma kēia pae. ʻO kēia kekahi o nā kumu i lawa ʻole ai ʻo Google Analytics iaʻu, ʻo ia hoʻi no ka palena o ka hiki ke ʻike a hoʻoponopono i kaʻu ʻikepili.

Makemake mau au e hoʻouka wikiwiki i ka ʻikepili kumu (master data), hoʻohui i kahi pae ʻē aʻe o ka hōʻuluʻulu ʻana, a i ʻole wehewehe ʻokoʻa i nā waiwai i loaʻa.

He mea maʻalahi kēia i loko i loko o kāu lumi hoʻopaʻa liʻiliʻi ma muli o ka waihona access.log a ua lawa ka ʻōlelo SQL no kēia.

No laila, he aha nā nīnau aʻu i makemake ai e pane?

He aha a me ka wā i hoʻololi ʻia ma ka pūnaewele

ʻO ka mōʻaukala o nā hoʻololi i ka ʻikepili kumu (master data) he hoihoi mau.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Nīnau hōʻike SQL

SELECT
	1 as 'SideStackedBar: Content Updates by Months',
	strftime('%m/%Y', datetime(UPDATE_DT, 'unixepoch')) AS 'Day',
	COUNT(CASE WHEN PAGE_TITLE != 'n.a.' THEN DIM_REQUEST_ID END) AS 'Web page updates',
	COUNT(CASE WHEN PAGE_DESCR = 'IMAGES' THEN DIM_REQUEST_ID END) AS 'Image uploads',
	COUNT(CASE WHEN PAGE_DESCR = 'VIDEO' THEN DIM_REQUEST_ID END) AS 'Video uploads',
	COUNT(CASE WHEN PAGE_DESCR = 'AUDIO' THEN DIM_REQUEST_ID END) AS 'Audio uploads'
FROM DIM_REQUEST
WHERE PAGE_TITLE != 'n.a.' OR PAGE_DESCR != 'n.a.'
GROUP BY strftime('%m/%Y', datetime(UPDATE_DT, 'unixepoch'))
ORDER BY UPDATE_DT

No ka laʻana, i kekahi manawa, ua hoʻokō ʻia ka ʻenekini hulina a i ʻole ua hoʻohui ʻia nā ʻike hou i ka pūnaewele, a no laila ke manaʻo ʻia nei ka piʻi ʻana o ke kaʻa.

Pūʻulu o nā mea hoʻohana

ʻO ka hiʻohiʻona maʻalahi o kahi hui ka mea hoʻohana a i ʻole ka inoa o ka ʻōnaehana hana.

Ua hōʻiliʻili ka nui o ka mea hoʻohana ma kahi o hoʻokahi kaukani mau moʻolelo a makemake wau e ʻike i ka dynamics o ka hāʻawi ʻana i nā ʻelele i loko o ka hui.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Nīnau hōʻike SQL

SELECT
	1 AS 'SideStackedBar: User Agents',
	AGENT_OS AS 'OS',
	SUM(CASE WHEN AGENT_BOT = 'n.a.' THEN 1 ELSE 0 END ) AS 'User Agent of Users',
	SUM(CASE WHEN AGENT_BOT != 'n.a.' THEN 1 ELSE 0 END ) AS 'User Agent of Bots'
FROM DIM_USER_AGENT
WHERE DIM_USER_AGENT_ID != -1
GROUP BY AGENT_OS
ORDER BY 3 DESC

Hele mai ka hapa nui o nā hui like ʻole i ka pūnaewele mai ka honua Windows. Ma waena o nā mea i ʻike ʻole ʻia ʻo WhatsApp, PocketImageCache, PlayStation, SmartTV, etc.

Hana pūʻulu mea hoʻohana ma ka pule

Ma ka hoʻohui ʻana i kekahi mau pūʻulu, hiki ke nānā i ka puʻunaue o kā lākou hana.

No ka laʻana, ʻoi aku ka nui o nā mea hoʻohana Linux cluster ma mua o nā mea ʻē aʻe.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Nīnau hōʻike SQL

SELECT
1 as 'StackedBar: Traffic Volume by User OS and by Week',
strftime('%W week', datetime(FCT.EVENT_DT, 'unixepoch')) AS 'Week',
SUM(CASE WHEN USG.AGENT_OS IN ('Android', 'Linux') THEN FCT.BYTES ELSE 0 END)/1000 AS 'Android/Linux Users',
SUM(CASE WHEN USG.AGENT_OS IN ('Windows') THEN FCT.BYTES ELSE 0 END)/1000 AS 'Windows Users',
SUM(CASE WHEN USG.AGENT_OS IN ('Macintosh', 'iOS') THEN FCT.BYTES ELSE 0 END)/1000 AS 'Mac/iOS Users',
SUM(CASE WHEN USG.AGENT_OS IN ('n.a.', 'BlackBerry') THEN FCT.BYTES ELSE 0 END)/1000 AS 'Other'
FROM
  FCT_ACCESS_USER_AGENT_DD FCT,
  DIM_USER_AGENT USG,
  DIM_HTTP_STATUS HST
WHERE FCT.DIM_USER_AGENT_ID=USG.DIM_USER_AGENT_ID
  AND FCT.DIM_HTTP_STATUS_ID = HST.DIM_HTTP_STATUS_ID
  AND USG.AGENT_BOT = 'n.a.' /* users only */
  AND HST.STATUS_GROUP IN ('Successful') /* good pages */
  AND datetime(FCT.EVENT_DT, 'unixepoch') > date('now', '-3 month')
GROUP BY strftime('%W week', datetime(FCT.EVENT_DT, 'unixepoch'))
ORDER BY FCT.EVENT_DT

Hoʻohana nui kaʻa

Hōʻike ka papaʻaina i nā pūʻulu mea hoʻohana ikaika loa a me ka lā o kā lākou hana.
ʻO nā mea ikaika loa no ka hui Linux.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Nīnau hōʻike SQL

SELECT
1 AS 'Table: User Agent with Havy Usage',
strftime('%d.%m.%Y', datetime(FCT.EVENT_DT, 'unixepoch')) AS 'Day',
ROUND(1.0*SUM(FCT.BYTES)/1000000, 1) AS 'Traffic MB',
ROUND(1.0*SUM(FCT.IP_CNT)/SUM(1), 1) AS 'IPs',
ROUND(1.0*SUM(FCT.REQUEST_CNT)/SUM(1), 1) AS 'Requests',
USA.DIM_USER_AGENT_ID AS 'ID',
MAX(USA.USER_AGENT_NK) AS 'User Agent',
MAX(USA.AGENT_BOT) AS 'Bot'
FROM
FCT_ACCESS_USER_AGENT_DD FCT,
DIM_USER_AGENT USA
WHERE FCT.DIM_USER_AGENT_ID = USA.DIM_USER_AGENT_ID
  AND datetime(FCT.EVENT_DT, 'unixepoch') >= date('now', '-30 day')
GROUP BY USA.DIM_USER_AGENT_ID, strftime('%d.%m.%Y', datetime(FCT.EVENT_DT, 'unixepoch')) 
ORDER BY SUM(FCT.BYTES) DESC, FCT.EVENT_DT
LIMIT 10

Ke hoʻohana nei i ka lā a me nā ʻano ID ID, hiki ke loaʻa a nānā i nā helu helu i nā lā o nā hui mea hoʻohana. Inā pono, hiki iā ʻoe ke loaʻa koke ka ʻike kikoʻī ma ka papaʻaina.

Pehea e loaʻa ai ka ʻike?

ʻIke mai ka waihona access.log hiki ke hana maikaʻi ʻia ma ka hoʻohui ʻana i nā kumu ʻikepili hou aʻe a me ka hoʻokomo ʻana i nā pae hou o ka hōʻuluʻulu a me ka hui pū ʻana.

ʻIke kumu a me nā hui

Loaʻa ka ʻikepili kumu i ka ʻike e pili ana i nā hui: nā ʻaoʻao pūnaewele, nā kiʻi, nā wikiō a me nā ʻike leo, i ka hihia o kahi hale kūʻai - nā huahana.

ʻO nā hui ponoʻī e hana ma ke ʻano he ana, a ʻo ke kaʻina o ka mālama ʻana i nā loli i nā ʻano i kapa ʻia ʻo historicization. I loko o kahi waihona, hoʻokō pinepine ʻia kēia kaʻina hana ma ke ʻano o nā ana hoʻololi mālie (SCD).

Hiki ke loaʻa mai ka ʻikepili kumu mai nā ʻōnaehana like ʻole, no laila pono lākou e hoʻohui mau.

Hoʻololi lohi i ke ana

Loaʻa i ka DIM_REQUEST ka ʻike e pili ana i nā noi ma ka pūnaewele ma ke ʻano mōʻaukala.

Papa SCD2

CREATE TABLE DIM_REQUEST ( /* scd table for user requests */
  DIM_REQUEST_ID      INTEGER NOT NULL PRIMARY KEY AUTOINCREMENT,
  DIM_REQUEST_ID_HIST INTEGER NOT NULL DEFAULT -1,
  REQUEST_NK          TEXT NOT NULL DEFAULT 'n.a.', /* request without ?parameters */
  PAGE_TITLE          TEXT NOT NULL DEFAULT 'n.a.',
  PAGE_DESCR          TEXT NOT NULL DEFAULT 'n.a.',
  PAGE_KEYWORDS       TEXT NOT NULL DEFAULT 'n.a.',
  DELETE_FLAG         INTEGER NOT NULL DEFAULT 0,
  UPDATE_DT           INTEGER NOT NULL DEFAULT 0,
  UNIQUE (REQUEST_NK, DIM_REQUEST_ID_HIST)
);
INSERT INTO DIM_REQUEST (DIM_REQUEST_ID) VALUES (-1);

Ma waho aʻe o ia mea, e hana mākou i hoʻokahi ʻike e hōʻike mau i nā moʻolelo āpau i ka mokuʻāina hou loa. Pono no ka hoʻouka ʻana i ke ana ponoʻī.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Hōʻike i kēia manawa o SCD2

/* Content: actual view on scd table */
SELECT HI.DIM_REQUEST_ID,
  HI.DIM_REQUEST_ID_HIST,
  HI.REQUEST_NK,
  HI.PAGE_TITLE,
  HI.PAGE_DESCR,
  HI.PAGE_KEYWORDS,
  NK.CNT AS HIST_CNT,
  HI.DELETE_FLAG,
  strftime('%d.%m.%Y %H:%M', datetime(HI.UPDATE_DT, 'unixepoch')) AS UPDATE_DT
FROM
  ( SELECT REQUEST_NK, MAX(DIM_REQUEST_ID) AS DIM_REQUEST_ID, SUM(1) AS CNT
    FROM DIM_REQUEST
    GROUP BY REQUEST_NK
  ) NK,
  DIM_REQUEST HI
WHERE 1 = 1
  AND NK.REQUEST_NK = HI.REQUEST_NK
  AND NK.DIM_REQUEST_ID = HI.DIM_REQUEST_ID;

A he ʻike kahi e hōʻiliʻili ai ka ʻike mōʻaukala no kēlā me kēia komo. Pono e kūkulu i kahi pilina kūpono o ka mōʻaukala me nā ʻoiaʻiʻo.

Nā helu helu pūnaewele hou aku i kāu waihona liʻiliʻi

Hōʻike mōʻaukala o SCD2

/* Content: actual view on scd table */
SELECT SCD.DIM_REQUEST_ID,
  SCD.DIM_REQUEST_ID_HIST,
  SCD.REQUEST_NK,
  SCD.PAGE_TITLE,
  SCD.PAGE_DESCR,
  SCD.PAGE_KEYWORDS,
  SCD.DELETE_FLAG,
  CASE
    WHEN HIS.UPDATE_DT IS NULL
    THEN 1
    ELSE 0 END ACTIVE_FLAG,
  SCD.DIM_REQUEST_ID_HIST AS ID_FROM,
  SCD.DIM_REQUEST_ID AS ID_TO,
  CASE
    WHEN SCD.DIM_REQUEST_ID_HIST=-1
    THEN 3600
    ELSE IFNULL(SCD.UPDATE_DT,3600)
  END AS TIME_FROM,
  CASE
    WHEN HIS.UPDATE_DT IS NULL
    THEN 253370764800
    ELSE HIS.UPDATE_DT
  END AS TIME_TO,
  CASE
    WHEN SCD.DIM_REQUEST_ID_HIST=-1
    THEN STRFTIME('%d.%m.%Y %H:%M', DATETIME(3600, 'unixepoch'))
    ELSE STRFTIME('%d.%m.%Y %H:%M', DATETIME(IFNULL(SCD.UPDATE_DT,3600), 'unixepoch'))
  END AS ACTIVE_FROM,
  CASE
    WHEN HIS.UPDATE_DT IS NULL
    THEN STRFTIME('%d.%m.%Y %H:%M', DATETIME(253370764800, 'unixepoch'))
    ELSE STRFTIME('%d.%m.%Y %H:%M', DATETIME(HIS.UPDATE_DT, 'unixepoch'))
  END AS ACTIVE_TO
FROM
  DIM_REQUEST SCD
  LEFT OUTER JOIN DIM_REQUEST HIS
  ON SCD.REQUEST_NK = HIS.REQUEST_NK AND SCD.DIM_REQUEST_ID = HIS.DIM_REQUEST_ID_HIST;

Hoʻohui ʻikepili

Hiki iā ʻoe ke loiloi i ka ʻikepili i kahi pae kiʻekiʻe a ʻike i nā anomalies a me nā ʻano i ʻike ʻole ʻia i nā hōʻike kikoʻī.

No ka laʻana, hoʻohui i kahi pūʻulu i ke ana me nā code kūlana noi DIM_HTTP_STATUS:

KULA/PUU
0xx/na
1xx/ʻike
2xx/Polopono
3xx/Hōʻano hou
4xx/Hapa mea kūʻai
5xx/Hāpa Pūnaewele

E loaʻa i ka nui mea hoʻohana DIM_USER_AGENT nā ʻano AGENT_OS a me AGENT_BOT i kuleana no nā hui. Hiki ke hoʻonui ʻia kēia mau mea i ke kaʻina hana ETL:

Ke hoʻouka nei ʻo DIM_USER_AGENT

/* Propagate the user agent from access log */
INSERT INTO DIM_USER_AGENT (USER_AGENT_NK, AGENT_OS, AGENT_ENGINE, AGENT_DEVICE, AGENT_BOT, UPDATE_DT)
WITH CLS AS (
	SELECT BROWSER
	FROM STG_ACCESS_LOG WHERE LENGTH(BROWSER)>1
	GROUP BY BROWSER
)
SELECT
	CLS.BROWSER AS USER_AGENT_NK,
	CASE
	WHEN INSTR(CLS.BROWSER,'Macintosh')>0
		THEN 'Macintosh'
	WHEN INSTR(CLS.BROWSER,'iPhone')>0
			 OR INSTR(CLS.BROWSER,'iPad')>0
			 OR INSTR(CLS.BROWSER,'iPod')>0
			 OR INSTR(CLS.BROWSER,'Apple TV')>0
			 OR INSTR(CLS.BROWSER,'Darwin')>0
		THEN 'iOS'
	WHEN INSTR(CLS.BROWSER,'Android')>0
		THEN 'Android'
	WHEN INSTR(CLS.BROWSER,'X11;')>0 OR INSTR(CLS.BROWSER,'Wayland;')>0 OR INSTR(CLS.BROWSER,'linux-gnu')>0
		THEN 'Linux'
	WHEN INSTR(CLS.BROWSER,'BB10;')>0 OR INSTR(CLS.BROWSER,'BlackBerry')>0
		THEN 'BlackBerry'
	WHEN INSTR(CLS.BROWSER,'Windows')>0
		THEN 'Windows'
	ELSE 'n.a.' END AS AGENT_OS, -- OS
	CASE
	WHEN INSTR(CLS.BROWSER,'AppleCoreMedia')>0
		THEN 'AppleWebKit'
	WHEN INSTR(CLS.BROWSER,') ')>1 AND LENGTH(CLS.BROWSER)>INSTR(CLS.BROWSER,') ')
		THEN COALESCE(SUBSTR(CLS.BROWSER, INSTR(CLS.BROWSER,') ')+2, LENGTH(CLS.BROWSER) - INSTR(CLS.BROWSER,') ')-1), 'N/A')
	ELSE 'n.a.' END AS AGENT_ENGINE, -- Engine
	CASE
	WHEN INSTR(CLS.BROWSER,'iPhone')>0
		THEN 'iPhone'
	WHEN INSTR(CLS.BROWSER,'iPad')>0
		THEN 'iPad'
	WHEN INSTR(CLS.BROWSER,'iPod')>0
		THEN 'iPod'
	WHEN INSTR(CLS.BROWSER,'Apple TV')>0
		THEN 'Apple TV'
	WHEN INSTR(CLS.BROWSER,'Android ')>0 AND INSTR(CLS.BROWSER,'Build')>0
		THEN COALESCE(SUBSTR(CLS.BROWSER, INSTR(CLS.BROWSER,'Android '), INSTR(CLS.BROWSER,'Build')-INSTR(CLS.BROWSER,'Android ')), 'n.a.')
	WHEN INSTR(CLS.BROWSER,'Android ')>0 AND INSTR(CLS.BROWSER,'MIUI')>0
		THEN COALESCE(SUBSTR(CLS.BROWSER, INSTR(CLS.BROWSER,'Android '), INSTR(CLS.BROWSER,'MIUI')-INSTR(CLS.BROWSER,'Android ')), 'n.a.')
	ELSE 'n.a.' END AS AGENT_DEVICE, -- Device
	CASE
	WHEN INSTR(LOWER(CLS.BROWSER),'yandex.com')>0
		THEN 'yandex'
	WHEN INSTR(LOWER(CLS.BROWSER),'googlebot')>0
		THEN 'google'
	WHEN INSTR(LOWER(CLS.BROWSER),'bingbot')>0
		THEN 'microsoft'
	WHEN INSTR(LOWER(CLS.BROWSER),'ahrefsbot')>0
		THEN 'ahrefs'
	WHEN INSTR(LOWER(CLS.BROWSER),'jobboersebot')>0 OR INSTR(LOWER(CLS.BROWSER),'jobkicks')>0
		THEN 'job.de'
	WHEN INSTR(LOWER(CLS.BROWSER),'mail.ru')>0
		THEN 'mail.ru'
	WHEN INSTR(LOWER(CLS.BROWSER),'baiduspider')>0
		THEN 'baidu'
	WHEN INSTR(LOWER(CLS.BROWSER),'mj12bot')>0
		THEN 'majestic-12'
	WHEN INSTR(LOWER(CLS.BROWSER),'duckduckgo')>0
		THEN 'duckduckgo'
	WHEN INSTR(LOWER(CLS.BROWSER),'bytespider')>0
		THEN 'bytespider'
	WHEN INSTR(LOWER(CLS.BROWSER),'360spider')>0
		THEN 'so.360.cn'
	WHEN INSTR(LOWER(CLS.BROWSER),'compatible')>0 OR INSTR(LOWER(CLS.BROWSER),'http')>0
		OR INSTR(LOWER(CLS.BROWSER),'libwww')>0 OR INSTR(LOWER(CLS.BROWSER),'spider')>0
		OR INSTR(LOWER(CLS.BROWSER),'java')>0 OR INSTR(LOWER(CLS.BROWSER),'python')>0
		OR INSTR(LOWER(CLS.BROWSER),'robot')>0 OR INSTR(LOWER(CLS.BROWSER),'curl')>0 OR INSTR(LOWER(CLS.BROWSER),'wget')>0
		THEN 'other'
	ELSE 'n.a.' END AS AGENT_BOT, -- Bot
	STRFTIME('%s','now') AS UPDATE_DT
FROM CLS
LEFT OUTER JOIN DIM_USER_AGENT TRG
ON CLS.BROWSER = TRG.USER_AGENT_NK
WHERE TRG.DIM_USER_AGENT_ID IS NULL

Hoʻohui ʻikepili

Loaʻa i ka hoʻonohonoho ʻana i ka hoʻoili ʻana o ka ʻikepili mai ka ʻōnaehana hana i ka ʻōnaehana hōʻike. No ka hana ʻana i kēia, pono ʻoe e hana i kahi papaʻaina me kahi ʻano like me ke kumu.

Loaʻa ka ʻike e pili ana i nā ʻaoʻao pūnaewele i ke kahua mai ka waihona CMS ma ke ʻano o nā noi hoʻokomo.

Ke hoʻouka nei i ka papa mōʻaukala DIM_REQUEST me ka ʻikepili kumu i ʻekolu mau ʻanuʻu: hoʻouka ʻana i nā kī hou a me nā ʻano, ka hoʻonui ʻana i nā mea i loaʻa, a me ka hana ʻana i nā moʻolelo i holoi ʻia.

Ke hoʻouka nei i nā hoʻopaʻa leo SCD2 hou

/* Load request table SCD from master data */
INSERT INTO DIM_REQUEST (DIM_REQUEST_ID_HIST, REQUEST_NK, PAGE_TITLE, PAGE_DESCR, PAGE_KEYWORDS, DELETE_FLAG, UPDATE_DT)
WITH CLS  AS ( -- prepare keys
	SELECT
	'/' || NAME AS REQUEST_NK,
	TITLE       AS PAGE_TITLE,
	CASE WHEN DESCRIPTION = '' OR DESCRIPTION IS NULL
	     THEN 'n.a.' ELSE DESCRIPTION
	END AS PAGE_DESCR,
	CASE WHEN KEYWORDS = '' OR KEYWORDS IS NULL
	     THEN 'n.a.' ELSE KEYWORDS
	END AS PAGE_KEYWORDS
	FROM STG_CMS_MENU
	WHERE CONTENT_TYPE != 'folder' -- only web pages
	  AND PAGE_TITLE != 'n.a.' -- master data which make sense
)
/* new records from stage: CLS */
SELECT
	-1 AS DIM_REQUEST_ID_HIST,
	CLS.REQUEST_NK,
	CLS.PAGE_TITLE,
	CLS.PAGE_DESCR,
	CLS.PAGE_KEYWORDS,
	0 AS DELETE_FLAG,
	STRFTIME('%s','now') AS UPDATE_DT
FROM CLS
LEFT OUTER JOIN
 (
	SELECT
	DIM_REQUEST_ID,
	REQUEST_NK,
	PAGE_TITLE,
	PAGE_DESCR,
	PAGE_KEYWORDS
	FROM DIM_REQUEST_V_ACT
) TRG ON CLS.REQUEST_NK = TRG.REQUEST_NK
WHERE TRG.REQUEST_NK IS NULL -- no such record in data mart

Hoʻohou i nā ʻano SCD2

/* Load request table SCD from master data */
INSERT INTO DIM_REQUEST (DIM_REQUEST_ID_HIST, REQUEST_NK, PAGE_TITLE, PAGE_DESCR, PAGE_KEYWORDS, DELETE_FLAG, UPDATE_DT)
WITH CLS  AS ( -- prepare keys
	SELECT
	'/' || NAME AS REQUEST_NK,
	TITLE       AS PAGE_TITLE,
	CASE WHEN DESCRIPTION = '' OR DESCRIPTION IS NULL
	     THEN 'n.a.' ELSE DESCRIPTION
	END AS PAGE_DESCR,
	CASE WHEN KEYWORDS = '' OR KEYWORDS IS NULL
	     THEN 'n.a.' ELSE KEYWORDS
	END AS PAGE_KEYWORDS
	FROM STG_CMS_MENU
	WHERE CONTENT_TYPE != 'folder' -- only web pages
	  AND PAGE_TITLE != 'n.a.' -- master data which make sense
)
/* updated records from stage: CLS and build reference to history: HIST */
SELECT
	HIST.DIM_REQUEST_ID AS DIM_REQUEST_ID_HIST,
	HIST.REQUEST_NK,
	CLS.PAGE_TITLE,
	CLS.PAGE_DESCR,
	CLS.PAGE_KEYWORDS,
	0 AS DELETE_FLAG,
	STRFTIME('%s','now') AS UPDATE_DT
FROM CLS,
     DIM_REQUEST_V_ACT TRG,
     DIM_REQUEST HIST
WHERE CLS.REQUEST_NK = TRG.REQUEST_NK
  AND TRG.DIM_REQUEST_ID = HIST.DIM_REQUEST_ID
  AND ( CLS.PAGE_TITLE != HIST.PAGE_TITLE /* changes only */
     OR CLS.PAGE_DESCR != HIST.PAGE_DESCR
     OR CLS.PAGE_KEYWORDS != HIST.PAGE_KEYWORDS )

Holoi ʻia nā hoʻopaʻa leo SCD2

/* Load request table SCD from master data */
INSERT INTO DIM_REQUEST (DIM_REQUEST_ID_HIST, REQUEST_NK, PAGE_TITLE, PAGE_DESCR, PAGE_KEYWORDS, DELETE_FLAG, UPDATE_DT)
WITH CLS  AS ( -- prepare keys
	SELECT
	'/' || NAME AS REQUEST_NK,
	TITLE       AS PAGE_TITLE
	FROM STG_CMS_MENU
	WHERE CONTENT_TYPE != 'folder' -- only web pages
	  AND PAGE_TITLE != 'n.a.' -- master data which make sense
)
/*  deleted records in data mart: TRG */
SELECT
	TRG.DIM_REQUEST_ID AS DIM_REQUEST_ID_HIST,
	TRG.REQUEST_NK,
	TRG.PAGE_TITLE,
	TRG.PAGE_DESCR,
	TRG.PAGE_KEYWORDS,
	1 AS DELETE_FLAG,
	STRFTIME('%s','now') AS UPDATE_DT
FROM (
	SELECT
	DIM_REQUEST_ID,
	REQUEST_NK,
	PAGE_TITLE,
	PAGE_DESCR,
	PAGE_KEYWORDS
	FROM DIM_REQUEST_V_ACT
	WHERE PAGE_TITLE != 'n.a.' -- track master data only
	  AND DELETE_FLAG = 0 -- not already deleted
) TRG
LEFT OUTER JOIN CLS ON TRG.REQUEST_NK = CLS.REQUEST_NK
WHERE CLS.REQUEST_NK IS NULL -- no such record in stage

Pono e hui pū ʻia kēlā me kēia kumu ʻikepili me kahi wehewehe kikoʻī, no ka laʻana, ma ka faila readme.txt:

Ka mea i loaʻa i ka ʻikepili ma ke ʻano maʻamau: inoa, helu leka uila
Mea hoʻolako ʻikepili ma ke ʻano maʻamau: inoa, helu leka uila
Punaʻikepili: ala waihona, inoa lawelawe
ʻIkepili ʻikepili: mea hoʻohana a me nā ʻōlelo huna

E kōkua ke kiʻikuhi kahe o ka ʻikepili i ke kaʻina hana o ka mālama ʻana a me ka hoʻonui ʻana, no ka laʻana, ma ke ʻano kikokikona:

Ke neʻe nei i kahi faila. Puna: ftp.domain.net: /logs/access.log Target: /var/www/access.log
Heluhelu ma ke kahua. Pahuhopu: STG_ACCESS_LOG
Hoʻouka a hoʻololi. Pahuhopu: FCT_ACCESS_REQUEST_REF_HH
Hoʻouka a hoʻololi. Pahuhopu: FCT_ACCESS_USER_AGENT_DD
Hōʻike. Pahuhopu: /var/www/report.html

hopena

No laila, wehewehe ka ʻatikala i nā mīkini e like me ka hoʻohui ʻana o ka ʻikepili i lalo a me ka hoʻokomo ʻana i nā pae hou o ka hōʻuluʻulu. Pono lākou i ke kūkulu ʻana i nā waihona ʻikepili i mea e loaʻa ai ka ʻike hou aʻe a hoʻomaikaʻi i ka maikaʻi o ka ʻike.

Source: www.habr.com

Pākuʻi i ka manaʻo hoʻopuka