Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

์•ˆ๋…•ํ•˜์„ธ์š” ์—ฌ๋Ÿฌ๋ถ„. XNUMX์›” OTUS ์ถœ์‹œ ๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ๋กœ๊น…์— ๊ด€ํ•œ ์›Œํฌ์ˆ, Zabbix, Prometheus, Grafana ๋ฐ ELK๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ธํ”„๋ผ์™€ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜ ๋ชจ๋‘. ์ด์™€ ๊ด€๋ จํ•˜์—ฌ ์šฐ๋ฆฌ๋Š” ์ „ํ†ต์ ์œผ๋กœ ํ•ด๋‹น ์ฃผ์ œ์— ๋Œ€ํ•œ ์œ ์šฉํ•œ ์ž๋ฃŒ๋ฅผ ๊ณต์œ ํ•ฉ๋‹ˆ๋‹ค.

๋ธ”๋ž™๋ฐ•์Šค ์ˆ˜์ถœ์—…์ฒด Prometheus์˜ ๊ฒฝ์šฐ HTTP, HTTPS, DNS, TCP, ICMP๋ฅผ ํ†ตํ•ด ์™ธ๋ถ€ ์„œ๋น„์Šค ๋ชจ๋‹ˆํ„ฐ๋ง์„ ๊ตฌํ˜„ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ๊ธ€์—์„œ๋Š” Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ HTTP/HTTPS ๋ชจ๋‹ˆํ„ฐ๋ง์„ ์„ค์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ณด์—ฌ๋“œ๋ฆฌ๊ฒ ์Šต๋‹ˆ๋‹ค. Kubernetes์—์„œ Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ ๊ธฐ๋Šฅ์„ ์ถœ์‹œํ•  ์˜ˆ์ •์ž…๋‹ˆ๋‹ค.

ํ™˜๊ฒฝ

๋‹ค์Œ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

  • Kubernetes
  • ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ์˜คํผ๋ ˆ์ดํ„ฐ

๋‚ด๋ณด๋‚ด๊ธฐ ๋ธ”๋ž™๋ฐ•์Šค ๊ตฌ์„ฑ

๋ธ”๋ž™๋ฐ•์Šค ๊ตฌ์„ฑ์„ ํ†ตํ•ด ConfigMap ์„ค์ •์šฉ http ์›น ์„œ๋น„์Šค ๋ชจ๋‹ˆํ„ฐ๋ง ๋ชจ๋“ˆ.

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
data:
  blackbox.yaml: |
    modules:
      http_2xx:
        http:
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

๊ธฐ์ค€ ์น˜์ˆ˜ http_2xx ์›น ์„œ๋น„์Šค๊ฐ€ HTTP 2xx ์ƒํƒœ ์ฝ”๋“œ๋ฅผ ๋ฐ˜ํ™˜ํ•˜๋Š”์ง€ ํ™•์ธํ•˜๋Š” ๋ฐ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ๋ธ”๋ž™๋ฐ•์Šค ๋‚ด๋ณด๋‚ด๊ธฐ ๊ตฌ์„ฑ์— ๋Œ€ํ•œ ์ž์„ธํ•œ ๋‚ด์šฉ์€ ์„ ์  ์„œ๋ฅ˜ ๋น„์น˜.

Kubernetes ํด๋Ÿฌ์Šคํ„ฐ์— ๋ธ”๋ž™๋ฐ•์Šค ๋‚ด๋ณด๋‚ด๊ธฐ ๋„๊ตฌ ๋ฐฐํฌ

์„ค๋ช…ํ•˜๋‹ค Deployment ะธ Service Kubernetes์— ๋ฐฐํฌํ•˜๊ธฐ ์œ„ํ•ด.

---
kind: Service
apiVersion: v1
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  type: ClusterIP
  ports:
    - name: http
      port: 9115
      protocol: TCP
  selector:
    app: prometheus-blackbox-exporter

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus-blackbox-exporter
  template:
    metadata:
      labels:
        app: prometheus-blackbox-exporter
    spec:
      restartPolicy: Always
      containers:
        - name: blackbox-exporter
          image: "prom/blackbox-exporter:v0.15.1"
          imagePullPolicy: IfNotPresent
          securityContext:
            readOnlyRootFilesystem: true
            runAsNonRoot: true
            runAsUser: 1000
          args:
            - "--config.file=/config/blackbox.yaml"
          resources:
            {}
          ports:
            - containerPort: 9115
              name: http
          livenessProbe:
            httpGet:
              path: /health
              port: http
          readinessProbe:
            httpGet:
              path: /health
              port: http
          volumeMounts:
            - mountPath: /config
              name: config
        - name: configmap-reload
          image: "jimmidyson/configmap-reload:v0.2.2"
          imagePullPolicy: "IfNotPresent"
          securityContext:
            runAsNonRoot: true
            runAsUser: 65534
          args:
            - --volume-dir=/etc/config
            - --webhook-url=http://localhost:9115/-/reload
          resources:
            {}
          volumeMounts:
            - mountPath: /etc/config
              name: config
              readOnly: true
      volumes:
        - name: config
          configMap:
            name: prometheus-blackbox-exporter

Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋Š” ๋‹ค์Œ ๋ช…๋ น์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐฐํฌํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋„ค์ž„์ŠคํŽ˜์ด์Šค monitoring ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ์˜คํผ๋ ˆ์ดํ„ฐ๋ฅผ ๋งํ•ฉ๋‹ˆ๋‹ค.

kubectl --namespace=monitoring apply -f blackbox-exporter.yaml

๋‹ค์Œ ๋ช…๋ น์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋“  ์„œ๋น„์Šค๊ฐ€ ์‹คํ–‰๋˜๊ณ  ์žˆ๋Š”์ง€ ํ™•์ธํ•˜์‹ญ์‹œ์˜ค.

kubectl --namespace=monitoring get all --selector=app=prometheus-blackbox-exporter

๋ธ”๋ž™๋ฐ•์Šค ํ™•์ธ

๋‹ค์Œ์„ ์‚ฌ์šฉํ•˜์—ฌ Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ ์›น ์ธํ„ฐํŽ˜์ด์Šค์— ์•ก์„ธ์Šคํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. port-forward:

kubectl --namespace=monitoring port-forward svc/prometheus-blackbox-exporter 9115:9115

์›น ๋ธŒ๋ผ์šฐ์ €๋ฅผ ํ†ตํ•ด Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ ์›น ์ธํ„ฐํŽ˜์ด์Šค์— ์—ฐ๊ฒฐํ•˜์‹ญ์‹œ์˜ค. ๋กœ์ปฌ ํ˜ธ์ŠคํŠธ: 9115.

Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

ํ•ด๋‹น ์ฃผ์†Œ๋กœ ๊ฐ€๋ณด์‹œ๋ฉด http://localhost:9115/probe?module=http_2xx&target=https://www.google.com, ์ง€์ •๋œ URL์„ ํ™•์ธํ•œ ๊ฒฐ๊ณผ๋ฅผ ๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค(https://www.google.com).

Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

์ธก์ •ํ•ญ๋ชฉ ๊ฐ’ probe_success 1๊ณผ ๊ฐ™์œผ๋ฉด ์„ฑ๊ณต์ ์ธ ํ™•์ธ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค. ๊ฐ’ 0์€ ์˜ค๋ฅ˜๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค.

ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ์„ค์ •

BlackBox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ๋ฐฐํฌํ•œ ํ›„ Prometheus๋ฅผ ๊ตฌ์„ฑํ•ฉ๋‹ˆ๋‹ค. prometheus-additional.yaml.

- job_name: 'kube-api-blackbox'
  scrape_interval: 1w
  metrics_path: /probe
  params:
    module: [http_2xx]
  static_configs:
   - targets:
      - https://www.google.com
      - http://www.example.com
      - https://prometheus.io
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

์šฐ๋ฆฌ๋Š” ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค Secret๋‹ค์Œ ๋ช…๋ น์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

์ง€์ • additional-scrape-configs Prometheus Operator์˜ ๊ฒฝ์šฐ ๋‹ค์Œ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. additionalScrapeConfigs.

kubectl --namespace=monitoring edit prometheuses k8s
...
spec:
  additionalScrapeConfigs:
    key: prometheus-additional.yaml
    name: additional-scrape-configs

Prometheus ์›น ์ธํ„ฐํŽ˜์ด์Šค๋กœ ์ด๋™ํ•˜์—ฌ ์ธก์ •ํ•ญ๋ชฉ๊ณผ ๋ชฉํ‘œ๋ฅผ ํ™•์ธํ•ฉ๋‹ˆ๋‹ค.

kubectl --namespace=monitoring port-forward svc/prometheus-k8s 9090:9090

Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

Blackbox์˜ ์ง€ํ‘œ์™€ ๋ชฉํ‘œ๋ฅผ ํ™•์ธํ•ฉ๋‹ˆ๋‹ค.

์•Œ๋ฆผ(๊ฒฝ๊ณ )์— ๋Œ€ํ•œ ๊ทœ์น™ ์ถ”๊ฐ€

Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋กœ๋ถ€ํ„ฐ ์•Œ๋ฆผ์„ ๋ฐ›๊ธฐ ์œ„ํ•ด Prometheus Operator์— ๊ทœ์น™์„ ์ถ”๊ฐ€ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: blackbox-exporter
    rules:
    - alert: ProbeFailed
      expr: probe_success == 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "Probe failed (instance {{ $labels.instance }})"
        description: "Probe failedn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowProbe
      expr: avg_over_time(probe_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow probe (instance {{ $labels.instance }})"
        description: "Blackbox probe took more than 1s to completen  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpStatusCode
      expr: probe_http_status_code <= 199 OR probe_http_status_code >= 400
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "HTTP Status Code (instance {{ $labels.instance }})"
        description: "HTTP status code is not 200-399n  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateWillExpireSoon
      expr: probe_ssl_earliest_cert_expiry - time() < 86400 * 30
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "SSL certificate will expire soon (instance {{ $labels.instance }})"
        description: "SSL certificate expires in 30 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateHasExpired
      expr: probe_ssl_earliest_cert_expiry - time()  <= 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "SSL certificate has expired (instance {{ $labels.instance }})"
        description: "SSL certificate has expired alreadyn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpSlowRequests
      expr: avg_over_time(probe_http_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "HTTP slow requests (instance {{ $labels.instance }})"
        description: "HTTP request took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowPing
      expr: avg_over_time(probe_icmp_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow ping (instance {{ $labels.instance }})"
        description: "Blackbox ping took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Prometheus ์›น ์ธํ„ฐํŽ˜์ด์Šค์—์„œ ์ƒํƒœ => ๊ทœ์น™์œผ๋กœ ์ด๋™ํ•˜์—ฌ blackbox-exporter์— ๋Œ€ํ•œ ๊ฒฝ๊ณ  ๊ทœ์น™์„ ์ฐพ์œผ์„ธ์š”.

Prometheus: Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ๋ฅผ ํ†ตํ•œ HTTP ๋ชจ๋‹ˆํ„ฐ๋ง

Kubernetes API ์„œ๋ฒ„ SSL ์ธ์ฆ์„œ ๋งŒ๋ฃŒ ์•Œ๋ฆผ ๊ตฌ์„ฑ

Kubernetes API ์„œ๋ฒ„ SSL ์ธ์ฆ์„œ ๋งŒ๋ฃŒ ๋ชจ๋‹ˆํ„ฐ๋ง์„ ๊ตฌ์„ฑํ•ด ๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค. ์ผ์ฃผ์ผ์— ํ•œ ๋ฒˆ์”ฉ ์•Œ๋ฆผ์„ ๋ณด๋‚ด๋“œ๋ฆฝ๋‹ˆ๋‹ค.

Kubernetes API ์„œ๋ฒ„ ์ธ์ฆ์„ ์œ„ํ•œ Blackbox ๋‚ด๋ณด๋‚ด๊ธฐ ๋ชจ๋“ˆ์„ ์ถ”๊ฐ€ํ•ฉ๋‹ˆ๋‹ค.

kubectl --namespace=monitoring edit configmap prometheus-blackbox-exporter
...
      kube-api:
        http:
          method: GET
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          tls_config:
            insecure_skip_verify: false
            ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
          bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Prometheus ์Šคํฌ๋žฉ ๊ตฌ์„ฑ ์ถ”๊ฐ€

- job_name: 'kube-api-blackbox'
  metrics_path: /probe
  params:
    module: [kube-api]
  static_configs:
   - targets:
      - https://kubernetes.default.svc/api
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

ํ”„๋กœ๋ฉ”ํ…Œ์šฐ์Šค ๋น„๋ฐ€ ์‚ฌ์šฉ

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

๊ฒฝ๊ณ  ๊ทœ์น™ ์ถ”๊ฐ€

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: k8s-api-server-cert-expiry
    rules:
    - alert: K8sAPIServerSSLCertExpiringAfterThreeMonths
      expr: probe_ssl_earliest_cert_expiry{job="kube-api-blackbox"} - time() < 86400 * 90 
      for: 1w
      labels:
        severity: warning
      annotations:
        summary: "Kubernetes API Server SSL certificate will expire after three months (instance {{ $labels.instance }})"
        description: "Kubernetes API Server SSL certificate expires in 90 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

์œ ์šฉํ•œ ๋งํฌ

Docker์—์„œ ๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ๋กœ๊ทธ์ธ

์ถœ์ฒ˜ : habr.com