Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

Bawo ni gbogbo eniyan. Ni May OTUS ifilọlẹ idanileko lori ibojuwo ati gedu, mejeeji amayederun ati awọn ohun elo lilo Zabbix, Prometheus, Grafana ati ELK. Ni iyi yii, a ṣe pinpin awọn ohun elo ti o wulo lori koko-ọrọ ni aṣa ni aṣa.

Blackbox atajasita fun Prometheus gba ọ laaye lati ṣe ibojuwo ti awọn iṣẹ ita nipasẹ HTTP, HTTPS, DNS, TCP, ICMP. Ninu nkan yii, Emi yoo fihan ọ bi o ṣe le ṣeto ibojuwo HTTP/HTTPS nipa lilo olutaja Blackbox. A yoo ṣe ifilọlẹ Blackbox atajasita ni Kubernetes.

Ayika

A yoo nilo awọn wọnyi:

  • Kubernetes
  • Prometheus onišẹ

Atajasita blackbox iṣeto ni

Tito leto Blackbox nipasẹ ConfigMap fun eto http awọn iṣẹ wẹẹbu ibojuwo module.

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
data:
  blackbox.yaml: |
    modules:
      http_2xx:
        http:
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Module http_2xx ti a lo lati ṣayẹwo pe iṣẹ wẹẹbu da koodu ipo HTTP 2xx pada. Atunto atajasita blackbox jẹ apejuwe ni awọn alaye diẹ sii ni iwe.

Gbigbe olutaja apoti dudu si iṣupọ Kubernetes kan

Apejuwe Deployment и Service fun imuṣiṣẹ ni Kubernetes.

---
kind: Service
apiVersion: v1
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  type: ClusterIP
  ports:
    - name: http
      port: 9115
      protocol: TCP
  selector:
    app: prometheus-blackbox-exporter

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus-blackbox-exporter
  template:
    metadata:
      labels:
        app: prometheus-blackbox-exporter
    spec:
      restartPolicy: Always
      containers:
        - name: blackbox-exporter
          image: "prom/blackbox-exporter:v0.15.1"
          imagePullPolicy: IfNotPresent
          securityContext:
            readOnlyRootFilesystem: true
            runAsNonRoot: true
            runAsUser: 1000
          args:
            - "--config.file=/config/blackbox.yaml"
          resources:
            {}
          ports:
            - containerPort: 9115
              name: http
          livenessProbe:
            httpGet:
              path: /health
              port: http
          readinessProbe:
            httpGet:
              path: /health
              port: http
          volumeMounts:
            - mountPath: /config
              name: config
        - name: configmap-reload
          image: "jimmidyson/configmap-reload:v0.2.2"
          imagePullPolicy: "IfNotPresent"
          securityContext:
            runAsNonRoot: true
            runAsUser: 65534
          args:
            - --volume-dir=/etc/config
            - --webhook-url=http://localhost:9115/-/reload
          resources:
            {}
          volumeMounts:
            - mountPath: /etc/config
              name: config
              readOnly: true
      volumes:
        - name: config
          configMap:
            name: prometheus-blackbox-exporter

Blackbox atajasita le ti wa ni ransogun lilo pipaṣẹ wọnyi. Ààyè orúkọ monitoring ntokasi si Prometheus onišẹ.

kubectl --namespace=monitoring apply -f blackbox-exporter.yaml

Rii daju pe gbogbo awọn iṣẹ nṣiṣẹ nipa lilo aṣẹ atẹle:

kubectl --namespace=monitoring get all --selector=app=prometheus-blackbox-exporter

Blackbox ayẹwo

O le wọle si wiwo oju opo wẹẹbu atajasita Blackbox nipa lilo port-forward:

kubectl --namespace=monitoring port-forward svc/prometheus-blackbox-exporter 9115:9115

Sopọ si oju opo wẹẹbu atajasita Blackbox nipasẹ ẹrọ aṣawakiri wẹẹbu kan ni localhost: 9115.

Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

Ti o ba lọ si adirẹsi http://localhost:9115/probe?module=http_2xx&target=https://www.google.com, iwọ yoo rii abajade ti ṣiṣe ayẹwo URL ti o pato (https://www.google.com).

Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

Iwọn metiriki probe_success dogba si 1 tumo si aseyori ayẹwo. Iye kan ti 0 tọkasi aṣiṣe kan.

Ṣiṣeto Prometheus

Lẹhin gbigbe olutaja BlackBox, a tunto Prometheus sinu prometheus-additional.yaml.

- job_name: 'kube-api-blackbox'
  scrape_interval: 1w
  metrics_path: /probe
  params:
    module: [http_2xx]
  static_configs:
   - targets:
      - https://www.google.com
      - http://www.example.com
      - https://prometheus.io
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

A ṣe ipilẹṣẹ Secretlilo awọn wọnyi pipaṣẹ.

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Pato additional-scrape-configs fun Prometheus oniṣẹ lilo additionalScrapeConfigs.

kubectl --namespace=monitoring edit prometheuses k8s
...
spec:
  additionalScrapeConfigs:
    key: prometheus-additional.yaml
    name: additional-scrape-configs

A lọ si oju opo wẹẹbu Prometheus ati ṣayẹwo awọn metiriki ati awọn ibi-afẹde.

kubectl --namespace=monitoring port-forward svc/prometheus-k8s 9090:9090

Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

A rii awọn metiriki ati awọn ibi-afẹde ti Blackbox.

Ṣafikun awọn ofin fun awọn iwifunni (titaniji)

Lati gba awọn iwifunni lati atajasita Blackbox, a yoo ṣafikun awọn ofin si Oluṣeto Prometheus.

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: blackbox-exporter
    rules:
    - alert: ProbeFailed
      expr: probe_success == 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "Probe failed (instance {{ $labels.instance }})"
        description: "Probe failedn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowProbe
      expr: avg_over_time(probe_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow probe (instance {{ $labels.instance }})"
        description: "Blackbox probe took more than 1s to completen  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpStatusCode
      expr: probe_http_status_code <= 199 OR probe_http_status_code >= 400
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "HTTP Status Code (instance {{ $labels.instance }})"
        description: "HTTP status code is not 200-399n  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateWillExpireSoon
      expr: probe_ssl_earliest_cert_expiry - time() < 86400 * 30
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "SSL certificate will expire soon (instance {{ $labels.instance }})"
        description: "SSL certificate expires in 30 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateHasExpired
      expr: probe_ssl_earliest_cert_expiry - time()  <= 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "SSL certificate has expired (instance {{ $labels.instance }})"
        description: "SSL certificate has expired alreadyn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpSlowRequests
      expr: avg_over_time(probe_http_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "HTTP slow requests (instance {{ $labels.instance }})"
        description: "HTTP request took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowPing
      expr: avg_over_time(probe_icmp_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow ping (instance {{ $labels.instance }})"
        description: "Blackbox ping took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Ni wiwo oju opo wẹẹbu Prometheus, lọ si Ipo => Awọn ofin ati wa awọn ofin itaniji fun blackbox-exporter.

Prometheus: Abojuto HTTP nipasẹ olutaja Blackbox

Ṣiṣeto Kubernetes API Server Awọn iwifunni Ipari Iwe-ẹri SSL

Jẹ ki a tunto Kubernetes API Server Abojuto ipari ijẹrisi SSL. Yoo firanṣẹ awọn iwifunni lẹẹkan ni ọsẹ kan.

Ṣafikun module atajasita Blackbox fun Ijeri olupin API Kubernetes.

kubectl --namespace=monitoring edit configmap prometheus-blackbox-exporter
...
      kube-api:
        http:
          method: GET
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          tls_config:
            insecure_skip_verify: false
            ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
          bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Fifi Prometheus scrape iṣeto ni

- job_name: 'kube-api-blackbox'
  metrics_path: /probe
  params:
    module: [kube-api]
  static_configs:
   - targets:
      - https://kubernetes.default.svc/api
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Lilo Prometheus Secret

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Fifi awọn ofin gbigbọn

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: k8s-api-server-cert-expiry
    rules:
    - alert: K8sAPIServerSSLCertExpiringAfterThreeMonths
      expr: probe_ssl_earliest_cert_expiry{job="kube-api-blackbox"} - time() < 86400 * 90 
      for: 1w
      labels:
        severity: warning
      annotations:
        summary: "Kubernetes API Server SSL certificate will expire after three months (instance {{ $labels.instance }})"
        description: "Kubernetes API Server SSL certificate expires in 90 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

wulo awọn ọna asopọ

Abojuto ati wíwọlé ni Docker

orisun: www.habr.com