Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Hai semua. Pada bulan Mei OTUS dilancarkan bengkel pemantauan dan pembalakan, kedua-dua infrastruktur dan aplikasi menggunakan Zabbix, Prometheus, Grafana dan ELK. Dalam hal ini, kami secara tradisinya berkongsi bahan berguna mengenai topik tersebut.

Pengeksport kotak hitam untuk Prometheus membolehkan anda melaksanakan pemantauan perkhidmatan luaran melalui HTTP, HTTPS, DNS, TCP, ICMP. Dalam artikel ini, saya akan menunjukkan kepada anda cara menyediakan pemantauan HTTP/HTTPS menggunakan pengeksport Blackbox. Kami akan melancarkan pengeksport Blackbox di Kubernetes.

Persekitaran

Kami akan memerlukan yang berikut:

  • Kubernetes
  • Operator Prometheus

Konfigurasi kotak hitam pengeksport

Mengkonfigurasi Kotak Hitam melalui ConfigMap untuk tetapan http modul pemantauan perkhidmatan web.

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
data:
  blackbox.yaml: |
    modules:
      http_2xx:
        http:
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Modul http_2xx digunakan untuk menyemak bahawa perkhidmatan web mengembalikan kod status HTTP 2xx. Konfigurasi pengeksport kotak hitam diterangkan dengan lebih terperinci dalam dokumentasi.

Menggunakan pengeksport kotak hitam ke gugusan Kubernetes

Huraikan Deployment ΠΈ Service untuk penempatan dalam Kubernetes.

---
kind: Service
apiVersion: v1
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  type: ClusterIP
  ports:
    - name: http
      port: 9115
      protocol: TCP
  selector:
    app: prometheus-blackbox-exporter

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus-blackbox-exporter
  template:
    metadata:
      labels:
        app: prometheus-blackbox-exporter
    spec:
      restartPolicy: Always
      containers:
        - name: blackbox-exporter
          image: "prom/blackbox-exporter:v0.15.1"
          imagePullPolicy: IfNotPresent
          securityContext:
            readOnlyRootFilesystem: true
            runAsNonRoot: true
            runAsUser: 1000
          args:
            - "--config.file=/config/blackbox.yaml"
          resources:
            {}
          ports:
            - containerPort: 9115
              name: http
          livenessProbe:
            httpGet:
              path: /health
              port: http
          readinessProbe:
            httpGet:
              path: /health
              port: http
          volumeMounts:
            - mountPath: /config
              name: config
        - name: configmap-reload
          image: "jimmidyson/configmap-reload:v0.2.2"
          imagePullPolicy: "IfNotPresent"
          securityContext:
            runAsNonRoot: true
            runAsUser: 65534
          args:
            - --volume-dir=/etc/config
            - --webhook-url=http://localhost:9115/-/reload
          resources:
            {}
          volumeMounts:
            - mountPath: /etc/config
              name: config
              readOnly: true
      volumes:
        - name: config
          configMap:
            name: prometheus-blackbox-exporter

Pengeksport kotak hitam boleh digunakan menggunakan arahan berikut. Ruang nama monitoring merujuk kepada Prometheus Operator.

kubectl --namespace=monitoring apply -f blackbox-exporter.yaml

Pastikan semua perkhidmatan berjalan menggunakan arahan berikut:

kubectl --namespace=monitoring get all --selector=app=prometheus-blackbox-exporter

Semak kotak hitam

Anda boleh mengakses antara muka web pengeksport Blackbox menggunakan port-forward:

kubectl --namespace=monitoring port-forward svc/prometheus-blackbox-exporter 9115:9115

Sambung ke antara muka web pengeksport Blackbox melalui pelayar web di localhost: 9115.

Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Jika anda pergi ke alamat http://localhost:9115/probe?module=http_2xx&target=https://www.google.com, anda akan melihat hasil semakan URL yang ditentukan (https://www.google.com).

Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Nilai metrik probe_success sama dengan 1 bermakna semakan berjaya. Nilai 0 menunjukkan ralat.

Menyediakan Prometheus

Selepas menggunakan pengeksport BlackBox, kami mengkonfigurasi Prometheus masuk prometheus-additional.yaml.

- job_name: 'kube-api-blackbox'
  scrape_interval: 1w
  metrics_path: /probe
  params:
    module: [http_2xx]
  static_configs:
   - targets:
      - https://www.google.com
      - http://www.example.com
      - https://prometheus.io
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Kami menjana Secretmenggunakan arahan berikut.

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Kami menunjukkan additional-scrape-configs untuk Prometheus Operator menggunakan additionalScrapeConfigs.

kubectl --namespace=monitoring edit prometheuses k8s
...
spec:
  additionalScrapeConfigs:
    key: prometheus-additional.yaml
    name: additional-scrape-configs

Kami pergi ke antara muka web Prometheus dan menyemak metrik dan matlamat.

kubectl --namespace=monitoring port-forward svc/prometheus-k8s 9090:9090

Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Kami melihat metrik dan matlamat Blackbox.

Menambah peraturan untuk pemberitahuan (makluman)

Untuk menerima pemberitahuan daripada pengeksport Blackbox, kami akan menambah peraturan pada Prometheus Operator.

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: blackbox-exporter
    rules:
    - alert: ProbeFailed
      expr: probe_success == 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "Probe failed (instance {{ $labels.instance }})"
        description: "Probe failedn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowProbe
      expr: avg_over_time(probe_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow probe (instance {{ $labels.instance }})"
        description: "Blackbox probe took more than 1s to completen  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpStatusCode
      expr: probe_http_status_code <= 199 OR probe_http_status_code >= 400
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "HTTP Status Code (instance {{ $labels.instance }})"
        description: "HTTP status code is not 200-399n  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateWillExpireSoon
      expr: probe_ssl_earliest_cert_expiry - time() < 86400 * 30
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "SSL certificate will expire soon (instance {{ $labels.instance }})"
        description: "SSL certificate expires in 30 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateHasExpired
      expr: probe_ssl_earliest_cert_expiry - time()  <= 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "SSL certificate has expired (instance {{ $labels.instance }})"
        description: "SSL certificate has expired alreadyn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpSlowRequests
      expr: avg_over_time(probe_http_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "HTTP slow requests (instance {{ $labels.instance }})"
        description: "HTTP request took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowPing
      expr: avg_over_time(probe_icmp_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow ping (instance {{ $labels.instance }})"
        description: "Blackbox ping took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Dalam antara muka web Prometheus, pergi ke Status => Peraturan dan cari peraturan amaran untuk pengeksport kotak hitam.

Prometheus: Pemantauan HTTP melalui pengeksport Blackbox

Mengkonfigurasi Pemberitahuan Tamat Tempoh Sijil SSL Pelayan API Kubernetes

Mari konfigurasikan pemantauan tamat tempoh sijil SSL Pelayan API Kubernetes. Ia akan menghantar pemberitahuan seminggu sekali.

Menambah modul pengeksport Blackbox untuk Pengesahan Pelayan API Kubernetes.

kubectl --namespace=monitoring edit configmap prometheus-blackbox-exporter
...
      kube-api:
        http:
          method: GET
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          tls_config:
            insecure_skip_verify: false
            ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
          bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Menambah konfigurasi pengikisan Prometheus

- job_name: 'kube-api-blackbox'
  metrics_path: /probe
  params:
    module: [kube-api]
  static_configs:
   - targets:
      - https://kubernetes.default.svc/api
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Menggunakan Prometheus Secret

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Menambah peraturan amaran

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: k8s-api-server-cert-expiry
    rules:
    - alert: K8sAPIServerSSLCertExpiringAfterThreeMonths
      expr: probe_ssl_earliest_cert_expiry{job="kube-api-blackbox"} - time() < 86400 * 90 
      for: 1w
      labels:
        severity: warning
      annotations:
        summary: "Kubernetes API Server SSL certificate will expire after three months (instance {{ $labels.instance }})"
        description: "Kubernetes API Server SSL certificate expires in 90 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Pautan berguna

Memantau dan log masuk Docker

Sumber: www.habr.com