Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Salaamu calaykum Bishii Maajo OTUS waxay bilawday aqoon-is-weydaarsi ku saabsan la socodka iyo gaynta, labadaba kaabayaasha iyo codsiyada isticmaalaya Zabbix, Prometheus, Grafana iyo ELK. Marka tan la eego, dhaqan ahaan waxaan wadaagnaa waxyaabo faa'iido leh oo ku saabsan mawduuca.

dhoofiyaha Blackbox Prometheus wuxuu kuu ogolaanayaa inaad fuliso la socodka adeegyada dibadda adoo isticmaalaya HTTP, HTTPS, DNS, TCP, ICMP. Maqaalkan, waxaan ku tusi doonaa sida loo sameeyo kormeerka HTTP/HTTPS iyadoo la adeegsanayo dhoofiyaha Blackbox. Waxaan ka bilaabi doonaa dhoofiyaha Blackbox ee Kubernetes.

Deegaanka

Waxaan u baahan doonaa kuwan soo socda:

  • Kubureteska
  • Hawl-wadeenka Prometheus

Dhoofinta blackbox qaabeynta

Habaynta Blackbox iyada oo loo marayo ConfigMap goobaha http moduleka kormeerka adeegyada shabakadda.

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
data:
  blackbox.yaml: |
    modules:
      http_2xx:
        http:
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Module http_2xx loo isticmaalo in lagu hubiyo in adeega webku soo celiyo koodka xaalada HTTP 2xx. Qaabaynta dhoofiyaha blackbox ayaa si faahfaahsan loogu sharaxay dukumentiyo.

Gelinaya dhoofiyaha sanduuqa madow ee kutlada Kubernetes

Sharax Deployment ΠΈ Service in la geeyo Kubernetes.

---
kind: Service
apiVersion: v1
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  type: ClusterIP
  ports:
    - name: http
      port: 9115
      protocol: TCP
  selector:
    app: prometheus-blackbox-exporter

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus-blackbox-exporter
  template:
    metadata:
      labels:
        app: prometheus-blackbox-exporter
    spec:
      restartPolicy: Always
      containers:
        - name: blackbox-exporter
          image: "prom/blackbox-exporter:v0.15.1"
          imagePullPolicy: IfNotPresent
          securityContext:
            readOnlyRootFilesystem: true
            runAsNonRoot: true
            runAsUser: 1000
          args:
            - "--config.file=/config/blackbox.yaml"
          resources:
            {}
          ports:
            - containerPort: 9115
              name: http
          livenessProbe:
            httpGet:
              path: /health
              port: http
          readinessProbe:
            httpGet:
              path: /health
              port: http
          volumeMounts:
            - mountPath: /config
              name: config
        - name: configmap-reload
          image: "jimmidyson/configmap-reload:v0.2.2"
          imagePullPolicy: "IfNotPresent"
          securityContext:
            runAsNonRoot: true
            runAsUser: 65534
          args:
            - --volume-dir=/etc/config
            - --webhook-url=http://localhost:9115/-/reload
          resources:
            {}
          volumeMounts:
            - mountPath: /etc/config
              name: config
              readOnly: true
      volumes:
        - name: config
          configMap:
            name: prometheus-blackbox-exporter

dhoofiyaha Blackbox waxa la geyn karaa iyadoo la isticmaalayo amarka soo socda. Meesha magaca monitoring waxaa loola jeedaa Prometheus Operator.

kubectl --namespace=monitoring apply -f blackbox-exporter.yaml

Hubi in dhammaan adeegyadu ay socdaan addoo isticmaalaya amarka soo socda:

kubectl --namespace=monitoring get all --selector=app=prometheus-blackbox-exporter

Blackbox check

Waxaad ku geli kartaa interneedka shabakadda dhoofiyaha Blackbox addoo isticmaalaya port-forward:

kubectl --namespace=monitoring port-forward svc/prometheus-blackbox-exporter 9115:9115

Ku xidh interface webka dhoofiyaha Blackbox adoo adeegsanaya biraawsarkaaga webka ee localhost: 9115.

Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Hadii aad tagto ciwaanka http://localhost:9115/probe?module=http_2xx&target=https://www.google.com, waxaad arki doontaa natiijada hubinta URL-ka la cayimay (https://www.google.com).

Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Qiimaha cabbirka probe_success la mid ah 1 macnaheedu waa jeeg guul leh. Qiimaha 0 wuxuu muujinayaa qalad

Dejinta Prometheus

Ka dib markii la geeyo dhoofiyaha BlackBox, waxaan ku habeyneynaa Prometheus gudaha prometheus-additional.yaml.

- job_name: 'kube-api-blackbox'
  scrape_interval: 1w
  metrics_path: /probe
  params:
    module: [http_2xx]
  static_configs:
   - targets:
      - https://www.google.com
      - http://www.example.com
      - https://prometheus.io
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Waxaan dhalin Secretadigoo isticmaalaya amarka soo socda.

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Waxaan tilmaamaynaa additional-scrape-configs ee Prometheus Operator isticmaalaya additionalScrapeConfigs.

kubectl --namespace=monitoring edit prometheuses k8s
...
spec:
  additionalScrapeConfigs:
    key: prometheus-additional.yaml
    name: additional-scrape-configs

Waxaan tagnaa is-dhexgalka shabakadda Prometheus oo aan hubinno cabbirrada iyo yoolalka.

kubectl --namespace=monitoring port-forward svc/prometheus-k8s 9090:9090

Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Waxaan aragnaa cabbirka iyo yoolalka Blackbox.

Ku darida sharciyada ogeysiisyada (digniinta)

Si aad uga hesho ogeysiisyada dhoofiyaha Blackbox, waxaanu ku dari doonaa shuruuc hawlwadeenka Prometheus.

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: blackbox-exporter
    rules:
    - alert: ProbeFailed
      expr: probe_success == 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "Probe failed (instance {{ $labels.instance }})"
        description: "Probe failedn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowProbe
      expr: avg_over_time(probe_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow probe (instance {{ $labels.instance }})"
        description: "Blackbox probe took more than 1s to completen  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpStatusCode
      expr: probe_http_status_code <= 199 OR probe_http_status_code >= 400
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "HTTP Status Code (instance {{ $labels.instance }})"
        description: "HTTP status code is not 200-399n  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateWillExpireSoon
      expr: probe_ssl_earliest_cert_expiry - time() < 86400 * 30
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "SSL certificate will expire soon (instance {{ $labels.instance }})"
        description: "SSL certificate expires in 30 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateHasExpired
      expr: probe_ssl_earliest_cert_expiry - time()  <= 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "SSL certificate has expired (instance {{ $labels.instance }})"
        description: "SSL certificate has expired alreadyn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpSlowRequests
      expr: avg_over_time(probe_http_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "HTTP slow requests (instance {{ $labels.instance }})"
        description: "HTTP request took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowPing
      expr: avg_over_time(probe_icmp_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow ping (instance {{ $labels.instance }})"
        description: "Blackbox ping took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Interface-ka shabakadda Prometheus, aad Status => Xeerarka oo hel xeerarka digniinta ee dhoofiyaha-blackbox-ka.

Prometheus: la socodka HTTP iyada oo loo sii marayo dhoofiyaha Blackbox

Habaynta Kubernetes API Seerfar SSL Ogeysiisyada Dhimashada Shahaadada

Aynu habeyno Kubernetes API Server shahaado SSL la socodka dhicitaanka Waxay soo diri doontaa ogeysiisyo todobaadkii hal mar.

Ku darida qaabka dhoofiyaha Blackbox ee Xaqiijinta Server Kubernetes API

kubectl --namespace=monitoring edit configmap prometheus-blackbox-exporter
...
      kube-api:
        http:
          method: GET
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          tls_config:
            insecure_skip_verify: false
            ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
          bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Ku darida qaabeynta xoqida Prometheus

- job_name: 'kube-api-blackbox'
  metrics_path: /probe
  params:
    module: [kube-api]
  static_configs:
   - targets:
      - https://kubernetes.default.svc/api
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Isticmaalka sirta Prometheus

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Ku darida xeerarka digniinta

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: k8s-api-server-cert-expiry
    rules:
    - alert: K8sAPIServerSSLCertExpiringAfterThreeMonths
      expr: probe_ssl_earliest_cert_expiry{job="kube-api-blackbox"} - time() < 86400 * 90 
      for: 1w
      labels:
        severity: warning
      annotations:
        summary: "Kubernetes API Server SSL certificate will expire after three months (instance {{ $labels.instance }})"
        description: "Kubernetes API Server SSL certificate expires in 90 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Xiriiro faa'iido leh

Kormeerka iyo gelitaanka Docker

Source: www.habr.com