Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Nyob zoo sawv daws. Thaum lub Tsib Hlis OTUS launches rhiav ntawm kev saib xyuas thiab txiav, ob qho tib si infrastructure thiab daim ntawv thov siv Zabbix, Prometheus, Grafana thiab ELK. Hauv qhov no, peb ib txwm muab cov ntaub ntawv tseem ceeb ntawm lub ncauj lus.

Blackbox exporter rau Prometheus tso cai rau koj los saib xyuas cov kev pabcuam sab nraud ntawm HTTP, HTTPS, DNS, TCP, ICMP. Hauv tsab xov xwm no, kuv yuav qhia koj yuav ua li cas teeb tsa HTTP / HTTPS saib xyuas siv Blackbox exporter. Peb yuav tso tawm Blackbox exporter hauv Kubernetes.

Ib puag ncig

Peb yuav xav tau cov hauv qab no:

  • Kubernetes
  • Prometheus Operator

Exporter blackbox configuration

Configuring Blackbox ntawm ConfigMap rau qhov chaw http web services xyuas module.

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
data:
  blackbox.yaml: |
    modules:
      http_2xx:
        http:
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Module http_2xx siv los xyuas tias lub vev xaib kev pabcuam xa rov qab HTTP 2xx txoj cai code. Lub blackbox exporter configuration yog piav nyob rau hauv kom meej ntxiv nyob rau hauv cov ntaub ntawv.

Siv lub blackbox exporter rau Kubernetes pawg

Piav Deployment ΠΈ Service rau kev xa tawm hauv Kubernetes.

---
kind: Service
apiVersion: v1
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  type: ClusterIP
  ports:
    - name: http
      port: 9115
      protocol: TCP
  selector:
    app: prometheus-blackbox-exporter

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: prometheus-blackbox-exporter
  labels:
    app: prometheus-blackbox-exporter
spec:
  replicas: 1
  selector:
    matchLabels:
      app: prometheus-blackbox-exporter
  template:
    metadata:
      labels:
        app: prometheus-blackbox-exporter
    spec:
      restartPolicy: Always
      containers:
        - name: blackbox-exporter
          image: "prom/blackbox-exporter:v0.15.1"
          imagePullPolicy: IfNotPresent
          securityContext:
            readOnlyRootFilesystem: true
            runAsNonRoot: true
            runAsUser: 1000
          args:
            - "--config.file=/config/blackbox.yaml"
          resources:
            {}
          ports:
            - containerPort: 9115
              name: http
          livenessProbe:
            httpGet:
              path: /health
              port: http
          readinessProbe:
            httpGet:
              path: /health
              port: http
          volumeMounts:
            - mountPath: /config
              name: config
        - name: configmap-reload
          image: "jimmidyson/configmap-reload:v0.2.2"
          imagePullPolicy: "IfNotPresent"
          securityContext:
            runAsNonRoot: true
            runAsUser: 65534
          args:
            - --volume-dir=/etc/config
            - --webhook-url=http://localhost:9115/-/reload
          resources:
            {}
          volumeMounts:
            - mountPath: /etc/config
              name: config
              readOnly: true
      volumes:
        - name: config
          configMap:
            name: prometheus-blackbox-exporter

Blackbox exporter tuaj yeem siv tau siv cov lus txib hauv qab no. Namespace monitoring hais txog Prometheus Operator.

kubectl --namespace=monitoring apply -f blackbox-exporter.yaml

Xyuas kom tseeb tias txhua qhov kev pabcuam tau ua haujlwm siv cov lus txib hauv qab no:

kubectl --namespace=monitoring get all --selector=app=prometheus-blackbox-exporter

Blackbox check

Koj tuaj yeem nkag mus rau Blackbox exporter web interface siv port-forward:

kubectl --namespace=monitoring port-forward svc/prometheus-blackbox-exporter 9115:9115

Txuas mus rau Blackbox exporter web interface ntawm lub web browser ntawm localhost: 9115.

Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Yog koj mus rau qhov chaw nyob http://localhost:9115/probe?module=http_2xx&target=https://www.google.com, koj yuav pom qhov tshwm sim ntawm kev txheeb xyuas qhov URL teev (https://www.google.com).

Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Metric nqi probe_success sib npaug li 1 txhais tau tias kev kuaj xyuas tiav. Tus nqi ntawm 0 qhia qhov yuam kev.

Teeb tsa Prometheus

Tom qab xa tawm BlackBox exporter, peb teeb tsa Prometheus hauv prometheus-additional.yaml.

- job_name: 'kube-api-blackbox'
  scrape_interval: 1w
  metrics_path: /probe
  params:
    module: [http_2xx]
  static_configs:
   - targets:
      - https://www.google.com
      - http://www.example.com
      - https://prometheus.io
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Peb tsim Secretsiv cov lus txib hauv qab no.

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Peb qhia tau additional-scrape-configs rau Prometheus Operator siv additionalScrapeConfigs.

kubectl --namespace=monitoring edit prometheuses k8s
...
spec:
  additionalScrapeConfigs:
    key: prometheus-additional.yaml
    name: additional-scrape-configs

Peb mus rau Prometheus web interface thiab xyuas cov ntsuas thiab cov hom phiaj.

kubectl --namespace=monitoring port-forward svc/prometheus-k8s 9090:9090

Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Peb pom cov kev ntsuas thiab cov hom phiaj ntawm Blackbox.

Ntxiv cov cai rau kev ceeb toom (alert)

Yuav kom tau txais cov ntawv ceeb toom los ntawm Blackbox exporter, peb yuav ntxiv cov cai rau Prometheus Operator.

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: blackbox-exporter
    rules:
    - alert: ProbeFailed
      expr: probe_success == 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "Probe failed (instance {{ $labels.instance }})"
        description: "Probe failedn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowProbe
      expr: avg_over_time(probe_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow probe (instance {{ $labels.instance }})"
        description: "Blackbox probe took more than 1s to completen  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpStatusCode
      expr: probe_http_status_code <= 199 OR probe_http_status_code >= 400
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "HTTP Status Code (instance {{ $labels.instance }})"
        description: "HTTP status code is not 200-399n  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateWillExpireSoon
      expr: probe_ssl_earliest_cert_expiry - time() < 86400 * 30
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "SSL certificate will expire soon (instance {{ $labels.instance }})"
        description: "SSL certificate expires in 30 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SslCertificateHasExpired
      expr: probe_ssl_earliest_cert_expiry - time()  <= 0
      for: 5m
      labels:
        severity: error
      annotations:
        summary: "SSL certificate has expired (instance {{ $labels.instance }})"
        description: "SSL certificate has expired alreadyn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: HttpSlowRequests
      expr: avg_over_time(probe_http_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "HTTP slow requests (instance {{ $labels.instance }})"
        description: "HTTP request took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"
    - alert: SlowPing
      expr: avg_over_time(probe_icmp_duration_seconds[1m]) > 1
      for: 5m
      labels:
        severity: warning
      annotations:
        summary: "Slow ping (instance {{ $labels.instance }})"
        description: "Blackbox ping took more than 1sn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Hauv Prometheus web interface, mus rau Status => Cov Cai thiab nrhiav cov cai ceeb toom rau blackbox-exporter.

Prometheus: HTTP saib xyuas ntawm Blackbox exporter

Configuring Kubernetes API Server SSL Certificate Expiration Notifications

Cia peb teeb tsa Kubernetes API Server SSL daim ntawv pov thawj tas sij hawm saib xyuas. Nws yuav xa cov ntawv ceeb toom ib zaug ib lub lim tiam.

Ntxiv rau Blackbox exporter module rau Kubernetes API Server Authentication.

kubectl --namespace=monitoring edit configmap prometheus-blackbox-exporter
...
      kube-api:
        http:
          method: GET
          no_follow_redirects: false
          preferred_ip_protocol: ip4
          tls_config:
            insecure_skip_verify: false
            ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
          bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
          valid_http_versions:
          - HTTP/1.1
          - HTTP/2
          valid_status_codes: []
        prober: http
        timeout: 5s

Ntxiv Prometheus scrape configuration

- job_name: 'kube-api-blackbox'
  metrics_path: /probe
  params:
    module: [kube-api]
  static_configs:
   - targets:
      - https://kubernetes.default.svc/api
  relabel_configs:
   - source_labels: [__address__]
     target_label: __param_target
   - source_labels: [__param_target]
     target_label: instance
   - target_label: __address__
     replacement: prometheus-blackbox-exporter:9115 # The blackbox exporter.

Siv Prometheus Secret

PROMETHEUS_ADD_CONFIG=$(cat prometheus-additional.yaml | base64)
cat << EOF | kubectl --namespace=monitoring apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: additional-scrape-configs
type: Opaque
data:
  prometheus-additional.yaml: $PROMETHEUS_ADD_CONFIG
EOF

Ntxiv cov cai ceeb toom

kubectl --namespace=monitoring edit prometheusrules prometheus-k8s-rules
...
  - name: k8s-api-server-cert-expiry
    rules:
    - alert: K8sAPIServerSSLCertExpiringAfterThreeMonths
      expr: probe_ssl_earliest_cert_expiry{job="kube-api-blackbox"} - time() < 86400 * 90 
      for: 1w
      labels:
        severity: warning
      annotations:
        summary: "Kubernetes API Server SSL certificate will expire after three months (instance {{ $labels.instance }})"
        description: "Kubernetes API Server SSL certificate expires in 90 daysn  VALUE = {{ $value }}n  LABELS: {{ $labels }}"

Pab kev sib txuas lus

Saib xyuas thiab nkag rau hauv Docker

Tau qhov twg los: www.hab.com