In lagu qoro luqadda Rust, waxaa lagu gartaa waxqabadka sare iyo isticmaalka RAM oo hooseeya marka la barbar dhigo kuwa lamid ah. Intaa waxaa dheer, fiiro gaar ah ayaa la siiyaa hawlaha la xiriira saxnaanta, gaar ahaan, awoodda lagu badbaadinayo dhacdooyinka aan la soo dirin ee kaydinta diskka iyo wareejinta faylasha.
Qaab dhismeed ahaan, Vector waa router dhacdo oo ka hela fariimaha hal ama ka badan ilo, si ikhtiyaari ah u codsanaya farriimahan isbeddellada, una dirto mid ama ka badan biyo-mareennada.
Vector waa bedelka filebeat iyo logstash, waxay u dhaqmi kartaa labada door (heli oo soo dir logs), faahfaahin dheeraad ah iyaga goobta.
Haddii Logstash ku taal silsiladda waxaa loo dhisay sida gelida → filter → wax soo saar ka dibna Vector waa ilo → beddelasho → waaskada
Tusaalooyinka waxaa laga heli karaa dukumeentiyada.
Tilmaantan waa tilmaan dib loo eegay Vyacheslav Rakhinsky. Tilmaamaha asalka ah waxaa ku jira habaynta geoip. Markii la tijaabiyay geoip ee shabakada gudaha, vector wuxuu bixiyay qalad.
Aug 05 06:25:31.889 DEBUG transform{name=nginx_parse_rename_fields type=rename_fields}: vector::transforms::rename_fields: Field did not exist field=«geoip.country_name» rate_limit_secs=30
Haddii qof uu u baahan yahay inuu farsameeyo geoip, ka dibna tixraac tilmaamaha asalka ah ee ka yimid Vyacheslav Rakhinsky.
Waxaan u habeyn doonaa isku-darka Nginx (Galitaanka galitaanka) → Vector (macmiil | Filebeat) → Vector (Server | Logstash) → si gaar ah Clickhouse iyo si gooni ah Elasticsearch. Waxaan ku rakibi doonaa 4 server. Inkasta oo aad ku dhaafi karto 3 server.
Nidaamku waa wax sidan oo kale ah.
Dami Selinux dhammaan server-yadaada
sed -i 's/^SELINUX=.*/SELINUX=disabled/g' /etc/selinux/config
reboot
Waxaan ku rakibnaa emulator server HTTP ah + utilities dhammaan server-yada
ClickHouse waxay isticmaashaa SSE 4.2 tilmaame, markaa haddii aan si kale loo cayimin, taageerada processor-ka la isticmaalay waxay noqonaysaa shuruudo dheeraad ah. Waa kan amarka lagu hubinayo haddii processor-ka hadda uu taageersan yahay SSE 4.2:
Habaynta Elasticsearch ee qaabka hal-noodka ah 1 shard, 0 nuqul ah. Waxay u badan tahay inaad haysato koox tiro badan oo adeegayaal ah uma baahnid inaad tan samayso.
Marka hore, waxaan u baahanahay inaan ku habeyno qaabka log ee Nginx ee faylka /etc/nginx/nginx.conf
user nginx;
# you must set worker processes based on your CPU cores, nginx does not benefit from setting more than that
worker_processes auto; #some last versions calculate it automatically
# number of file descriptors used for nginx
# the limit for the maximum FDs on the server is usually set by the OS.
# if you don't set FD's then OS settings will be used which is by default 2000
worker_rlimit_nofile 100000;
error_log /var/log/nginx/error.log warn;
pid /var/run/nginx.pid;
# provides the configuration file context in which the directives that affect connection processing are specified.
events {
# determines how much clients will be served per worker
# max clients = worker_connections * worker_processes
# max clients is also limited by the number of socket connections available on the system (~64k)
worker_connections 4000;
# optimized to serve many clients with each thread, essential for linux -- for testing environment
use epoll;
# accept as many connections as possible, may flood worker connections if set too low -- for testing environment
multi_accept on;
}
http {
include /etc/nginx/mime.types;
default_type application/octet-stream;
log_format main '$remote_addr - $remote_user [$time_local] "$request" '
'$status $body_bytes_sent "$http_referer" '
'"$http_user_agent" "$http_x_forwarded_for"';
log_format vector escape=json
'{'
'"node_name":"nginx-vector",'
'"timestamp":"$time_iso8601",'
'"server_name":"$server_name",'
'"request_full": "$request",'
'"request_user_agent":"$http_user_agent",'
'"request_http_host":"$http_host",'
'"request_uri":"$request_uri",'
'"request_scheme": "$scheme",'
'"request_method":"$request_method",'
'"request_length":"$request_length",'
'"request_time": "$request_time",'
'"request_referrer":"$http_referer",'
'"response_status": "$status",'
'"response_body_bytes_sent":"$body_bytes_sent",'
'"response_content_type":"$sent_http_content_type",'
'"remote_addr": "$remote_addr",'
'"remote_port": "$remote_port",'
'"remote_user": "$remote_user",'
'"upstream_addr": "$upstream_addr",'
'"upstream_bytes_received": "$upstream_bytes_received",'
'"upstream_bytes_sent": "$upstream_bytes_sent",'
'"upstream_cache_status":"$upstream_cache_status",'
'"upstream_connect_time":"$upstream_connect_time",'
'"upstream_header_time":"$upstream_header_time",'
'"upstream_response_length":"$upstream_response_length",'
'"upstream_response_time":"$upstream_response_time",'
'"upstream_status": "$upstream_status",'
'"upstream_content_type":"$upstream_http_content_type"'
'}';
access_log /var/log/nginx/access.log main;
access_log /var/log/nginx/access.json.log vector; # Новый лог в формате json
sendfile on;
#tcp_nopush on;
keepalive_timeout 65;
#gzip on;
include /etc/nginx/conf.d/*.conf;
}
Si aanad u jabin qaabayntaada hadda, Nginx waxa ay kuu ogolaanaysaa inaad haysato dhawr dardaaran oo gelis_log ah
access_log /var/log/nginx/access.log main; # Стандартный лог
access_log /var/log/nginx/access.json.log vector; # Новый лог в формате json
Ha iloobin inaad ku darto qaanuun aad ku qorto diiwaanka cusub (haddii faylka logu aanu ku dhammaanayn .log)
Ka saar default.conf /etc/nginx/conf.d/
rm -f /etc/nginx/conf.d/default.conf
Ku dar martigeliyaha casriga ah /etc/nginx/conf.d/vhost1.conf
Oo habee beddelka Filebeat ee /etc/vector/vector.toml config. Cinwaanka IP 172.26.10.108 waa cinwaanka IP-ga ee server-ka log (Vector-Server)
data_dir = "/var/lib/vector"
[sources.nginx_file]
type = "file"
include = [ "/var/log/nginx/access.json.log" ]
start_at_beginning = false
fingerprinting.strategy = "device_and_inode"
[sinks.nginx_output_vector]
type = "vector"
inputs = [ "nginx_file" ]
address = "172.26.10.108:9876"
Ha iloobin inaad ku darto isticmaale vector kooxda loo baahan yahay si uu u akhriyo galalka log. Tusaale ahaan, nginx in centos waxay abuurtaa qoraallo leh xuquuqaha kooxda.
usermod -a -G adm vector
Aan bilowno adeega vector
systemctl enable vector
systemctl start vector
Vector logs waxaa loo arki karaa sidan:
journalctl -f -u vector
Waa in sidan oo kale loo soo galaa
INFO vector::topology::builder: Healthcheck: Passed.
select concat(database, '.', table) as table,
formatReadableSize(sum(bytes)) as size,
sum(rows) as rows,
max(modification_time) as latest_modification,
sum(bytes) as bytes_size,
any(engine) as engine,
formatReadableSize(sum(primary_key_bytes_in_memory)) as primary_keys_size
from system.parts
where active
group by database, table
order by bytes_size desc;
Aynu ogaano inta ay le'eg tahay diiwaannada laga qaaday Clickhouse.
Cabbirka miiska loggu waa 857.19 MB.
Cabbirka isla xogta ku jirta tusaha Elasticsearch waa 4,5GB.
Haddii aadan ku qeexin xogta ku jirta halbeegyada, Clickhouse waxay qaadataa 4500/857.19 = 5.24 jeer ka yar kan Elasticsearch.
Xagga vector-ka, goobta isku-buufinta ayaa si caadi ah loo isticmaalaa.