Imeandikwa kwa lugha ya Rust, ina sifa ya utendaji wa juu na matumizi ya chini ya RAM ikilinganishwa na analogues zake. Kwa kuongeza, tahadhari nyingi hulipwa kwa kazi zinazohusiana na usahihi, hasa, uwezo wa kuokoa matukio yasiyotumwa kwenye buffer kwenye diski na kuzunguka faili.
Kwa usanifu, Vector ni kipanga njia cha tukio ambacho hupokea ujumbe kutoka kwa moja au zaidi vyanzo, ikituma kwa hiari juu ya barua pepe hizi mabadiliko, na kuzituma kwa moja au zaidi mifereji ya maji.
Vekta ni mbadala wa mpigo wa faili na logstash, inaweza kuchukua hatua katika majukumu yote mawili (kupokea na kutuma kumbukumbu), maelezo zaidi juu yao. Online.
Ikiwa kwenye Logstash mnyororo umejengwa kama pembejeo → kichungi → pato basi iko kwenye Vector vyanzo → hubadili → kuzama
Mifano inaweza kupatikana katika nyaraka.
Maagizo haya ni maagizo yaliyorekebishwa kutoka Vyacheslav Rakhinsky. Maagizo asilia yana usindikaji wa geoip. Wakati wa kujaribu geoip kutoka kwa mtandao wa ndani, vekta ilitoa hitilafu.
Aug 05 06:25:31.889 DEBUG transform{name=nginx_parse_rename_fields type=rename_fields}: vector::transforms::rename_fields: Field did not exist field=«geoip.country_name» rate_limit_secs=30
Ikiwa mtu yeyote anahitaji kuchakata geoip, basi rejelea maagizo asili kutoka Vyacheslav Rakhinsky.
Tutasanidi mchanganyiko wa Nginx (Kumbukumbu za Ufikiaji) → Vekta (Mteja | Mdundo wa faili) → Vekta (Seva | Logstash) → kando katika Clickhouse na kando katika Elasticsearch. Tutaweka seva 4. Ingawa unaweza kuipitisha na seva 3.
Mpango ni kitu kama hiki.
Lemaza Selinux kwenye seva zako zote
sed -i 's/^SELINUX=.*/SELINUX=disabled/g' /etc/selinux/config
reboot
Tunasakinisha emulator ya seva ya HTTP + huduma kwenye seva zote
ClickHouse hutumia seti ya maagizo ya SSE 4.2, kwa hivyo isipokuwa ikiwa imeainishwa vinginevyo, msaada wake katika kichakataji kinachotumiwa huwa hitaji la ziada la mfumo. Hapa kuna amri ya kuangalia ikiwa processor ya sasa inasaidia SSE 4.2:
Inasanidi Elasticsearch kwa modi ya nodi moja shard 1, nakala 0. Uwezekano mkubwa zaidi utakuwa na nguzo ya idadi kubwa ya seva na huna haja ya kufanya hivyo.
Kwa faharasa za siku zijazo, sasisha kiolezo chaguo-msingi:
INFO vector::topology::builder: Healthcheck: Passed.
INFO vector::topology::builder: Healthcheck: Passed.
Kwenye mteja (Seva ya Wavuti) - seva ya 1
Kwenye seva iliyo na nginx, unahitaji kuzima ipv6, kwani jedwali la kumbukumbu kwenye bofya hutumia uwanja. upstream_addr IPv4, kwani situmii ipv6 ndani ya mtandao. Ikiwa ipv6 haijazimwa, kutakuwa na makosa:
DB::Exception: Invalid IPv4 value.: (while read the value of key upstream_addr)
Kwanza, tunahitaji kusanidi umbizo la logi katika Nginx kwenye faili /etc/nginx/nginx.conf
user nginx;
# you must set worker processes based on your CPU cores, nginx does not benefit from setting more than that
worker_processes auto; #some last versions calculate it automatically
# number of file descriptors used for nginx
# the limit for the maximum FDs on the server is usually set by the OS.
# if you don't set FD's then OS settings will be used which is by default 2000
worker_rlimit_nofile 100000;
error_log /var/log/nginx/error.log warn;
pid /var/run/nginx.pid;
# provides the configuration file context in which the directives that affect connection processing are specified.
events {
# determines how much clients will be served per worker
# max clients = worker_connections * worker_processes
# max clients is also limited by the number of socket connections available on the system (~64k)
worker_connections 4000;
# optimized to serve many clients with each thread, essential for linux -- for testing environment
use epoll;
# accept as many connections as possible, may flood worker connections if set too low -- for testing environment
multi_accept on;
}
http {
include /etc/nginx/mime.types;
default_type application/octet-stream;
log_format main '$remote_addr - $remote_user [$time_local] "$request" '
'$status $body_bytes_sent "$http_referer" '
'"$http_user_agent" "$http_x_forwarded_for"';
log_format vector escape=json
'{'
'"node_name":"nginx-vector",'
'"timestamp":"$time_iso8601",'
'"server_name":"$server_name",'
'"request_full": "$request",'
'"request_user_agent":"$http_user_agent",'
'"request_http_host":"$http_host",'
'"request_uri":"$request_uri",'
'"request_scheme": "$scheme",'
'"request_method":"$request_method",'
'"request_length":"$request_length",'
'"request_time": "$request_time",'
'"request_referrer":"$http_referer",'
'"response_status": "$status",'
'"response_body_bytes_sent":"$body_bytes_sent",'
'"response_content_type":"$sent_http_content_type",'
'"remote_addr": "$remote_addr",'
'"remote_port": "$remote_port",'
'"remote_user": "$remote_user",'
'"upstream_addr": "$upstream_addr",'
'"upstream_bytes_received": "$upstream_bytes_received",'
'"upstream_bytes_sent": "$upstream_bytes_sent",'
'"upstream_cache_status":"$upstream_cache_status",'
'"upstream_connect_time":"$upstream_connect_time",'
'"upstream_header_time":"$upstream_header_time",'
'"upstream_response_length":"$upstream_response_length",'
'"upstream_response_time":"$upstream_response_time",'
'"upstream_status": "$upstream_status",'
'"upstream_content_type":"$upstream_http_content_type"'
'}';
access_log /var/log/nginx/access.log main;
access_log /var/log/nginx/access.json.log vector; # Новый лог в формате json
sendfile on;
#tcp_nopush on;
keepalive_timeout 65;
#gzip on;
include /etc/nginx/conf.d/*.conf;
}
Ili usivunje usanidi wako wa sasa, Nginx hukuruhusu kuwa na maagizo kadhaa ya ufikiaji_logi
access_log /var/log/nginx/access.log main; # Стандартный лог
access_log /var/log/nginx/access.json.log vector; # Новый лог в формате json
Usisahau kuongeza sheria ya kusajili kumbukumbu mpya (ikiwa faili ya kumbukumbu haiishii kwa .log)
Na usanidi uingizwaji wa Filebeat katika usanidi /etc/vector/vector.toml. Anwani ya IP 172.26.10.108 ni anwani ya IP ya seva ya kumbukumbu (Vector-Server)
data_dir = "/var/lib/vector"
[sources.nginx_file]
type = "file"
include = [ "/var/log/nginx/access.json.log" ]
start_at_beginning = false
fingerprinting.strategy = "device_and_inode"
[sinks.nginx_output_vector]
type = "vector"
inputs = [ "nginx_file" ]
address = "172.26.10.108:9876"
Usisahau kuongeza mtumiaji wa vekta kwenye kikundi kinachohitajika ili aweze kusoma faili za kumbukumbu. Kwa mfano, nginx katika centos huunda kumbukumbu na haki za kikundi cha adm.
usermod -a -G adm vector
Wacha tuanze huduma ya vector
systemctl enable vector
systemctl start vector
Kumbukumbu za Vector zinaweza kutazamwa kama hii:
journalctl -f -u vector
Lazima kuwe na kiingilio kama hiki kwenye magogo
INFO vector::topology::builder: Healthcheck: Passed.
Mtihani wa Stress
Upimaji unafanywa kwa kutumia alama ya Apache.
Kifurushi cha zana za httpd kilisakinishwa kwenye seva zote
Tunaanza kujaribu kwa kutumia alama ya Apache kutoka kwa seva 4 tofauti kwenye skrini. Kwanza, tunazindua multiplexer ya terminal ya skrini, na kisha tunaanza kupima kwa kutumia alama ya Apache. Jinsi ya kufanya kazi na skrini unaweza kupata ndani Ibara ya.
Kutoka kwa seva ya 1
while true; do ab -H "User-Agent: 1server" -c 100 -n 10 -t 10 http://vhost1/; sleep 1; done
Kutoka kwa seva ya 2
while true; do ab -H "User-Agent: 2server" -c 100 -n 10 -t 10 http://vhost2/; sleep 1; done
Kutoka kwa seva ya 3
while true; do ab -H "User-Agent: 3server" -c 100 -n 10 -t 10 http://vhost3/; sleep 1; done
Kutoka kwa seva ya 4
while true; do ab -H "User-Agent: 4server" -c 100 -n 10 -t 10 http://vhost4/; sleep 1; done
select concat(database, '.', table) as table,
formatReadableSize(sum(bytes)) as size,
sum(rows) as rows,
max(modification_time) as latest_modification,
sum(bytes) as bytes_size,
any(engine) as engine,
formatReadableSize(sum(primary_key_bytes_in_memory)) as primary_keys_size
from system.parts
where active
group by database, table
order by bytes_size desc;
Wacha tujue ni kumbukumbu ngapi zilichukua kwenye Clickhouse.
Ukubwa wa jedwali la kumbukumbu ni 857.19 MB.
Saizi ya data sawa katika faharasa katika Elasticsearch ni 4,5GB.
Ikiwa hutabainisha data katika vector katika vigezo, Clickhouse inachukua 4500/857.19 = mara 5.24 chini kuliko katika Elasticsearch.
Katika vector, uwanja wa compression hutumiwa kwa chaguo-msingi.