0
0
Fork 0
mirror of https://github.com/netdata/netdata.git synced 2025-04-14 17:48:37 +00:00
netdata_netdata/docs
Costa Tsaousis 5f72d4279b
Streaming improvements No 3 ()
* ML uses synchronous queries

* do not call malloc_trim() to free memory, since to locks everything

* Reschedule dimensions for training from worker threads.

* when we collect or read from the database, it is SAMPLES. When we generate points for a chart is POINTS

* keep the receiver send buffer 10x the default

* support autoscaling stream circular buffers

* nd_poll() prefers sending data vs receiving data - in an attempt to dequeue as soon as possible

* fix last commit

* allow removing receiver and senders inline, if the stream thread is not working on them

* fix logs

* Revert "nd_poll() prefers sending data vs receiving data - in an attempt to dequeue as soon as possible"

This reverts commit 51539a97da.

* do not access receiver or sender after it has been removed

* open cache hot2clean

* open cache hot2clean does not need flushing

* use aral for extent pages up to 65k

* track aral malloc and mmap allocations separately; add 8192 as a possible value to PGD

* do not evict too frequently if not needed

* fix aral metrics

* fix aral metrics again

* accurate accounting of memory for dictionaries, strings, labels and MRG

* log during shutdown the progress of dbengine flushing

* move metasync shutfown after dbengine

* max iterations per I/O events

* max iterations per I/O events - break the loop

* max iterations per I/O events - break the loop - again

* disable inline evictions for all caches

* when writing to sockets, send everything that can be sent

* cleanup code to trigger evictions

* fix calculation of eviction size

* fix calculation of eviction size once more

* fix calculation of eviction size once more - again

* ml and replication stop while backfilling is running

* process opcodes while draining the sockets; log with limit when asking to disconnect a node

* fix log

* ml stops when replication queries are running

* report pgd_padding to pulse

* aral precise memory accounting

* removed all alignas() and fix the 2 issues that resulted in unaligned memory accesses (one in mqtt and another in streaming)

* remove the bigger sizes from PGD, but keep multiples of gorilla buffers

* exclude judy from sanitizers

* use 16 bytes alignment on 32 bit machines

* internal check about memory alignment

* experiment: do not allow more children to connect while there is backfilling or replication queries running

* when the node is initializing, retry in 30 seconds

* connector cleanup and isolation of control logic about enabling/disabling various parts

* stop also health queries while backfilling is running

* tuning

* drain the input

* improve interactivity when suspending

* more interactive stream_control

* debug logs to find the connection issue

* abstracted everything about stream control

* Add ml_host_{start,stop} again.

* Do not create/update anomaly-detection charts when ML is not running for a host.

* rrdhost flag RECEIVER_DISCONNECTED has been reversed to COLLECTOR_ONLINE and has been used for localhost and virtual hosts too, to have a single point of truth about the availability of collected data or not

* ml_host_start() and ml_host_stop() are used by streaming receivers; ml_host_start() is used for localhost and virtual hosts

* fixed typo

* allow up to 3 backfills at a time

* add throttling based on user queries

* restore cache line paddings

* unify streaming logs to make it easier to grep logs

* tuning of stream_control

* more logs unification

* use mallocz_release_as_much_memory_to_the_system() under extreme conditions

* do not rely on the response code of evict_pages()

* log the gap of the database every time a node is connected

* updated ram requirements

---------

Co-authored-by: vkalintiris <vasilis@netdata.cloud>
2024-12-11 18:02:17 +02:00
..
.templates Docs fixes () 2024-10-04 12:23:00 +03:00
alerts-and-notifications docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
category-overview-pages docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
dashboards-and-charts docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
deployment-guides docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
developer-and-contributor-corner Balance streaming parents () 2024-12-05 18:17:36 +02:00
diagrams Balance streaming parents () 2024-12-05 18:17:36 +02:00
exporting-metrics docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
netdata-agent Streaming improvements No 3 () 2024-12-11 18:02:17 +02:00
netdata-cloud docs: leftover links + changes on api-tokens.md () 2024-12-09 11:37:21 +00:00
observability-centralization-points docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
security-and-privacy-design docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
Demo-Sites.md Docs fixes () 2024-10-04 12:23:00 +03:00
DICTIONARY.md Docs edits on Cloud versions and On Prem () 2024-11-29 16:05:17 +02:00
glossary.md docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
guidelines.md docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
metric-correlations.md docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
netdata-assistant.md docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00
top-monitoring-netdata-functions.md docs: format, typos, and some simplifications in docs/ () 2024-11-30 21:14:36 +02:00