mirror of
https://github.com/netdata/netdata.git
synced 2025-04-28 14:42:31 +00:00
files movearound (#17653)
This commit is contained in:
parent
9597f023f6
commit
effe8e6f26
677 changed files with 882 additions and 2292 deletions
docs
alerts-and-notifications
category-overview-pages
install-netdata-on-embedded-systems.mdinstall-with-a-cicd-provisioning-system.mdinstallation-overview.mdintegrations-overview.mdmachine-learning-and-assisted-troubleshooting.mdmetrics-streaming-and-replication.mdmisc-overview.mdnetdata-apis.mdnetdata-architecture.mdtroubleshooting-overview.mdvisualizations-overview.mdworking-with-logs.md
collecting-metrics
configure
dashboards-and-charts
README.mdalerts-tab.mdanomaly-advisor-tab.mddashboards-tab.mdevents-feed.mdhome-tab.mdimport-export-print-snapshot.mdkubernetes-tab.mdlogs-tab.mdmetrics-tab-and-single-node-tabs.mdnetdata-charts.mdnode-filter.mdnodes-tab.mdthemes.mdtop-tab.mdvisualization-date-and-time-controls.md
deployment-guides
developer-and-contributor-corner
README.mdbuild-the-netdata-agent-yourself.mdcollect-apache-nginx-web-logs.mdcollect-unbound-metrics.mdcustomize.mdkubernetes-k8s-netdata.mdlamp-stack.mdmonitor-cockroachdb.mdmonitor-debug-applications-ebpf.mdmonitor-hadoop-cluster.mdpi-hole-raspberry-pi.mdprocess.mdpython-collector.mdraspberry-pi-anomaly-detection.mdrunning-through-cf-tunnels.mdstyle-guide.md
exporting-metrics
getting-started
glossary.mdguidelines.mdguides
metric-correlations.mdmetrics-storage-management
netdata-agent
README.mdbackup-and-restore-an-agent.md
netdata-assistant.mdconfiguration
README.mdanonymous-telemetry-events.mdcheatsheet.mdcommon-configuration-changes.mdoptimize-the-netdata-agents-performance.md
securing-netdata-agents.mdstart-stop-restart.mdoptimizing-metrics-database
organize-systems-metrics-and-alerts.mdrunning-the-netdata-agent-behind-a-reverse-proxy
netdata-cloud-onprem
netdata-cloud
README.md
netdata-for-IoT.mdauthentication-and-authorization
monitor-your-infrastructure.mdnetdata-subscription-plans.mdorganize-your-infrastructure-invite-your-team.mdview-plan-and-billing.mdobservability-centralization-points/metrics-centralization-points
top-monitoring-netdata-functions.mdintegrations/cloud-authentication/integrations
|
@ -5,16 +5,16 @@ Netdata Cloud sends alert notifications for nodes in warning, critical, or unrea
|
|||
## Benefits of Centralized Notifications
|
||||
|
||||
- Consolidate health status views across all infrastructure in one place.
|
||||
- Set up and [manage your alert notifications easily](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-notification-methods.md).
|
||||
- Expedite troubleshooting with tools like [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md) and the [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md).
|
||||
- Set up and [manage your alert notifications easily](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-notification-methods.md).
|
||||
- Expedite troubleshooting with tools like [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md) and the [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md).
|
||||
|
||||
To avoid notification overload, **flood protection** is triggered when a node frequently disconnects or sends excessive alerts, highlighting potential issues.
|
||||
|
||||
Admins must [enable alert notifications](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-notification-methods.md#manage-space-notification-settings) for their Space(s). All users can then customize their notification preferences through their [account menu](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/#manage-user-notification-settings).
|
||||
Admins must [enable alert notifications](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-notification-methods.md#manage-space-notification-settings) for their Space(s). All users can then customize their notification preferences through their [account menu](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-notification-methods.md#manage-user-notification-settings).
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> Centralized alerts in Netdata Cloud are separate from the [Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md) notifications. Alerts must be [configured individually](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md) on each node.
|
||||
> Centralized alerts in Netdata Cloud are separate from the [Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/README.md) notifications. Alerts must be [configured individually](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md) on each node.
|
||||
|
||||
## Alert Notifications
|
||||
|
||||
|
@ -38,11 +38,11 @@ Available to all plans, includes basic methods like Email and Discord.
|
|||
|
||||
#### Business
|
||||
|
||||
Exclusive to [paid plans](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md), includes advanced services like PagerDuty and Slack.
|
||||
Exclusive to [paid plans](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md), includes advanced services like PagerDuty and Slack.
|
||||
|
||||
## Alert Notification Silencing Rules
|
||||
|
||||
Netdata Cloud offers a silencing rule engine to mute alert notifications based on specific conditions related to nodes or alert types. Learn how to manage these settings [here](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-alert-notification-silencing-rules.md).
|
||||
Netdata Cloud offers a silencing rule engine to mute alert notifications based on specific conditions related to nodes or alert types. Learn how to manage these settings [here](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-alert-notification-silencing-rules.md).
|
||||
|
||||
## Flood Protection
|
||||
|
|
@ -33,7 +33,7 @@ To manage your **personal alert notification silencing rule settings**, you will
|
|||
- Define when it will be applied:
|
||||
- Immediately, from now till until it is turned off or until a specific duration (start and end date automatically set)
|
||||
- Scheduled, you specify the start and end time for when the rule becomes active and then inactive (time is set according to your browser local timezone)
|
||||
Note: You are only able to add a rule if your space is on a [paid plan](https://github.com/netdata/netdata/edit/master/docs/cloud/manage/plans.md).
|
||||
Note: You are only able to add a rule if your space is on a [paid plan](https://github.com/netdata/netdata/edit/master/docs/netdata-cloud/netdata-subscription-plans.md).
|
||||
1. **Edit an existing** alert notification silencing rule configurations. You will be able to change:
|
||||
- The name provided for it
|
||||
- Who it applies to
|
|
@ -22,7 +22,7 @@ To manage space notification settings, you will need the following:
|
|||
|
||||
Notes:
|
||||
* For Netadata provided ones you can't delete the existing notification method configuration.
|
||||
* Enable, Edit and Add actions over specific notification methods will only be allowed if your plan has access to those ([service classification](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#service-classification))
|
||||
* Enable, Edit and Add actions over specific notification methods will only be allowed if your plan has access to those ([service classification](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#service-classification))
|
||||
|
||||
### Steps
|
||||
|
||||
|
@ -55,13 +55,13 @@ To manage user specific notification settings, you will need the following:
|
|||
- A Cloud account
|
||||
- Have access to, at least, a space
|
||||
|
||||
Note: If an administrator has disabled a Personal [service level](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#service-level) notification method this will override any user specific setting.
|
||||
Note: If an administrator has disabled a Personal [service level](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#service-level) notification method this will override any user specific setting.
|
||||
|
||||
### Steps
|
||||
|
||||
1. Click on the **User notification settings** shortcut on top of the help button
|
||||
1. You are presented with:
|
||||
- The Personal [service level](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#service-level) notification methods you can manage
|
||||
- The Personal [service level](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#service-level) notification methods you can manage
|
||||
- The list spaces and rooms inside those where you have access to
|
||||
- If you're an administrator, Manager or Troubleshooter you'll also see the Rooms from a space you don't have access to on **All Rooms** tab and you can activate notifications for them by joining the room
|
||||
1. On this modal you will be able to:
|
|
@ -1,3 +0,0 @@
|
|||
# Install Netdata on Embedded Systems Overview
|
||||
|
||||
This section contains documentation for installation methods when it comes to Embedded Systems.
|
|
@ -1,3 +0,0 @@
|
|||
# Install with a CI/CD Provisioning System Overview
|
||||
|
||||
This section contains documentation on all the installation methods through a CI/CD system.
|
|
@ -1,10 +0,0 @@
|
|||
# Installation
|
||||
|
||||
In this category you can find instructions on all the possible ways you can install Netdata on the
|
||||
[supported platforms](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/versions-and-platforms.md).
|
||||
|
||||
If this is your first time using Netdata, we recommend that you first start with the
|
||||
[quick installation guide](https://github.com/netdata/netdata/edit/master/packaging/installer/README.md) and then
|
||||
go into the more advanced options available to you.
|
||||
|
||||
|
|
@ -1,31 +0,0 @@
|
|||
<!--
|
||||
title: "Integrations"
|
||||
sidebar_label: "Integrations"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/category-overview-pages/integrations-overview.md"
|
||||
description: "Available integrations in Netdata"
|
||||
learn_status: "Published"
|
||||
learn_rel_path: "Integrations"
|
||||
sidebar_position: 60
|
||||
-->
|
||||
|
||||
# Integrations
|
||||
|
||||
Netdata's ability to monitor out of the box every potentially useful aspect of a node's operation is unparalleled.
|
||||
But Netdata also provides out of the box, meaningful charts and alerts for hundreds of applications, with the ability
|
||||
to be easily extended to monitor anything. See the full list of Netdata's capabilities and how you can extend them in the
|
||||
[supported collectors list](https://github.com/netdata/netdata/blob/master/src/collectors/COLLECTORS.md).
|
||||
|
||||
Our out of the box alerts were created by expert professionals and have been validated on the field, countless times.
|
||||
Use them to trigger [alert notifications](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md)
|
||||
either centrally, via the
|
||||
[Cloud alert notifications](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md)
|
||||
, or by configuring individual
|
||||
[agent notifications](https://github.com/netdata/netdata/blob/master/src/health/notifications/README.md).
|
||||
|
||||
We designed Netdata with interoperability in mind. The Agent collects thousands of metrics every second, and then what
|
||||
you do with them is up to you. You can
|
||||
[store metrics in the database engine](https://github.com/netdata/netdata/blob/master/src/database/README.md),
|
||||
or send them to another time series database for long-term storage or further analysis using
|
||||
Netdata's [exporting engine](https://github.com/netdata/netdata/edit/master/src/exporting/README.md).
|
||||
|
||||
|
|
@ -2,6 +2,6 @@
|
|||
|
||||
Netdata provides a variety of Machine Learning features to help you troubleshoot certain scenarios that might come up.
|
||||
|
||||
- K-means clustering [Machine Learning models](https://github.com/netdata/netdata/blob/master/src/ml/README.md) are trained to power the [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md) on the dashboard, which allows you to identify anomalies in your infrastructure
|
||||
- [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md) are possible through the dashboard using the [Two-sample Kolmogorov Smirnov](https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test#Two-sample_Kolmogorov%E2%80%93Smirnov_test) statistical test and Volume heuristic measures
|
||||
- The [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-assistant.md) is able to answer your prompts when it comes to troubleshooting alerts and anomalies.
|
||||
- K-means clustering [Machine Learning models](https://github.com/netdata/netdata/blob/master/src/ml/README.md) are trained to power the [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md) on the dashboard, which allows you to identify anomalies in your infrastructure
|
||||
- [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md) are possible through the dashboard using the [Two-sample Kolmogorov Smirnov](https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test#Two-sample_Kolmogorov%E2%80%93Smirnov_test) statistical test and Volume heuristic measures
|
||||
- The [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/netdata-assistant.md) is able to answer your prompts when it comes to troubleshooting alerts and anomalies.
|
||||
|
|
|
@ -1,175 +0,0 @@
|
|||
# Netdata Parents (Streaming and Replication)
|
||||
|
||||
## What are they and why do we need them?
|
||||
|
||||
A “Parent” is a Netdata Agent, like the ones we install on all our systems, but is configured as a central node that receives, stores and processes metrics data from other Netdata “Child” nodes in our infrastructure.
|
||||
|
||||
Netdata Parents are flexible. You can have one big active-active cluster of Netdata Parents, or you can spread a lot of independent Parents across the infrastructure.
|
||||
|
||||
This “distributed still centralized” setup provides a lot of benefits. Let’s see them:
|
||||
|
||||
## Infrastructure-Level Dashboards: All Nodes in One Dashboard
|
||||
|
||||
A Parent node receives and aggregates metrics data from all child nodes that push metrics to it, presenting all of them on a single, centralized dashboard.
|
||||
|
||||
Metrics streaming between Netdata nodes is real-time and low-latency, so that the Parent can provide the same resolution and detail its children provide.
|
||||
|
||||
Each chart on the Parent’s dashboard is automatically turned into a multi-node chart, allowing instant aggregation of the data across the entire dashboard. This is transparent and automatic for all kinds of charts, even application-specific ones. For example, when you have 2 PostgreSQL servers in your infrastructure, the parent will present one set of charts for PostgreSQL and these charts will include data from both servers.
|
||||
|
||||
## Increased Data Retention: Store More, Learn More
|
||||
|
||||
Netdata’s database (`dbengine`), supports multiple tiers of variable resolution for storing metrics’ samples. Tier 0 is the high-resolution one and usually stores per second data. Tier 1 is the middle resolution one, downsampling data to per minute. Tier 2 is the low-resolution one, downsampling data to per hour. With this setup, a default Netdata setup is usually able to maintain 2-3 days of high resolution and up to a year of low-resolution data, all in less than 1 GB of disk space.
|
||||
|
||||
In many cases, however, organizations require a lot more retention than this. A Netdata Parent can be configured to have weeks or even months of high-resolution data and several years of low-resolution data for all its Child nodes, by allowing the Netdata database to grow to hundreds of GiBs or even several TiBs.
|
||||
|
||||
## Monitoring Ephemeral Nodes: No Node Left Behind
|
||||
|
||||
Production systems are often ephemeral by nature. In containerized and orchestrated environments, like Kubernetes, nodes may come and go due to scaling policies, maintenance tasks, or as part of regular operations.
|
||||
|
||||
Netdata Parents come to the rescue in such scenarios. They can continuously receive metrics from ephemeral nodes during their lifecycle. As these nodes are removed or replaced, the Parent retains their performance history, essentially archiving the life of each node.
|
||||
|
||||
The Netdata dashboards on the Parents automatically bring into the charts data from archived nodes when users pan the dashboard to the time-window these nodes were alive. This means that no data is lost and visibility is maintained across the entire lifespan of every node, regardless of its ephemeral nature.
|
||||
|
||||
## Unified Alerts Management: Silence the Noise
|
||||
|
||||
Each Netdata Agent is able to run health checks, trigger alerts and send notifications on its own. However, in a large-scale infrastructure with numerous nodes, each capable of generating alerts, managing these notifications can quickly become a challenge. Duplicate alerts and non-centralized management can lead to unnecessary noise, causing alert fatigue and possibly overlooking critical warnings.
|
||||
|
||||
Netdata Parents provide a solution to this problem. By configuring a Parent node to handle all alerts and health checks, and disabling health monitoring on the Child nodes, you centralize your alerts management, meaning that all alerts are now generated from a single place, reducing noise and ensuring that each unique issue only triggers a single notification.
|
||||
|
||||
In addition to making alert management more straightforward, this setup also allows for more refined control over your alert configurations. Instead of managing alert settings across multiple nodes, you can handle all configurations in one place, ensuring consistency and ease of management.
|
||||
|
||||
## Offloading Production Systems: Prioritize Performance
|
||||
|
||||
In a production environment, every bit of system resources is crucial. Minimizing the overhead due to monitoring and observability is vital to ensure optimal system performance. Although the Netdata Agent is designed to be lightweight and efficient, using a Netdata Parent can allow the Netdata Agents on your production systems to focus on the absolutely necessary for collecting metrics and pushing them to their Parent.
|
||||
|
||||
On your production systems, by configuring the Netdata Agents to use the `alloc` database mode with 5-10 minutes of retention time and disabling health monitoring and Machine Learning (ML) processing, you significantly reduce the system resources consumed by the monitoring system.
|
||||
|
||||
Netdata, with the `alloc` database mode, doesn't touch the disk at all (apart from logging - which can also be disabled). This approach eliminates any potential disk I/O impact from Netdata on your production applications, which could be particularly beneficial in I/O-sensitive environments.
|
||||
|
||||
## Fault Tolerance and Redundancy: Ensure Continuous Monitoring
|
||||
|
||||
Netdata Agents stream metrics to one Netdata Parent at a time. But more than one Parent can be configured on each child. The first available at any given time is used.
|
||||
|
||||
Similarly, Netdata Parents can be configured to stream/proxy the data they receive to another Netdata Parent. And they can support multiple Parents too, one of which will be used at any given time.
|
||||
|
||||
Configuration allows setting up a circular streaming setup. Parent A streams to Parent B and Parent B streams to Parent A. Child nodes are configured to stream to any of Parents A and B and they will automatically fall back and switch parents as necessary.
|
||||
|
||||
With the replication feature (enabled by default), all nodes replicate missing data on their Parent, before streaming live metrics, filling up any gap the Parent may have.
|
||||
|
||||
The same setup can work for 2 or even more parents, to form an active-active multi-node cluster. Child nodes can connect to any of the parent nodes available and the parent nodes will automatically replicate and stream metrics to each other.
|
||||
|
||||
The setup is optimized even for wide-area connections between child nodes and parents, or for cases where the bandwidth between child nodes and parents has a cost associated with it. At any given time each child node sends its data only once. The parents then replicate and stream this data to each other.
|
||||
|
||||
## Security and Isolation: Protect Your Production Systems
|
||||
|
||||
Parent nodes can be set up in your organization's Demilitarized Zone (DMZ), acting as a protective barrier or application firewall, shielding your production Netdata agents from the outside world.
|
||||
|
||||
With Netdata Parents configured, the Netdata Agents running on your production systems need only one connection to these parents. They don’t need to run data queries, they will never send alert notifications, or even connect to Netdata Cloud.
|
||||
|
||||
Especially for Netdata Cloud, when the Parent node is connected to Netdata Cloud, it registers its Child nodes to it and can serve all functions required by the Cloud on behalf of the Child nodes. So, although only the parent is connected to Netdata Cloud, there is no difference in the user features you enjoy on Netdata Cloud in regard to your production systems. They will all be there.
|
||||
|
||||
## FAQ about Netdata Parents
|
||||
|
||||
### How much can a Parent node scale?
|
||||
|
||||
For about 1 million real-time metrics, with a default configuration:
|
||||
|
||||
- collected and streamed to it per second,
|
||||
- stored in 3 database tiers (high, mid, low resolution),
|
||||
- with ML training and anomaly detection running,
|
||||
- health for alerts and notifications
|
||||
|
||||
And about 2 TiB of storage for metrics, you will need about 5-8 CPU cores and 32GiB of RAM. On such a setup you can have:
|
||||
|
||||
- 15 days of high resolution metrics
|
||||
- 3 months of mid resolution metrics
|
||||
- 1 year of low resolution metrics
|
||||
|
||||
For such a setup, we recommend a 16 CPU cores system so that there is spare capacity for queries. More RAM and faster disks will give faster queries.
|
||||
|
||||
So, depending on the number of metrics per node you have and the size of your Parents, you may be able to aggregate 200 to 500 nodes per Parent.
|
||||
|
||||
### If I set up 2 active-active parents, will I be able to have more Child nodes stream to them?
|
||||
|
||||
No. When you set up an active-active cluster, even if child nodes connect randomly to one or the other, all the parent nodes receive all the metrics of all the child nodes. So, all of them do all the work.
|
||||
|
||||
There is a feature we currently work on, to allow Parent nodes to detect that they receive ML information with the streamed metric data (they receive it already but they ignore it), to prevent them from training their own ML models and running anomaly detection again for the child node. But this is not ready yet.
|
||||
|
||||
### How much retention do the child nodes need?
|
||||
|
||||
Child nodes need to have only the retention required in order to connect to another Parent if one fails or stops for maintenance.
|
||||
|
||||
- If you have an active-active cluster of parents, 5 to 10 minutes in `alloc` mode is enough.
|
||||
- If you have only 1 parent, it would be better to run the child nodes with `dbengine` so that they will have enough retention to backfill the parent nodes if it stops for a few hours for maintenance.
|
||||
|
||||
### Does streaming between child nodes and parents support encryption?
|
||||
|
||||
Yes. You can configure your parent nodes to enable TLS and configure the child nodes to connect with TLS to it. The streaming connection is also compressed with LZ4 and this works even on top of TLS.
|
||||
|
||||
### Can I have an HTTP proxy between parent and child nodes?
|
||||
|
||||
No. The streaming protocol works on the same port as the internal web server of Netdata Agents, but the protocol is not HTTP-friendly and cannot be understood by HTTP proxy servers.
|
||||
|
||||
### Should I load balance the parents with a TCP load balancer?
|
||||
|
||||
Although this can be done and for streaming between child and parent nodes it could work, we recommend not doing it. It can lead to several kinds of problems.
|
||||
|
||||
It is better to configure all the parent nodes directly in the child nodes `stream.conf`. The child nodes will do everything in their power to find a parent node to connect and they will never give up.
|
||||
|
||||
### When I have an active-active cluster of parents, will I receive alert notifications from both of them?
|
||||
|
||||
If both are configured to run health checks and trigger alerts, yes.
|
||||
|
||||
We recommend using Netdata Cloud to avoid receiving duplicate alert notifications. Netdata Cloud deduplicates alert notifications so that you will receive them only once. On top of that, you can control silencing and routing directly from the Netdata Cloud UI.
|
||||
|
||||
### When I have only Parents connected to Netdata Cloud, will I be able to use the Functions feature on my child nodes?
|
||||
|
||||
Yes.
|
||||
|
||||
Functions is a feature of data collection plugins to expose functions that can be run from the dashboard to view more detailed information about a data collection. For example, apps.plugin exposes the processes function that returns a list of all the processes running, together with information about their CPU utilization, memory consumption, disk I/O operations, bandwidth, and a lot more.
|
||||
|
||||
When a parent receives a Function request, it forwards it to the plugin that exposes it. If the plugin is available over a streaming connection, the parent will forward the request to the socket it receives metrics from. This process will be repeated even if many parents are chained in order to reach the child.
|
||||
|
||||
### If I have a set of 2 active-active parents and get one out for maintenance for a few hours, will it have missing data when it returns back online?
|
||||
|
||||
There are 2 reasons you may have gaps in your data after you bring it back online:
|
||||
|
||||
1. Replication does not replicate metrics that are not actively collected. So, when the parent comes back, if there are samples that this parent does not have, for metrics that are not currently being collected, these samples will not be propagated to that parent. [We are working to fix this issue](https://github.com/netdata/netdata/issues/15198).
|
||||
2. If the parent has been offline for a long time and the child nodes run in db mode `alloc`, you need to plan how you will bring this parent back online. Child nodes in this mode do not have enough retention to backfill the parent and if they connect to it before the other parent, you will end up with missing information on that parent.
|
||||
|
||||
The simplest way to solve this is to block at the firewall all connections to port 19999 from child nodes, but allow connections from the other parent nodes. Once replication finishes for all nodes, you can unblock the connections from child nodes to it.
|
||||
|
||||
### I got a parent out of maintenance but it replicates (backfills) missing data slowly. Can I speed it up?
|
||||
|
||||
Yes, there is a setting on `netdata.conf` under section `[db]` called `replication threads`. The default value is 1.
|
||||
|
||||
Usually, each thread is able to replicate about 2-5 million samples per second. We suggest setting this to 5 threads for all parents. Generally do not use too many threads because you are risking congesting the disks and/or the CPU cores available. Keep in mind that the sending parent needs this setting.
|
||||
|
||||
There is no need to increase this number on child nodes. Each node has one replication sender, so when hundreds of nodes are replicating to a parent, there are already a lot of senders pushing metrics to it.
|
||||
|
||||
### I have multiple active-active parents. Which one is used by Netdata Cloud for queries?
|
||||
|
||||
When you have multiple parents available, the one that is further away from the child node is used by Netdata Cloud, unless it does not have the data required.
|
||||
|
||||
This works like this: The child has `hops = 0`. Each parent receiving metrics for this child increases the `hops` by 1. So the first parent will have `hops = 1`, the second parent will have `hops = 2` and so on.
|
||||
|
||||
Netdata Cloud knows the retention of each parent. So, when it needs data from this child, it first checks the available retention each parent has for it and then it uses the parent with the higher `hops`. If no parent is available and the child node is directly connected to Netdata Cloud, it uses the child.
|
||||
|
||||
### Is there a way to balance child nodes to the parent nodes of an active-active cluster?
|
||||
|
||||
If you have 2 parent nodes A and B, you can configure them on half the child nodes as A, B, and the other half as B, A. The child nodes will connect to the first available (left to right). If both A and B are online, half of the child nodes will connect to A and the other half to B.
|
||||
|
||||
Keep in mind, however, that if you restart a parent, all the child nodes that were connected to it will automatically reconnect to the other parent. Once this happens, the child nodes will stay connected to it.
|
||||
|
||||
### Is there a way to get notified when a child gets disconnected?
|
||||
|
||||
There are 2 kinds of production nodes:
|
||||
1. **Permanent nodes**
|
||||
These are nodes that should be available permanently and if they disconnect an alert should be triggered to notify you.
|
||||
By default, all nodes are considered permanent (not ephemeral).
|
||||
2. **Ephemeral nodes**
|
||||
These are nodes that are ephemeral by nature and they may shutdown at any point in time without any impact on the services you run.
|
||||
|
||||
To set the ephemeral flag on a node, edit its `netdata.conf` and in the `[health]` section set is `ephemeral = yes`. This setting is propagated to parent nodes and Netdata Cloud.
|
||||
|
||||
When using Netdata Cloud (via a parent or directly) and a permanent node gets disconnected, Netdata Cloud sends node disconnection notifications.
|
|
@ -1,3 +0,0 @@
|
|||
# Miscellaneous material
|
||||
|
||||
This section contains material that will be moved to new locations as we see fit. We keep it here to make it accessible while we make these changes.
|
|
@ -1,5 +0,0 @@
|
|||
# Netdata APIs Overview
|
||||
|
||||
This section contains information about Netdata's APIs.
|
||||
|
||||
You can access the Netdata Agent's API through swagger UI [here](/api).
|
|
@ -1,3 +0,0 @@
|
|||
# Netdata Architecture Overview
|
||||
|
||||
This section's purpose is to explain the architecture of Netdata, the role of the Agent and the Cloud, and more.
|
|
@ -1,5 +0,0 @@
|
|||
# Troubleshooting and machine learning
|
||||
|
||||
In this section you can learn about Netdata's advanced tools that can assist you in troubleshooting issues with
|
||||
your infrastructure, to facilitate the identification of a root cause.
|
||||
|
|
@ -1,4 +0,0 @@
|
|||
# Visualizations, charts and dashboards
|
||||
|
||||
In this section you can learn about the various ways Netdata visualizes the collected metrics at an infrastructure level with Netdata Cloud
|
||||
and at a single node level, with the Netdata Agent Dashboard.
|
|
@ -1,3 +1,3 @@
|
|||
# Logs
|
||||
# Working with Logs
|
||||
|
||||
This section talks about ways Netdata collects and visualizes logs, while also providing useful guides on log centralization setups that can be used with Netdata.
|
|
@ -2,7 +2,7 @@
|
|||
title: "Collect application metrics with Netdata"
|
||||
sidebar_label: "Application metrics"
|
||||
description: "Monitor and troubleshoot every application on your infrastructure with per-second metrics, zero configuration, and meaningful charts."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collect/application-metrics.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collecting-metrics/application-metrics.md"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Concepts"
|
||||
learn_rel_path: "Concepts"
|
||||
|
@ -78,6 +78,6 @@ collector](https://github.com/netdata/netdata/blob/master/src/go/collectors/go.d
|
|||
|
||||
Collecting all the available metrics on your nodes, and across your entire infrastructure, is just one piece of the
|
||||
puzzle. Next, learn more about Netdata's famous real-time visualizations by [seeing an overview of your
|
||||
infrastructure](https://github.com/netdata/netdata/blob/master/docs/dashboard/home-tab.md) using Netdata Cloud.
|
||||
infrastructure](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/home-tab.md) using Netdata Cloud.
|
||||
|
||||
|
|
@ -2,7 +2,7 @@
|
|||
title: "Collect container metrics with Netdata"
|
||||
sidebar_label: "Container metrics"
|
||||
description: "Use Netdata to collect per-second utilization and application-level metrics from Linux/Docker containers and Kubernetes clusters."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collect/container-metrics.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collecting-metrics/container-metrics.md"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Concepts"
|
||||
learn_rel_path: "Concepts"
|
||||
|
@ -61,7 +61,7 @@ a container. You may need to configure the target endpoint if it's not the appli
|
|||
Netdata can even [run in a Docker container](https://github.com/netdata/netdata/blob/master/packaging/docker/README.md) itself, and then collect metrics about the
|
||||
host system, its own container with cgroups, and any applications you want to monitor.
|
||||
|
||||
See our [application metrics doc](https://github.com/netdata/netdata/blob/master/docs/collect/application-metrics.md) for details about Netdata's application metrics
|
||||
See our [application metrics doc](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/application-metrics.md) for details about Netdata's application metrics
|
||||
collection capabilities.
|
||||
|
||||
## Collect Kubernetes metrics
|
||||
|
@ -88,14 +88,14 @@ your k8s infrastructure.
|
|||
each container running on your k8s cluster.
|
||||
|
||||
For a holistic view of Netdata's Kubernetes monitoring capabilities, see our guide: [_Monitor a Kubernetes (k8s) cluster
|
||||
with Netdata_](https://github.com/netdata/netdata/blob/master/docs/guides/monitor/kubernetes-k8s-netdata.md).
|
||||
with Netdata_](https://github.com/netdata/netdata/blob/master/docs/developer-and-contributor-corner/kubernetes-k8s-netdata.md).
|
||||
|
||||
## What's next?
|
||||
|
||||
Netdata is capable of collecting metrics from hundreds of applications, such as web servers, databases, messaging
|
||||
brokers, and more. See more in the [application metrics doc](https://github.com/netdata/netdata/blob/master/docs/collect/application-metrics.md).
|
||||
brokers, and more. See more in the [application metrics doc](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/application-metrics.md).
|
||||
|
||||
If you already have all the information you need about collecting metrics, move into Netdata's meaningful visualizations
|
||||
with [seeing an overview of your infrastructure](https://github.com/netdata/netdata/blob/master/docs/dashboard/home-tab.md) using Netdata Cloud.
|
||||
with [seeing an overview of your infrastructure](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/home-tab.md) using Netdata Cloud.
|
||||
|
||||
|
|
@ -2,7 +2,7 @@
|
|||
title: "Collect system metrics with Netdata"
|
||||
sidebar_label: "System metrics"
|
||||
description: "Netdata collects thousands of metrics from physical and virtual systems, IoT/edge devices, and containers with zero configuration."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collect/system-metrics.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/collecting-metrics/system-metrics.md"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Concepts"
|
||||
learn_rel_path: "Concepts"
|
||||
|
@ -11,7 +11,7 @@ learn_rel_path: "Concepts"
|
|||
# Collect system metrics with Netdata
|
||||
|
||||
Netdata collects thousands of metrics directly from the operating systems of physical and virtual systems, IoT/edge
|
||||
devices, and [containers](https://github.com/netdata/netdata/blob/master/docs/collect/container-metrics.md) with zero configuration.
|
||||
devices, and [containers](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/container-metrics.md) with zero configuration.
|
||||
|
||||
To gather system metrics, Netdata uses roughly a dozen plugins, each of which has one or more collectors for very
|
||||
specific metrics exposed by the host. The system metrics Netdata users interact with most for health monitoring and
|
||||
|
@ -25,7 +25,7 @@ It even allows Netdata to monitor its own resource utilization!
|
|||
[**cgroups.plugin**](https://github.com/netdata/netdata/blob/master/src/collectors/cgroups.plugin/README.md) collects rich metrics about containers and virtual machines
|
||||
using the virtual files under `/sys/fs/cgroup`. By reading cgroups, Netdata can instantly collect resource utilization
|
||||
metrics for systemd services, all containers (Docker, LXC, LXD, Libvirt, systemd-nspawn), and more. Learn more in the
|
||||
[collecting container metrics](https://github.com/netdata/netdata/blob/master/docs/collect/container-metrics.md) doc.
|
||||
[collecting container metrics](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/container-metrics.md) doc.
|
||||
|
||||
[**ebpf.plugin**](https://github.com/netdata/netdata/blob/master/src/collectors/ebpf.plugin/README.md): Netdata's extended Berkeley Packet Filter (eBPF) collector
|
||||
monitors Linux kernel-level metrics for file descriptors, virtual filesystem IO, and process management. You can use our
|
||||
|
@ -49,14 +49,14 @@ method](https://github.com/netdata/netdata/blob/master/packaging/installer/READM
|
|||
utilization, memory, bandwidth per NIC, number of processes, and much more.
|
||||
|
||||
For information about collecting metrics from applications _running on Windows systems_, see the [application metrics
|
||||
doc](https://github.com/netdata/netdata/blob/master/docs/collect/application-metrics.md#collect-metrics-from-applications-running-on-windows).
|
||||
doc](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/application-metrics.md#collect-metrics-from-applications-running-on-windows).
|
||||
|
||||
## What's next?
|
||||
|
||||
Because there's some overlap between system metrics and [container metrics](https://github.com/netdata/netdata/blob/master/docs/collect/container-metrics.md), you
|
||||
Because there's some overlap between system metrics and [container metrics](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/container-metrics.md), you
|
||||
should investigate Netdata's container compatibility if you use them heavily in your infrastructure.
|
||||
|
||||
If you don't use containers, skip ahead to collecting [application metrics](https://github.com/netdata/netdata/blob/master/docs/collect/application-metrics.md) with
|
||||
If you don't use containers, skip ahead to collecting [application metrics](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/application-metrics.md) with
|
||||
Netdata.
|
||||
|
||||
|
|
@ -1,139 +0,0 @@
|
|||
# Configure the Netdata Agent
|
||||
|
||||
Netdata's zero-configuration collection, storage, and visualization features work for many users, infrastructures, and
|
||||
use cases, but there are some situations where you might want to configure the Netdata Agent running on your node(s),
|
||||
which can be a physical or virtual machine (VM), container, cloud deployment, or edge/IoT device.
|
||||
|
||||
For example, you might want to increase metrics retention, configure a collector based on your infrastructure's unique
|
||||
setup, or secure the local dashboard by restricting it to only connections from `localhost`.
|
||||
|
||||
Whatever the reason, Netdata users should know how to configure individual nodes to act decisively if an incident,
|
||||
anomaly, or change in infrastructure affects how their Agents should perform.
|
||||
|
||||
## The Netdata config directory
|
||||
|
||||
On most Linux systems, using our [recommended one-line
|
||||
installation](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#install-on-linux-with-one-line-installer), the **Netdata config
|
||||
directory** is `/etc/netdata/`. The config directory contains several configuration files with the `.conf` extension, a
|
||||
few directories, and a shell script named `edit-config`.
|
||||
|
||||
> Some operating systems will use `/opt/netdata/etc/netdata/` as the config directory. If you're not sure where yours
|
||||
> is, navigate to `http://NODE:19999/netdata.conf` in your browser, replacing `NODE` with the IP address or hostname of
|
||||
> your node, and find the `# config directory = ` setting. The value listed is the config directory for your system.
|
||||
|
||||
All of Netdata's documentation assumes that your config directory is at `/etc/netdata`, and that you're running any
|
||||
scripts from inside that directory.
|
||||
|
||||
## Netdata's configuration files
|
||||
|
||||
Upon installation, the Netdata config directory contains a few files and directories. It's okay if you don't see all
|
||||
these files in your own Netdata config directory, as the next section describes how to edit any that might not already
|
||||
exist.
|
||||
|
||||
- `netdata.conf` is the main configuration file. This is where you'll find most configuration options. Read descriptions
|
||||
for each in the [daemon config](https://github.com/netdata/netdata/blob/master/src/daemon/config/README.md) doc.
|
||||
- `edit-config` is a shell script used for [editing configuration files](#use-edit-config-to-edit-configuration-files).
|
||||
- Various configuration files ending in `.conf` for [configuring plugins or
|
||||
collectors](https://github.com/netdata/netdata/blob/master/src/collectors/REFERENCE.md) behave. Examples: `go.d.conf`,
|
||||
`python.d.conf`, and `ebpf.d.conf`.
|
||||
- Various directories ending in `.d`, which contain other configuration files, each ending in `.conf`, for [configuring
|
||||
specific collectors](https://github.com/netdata/netdata/blob/master/src/collectors/REFERENCE.md).
|
||||
- `apps_groups.conf` is a configuration file for changing how applications/processes are grouped when viewing the
|
||||
**Application** charts from [`apps.plugin`](https://github.com/netdata/netdata/blob/master/src/collectors/apps.plugin/README.md) or
|
||||
[`ebpf.plugin`](https://github.com/netdata/netdata/blob/master/src/collectors/ebpf.plugin/README.md).
|
||||
- `health.d/` is a directory that contains [health configuration files](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md).
|
||||
- `health_alarm_notify.conf` enables and configures [alert notifications](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md).
|
||||
- `statsd.d/` is a directory for configuring Netdata's [statsd collector](https://github.com/netdata/netdata/blob/master/src/collectors/statsd.plugin/README.md).
|
||||
- `stream.conf` configures [parent-child streaming](https://github.com/netdata/netdata/blob/master/src/streaming/README.md) between separate nodes running the Agent.
|
||||
- `.environment` is a hidden file that describes the environment in which the Netdata Agent is installed, including the
|
||||
`PATH` and any installation options. Useful for [reinstalling](https://github.com/netdata/netdata/blob/master/packaging/installer/REINSTALL.md) or
|
||||
[uninstalling](https://github.com/netdata/netdata/blob/master/packaging/installer/UNINSTALL.md) the Agent.
|
||||
|
||||
The Netdata config directory also contains one symlink:
|
||||
|
||||
- `orig` is a symbolic link to the directory `/usr/lib/netdata/conf.d`, which contains stock configuration files. Stock
|
||||
versions are copied into the config directory when opened with `edit-config`. _Do not edit the files in
|
||||
`/usr/lib/netdata/conf.d`, as they are overwritten by updates to the Netdata Agent._
|
||||
|
||||
## Configure a Netdata docker container
|
||||
|
||||
See [configure agent containers](https://github.com/netdata/netdata/blob/master/packaging/docker/README.md#configure-agent-containers).
|
||||
|
||||
## Use `edit-config` to edit configuration files
|
||||
|
||||
The **recommended way to easily and safely edit Netdata's configuration** is with the `edit-config` script. This script
|
||||
opens existing Netdata configuration files using your system's `$EDITOR`. If the file doesn't yet exist in your config
|
||||
directory, the script copies the stock version from `/usr/lib/netdata/conf.d` (or wherever the symlink `orig` under the config directory leads to)
|
||||
to the proper place in the config directory and opens the copy for editing.
|
||||
|
||||
If you have trouble running the script, you can manually copy the file and edit the copy.
|
||||
|
||||
e.g. `cp /usr/lib/netdata/conf.d/go.d/bind.conf /etc/netdata/go.d/bind.conf; vi /etc/netdata/go.d/bind.conf`
|
||||
|
||||
Run `edit-config` without options, to see details on its usage, or `edit-config --list` to see a list of all the configuration
|
||||
files you can edit.
|
||||
|
||||
```bash
|
||||
USAGE:
|
||||
./edit-config [options] FILENAME
|
||||
|
||||
Copy and edit the stock config file named: FILENAME
|
||||
if FILENAME is already copied, it will be edited as-is.
|
||||
|
||||
Stock config files at: '/etc/netdata/../../usr/lib/netdata/conf.d'
|
||||
User config files at: '/etc/netdata'
|
||||
|
||||
The editor to use can be specified either by setting the EDITOR
|
||||
environment variable, or by using the --editor option.
|
||||
|
||||
The file to edit can also be specified using the --file option.
|
||||
|
||||
For a list of known config files, run './edit-config --list'
|
||||
```
|
||||
|
||||
To edit `netdata.conf`, run `./edit-config netdata.conf`. You may need to elevate your privileges with `sudo` or another
|
||||
method for `edit-config` to write into the config directory. Use your `$EDITOR`, make your changes, and save the file.
|
||||
|
||||
> `edit-config` uses the `EDITOR` environment variable on your system to edit the file. On many systems, that is
|
||||
> defaulted to `vim` or `nano`. Use `export EDITOR=` to change this temporarily, or edit your shell configuration file
|
||||
> to change to permanently.
|
||||
|
||||
After you make your changes, you need to [restart the Agent](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#maintaining-a-netdata-agent-installation) with `sudo systemctl
|
||||
restart netdata` or the appropriate method for your system.
|
||||
|
||||
Here's an example of editing the node's hostname, which appears in both the local dashboard and in Netdata Cloud.
|
||||
|
||||

|
||||
|
||||
### Other configuration files
|
||||
|
||||
You can edit any Netdata configuration file using `edit-config`. A few examples:
|
||||
|
||||
```bash
|
||||
./edit-config apps_groups.conf
|
||||
./edit-config ebpf.d.conf
|
||||
./edit-config health.d/load.conf
|
||||
./edit-config go.d/prometheus.conf
|
||||
```
|
||||
|
||||
The documentation for each of Netdata's components explains which file(s) to edit to achieve the desired behavior.
|
||||
|
||||
## See an Agent's running configuration
|
||||
|
||||
On start, the Netdata Agent daemon attempts to load `netdata.conf`. If that file is missing, incomplete, or contains
|
||||
invalid settings, the daemon attempts to run sane defaults instead. In other words, the state of `netdata.conf` on your
|
||||
filesystem may be different from the state of the Netdata Agent itself.
|
||||
|
||||
To see the _running configuration_, navigate to `http://NODE:19999/netdata.conf` in your browser, replacing `NODE` with
|
||||
the IP address or hostname of your node. The file displayed here is exactly the settings running live in the Netdata
|
||||
Agent.
|
||||
|
||||
If you're having issues with configuring the Agent, apply the running configuration to `netdata.conf` by downloading the
|
||||
file to the Netdata config directory. Use `sudo` to elevate privileges.
|
||||
|
||||
```bash
|
||||
wget -O /etc/netdata/netdata.conf http://localhost:19999/netdata.conf
|
||||
# or
|
||||
curl -o /etc/netdata/netdata.conf http://NODE:19999/netdata.conf
|
||||
```
|
|
@ -13,15 +13,15 @@ By default, the Agent dashboard shows the latest version (matching Netdata Cloud
|
|||
|
||||
The Netdata dashboard consists of the following main sections:
|
||||
|
||||
- [Home tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/home-tab.md)
|
||||
- [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/nodes-tab.md)
|
||||
- [Netdata charts](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md)
|
||||
- [Metrics tab and single node tabs](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md)
|
||||
- [Top tab](https://github.com/netdata/netdata/blob/master/docs/cloud/runtime-troubleshooting-with-functions.md)
|
||||
- [Logs tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/logs-tab.md)
|
||||
- [Dashboards tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md)
|
||||
- [Alerts tab](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md)
|
||||
- [Events tab](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/events-feed.md)
|
||||
- [Home tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/home-tab.md)
|
||||
- [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/nodes-tab.md)
|
||||
- [Netdata charts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md)
|
||||
- [Metrics tab and single node tabs](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md)
|
||||
- [Top tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/top-tab.md)
|
||||
- [Logs tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/logs-tab.md)
|
||||
- [Dashboards tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md)
|
||||
- [Alerts tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md)
|
||||
- [Events tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/events-feed.md)
|
||||
|
||||
> **Note**
|
||||
>
|
||||
|
@ -31,7 +31,7 @@ The Netdata dashboard consists of the following main sections:
|
|||
|
||||
### Netdata Cloud
|
||||
|
||||
You can access the dashboard at <https://app.netdata.cloud/> and [sign-in with an account or sign-up](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/authentication-and-authorization.md) if you don't have an account yet.
|
||||
You can access the dashboard at <https://app.netdata.cloud/> and [sign-in with an account or sign-up](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/README.md) if you don't have an account yet.
|
||||
|
||||
### Netdata Agent
|
||||
|
|
@ -41,11 +41,11 @@ By clicking on the name of an entry of the table you can access that alert's det
|
|||
- Configuration section
|
||||
- Instance values - Node Instances
|
||||
|
||||
At the bottom of the panel you can click the green button "View alert page" to open a [dynamic tab](https://github.com/netdata/netdata/blob/master/docs/quickstart/infrastructure.md#dynamic-tabs) containing all the info for this alert in a tab format, where you can also run correlations and go to the node's chart that raised the particular alert.
|
||||
At the bottom of the panel you can click the green button "View alert page" to open a [dynamic tab](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/monitor-your-infrastructure.md#dynamic-tabs) containing all the info for this alert in a tab format, where you can also run correlations and go to the node's chart that raised the particular alert.
|
||||
|
||||
### Silence an alert
|
||||
|
||||
From this tab, the "Silencing" column shows if there is any rule present for each alert, and from the "Actions" column you can create a new [silencing rule](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#alert-notifications-silencing-rules) for this alert, or get help and information about this alert from the [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-assistant.md).
|
||||
From this tab, the "Silencing" column shows if there is any rule present for each alert, and from the "Actions" column you can create a new [silencing rule](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#alert-notifications-silencing-rules) for this alert, or get help and information about this alert from the [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/netdata-assistant.md).
|
||||
|
||||
## Alert Configurations tab
|
||||
|
||||
|
@ -55,7 +55,7 @@ By running alerts we mean alerts that are related to some metric that is or was
|
|||
|
||||
You can control which columns are visible by using the gear icon on the right-hand side.
|
||||
|
||||
Similarly to the previous tab, you can see the silencing status of an alert, while also being able to dig deeper and show the configuration for the alert and ask the [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-assistant.md) for help.
|
||||
Similarly to the previous tab, you can see the silencing status of an alert, while also being able to dig deeper and show the configuration for the alert and ask the [Netdata Assistant](https://github.com/netdata/netdata/blob/master/docs/netdata-assistant.md) for help.
|
||||
|
||||
### See the configuration for an alert
|
||||
|
|
@ -1,6 +1,6 @@
|
|||
# Anomaly Advisor tab
|
||||
|
||||
The Anomaly Advisor tab lets you focus on potentially anomalous metrics and charts related to a particular highlighted window of interest. In addition to this tab, each chart in the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) also has an [Anomaly Rate ribbon](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#anomaly-rate-ribbon).
|
||||
The Anomaly Advisor tab lets you focus on potentially anomalous metrics and charts related to a particular highlighted window of interest. In addition to this tab, each chart in the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) also has an [Anomaly Rate ribbon](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#anomaly-rate-ribbon).
|
||||
|
||||
|
||||
More details about configuration can be found in the [ML documentation](https://github.com/netdata/netdata/blob/master/src/ml/README.md#configuration).
|
||||
|
@ -17,7 +17,7 @@ Once you have highlighted a window of interest, you should see an ordered list o
|
|||
|
||||
> **Tip**
|
||||
>
|
||||
> You can also use the [node filter](https://github.com/netdata/netdata/blob/master/docs/dashboard/node-filter.md) to select which nodes you want to include or exclude.
|
||||
> You can also use the [node filter](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/node-filter.md) to select which nodes you want to include or exclude.
|
||||
|
||||
The right side of the page displays an anomaly index for the highlighted timeline of interest. The index is sorted from most anomalous (highest level of anomaly) to least (lowest level of anomaly). Clicking on an entry in the index will get you to the corresponding chart for the anomalous metric.
|
||||
|
|
@ -12,7 +12,7 @@ In the modal, give your custom dashboard a name, and click **+ Add**.
|
|||
|
||||
- The **Add Chart** button on the top right of the interface adds your first chart card. From the dropdown, select either **All Nodes** or a specific node.
|
||||
|
||||
Next, select the context. You'll see a preview of the chart before you finish adding it. In this modal you can also [interact with the chart](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md), meaning you can configure all the aspects of the [NIDL framework](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#nidl-framework) of the chart and more in detail, you can:
|
||||
Next, select the context. You'll see a preview of the chart before you finish adding it. In this modal you can also [interact with the chart](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md), meaning you can configure all the aspects of the [NIDL framework](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#nidl-framework) of the chart and more in detail, you can:
|
||||
- define which `group by` method to use
|
||||
- select the aggregation function over the data source
|
||||
- select nodes
|
||||
|
@ -21,7 +21,7 @@ In the modal, give your custom dashboard a name, and click **+ Add**.
|
|||
- select labels
|
||||
- select the aggregation function over time
|
||||
|
||||
After you are done configuring the chart, you can also change the type of the chart from the right hand side of the [Title bar](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#title-bar), and select which of the final dimensions you want to be visible and in what order, from the [Dimensions bar](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#dimensions-bar).
|
||||
After you are done configuring the chart, you can also change the type of the chart from the right hand side of the [Title bar](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#title-bar), and select which of the final dimensions you want to be visible and in what order, from the [Dimensions bar](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#dimensions-bar).
|
||||
|
||||
- The **Add Text** button on the top right of the interface creates a new card with user-defined text, which you can use to describe or document a particular dashboard's meaning and purpose.
|
||||
|
||||
|
@ -35,13 +35,13 @@ Dashboards are designed to be interactive and flexible so you can design them to
|
|||
|
||||
### Charts
|
||||
|
||||
The charts you add to any dashboard are [fully interactive](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md), just like any other Netdata chart. You can zoom in and out, highlight timeframes, and more.
|
||||
The charts you add to any dashboard are [fully interactive](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md), just like any other Netdata chart. You can zoom in and out, highlight timeframes, and more.
|
||||
|
||||
Charts also synchronize as you interact with them, even across contexts _or_ nodes.
|
||||
|
||||
### Text cards
|
||||
|
||||
You can use text cards as notes to explain to other members of the [War Room](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms) the purpose of the dashboard's arrangement.
|
||||
You can use text cards as notes to explain to other members of the [War Room](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms) the purpose of the dashboard's arrangement.
|
||||
|
||||
By clicking the `T` icon on the text box, you can switch between font sizes.
|
||||
|
||||
|
@ -56,7 +56,7 @@ To resize any element on a dashboard, click on the bottom-right corner and drag
|
|||
|
||||
### Go to chart
|
||||
|
||||
Quickly jump to the location of the chart in either the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) or if the chart refers to a single node, its single node dashboard by clicking the 3-dot icon in the corner of any chart to open a menu. Hit the **Go to Chart** item.
|
||||
Quickly jump to the location of the chart in either the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) or if the chart refers to a single node, its single node dashboard by clicking the 3-dot icon in the corner of any chart to open a menu. Hit the **Go to Chart** item.
|
||||
|
||||
You'll land directly on that chart of interest, but you can now scroll up and down to correlate your findings with other
|
||||
charts. Of course, you can continue to zoom, highlight, and pan through time just as you're used to with Netdata Charts.
|
||||
|
@ -93,4 +93,4 @@ Because of the visual complexity of individual charts, dashboards require a mini
|
|||
|
||||
## What's next?
|
||||
|
||||
Once you've designed a dashboard or two, make sure to [invite your team](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#invite-your-team) if you haven't already. You can add these new users to the same War Room to let them see the same dashboards without any effort.
|
||||
Once you've designed a dashboard or two, make sure to [invite your team](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#invite-your-team) if you haven't already. You can add these new users to the same War Room to let them see the same dashboards without any effort.
|
|
@ -61,12 +61,12 @@ At a high-level view, these are the domains from which the Events feed will prov
|
|||
|
||||
## Who can access the events?
|
||||
|
||||
All users will be able to see events from the Topology and Alerts domain but Auditing events, once these are added, will only be accessible to administrators. For more details check the [Netdata Role-Based Access model](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md).
|
||||
All users will be able to see events from the Topology and Alerts domain but Auditing events, once these are added, will only be accessible to administrators. For more details check the [Netdata Role-Based Access model](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md).
|
||||
|
||||
## How to use the events feed
|
||||
|
||||
1. Click on the **Events** tab (located near the top of your screen)
|
||||
1. You will be presented with a table listing the events that occurred from the timeframe defined on the [date time picker](https://github.com/netdata/netdata/blob/master/docs/dashboard/visualization-date-and-time-controls.md#date-and-time-selector)
|
||||
1. You will be presented with a table listing the events that occurred from the timeframe defined on the [date time picker](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/visualization-date-and-time-controls.md#date-and-time-selector)
|
||||
1. You can use the filtering capabilities available on right-hand bar to slice through the results provided. See more details on [event types and filters](#event-types-and-filters)
|
||||
|
||||
> **Note**
|
|
@ -49,7 +49,7 @@ The second table contains the top alerts in the last 24 hours, along with their
|
|||
|
||||
## Netdata Assistant shortcut
|
||||
|
||||
In the Home tab there is a shortcut button in order to start an instant conversation with the [Netdata Assistant](https://github.com/netdata/netdata/edit/master/docs/cloud/netdata-assistant.md).
|
||||
In the Home tab there is a shortcut button in order to start an instant conversation with the [Netdata Assistant](https://github.com/netdata/netdata/edit/master/docs/netdata-assistant.md).
|
||||
|
||||
## Space metrics
|
||||
|
|
@ -5,7 +5,7 @@ description: >-
|
|||
they've already happened, and are interoperable with any other node
|
||||
running Netdata."
|
||||
type: "how-to"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/blob/master/docs/dashboard/import-export-print-snapshot.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/import-export-print-snapshot.md"
|
||||
sidebar_label: "Import, export, and print a snapshot"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
||||
|
@ -15,7 +15,7 @@ learn_rel_path: "Operations"
|
|||
# Import, export, and print a snapshot
|
||||
|
||||
>❗This feature is only available on v1 dashboards, it hasn't been port-forwarded to v2.
|
||||
> For more information on accessing dashboards check [this documentation](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/accessing-netdata-dashboards.md).
|
||||
> For more information on accessing dashboards check [this documentation](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/README.md).
|
||||
|
||||
|
||||
Netdata can export snapshots of the contents of your dashboard at a given time, which you can then import into any other
|
||||
|
@ -23,7 +23,7 @@ node running Netdata. Or, you can create a print-ready version of your dashboard
|
|||
paper.
|
||||
|
||||
Snapshots can be incredibly useful for diagnosing anomalies after they've already happened. Let's say Netdata triggered a warning alert while you were asleep. In the morning, you can [select the
|
||||
timeframe](https://github.com/netdata/netdata/blob/master/docs/dashboard/visualization-date-and-time-controls.md) when the alert triggered, export a snapshot, and send it to a
|
||||
timeframe](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/visualization-date-and-time-controls.md) when the alert triggered, export a snapshot, and send it to a
|
||||
|
||||
colleague for further analysis.
|
||||
|
|
@ -1,8 +1,8 @@
|
|||
# Kubernetes tab
|
||||
|
||||
The Netdata dashboards feature enhanced visualizations for the resource utilization of Kubernetes (k8s) clusters, embedded in the default [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) dashboard.
|
||||
The Netdata dashboards feature enhanced visualizations for the resource utilization of Kubernetes (k8s) clusters, embedded in the default [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) dashboard.
|
||||
|
||||
These visualizations include a health map for viewing the status of k8s pods/containers, in addition to [Netdata charts](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md) for viewing per-second CPU, memory, disk, and networking metrics from k8s nodes.
|
||||
These visualizations include a health map for viewing the status of k8s pods/containers, in addition to [Netdata charts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md) for viewing per-second CPU, memory, disk, and networking metrics from k8s nodes.
|
||||
|
||||
See our [Kubernetes deployment instructions](https://github.com/netdata/netdata/blob/master/packaging/installer/methods/kubernetes.md) for details on deploying Netdata on your Kubernetes cluster.
|
||||
|
||||
|
@ -35,7 +35,7 @@ their associated pods.
|
|||
|
||||
At the top of the Kubernetes containers section there is a map, that with a given context colorizes the containers in terms of their utilization.
|
||||
|
||||
The filtering of this map is controlled by using the [NIDL framework](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#nidl-framework) from the definition bar of the chart.
|
||||
The filtering of this map is controlled by using the [NIDL framework](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#nidl-framework) from the definition bar of the chart.
|
||||
|
||||
### Detailed information
|
||||
|
|
@ -2,7 +2,7 @@
|
|||
|
||||
The Logs tab is using the [`systemd` journal plugin](https://github.com/netdata/netdata/blob/master/src/collectors/systemd-journal.plugin/README.md), to present a structured view into your infrastructure's `systemd` logs.
|
||||
|
||||
We have a thorough section explaining how you can [work with logs](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/logs.md), detailing how the plugin works, and what other utilities are used under the hood to provide you with the visualizations and the log entries.
|
||||
We have a thorough section explaining how you can [work with logs](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/working-with-logs.md), detailing how the plugin works, and what other utilities are used under the hood to provide you with the visualizations and the log entries.
|
||||
|
||||
The [`systemd` journal plugin](https://github.com/netdata/netdata/blob/master/src/collectors/systemd-journal.plugin/README.md) documentation has information about:
|
||||
|
|
@ -1,14 +1,14 @@
|
|||
# Metrics tab and single node tabs
|
||||
|
||||
The Metrics tab is where all the time series [charts](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md) for all the nodes of a War Room are located.
|
||||
The Metrics tab is where all the time series [charts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md) for all the nodes of a War Room are located.
|
||||
|
||||
You can also see single-node dashboards, essentially the same dashboard the Metrics tab offers but only for one node. They are reached from most places in the UI, often by clicking the name of a node.
|
||||
|
||||
From this tab, a user can also reach the Integrations tab and run [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md)
|
||||
From this tab, a user can also reach the Integrations tab and run [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md)
|
||||
|
||||
## Dashboard structure
|
||||
|
||||
The dashboard consists of various charts presented in different chart types. They are categorized based on their [context](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#contexts) and at the beginning of each section, there is a predefined arrangement of charts helping you to get an overview for that particular section.
|
||||
The dashboard consists of various charts presented in different chart types. They are categorized based on their [context](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#contexts) and at the beginning of each section, there is a predefined arrangement of charts helping you to get an overview for that particular section.
|
||||
|
||||
## Chart navigation Menu
|
||||
|
|
@ -13,14 +13,14 @@ These charts provide a lot of useful information, so that you can:
|
|||
- View the combined anomaly rate of all underlying data with the [Anomaly Rate ribbon](#anomaly-rate-ribbon)
|
||||
- Explore even more details about a chart's metrics through [hovering over certain elements of it](#hover-over-the-chart)
|
||||
- Use intuitive tooling and shortcuts to pan, zoom or highlight areas of interest in your charts
|
||||
- On highlight, get easy access to [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md) to see other metrics with similar patterns
|
||||
- On highlight, get easy access to [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md) to see other metrics with similar patterns
|
||||
- Have the dimensions sorted based on name or value
|
||||
- View information about the chart, its plugin, context, and type
|
||||
- View individual metric collection status about a chart
|
||||
|
||||
These charts are available on Netdata Cloud's
|
||||
[Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md), [single sode tabs](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) and
|
||||
on your [Custom Dashboards](https://github.com/netdata/netdata/blob/master/docs/cloud/dashboard/dashboards-tab.md).
|
||||
[Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md), [single sode tabs](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) and
|
||||
on your [Custom Dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md).
|
||||
|
||||
## Overview
|
||||
|
||||
|
@ -104,7 +104,7 @@ When you start interacting with a chart, you'll notice valuable information on t
|
|||
|
||||
The elements that you can find on this top bar are:
|
||||
|
||||
- **Netdata icon**: this indicates that data is continuously being updated, this happens if [Time controls](https://github.com/netdata/netdata/blob/master/docs/dashboard/visualization-date-and-time-controls.md#time-controls) are in Play or Force Play mode.
|
||||
- **Netdata icon**: this indicates that data is continuously being updated, this happens if [Time controls](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/visualization-date-and-time-controls.md#time-controls) are in Play or Force Play mode.
|
||||
- **Chart title**: on the chart title you can see the title together with the metric being displayed, as well as the unit of measurement.
|
||||
- **Chart status icon**: possible values are: Loading, Timeout, Error or No data, otherwise this icon is not shown.
|
||||
|
||||
|
@ -318,7 +318,7 @@ All these indicators are also visualized per dimension, in the pop-over that app
|
|||
|
||||
## Play, Pause and Reset
|
||||
|
||||
Your charts are controlled using the available [Time controls](https://github.com/netdata/netdata/blob/master/docs/dashboard/visualization-date-and-time-controls.md#time-controls).
|
||||
Your charts are controlled using the available [Time controls](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/visualization-date-and-time-controls.md#time-controls).
|
||||
Besides these, when interacting with the chart you can also activate these controls by:
|
||||
|
||||
- Hovering over any chart to temporarily pause it - this momentarily switches time control to Pause, so that you can
|
||||
|
@ -363,7 +363,7 @@ it like pushing the current timeframe off the screen to see what came before or
|
|||
Selecting timeframes is useful when you see an interesting spike or change in a chart and want to investigate further by:
|
||||
|
||||
- Looking at the same period of time on other charts/sections
|
||||
- Running [metric correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md) to filter metrics that also show something different in the selected period, vs the previous one
|
||||
- Running [metric correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md) to filter metrics that also show something different in the selected period, vs the previous one
|
||||
|
||||
| Interaction | Keyboard/mouse | Touchpad/touchscreen |
|
||||
|:-----------------------------------|:---------------------------------------------------------|:---------------------|
|
|
@ -6,7 +6,7 @@ Inside the filter, the nodes get categorized into three groups:
|
|||
|
||||
| Group | Description |
|
||||
|---------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| Live | Nodes that are currently online, collecting and streaming metrics to Cloud. Live nodes display raised [Alert](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md) counters, [Machine Learning](https://github.com/netdata/netdata/blob/master/src/ml/README.md) availability, and [Functions](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md) availability |
|
||||
| Live | Nodes that are currently online, collecting and streaming metrics to Cloud. Live nodes display raised [Alert](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md) counters, [Machine Learning](https://github.com/netdata/netdata/blob/master/src/ml/README.md) availability, and [Functions](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md) availability |
|
||||
| Stale | Nodes that are offline and not streaming metrics to Cloud. Only historical data can be presented from a parent node. For these nodes you can only see their ML status, as they are not online to provide more information |
|
||||
| Offline | Nodes that are offline, not streaming metrics to Cloud and not available in any parent node. Offline nodes are automatically deleted after 30 days and can also be deleted manually. |
|
||||
|
|
@ -1,6 +1,6 @@
|
|||
# Nodes tab
|
||||
|
||||
The nodes tab provides a summarized view of your [War Room](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms), allowing you to view quick information per node.
|
||||
The nodes tab provides a summarized view of your [War Room](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms), allowing you to view quick information per node.
|
||||
|
||||
> **Tip**
|
||||
>
|
||||
|
@ -28,7 +28,7 @@ Each node row allows you to:
|
|||
- View Machine Learning status
|
||||
- View Functions capability status
|
||||
- Add configuration (beta)
|
||||
- [Add alert silencing rules](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-alert-notification-silencing-rules.md)
|
||||
- [Add alert silencing rules](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-alert-notification-silencing-rules.md)
|
||||
- View a set of key attributes collected on your node
|
||||
|
||||
## Right bar
|
|
@ -1,15 +1,15 @@
|
|||
# Top tab
|
||||
|
||||
The Top tab allows you to run [Netdata Functions](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md) on a node where a Netdata Agent is running. These routines are exposed by a given collector.
|
||||
The Top tab allows you to run [Netdata Functions](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md) on a node where a Netdata Agent is running. These routines are exposed by a given collector.
|
||||
They can be used to retrieve additional information to help you troubleshoot or to trigger some action to happen on the node itself.
|
||||
|
||||
> **Tip**
|
||||
>
|
||||
> You can also execute a Function from the [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/nodes-tab.md), by pressing the `f(x)` button.
|
||||
> You can also execute a Function from the [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/nodes-tab.md), by pressing the `f(x)` button.
|
||||
|
||||
> **Note**
|
||||
>
|
||||
> If you get an error saying that your node can't execute Functions please check the [prerequisites](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md#prerequisites).
|
||||
> If you get an error saying that your node can't execute Functions please check the [prerequisites](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md#prerequisites).
|
||||
|
||||
The main view of this tab provides you with (depending on the Function) two elements: a visualization on the top and a table on the bottom.
|
||||
|
|
@ -27,7 +27,7 @@ The date and time selector allows you to change the visible timeframe and change
|
|||
|
||||
### Pick timeframes to visualize
|
||||
|
||||
While [panning through time and zooming in/out](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md) from charts it is helpful when you're looking a recent history, or want to do granular troubleshooting, what if you want to see metrics from 6 hours ago? Or 6 days?
|
||||
While [panning through time and zooming in/out](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md) from charts it is helpful when you're looking a recent history, or want to do granular troubleshooting, what if you want to see metrics from 6 hours ago? Or 6 days?
|
||||
|
||||
Netdata's dashboard features a **timeframe selector** to help you visualize specific timeframes in a few helpful ways.
|
||||
By default, it shows a certain number of minutes of historical metrics based on the your browser's viewport to ensure it's always showing per-second granularity.
|
||||
|
@ -81,7 +81,7 @@ beyond stored historical metrics, you'll see this message:
|
|||
|
||||

|
||||
|
||||
At any time, [configure the internal TSDB's storage capacity](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md) to expand your
|
||||
At any time, [configure the internal TSDB's storage capacity](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md) to expand your
|
||||
depth of historical metrics.
|
||||
|
||||
### Timezone selector
|
|
@ -38,7 +38,7 @@ With both Parent Agents connected to Cloud, Cloud will route queries to either P
|
|||
|
||||
### Stand-alone Deployment
|
||||
|
||||
The stand-alone setup is configured out of the box with reasonable defaults, but please consult our [configuration documentation](https://github.com/netdata/netdata/blob/master/docs/cloud/cheatsheet.md) for details, including the overview of [common configuration changes](https://github.com/netdata/netdata/blob/master/docs/configure/common-changes.md).
|
||||
The stand-alone setup is configured out of the box with reasonable defaults, but please consult our [configuration documentation](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/cheatsheet.md) for details, including the overview of [common configuration changes](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/common-configuration-changes.md).
|
||||
|
||||
### Parent – Child Deployment
|
||||
|
||||
|
@ -101,7 +101,7 @@ Set the following parameters:
|
|||
|
||||
#### Parent config
|
||||
|
||||
For the Parent, besides setting up streaming, the example will also provide an example configuration of multiple [tiers](https://github.com/netdata/netdata/blob/master/src/database/engine/README.md#tiering) of metrics [storage](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md), for 10 children, with about 2k metrics each.
|
||||
For the Parent, besides setting up streaming, the example will also provide an example configuration of multiple [tiers](https://github.com/netdata/netdata/blob/master/src/database/engine/README.md#tiering) of metrics [storage](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md), for 10 children, with about 2k metrics each.
|
||||
|
||||
- 1s granularity at tier 0 for 1 week
|
||||
- 1m granularity at tier 1 for 1 month
|
||||
|
@ -209,7 +209,7 @@ On both Netdata Parent and all Child Agents, edit `stream.conf` with `/etc/netda
|
|||
We strongly recommend the following configuration changes for production deployments:
|
||||
|
||||
1. Understand Netdata's [security and privacy design](https://github.com/netdata/netdata/blob/master/docs/security-and-privacy-design/README.md) and
|
||||
[secure your nodes](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/secure-nodes.md)
|
||||
[secure your nodes](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/securing-netdata-agents.md)
|
||||
|
||||
To safeguard your infrastructure and comply with your organization's security policies.
|
||||
|
||||
|
@ -220,20 +220,20 @@ We strongly recommend the following configuration changes for production deploym
|
|||
- Increase data retention.
|
||||
- Make your data highly available.
|
||||
|
||||
3. [Optimize the Netdata Agents system utilization and performance](https://github.com/netdata/netdata/blob/master/docs/guides/configure/performance.md)
|
||||
3. [Optimize the Netdata Agents system utilization and performance](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimize-the-netdata-agents-performance.md)
|
||||
|
||||
To save valuable system resources, especially when running on weak IoT devices.
|
||||
|
||||
We also suggest that you:
|
||||
|
||||
1. [Use Netdata Cloud to access the dashboards](https://github.com/netdata/netdata/blob/master/docs/quickstart/infrastructure.md)
|
||||
1. [Use Netdata Cloud to access the dashboards](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/monitor-your-infrastructure.md)
|
||||
|
||||
For increased security, user management and access to our latest tools for advanced dashboarding and troubleshooting.
|
||||
|
||||
2. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md)
|
||||
2. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md)
|
||||
|
||||
To control Netdata's memory use, when you have a lot of ephemeral metrics.
|
||||
|
||||
3. [Use host labels](https://github.com/netdata/netdata/blob/master/docs/guides/using-host-labels.md)
|
||||
3. [Use host labels](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/organize-systems-metrics-and-alerts.md)
|
||||
|
||||
To organize systems, metrics, and alerts.
|
||||
|
|
|
@ -2,7 +2,7 @@
|
|||
|
||||
> ### Disclaimer
|
||||
>
|
||||
> This document is only applicable to the v1 version of the dashboard and doesn't affect the [Netdata Dashboard](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/accessing-netdata-dashboards.md).
|
||||
> This document is only applicable to the v1 version of the dashboard and doesn't affect the [Netdata Dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/README.md).
|
||||
|
||||
While the [Netdata dashboard](https://github.com/netdata/netdata/blob/master/src/web/gui/README.md) comes preconfigured with hundreds of charts and
|
||||
thousands of metrics, you may want to alter your experience based on a particular use case or preferences.
|
||||
|
@ -21,8 +21,7 @@ Here are a few popular settings:
|
|||
|
||||
### Change chart legend position
|
||||
|
||||
Find this setting under the **Visual** tab. By default, Netdata places the
|
||||
[legend of dimensions](https://github.com/netdata/netdata/blob/master/docs/dashboard/dimensions-contexts-families.md#dimension) _below_ charts.
|
||||
Find this setting under the **Visual** tab. By default, Netdata places the legend of dimensions _below_ charts.
|
||||
Click this toggle to move the legend to the _right_ of charts.
|
||||
|
||||
|
||||
|
@ -65,7 +64,7 @@ Edit the file with customizations to the `title`, `icon`, and `info` fields. Rep
|
|||
icon from [Font Awesome](https://fontawesome.com/cheatsheet) to customize the icons that appear throughout the
|
||||
dashboard.
|
||||
|
||||
Save the file, then navigate to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) to edit `netdata.conf`. Add
|
||||
Save the file, then navigate to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) to edit `netdata.conf`. Add
|
||||
the following line to the `[web]` section to tell Netdata where to find your custom configuration.
|
||||
|
||||
```conf
|
|
@ -123,7 +123,7 @@ visualizations](https://user-images.githubusercontent.com/1153921/109049195-349f
|
|||
|
||||
### Health map
|
||||
|
||||
The first visualization is the [health map](https://github.com/netdata/netdata/blob/master/docs/dashboard/kubernetes-tab.md#health-map),
|
||||
The first visualization is the [health map](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/kubernetes-tab.md#health-map),
|
||||
which places each container into its own box, then varies the intensity of their color to visualize the resource
|
||||
utilization. By default, the health map shows the **average CPU utilization as a percentage of the configured limit**
|
||||
for every container in your cluster.
|
||||
|
@ -156,7 +156,7 @@ different namespaces.
|
|||

|
||||
|
||||
Each composite chart has a [definition bar](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#definition-bar)
|
||||
Each composite chart has a [definition bar](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#definition-bar)
|
||||
for complete customization. For example, grouping the top chart by `k8s_container_name` reveals new information.
|
||||
|
||||

|
|
@ -61,7 +61,7 @@ replacing `NODE` with the hostname or IP address of your system.
|
|||
|
||||
## Enable hardware and Linux system monitoring
|
||||
|
||||
There's nothing you need to do to enable [system monitoring](https://github.com/netdata/netdata/blob/master/docs/collect/system-metrics.md) and Linux monitoring with
|
||||
There's nothing you need to do to enable [system monitoring](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/system-metrics.md) and Linux monitoring with
|
||||
the Netdata Agent, which autodetects metrics from CPUs, memory, disks, networking devices, and Linux processes like
|
||||
systemd without any configuration. If you're using containers, Netdata automatically collects resource utilization
|
||||
metrics from each using the [cgroups data collector](https://github.com/netdata/netdata/blob/master/src/collectors/cgroups.plugin/README.md).
|
||||
|
@ -164,7 +164,7 @@ If the Netdata Agent isn't already open in your browser, open a new tab and navi
|
|||
Netdata automatically organizes all metrics and charts onto a single page for easy navigation. Peek at gauges to see
|
||||
overall system performance, then scroll down to see more. Click-and-drag with your mouse to pan _all_ charts back and
|
||||
forth through different time intervals, or hold `SHIFT` and use the scrollwheel (or two-finger scroll) to zoom in and
|
||||
out. Check out our doc on [interacting with charts](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md) for all the details.
|
||||
out. Check out our doc on [interacting with charts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md) for all the details.
|
||||
|
||||

|
||||
|
||||
|
@ -197,7 +197,7 @@ Here's a quick reference for what charts you might want to focus on after settin
|
|||
The Netdata Agent comes with hundreds of pre-configured alerts to help you keep tabs on your system, including 19 alerts
|
||||
designed for smarter LAMP stack monitoring.
|
||||
|
||||
Click the 🔔 icon in the top navigation to [see active alerts](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md). The **Active** tabs
|
||||
Click the 🔔 icon in the top navigation to [see active alerts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md). The **Active** tabs
|
||||
shows any alerts currently triggered, while the **All** tab displays a list of _every_ pre-configured alert. The
|
||||
|
||||
 based on your infrastructure monitoring needs, and to see these alerts
|
||||
in other places, like your inbox or a Slack channel, [enable a notification
|
||||
method](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md).
|
||||
method](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/README.md).
|
||||
|
||||
## What's next?
|
||||
|
||||
|
@ -225,7 +225,7 @@ we're not covering it here, but it _does_ work in a single-node setup. Just don'
|
|||
node crashed.
|
||||
|
||||
If you're planning on managing more than one node, or want to take advantage of advanced features, like finding the
|
||||
source of issues faster with [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md),
|
||||
source of issues faster with [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md),
|
||||
[sign up](https://app.netdata.cloud/sign-up?cloudRoute=/spaces) for a free Netdata Cloud account.
|
||||
|
||||
### Related reference documentation
|
|
@ -239,16 +239,16 @@ same application on multiple systems and want to correlate how it performs on ea
|
|||
findings with someone else on your team.
|
||||
|
||||
If you don't already have a Netdata Cloud account, go [sign in](https://app.netdata.cloud) and get started for free.
|
||||
You can also read how to [monitor your infrastructure with Netdata Cloud](https://github.com/netdata/netdata/blob/master/docs/quickstart/infrastructure.md) to understand the key features that it has to offer.
|
||||
You can also read how to [monitor your infrastructure with Netdata Cloud](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/monitor-your-infrastructure.md) to understand the key features that it has to offer.
|
||||
|
||||
Once you've added one or more nodes to a Space in Netdata Cloud, you can see aggregated eBPF metrics in the Overview
|
||||
dashboard under the same **Applications** or **eBPF** sections that you
|
||||
find on the local Agent dashboard. Or, [create new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md) using eBPF metrics
|
||||
find on the local Agent dashboard. Or, [create new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md) using eBPF metrics
|
||||
from any number of distributed nodes to see how your application interacts with multiple Linux kernels on multiple Linux
|
||||
systems.
|
||||
|
||||
Now that you can see eBPF metrics in Netdata Cloud, you can [invite your
|
||||
team](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#invite-your-team) and share your findings with others.
|
||||
team](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#invite-your-team) and share your findings with others.
|
||||
|
||||
|
||||
|
|
@ -100,8 +100,6 @@ part of your system might affect another.
|
|||

|
||||
|
||||
If you're completely new to Netdata, look at the [Introduction](https://github.com/netdata/netdata/blob/master/docs/getting-started/introduction.md) section for a walkthrough of all its features. For a more expedited tour, see the [get started documentation](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md).
|
||||
|
||||
### Enable temperature sensor monitoring
|
||||
|
||||
You need to manually enable Netdata's built-in [temperature sensor
|
||||
|
@ -137,6 +135,6 @@ more than 256.
|
|||
```
|
||||
|
||||
Use our [database sizing
|
||||
calculator](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md#calculate-the-system-resources-ram-disk-space-needed-to-store-metrics)
|
||||
calculator](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md#calculate-the-system-resources-ram-disk-space-needed-to-store-metrics)
|
||||
and the [Database configuration documentation](https://github.com/netdata/netdata/blob/master/src/database/README.md) to help you determine the right
|
||||
setting for your Raspberry Pi.
|
|
@ -39,7 +39,7 @@ With Netdata's process monitoring, you can:
|
|||
|
||||
- One or more Linux nodes running [Netdata](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md)
|
||||
- A general understanding of how
|
||||
to [configure the Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md)
|
||||
to [configure the Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md)
|
||||
using `edit-config`.
|
||||
- A Netdata Cloud account. [Sign up](https://app.netdata.cloud) if you don't have one already.
|
||||
|
||||
|
@ -130,7 +130,7 @@ aware of hundreds of processes, and collects metrics from them automatically.
|
|||
But, if you want to change the grouping behavior, add an application that isn't yet supported in the Netdata Agent, or
|
||||
monitor a custom application, you need to edit the `apps_groups.conf` configuration file.
|
||||
|
||||
Navigate to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) and
|
||||
Navigate to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) and
|
||||
use `edit-config` to edit the file.
|
||||
|
||||
```bash
|
|
@ -23,8 +23,8 @@ Read on to learn all the steps and enable unsupervised anomaly detection on your
|
|||
|
||||
First make sure Netdata is using Python 3 when it runs Python-based data collectors.
|
||||
|
||||
Next, open `netdata.conf` using [`edit-config`](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md#use-edit-config-to-edit-configuration-files)
|
||||
from within the [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md#the-netdata-config-directory). Scroll down to the
|
||||
Next, open `netdata.conf` using [`edit-config`](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#edit-netdataconf)
|
||||
from within the [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#the-netdata-config-directory). Scroll down to the
|
||||
`[plugin:python.d]` section to pass in the `-ppython3` command option.
|
||||
|
||||
```conf
|
|
@ -86,7 +86,7 @@ RestartSec=5s
|
|||
WantedBy=multi-user.target
|
||||
```
|
||||
|
||||
You can edit the configuration file using the `edit-config` script from the Netdata [config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md#the-netdata-config-directory).
|
||||
You can edit the configuration file using the `edit-config` script from the Netdata [config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#the-netdata-config-directory).
|
||||
|
||||
- Edit `netdata.conf` and input:
|
||||
|
|
@ -295,7 +295,7 @@ Netdata Agent installation will have commands under the same paths. When applica
|
|||
path, providing a recommendation or instructions on how to view the running configuration, which includes the correct
|
||||
paths.
|
||||
|
||||
For example, the [configuration](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) doc first
|
||||
For example, the [configuration](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) doc first
|
||||
teaches users how to find the Netdata config
|
||||
directory and navigate to it, then runs commands from the `/etc/netdata` path so that the instructions are more
|
||||
universal.
|
||||
|
@ -305,7 +305,7 @@ Don't include full paths, beginning from the system's root (`/`), as these might
|
|||
| | |
|
||||
|-----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| Not recommended | Use `edit-config` to edit Netdata's configuration: `sudo /etc/netdata/edit-config netdata.conf`. |
|
||||
| **Recommended** | Use `edit-config` to edit Netdata's configuration by first navigating to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md#the-netdata-config-directory), which is typically at `/etc/netdata`, then running `sudo edit-config netdata.conf`. |
|
||||
| **Recommended** | Use `edit-config` to edit Netdata's configuration by first navigating to your [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#the-netdata-config-directory), which is typically at `/etc/netdata`, then running `sudo edit-config netdata.conf`. |
|
||||
|
||||
### `sudo`
|
||||
|
|
@ -1,13 +1,3 @@
|
|||
<!--
|
||||
title: "Export metrics to external time-series databases"
|
||||
description: "Use the exporting engine to send Netdata metrics to popular external time series databases for long-term storage or further analysis."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/export/external-databases.md"
|
||||
sidebar_label: "Export metrics to external time-series databases"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Concepts"
|
||||
learn_rel_path: "Concepts"
|
||||
-->
|
||||
|
||||
# Export metrics to external time-series databases
|
||||
|
||||
Netdata allows you to export metrics to external time-series databases with the [exporting
|
||||
|
@ -30,7 +20,7 @@ analysis, or correlation with other tools, such as application tracing.
|
|||
|
||||
Netdata supports exporting metrics to the following databases through several
|
||||
[connectors](https://github.com/netdata/netdata/blob/master/src/exporting/README.md#features). Once you find the connector that works for your database, open its
|
||||
documentation and the [enabling a connector](https://github.com/netdata/netdata/blob/master/docs/export/enable-connector.md) doc for details on enabling it.
|
||||
documentation and the [enabling a connector](https://github.com/netdata/netdata/blob/master/docs/exporting-metrics/enable-an-exporting-connector.md) doc for details on enabling it.
|
||||
|
||||
- **AppOptics**: [Prometheus remote write](https://github.com/netdata/netdata/blob/master/src/exporting/prometheus/remote_write/README.md)
|
||||
- **AWS Kinesis**: [AWS Kinesis Data Streams](https://github.com/netdata/netdata/blob/master/src/exporting/aws_kinesis/README.md)
|
|
@ -1,30 +1,20 @@
|
|||
<!--
|
||||
title: "Enable an exporting connector"
|
||||
description: "Learn how to enable and configure any connector using examples to start exporting metrics to external time-series databases in minutes."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/export/enable-connector.md"
|
||||
sidebar_label: "Enable an exporting connector"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
||||
learn_rel_path: "Configuration"
|
||||
-->
|
||||
|
||||
# Enable an exporting connector
|
||||
|
||||
Now that you found the right connector for your [external time-series
|
||||
database](https://github.com/netdata/netdata/blob/master/docs/export/external-databases.md#supported-databases), you can now enable the exporting engine and the
|
||||
database](https://github.com/netdata/netdata/blob/master/docs/exporting-metrics/README.md#supported-databases), you can now enable the exporting engine and the
|
||||
connector itself. We'll walk through the process of enabling the exporting engine itself, followed by two examples using
|
||||
the OpenTSDB and Graphite connectors.
|
||||
|
||||
> When you enable the exporting engine and a connector, the Netdata Agent exports metrics _beginning from the time you
|
||||
> restart its process_, not the entire
|
||||
> [database of long-term metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md).
|
||||
> [database of long-term metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md).
|
||||
|
||||
Once you understand the process of enabling a connector, you can translate that knowledge to any other connector.
|
||||
|
||||
## Enable the exporting engine
|
||||
|
||||
Use `edit-config` from your
|
||||
[Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md#the-netdata-config-directory)
|
||||
[Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#the-netdata-config-directory)
|
||||
to open `exporting.conf`:
|
||||
|
||||
```bash
|
||||
|
@ -94,7 +84,7 @@ the [exporting engine reference](https://github.com/netdata/netdata/blob/master/
|
|||
|
||||
For a comprehensive example of using the Graphite connector, read our documentation on
|
||||
[exporting metrics to Graphite providers](https://github.com/netdata/netdata/blob/master/src/exporting/graphite/README.md). Or, start
|
||||
[using host labels](https://github.com/netdata/netdata/blob/master/docs/guides/using-host-labels.md) on exported metrics.
|
||||
[using host labels](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/organize-systems-metrics-and-alerts.md) on exported metrics.
|
||||
|
||||
### Related reference documentation
|
||||
|
|
@ -1,192 +0,0 @@
|
|||
# Getting started with Netdata
|
||||
|
||||
Learn how Netdata can get you monitoring your infrastructure in minutes.
|
||||
|
||||
## What is Netdata ?
|
||||
|
||||
Netdata is designed by system administrators, DevOps engineers, and developers to collect everything, help you visualize
|
||||
metrics, troubleshoot complex performance problems, and make data interoperable with the rest of your monitoring stack.
|
||||
|
||||
You can install Netdata on most Linux distributions (Ubuntu, Debian, CentOS, and more), container platforms (Kubernetes
|
||||
clusters, Docker), and many other operating systems (FreeBSD).
|
||||
|
||||
Netdata is:
|
||||
|
||||
### Simple to deploy
|
||||
|
||||
- **One-line deployment** for Linux distributions, plus support for Kubernetes/Docker infrastructures.
|
||||
- **Zero configuration and maintenance** required to collect thousands of metrics, every second, from the underlying
|
||||
OS and running applications.
|
||||
- **Prebuilt charts and alerts** alert you to common anomalies and performance issues without manual configuration.
|
||||
- **Distributed storage** to simplify the cost and complexity of storing metrics data from any number of nodes.
|
||||
|
||||
### Powerful and scalable
|
||||
|
||||
- **1% CPU utilization, a few MB of RAM, and minimal disk I/O** to run the monitoring Agent on bare metal, virtual
|
||||
machines, containers, and even IoT devices.
|
||||
- **Per-second granularity** for an unlimited number of metrics based on the hardware and applications you're running
|
||||
on your nodes.
|
||||
- **Interoperable exporters** let you connect Netdata's per-second metrics with an existing monitoring stack and other
|
||||
time-series databases.
|
||||
|
||||
### Optimized for troubleshooting
|
||||
|
||||
- **Visual anomaly detection** with a UI/UX that emphasizes the relationships between charts.
|
||||
- **Customizable dashboards** to pinpoint correlated metrics, respond to incidents, and help you streamline your
|
||||
workflows.
|
||||
- **Distributed metrics in a centralized interface** to assist users or teams trace complex issues between distributed
|
||||
nodes.
|
||||
|
||||
### Secure by design
|
||||
|
||||
- **Distributed data architecture** so fast and efficient, there’s no limit to the number of metrics you can follow.
|
||||
- Because your data is **stored at the edge**, security is ensured.
|
||||
|
||||
### Comparison with other monitoring solutions
|
||||
|
||||
Netdata offers many benefits over the existing monitoring landscape, whether they're expensive SaaS products or other
|
||||
open-source tools.
|
||||
|
||||
| Netdata | Others (open-source and commercial) |
|
||||
|:----------------------------------------------------------------|:-----------------------------------------------------------------|
|
||||
| **High resolution metrics** (1s granularity) | Low resolution metrics (10s granularity at best) |
|
||||
| Collects **thousands of metrics per node** | Collects just a few metrics |
|
||||
| Fast UI optimized for **anomaly detection** | UI is good for just an abstract view |
|
||||
| **Long-term, autonomous storage** at one-second granularity | Centralized metrics in an expensive data lake at 10s granularity |
|
||||
| **Meaningful presentation**, to help you understand the metrics | You have to know the metrics before you start |
|
||||
| Install and get results **immediately** | Long sales process and complex installation process |
|
||||
| Use it for **troubleshooting** performance problems | Only gathers _statistics of past performance_ |
|
||||
| **Kills the console** for tracing performance issues | The console is always required for troubleshooting |
|
||||
| Requires **zero dedicated resources** | Require large dedicated resources |
|
||||
|
||||
Netdata works with tons of applications, notifications platforms, and other time-series databases:
|
||||
|
||||
- **300+ system, container, and application endpoints**: Collectors autodetect metrics from default endpoints and
|
||||
immediately visualize them into meaningful charts designed for troubleshooting. See [everything we
|
||||
support](https://github.com/netdata/netdata/blob/master/src/collectors/COLLECTORS.md).
|
||||
- **20+ notification platforms**: Netdata's health watchdog sends warning and critical alerts to your [favorite
|
||||
platform](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md) to inform you of anomalies just seconds
|
||||
after they affect your node.
|
||||
- **30+ external time-series databases**: Export resampled metrics as they're collected to other [local- and
|
||||
Cloud-based databases](https://github.com/netdata/netdata/blob/master/docs/export/external-databases.md) for best-in-class
|
||||
interoperability.
|
||||
|
||||
## How it works
|
||||
|
||||
Netdata is a highly efficient, highly modular, metrics management engine. Its lockless design makes it ideal for concurrent operations on the metrics.
|
||||
|
||||
You can see a high level representation in the following diagram.
|
||||
|
||||

|
||||
|
||||
And a higher level diagram in this one.
|
||||
|
||||

|
||||
|
||||
You can even visit this slightly dated [interactive infographic](https://my-netdata.io/infographic.html) and get lost in a rabbit hole.
|
||||
|
||||
But the best way to get under the hood or in the steering wheel of our highly efficient, low-latency system (supporting multiple readers and one writer on each metric) is to read the rest of our docs, or just to jump in and [get started](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md). But here's a good breakdown:
|
||||
|
||||
### Netdata Agent
|
||||
|
||||
Netdata's distributed monitoring Agent collects thousands of metrics from systems, hardware, and applications with zero configuration. It runs permanently on all your physical/virtual servers, containers, cloud deployments, and edge/IoT devices.
|
||||
|
||||
You can install Netdata on most Linux distributions (Ubuntu, Debian, CentOS, and more), container/microservice platforms (Kubernetes clusters, Docker), and many other operating systems (FreeBSD, macOS), with no sudo required.
|
||||
|
||||
### Netdata Cloud
|
||||
|
||||
Netdata Cloud is a web application that gives you real-time visibility for your entire infrastructure. With Netdata Cloud, you can view key metrics, insightful charts, and active alerts from all your nodes in a single web interface. When an anomaly strikes, seamlessly navigate to any node to troubleshoot and discover the root cause with the familiar Netdata dashboard.
|
||||
|
||||
Netdata Cloud is free! You can add an entire infrastructure of nodes, invite all your colleagues, and visualize any number of metrics, charts, and alerts entirely for free.
|
||||
|
||||
While Netdata Cloud offers a centralized method of monitoring your Agents, your metrics data is not stored or centralized in any way. Metrics data remains with your nodes and is only streamed to your browser, through Cloud, when you're viewing the Netdata Cloud interface.
|
||||
|
||||
## Use Netdata standalone or as part of your monitoring stack
|
||||
|
||||
Netdata is an extremely powerful monitoring, visualization, and troubleshooting platform. While you can use it as an
|
||||
effective standalone tool, we also designed it to be open and interoperable with other tools you might already be using.
|
||||
|
||||
Netdata helps you collect everything and scales to infrastructure of any size, but it doesn't lock-in data or force you
|
||||
to use specific tools or methodologies. Each feature is extensible and interoperable so they can work in parallel with
|
||||
other tools. For example, you can use Netdata to collect metrics, visualize metrics with a second open-source program,
|
||||
and centralize your metrics in a cloud-based time-series database solution for long-term storage or further analysis.
|
||||
|
||||
You can build a new monitoring stack, including Netdata, or integrate Netdata's metrics with your existing monitoring
|
||||
stack. No matter which route you take, Netdata helps you monitor infrastructure of any size.
|
||||
|
||||
Here are a few ways to enrich your existing monitoring and troubleshooting stack with Netdata:
|
||||
|
||||
### Collect metrics from Prometheus endpoints
|
||||
|
||||
Netdata automatically detects 600 popular endpoints and collects per-second metrics from them via the [generic
|
||||
Prometheus collector](https://github.com/netdata/netdata/blob/master/src/go/collectors/go.d.plugin/modules/prometheus/README.md). This even
|
||||
includes support for Windows 10 via [`windows_exporter`](https://github.com/prometheus-community/windows_exporter).
|
||||
|
||||
This collector is installed and enabled on all Agent installations by default, so you don't need to waste time
|
||||
configuring Netdata. Netdata will detect these Prometheus metrics endpoints and collect even more granular metrics than
|
||||
your existing solutions. You can now use all of Netdata's meaningfully-visualized charts to diagnose issues and
|
||||
troubleshoot anomalies.
|
||||
|
||||
### Export metrics to external time-series databases
|
||||
|
||||
Netdata can send its per-second metrics to external time-series databases, such as InfluxDB, Prometheus, Graphite,
|
||||
TimescaleDB, ElasticSearch, AWS Kinesis Data Streams, Google Cloud Pub/Sub Service, and many others.
|
||||
|
||||
Once you have Netdata's metrics in a secondary time-series database, you can use them however you'd like, such as
|
||||
additional visualization/dashboarding tools or aggregation of data from multiple sources.
|
||||
|
||||
### Visualize metrics with Grafana
|
||||
|
||||
One popular monitoring stack is Netdata, Prometheus, and Grafana. Netdata acts as the stack's metrics collection
|
||||
powerhouse, Prometheus as the time-series database, and Grafana as the visualization platform. You can also use Grafite instead of Prometheus,
|
||||
or directly use the [Netdata source plugin for Grafana](https://blog.netdata.cloud/introducing-netdata-source-plugin-for-grafana/)
|
||||
|
||||
Of course, just because you export or visualize metrics elsewhere, it doesn't mean Netdata's equivalent features
|
||||
disappear. You can always build new dashboards in Netdata Cloud, drill down into per-second metrics using Netdata's
|
||||
charts, or use Netdata's health watchdog to send notifications whenever an anomaly strikes.
|
||||
|
||||
## Community
|
||||
|
||||
Netdata is an inclusive open-source project and community. Please read our [Code of Conduct](https://github.com/netdata/.github/blob/main/CODE_OF_CONDUCT.md).
|
||||
|
||||
Find most of the Netdata team in our [community forums](https://community.netdata.cloud). It's the best place to
|
||||
ask questions, find resources, and engage with passionate professionals. The team is also available and active in our [Discord](https://discord.com/invite/2mEmfW735j) too.
|
||||
|
||||
You can also find Netdata on:
|
||||
|
||||
- [Twitter](https://twitter.com/netdatahq)
|
||||
- [YouTube](https://www.youtube.com/c/Netdata)
|
||||
- [Reddit](https://www.reddit.com/r/netdata/)
|
||||
- [LinkedIn](https://www.linkedin.com/company/netdata-cloud/)
|
||||
- [StackShare](https://stackshare.io/netdata)
|
||||
- [Product Hunt](https://www.producthunt.com/posts/netdata-monitoring-agent/)
|
||||
- [Repology](https://repology.org/metapackage/netdata/versions)
|
||||
- [Facebook](https://www.facebook.com/linuxnetdata/)
|
||||
|
||||
## Contribute
|
||||
|
||||
Contributions are the lifeblood of open-source projects. While we continue to invest in and improve Netdata, we need help to democratize monitoring!
|
||||
|
||||
- Check our [Security Policy](https://github.com/netdata/netdata/security/policy).
|
||||
- Found a bug? Open a [GitHub issue](https://github.com/netdata/netdata/issues/new?assignees=&labels=bug%2Cneeds+triage&template=BUG_REPORT.yml&title=%5BBug%5D%3A+).
|
||||
- Read our [Contributing Guide](https://github.com/netdata/.github/blob/main/CONTRIBUTING.md), which contains all the information you need to contribute to Netdata, such as improving our documentation, engaging in the community, and developing new features. We've made it as frictionless as possible, but if you need help, just ping us on our community forums!
|
||||
|
||||
Package maintainers should read the guide on [building Netdata from source](https://github.com/netdata/netdata/blob/master/packaging/installer/methods/source.md) for
|
||||
instructions on building each Netdata component from source and preparing a package.
|
||||
|
||||
## License
|
||||
|
||||
The Netdata Agent is an open source project distributed under [GPLv3+](https://github.com/netdata/netdata/blob/master/LICENSE). Netdata re-distributes other open-source tools and libraries. Please check the
|
||||
[third party licenses](https://github.com/netdata/netdata/blob/master/REDISTRIBUTED.md).
|
||||
|
||||
## Is it any good?
|
||||
|
||||
Yes.
|
||||
|
||||
_When people first hear about a new product, they frequently ask if it is any good. A Hacker News user
|
||||
[remarked](https://news.ycombinator.com/item?id=3067434):_
|
||||
|
||||
> Note to self: Starting immediately, all raganwald projects will have a “Is it any good?” section in the readme, and
|
||||
> the answer shall be "yes.".
|
||||
*******************************************************************************
|
|
@ -17,21 +17,21 @@ Use the alphabatized list below to find the answer to your single-term questions
|
|||
|
||||
- [**Agent-cloud link** or **ACLK**](https://github.com/netdata/netdata/blob/master/src/aclk/README.md): The Agent-Cloud link (ACLK) is the mechanism responsible for securely connecting a Netdata Agent to your web browser through Netdata Cloud.
|
||||
|
||||
- [**Aggregate Function**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#aggregate-functions-over-time): A function applied When the granularity of the data collected is higher than the plotted points on the chart.
|
||||
- [**Aggregate Function**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#aggregate-functions-over-time): A function applied When the granularity of the data collected is higher than the plotted points on the chart.
|
||||
|
||||
- [**Alerts** (formerly **Alarms**)](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md): With the information that appears on Netdata Cloud and the local dashboard about active alerts, you can configure alerts to match your infrastructure's needs or your team's goals.
|
||||
- [**Alerts** (formerly **Alarms**)](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md): With the information that appears on Netdata Cloud and the local dashboard about active alerts, you can configure alerts to match your infrastructure's needs or your team's goals.
|
||||
|
||||
- [**Alarm Entity Type**](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#health-entity-reference): Entity types that are attached to specific charts and use the `alarm` label.
|
||||
|
||||
- [**Anomaly Advisor**](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md): A Netdata feature that lets you focus on potentially anomalous metrics and charts related to a particular highlight window of interest.
|
||||
- [**Anomaly Advisor**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md): A Netdata feature that lets you focus on potentially anomalous metrics and charts related to a particular highlight window of interest.
|
||||
|
||||
## B
|
||||
|
||||
- [**Bookmarks**](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#manage-spaces): Netdata Cloud's bookmarks put your tools in one accessible place. Bookmarks are shared between all War Rooms in a Space, so any users in your Space will be able to see and use them.
|
||||
- [**Bookmarks**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#manage-spaces): Netdata Cloud's bookmarks put your tools in one accessible place. Bookmarks are shared between all War Rooms in a Space, so any users in your Space will be able to see and use them.
|
||||
|
||||
## C
|
||||
|
||||
- [**Child**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md#streaming-basics): A node, running Netdata, that streams metric data to one or more parent.
|
||||
- [**Child**](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/metrics-centralization-points/README.md): A node, running Netdata, that streams metric data to one or more parent.
|
||||
|
||||
- [**Cloud** or **Netdata Cloud**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/README.md): Netdata Cloud is a web application that gives you real-time visibility for your entire infrastructure. With Netdata Cloud, you can view key metrics, insightful charts, and active alerts from all your nodes in a single web interface.
|
||||
|
||||
|
@ -39,17 +39,17 @@ Use the alphabatized list below to find the answer to your single-term questions
|
|||
|
||||
- [**Community**](https://community.netdata.cloud/): As a company with a passion and genesis in open-source, we are not just very proud of our community, but we consider our users, fans, and chatters to be an imperative part of the Netdata experience and culture.
|
||||
|
||||
- [**Context**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#contexts): A way of grouping charts by the types of metrics collected and dimensions displayed. It's kind of like a machine-readable naming and organization scheme.
|
||||
- [**Context**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#contexts): A way of grouping charts by the types of metrics collected and dimensions displayed. It's kind of like a machine-readable naming and organization scheme.
|
||||
|
||||
- [**Custom dashboards**](https://github.com/netdata/netdata/blob/master/src/web/gui/custom/README.md) A dashboard that you can create using simple HTML (no javascript is required for basic dashboards).
|
||||
|
||||
## D
|
||||
|
||||
- [**Dashboard**](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/accessing-netdata-dashboards.md): Out-of-the-box visual representation of metrics to provide insight into your infrastructure, its health and performance.
|
||||
- [**Dashboard**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/README.md): Out-of-the-box visual representation of metrics to provide insight into your infrastructure, its health and performance.
|
||||
|
||||
- [**Definition Bar**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md): Bar within a composite chart that provides important information and options about the metrics within the chart.
|
||||
- [**Definition Bar**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md): Bar within a composite chart that provides important information and options about the metrics within the chart.
|
||||
|
||||
- [**Dimension**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#dimensions): A dimension is a value that gets shown on a chart.
|
||||
- [**Dimension**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#dimensions): A dimension is a value that gets shown on a chart.
|
||||
|
||||
## E
|
||||
|
||||
|
@ -57,27 +57,21 @@ Use the alphabatized list below to find the answer to your single-term questions
|
|||
|
||||
## F
|
||||
|
||||
- [**Family**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#families): 1. What we consider our Netdata community of users and engineers. 2. A single instance of a hardware or software resource that needs to be displayed separately from similar instances.
|
||||
- [**Family**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#families): 1. What we consider our Netdata community of users and engineers. 2. A single instance of a hardware or software resource that needs to be displayed separately from similar instances.
|
||||
|
||||
- [**Flood Protection**](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#flood-protection): If a node has too many state changes like firing too many alerts or going from reachable to unreachable, Netdata Cloud enables flood protection. As long as a node is in flood protection mode, Netdata Cloud does not send notifications about this node
|
||||
- [**Flood Protection**](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#flood-protection): If a node has too many state changes like firing too many alerts or going from reachable to unreachable, Netdata Cloud enables flood protection. As long as a node is in flood protection mode, Netdata Cloud does not send notifications about this node
|
||||
|
||||
- [**Functions** or **Netdata Functions**](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md): Routines exposed by a collector on the Netdata Agent that can bring additional information to support troubleshooting or trigger some action to happen on the node itself.
|
||||
- [**Functions** or **Netdata Functions**](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md): Routines exposed by a collector on the Netdata Agent that can bring additional information to support troubleshooting or trigger some action to happen on the node itself.
|
||||
|
||||
## G
|
||||
|
||||
- [**Guided Troubleshooting**](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/troubleshooting-overview.md): Troubleshooting with our Machine-Learning-powered tools designed to give you a cutting edge advantage in your troubleshooting battles.
|
||||
|
||||
- [**Group by**](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md#group-by-dimension-node-or-chart): The drop-down on the dimension bar of a composite chart that allows you to group metrics by dimension, node, or chart.
|
||||
|
||||
## H
|
||||
|
||||
- [**Headless Collector Streaming**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md#supported-streaming-configurations): Streaming configuration where child `A`, _without_ a database or web dashboard, streams metrics to parent `B`.
|
||||
- [**Group by**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md#group-by-dimension-node-or-chart): The drop-down on the dimension bar of a composite chart that allows you to group metrics by dimension, node, or chart.
|
||||
|
||||
- [**Health Configuration Files**](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#edit-health-configuration-files): Files that you can edit to configure your Agent's health watchdog service.
|
||||
|
||||
- [**Health Entity Reference**](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#health-entity-reference):
|
||||
|
||||
- [**Home** tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/home-tab.md): Tab in Netdata Cloud that provides a predefined dashboard of relevant information about entities in the War Room.
|
||||
- [**Home** tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/home-tab.md): Tab in Netdata Cloud that provides a predefined dashboard of relevant information about entities in the War Room.
|
||||
|
||||
## I
|
||||
|
||||
|
@ -87,32 +81,32 @@ Use the alphabatized list below to find the answer to your single-term questions
|
|||
|
||||
- [**Kickstart** or **Kickstart Script**](https://github.com/netdata/netdata/blob/master/packaging/installer/methods/kickstart.md): An automatic one-line installation script named 'kickstart.sh' that works on all Linux distributions and macOS.
|
||||
|
||||
- [**Kubernetes Dashboard** or **Kubernetes Tab**](https://github.com/netdata/netdata/blob/master/docs/dashboard/kubernetes-tab.md): Netdata Cloud features enhanced visualizations for the resource utilization of Kubernetes (k8s) clusters, embedded in the default Overview dashboard.
|
||||
- [**Kubernetes Dashboard** or **Kubernetes Tab**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/kubernetes-tab.md): Netdata Cloud features enhanced visualizations for the resource utilization of Kubernetes (k8s) clusters, embedded in the default Overview dashboard.
|
||||
|
||||
## M
|
||||
|
||||
- [**Metrics Collection**](https://github.com/netdata/netdata/blob/master/src/collectors/README.md): With zero configuration, Netdata auto-detects thousands of data sources upon starting and immediately collects per-second metrics. Netdata can immediately collect metrics from these endpoints thanks to 300+ collectors, which all come pre-installed when you install Netdata.
|
||||
|
||||
- [**Metric Correlations**](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md): A Netdata feature that lets you quickly find metrics and charts related to a particular window of interest that you want to explore further.
|
||||
- [**Metric Correlations**](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md): A Netdata feature that lets you quickly find metrics and charts related to a particular window of interest that you want to explore further.
|
||||
|
||||
- [**Metrics Exporting**](https://github.com/netdata/netdata/blob/master/docs/export/external-databases.md): Netdata allows you to export metrics to external time-series databases with the exporting engine. This system uses a number of connectors to initiate connections to more than thirty supported databases, including InfluxDB, Prometheus, Graphite, ElasticSearch, and much more.
|
||||
- [**Metrics Exporting**](https://github.com/netdata/netdata/blob/master/docs/exporting-metrics/README.md): Netdata allows you to export metrics to external time-series databases with the exporting engine. This system uses a number of connectors to initiate connections to more than thirty supported databases, including InfluxDB, Prometheus, Graphite, ElasticSearch, and much more.
|
||||
|
||||
- [**Metrics Storage**](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md): Upon collection the collected metrics need to be either forwarded, exported or just stored for further treatment. The Agent is capable to store metrics both short and long-term, with or without the usage of non-volatile storage.
|
||||
- [**Metrics Storage**](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md): Upon collection the collected metrics need to be either forwarded, exported or just stored for further treatment. The Agent is capable to store metrics both short and long-term, with or without the usage of non-volatile storage.
|
||||
|
||||
- [**Metrics Streaming Replication**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md): Each node running Netdata can stream the metrics it collects, in real time, to another node. Metric streaming allows you to replicate metrics data across multiple nodes, or centralize all your metrics data into a single time-series database (TSDB).
|
||||
- [**Metrics Streaming Replication**](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/README.md): Each node running Netdata can stream the metrics it collects, in real time, to another node. Metric streaming allows you to replicate metrics data across multiple nodes, or centralize all your metrics data into a single time-series database (TSDB).
|
||||
|
||||
- [**Module**](https://github.com/netdata/netdata/blob/master/src/collectors/REFERENCE.md#enable-and-disable-a-specific-collection-module): A type of collector.
|
||||
|
||||
## N
|
||||
|
||||
- [**Netdata**](https://github.com/netdata/netdata/blob/master/docs/getting-started/introduction.md): Netdata is a monitoring tool designed by system administrators, DevOps engineers, and developers to collect everything, help you visualize
|
||||
- [**Netdata**](https://www.netdata.cloud/): Netdata is a monitoring tool designed by system administrators, DevOps engineers, and developers to collect everything, help you visualize
|
||||
metrics, troubleshoot complex performance problems, and make data interoperable with the rest of your monitoring stack.
|
||||
|
||||
- [**Netdata Agent** or **Agent**](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md): Netdata's distributed monitoring Agent collects thousands of metrics from systems, hardware, and applications with zero configuration. It runs permanently on all your physical/virtual servers, containers, cloud deployments, and edge/IoT devices.
|
||||
|
||||
- [**Netdata Cloud** or **Cloud**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/README.md): Netdata Cloud is a web application that gives you real-time visibility for your entire infrastructure. With Netdata Cloud, you can view key metrics, insightful charts, and active alerts from all your nodes in a single web interface.
|
||||
|
||||
- [**Netdata Functions** or **Functions**](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md): Routines exposed by a collector on the Netdata Agent that can bring additional information to support troubleshooting or trigger some action to happen on the node itself.
|
||||
- [**Netdata Functions** or **Functions**](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md): Routines exposed by a collector on the Netdata Agent that can bring additional information to support troubleshooting or trigger some action to happen on the node itself.
|
||||
|
||||
<!-- No link for this keyword - **Netdata Logs** https://github.com/netdata/netdata/blob/master/docs/tasks/miscellaneous/check-netdata-logs.md: The three log files - `error.log`, `access.log` and `debug.log` - used by Netdata -->
|
||||
|
||||
|
@ -122,17 +116,13 @@ metrics, troubleshoot complex performance problems, and make data interoperable
|
|||
|
||||
## O
|
||||
|
||||
- [**Obsoletion**(of nodes)](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#obsoleting-offline-nodes-from-a-space): Removing nodes from a space.
|
||||
- [**Obsoletion**(of nodes)](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#obsoleting-offline-nodes-from-a-space): Removing nodes from a space.
|
||||
|
||||
- [**Orchestrators**](https://github.com/netdata/netdata/blob/master/src/collectors/README.md#collector-architecture-and-terminology): External plugins that run and manage one or more modules. They run as independent processes.
|
||||
|
||||
## P
|
||||
|
||||
- [**Parent**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md#streaming-basics): A node, running Netdata, that receives streamed metric data.
|
||||
|
||||
- [**Proxy**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md#streaming-basics): A node, running Netdata, that receives metric data from a child and "forwards" them on to a separate parent node.
|
||||
|
||||
- [**Proxy Streaming**](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md#supported-streaming-configurations): Streaming configuration where child `A`, _with or without_ a database, sends metrics to proxy `C`, also _with or without_ a database. `C` sends metrics to parent `B`
|
||||
- [**Parent**](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/metrics-centralization-points/README.md): A node, running Netdata, that receives streamed metric data.
|
||||
|
||||
## R
|
||||
|
||||
|
@ -140,13 +130,13 @@ metrics, troubleshoot complex performance problems, and make data interoperable
|
|||
|
||||
- [**Replication Streaming**](https://github.com/netdata/netdata/blob/master/src/streaming/README.md): Streaming configuration where child `A`, _with_ a database and web dashboard, streams metrics to parent `B`.
|
||||
|
||||
- [**Room** or **War Room**](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms): War Rooms organize your connected nodes and provide infrastructure-wide dashboards using real-time metrics and visualizations.
|
||||
- [**Room** or **War Room**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms): War Rooms organize your connected nodes and provide infrastructure-wide dashboards using real-time metrics and visualizations.
|
||||
|
||||
## S
|
||||
|
||||
- [**Single Node Dashboard**](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md): A dashboard pre-configured with every installation of the Netdata agent, with thousand of metrics and hundreds of interactive charts that requires no set up.
|
||||
- [**Single Node Dashboard**](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md): A dashboard pre-configured with every installation of the Netdata agent, with thousand of metrics and hundreds of interactive charts that requires no set up.
|
||||
|
||||
- [**Space**](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-spaces): A high-level container and virtual collaboration area where you can organize team members, access levels,and the nodes you want to monitor.
|
||||
- [**Space**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-spaces): A high-level container and virtual collaboration area where you can organize team members, access levels,and the nodes you want to monitor.
|
||||
|
||||
## T
|
||||
|
||||
|
@ -165,8 +155,8 @@ even thousands of nodes. There are no actual bottlenecks especially if you retai
|
|||
|
||||
## W
|
||||
|
||||
- [**War Room** or **Room**](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms): War Rooms organize your connected nodes and provide infrastructure-wide dashboards using real-time metrics and visualizations.
|
||||
- [**War Room** or **Room**](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms): War Rooms organize your connected nodes and provide infrastructure-wide dashboards using real-time metrics and visualizations.
|
||||
|
||||
## Z
|
||||
|
||||
- [**Zero Configuration**](https://github.com/netdata/netdata/blob/master/docs/getting-started/introduction.md#simple-to-deploy): Netdata is preconfigured and capable to autodetect and monitor any well known application that runs on your system. You just deploy and claim Netdata Agents in your Netdata space, and monitor them in seconds.
|
||||
- **Zero Configuration**: Netdata is preconfigured and capable to autodetect and monitor any well known application that runs on your system. You just deploy and claim Netdata Agents in your Netdata space, and monitor them in seconds.
|
||||
|
|
|
@ -35,7 +35,7 @@ Check out our [blog](https://github.com/netdata/blog#readme) repo! Any blog subm
|
|||
|
||||
#### Before you get started
|
||||
|
||||
Anyone interested in contributing significantly to documentation should first read the [Netdata style guide](https://github.com/netdata/netdata/blob/master/docs/contributing/style-guide.md) and the [Netdata Community Code of Conduct](https://github.com/netdata/.github/blob/main/CODE_OF_CONDUCT.md).
|
||||
Anyone interested in contributing significantly to documentation should first read the [Netdata style guide](https://github.com/netdata/netdata/blob/master/docs/developer-and-contributor-corner/style-guide.md) and the [Netdata Community Code of Conduct](https://github.com/netdata/.github/blob/main/CODE_OF_CONDUCT.md).
|
||||
|
||||
Netdata's documentation uses Markdown syntax. If you're not familiar with Markdown, read the [Mastering Markdown](https://guides.github.com/features/mastering-markdown/) guide from GitHub for the basics on creating paragraphs, styled text, lists, tables, and more.
|
||||
|
||||
|
|
|
@ -1,76 +0,0 @@
|
|||
<!--
|
||||
title: "Machine learning (ML) powered anomaly detection"
|
||||
sidebar_label: "Machine learning (ML) powered anomaly detection"
|
||||
description: "Detect anomalies in any system, container, or application in your infrastructure with machine learning and the open-source Netdata Agent."
|
||||
image: /img/seo/guides/monitor/anomaly-detection.png
|
||||
custom_edit_url: https://github.com/netdata/netdata/edit/master/docs/guides/monitor/anomaly-detection.md
|
||||
learn_status: "Published"
|
||||
learn_rel_path: "Operations"
|
||||
-->
|
||||
|
||||
# Machine learning (ML) powered anomaly detection
|
||||
|
||||
|
||||
## Overview
|
||||
|
||||
As of [`v1.32.0`](https://github.com/netdata/netdata/releases/tag/v1.32.0), Netdata comes with some ML powered [anomaly detection](https://en.wikipedia.org/wiki/Anomaly_detection) capabilities built into it and available to use out of the box, with zero configuration required (ML was enabled by default in `v1.35.0-29-nightly` in [this PR](https://github.com/netdata/netdata/pull/13158), previously it required a one line config change).
|
||||
|
||||
This means that in addition to collecting raw value metrics, the Netdata agent will also produce an [`anomaly-bit`](https://github.com/netdata/netdata/blob/master/src/ml/README.md#anomaly-bit---100--anomalous-0--normal) every second which will be `100` when recent raw metric values are considered anomalous by Netdata and `0` when they look normal. Once we aggregate beyond one second intervals this aggregated `anomaly-bit` becomes an ["anomaly rate"](https://github.com/netdata/netdata/blob/master/src/ml/README.md#anomaly-rate---averageanomaly-bit).
|
||||
|
||||
To be as concrete as possible, the below api call shows how to access the raw anomaly bit of the `system.cpu` chart from the [london.my-netdata.io](https://london.my-netdata.io) Netdata demo server. Passing `options=anomaly-bit` returns the anomaly bit instead of the raw metric value.
|
||||
|
||||
```
|
||||
https://london.my-netdata.io/api/v1/data?chart=system.cpu&options=anomaly-bit
|
||||
```
|
||||
|
||||
If we aggregate the above to just 1 point by adding `points=1` we get an "[Anomaly Rate](https://github.com/netdata/netdata/blob/master/src/ml/README.md#anomaly-rate---averageanomaly-bit)":
|
||||
|
||||
```
|
||||
https://london.my-netdata.io/api/v1/data?chart=system.cpu&options=anomaly-bit&points=1
|
||||
```
|
||||
|
||||
The fundamentals of Netdata's anomaly detection approach and implementation are covered in lots more detail in the [agent ML documentation](https://github.com/netdata/netdata/blob/master/src/ml/README.md).
|
||||
|
||||
This guide will explain how to get started using these ML based anomaly detection capabilities within Netdata.
|
||||
|
||||
## Anomaly Advisor
|
||||
|
||||
The [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md) is the flagship anomaly detection feature within Netdata. In the "Anomalies" tab of Netdata you will see an overall "Anomaly Rate" chart that aggregates node level anomaly rate for all nodes in a space. The aim of this chart is to make it easy to quickly spot periods of time where the overall "[node anomaly rate](https://github.com/netdata/netdata/blob/master/src/ml/README.md#node-anomaly-rate)" is elevated in some unusual way and for what node or nodes this relates to.
|
||||
|
||||

|
||||
|
||||
Once an area on the Anomaly Rate chart is highlighted netdata will append a "heatmap" to the bottom of the screen that shows which metrics were more anomalous in the highlighted timeframe. Each row in the heatmap consists of an anomaly rate sparkline graph that can be expanded to reveal the raw underlying metric chart for that dimension.
|
||||
|
||||

|
||||
|
||||
## Embedded Anomaly Rate Charts
|
||||
|
||||
Charts in both the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) and [single node tabs](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) also expose the underlying anomaly rates for each dimension so users can easily see if the raw metrics are considered anomalous or not by Netdata.
|
||||
|
||||
Pressing the anomalies icon (next to the information icon in the chart header) will expand the anomaly rate chart to make it easy to see how the anomaly rate for any individual dimension corresponds to the raw underlying data. In the example below we can see that the spike in `system.pgpgio|in` corresponded in the anomaly rate for that dimension jumping to 100% for a small period of time until the spike passed.
|
||||
|
||||

|
||||
|
||||
## Anomaly Rate Based Alerts
|
||||
|
||||
It is possible to use the `anomaly-bit` when defining traditional Alerts within netdata. The `anomaly-bit` is just another `options` parameter that can be passed as part of an alert line lookup.
|
||||
|
||||
You can see some example ML based alert configurations below:
|
||||
|
||||
- [Anomaly rate based CPU dimensions alert](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#example-8---anomaly-rate-based-cpu-dimensions-alert)
|
||||
- [Anomaly rate based CPU chart alert](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#example-9---anomaly-rate-based-cpu-chart-alert)
|
||||
- [Anomaly rate based node level alert](https://github.com/netdata/netdata/blob/master/src/health/REFERENCE.md#example-10---anomaly-rate-based-node-level-alert)
|
||||
- More examples in the [`/health/health.d/ml.conf`](https://github.com/netdata/netdata/blob/master/src/health/health.d/ml.conf) file that ships with the agent.
|
||||
|
||||
## Learn More
|
||||
|
||||
Check out the resources below to learn more about how Netdata is approaching ML:
|
||||
|
||||
- [Agent ML documentation](https://github.com/netdata/netdata/blob/master/src/ml/README.md).
|
||||
- [Anomaly Advisor documentation](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md).
|
||||
- [Metric Correlations documentation](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md).
|
||||
- Anomaly Advisor [launch blog post](https://www.netdata.cloud/blog/introducing-anomaly-advisor-unsupervised-anomaly-detection-in-netdata/).
|
||||
- Netdata Approach to ML [blog post](https://www.netdata.cloud/blog/our-approach-to-machine-learning/).
|
||||
- `areal/ml` related [GitHub Discussions](https://github.com/netdata/netdata/discussions?discussions_q=label%3Aarea%2Fml).
|
||||
- Netdata Machine Learning Meetup [deck](https://docs.google.com/presentation/d/1rfSxktg2av2k-eMwMbjN0tXeo76KC33iBaxerYinovs/edit?usp=sharing) and [YouTube recording](https://www.youtube.com/watch?v=eJGWZHVQdNU).
|
||||
- Netdata Anomaly Advisor [YouTube Playlist](https://youtube.com/playlist?list=PL-P-gAHfL2KPeUcCKmNHXC-LX-FfdO43j).
|
|
@ -1,147 +0,0 @@
|
|||
# Troubleshoot Agent-Cloud connectivity issues
|
||||
|
||||
Learn how to troubleshoot connectivity issues leading to agents not appearing at all in Netdata Cloud, or
|
||||
appearing with a status other than `live`.
|
||||
|
||||
After installing an agent with the claiming token provided by Netdata Cloud, you should see charts from that node on
|
||||
Netdata Cloud within seconds. If you don't see charts, check if the node appears in the list of nodes
|
||||
(Nodes tab, top right Node filter, or Manage Nodes screen). If your node does not appear in the list, or it does appear with a status other than "Live", this guide will help you troubleshoot what's happening.
|
||||
|
||||
The most common explanation for connectivity issues usually falls into one of the following three categories:
|
||||
|
||||
- If the node does not appear at all in Netdata Cloud, [the claiming process was unsuccessful](#the-claiming-process-was-unsuccessful).
|
||||
- If the node appears as in Netdata Cloud, but is in the "Unseen" state, [the Agent was claimed but can not connect](#the-agent-was-claimed-but-can-not-connect).
|
||||
- If the node appears as in Netdata Cloud as "Offline" or "Stale", it is a [previously connected agent that can no longer connect](#previously-connected-agent-that-can-no-longer-connect).
|
||||
|
||||
## The claiming process was unsuccessful
|
||||
|
||||
If the claiming process fails, the node will not appear at all in Netdata Cloud.
|
||||
|
||||
First ensure that you:
|
||||
- Use the newest possible stable or nightly version of the agent (at least v1.32).
|
||||
- Your node can successfully issue an HTTPS request to https://app.netdata.cloud
|
||||
|
||||
Other possible causes differ between kickstart installations and Docker installations.
|
||||
|
||||
### Verify your node can access Netdata Cloud
|
||||
|
||||
If you run either `curl` or `wget` to do an HTTPS request to https://app.netdata.cloud, you should get
|
||||
back a 404 response. If you do not, check your network connectivity, domain resolution,
|
||||
and firewall settings for outbound connections.
|
||||
|
||||
If your firewall is configured to completely prevent outbound connections, you need to whitelist `app.netdata.cloud` and `mqtt.netdata.cloud`. If you can't whitelist domains in your firewall, you can whitelist the IPs that the hostnames resolve to, but keep in mind that they can change without any notice.
|
||||
|
||||
If you use an outbound proxy, you need to [take some extra steps]( https://github.com/netdata/netdata/blob/master/src/claim/README.md#connect-through-a-proxy).
|
||||
|
||||
### Troubleshoot claiming with kickstart.sh
|
||||
|
||||
Claiming is done by executing `netdata-claim.sh`, a script that is usually located under `${INSTALL_PREFIX}/netdata/usr/sbin/netdata-claim.sh`. Possible error conditions we have identified are:
|
||||
- No script found at all in any of our search paths.
|
||||
- The path where the claiming script should be does not exist.
|
||||
- The path exists, but is not a file.
|
||||
- The path is a file, but is not executable.
|
||||
Check the output of the kickstart script for any reported errors claiming and verify that the claiming script exists
|
||||
and can be executed.
|
||||
|
||||
### Troubleshoot claiming with Docker
|
||||
|
||||
First verify that the NETDATA_CLAIM_TOKEN parameter is correctly configured and then check for any errors during
|
||||
initialization of the container.
|
||||
|
||||
The most common issue we have seen claiming nodes in Docker is [running on older hosts with seccomp enabled](https://github.com/netdata/netdata/blob/master/src/claim/README.md#known-issues-on-older-hosts-with-seccomp-enabled).
|
||||
|
||||
## The Agent was claimed but can not connect
|
||||
|
||||
Agents that appear on the cloud with state "Unseen" have successfully been claimed, but have never
|
||||
been able to successfully establish an ACLK connection.
|
||||
|
||||
Agents that appear with state "Offline" or "Stale" were able to connect at some point, but are currently not
|
||||
connected. The difference between the two is that "Stale" nodes had some of their data replicated to a
|
||||
parent node that is still connected.
|
||||
|
||||
### Verify that the agent is running
|
||||
|
||||
#### Troubleshoot connection establishment with kickstart.sh
|
||||
|
||||
The kickstart script will install/update your Agent and then try to claim the node to the Cloud
|
||||
(if tokens are provided). To complete the second part, the Agent must be running. In some platforms,
|
||||
the Netdata service cannot be enabled by default and you must do it manually, using the following steps:
|
||||
|
||||
1. Check if the Agent is running:
|
||||
|
||||
```bash
|
||||
systemctl status netdata
|
||||
```
|
||||
|
||||
The expected output should contain info like this:
|
||||
|
||||
```bash
|
||||
Active: active (running) since Wed 2022-07-06 12:25:02 EEST; 1h 40min ago
|
||||
```
|
||||
|
||||
2. Enable and start the Netdata Service.
|
||||
|
||||
```bash
|
||||
systemctl enable netdata
|
||||
systemctl start netdata
|
||||
```
|
||||
|
||||
3. Retry the kickstart claiming process.
|
||||
|
||||
> ### Note
|
||||
>
|
||||
> In some cases a simple restart of the Agent can fix the issue.
|
||||
> Read more about [Starting, Stopping and Restarting the Agent](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#maintaining-a-netdata-agent-installation).
|
||||
|
||||
#### Troubleshoot connection establishment with Docker
|
||||
|
||||
If a Netdata container exits or is killed before it properly starts, it may be able to complete the claiming
|
||||
process, but not have enough time to establish the ACLK connection.
|
||||
|
||||
### Verify that your firewall allows websockets
|
||||
|
||||
The agent initiates an SSL connection to `app.netdata.cloud` and then upgrades that connection to use secure
|
||||
websockets. Some firewalls completely prevent the use of websockets, even for outbound connections.
|
||||
|
||||
## Previously connected agent that can no longer connect
|
||||
|
||||
The states "Offline" and "Stale" suggest that the agent was able to connect at some point in the past, but
|
||||
that it is currently not connected.
|
||||
|
||||
### Verify that network connectivity is still possible
|
||||
|
||||
Verify that you can still issue HTTPS requests to app.netdata.cloud and that no firewall or proxy changes were made.
|
||||
|
||||
### Verify that the claiming info is persisted
|
||||
|
||||
If you use Docker, verify that the contents of `/var/lib/netdata` are preserved across container restarts, using a persistent volume.
|
||||
|
||||
### Verify that the claiming info is not cloned
|
||||
|
||||
A relatively common case we have seen especially with VMs is two or more nodes sharing the same credentials.
|
||||
This happens if you claim a node in a VM and then create an image based on that node. Netdata can't properly
|
||||
work this way, as we have unique node identification information under `/var/lib/netdata`.
|
||||
|
||||
### Verify that your IP is not blocked by Netdata Cloud
|
||||
|
||||
Most of the nodes change IPs dynamically. It is possible that your current IP has been restricted from accessing `app.netdata.cloud` due to security concerns, usually because it was spamming Netdata Coud with too many
|
||||
failed requests (old versions of the agent).
|
||||
|
||||
To verify this:
|
||||
|
||||
1. Check the Agent's `aclk-state`.
|
||||
|
||||
```bash
|
||||
sudo netdatacli aclk-state | grep "Banned By Cloud"
|
||||
```
|
||||
|
||||
The output will contain a line indicating if the IP is banned from `app.netdata.cloud`:
|
||||
|
||||
```bash
|
||||
Banned By Cloud: yes
|
||||
```
|
||||
|
||||
2. If your node's IP is banned, you can:
|
||||
|
||||
- Contact our team to whitelist your IP by submitting a ticket in the [Netdata forum](https://community.netdata.cloud/)
|
||||
- Change your node's IP
|
|
@ -8,7 +8,7 @@ Because Metric Correlations uses every available metric from your infrastructure
|
|||
|
||||
## Using Metric Correlations
|
||||
|
||||
When viewing the [Metrics tab or a single-node dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md), the **Metric Correlations** button appears in the top right corner of the page.
|
||||
When viewing the [Metrics tab or a single-node dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md), the **Metric Correlations** button appears in the top right corner of the page.
|
||||
|
||||
To start correlating metrics, click the **Metric Correlations** button, hold the `Alt` key (or `⌘` on macOS), and drag a selection of metrics on a single chart. The selected timeframe needs at least 15 seconds for Metric Correlation to work.
|
||||
|
||||
|
@ -52,7 +52,7 @@ When a Metric Correlations request is made to Netdata Cloud, if any node instanc
|
|||
|
||||
## Usage tips
|
||||
|
||||
- When running Metric Correlations from the [Metrics tab](docs/dashboard/metrics-tab-and-single-node-tabs.md) across multiple nodes, you might find better results if you iterate on the initial results by grouping by node to then filter to nodes of interest and rerun the Metric Correlations. So a typical workflow in this case would be to:
|
||||
- When running Metric Correlations from the [Metrics tab](docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) across multiple nodes, you might find better results if you iterate on the initial results by grouping by node to then filter to nodes of interest and rerun the Metric Correlations. So a typical workflow in this case would be to:
|
||||
- If unsure which nodes you are interested in then run MC on all nodes.
|
||||
- Within the initial results returned group the most interesting chart by node to see if the changes are across all nodes or a subset of nodes.
|
||||
- If you see a subset of nodes clearly jump out when you group by node, then filter for just those nodes of interest and run the MC again. This will result in less aggregation needing to be done by Netdata and so should help give clearer results as you interact with the slider.
|
|
@ -1,226 +0,0 @@
|
|||
# How metrics streaming works
|
||||
|
||||
Each node running Netdata can stream the metrics it collects, in real time, to another node. Streaming allows you to
|
||||
replicate metrics data across multiple nodes, or centralize all your metrics data into a single time-series database
|
||||
(TSDB).
|
||||
|
||||
When one node streams metrics to another, the node receiving metrics can visualize them on the dashboard, run health checks to
|
||||
[trigger alerts](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md) and
|
||||
[send notifications](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md), and
|
||||
[export](https://github.com/netdata/netdata/blob/master/docs/export/external-databases.md) all metrics to an external TSDB. When Netdata streams metrics to another
|
||||
Netdata, the receiving one is able to perform everything a Netdata instance is capable of.
|
||||
|
||||
Streaming lets you decide exactly how you want to store and maintain metrics data. While we believe Netdata's distributed architecture is ideal for speed and scale, streaming provides centralization options and high data availability.
|
||||
|
||||
This document will get you started quickly with streaming. More advanced concepts and suggested production deployments
|
||||
can be found in the [streaming and replication reference](https://github.com/netdata/netdata/blob/master/src/streaming/README.md).
|
||||
|
||||
## Streaming basics
|
||||
|
||||
There are three types of nodes in Netdata's streaming ecosystem.
|
||||
|
||||
- **Parent**: A node, running Netdata, that receives streamed metric data.
|
||||
- **Child**: A node, running Netdata, that streams metric data to one or more parent.
|
||||
- **Proxy**: A node, running Netdata, that receives metric data from a child and "forwards" them on to a
|
||||
separate parent node.
|
||||
|
||||
Netdata uses API keys, which are just random GUIDs, to authorize the communication between child and parent nodes. We
|
||||
recommend using `uuidgen` for generating API keys, which can then be used across any number of streaming connections.
|
||||
Or, you can generate unique API keys for each parent-child relationship.
|
||||
|
||||
Once the parent node authorizes the child's API key, the child can start streaming metrics.
|
||||
|
||||
It's important to note that the streaming connection uses TCP, UDP, or Unix sockets, _not HTTP_. To proxy streaming
|
||||
metrics, you need to use a proxy that tunnels [OSI layer 4-7
|
||||
traffic](https://en.wikipedia.org/wiki/OSI_model#Layer_4:_Transport_Layer) without interfering with it, such as
|
||||
[SOCKS](https://en.wikipedia.org/wiki/SOCKS) or Nginx's
|
||||
[TCP/UDP load balancing](https://docs.nginx.com/nginx/admin-guide/load-balancer/tcp-udp-load-balancer/).
|
||||
|
||||
## Supported streaming configurations
|
||||
|
||||
Netdata supports any combination of parent, child, and proxy nodes that you can imagine. Any node can act as both a
|
||||
parent, child, or proxy at the same time, sending or receiving streaming metrics from any number of other nodes.
|
||||
|
||||
Here are a few example streaming configurations:
|
||||
|
||||
- **Headless collector**:
|
||||
- Child `A`, _without_ a database or web dashboard, streams metrics to parent `B`.
|
||||
- `A` metrics are only available via the local Agent dashboard for `B`.
|
||||
- `B` generates alerts for `A`.
|
||||
- **Replication**:
|
||||
- Child `A`, _with_ a database and web dashboard, streams metrics to parent `B`.
|
||||
- `A` metrics are available on both local Agent dashboards, and can be stored with the same or different metrics
|
||||
retention policies.
|
||||
- Both `A` and `B` generate alerts.
|
||||
- **Proxy**:
|
||||
- Child `A`, _with or without_ a database, sends metrics to proxy `C`, also _with or without_ a database. `C` sends
|
||||
metrics to parent `B`.
|
||||
- Any node with a database can generate alerts.
|
||||
|
||||
|
||||
|
||||
### A basic parent child setup
|
||||
|
||||

|
||||
|
||||
For a predictable number of non-ephemeral nodes, install a Netdata agent on each node and replicate its data to a
|
||||
Netdata parent, preferrably on a management/admin node outside your production infrastructure.
|
||||
There are two variations of the basic setup:
|
||||
|
||||
- When your nodes have sufficient RAM and disk IO the Netdata agents on each node can run with the default
|
||||
settings for data collection and retention.
|
||||
|
||||
- When your nodes have severe RAM and disk IO limitations (e.g. Raspberry Pis), you should
|
||||
[optimize the Netdata agent's performance](https://github.com/netdata/netdata/blob/master/docs/guides/configure/performance.md).
|
||||
|
||||
[Secure your nodes](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/secure-nodes.md) to
|
||||
protect them from the internet by making their UI accessible only via an nginx proxy, with potentially different subdomains
|
||||
for the parent and even each child, if necessary.
|
||||
|
||||
Both children and the parent are connected to the cloud, to enable infrastructure observability,
|
||||
[without transferring the collected data](https://github.com/netdata/netdata/blob/master/docs/security-and-privacy-design/README.md).
|
||||
Requests for data are always serverd by a connected Netdata agent. When both a child and a parent are connected,
|
||||
the cloud will always select the parent to query the user requested data.
|
||||
|
||||
### An advanced setup
|
||||
|
||||

|
||||
|
||||
When the nodes are ephemeral, we recommend using two parents in an active-active setup, and having the children not store data at all.
|
||||
|
||||
Both parents are configured on each child, so that if one is not available, they connect to the other.
|
||||
|
||||
The children in this set up are not connected to Netdata Cloud at all, as high availability is achieved with the second parent.
|
||||
|
||||
## Enable streaming between nodes
|
||||
|
||||
The simplest streaming configuration is **replication**, in which a child node streams its metrics in real time to a
|
||||
parent node, and both nodes retain metrics in their own databases.
|
||||
|
||||
To configure replication, you need two nodes, each running Netdata. First you'll first enable streaming on your parent
|
||||
node, then enable streaming on your child node. When you're finished, you'll be able to see the child node's metrics in
|
||||
the parent node's dashboard, quickly switch between the two dashboards, and be able to serve
|
||||
[alert notifications](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md) from either or both nodes.
|
||||
|
||||
### Enable streaming on the parent node
|
||||
|
||||
First, log onto the node that will act as the parent.
|
||||
|
||||
Run `uuidgen` to create a new API key, which is a randomly-generated machine GUID the Netdata Agent uses to identify
|
||||
itself while initiating a streaming connection. Copy that into a separate text file for later use.
|
||||
|
||||
> Find out how to [install `uuidgen`](https://command-not-found.com/uuidgen) on your node if you don't already have it.
|
||||
|
||||
Next, open `stream.conf` using [`edit-config`](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md#use-edit-config-to-edit-configuration-files)
|
||||
from within the [Netdata config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md#the-netdata-config-directory).
|
||||
|
||||
```bash
|
||||
cd /etc/netdata
|
||||
sudo ./edit-config stream.conf
|
||||
```
|
||||
|
||||
Scroll down to the section beginning with `[API_KEY]`. Paste the API key you generated earlier between the brackets, so
|
||||
that it looks like the following:
|
||||
|
||||
```conf
|
||||
[11111111-2222-3333-4444-555555555555]
|
||||
```
|
||||
|
||||
Set `enabled` to `yes`, and `default memory mode` to `dbengine`. Leave all the other settings as their defaults. A
|
||||
simplified version of the configuration, minus the commented lines, looks like the following:
|
||||
|
||||
```conf
|
||||
[11111111-2222-3333-4444-555555555555]
|
||||
enabled = yes
|
||||
default memory mode = dbengine
|
||||
```
|
||||
|
||||
Save the file and close it, then restart Netdata with `sudo systemctl restart netdata`, or the [appropriate
|
||||
method](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#maintaining-a-netdata-agent-installation) for your system.
|
||||
|
||||
### Enable streaming on the child node
|
||||
|
||||
Connect to your child node with SSH.
|
||||
|
||||
Open `stream.conf` again. Scroll down to the `[stream]` section and set `enabled` to `yes`. Paste the IP address of your
|
||||
parent node at the end of the `destination` line, and paste the API key generated on the parent node onto the `api key`
|
||||
line.
|
||||
|
||||
Leave all the other settings as their defaults. A simplified version of the configuration, minus the commented lines,
|
||||
looks like the following:
|
||||
|
||||
```conf
|
||||
[stream]
|
||||
enabled = yes
|
||||
destination = 203.0.113.0
|
||||
api key = 11111111-2222-3333-4444-555555555555
|
||||
```
|
||||
|
||||
Save the file and close it, then restart Netdata with `sudo systemctl restart netdata`, or the [appropriate
|
||||
method](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#maintaining-a-netdata-agent-installation) for your system.
|
||||
|
||||
### Enable TLS/SSL on streaming (optional)
|
||||
|
||||
While encrypting the connection between your parent and child nodes is recommended for security, it's not required to
|
||||
get started. If you're not interested in encryption, skip ahead to [view streamed
|
||||
metrics](#view-streamed-metrics-in-netdatas-dashboard).
|
||||
|
||||
In this example, we'll use self-signed certificates.
|
||||
|
||||
On the **parent** node, use OpenSSL to create the key and certificate, then use `chown` to make the new files readable
|
||||
by the `netdata` user.
|
||||
|
||||
```bash
|
||||
sudo openssl req -newkey rsa:2048 -nodes -sha512 -x509 -days 365 -keyout /etc/netdata/ssl/key.pem -out /etc/netdata/ssl/cert.pem
|
||||
sudo chown netdata:netdata /etc/netdata/ssl/cert.pem /etc/netdata/ssl/key.pem
|
||||
```
|
||||
|
||||
Next, enforce TLS/SSL on the web server. Open `netdata.conf`, scroll down to the `[web]` section, and look for the `bind
|
||||
to` setting. Add `^SSL=force` to turn on TLS/SSL. See the [web server
|
||||
reference](https://github.com/netdata/netdata/blob/master/src/web/server/README.md#enabling-tls-support) for other TLS/SSL options.
|
||||
|
||||
```conf
|
||||
[web]
|
||||
bind to = *=dashboard|registry|badges|management|streaming|netdata.conf^SSL=force
|
||||
```
|
||||
|
||||
Next, connect to the **child** node and open `stream.conf`. Add `:SSL` to the end of the existing `destination` setting
|
||||
to connect to the parent using TLS/SSL. Uncomment the `ssl skip certificate verification` line to allow the use of
|
||||
self-signed certificates.
|
||||
|
||||
```conf
|
||||
[stream]
|
||||
enabled = yes
|
||||
destination = 203.0.113.0:SSL
|
||||
ssl skip certificate verification = yes
|
||||
api key = 11111111-2222-3333-4444-555555555555
|
||||
```
|
||||
|
||||
Restart both the parent and child nodes with `sudo systemctl restart netdata`, or the [appropriate
|
||||
method](https://github.com/netdata/netdata/blob/master/packaging/installer/README.md#maintaining-a-netdata-agent-installation) for your system, to stream encrypted metrics using TLS/SSL.
|
||||
|
||||
### View streamed metrics in Netdata Cloud
|
||||
|
||||
In Netdata Cloud you should now be able to see a new parent showing up in the Home tab under "Nodes by data replication".
|
||||
The replication factor for the child node has now increased to 2, meaning that its data is now highly available.
|
||||
|
||||
You don't need to do anything else, as the cloud will automatically prefer to fetch data about the child from the parent
|
||||
and switch to querying the child only when the parent is unavailable, or for some reason doesn't have the requested
|
||||
data (e.g. the connection between parent and the child is broken).
|
||||
|
||||
### View streamed metrics in Netdata's dashboard
|
||||
|
||||
At this point, the child node is streaming its metrics in real time to its parent. Open the local Agent dashboard for
|
||||
the parent by navigating to `http://PARENT-NODE:19999` in your browser, replacing `PARENT-NODE` with its IP address or
|
||||
hostname.
|
||||
|
||||
This dashboard shows parent metrics. To see child metrics, open the left-hand sidebar with the hamburger icon
|
||||

|
||||
in the top panel. Both nodes appear under the **Replicated Nodes** menu. Click on either of the links to switch between
|
||||
separate parent and child dashboards.
|
||||
|
||||

|
||||
|
||||
The child dashboard is also available directly at `http://PARENT-NODE:19999/host/CHILD-HOSTNAME`, which in this example
|
||||
is `http://203.0.113.0:19999/host/netdata-child`.
|
||||
|
|
@ -69,7 +69,7 @@ stateDiagram-v2
|
|||
|
||||
2. **Automation**: Netdata is designed to automate most of the process of setting up and running an observability solution. It is designed to instantly provide comprehensive dashboards and fully automated alerts, with zero configuration.
|
||||
|
||||
3. **High Fidelity Monitoring**: Netdata was born from our need to kill the console for observability. So, it provides metrics and logs in the same granularity and fidelity console tools do, but also comes with tools that go beyond metrics and logs, to provide a holistic view of the monitored infrastructure (e.g. check [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md)).
|
||||
3. **High Fidelity Monitoring**: Netdata was born from our need to kill the console for observability. So, it provides metrics and logs in the same granularity and fidelity console tools do, but also comes with tools that go beyond metrics and logs, to provide a holistic view of the monitored infrastructure (e.g. check [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md)).
|
||||
|
||||
4. **Minimal impact on monitored systems and applications**: Netdata has been designed to have a minimal impact on the monitored systems and their applications. There are [independent studies](https://www.ivanomalavolta.com/files/papers/ICSOC_2023.pdf) reporting that Netdata excels in CPU usage, RAM utilization, Execution Time and the impact Netdata has on monitored applications and containers.
|
||||
|
||||
|
@ -79,6 +79,6 @@ stateDiagram-v2
|
|||
|
||||
The Netdata agents (Standalone, Children and Parents) **share the dashboard** of Netdata Cloud. However, when the user is logged-in and the Netdata agent is connected to Netdata Cloud, the following are enabled (which are otherwise disabled):
|
||||
|
||||
1. **Access to Sensitive Data**: Some data, like systemd-journal logs and several [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md) features expose sensitive data, like IPs, ports, process command lines and more. To access all these when the dashboard is served directly from a Netdata agent, Netdata Cloud is required to verify that the user accessing the dashboard has the required permissions.
|
||||
1. **Access to Sensitive Data**: Some data, like systemd-journal logs and several [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md) features expose sensitive data, like IPs, ports, process command lines and more. To access all these when the dashboard is served directly from a Netdata agent, Netdata Cloud is required to verify that the user accessing the dashboard has the required permissions.
|
||||
|
||||
2. **Dynamic Configuration**: Netdata agents are configured via configuration files, manually or through some provisioning system. The latest Netdata includes a feature to allow users change some of the configuration (collectors, alerts) via the dashboard. This feature is only available to users of paid Netdata Cloud plan.
|
||||
|
|
|
@ -6,7 +6,7 @@ When preparing to backup a Netdata Agent it is worth considering that there are
|
|||
|
||||
| Data type | Description | Location |
|
||||
|---------------------|------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------|
|
||||
| Agent configuration | Files controlling configuration of the Netdata Agent | [config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md) |
|
||||
| Agent configuration | Files controlling configuration of the Netdata Agent | [config directory](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) |
|
||||
| Metrics | Database files | /var/cache/netdata |
|
||||
| Identity | Claim token, API key and some other files | /var/lib/netdata |
|
||||
|
||||
|
@ -22,7 +22,7 @@ In this standard scenario, you are backing up your Netdata Agent in case of a no
|
|||
> **Note**
|
||||
> The specific paths may vary depending on installation method, Operating System, and whether it is a Docker/Kubernetes deployment.
|
||||
|
||||
2. It is recommended that you [stop the Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/configure/start-stop-restart.md) when backing up the Metrics/database files.
|
||||
2. It is recommended that you [stop the Netdata Agent](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/start-stop-restart.md) when backing up the Metrics/database files.
|
||||
Backing up the Agent configuration and Identity folders is straightforward as they should not be changing very frequently.
|
||||
|
||||
3. Using a backup tool such as `tar` you will need to run the backup as _root_ or as the _netdata_ user to access all the files in the directories.
|
|
@ -1,6 +1,6 @@
|
|||
<!--
|
||||
title: "Anonymous telemetry events"
|
||||
custom_edit_url: https://github.com/netdata/netdata/edit/master/docs/anonymous-statistics.md
|
||||
custom_edit_url: https://github.com/netdata/netdata/edit/master/docs/netdata-agent/configuration/anonymous-telemetry-events.md
|
||||
sidebar_label: "Anonymous telemetry events"
|
||||
learn_status: "Published"
|
||||
learn_rel_path: "Configuration"
|
||||
|
@ -23,7 +23,7 @@ We use the statistics gathered from this information for two purposes:
|
|||
|
||||
Netdata collects usage information via two different channels:
|
||||
|
||||
- **Agent dashboard**: We use the [PostHog JavaScript integration](https://posthog.com/docs/integrations/js-integration) (with sensitive event attributes overwritten to be anonymized) to send product usage events when you access an [Agent's dashboard](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/accessing-netdata-dashboards.md).
|
||||
- **Agent dashboard**: We use the [PostHog JavaScript integration](https://posthog.com/docs/integrations/js-integration) (with sensitive event attributes overwritten to be anonymized) to send product usage events when you access an [Agent's dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/README.md).
|
||||
- **Agent backend**: The `netdata` daemon executes the [`anonymous-statistics.sh`](https://github.com/netdata/netdata/blob/6469cf92724644f5facf343e4bdd76ac0551a418/daemon/anonymous-statistics.sh.in) script when Netdata starts, stops cleanly, or fails.
|
||||
|
||||
You can opt-out from sending anonymous statistics to Netdata through three different [opt-out mechanisms](#opt-out).
|
|
@ -1,7 +1,7 @@
|
|||
<!--
|
||||
title: "Common configuration changes"
|
||||
description: "See the most popular configuration changes to make to the Netdata Agent, including longer metrics retention, reduce sampling, and more."
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/configure/common-changes.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/netdata-agent/configuration/common-configuration-changes.md"
|
||||
sidebar_label: "Common configuration changes"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
||||
|
@ -15,12 +15,12 @@ systems, containers, and applications, but there are hundreds of settings to twe
|
|||
over your monitoring platform.
|
||||
|
||||
This document assumes familiarity with
|
||||
using [`edit-config`](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) from the Netdata config
|
||||
using [`edit-config`](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) from the Netdata config
|
||||
directory.
|
||||
|
||||
## Change dashboards and visualizations
|
||||
|
||||
The Netdata Agent's [local dashboard](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/accessing-netdata-dashboards.md), accessible
|
||||
The Netdata Agent's [local dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/README.md), accessible
|
||||
at `http://NODE:19999` is highly configurable. If
|
||||
you use [Netdata Cloud](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/README.md)
|
||||
for infrastructure monitoring, you
|
||||
|
@ -30,9 +30,9 @@ changes reflected in those visualizations due to the way Netdata Cloud proxies m
|
|||
### Increase the long-term metrics retention period
|
||||
|
||||
Read our doc
|
||||
on [increasing long-term metrics storage](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md)
|
||||
on [increasing long-term metrics storage](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md)
|
||||
for details, including a
|
||||
[calculator](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md#calculate-the-system-resources-ram-disk-space-needed-to-store-metrics)
|
||||
[calculator](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md#calculate-the-system-resources-ram-disk-space-needed-to-store-metrics)
|
||||
to help you determine the exact settings for your desired retention period.
|
||||
|
||||
### Reduce the data collection frequency
|
||||
|
@ -94,7 +94,7 @@ Tweak existing alerts by editing files in the `health.d/` directory. For example
|
|||
the Agent responds to anomalies related to CPU utilization.
|
||||
|
||||
To see which configuration file you need to edit to configure a specific
|
||||
alert, [view your active alerts](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md) in
|
||||
alert, [view your active alerts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md) in
|
||||
Netdata Cloud or the local Agent dashboard and look for the **source** line. For example, it might
|
||||
read `source 4@/usr/lib/netdata/conf.d/health.d/cpu.conf`.
|
||||
|
||||
|
@ -122,7 +122,7 @@ section of `netdata.conf`.
|
|||
### Enable alert notifications
|
||||
|
||||
Open `health_alarm_notify.conf` for editing. First, read the [enabling
|
||||
notifications](https://github.com/netdata/netdata/blob/master/docs/monitor/enable-notifications.md#netdata-agent) doc
|
||||
notifications](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/README.md#netdata-agent) doc
|
||||
for an example of the process using Slack, then
|
||||
click on the link to your preferred notification method to find documentation for that specific endpoint.
|
||||
|
||||
|
@ -136,7 +136,7 @@ Learn more about the available options in the [security design documentation](ht
|
|||
## Reduce resource usage
|
||||
|
||||
Read
|
||||
our [performance optimization guide](https://github.com/netdata/netdata/blob/master/docs/guides/configure/performance.md)
|
||||
our [performance optimization guide](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimize-the-netdata-agents-performance.md)
|
||||
for a long list of specific changes
|
||||
that can reduce the Netdata Agent's CPU/memory footprint and IO requirements.
|
||||
|
||||
|
@ -147,7 +147,7 @@ and as metadata to Netdata Cloud, and help you organize the metrics coming from
|
|||
defined in the section `[host labels]`.
|
||||
|
||||
For a quick introduction, read
|
||||
the [host label guide](https://github.com/netdata/netdata/blob/master/docs/guides/using-host-labels.md).
|
||||
the [host label guide](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/organize-systems-metrics-and-alerts.md).
|
||||
|
||||
The following restrictions apply to host label names:
|
||||
|
|
@ -22,7 +22,7 @@ The following table summarizes the effect of each optimization on the CPU, RAM a
|
|||
| [Use streaming and replication](#use-streaming-and-replication) | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
|
||||
| [Disable unneeded plugins or collectors](#disable-unneeded-plugins-or-collectors) | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
|
||||
| [Reduce data collection frequency](#reduce-collection-frequency) | :heavy_check_mark: | | :heavy_check_mark: |
|
||||
| [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md) | | :heavy_check_mark: | :heavy_check_mark: |
|
||||
| [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md) | | :heavy_check_mark: | :heavy_check_mark: |
|
||||
| [Use a different metric storage database](https://github.com/netdata/netdata/blob/master/src/database/README.md) | | :heavy_check_mark: | :heavy_check_mark: |
|
||||
| [Disable machine learning](#disable-machine-learning) | :heavy_check_mark: | | |
|
||||
| [Use a reverse proxy](#run-netdata-behind-a-proxy) | :heavy_check_mark: | | |
|
||||
|
@ -72,7 +72,7 @@ The memory footprint of Netdata is mainly influenced by the number of metrics co
|
|||
To estimate and control memory consumption, you can (either one or a combination of the following actions):
|
||||
|
||||
1. [Disable unneeded plugins or collectors](#disable-unneeded-plugins-or-collectors)
|
||||
2. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md)
|
||||
2. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md)
|
||||
3. [Use a different metric storage database](https://github.com/netdata/netdata/blob/master/src/database/README.md).
|
||||
|
||||
### Disk footprint and I/O
|
||||
|
@ -91,7 +91,7 @@ To optimize your disk footprint in any aspect described below you can:
|
|||
|
||||
To configure retention, you can:
|
||||
|
||||
1. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md).
|
||||
1. [Change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md).
|
||||
|
||||
To control disk I/O:
|
||||
|
||||
|
@ -108,7 +108,7 @@ Minimize deployment impact on the production system by optimizing disk footprint
|
|||
|
||||
For all production environments, parent Netdata nodes outside the production infrastructure should be receiving all
|
||||
collected data from children Netdata nodes running on the production infrastructure,
|
||||
using [streaming and replication](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md).
|
||||
using [streaming and replication](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/README.md).
|
||||
|
||||
### Disable health checks on the child nodes
|
||||
|
||||
|
@ -171,7 +171,7 @@ The fastest way to improve the Agent's resource utilization is to reduce how oft
|
|||
### Global
|
||||
|
||||
If you don't need per-second metrics, or if the Netdata Agent uses a lot of CPU even when no one is viewing that node's
|
||||
dashboard, [configure the Agent](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) to collect
|
||||
dashboard, [configure the Agent](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) to collect
|
||||
metrics less often.
|
||||
|
||||
Open `netdata.conf` and edit the `update every` setting. The default is `1`, meaning that the Agent collects metrics
|
||||
|
@ -216,13 +216,13 @@ update_every: 10
|
|||
## Lower memory usage for metrics retention
|
||||
|
||||
See how
|
||||
to [change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/store/change-metrics-storage.md).
|
||||
to [change how long Netdata stores metrics](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/optimizing-metrics-database/change-metrics-storage.md).
|
||||
|
||||
## Use a different metric storage database
|
||||
|
||||
Consider [using a different metric storage database](https://github.com/netdata/netdata/blob/master/src/database/README.md)
|
||||
when running Netdata on IoT devices, and for children in a parent-child set up based
|
||||
on [streaming and replication](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md).
|
||||
on [streaming and replication](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/README.md).
|
||||
|
||||
## Disable machine learning
|
||||
|
||||
|
@ -241,7 +241,7 @@ internal [web server](https://github.com/netdata/netdata/blob/master/src/web/REA
|
|||
Nginx can handle more concurrent connections, reuse idle connections, and use fast gzip compression to reduce payloads.
|
||||
|
||||
For details on installing another web server as a proxy for the local Agent dashboard,
|
||||
see [reverse proxies](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/reverse-proxies.md).
|
||||
see [reverse proxies](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/README.md).
|
||||
|
||||
## Disable/lower gzip compression for the dashboard
|
||||
|
|
@ -156,7 +156,7 @@ setups, all the way to 100% in some edge cases.
|
|||
## Configure metric retention
|
||||
|
||||
Once you have decided how to size each tier, open `netdata.conf` with
|
||||
[`edit-config`](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md#use-edit-config-to-edit-configuration-files)
|
||||
[`edit-config`](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md#edit-netdataconf)
|
||||
and make your changes in the `[db]` subsection.
|
||||
|
||||
Save the file and restart the Agent with `sudo systemctl restart netdata`, or
|
|
@ -5,12 +5,12 @@ Netdata allows to organize your observability infrastructure with spaces, war ro
|
|||
|
||||
## Spaces and war rooms
|
||||
|
||||
[Spaces](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-spaces) are used for organization-level or infrastructure-level
|
||||
[Spaces](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-spaces) are used for organization-level or infrastructure-level
|
||||
grouping of nodes and people. A node can only appear in a single space, while people can have access to multiple spaces.
|
||||
|
||||
The [war rooms](https://github.com/netdata/netdata/edit/master/docs/cloud/war-rooms.md) in a space bring together nodes and people in
|
||||
collaboration areas. War rooms can also be used for fine-tuned
|
||||
[role based access control](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md).
|
||||
[role based access control](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md).
|
||||
|
||||
## Virtual nodes
|
||||
|
||||
|
@ -85,7 +85,7 @@ sudo ./edit-config netdata.conf
|
|||
```
|
||||
|
||||
Create a new `[host labels]` section defining a new host label and its value for the system in question. Make sure not
|
||||
to violate any of the [host label naming rules](https://github.com/netdata/netdata/blob/master/docs/configure/common-changes.md#organize-nodes-with-host-labels).
|
||||
to violate any of the [host label naming rules](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/common-configuration-changes.md#organize-nodes-with-host-labels).
|
||||
|
||||
```conf
|
||||
[host labels]
|
||||
|
@ -141,7 +141,7 @@ child system. It's a vastly simplified way of accessing critical information abo
|
|||
> kernel and operating system versions, you should secure streaming connections with SSL. See the [streaming
|
||||
> documentation](https://github.com/netdata/netdata/blob/master/src/streaming/README.md#securing-streaming-communications) for details. You may also want to use
|
||||
> [access lists](https://github.com/netdata/netdata/blob/master/src/web/server/README.md#access-lists) or [expose the API only to LAN/localhost
|
||||
> connections](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/secure-nodes.md#expose-netdata-only-in-a-private-lan).
|
||||
> connections](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/securing-netdata-agents.md#expose-netdata-only-in-a-private-lan).
|
||||
|
||||
You can also use `_is_parent`, `_is_child`, and any other host labels in both health entities and metrics
|
||||
exporting. Speaking of which...
|
|
@ -1,4 +1,4 @@
|
|||
# Running Netdata behind a reverse proxy
|
||||
# Running the Netdata Agent behind a reverse proxy
|
||||
|
||||
If you need to access a Netdata agent's user interface or API in a production environment we recommend you put Netdata behind
|
||||
another web server and secure access to the dashboard via SSL, user authentication and firewall rules.
|
||||
|
@ -6,12 +6,12 @@ another web server and secure access to the dashboard via SSL, user authenticati
|
|||
A dedicated web server also provides more robustness and capabilities than the Agent's [internal web server](https://github.com/netdata/netdata/blob/master/src/web/README.md).
|
||||
|
||||
We have documented running behind
|
||||
[nginx](https://github.com/netdata/netdata/blob/master/docs/Running-behind-nginx.md),
|
||||
[Apache](https://github.com/netdata/netdata/blob/master/docs/Running-behind-apache.md),
|
||||
[HAProxy](https://github.com/netdata/netdata/blob/master/docs/Running-behind-haproxy.md),
|
||||
[Lighttpd](https://github.com/netdata/netdata/blob/master/docs/Running-behind-lighttpd.md),
|
||||
[Caddy](https://github.com/netdata/netdata/blob/master/docs/Running-behind-caddy.md),
|
||||
and [H2O](https://github.com/netdata/netdata/blob/master/docs/Running-behind-h2o.md).
|
||||
[nginx](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-nginx.md),
|
||||
[Apache](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-apache.md),
|
||||
[HAProxy](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-haproxy.md),
|
||||
[Lighttpd](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-lighttpd.md),
|
||||
[Caddy](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-caddy.md),
|
||||
and [H2O](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-h2o.md).
|
||||
If you prefer a different web server, we suggest you follow the documentation for nginx and tell us how you did it
|
||||
by adding your own "Running behind webserverX" document.
|
||||
|
|
@ -1,6 +1,6 @@
|
|||
<!--
|
||||
title: "Running Netdata behind H2O"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/Running-behind-h2o.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-h2o.md"
|
||||
sidebar_label: "Running Netdata behind H2O"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
|
@ -1,6 +1,6 @@
|
|||
<!--
|
||||
title: "Netdata via HAProxy"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/Running-behind-haproxy.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-haproxy.md"
|
||||
sidebar_label: "Netdata via HAProxy"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
|
@ -1,6 +1,6 @@
|
|||
<!--
|
||||
title: "Netdata via lighttpd v1.4.x"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/Running-behind-lighttpd.md"
|
||||
custom_edit_url: "https://github.com/netdata/netdata/edit/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-lighttpd.md"
|
||||
sidebar_label: "Netdata via lighttpd v1.4.x"
|
||||
learn_status: "Published"
|
||||
learn_topic_type: "Tasks"
|
|
@ -1,4 +1,4 @@
|
|||
# Secure your nodes
|
||||
# Securing Netdata Agents
|
||||
|
||||
Netdata is a monitoring system. It should be protected, the same way you protect all your admin apps. We assume Netdata
|
||||
will be installed privately, for your eyes only.
|
||||
|
@ -65,7 +65,7 @@ using `sudo systemctl
|
|||
restart netdata`. If you try to visit the local dashboard to `http://NODE:19999` again, the connection will fail because
|
||||
that node no longer serves its local dashboard.
|
||||
|
||||
> See the [configuration basics doc](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) for details on how to find
|
||||
> See the [configuration basics doc](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) for details on how to find
|
||||
`netdata.conf` and use
|
||||
> `edit-config`.
|
||||
|
||||
|
@ -144,12 +144,12 @@ dashboard in transit. The connection to Netdata Cloud is always secured with TLS
|
|||
Use one web server to provide authentication in front of **all your Netdata servers**. So, you will be accessing all your Netdata with
|
||||
URLs like `http://{HOST}/netdata/{NETDATA_HOSTNAME}/` and authentication will be shared among all of them (you will sign-in once for all your servers).
|
||||
Instructions are provided on how to set the proxy configuration to have Netdata run behind
|
||||
[nginx](https://github.com/netdata/netdata/blob/master/docs/Running-behind-nginx.md),
|
||||
[HAproxy](https://github.com/netdata/netdata/blob/master/docs/Running-behind-haproxy.md),
|
||||
[Apache](https://github.com/netdata/netdata/blob/master/docs/Running-behind-apache.md),
|
||||
[lighthttpd](https://github.com/netdata/netdata/blob/master/docs/Running-behind-lighttpd.md),
|
||||
[caddy](https://github.com/netdata/netdata/blob/master/docs/Running-behind-caddy.md), and
|
||||
[H2O](https://github.com/netdata/netdata/blob/master/docs/Running-behind-h2o.md).
|
||||
[nginx](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-nginx.md),
|
||||
[HAproxy](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-haproxy.md),
|
||||
[Apache](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-apache.md),
|
||||
[lighthttpd](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-lighttpd.md),
|
||||
[caddy](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-caddy.md), and
|
||||
[H2O](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/running-the-netdata-agent-behind-a-reverse-proxy/Running-behind-h2o.md).
|
||||
|
||||
## Use Netdata parents as Web Application Firewalls
|
||||
|
||||
|
@ -159,7 +159,7 @@ your production systems from the rest of the world. Netdata
|
|||
Parents receive metric data from Netdata Agents or other Netdata Parents on one side, and serve most queries using their own
|
||||
copy of the data to satisfy dashboard requests on the other side.
|
||||
|
||||
For more information see [Streaming and replication](https://github.com/netdata/netdata/blob/master/docs/metrics-storage-management/enable-streaming.md).
|
||||
For more information see [Streaming and replication](https://github.com/netdata/netdata/blob/master/docs/observability-centralization-points/README.md).
|
||||
|
||||
## Other methods
|
||||
|
|
@ -1,60 +0,0 @@
|
|||
# Getting started with Netdata Cloud On-Prem Light PoC
|
||||
Due to the high demand, we designed a very light and easy-to-install version of netdata for clients who do not have Kubernetes cluster installed. Please keep in mind that this is (for now) only designed to be used as a PoC with no built-in resiliency on failures of any kind.
|
||||
|
||||
Requirements:
|
||||
- Ubuntu 22.04 (clean installation will work best).
|
||||
- 10 CPU Cores and 24 GiB of memory.
|
||||
- Access to shell as a sudo.
|
||||
- TLS certificate for Netdata Cloud On-Prem PoC. A single endpoint is required. The certificate must be trusted by all entities connecting to the On-Prem installation by any means.
|
||||
- AWS ID and Key - contact Netdata Product Team - info@netdata.cloud
|
||||
- License Key - contact Netdata Product Team - info@netdata.cloud
|
||||
|
||||
To install the whole environment, log in to the designated host and run:
|
||||
```shell
|
||||
curl https://netdata-cloud-netdata-static-content.s3.amazonaws.com/provision.sh -o provision.sh
|
||||
chmod +x provision.sh
|
||||
sudo ./provision.sh install \
|
||||
-key-id "" \
|
||||
-access-key "" \
|
||||
-onprem-license-key "" \
|
||||
-onprem-license-subject "" \
|
||||
-onprem-url "" \
|
||||
-certificate-path "" \
|
||||
-private-key-path ""
|
||||
```
|
||||
|
||||
What does the script do during installation?
|
||||
1. Prompts for user to provide:
|
||||
- `-key-id` - AWS ECR access key ID.
|
||||
- `-access-key` - AWS ECR Access Key.
|
||||
- `-onprem-license-key` - Netdata Cloud On-Prem license key.
|
||||
- `-onprem-license-subject` - Netdata Cloud On-Prem license subject.
|
||||
- `-onprem-url` - URL for the On-prem (without http(s) protocol).
|
||||
- `-certificate-path` - path to your PEM encoded certificate.
|
||||
- `-private-key-path` - path to your PEM encoded key.
|
||||
2. After getting all of the information installation is starting. The script will install:
|
||||
- Helm
|
||||
- Kubectl
|
||||
- AWS CLI
|
||||
- K3s cluster (single node)
|
||||
3. When all the required software is installed script starts to provision the K3s cluster with gathered data.
|
||||
|
||||
After cluster provisioning netdata is ready to be used.
|
||||
|
||||
##### How to log in?
|
||||
Because this is a PoC with 0 configurations required, only log in by mail can work. What's more every mail that Netdata Cloud On-Prem sends will appear on the mailcatcher, which acts as the SMTP server with a simple GUI to read the mails. Steps:
|
||||
1. Open Netdata Cloud On-Prem PoC in the web browser on URL you specified
|
||||
2. Provide email and use the button to confirm
|
||||
3. Mailcatcher will catch all the emails so go to `<URL from point 1.>/mailcatcher`. Find yours and click the link.
|
||||
4. You are now logged into the netdata. Add your first nodes!
|
||||
|
||||
##### How to remove Netdata Cloud On-Prem PoC?
|
||||
To uninstall the whole PoC, use the same script that installed it, with the `uninstall` switch.
|
||||
|
||||
```shell
|
||||
cd <script dir>
|
||||
sudo ./provision.sh uninstall
|
||||
```
|
||||
|
||||
#### WARNING
|
||||
This script will automatically expose not only netdata but also a mailcatcher under `<URL from point 1.>/mailcatcher`.
|
|
@ -1,200 +0,0 @@
|
|||
# Getting started with Netdata Cloud On-Prem
|
||||
Helm charts are designed for Kubernetes to run as the local equivalent of the Netdata Cloud public offering. This means that no data is sent outside of your cluster. By default, On-Prem installation is trying to reach outside resources only when pulling the container images.
|
||||
There are 2 helm charts in total:
|
||||
- netdata-cloud-onprem - installs onprem itself.
|
||||
- netdata-cloud-dependency - installs all necessary dependency applications. Not for production use, PoC only.
|
||||
|
||||
## Requirements
|
||||
#### Install host:
|
||||
- [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html)
|
||||
- [Helm](https://helm.sh/docs/intro/install/) version 3.12+ with OCI Configuration (explained in the installation section)
|
||||
- [Kubectl](https://kubernetes.io/docs/tasks/tools/)
|
||||
|
||||
#### Kubernetes requirements:
|
||||
- Kubernetes cluster version 1.23+
|
||||
- Kubernetes metrics server (For autoscaling)
|
||||
- TLS certificate for Netdata Cloud On-Prem. A single endpoint is required but there is an option to split the frontend, api, and mqtt endpoints. The certificate must be trusted by all entities connecting to the On-Prem installation by any means.
|
||||
- Ingress controller to support HTTPS `*`
|
||||
- PostgreSQL version 13.7 `*` (Main persistent data app)
|
||||
- EMQX version 5.11 `*` (MQTT Broker that allows Agents to send messages to the On-Prem Cloud)
|
||||
- Apache Pulsar version 2.10+ `*` (Central communication hub. Applications exchange messages through Pulsar)
|
||||
- Traefik version 2.7.x `*` (Internal communication - API Gateway)
|
||||
- Elasticsearch version 8.8.x `*` (Holds Feed)
|
||||
- Redis version 6.2 `*` (Cache)
|
||||
- Some form of generating imagePullSecret `*` (Our ECR repos are secured)
|
||||
- Default storage class configured and working (Persistent volumes based on SSDs are preferred)
|
||||
`*` - available in dependencies helm chart for PoC applications.
|
||||
|
||||
#### Hardware requirements:
|
||||
##### How we tested it:
|
||||
- Several VMs on the AWS EC2, the size of the instance was c6a.32xlarge (128CPUs / 256GiB memory).
|
||||
- Host system - Ubuntu 22.04.
|
||||
- Each VM hosts 200 Agent nodes as docker containers.
|
||||
- Agents are connected directly to the Netdata Cloud On-Prem (no Parent-Child relationships). This is the worst option for the cloud.
|
||||
- Cloud hosted on 1 Kubernetes node c6a.8xlarge (32CPUs / 64GiB memory).
|
||||
- Dependencies were also installed on the same node.
|
||||
The maximum of nodes connected was ~2000.
|
||||
|
||||
##### Results
|
||||
There was no point in trying to connect more nodes as we are covering the PoC purposes.
|
||||
- In a peak connection phase - All nodes startup were triggered in ~15 minutes:
|
||||
- Up to 60% (20 cores) CPU usage of the Kubernetes node. Top usage came from:
|
||||
- Ingress controller (we used haproxy ingress controller)
|
||||
- Postgres
|
||||
- Pulsar
|
||||
- EMQX
|
||||
Combined they were responsible for ~30-35% of CPU usage of the node.
|
||||
- When all nodes connected and synchronized their state CPU usage floated between 30% and 40% - depending on what we did on the Cloud. Here top offenders were:
|
||||
- Pulsar
|
||||
- Postgres
|
||||
Combined they were responsible for ~15-20% of CPU usage of the node.
|
||||
- Memory usage - 45GiB in a peak. Most of it (~20GiB) was consumed by:
|
||||
- Postgres
|
||||
- Elasticsearch
|
||||
- Pulsar
|
||||
|
||||
For a comparison - Netdata Cloud On-prem installation with just 100 nodes connected, without dependencies is going to consume ~2CPUs and ~2GiB of memory (REAL usage, not requests on a Kubernetes).
|
||||
|
||||
## Pulling the helm chart
|
||||
The helm chart for the Netdata Cloud On-Prem installation on Kubernetes is available in the ECR registry.
|
||||
The ECR registry is private, so you need to log in first. Credentials are sent by our Product Team. If you do not have them, please contact our Product Team - info@netdata.cloud.
|
||||
|
||||
#### Configure AWS CLI
|
||||
The machine used for helm chart installation will also need [AWS CLI installed](https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html).
|
||||
There are 2 options for configuring `aws cli` to work with the provided credentials. The first one is to set the environment variables:
|
||||
```bash
|
||||
export AWS_ACCESS_KEY_ID=<your_secret_id>
|
||||
export AWS_SECRET_ACCESS_KEY=<your_secret_key>
|
||||
```
|
||||
|
||||
The second one is to use an interactive shell:
|
||||
```bash
|
||||
aws configure
|
||||
```
|
||||
|
||||
#### Configure helm to use secured ECR repository
|
||||
Using `aws` command we will generate a token for helm to access the secured ECR repository:
|
||||
```bash
|
||||
aws ecr get-login-password --region us-east-1 | helm registry login --username AWS --password-stdin 362923047827.dkr.ecr.us-east-1.amazonaws.com
|
||||
```
|
||||
|
||||
After this step you should be able to add the repository to your helm or just pull the helm chart:
|
||||
```bash
|
||||
helm pull oci://362923047827.dkr.ecr.us-east-1.amazonaws.com/netdata-cloud-dependency --untar #optional
|
||||
helm pull oci://362923047827.dkr.ecr.us-east-1.amazonaws.com/netdata-cloud-onprem --untar
|
||||
```
|
||||
|
||||
Local folders with the newest versions of helm charts should appear on your working dir.
|
||||
|
||||
## Installation
|
||||
|
||||
Netdata provides access to two helm charts:
|
||||
1. netdata-cloud-dependency - required applications for netdata-cloud-onprem. Not for production use.
|
||||
2. netdata-cloud-onprem - the application itself + provisioning
|
||||
|
||||
### netdata-cloud-dependency
|
||||
The entire helm chart is designed around the idea that it allows the installation of the necessary applications:
|
||||
- Redis
|
||||
- Elasticsearch
|
||||
- EMQX
|
||||
- Apache Pulsar
|
||||
- PostgreSQL
|
||||
- Traefik
|
||||
- Mailcatcher
|
||||
- k8s-ecr-login-renew
|
||||
- kubernetes-ingress
|
||||
|
||||
Every configuration option is available through `values.yaml` in the folder that contains your netdata-cloud-dependency helm chart. All configuration options are described in README.md which is a part of the helm chart. It is enough to mention here that each component can be enabled/disabled individually. It is done by true/false switches in `values.yaml`. In this way, it is easier for the user to migrate to production-grade components gradually.
|
||||
|
||||
Unless you prefer a different solution to the problem, `k8s-ecr-login-renew` is responsible for calling out the `AWS API` for token regeneration. This token is then injected into the secret that every node is using for authentication with secured ECR when pulling the images.
|
||||
The default setting in `values.yaml` of `netdata-cloud-onprem` - `.global.imagePullSecrets` is configured to work out of the box with the dependency helm chart.
|
||||
|
||||
For helm chart installation - save your changes in `values.yaml` and execute:
|
||||
```shell
|
||||
cd [your helm chart location]
|
||||
helm upgrade --wait --install netdata-cloud-dependency -n netdata-cloud --create-namespace -f values.yaml .
|
||||
```
|
||||
|
||||
### netdata-cloud-onprem
|
||||
|
||||
Every configuration option is available through `values.yaml` in the folder that contains your netdata-cloud-onprem helm chart. All configuration options are described in README.md which is a part of the helm chart.
|
||||
|
||||
#### Installing Netdata Cloud On-Prem
|
||||
```shell
|
||||
cd [your helm chart location]
|
||||
helm upgrade --wait --install netdata-cloud-onprem -n netdata-cloud --create-namespace -f values.yaml .
|
||||
```
|
||||
|
||||
##### Important notes
|
||||
1. Installation takes care of provisioning the resources with migration services.
|
||||
1. During the first installation, a secret called the `netdata-cloud-common` is created. It contains several randomly generated entries. Deleting helm chart is not going to delete this secret, nor reinstalling the whole On-Prem, unless manually deleted by kubernetes administrator. The content of this secret is extremely relevant - strings that are contained there are essential parts of encryption. Losing or changing the data that it contains will result in data loss.
|
||||
|
||||
## Short description of services
|
||||
#### cloud-accounts-service
|
||||
Responsible for user registration & authentication. Manages user account information.
|
||||
#### cloud-agent-data-ctrl-service
|
||||
Forwards request from the cloud to the relevant agents.
|
||||
The requests include:
|
||||
* Fetching chart metadata from the agent
|
||||
* Fetching chart data from the agent
|
||||
* Fetching function data from the agent
|
||||
#### cloud-agent-mqtt-input-service
|
||||
Forwards MQTT messages emitted by the agent related to the agent entities to the internal Pulsar broker. These include agent connection state updates.
|
||||
#### cloud-agent-mqtt-output-service
|
||||
Forwards Pulsar messages emitted in the cloud related to the agent entities to the MQTT broker. From there, the messages reach the relevant agent.
|
||||
#### cloud-alarm-config-mqtt-input-service
|
||||
Forwards MQTT messages emitted by the agent related to the alarm-config entities to the internal Pulsar broker. These include the data for the alarm configuration as seen by the agent.
|
||||
#### cloud-alarm-log-mqtt-input-service
|
||||
Forwards MQTT messages emitted by the agent related to the alarm-log entities to the internal Pulsar broker. These contain data about the alarm transitions that occurred in an agent.
|
||||
#### cloud-alarm-mqtt-output-service
|
||||
Forwards Pulsar messages emitted in the cloud related to the alarm entities to the MQTT broker. From there, the messages reach the relevant agent.
|
||||
#### cloud-alarm-processor-service
|
||||
Persists latest alert statuses received from the agent in the cloud.
|
||||
Aggregates alert statuses from relevant node instances.
|
||||
Exposes API endpoints to fetch alert data for visualization on the cloud.
|
||||
Determines if notifications need to be sent when alert statuses change and emits relevant messages to Pulsar.
|
||||
Exposes API endpoints to store and return notification-silencing data.
|
||||
#### cloud-alarm-streaming-service
|
||||
Responsible for starting the alert stream between the agent and the cloud.
|
||||
Ensures that messages are processed in the correct order, and starts a reconciliation process between the cloud and the agent if out-of-order processing occurs.
|
||||
#### cloud-charts-mqtt-input-service
|
||||
Forwards MQTT messages emitted by the agent related to the chart entities to the internal Pulsar broker. These include the chart metadata that is used to display relevant charts on the cloud.
|
||||
#### cloud-charts-mqtt-output-service
|
||||
Forwards Pulsar messages emitted in the cloud related to the charts entities to the MQTT broker. From there, the messages reach the relevant agent.
|
||||
#### cloud-charts-service
|
||||
Exposes API endpoints to fetch the chart metadata.
|
||||
Forwards data requests via the `cloud-agent-data-ctrl-service` to the relevant agents to fetch chart data points.
|
||||
Exposes API endpoints to call various other endpoints on the agent, for instance, functions.
|
||||
#### cloud-custom-dashboard-service
|
||||
Exposes API endpoints to fetch and store custom dashboard data.
|
||||
#### cloud-environment-service
|
||||
Serves as the first contact point between the agent and the cloud.
|
||||
Returns authentication and MQTT endpoints to connecting agents.
|
||||
#### cloud-feed-service
|
||||
Processes incoming feed events and stores them in Elasticsearch.
|
||||
Exposes API endpoints to fetch feed events from Elasticsearch.
|
||||
#### cloud-frontend
|
||||
Contains the on-prem cloud website. Serves static content.
|
||||
#### cloud-iam-user-service
|
||||
Acts as a middleware for authentication on most of the API endpoints. Validates incoming token headers, injects the relevant ones, and forwards the requests.
|
||||
#### cloud-metrics-exporter
|
||||
Exports various metrics from an On-Prem Cloud installation. Uses the Prometheus metric exposition format.
|
||||
#### cloud-netdata-assistant
|
||||
Exposes API endpoints to fetch a human-friendly explanation of various netdata configuration options, namely the alerts.
|
||||
#### cloud-node-mqtt-input-service
|
||||
Forwards MQTT messages emitted by the agent related to the node entities to the internal Pulsar broker. These include the node metadata as well as their connectivity state, either direct or via parents.
|
||||
#### cloud-node-mqtt-output-service
|
||||
Forwards Pulsar messages emitted in the cloud related to the charts entities to the MQTT broker. From there, the messages reach the relevant agent.
|
||||
#### cloud-notifications-dispatcher-service
|
||||
Exposes API endpoints to handle integrations.
|
||||
Handles incoming notification messages and uses the relevant channels(email, slack...) to notify relevant users.
|
||||
#### cloud-spaceroom-service
|
||||
Exposes API endpoints to fetch and store relations between agents, nodes, spaces, users, and rooms.
|
||||
Acts as a provider of authorization for other cloud endpoints.
|
||||
Exposes API endpoints to authenticate agents connecting to the cloud.
|
||||
|
||||
## Infrastructure Diagram
|
||||
|
||||

|
||||
|
||||
### If you have any questions or suggestions please contact the Netdata team.
|
|
@ -1,21 +0,0 @@
|
|||
# Basic troubleshooting
|
||||
We cannot predict how your particular installation of Netdata Cloud On-prem is going to work. It is a mixture of underlying infrastructure, the number of agents, and their topology.
|
||||
You can always contact the Netdata team for recommendations!
|
||||
|
||||
#### Loading charts takes a long time or ends with an error
|
||||
Charts service is trying to collect the data from all of the agents in question. If we are talking about the overview screen, all of the nodes in space are going to be queried (`All nodes` room). If it takes a long time, there are a few things that should be checked:
|
||||
1. How many nodes are you querying directly?
|
||||
There is a big difference between having 100 nodes connected directly to the cloud compared to them being connected through a few parents. Netdata always prioritizes querying nodes through parents. This way, we can reduce some of the load by pushing the responsibility to query the data to the parent. The parent is then responsible for passing accumulated data from nodes connected to it to the cloud.
|
||||
1. If you are missing data from endpoints all the time.
|
||||
Netdata Cloud always queries nodes themselves for the metrics. The cloud only holds information about metadata, such as information about what charts can be pulled from any node, but not the data points themselves for any metric. This means that if a node is throttled by the network connection or under high resource pressure, the information exchange between the agent and cloud through the MQTT broker might take a long time. In addition to checking resource usage and networking, we advise using a parent node for such endpoints. Parents can hold the data from nodes that are connected to the cloud through them, eliminating the need to query those endpoints.
|
||||
1. Errors on the cloud when trying to load charts.
|
||||
If the entire data query is crashing and no data is displayed on the UI, it could indicate problems with the `cloud-charts-service`. The query you are performing might simply exceed the CPU and/or memory limits set on the deployment. We advise increasing those resources.
|
||||
It takes a long time to load anything on the Cloud UI
|
||||
When experiencing sluggishness and slow responsiveness, the following factors should be checked regarding the Postgres database:
|
||||
1. CPU: Monitor the CPU usage to ensure it is not reaching its maximum capacity. High and sustained CPU usage can lead to sluggish performance.
|
||||
1. Memory: Check if the database server has sufficient memory allocated. Inadequate memory could cause excessive disk I/O and slow down the database.
|
||||
1. Disk Queue / IOPS: Analyze the disk queue length and disk I/O operations per second (IOPS). A high disk queue length or limited IOPS can indicate a bottleneck and negatively impact database performance.
|
||||
By examining these factors and ensuring that CPU, memory, and disk IOPS are within acceptable ranges, you can mitigate potential performance issues with the Postgres database.
|
||||
|
||||
#### Nodes are not updated quickly on the Cloud UI
|
||||
If you're experiencing delays with information exchange between the Cloud UI and the Agent, and you've already checked the networking and resource usage on the agent side, the problem may be related to Apache Pulsar or the database. Slow alerts on node alerts or slow updates on node status (online/offline) could indicate issues with message processing or database performance. You may want to investigate the performance of Apache Pulsar, ensure it is properly configured, and consider scaling or optimizing the database to handle the volume of data being processed or written to it.
|
|
@ -129,6 +129,6 @@ Netdata Cloud prefers:
|
|||
|
||||
- The most distant (from the Child) Parent available, when doing metrics visualization queries (since usually these Parents have been added for this purpose).
|
||||
|
||||
- The closest (to the Child) Parent available, for [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md) (since top-monitoring provides live data, like the processes running, the list of sockets open, etc). The streaming protocol of Netdata Parents and Children is able to forward such requests to the right child, via the Parents, to respond with live and accurate data.
|
||||
- The closest (to the Child) Parent available, for [Top Monitoring](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md) (since top-monitoring provides live data, like the processes running, the list of sockets open, etc). The streaming protocol of Netdata Parents and Children is able to forward such requests to the right child, via the Parents, to respond with live and accurate data.
|
||||
|
||||
Netdata Children may be connected to Netdata Cloud for high-availability, in case the Netdata Parents are unreachable.
|
||||
|
|
|
@ -20,8 +20,8 @@ To sign in/sign up using Google or GitHub OAuth, visit [Netdata Cloud](https://a
|
|||
|
||||
Netdata integrates with SSO tools, allowing you to control how your team connects and authenticates to Netdata Cloud.
|
||||
|
||||
For more information, see [Enterprise SSO Authentication](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/enterprise-sso-authentication.md).
|
||||
For more information, see [Enterprise SSO Authentication](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/enterprise-sso-authentication.md).
|
||||
|
||||
## Authorization
|
||||
|
||||
Once logged in, you can manage role-based access in your space to give each team member the appropriate role. For more information, see [Role-Based Access model](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md).
|
||||
Once logged in, you can manage role-based access in your space to give each team member the appropriate role. For more information, see [Role-Based Access model](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md).
|
|
@ -79,9 +79,9 @@ In more detail, you can find on the following tables which functionalities are a
|
|||
| See all configured notifications on a Space | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | - | :heavy_check_mark: | |
|
||||
| Add new configuration | :heavy_check_mark: | - | - | - | - | - | |
|
||||
| Enable/Disable configuration | :heavy_check_mark: | - | - | - | - | - | |
|
||||
| Edit configuration | :heavy_check_mark: | - | - | - | - | - | Some exceptions apply depending on [service level](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-notification-methods.md#available-actions-per-notification-methods-based-on-service-level) |
|
||||
| Edit configuration | :heavy_check_mark: | - | - | - | - | - | Some exceptions apply depending on [service level](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-notification-methods.md#available-actions-per-notification-methods-based-on-service-level) |
|
||||
| Delete configuration | :heavy_check_mark: | - | - | - | - | - | |
|
||||
| Edit personal level notification settings | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | [Manage user notification settings](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/manage-notification-methods.md#manage-user-notification-settings) |
|
||||
| Edit personal level notification settings | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | [Manage user notification settings](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/manage-notification-methods.md#manage-user-notification-settings) |
|
||||
| See space alert notification silencing rules | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | - | - | - | |
|
||||
| Add new space alert notification silencing rule | :heavy_check_mark: | :heavy_check_mark: | - | - | - | - | |
|
||||
| Enable/Disable space alert notification silencing rule | :heavy_check_mark: | :heavy_check_mark: | - | - | - | - | |
|
||||
|
@ -91,7 +91,7 @@ In more detail, you can find on the following tables which functionalities are a
|
|||
|
||||
> **Note**
|
||||
>
|
||||
> Enable, Edit and Add actions over specific notification methods will only be allowed if your plan has access to those ([service classification](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md#service-classification))
|
||||
> Enable, Edit and Add actions over specific notification methods will only be allowed if your plan has access to those ([service classification](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md#service-classification))
|
||||
|
||||
### Dashboards
|
||||
|
|
@ -41,11 +41,11 @@ SRE team for the user-facing SaaS application, and a second IT team for managing
|
|||
don't monitor the same nodes, they can work in separate Spaces and then further organize their nodes into War Rooms.
|
||||
|
||||
Next, set up War Rooms. Netdata Cloud creates dashboards and visualizations based on the nodes added to a given War
|
||||
Room. You can [organize War Rooms](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#war-room-organization) in any way
|
||||
Room. You can [organize War Rooms](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#war-room-organization) in any way
|
||||
you want, such as by the application type, for end-to-end application monitoring, or as an incident response tool.
|
||||
|
||||
Learn more about [Spaces](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-spaces) and [War
|
||||
Rooms](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms), including how to manage each, in their respective reference
|
||||
Learn more about [Spaces](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-spaces) and [War
|
||||
Rooms](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms), including how to manage each, in their respective reference
|
||||
documentation.
|
||||
|
||||
### Invite your team
|
||||
|
@ -55,7 +55,7 @@ inviting others, you can better synchronize with your team or colleagues to unde
|
|||
When something goes wrong, you'll be ready to collaboratively troubleshoot complex performance problems from a single
|
||||
pane of glass.
|
||||
|
||||
To [invite new users](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#invite-your-team), click on **Invite Users** in the
|
||||
To [invite new users](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#invite-your-team), click on **Invite Users** in the
|
||||
Space management Area. Choose which War Rooms to add this user to, then click **Send**.
|
||||
|
||||
### See an overview of your infrastructure
|
||||
|
@ -65,28 +65,28 @@ These tabs can be separated into "static", meaning they are by default presented
|
|||
|
||||
#### Static tabs
|
||||
|
||||
- The default tab for any War Room is the [Home tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/home-tab.md), which gives you an overview of this Space.
|
||||
- The default tab for any War Room is the [Home tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/home-tab.md), which gives you an overview of this Space.
|
||||
Here you can see the number of Nodes claimed, data retention statics, users by role, alerts and more.
|
||||
|
||||
- The [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/nodes-tab.md) gives you the ability to see the status (offline or online), host details, alert status and also a short overview of some key metrics from all your nodes at a glance.
|
||||
- The [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/nodes-tab.md) gives you the ability to see the status (offline or online), host details, alert status and also a short overview of some key metrics from all your nodes at a glance.
|
||||
|
||||
- The third and most important tab is the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md) which uses composite charts to display real-time metrics from every available node in a given War Room.
|
||||
- The third and most important tab is the [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md) which uses composite charts to display real-time metrics from every available node in a given War Room.
|
||||
|
||||
- [Kubernetes tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/kubernetes-tab.md) is a logical grouping of charts regarding your Kubernetes clusters. It contains a subset of the charts available in the **Overview tab**.
|
||||
- [Kubernetes tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/kubernetes-tab.md) is a logical grouping of charts regarding your Kubernetes clusters. It contains a subset of the charts available in the **Overview tab**.
|
||||
|
||||
- The [Dashboards tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md) gives you the ability to have tailored made views of specific/targeted interfaces for your infrastructure using any number of charts from any number of nodes.
|
||||
- The [Dashboards tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md) gives you the ability to have tailored made views of specific/targeted interfaces for your infrastructure using any number of charts from any number of nodes.
|
||||
|
||||
- The [Alerts tab](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md) provides you with an overview for all the active alerts you receive for the nodes in this War Room, you can also see all the alerts that are configured to be triggered in any given moment.
|
||||
- The [Alerts tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md) provides you with an overview for all the active alerts you receive for the nodes in this War Room, you can also see all the alerts that are configured to be triggered in any given moment.
|
||||
|
||||
- The [Anomalies tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md) is dedicated to the Anomaly Advisor tool.
|
||||
- The [Anomalies tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md) is dedicated to the Anomaly Advisor tool.
|
||||
|
||||
- The [Functions tab](https://github.com/netdata/netdata/blob/master/docs/cloud/netdata-functions.md) gives you the ability to visualize functions that the Netdata Agent collectors are able to expose.
|
||||
- The [Functions tab](https://github.com/netdata/netdata/blob/master/docs/top-monitoring-netdata-functions.md) gives you the ability to visualize functions that the Netdata Agent collectors are able to expose.
|
||||
|
||||
- The [Feed & events](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/events-feed.md) tab lets you investigate events that occurred in the past, which is invaluable for troubleshooting.
|
||||
- The [Feed & events](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/events-feed.md) tab lets you investigate events that occurred in the past, which is invaluable for troubleshooting.
|
||||
|
||||
#### Dynamic tabs
|
||||
|
||||
If you open a [new dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md), jump to a single-node dashboard, or navigate to a dedicated alert page, a new tab will open in War Room bar.
|
||||
If you open a [new dashboard](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md), jump to a single-node dashboard, or navigate to a dedicated alert page, a new tab will open in War Room bar.
|
||||
|
||||
Tabs can be rearranged with drag-and-drop or closed with the **X** button. Open tabs persist between sessions, so you can always come right back to your preferred setup.
|
||||
|
||||
|
@ -97,7 +97,7 @@ single-node dashboards in Netdata Cloud to drill down on specific issues, scrub
|
|||
historical data, and see like metrics presented meaningfully to help you troubleshoot performance problems.
|
||||
|
||||
Learn more about [interacting with
|
||||
dashboards and charts](https://github.com/netdata/netdata/blob/master/docs/dashboard/netdata-charts.md) to get the most from all of Netdata's real-time
|
||||
dashboards and charts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/netdata-charts.md) to get the most from all of Netdata's real-time
|
||||
metrics.
|
||||
|
||||
### Create new dashboards
|
||||
|
@ -109,7 +109,7 @@ from every node in your infrastructure on a single dashboard.
|
|||

|
||||
|
||||
Read more about [creating new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md) for more details about the process and
|
||||
Read more about [creating new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md) for more details about the process and
|
||||
additional tips on best leveraging the feature to help you troubleshoot complex performance problems.
|
||||
|
||||
## Set up your nodes
|
||||
|
@ -136,7 +136,7 @@ cd /etc/netdata
|
|||
sudo ./edit-config netdata.conf
|
||||
```
|
||||
|
||||
Our [configuration basics doc](https://github.com/netdata/netdata/blob/master/docs/configure/nodes.md) contains more information about `netdata.conf`, `edit-config`,
|
||||
Our [configuration basics doc](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md) contains more information about `netdata.conf`, `edit-config`,
|
||||
along with simple examples to get you familiar with editing your node's configuration.
|
||||
|
||||
After you've learned the basics, you should [secure your infrastructure's nodes](https://github.com/netdata/netdata/blob/master/docs/security-and-privacy-design/README.md) using
|
||||
|
@ -151,8 +151,8 @@ gather as many metrics as they can without you having to configure them individu
|
|||
|
||||
Most collectors work without configuration, should you want more info, you can read more on [how Netdata's metrics collectors work](https://github.com/netdata/netdata/blob/master/src/collectors/README.md) and the [Collectors configuration reference](https://github.com/netdata/netdata/blob/master/src/collectors/REFERENCE.md) documentation.
|
||||
|
||||
In addition, find detailed information about which [system](https://github.com/netdata/netdata/blob/master/docs/collect/system-metrics.md),
|
||||
[container](https://github.com/netdata/netdata/blob/master/docs/collect/container-metrics.md), and [application](https://github.com/netdata/netdata/blob/master/docs/collect/application-metrics.md) metrics you can
|
||||
In addition, find detailed information about which [system](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/system-metrics.md),
|
||||
[container](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/container-metrics.md), and [application](https://github.com/netdata/netdata/blob/master/docs/collecting-metrics/application-metrics.md) metrics you can
|
||||
collect from across your infrastructure with Netdata.
|
||||
|
||||
## Netdata Cloud features
|
||||
|
@ -161,62 +161,62 @@ collect from across your infrastructure with Netdata.
|
|||
<Box
|
||||
title="Spaces and War Rooms">
|
||||
<BoxList>
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-spaces)" title="Spaces" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms)" title="War Rooms" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-spaces)" title="Spaces" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms)" title="War Rooms" />
|
||||
</BoxList>
|
||||
</Box>
|
||||
<Box
|
||||
title="Dashboards">
|
||||
<BoxList>
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md)" title="Metrics tab" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboard/nodes-tab.md)" title="Nodes tab" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboard/kubernetes-tab.md)" title="Kubernetes" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md)" title="Create new dashboards" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md)" title="Metrics tab" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/nodes-tab.md)" title="Nodes tab" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/kubernetes-tab.md)" title="Kubernetes" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md)" title="Create new dashboards" />
|
||||
</BoxList>
|
||||
</Box>
|
||||
<Box
|
||||
title="Alerts and notifications">
|
||||
<BoxList>
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md#netdata-cloud)" title="View active alerts" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md)" title="Alert notifications" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md#netdata-cloud)" title="View active alerts" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md)" title="Alert notifications" />
|
||||
</BoxList>
|
||||
</Box>
|
||||
<Box
|
||||
title="Troubleshooting with Netdata Cloud">
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md)" title="Metric Correlations" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md)" title="Anomaly Advisor" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/events-feed.md)" title="Events Feed" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md)" title="Metric Correlations" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md)" title="Anomaly Advisor" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/events-feed.md)" title="Events Feed" />
|
||||
</Box>
|
||||
<Box
|
||||
title="Management and settings">
|
||||
<BoxList>
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/authentication-and-authorization.md)" title="Sign in with email, Google, GitHub or with an SSO tool" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#invite-your-team)" title="Invite your team" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/themes.md)" title="Choose your Netdata Cloud theme" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md)" title="Role-Based Access" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md)" title="Paid Plans" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/README.md)" title="Sign in with email, Google, GitHub or with an SSO tool" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#invite-your-team)" title="Invite your team" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/themes.md)" title="Choose your Netdata Cloud theme" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md)" title="Role-Based Access" />
|
||||
<BoxListItemRegexLink to="[](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md)" title="Paid Plans" />
|
||||
</BoxList>
|
||||
</Box>
|
||||
</Grid>
|
||||
|
||||
- Spaces and War Rooms
|
||||
- [Spaces](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-spaces)
|
||||
- [War Rooms](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#netdata-cloud-war-rooms)
|
||||
- [Spaces](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-spaces)
|
||||
- [War Rooms](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#netdata-cloud-war-rooms)
|
||||
- Dashboards
|
||||
- [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/metrics-tab-and-single-node-tabs.md)
|
||||
- [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboard/nodes-tab.md)
|
||||
- [Kubernetes](https://github.com/netdata/netdata/blob/master/docs/dashboard/kubernetes-tab.md)
|
||||
- [Create new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboard/dashboards-tab.md)
|
||||
- [Metrics tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/metrics-tab-and-single-node-tabs.md)
|
||||
- [Nodes tab](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/nodes-tab.md)
|
||||
- [Kubernetes](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/kubernetes-tab.md)
|
||||
- [Create new dashboards](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/dashboards-tab.md)
|
||||
- Alerts and notifications
|
||||
- [View active alerts](https://github.com/netdata/netdata/blob/master/docs/monitor/view-active-alerts.md#netdata-cloud)
|
||||
- [Alert notifications](https://github.com/netdata/netdata/blob/master/docs/cloud/alerts-notifications/notifications.md)
|
||||
- [View active alerts](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/alerts-tab.md#netdata-cloud)
|
||||
- [Alert notifications](https://github.com/netdata/netdata/blob/master/docs/alerts-and-notifications/notifications/centralized-cloud-notifications/centralized-cloud-notifications-reference.md)
|
||||
- Troubleshooting with Netdata Cloud
|
||||
- [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/metric-correlations.md)
|
||||
- [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboard/anomaly-advisor-tab.md)
|
||||
- [Events Feed](https://github.com/netdata/netdata/blob/master/docs/cloud/insights/events-feed.md)
|
||||
- [Metric Correlations](https://github.com/netdata/netdata/blob/master/docs/metric-correlations.md)
|
||||
- [Anomaly Advisor](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/anomaly-advisor-tab.md)
|
||||
- [Events Feed](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/events-feed.md)
|
||||
- Management and settings
|
||||
- [Sign in with email, Google, GitHub or with an SSO tool](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/authentication-and-authorization.md)
|
||||
- [Invite your team](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/organize-your-infrastrucutre-invite-your-team.md#invite-your-team)
|
||||
- [Choose your Netdata Cloud theme](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/themes.md)
|
||||
- [Role-Based Access](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md)
|
||||
- [Paid Plans](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md)
|
||||
- [Sign in with email, Google, GitHub or with an SSO tool](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/README.md)
|
||||
- [Invite your team](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/organize-your-infrastructure-invite-your-team.md#invite-your-team)
|
||||
- [Choose your Netdata Cloud theme](https://github.com/netdata/netdata/blob/master/docs/dashboards-and-charts/themes.md)
|
||||
- [Role-Based Access](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md)
|
||||
- [Paid Plans](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md)
|
|
@ -29,7 +29,7 @@ Yearly plans require specifying a number of committed nodes, which receive a dis
|
|||
|
||||
## Plan Changes and Credit Balance
|
||||
|
||||
You can change your plan, billing frequency, or committed nodes at any time. For guidance, see [updating your plan](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/view-plan-billing.md#update-a-subscription-plan).
|
||||
You can change your plan, billing frequency, or committed nodes at any time. For guidance, see [updating your plan](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/view-plan-and-billing.md#update-a-subscription-plan).
|
||||
|
||||
> **Note**
|
||||
>
|
|
@ -158,10 +158,10 @@ You will be prompted to enter the email addresses of the users you want to invit
|
|||
Next, choose the War Rooms you want to invite these users to. Once logged in, these users are not restricted only to
|
||||
these War Rooms. They can be invited to others, or join any that are public.
|
||||
|
||||
Next, pick a role for the invited user. You can read more about [which roles are available](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/role-based-access.md#what-roles-are-available) based on your [subscription plan](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md).
|
||||
Next, pick a role for the invited user. You can read more about [which roles are available](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/role-based-access-model.md#what-roles-are-available) based on your [subscription plan](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md).
|
||||
|
||||
Click the **Send** button to send an email invitation, which will prompt them
|
||||
to [sign up](https://github.com/netdata/netdata/blob/master/docs/category-overview-pages/authentication-and-authorization.md) and join your Space.
|
||||
to [sign up](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/README.md) and join your Space.
|
||||
|
||||

|
||||
|
|
@ -14,7 +14,7 @@ This section outlines how to view and manage your Space's plan, billing settings
|
|||
1. Navigate to **Space settings** (the cog above your profile icon).
|
||||
2. Select the **Plan & Billing** tab.
|
||||
3. You'll see:
|
||||
- **Credit** amount, if applicable, usable for future invoices or subscription changes. More on this at [Plan changes and credit balance](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md#plan-changes-and-credit-balance).
|
||||
- **Credit** amount, if applicable, usable for future invoices or subscription changes. More on this at [Plan changes and credit balance](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md#plan-changes-and-credit-balance).
|
||||
- **Billing email** linked to your subscription, where all related notifications are sent.
|
||||
- A link to the **Billing options and Invoices** in our billing provider's Customer Portal, where you can:
|
||||
- Manage subscriptions and payment methods.
|
||||
|
@ -33,7 +33,7 @@ This section outlines how to view and manage your Space's plan, billing settings
|
|||
> **Note**
|
||||
>
|
||||
> - Checkout is performed directly if there's an active plan.
|
||||
> - Plan changes, including downgrades or cancellations, may impact notification settings or user access. More details at [Plan changes and credit balance](https://github.com/netdata/netdata/blob/master/docs/cloud/manage/plans.md#plan-changes-and-credit-balance).
|
||||
> - Plan changes, including downgrades or cancellations, may impact notification settings or user access. More details at [Plan changes and credit balance](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/netdata-subscription-plans.md#plan-changes-and-credit-balance).
|
||||
|
||||
## FAQ
|
||||
|
|
@ -1,78 +0,0 @@
|
|||
<!--
|
||||
title: "Netdata for IoT"
|
||||
custom_edit_url: https://github.com/netdata/netdata/edit/master/docs/netdata-for-IoT.md
|
||||
sidebar_label: "Netdata for IoT"
|
||||
learn_status: "Published"
|
||||
learn_rel_path: "Miscellaneous"
|
||||
-->
|
||||
|
||||
# Netdata for IoT
|
||||
|
||||

|
||||
|
||||
> New to Netdata? Check its demo: **<https://my-netdata.io/>**
|
||||
>
|
||||
>[](https://registry.my-netdata.io/#netdata_registry)
|
||||
> [](https://registry.my-netdata.io/#netdata_registry)
|
||||
> [](https://registry.my-netdata.io/#netdata_registry)
|
||||
>
|
||||
>[](https://registry.my-netdata.io/#netdata_registry)
|
||||
> [](https://registry.my-netdata.io/#netdata_registry)
|
||||
> [](https://registry.my-netdata.io/#netdata_registry)
|
||||
|
||||
---
|
||||
|
||||
Netdata is a [very efficient](https://github.com/netdata/netdata/blob/master/docs/guides/configure/performance.md)
|
||||
server performance monitoring solution. When running in server hardware, it can collect
|
||||
thousands of system and application metrics **per second** with just 1% CPU utilization of a single core. Its web server
|
||||
responds to most data requests in about **half a millisecond** making its web dashboards spontaneous, amazingly fast!
|
||||
|
||||
Netdata can also be a very efficient real-time monitoring solution for **IoT devices** (RPIs, routers, media players,
|
||||
wifi access points, industrial controllers and sensors of all kinds). Netdata will generally run everywhere a Linux
|
||||
kernel runs (and it is glibc and [musl-libc](https://www.musl-libc.org/) friendly).
|
||||
|
||||
You can use it as both a data collection agent (where you pull data using its API), for embedding its charts on other
|
||||
web pages / consoles, but also for accessing it directly with your browser to view its dashboard.
|
||||
|
||||
The Netdata web API already provides **reduce** functions allowing it to report **average** and **max** for any
|
||||
timeframe. It can also respond in many formats including JSON, JSONP, CSV, HTML. Its API is also a **google charts**
|
||||
provider so it can directly be used by google sheets, google charts, google widgets.
|
||||
|
||||

|
||||
|
||||
Although Netdata has been significantly optimized to lower the CPU and RAM resources it consumes, the plethora of data
|
||||
collection plugins may be inappropriate for weak IoT devices. Please follow
|
||||
the [Netdata Agent performance guide](https://github.com/netdata/netdata/blob/master/docs/guides/configure/performance.md)
|
||||
|
||||
## Monitoring RPi temperature
|
||||
|
||||
The python version of the sensors plugin uses `lm-sensors`. Unfortunately the temperature reading of RPi are not
|
||||
supported by `lm-sensors`.
|
||||
|
||||
Netdata also has a bash version of the sensors plugin that can read RPi temperatures. It is disabled by default to avoid
|
||||
the conflicts with the python version.
|
||||
|
||||
To enable it, run:
|
||||
|
||||
```bash
|
||||
cd /etc/netdata # Replace this path with your Netdata config directory
|
||||
sudo ./edit-config charts.d.conf
|
||||
```
|
||||
|
||||
and uncomment this line:
|
||||
|
||||
```sh
|
||||
sensors=force
|
||||
```
|
||||
|
||||
Then restart Netdata. You will get this:
|
||||
|
||||

|
||||
|
||||
|
|
@ -61,7 +61,7 @@ While encrypting the connection between your parent and child nodes is recommend
|
|||
This example uses self-signed certificates.
|
||||
|
||||
> **Note**
|
||||
> This section assumes you have read the documentation on [how to edit the Netdata configuration files](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration.md).
|
||||
> This section assumes you have read the documentation on [how to edit the Netdata configuration files](https://github.com/netdata/netdata/blob/master/docs/netdata-agent/configuration/README.md).
|
||||
<!-- here we need link to the section that will contain the restarting instructions -->
|
||||
|
||||
1. **Parent node**
|
||||
|
|
|
@ -55,7 +55,7 @@ The access settings for your client are the following:
|
|||
|
||||
### SP-initiated SSO
|
||||
|
||||
If you start your authentication flow from Netdata sign-in page please check [these steps](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/enterprise-sso-authentication.md#from-netdata-sign-up-page).
|
||||
If you start your authentication flow from Netdata sign-in page please check [these steps](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/enterprise-sso-authentication.md#from-netdata-sign-up-page).
|
||||
|
||||
|
||||
### Reference
|
||||
|
|
|
@ -50,6 +50,6 @@ Steps needed to be done on Okta Admin Portal:
|
|||
|
||||
### SP-initiated SSO
|
||||
|
||||
If you start your authentication flow from Netdata sign-in page please check [these steps](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/enterprise-sso-authentication.md#from-netdata-sign-up-page).
|
||||
If you start your authentication flow from Netdata sign-in page please check [these steps](https://github.com/netdata/netdata/blob/master/docs/netdata-cloud/authentication-and-authorization/enterprise-sso-authentication.md#from-netdata-sign-up-page).
|
||||
|
||||
|
||||
|
|
Some files were not shown because too many files have changed in this diff Show more
Loading…
Add table
Add a link
Reference in a new issue