* Add a `ping` command to netdatacli to check if agent is alive.
This provides a way to trivially check if the agent itself appears to be
running (namely, the command parser for netdatacli in the agent itself
is working and responding), allowing users to check this without having
to rely on us continuing to have `help` be a command sent to the agent
instead of executing locally.
* Add a basic health check to our docke rimages.
This adds a relatively basic health checker script to our Docker images.
By default it verifies that the `/api/v1/info` endpoint returns a 200
status code.
It also supports checking different endpoints or using `netdatacli ping`
to check that Netdata is running, all controlled by a new Docker
environment variable: `NETDATA_HEALTH_CHECK`.
* Avoid unnessecary `chmod` in Dockerfile.
Suggested by @prologic.
* Fix typo in docs.
* Update environment variable name to be more clear.
Also add `-L` to `curl` command in health check to follow redirects.
* Removeed Polyverse Polymorphic Linux from Docker images.
It ends up using roughly 140MB of extra space, only works for 64-bit x86,
and the benefits for a majority of our users range from questionale to
completely non-existent.
* Update docs and add runtime support for Polyverse.
This PR merges the feature-branch to make the cloud live. It contains the following work:
Co-authored-by: Andrew Moss <1043609+amoss@users.noreply.github.com(opens in new tab)>
Co-authored-by: Jacek Kolasa <jacek.kolasa@gmail.com(opens in new tab)>
Co-authored-by: Austin S. Hemmelgarn <austin@netdata.cloud(opens in new tab)>
Co-authored-by: James Mills <prologic@shortcircuit.net.au(opens in new tab)>
Co-authored-by: Markos Fountoulakis <44345837+mfundul@users.noreply.github.com(opens in new tab)>
Co-authored-by: Timotej S <6674623+underhood@users.noreply.github.com(opens in new tab)>
Co-authored-by: Stelios Fragkakis <52996999+stelfrag@users.noreply.github.com(opens in new tab)>
* dashboard with new navbars, v1.0-alpha.9: PR #8478
* dashboard v1.0.11: netdata/dashboard#76
Co-authored-by: Jacek Kolasa <jacek.kolasa@gmail.com(opens in new tab)>
* Added installer code to bundle JSON-c if it's not present. PR #8836
Co-authored-by: James Mills <prologic@shortcircuit.net.au(opens in new tab)>
* Fix claiming config PR #8843
* Adds JSON-c as hard dep. for ACLK PR #8838
* Fix SSL renegotiation errors in old versions of openssl. PR #8840. Also - we have a transient problem with opensuse CI so this PR disables them with a commit from @prologic.
Co-authored-by: James Mills <prologic@shortcircuit.net.au(opens in new tab)>
* Fix claiming error handling PR #8850
* Added CI to verify JSON-C bundling code in installer PR #8853
* Make cloud-enabled flag in web/api/v1/info be independent of ACLK build success PR #8866
* Reduce ACLK_STABLE_TIMEOUT from 10 to 3 seconds PR #8871
* remove old-cloud related UI from old dashboard (accessible now via /old suffix) PR #8858
* dashboard v1.0.13 PR #8870
* dashboard v1.0.14 PR #8904
* Provide feedback on proxy setting changes PR #8895
* Change the name of the connect message to update during an ongoing session PR #8927
* Fetch active alarms from alarm_log PR #8944
The default cloud url has been updated to app.netdata.cloud ready for the release. The claiming process now checks the current user executing claiming and refuses to perform the claim for the wrong user. If the current UID is 0 then claiming proceeds but the file ownership is adjusted to be the correct netdata user. The default expected user is `netdata` unless the script can identify the user from the current configuration. After the claiming script is executed the CLI is used to reload the claiming state.
This adds a build argument to the main Dockerfile to allow passing extra
options to the netdata-installer script during the build process.
The value specified in the `EXTRA_INSTALL_OPTS` argument wil be added
verbatm to the netdata-installer.sh script. The arguments
`--dont-start-it` and `--dont-wait` are still implicitly passed to the
script as those are required to get a correctly functioning Docker
image.
* Added support for opting out of telemtry via the DO_NOT_TRACK environment variable
* Added support for DO_NOT_TRACK=1 in anonymous-statistics.sh and minor cleanup in Dockerfile and run.sh entrypoint
* Allow DO_NOT_TRACK to be either non-zero or non-empty
* Update md5sum of kickstart-static64.sh in docs
* Fixed a bug in netdata-installer.sh
* Revert changes to daemon/main.c (testing onyl)
* Update docs/anonymous-statistics.md
Co-Authored-By: Mansour Behabadi <57921115+ncmans@users.noreply.github.com>
Co-authored-by: Mansour Behabadi <57921115+ncmans@users.noreply.github.com>
* Show notice of anonymous data collection
* Add DO_NOT_TRACK env var to docker run script
* Add disable-telemetry option to kickstart-static64
* Check if DO_NOT_TRACK is set
* Allow building without pushing
This enables easier local testing
* Refactor fetching Docker container names to be safer
Fixes#5680
* Wrap shell variable with quotes
And change spaces to tabs
* Make cgroup-name quieter
* Make DOCKER_USR overridable
* Update documentation to explain safe usage
* Remove recommended image for docker socket proxy
* Add capability to pass in a privileged GID
* Fix some documentation typos
* Update documentation to remove socket reference and clean up wording
1) Do not double COPY stuff at the second layer, that does not work like that. Install judy on / and move over just like we do with netdata to bring all over with one COPY
2) After you install judy, remove the stuff you bring in to make install command work for judy
The image has been tested with the latest fixes from the helper-images side.
This change should be merged in AFTER we merge in helper-images fix
* netdata/packaging/docker: fix docker permissions and other things
1. User more consistent shebangs
2. Do not use default values on REPOSITORY, its confusing. Just fail with a message if REPOSITORY not there
3. Set the ownership to root:netdata on the whole /usr/libexec/netdata directory
* netdata/packaging/ci: Attempt to detect CI repo slug value for REPOSITORY, before failing hard
* Fix py2-psycopg2 right after upgrade
This mitigates the issue https://github.com/netdata/netdata/issues/5588
* Improve reinstall script so it doesn't get into broken state
* Update and upgrade before polymorphic linux
* apk fix py2-psycopg2 right after reinstall.
* Removing apk fix - package is now blacklisted upstream
The problem:
After this PR was merged (https://github.com/netdata/netdata/pull/5137),
we noticed on our end that the default Polyverse install script
had room for improvement to ensure there's reliability.
To be more specific, we noticed that while we only expose x86_64 packages,
we noticed a pull to an x86 (32-bit package) that was not expected.
What this meant was, during our first update, we were completely removing
existing mirrors/packages in the install script.
The fix:
Our install script now merely comments out the existing repos/mirrors.
The install instructions then merely update/upgrade all packages in-place,
this preferring our packages as a priority, and the final command
uncomments all the other mirrors/repos that may exist.
The outcome:
Due to this step, ALL existing repos stay on the host after Polymorphic
repositories are sbuscribed. Everything works as expected. However,
the host will prefer Polymorphic repositories above others.