Provisioning real-time performance monitoring with Netdata

By Gcore

April 2, 2023

4 min read

Provisioning real-time performance monitoring with Netdata

Introduction

It goes without saying that implementing some type of metrics solution is an often neglected part of any organization’s digital transformation strategy. That said, not all solutions are made the same and many of them can have prohibitively expensive licensing fees. This is where OSS solutions such as Netdata can play a vital role in a company’s IT strategy.

To allow for swift adjustments, metrics must be based on real-time data as much as possible. In a world where data now increasingly drives decision-making across every imaginable realm, using it for internal progress would only make sense. Yet far too often, companies fail to use their data for this purpose.

What is Netdata?

Netdata is a distributed, real-time, performance and health monitoring for systems and applications. It is a highly-optimized monitoring agent you install on all your systems and containers.

Netdata provides insights of everything happening on the systems it runs (including web servers, databases, applications), using interactive web dashboards. It can run autonomously, without any third-party components, or it can be integrated into existing monitoring toolchains (Prometheus, Graphite, OpenTSDB, Kafka, Grafana, and more).

Netdata is highly interactive and **real-time**

Netdata is built around 4 principles:

Per second data collection for all metrics: It is impossible to monitor a 2 second SLA, with 10 second metrics.
Collect and visualize all the metrics from all possible sources: To troubleshoot slowdowns, we need all the available metrics. The console should not provide more metrics.
Meaningful presentation, optimized for visual anomaly detection: Metrics are a lot more than name-value pairs over time. The monitoring tool should know all the metrics. Users should not!
Immediate results, just install and use: Most of our infrastructure is standardized. There is no point to configure everything metric by metric.

Unlike other monitoring solutions that focus on metrics visualization, Netdata helps you troubleshoot slowdowns without touching the console.

What can I monitor?

Netdata data collection is extensible – you can monitor anything you can get a metric for. Its Plugin API supports all programing languages (anything can be a Netdata plugin, BASH, Python, Perl, Node.js, Java, Go, Ruby, etc).

For better performance, most system-related plugins (CPU, memory, disks, filesystems, networking, etc) have been written in C.
For faster development and easier contributions, most application related plugins (databases, web servers, etc) have been written in python.

So the world is your oyster with Netdata, and it’s better than paying for a super expensive enterprise license.

Installing Netdata

You can install Netdata in a variety of ways depending on your desired scenario.

One line Install

To install Netdata from source, and keep it up to date with our nightly releases automatically, run the following:

bash <(curl -Ss https://my-netdata.io/kickstart.sh)

Docker Install

Quickly start Netdata with the docker command. Netdata is then available at http://host:19999.

docker run -d --name=netdata \  -p 19999:19999 \  -v /etc/passwd:/host/etc/passwd:ro \  -v /etc/group:/host/etc/group:ro \  -v /proc:/host/proc:ro \  -v /sys:/host/sys:ro \  -v /etc/os-release:/host/etc/os-release:ro \  --cap-add SYS_PTRACE \  --security-opt apparmor=unconfined \  netdata/netdata

The above can be converted to a docker-compose.yml file for ease of management:

version: '3'services:  netdata:    image: netdata/netdata    hostname: example.com # set to fqdn of host    ports:      - 19999:19999    cap_add:      - SYS_PTRACE    security_opt:      - apparmor:unconfined    volumes:      - /etc/passwd:/host/etc/passwd:ro      - /etc/group:/host/etc/group:ro      - /proc:/host/proc:ro      - /sys:/host/sys:ro

Access the dashboard

Open up your web browser of choice and navigate to http://YOUR-HOST:19999. Welcome to Netdata!

Navigating the standard dashboard

Beyond charts, the standard dashboard can be broken down into three key areas:

Sections
Menus/Submenus
Nodes menu

Sections

Netdata is broken up into multiple sections, such as System Overview, CPU, Disk, and more. Inside each section, you’ll find a number of charts, broken down into contexts and families.

An example of the Memory section on a Linux desktop system.

All sections and their associated charts appear on a single “page,” so all you need to do to view different sections is scroll up and down the page. But it’s usually quicker to use the menus.

Menus

Menus appears on the right-hand side of the standard dashboard. Netdata generates a menu for each section, and menus link to the section they’re associated with.

Most menu items will contain several Submenu entries, which represent any families from that section. Netdata automatically generates these Submenu entries.

Here’s a Disks menu with several submenu entries for each disk drive and partition Netdata recognizes.

Nodes menu

The nodes menu appears in the top-left corner of the standard dashboard and is labeled with the hostname of the system Netdata is monitoring.

Clicking on it will display a drop-down menu of any nodes you might have connected via the Netdata registry. By default, you’ll find nothing under the My nodes heading, but you can try out any of the Demo Netdata Nodes to see how the nodes menu works.

Once you add nodes via Netdata Cloud or a private registry, you will see them appear under the My nodes heading.

The nodes menu will also show the master netdata node and all slave nodes streaming to that master, if you have configured streaming.

Customizing the standard dashboard

Netdata stores information about individual charts in the dashboard_info.js file. This file includes section and subsection headings, descriptions, colors, titles, tooltips, and other information for Netdata to render on the dashboard.

For example, here is how dashboard_info.js defines the System Overview section.

netdataDashboard.menu = {  'system': {    title: 'System Overview',    icon: '<i class="fas fa-bookmark"></i>',    info: 'Overview of the key system metrics.'  },

If you want to customize this information, you should avoid editing dashboard_info.js directly. These changes are not persistent; Netdata will overwrite the file when it’s updated. Instead, you should create a new file with your customizations.

That’s it!

Now you’re ready to monitor your metrics in real time with Netdata dashboards. I hope you liked this article and better yet learned something from it. Thanks for reading! See you next time.

Optimize your workload: a guide to selecting the best virtual machine configuration

Virtual machines (VMs) offer the flexibility, scalability, and cost-efficiency that businesses need to optimize workloads. However, choosing the wrong setup can lead to poor performance, wasted resources, and unnecessary costs.In this guide, we’ll walk you through the essential factors to consider when selecting the best virtual machine configuration for your specific workload needs.﹟1 Understand your workload requirementsThe first step in choosing the right virtual machine configuration is understanding the nature of your workload. Workloads can range from light, everyday tasks to resource-intensive applications. When making your decision, consider the following:Compute-intensive workloads: Applications like video rendering, scientific simulations, and data analysis require a higher number of CPU cores. Opt for VMs with multiple processors or CPUs for smoother performance.Memory-intensive workloads: Databases, big data analytics, and high-performance computing (HPC) jobs often need more RAM. Choose a VM configuration that provides sufficient memory to avoid memory bottlenecks.Storage-intensive workloads: If your workload relies heavily on storage, such as file servers or applications requiring frequent read/write operations, prioritize VM configurations that offer high-speed storage options, such as SSDs or NVMe.I/O-intensive workloads: Applications that require frequent network or disk I/O, such as cloud services and distributed applications, benefit from VMs with high-bandwidth and low-latency network interfaces.﹟2 Consider VM size and scalabilityOnce you understand your workload’s requirements, the next step is to choose the right VM size. VM sizes are typically categorized by the amount of CPU, memory, and storage they offer.Start with a baseline: Select a VM configuration that offers a balanced ratio of CPU, RAM, and storage based on your workload type.Scalability: Choose a VM size that allows you to easily scale up or down as your needs change. Many cloud providers offer auto-scaling capabilities that adjust your VM’s resources based on real-time demand, providing flexibility and cost savings.Overprovisioning vs. underprovisioning: Avoid overprovisioning (allocating excessive resources) unless your workload demands peak capacity at all times, as this can lead to unnecessary costs. Similarly, underprovisioning can affect performance, so finding the right balance is essential.﹟3 Evaluate CPU and memory considerationsThe central processing unit (CPU) and memory (RAM) are the heart of a virtual machine. The configuration of both plays a significant role in performance. Workloads that need high processing power, such as video encoding, machine learning, or simulations, will benefit from VMs with multiple CPU cores. However, be mindful of CPU architecture—look for VMs that offer the latest processors (e.g., Intel Xeon, AMD EPYC) for better performance per core.It’s also important that the VM has enough memory to avoid paging, which occurs when the system uses disk space as virtual memory, significantly slowing down performance. Consider a configuration with more RAM and support for faster memory types like DDR4 for memory-heavy applications.﹟4 Assess storage performance and capacityStorage performance and capacity can significantly impact the performance of your virtual machine, especially for applications requiring large data volumes. Key considerations include:Disk type: For faster read/write operations, opt for solid-state drives (SSDs) over traditional hard disk drives (HDDs). Some cloud providers also offer NVMe storage, which can provide even greater speed for highly demanding workloads.Disk size: Choose the right size based on the amount of data you need to store and process. Over-allocating storage space might seem like a safe bet, but it can also increase costs unnecessarily. You can always resize disks later, so avoid over-allocating them upfront.IOPS and throughput: Some workloads require high input/output operations per second (IOPS). If this is a priority for your workload (e.g., databases), make sure that your VM configuration includes high IOPS storage options.﹟5 Weigh up your network requirementsWhen working with cloud-based VMs, network performance is a critical consideration. High-speed and low-latency networking can make a difference for applications such as online gaming, video conferencing, and real-time analytics.Bandwidth: Check whether the VM configuration offers the necessary bandwidth for your workload. For applications that handle large data transfers, such as cloud backup or file servers, make sure that the network interface provides high throughput.Network latency: Low latency is crucial for applications where real-time performance is key (e.g., trading systems, gaming). Choose VMs with low-latency networking options to minimize delays and improve the user experience.Network isolation and security: Check if your VM configuration provides the necessary network isolation and security features, especially when handling sensitive data or operating in multi-tenant environments.﹟6 Factor in cost considerationsWhile it’s essential that your VM has the right configuration, cost is always an important factor to consider. Cloud providers typically charge based on the resources allocated, so optimizing for cost efficiency can significantly impact your budget.Consider whether a pay-as-you-go or reserved model (which offers discounted rates in exchange for a long-term commitment) fits your usage pattern. The reserved option can provide significant savings if your workload runs continuously. You can also use monitoring tools to track your VM’s performance and resource usage over time. This data will help you make informed decisions about scaling up or down so you’re not paying for unused resources.﹟7 Evaluate security featuresSecurity is a primary concern when selecting a VM configuration, especially for workloads handling sensitive data. Consider the following:Built-in security: Look for VMs that offer integrated security features such as DDoS protection, web application firewall (WAF), and encryption.Compliance: Check that the VM configuration meets industry standards and regulations, such as GDPR, ISO 27001, and PCI DSS.Network security: Evaluate the VM's network isolation capabilities and the availability of cloud firewalls to manage incoming and outgoing traffic.﹟8 Consider geographic locationThe geographic location of your VM can impact latency and compliance. Therefore, it’s a good idea to choose VM locations that are geographically close to your end users to minimize latency and improve performance. In addition, it’s essential to select VM locations that comply with local data sovereignty laws and regulations.﹟9 Assess backup and recovery optionsBackup and recovery are critical for maintaining data integrity and availability. Look for VMs that offer automated backup solutions so that data is regularly saved. You should also evaluate disaster recovery capabilities, including the ability to quickly restore data and applications in case of failure.﹟10 Test and iterateFinally, once you've chosen a VM configuration, testing its performance under real-world conditions is essential. Most cloud providers offer performance monitoring tools that allow you to assess how well your VM is meeting your workload requirements.If you notice any performance bottlenecks, be prepared to adjust the configuration. This could involve increasing CPU cores, adding more memory, or upgrading storage. Regular testing and fine-tuning means that your VM is always optimized.Choosing a virtual machine that suits your requirementsSelecting the best virtual machine configuration is a key step toward optimizing your workloads efficiently, cost-effectively, and without unnecessary performance bottlenecks. By understanding your workload’s needs, considering factors like CPU, memory, storage, and network performance, and continuously monitoring resource usage, you can make informed decisions that lead to better outcomes and savings.Whether you're running a small application or large-scale enterprise software, the right VM configuration can significantly improve performance and cost. Gcore offers a wide range of virtual machine options that can meet your unique requirements. Our virtual machines are designed to meet diverse workload requirements, providing dedicated vCPUs, high-speed storage, and low-latency networking across 30+ global regions. You can scale compute resources on demand, benefit from free egress traffic, and enjoy flexible pricing models by paying only for the resources in use, maximizing the value of your cloud investments.Contact us to discuss your VM needs

Provisioning real-time performance monitoring with Netdata

Introduction

What is Netdata?

What can I monitor?

Installing Netdata

One line Install

Docker Install

Access the dashboard

Navigating the standard dashboard

Sections

Menus

Nodes menu

Customizing the standard dashboard

That’s it!

Related articles

Pre-configure your dev environment with Gcore VM init scripts

How to cut egress costs and speed up delivery using Gcore CDN and Object Storage

Bare metal vs. virtual machines: performance, cost, and use case comparison

Optimize your workload: a guide to selecting the best virtual machine configuration

How to get the size of a directory in Linux

How to Run Hugging Face Spaces on Gcore Inference at the Edge

Subscribe to our newsletter