监视第3/4部分指标和Prometheus和Grafana

1年前
2020年10月6日
3个答复
2125意见

UserLevel 2

+12

frsbeckum
开拓者
24个答复

除了使用SNMP或基于代理的版本的“经典”监控之外，我们还在谈论指标和OpenSource Project的Livedata的Monitorng普罗米修斯。虚拟化是用强大的工具进行的格拉法纳。

对数据的访问将使用所谓的“ node-exporter”进行，我们可以在github下找到。一个出现nutanix呢

Homelab中的我的测试柜是由以下项目构建的：

nutanixcluster <-prometheus <-grafana

192.168.10.80 192.168.10.123 192.168.10.100

Nutanix CE Ubuntu 18.04 Lts Ubuntu 18.04 LTS

Prometheus 2.2.1 Grafana 7.0.4

走1.10

要求

在Nutanix Prism Central中创建新用户，并具有观众权利

2.在Ubuntu 18.04 LTS上安装Prometheus，并使用运行GO！

良好的源这里或者这里。

3.在第二个Ubuntun 18.04 LTS上安装von Grafana 7.x

找到好的来源这里。

开始与Nutanix的联系

我们下载去二进制对于GO/bin文件夹中Prometheus VM的Nutanix出口商

我们将测试与-help命令的GO/BIN共享：

应该这样。不，我们尝试检查我们的新创建的Viewer用户是否能够从Nutanix群集中获得一些指标。如果我们描述没有单独的端口Prometheus使用端口9405。如果您连接到更多，则必须为每个群集描述一个单独的端口！

您可以通过IP连接或为其创建DNS条目。如果需要，可以用变量掩盖用户名/密码！

端口上的结果：9494在我们的Prometheus服务器上的结果如下：

答对了。单击度量，您可以获得所有可用的指标...

这是马努尔的方式。

现在，我们构建一个小的shell脚本，该脚本正在自动进行这些调用。我们还在Prometheus服务器上创建服务，以重新启动！

我们在/etc/systemd/system中创建shell脚本作为新服务！

bash-script将在份额中创建/go/bin（示例代码）

2.创建新服务/etc/systemd/system

在我的Homelab中，我使用了根用户！不要在生产环境中这样做！为这些任务创建特定用户！

使用“ SystemCtl启用Prometheus_nutanix.Service”启用服务

我们重新启动VM并检查一切是否正常，现在自动运行：

SystemCtl状态Prometheus_nutanix.Service

现在，我们将在Prometheus上的“给予：港口”上找到指标。但是，对于与格拉法纳的合作，出口商需要被宣布为目标呢

modify/etc/prometheus/prometheus.yml并放入以下部分！重新启动Prometheus服务！

如果存在新创建的目标，请在端口上控制默认的Prometheus：9090。

美好的！现在我们切换到Grafana

我们将Prometheus宣布为Grafana的新数据源

现在，我们在Prometheus/Nutanix下创建了一个新的仪表板，并选择了选择指标。

注意力呢它使在Prometheus配置中使用有效名称声明每个群集！

我仅创建了一个简单的示例仪表板，其中包括VM帐户，内存等。

如果您想从此开始，只需从我的github存储库下载json文件这里并将其作为新的仪表板导入。

处理技巧：

将所有指标从Prometheus指标站点导出到记事本++ / sublime，以方便搜索！

帮助和类型背后的价值是无关紧要的。但是Metrik的名称是关键！将其从这里复制并粘贴到Grafana的选择！

传说默认显示了度量的名称。但是你可以

a）手工覆盖它

b）使用{{cluster}}或{{node}}的变量

通过/1024/1024/1024等转换字节，以MB/GB/TB为

玩仪表板玩得开心……。

Beside the \u201cclassical\u201d Monitoring with snmp or agent based version now we are talking about Metrics and the Monitorng of LiveData with\u00a0the OpenSource Project\u00a0Prometheus<\/a>. The Virtualization is made with the powerfull Tool of Grafana<\/a>.<\/p>

The access to the Data will be made with so called \u201cNode-Exporter\u201d which could we found under GitHUB. One is present for\u00a0 Nutanix<\/a>!<\/p>

\u00a0<\/p>

My Testcase in the homelab is build with the following items:<\/p>

NutanixCluster <- Prometheus <- Grafana<\/p>

192.168.10.80 192.168.10.123 192.168.10.100<\/p>

Nutanix CE Ubuntu 18.04 LTS Ubuntu 18.04 LTS<\/p>

Prometheus 2.2.1 Grafana 7.0.4<\/p>

GO 1.10<\/p>

Requirements<\/strong><\/p>

Create NEW User in Nutanix Prism Central with VIEWER Rights<\/li><\/ol> $\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
2. Installation of Prometheus on Ubuntu 18.04 LTS with running GO!<\/p>
Good Sourcse will\u00a0be available\u00a0 here<\/a>\u00a0or here<\/a>.<\/p>
3. Installation ofvon Grafana 7.x on a second Ubuntun 18.04 LTS<\/p>
Good Source are found here<\/a>.<\/p>
\u00a0<\/p>
Start of Connection to Nutanix<\/strong><\/p>
We download the GO Binary<\/a>\u00a0for the Nutanix Exporter to the Prometheus VM in the folder of GO\/BIN<\/li><\/ol>
We will test the go\/bin share with the --help command:<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
SO is should be. No we try to check if our new created VIEWER User is able to feth some metrics from the nutanix cluster. If we describe no seperate port Prometheus is using port 9405. If you are connection to more then one cluster you have to describe for each cluster a seperate port!<\/p>
\u00a0<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
You could connect via IP or create an DNS Entry for it. Username\/Password could be masked in variables if needed!<\/p>
The Result on Port :9494 on our Prometheus Server is as follows:\u00a0<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
BINGO. With a click on Metric you get all Metrics which are available...<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
THIS was the manuell way.<\/p>
Now we build a little shell script which is doing these call automatically. We also create a Service on the prometheus server to be reboot aware!<\/p>
We create the Shell Script\u00a0 in\u00a0\/etc\/systemd\/system as a new SERVICE!<\/p>
Bash-Script will be created in the share of\u00a0<\/strong>\u00a0\/go\/bin (Example Code)<\/li><\/ol>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$
Other Example could we found in the Github Repo!<\/figcaption><\/figure>
2. Creation of a new Service in\u00a0\/etc\/systemd\/system<\/strong><\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$
In my Homelab i used a root user! Dont do that in a production environment! Create a specific user for these task!<\/figcaption><\/figure>
Enable the service with \u201csystemctl enable prometheus_nutanix.service\u201d<\/em><\/strong><\/p>
We reboot the VM and check if all is fine and running automatically now:<\/p>
systemctl status prometheus_nutanix.service<\/strong>\u00a0<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
We will find the Metrics now on the give :port on the Prometheus. But for the work with Grafana the Exporter needs to be declared as a TARGET<\/strong>\u00a0!<\/p>
Modify \/etc\/prometheus\/prometheus.yml and put in the following section! Restart the Prometheus Service!<\/strong><\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
Controll the default Prometheus on Port :9090 if the new created TARGET is present and working.<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
Fine! Now we switch over to GRAFANA<\/p>
We declare Prometheus as a new Datasource in Grafana<\/li><\/ol>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$
Just use the PrometheusIP with the\u00a0 Default Port 9090!<\/figcaption><\/figure>
Now we created a new Dashboard and select METRICS under\u00a0Prometheus\/Nutanix.(btw. Exportername from the Prometheus Config)<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
Attention<\/strong>! It makes sence to declare EACH Cluster with a valid NAME in the Prometheus Config!<\/p>
I created just a simple Example Dashboard with VM Account, Memory etc.<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
If you like to start with that just download the JSON File from my Github Repo\u00a0here<\/a>\u00a0and import it as a new Dashboard.<\/p>
Handling Tips:<\/p>
Export all Metrics from the Prometheus Metrics Site to Notepad++ \/ Sublime for easy searching!<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
\u00a0<\/p>
The Values behind HELP and TYPE are irrelevant. But the NAME of the Metrik is key! Copy and Paste it from here to the selection in Grafana!<\/p>
\u00a0<\/p>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
The LEGENDS show in default the NAME of the Metric. But you could<\/p>
a) manuell overwrite it<\/p>
b) Use Variable like \u00a0{{cluster}} or\u00a0{{node}}<\/p>
Convert BYTES\u00a0via \/1024\/1024\/1024 etc. in MB\/GB\/TB\u00a0<\/li><\/ol>
$\"//www.jhbzcj.com/next/how-it-works-22/\"$ <\/figure>
\u00a0<\/p>
Have Fun with your Dashboards\u2026\u2026.<\/p>","quoteUsername":"frsbeckum","translations":{"Common":{"like":"Like","unlike":"Unlike"},"Forum":{"Quote":"Quote","Share":"Share"}}}">

喜欢

引用

分享

3个答复

最古老的第一

 新的先来最佳投票

m

mayurk

冒险家

4个答复

1年前
2020年10月6日

感谢您的文档。我们是否有解释这些指标的文档？理想情况应该为他们设置什么阈值？

Thanks for the documentation. Do we have documentation which explain\u00a0these metrics?\u00a0And what ideally should be the threshold set up for them?\u00a0<\/p>","quoteUsername":"MayurK","translations":{"Common":{"like":"Like","unlike":"Unlike"},"Forum":{"Quote":"Quote","Share":"Share"}}}">

喜欢

引用

UserLevel 2

+12

frsbeckum

作者

开拓者

24个答复

1年前
2020年10月6日

如果您看指标，则最多的声明是明确的。这也是Nutanix圣经中缺少的元素...但是我希望它会尽快参加……这是我的Homelab 3节点群的一个例子…

nutanix_cluster_num_random_io {cluster =“ admincafe”} -1.0
nutanix_cluster_num_read_iops {cluster =“ admincafe”} 3.0
nutanix_cluster_num_read_io {cluster =“ admincafe”} 93.0
nutanix_cluster_num_seq_io {cluster =“ admincafe”} -1.0
nutanix_cluster_num_write_iops {cluster =“ admincafe”} 7.0
nutanix_cluster_num_write_io {cluster =“ admincafe”} 212.0
nutanix_cluster_random_io_ppm {cluster =“ admincafe”} -1.0
nutanix_cluster_read_io_bandwidth_kbps {cluster =“ admincafe”} 29.0
nutanix_cluster_read_io_ppm {cluster =“ admincafe”} 304918.0
nutanix_cluster_seq_io_ppm {cluster =“ admincafe”} -1.0
nutanix_cluster_storage_capacity_bytes {cluster =“ admincafe”} 1.2839315724E+12
nutanix_cluster_storage_disk_physical_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_free_bytes {cluster =“ admincafe”} 1.145171744971e+12
nutanix_cluster_storage_logical_usage_bytes {cluster =“ admincafe”} 1.4583942348E+11
nutanix_cluster_storage_resveres_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_resvere_free_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_reseved_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_tier_das_sata_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_tier_ssd_usage_bytes {cluster =“ admincafe”} 1.39058477856E+11
nutanix_cluster_storage_unresveres_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_unresveres_free_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_unresverve_own_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_unresverve_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_usage_bytes {cluster =“ admincafe”} 1.38759827456E+11
nutanix_cluster_storage_user_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_container_own_ousage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_disk_physical_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_free_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_other_containers_reseved_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_resver_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_resver_free_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_resverd_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_storage_pool_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_unreseved_capacity_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_unresvere_free_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_unresverd_own_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_unresverd_shared_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_unresverd_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_storage_user_usage_bytes {cluster =“ admincafe”} 0.0
nutanix_cluster_timespan_usecs {cluster =“ admincafe”} 3E+07
nutanix_cluster_total_io_size_kbytes {cluster =“ admincafe”} 4013.0
nutanix_cluster_total_io_time_usecs {cluster =“ admincafe”} 62667.0
nutanix_cluster_total_read_io_size_kbytes {cluster =“ admincafe”} 889.0
nutanix_cluster_total_read_io_time_usecs {cluster =“ admincafe”} -1.0
nutanix_cluster_total_transformed_usage_bytes {cluster =“ admincafe”} -1.0
nutanix_cluster_total_untransformed_usage_bytes {cluster =“ admincafe”} -1.0
nutanix_cluster_write_io_io_bandwidth_kbps {cluster =“ admincafe”} 104.0
nutanix_cluster_write_io_ppm {cluster =“ admincafe”} 695081.0
nutanix_host_avg_io_latency_usecs {hostName =“ ntnx-739347ed-a”} 309.0
nutanix_host_avg_io_latency_usecs {hostName =“ ntnx-d5104c7d-a”} 153.0
nutanix_host_avg_io_latency_usecs {hostName =“ ntnx-ee409937-a”} 206.0

If you look on the metrics the most declarations are clear. It is also a missing element in the nutanix bible...but i hope it will take part there soon\u2026\u00a0here is an example out of my homelab 3 Node Cluster\u2026<\/p>
nutanix_cluster_num_random_io{cluster=\"Admincafe\"} -1.0
nutanix_cluster_num_read_iops{cluster=\"Admincafe\"} 3.0
nutanix_cluster_num_read_io{cluster=\"Admincafe\"} 93.0
nutanix_cluster_num_seq_io{cluster=\"Admincafe\"} -1.0
nutanix_cluster_num_write_iops{cluster=\"Admincafe\"} 7.0
nutanix_cluster_num_write_io{cluster=\"Admincafe\"} 212.0
nutanix_cluster_random_io_ppm{cluster=\"Admincafe\"} -1.0
nutanix_cluster_read_io_bandwidth_kbps{cluster=\"Admincafe\"} 29.0
nutanix_cluster_read_io_ppm{cluster=\"Admincafe\"} 304918.0
nutanix_cluster_seq_io_ppm{cluster=\"Admincafe\"} -1.0
nutanix_cluster_storage_capacity_bytes{cluster=\"Admincafe\"} 1.283931572427e+12
nutanix_cluster_storage_disk_physical_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_free_bytes{cluster=\"Admincafe\"} 1.145171744971e+12
nutanix_cluster_storage_logical_usage_bytes{cluster=\"Admincafe\"} 1.45839423488e+11
nutanix_cluster_storage_reserved_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_reserved_free_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_reserved_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_tier_das_sata_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_tier_ssd_usage_bytes{cluster=\"Admincafe\"} 1.39058477856e+11
nutanix_cluster_storage_unreserved_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_unreserved_free_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_unreserved_own_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_unreserved_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_usage_bytes{cluster=\"Admincafe\"} 1.38759827456e+11
nutanix_cluster_storage_user_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_container_own_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_disk_physical_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_free_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_other_containers_reserved_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_reserved_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_reserved_free_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_reserved_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_storage_pool_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_unreserved_capacity_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_unreserved_free_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_unreserved_own_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_unreserved_shared_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_unreserved_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_storage_user_usage_bytes{cluster=\"Admincafe\"} 0.0
nutanix_cluster_timespan_usecs{cluster=\"Admincafe\"} 3e+07
nutanix_cluster_total_io_size_kbytes{cluster=\"Admincafe\"} 4013.0
nutanix_cluster_total_io_time_usecs{cluster=\"Admincafe\"} 62667.0
nutanix_cluster_total_read_io_size_kbytes{cluster=\"Admincafe\"} 889.0
nutanix_cluster_total_read_io_time_usecs{cluster=\"Admincafe\"} -1.0
nutanix_cluster_total_transformed_usage_bytes{cluster=\"Admincafe\"} -1.0
nutanix_cluster_total_untransformed_usage_bytes{cluster=\"Admincafe\"} -1.0
nutanix_cluster_write_io_bandwidth_kbps{cluster=\"Admincafe\"} 104.0
nutanix_cluster_write_io_ppm{cluster=\"Admincafe\"} 695081.0
nutanix_host_avg_io_latency_usecs{hostname=\"NTNX-739347ed-A\"} 309.0
nutanix_host_avg_io_latency_usecs{hostname=\"NTNX-d5104c7d-A\"} 153.0
nutanix_host_avg_io_latency_usecs{hostname=\"NTNX-ee409937-A\"} 206.0<\/p>
\u00a0<\/p>","quoteUsername":"frsbeckum","translations":{"Common":{"like":"Like","unlike":"Unlike"},"Forum":{"Quote":"Quote","Share":"Share"}}}">

喜欢

引用

j

Jithinkp11

旅行者

1回复

7个月前
2021年6月4日

你好，

在对Nutanix身份验证后，我将面临一个问题。它显示“ 401未经授权”

我通过了URL，用户名和密码。它工作了几个小时，然后开始显示401个未经授权

你能帮忙吗

Hi,<\/p>
I am facing one issue after authenticating to nutanix. Its showing \u201c401 Unauthorized\u201d

I passed the url, username and password. It worked fine couple of hours and then it started showing\u00a0401 Unauthorized

Can you please help<\/p>","quoteUsername":"jithinkp11","translations":{"Common":{"like":"Like","unlike":"Unlike"},"Forum":{"Quote":"Quote","Share":"Share"}}}">

喜欢

引用

回复

由内部提供动力

注册

已经有一个帐户？登录

使用您的帐户登录

登录社区

使用您的帐户登录

输入您的用户名或电子邮件地址。我们将向您发送带有指令的电子邮件以重置您的密码。

用户名或电子邮件

返回概述

扫描病毒文件。

抱歉，我们仍在检查该文件的内容，以确保它可以安全下载。请在几分钟后再试一次。
好的

该文件无法下载

抱歉，我们的病毒扫描仪检测到该文件无法安全下载。
好的