解决了

远程站点复制调整

  • 2018年5月29日
  • 6个答复
  • 1966年的观点

UserLevel 2
徽章 +5
  • 开拓者
  • 10个答复
Nutanix中有哪些选项可以加快远程站点的复制?我能够在单个流上获得的最好的方法是约20Mbps(160Mbps),总速度约为60Mbps(480Mbps),分布在几个流上。但是,我们在网站之间有一个10Gbps的链接,因此我希望获得更好的吞吐量。
图标

最好的答案企鹅2018年6月8日,16:35

@dlink7<\/user-mention> for the good info on expected performance under ideal situations. That does give me a good baseline.
\r\n
\r\nOur issue is definitely latency. Our remote site is across the continental US, and our per VM size is 30TB (10TB used) on a single disk. as you can imagine, this creates a bottleneck for us. Replications with default settings are not meeting RPO.
\r\n
\r\nOur AOS version is 5.5.2.
\r\n
\r\nI have found (with help from support) these tunable settings within the cluster:
\r\n
\r\n
\r\n
code:<\/b>
nutanix@NTNX:~$ python \/home\/nutanix\/serviceability\/bin\/edit-aos-gflags | grep stargate_
2018-06-08 10:00:09 INFO zookeeper_session.py:110 edit-aos-gflags is attempting to connect to Zookeeper
stargate_cerebro_replication_max_rpc_vblocks = 16 #default 4
stargate_cerebro_replication_max_rpc_data = 4194304 #default 1048576
stargate_cerebro_max_outstanding_vdisk_replication_rpcs = 16 #default 4
stargate_cerebro_replication_param_multiplier = 32 #default 16
stargate_vdisk_read_extents_max_outstanding_egroup_reads = 6 #default 3
<\/pre><\/div>
\r\n
\r\nI am putting together a helpful howto on how to tune these parameters, but the gist of it is editing with...
\r\n
\r\n
code:<\/b>
nutanix@NTNX:~$ python \/home\/nutanix\/serviceability\/bin\/edit-aos-gflags --service=stargate
<\/pre><\/div>
\r\n
\r\n...then restarting stargate on each cvm.
\r\n
\r\nThe restarts went perfectly with a sleep 60 in there, we had no failure on running replications, and our throughput went up ~4x as expected. our 12 day running replications began to finish.
\r\n
\r\nmy ultimate solution is two fold. first, the above settings to accelerate each stream. second, we are working to break out our 30TB disk in to 10x 3TB disks.
\r\n
\r\nRE: going off stock; I think that as nutanix grows in to more environments (and they will, the technology is amazing and i foresee continued adoption) there are going to be many more flavors of environments that sticking to stock options just wont solve. I expect in the next few years that nutanix will pivot in to a more open position. I'd love to see some of these settings within reach of the prism GUI along with some better man, info pages on what each individual setting and switch does.
\r\n
\r\nThat being said, your warning about going off stock is noted and appreciated. i understand that nutanix attempts to tune AOS as well as possible out of the box, and that tuning these settings can have an impact (sometimes bad, sometimes good). Therefore, I will only seek a behavior change in the technology to achieve some desired result.","className":"post__content__best_answer"}">
查看原件

该主题已关闭以供评论

6个答复

UserLevel 4
徽章 +19
嗨,您正在摇摆什么AOS版本?

在雅典卫城,每个节点都可以复制四个文件,一次总计100 mb/s。因此,在四节点配置中,群集可以复制400 MB/s或3.2 GB/s。

什么会导致这个数字下降?群集上的其他任务,例如策展人如果策划链链的延迟太高和不良的延迟。

如果您仍在遇到RPO,我不会更改任何内容。下车将稍后再困扰您。
UserLevel 7
徽章 +34
你好 @penguindows

你看到的答复了吗 @dlink7
UserLevel 2
徽章 +5
是的,我看到dlink7,答复。谢谢 @dlink7有关理想情况下预期性能的好信息。这确实给了我一个很好的基准。

我们的问题绝对是延迟。我们的远程站点遍布美国大陆,我们的每VM大小为单个磁盘上的30TB(使用10TB)。可以想象,这为我们创造了瓶颈。具有默认设置的复制不符合RPO。

我们的AOS版本为5.5.2。

我已经(在支持的帮助下)在集群中发现了这些可调设置:


代码:
nutanix@ntnx:〜$ python/home/nutanix/serviceability/bin/edit-aos-gflags |grep stargate_
2018-06-08 10:00:09信息Zookeeper_session.py:110 Edit-Aos-Gflags正在尝试连接到Zookeeper
stargate_cerebro_replication_max_rpc_vblocks = 16 #default 4
stargate_cerebro_replication_max_rpc_data = 4194304 #default 1048576
stargate_cerebro_max_outantate_vdisk_replication_rpcs = 16 #default 4
stargate_cerebro_replication_param_multiplier = 32 #default 16
stargate_vdisk_read_extents_max_outantate_egroup_egroup_reads = 6 #default 3


我正在为如何调整这些参数提供一个有益的方法,但是它的要点正在编辑...

代码:
nutanix@ntnx:〜$ python/home/nutanix/serviceability/bin/edit-aos-gflags -service = stargate


...然后在每个CVM上重新启动星际之门。

重新启动时,那里有60个睡眠,我们在运行复制方面没有失败,我们的吞吐量按预期升高了约4倍。我们的12天运行复制开始完成。

我的最终解决方案是两倍。首先,以上设置加速了每个流。其次,我们正在努力将30TB磁盘分解为10x 3TB磁盘。

回复:库存;我认为,随着Nutanix的发展到更多的环境(它们将会,这项技术是惊人的,我预见的是继续采用),将会有更多的环境风味,这些环境会坚持使用库存期权。我希望在接下来的几年中,Nutanix将转移到更开放的位置。我很想在Prism GUI以及一些更好的男人以及每个单独的设置和切换所做的信息页面上看到其中的一些设置。

话虽这么说,您警告说,并感谢您的警告。我知道Nutanix试图从开箱即用的情况下调整AOS以及可能的调整,并且调整这些设置可能会产生影响(有时是不好的,有时是好)。因此,我只会寻求技术改变行为,以实现一些理想的结果。
Learn more about our cookies.<\/a>","cookiepolicy.button":"Accept cookies","cookiepolicy.button.deny":"Deny all","cookiepolicy.link":"Cookie settings","cookiepolicy.modal.title":"Cookie settings","cookiepolicy.modal.content":"We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.<\/a>","cookiepolicy.modal.level1":"Basic
Functional","cookiepolicy.modal.level2":"Normal
Functional + analytics","cookiepolicy.modal.level3":"Complete
Functional + analytics + social media + embedded videos"}}}">
Baidu