解决了

远程站点复制调整

  • 2018年5月29日
  • 6个答复
  • 1966年的观点

UserLevel 2
徽章 +5
  • 开拓者
  • 10个答复
Nutanix中有哪些选项可以加快复制到远程站点?我能够在一个流中获得的最好的方法是约20Mbps(160Mbps),总速度约为60Mbps(480Mbps),分布在几个流上。但是,我们在网站之间有一个10Gbps的链接,因此我希望获得更好的吞吐量。
图标

最好的答案企鹅2018年6月8日,16:35

@dlink7<\/user-mention> for the good info on expected performance under ideal situations. That does give me a good baseline.
\r\n
\r\nOur issue is definitely latency. Our remote site is across the continental US, and our per VM size is 30TB (10TB used) on a single disk. as you can imagine, this creates a bottleneck for us. Replications with default settings are not meeting RPO.
\r\n
\r\nOur AOS version is 5.5.2.
\r\n
\r\nI have found (with help from support) these tunable settings within the cluster:
\r\n
\r\n
\r\n
code:<\/b>
nutanix@NTNX:~$ python \/home\/nutanix\/serviceability\/bin\/edit-aos-gflags | grep stargate_
2018-06-08 10:00:09 INFO zookeeper_session.py:110 edit-aos-gflags is attempting to connect to Zookeeper
stargate_cerebro_replication_max_rpc_vblocks = 16 #default 4
stargate_cerebro_replication_max_rpc_data = 4194304 #default 1048576
stargate_cerebro_max_outstanding_vdisk_replication_rpcs = 16 #default 4
stargate_cerebro_replication_param_multiplier = 32 #default 16
stargate_vdisk_read_extents_max_outstanding_egroup_reads = 6 #default 3
<\/pre><\/div>
\r\n
\r\nI am putting together a helpful howto on how to tune these parameters, but the gist of it is editing with...
\r\n
\r\n
code:<\/b>
nutanix@NTNX:~$ python \/home\/nutanix\/serviceability\/bin\/edit-aos-gflags --service=stargate
<\/pre><\/div>
\r\n
\r\n...then restarting stargate on each cvm.
\r\n
\r\nThe restarts went perfectly with a sleep 60 in there, we had no failure on running replications, and our throughput went up ~4x as expected. our 12 day running replications began to finish.
\r\n
\r\nmy ultimate solution is two fold. first, the above settings to accelerate each stream. second, we are working to break out our 30TB disk in to 10x 3TB disks.
\r\n
\r\nRE: going off stock; I think that as nutanix grows in to more environments (and they will, the technology is amazing and i foresee continued adoption) there are going to be many more flavors of environments that sticking to stock options just wont solve. I expect in the next few years that nutanix will pivot in to a more open position. I'd love to see some of these settings within reach of the prism GUI along with some better man, info pages on what each individual setting and switch does.
\r\n
\r\nThat being said, your warning about going off stock is noted and appreciated. i understand that nutanix attempts to tune AOS as well as possible out of the box, and that tuning these settings can have an impact (sometimes bad, sometimes good). Therefore, I will only seek a behavior change in the technology to achieve some desired result.","className":"post__content__best_answer"}">
查看原件

该主题已关闭以供评论

6个答复

Userlevel 4
徽章 +19
嗨,您正在摇摆什么AOS版本?

在雅典卫城,每个节点都可以一次复制四个文件,一次总计100 MB/s。因此,在四节点配置中,群集可以复制400 Mb/s或3.2 GB/s。

什么会导致这个数字下降?群集上的其他任务,例如策展人如果策划链链的延迟太高且延迟不良。

如果您仍在遇到您的RP​​O,我不会更改任何内容。下车将稍后再困扰您。
UserLevel 7
徽章 +34
你好 @penguindows

你看到了 @dlink7
UserLevel 2
徽章 +5
是的,我看到Dlink7,答复。谢谢 @dlink7有关理想情况下预期性能的好信息。这确实给了我一个很好的基准。

我们的问题绝对是延迟。我们的远程站点位于美国大陆,我们的每VM大小为单个磁盘上的30TB(使用10TB)。可以想象,这为我们创造了瓶颈。具有默认设置的复制不符合RPO。

我们的AOS版本为5.5.2。

我发现(在支持的帮助下)这些可调设置在集群中:


代码:
nutanix@ntnx:〜$ python/home/nutanix/serviceability/bin/edit-aos-gflags |grep stargate_
2018-06-08 10:00:09信息Zookeeper_session.py:110 Edit-Aos-Gflags正在尝试连接到Zookeeper
stargate_cerebro_replication_max_rpc_vblocks = 16 #default 4
stargate_cerebro_replication_max_rpc_data = 4194304 #default 1048576
stargate_cerebro_max_outantate_vdisk_replication_rpcs = 16 #default 4
stargate_cerebro_replication_param_multiplier = 32 #default 16
stargate_vdisk_read_extents_max_outantate_egroup_egroup_reads = 6 #default 3


我正在为如何调整这些参数提供一个有益的方法,但是它的要点是...

代码:
nutanix@ntnx:〜$ python/home/nutanix/serviceability/bin/edit-aos-gflags -service = stargate


...然后在每个CVM上重新启动星际之门。

重新启动时,那里有60个睡眠,我们在跑步复制方面没有失败,我们的吞吐量如预期的〜4倍。我们的12天运行复制开始完成。

我的最终解决方案是两倍。首先,以上设置加速了每个流。其次,我们正在努力将30TB磁盘分解为10x 3TB磁盘。

回复:下库存;我认为,随着Nutanix在更多的环境中的发展(它们会的,这项技术是惊人的,我预见的是继续采用),将会有更多的环境风味,而这些环境会坚持使用股票期权。我希望在接下来的几年中,Nutanix将转移到更开放的位置。我很想在Prism Gui以及一些更好的人以及每个单独的设置和切换的信息页面上看到其中的一些设置。

话虽这么说,您要注意并感谢您警告。我知道Nutanix试图从开箱即用的情况下调整AOS以及可能的调整,并且调整这些设置可能会产生影响(有时是不好的,有时是好)。因此,我只会寻求技术的行为改变,以实现一些预期的结果。
Learn more about our cookies.<\/a>","cookiepolicy.button":"Accept cookies","cookiepolicy.button.deny":"Deny all","cookiepolicy.link":"Cookie settings","cookiepolicy.modal.title":"Cookie settings","cookiepolicy.modal.content":"We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.<\/a>","cookiepolicy.modal.level1":"Basic
Functional","cookiepolicy.modal.level2":"Normal
Functional + analytics","cookiepolicy.modal.level3":"Complete
Functional + analytics + social media + embedded videos"}}}">
Baidu