Glossary entry (derived from question below)
Feb 9, 2017 03:40
7 yrs ago
2 viewers *
English term
gradients
English to Chinese
Tech/Engineering
IT (Information Technology)
PaddlePaddle
Some other approaches use a set of parameter servers to collectively hold a very large model in the CPU memory space on multiple hosts. But in practice, it is not often that we have such big models, because it would be very inefficient to handle very large model due to the limitation of GPU memory. In our configuration, multiple parameter servers are mostly for fast communications. Suppose there is only one parameter server process working with all trainers, the parameter server would have to aggregate {gradients} from all trainers and becomes a bottleneck.
在深度学习领域,这个gradients是指“梯度”还是“梯度数据”?还是其他?
在深度学习领域,这个gradients是指“梯度”还是“梯度数据”?还是其他?
Proposed translations
(Chinese)
5 | 变化 | Patrick Cheng |
3 | 梯度 | Richard Lin |
Change log
Feb 11, 2017 05:24: Patrick Cheng Created KOG entry
Proposed translations
2 hrs
Selected
变化
读了一下原文,每台trainer有模型的当地拷贝,并利用当地数据对模型进行更新,然后把变化传送回去。premeter server负责把所有更新加以综合(aggregate gradients)。
gradient本身有‘变化率’的意思,这里翻成‘变化’似乎比‘更新’要好一点。下面是说明二者关系的原文:
During the training process, trainers send model updates to parameter servers, parameter servers are responsible for aggregating these updates, so that trainers can synchronize their local copy with the global model.
gradient本身有‘变化率’的意思,这里翻成‘变化’似乎比‘更新’要好一点。下面是说明二者关系的原文:
During the training process, trainers send model updates to parameter servers, parameter servers are responsible for aggregating these updates, so that trainers can synchronize their local copy with the global model.
4 KudoZ points awarded for this answer.
Comment: "谢谢!"
2 hrs
梯度
供參考
Something went wrong...