Translate p3. - arma-thesis

commit 8359cad1b7b74e34b3fcec2be2b9f771e6b6bf99
parent 4c124f0ab389147d2a46e0e31b1ed6a507ff0ee9
Author: Ivan Gankevich <igankevich@ya.ru>
Date:   Tue, 13 Jun 2017 14:27:42 +0300

Translate p3.

Diffstat:
arma-thesis-ru.org  | 14 ++++++++++++++
arma-thesis.org  | 5 +++--

2 files changed, 17 insertions(+), 2 deletions(-)
diff --git a/arma-thesis-ru.org b/arma-thesis-ru.org
@@ -2478,6 +2478,20 @@ downstream-объектов метод ~react~ их родителя вызыв
 объектом в качестве аргумента, чтобы позволить родителю собрать результат
 выполнения объекта.
 
+Не существует способа предоставить мелкозернистую отказоустойчивость к сбоям
+узлов кластера, если в программе присутствуют downstream-объекты кроме тех, что
+возвращаются к своим родителям. Вместо этого, код завершения управляющего
+объекта проверяется и пользовательский код для восстановления выполняется. Если
+проверки на ошибку не выполняется, система перезапускает выполнение, начиная с
+первого родительского объекта, который не создавал downstream-объектов. Это
+означает, что если решаемая программой задача имеет информационные зависимости
+между частями вычисляемыми параллельно, и узел выходит из строя во время
+вычисления этих частей, то эти вычисления перезапускается с самого начала,
+отбрасывая части, вычисленные ранее. Такого не происходит в высоко параллельных
+программах, где параллельные части не имеют таких информационных зависимостей
+между друг другом: в этом случае только части с вышедших из строя узлов
+вычисляются заново, а все ранее вычисленные части сохраняются.
+
 ** Реализация для систем с общей памятью (SMP)
 **** Алгоритм распределения нагрузки.
 Наиболее простым и широко применяемым подходом к распределению нагрузки на
diff --git a/arma-thesis.org b/arma-thesis.org
@@ -3349,7 +3349,7 @@ make it possible for a parent to collect the result of the execution.
 There is no way to provide fine-grained resilience to cluster node failures, if
 there are downstream kernels in the programme, except the ones returning to
 their parents. Instead, an exit code of the kernel is checked and a custom
-recovery action is executed. If there is no error checking, the system restarts
+recovery code is executed. If there is no error checking, the system restarts
 execution from the first parent kernel, which did not produce any downstream
 kernels. This means, that if a problem being solved by the programme has
 information dependencies between parts that are computed in parallel, and a node
@@ -3357,7 +3357,8 @@ failure occurs during computation of these parts, then this computation is
 restarted from the very beginning, discarding any already computed parts. This
 does not occur for embarrassingly parallel programmes, where parallel parts do
 not have such information dependencies between each other: in this case only
-failed parts are recomputed and all previously computed parts are retained.
+parts from failed nodes are recomputed and all previously computed parts are
+retained.
 
 ** SMP implementation
 **** Load balancing algorithm.

	arma-thesis
	git clone https://git.igankevich.com/arma-thesis.git
	Log \| Files \| Refs \| LICENSE