[0.0, 4.0, 16.0]
[0.48, 1.5808, 2.4989288]
[0.6089933, 1.087786, 1.1832783]
[0.68653125, 0.82346296, 0.6780912]
[0.7401202, 0.65483475, 0.42880854]
[0.77992785, 0.5370076, 0.28837714]
[0.8108626, 0.44982052, 0.2023385]
[0.83566165, 0.3826909, 0.14645232]
[0.85600054, 0.3294704, 0.10855074]
[0.8729749, 0.28632116, 0.08197981]
[0.8873373, 0.25071096, 0.06285599]
[0.8996254, 0.2208991, 0.048796415]
[0.9102352, 0.19564486, 0.03827691]
[0.91946596, 0.17403984, 0.030289866]
[0.9275488, 0.15540075, 0.024149394]
[0.93466556, 0.13920641, 0.019378424]
[0.9409614, 0.12504816, 0.015637042]
[0.94655395, 0.112605095, 0.0126799075]
[0.95153964, 0.101617336, 0.010326083]
[0.9559983, 0.09187603, 0.008441205]
这将收敛到1。
如您所见,结果的x坐标被解释为围绕最优解x=1的振荡。x坐标越远,产生的梯度越大。最后,损失超过最大值
tf.float32
可以表达,导致
inf
.