Structs§
- Grad
Accumulator - Gradient accumulation helper.
Functions§
- clip_
grad_ norm - Clip gradients by their global L2 norm.
- clip_
grad_ value - Clamp each gradient element to
[-max_value, max_value]. - grad_
norm - Compute the global L2 norm of all gradients without clipping.