Module nn

Source

Expand description

Re-export neural network modules.

Modules§

activation
attention
batchnorm
conv
dropout
embedding
flatten
groupnorm
init
layernorm
linear
loss
metrics
module
rmsnorm
rnn
sequential
transformer

Structs§

AdaptiveAvgPool2d: Adaptive 2D Average Pooling — pools to a fixed output size.
AvgPool2d: 2D average-pooling layer.
BatchNorm2d: 2D Batch Normalization layer for convolutional networks.
ClassMetrics: Per-class metric report entry.
ConfusionMatrix: NxN confusion matrix. Entry [i][j] = count of samples with true class i predicted as class j.
Conv1d: 1D convolution layer.
Conv2d: 2D convolutional layer.
Dropout: Applies dropout regularization.
ELU: ELU activation: x if x > 0, alpha * (exp(x) - 1) otherwise
Embedding: A learnable lookup table mapping integer indices to dense vectors.
Flatten: Flatten layer: collapses dimensions [start_dim..=end_dim] into one.
GRU: A multi-step GRU that unrolls a GRUCell over the sequence dimension.
GRUCell: A single-step GRU cell.
GeLU: GELU activation (Gaussian Error Linear Unit) Used in Transformers (BERT, GPT, etc.)
GroupNorm: Group Normalization layer.
LSTM: A multi-step LSTM that unrolls an LSTMCell over the sequence dimension.
LSTMCell: A single-step LSTM cell.
LayerNorm: Layer Normalization: normalizes over the last N dimensions.
LeakyReLU: LeakyReLU activation: max(negative_slope * x, x)
Linear: A fully-connected (dense) layer: y = xW^T + b.
MaxPool2d: 2D max-pooling layer.
Mish: Mish activation: x * tanh(softplus(x)) = x * tanh(ln(1 + exp(x)))
MultiHeadAttention: Multi-Head Self-Attention module.
RMSNorm: RMS Normalization layer (used in LLaMA, Mistral, etc.).
RNN: A multi-step vanilla RNN that unrolls an RNNCell over the sequence dimension.
RNNCell: A single-step vanilla RNN cell.
ReLU: ReLU activation: max(0, x)
Sequential: A container that chains modules sequentially.
SiLU: SiLU / Swish activation: x * σ(x) Used in modern architectures (EfficientNet, LLaMA, etc.)
Sigmoid: Sigmoid activation: 1 / (1 + e^(-x))
Tanh: Tanh activation
TransformerBlock: A single Transformer block (pre-norm style).

Enums§

Average: How to average per-class metrics for multi-class problems.
Reduction: Reduction mode for loss functions.

Traits§

Module: The fundamental trait for all neural network layers.

Functions§

accuracy: Classification accuracy: fraction of correct predictions.
argmax_classes: Compute argmax along the last axis, returning class indices.
bce_loss: Binary Cross-Entropy loss for probabilities in [0, 1].
bce_with_logits_loss: Binary Cross-Entropy with Logits (numerically stable).
classification_report: Per-class precision, recall, F1, and support — like sklearn’s classification_report.
cross_entropy_loss: Cross-entropy loss with log-softmax for numerical stability.
f1_score: F1 Score — harmonic mean of precision and recall.
l1_loss: L1 Loss (Mean Absolute Error): mean(|prediction - target|)
l1_loss_with_reduction: L1 Loss with configurable reduction.
mae: Mean Absolute Error: mean(|y_true - y_pred|).
mape: Mean Absolute Percentage Error: mean(|y_true - y_pred| / |y_true|) * 100.
mse_loss: Mean Squared Error loss: mean((prediction - target)²)
mse_loss_with_reduction: MSE Loss with configurable reduction.
nll_loss: Negative Log-Likelihood Loss with integer class indices.
perplexity: Perplexity from cross-entropy loss: exp(loss).
perplexity_from_log_probs: Perplexity from a flat array of per-token log-probabilities.
precision: Precision for multi-class classification.
r2_score: R² (coefficient of determination).
recall: Recall for multi-class classification.
rmse: Root Mean Squared Error: sqrt(mean((y_true - y_pred)²)).
smooth_l1_loss: Smooth L1 Loss (Huber Loss):
tensor_accuracy: Compute accuracy directly from logit tensors and one-hot/class-index targets.
top_k_accuracy: Top-K accuracy: fraction of samples where the true class is in the top-K predictions.

Module nn

Module nn Copy item path

Modules§

Structs§

Enums§

Traits§

Functions§

Module nn