pub struct MultiHeadAttention<B>where
B: Backend,{ /* private fields */ }Expand description
Implementations§
Source§impl<B> MultiHeadAttention<B>where
B: Backend,
impl<B> MultiHeadAttention<B>where
B: Backend,
Sourcepub fn new(
d_model: usize,
num_heads: usize,
dtype: DType,
device: &<B as Backend>::Device,
) -> Result<MultiHeadAttention<B>, Error>
pub fn new( d_model: usize, num_heads: usize, dtype: DType, device: &<B as Backend>::Device, ) -> Result<MultiHeadAttention<B>, Error>
Create a new Multi-Head Attention module.
§Arguments
d_model: total model dimension (must be divisible by num_heads)num_heads: number of attention headsdtype: data type for parametersdevice: device to create parameters on
Sourcepub fn with_causal(self, causal: bool) -> MultiHeadAttention<B>
pub fn with_causal(self, causal: bool) -> MultiHeadAttention<B>
Enable causal (autoregressive) masking.
pub fn num_heads(&self) -> usize
pub fn d_model(&self) -> usize
pub fn head_dim(&self) -> usize
Trait Implementations§
Source§impl<B> Module<B> for MultiHeadAttention<B>where
B: Backend,
impl<B> Module<B> for MultiHeadAttention<B>where
B: Backend,
Source§fn forward(&self, x: &Tensor<B>) -> Result<Tensor<B>, Error>
fn forward(&self, x: &Tensor<B>) -> Result<Tensor<B>, Error>
Forward pass: self-attention on input x.
Input: [batch, seq_len, d_model] Output: [batch, seq_len, d_model]
Source§fn parameters(&self) -> Vec<Tensor<B>>
fn parameters(&self) -> Vec<Tensor<B>>
Return all trainable parameters of this module.
The optimizer uses these to update weights during training.
Source§fn named_parameters(&self) -> Vec<(String, Tensor<B>)>
fn named_parameters(&self) -> Vec<(String, Tensor<B>)>
Return all trainable parameters with human-readable names. Read more
Source§fn set_training(&self, _training: bool)
fn set_training(&self, _training: bool)
Set training or evaluation mode. Read more
Source§fn is_training(&self) -> bool
fn is_training(&self) -> bool
Whether the module is in training mode (default: true).
Source§fn num_parameters(&self) -> usize
fn num_parameters(&self) -> usize
Total number of scalar parameters in this module.
Source§fn trainable_params_count(&self) -> usize
fn trainable_params_count(&self) -> usize
Number of trainable (variable) parameters.
Auto Trait Implementations§
impl<B> Freeze for MultiHeadAttention<B>
impl<B> RefUnwindSafe for MultiHeadAttention<B>
impl<B> Send for MultiHeadAttention<B>
impl<B> Sync for MultiHeadAttention<B>
impl<B> Unpin for MultiHeadAttention<B>
impl<B> UnwindSafe for MultiHeadAttention<B>
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more