Search code examples
How does BertForSequenceClassification classify on the CLS vector?...


pythonmachine-learninghuggingface-transformersbert-language-modeltransformer-model

Read More
AttributeError: 'DynamicCache' object has no attribute 'seen_tokens'...


pythonlarge-language-modeltransformer-model

Read More
Choose available GPU devices with device_map...


pythonmachine-learninghuggingface-transformerstransformer-model

Read More
(NVIDIA/nv-embed-v2) ImportError: cannot import name 'MISTRAL_INPUTS_DOCSTRING' from 'tr...


pythonnvidiahuggingface-transformerstransformer-modelword-embedding

Read More
Using positional encoding in pytorch...


pythondeep-learningpytorchtransformer-model

Read More
Logits Don't Change in a Custom Reimplementation of a CLIP model [PyTorch]...


pythondeep-learningpytorchartificial-intelligencetransformer-model

Read More
I keep getting this error, cuda available 'RuntimeError: Expected all tensors to be on the same ...


pythonpytorchreinforcement-learningtransformer-modelrllib

Read More
Is positional encoding necessary for transformer in language modeling?...


transformer-modellanguage-model

Read More
Why does my keras model with multiple inputs accept the shape of the training data for .call() but n...


pythontensorflowkerastransformer-model

Read More
How to download a model from huggingface?...


huggingface-transformerstransformer-model

Read More
How to get cosine similarity of word embedding from BERT model...


pythonbert-language-modelword-embeddingtransformer-model

Read More
How to extract image hidden states in LLaVa's transformers (Huggingface) implementation?...


huggingface-transformerstransformer-modelmultimodal

Read More
ValueError: Exception encountered when calling layer 'tf_bert_model' (type TFBertModel)...


tensorflowtensorflow2.0huggingface-transformersbert-language-modeltransformer-model

Read More
How to correctly apply LayerNorm after MultiheadAttention with different input shapes (batch_first v...


audiodeep-learningpytorchtransformer-modelpattern-recognition

Read More
How to mask inputs with variable size in transformer model when the batches needs to be masked diffe...


pythonnumpytensorflowkerastransformer-model

Read More
Warning: Gradients do not exist for variables...


pythontensorflowkerastransformer-model

Read More
How to apply a pretrained transformer model from huggingface?...


huggingface-transformersnamed-entity-recognitiontransformer-model

Read More
How to reconstruct text entities with Hugging Face's transformers pipelines without IOB tags?...


nlptokenizetransformer-modelnamed-entity-recognitionhuggingface-transformers

Read More
Inference error after training an IP-Adapter plus model...


machine-learningdeep-learningpytorchtransformer-modelstable-diffusion

Read More
cannot import name 'split_torch_state_dict_into_shards' from 'huggingface_hub'...


pythonnlphuggingface-transformerstransformer-modelllama

Read More
Why do Transformers in Natural Language Processing need a stack of encoders?...


machine-learningdeep-learningnlptransformer-model

Read More
Transformers: Cross Attention Tensor Shapes During Inference Mode...


pytorchtransformer-model

Read More
Query padding mask and key padding mask in Transformer encoder...


pythonmachine-learningpytorchtransformer-modelattention-model

Read More
PyTorch Linear operations vary widely after reshaping...


pythondebuggingpytorchtransformer-modelattention-model

Read More
Why doesn't permuting positional encodings in GPT-2 affect the output as expected?...


pytorchhuggingface-transformerstransformer-modelgpt-2

Read More
Does Padding in a Batch of Sequences Affect Performance? How Effective is the Attention Mask?...


pytorchnlphuggingface-transformerstransformer-model

Read More
Why is the timm visual transformer position embedding initializing to zeros?...


pytorchtransformer-modelvision-transformer

Read More
Inference question through LoRA in Whisper model...


transformer-modelopenai-whisper

Read More
How to make huggingface transformer for translation return n translation inferences?...


pythonhuggingface-transformerstransformer-model

Read More
Understanding the results of Transformers Learn In Context with Gradient Descent...


machine-learningnlplarge-language-modeltransformer-modelmeta-learning

Read More
BackNext