The attention mechanism in foundation model architectures allows the model to focus on specific parts of the input data. Which of the following steps are key components of a standard attention mechanism?
------- is a model that uses a convolutional neural network (CNN) to classify texts.
In an image preprocessing experiment, the cv2.imread("lena.png", 1) function provided by OpenCV is used to read images. The parameter "1" in this function represents a --------- -channel image. (Fill in the blank with a number.)
John wants to deploy a large model locally to implement the Q&A assistant function for his company. Which of the following factors is unnecessary for John to consider?
In the field of deep learning, which of the following activation functions has a derivative not greater than 0.5?
Maximum likelihood estimation (MLE) can be used for parameter estimation in a Gaussian mixture model (GMM).
Which of the following ModelArts training parameters is used to customize hyperparameters?
In cases where the bright and dark areas of an image are too extreme, which of the following techniques can be used to improve the image?
-------- is a text representation method based on the bag of words (BoW) model. It decomposes words into subwords and then adds the vector representations of the subwords to obtain word vectors, fully utilizing character N-gram information. (Fill in the blank.)
Mel-frequency cepstral coefficients (MFCCs) take into account human auditory characteristics by first mapping the linear spectrum to the Mel nonlinear spectrum based on auditory perception, and then converting it to the cepstral domain.
The objective of -------- is to extract and classify named entities in a text into pre-defined classes such as names, organizations, locations, time expressions, monetary values, and percentages. (Enter the abbreviation.)