Skip to main content

04.6.11.3 bge-small-en-v1.5 (preview)

Node Description

bge-small-en-v1.5 (preview) - action type node needed to generate numeric vectors based on a given text.

caution

Maximum number of input tokens: 512. Output sizes: 384

note

Model IDbge-small-en-v1.5 (preview). BAAI general embedding (bge) models, i.e., object extraction models convert any text into a compact vector. These numerical representations or numerical vectors retain the basic information from the raw data (text) and can be further utilized in applications.

Embeddings are widely used in modern language models and natural language processing systems. Here are some of the main applications of text embeddings:

  1. Text analysis and classification: representing words and text as vectors allows efficient application of machine learning algorithms for text classification tasks, topic modeling, tone detection, etc.
  1. Information retrieval and recommendation: text embeddings are used to compute semantic proximity between documents, queries and content, which is important for search engines, recommendation services and other applications.
  1. Improving deep learning models: using textual embeddings as inputs to deep learning models can often improve their performance, especially in the presence of limited labeled data.

Node Configuration

To configure the bge-small-en-v1.5 (preview) node, the required Text field is required.

Text

Text field to be filled in. The text can be filled in manually or be the result of previous nodes.

If necessary, you can process several text fragments at once. To do this, add additional fields by clicking the "plus" icon (1). Deleting added fields is also available (2).

Example of launching a node

Let's run the bge-small-en-v1.5 (preview) node to process text and generate numeric vectors.

Example text
Cornish Rex are muscular, slender and lean cats. They are heavier than their appearance suggests. These cats have an elongated wedge-shaped head, a flat skull and large, high-set ears.

The output of the node execution is JSON:

  • with an array of 384 numeric vectors "data";
  • with the action execution status "success": true.
JSON (abridged version)
{
"result": {
"errors": [],
"messages": [],
"result": {
"data": [
[
0.011217856779694557,
-0.0001165232970379293,
0.07868301123380661,
0.04923422634601593,
-0.060261934995651245,
0.02206261269748211,
...
-0.02731413021683693,
-0.009005442261695862,
-0.10363060235977173,
0.036324430257081985,
0.13463462889194489,
0.00127658701967448
]
],
"shape": [
1,
384
]
},
"success": true
}
}