12.02.1 Using AI Speech-to-Text group nodes

note

As an example of using nodes of the AI Speech-To-Text group, let's create a scenario where an audio file will be passed for processing. The outcome of the scenario will be a text generated from the content of the audio file.

Several nodes must be added for the scenario to work successfully:

(1) A Trigger on Webhook node to whose URL is sent a POST request containing a file;

(2) A NVIDIA Canary-1B (Free Preview) node to process the file content. The file content can be an output parameter of the Trigger on Webhook node. To make the node return a response immediately without waiting for the results of file processing, the Async parameter should be checked.

(3) A NVIDIA Canary-1B - Check result (Free Preview) node to get the result of file processing. The ID can be obtained from the NVIDIA Canary-1B (Free Preview) node parameters. File processing takes some time, so the output parameters of the node will not show the result, but the initial status will be displayed;

(4) Wait node, to set the wait time, e.g. 30 seconds;

(5) A Node NVIDIA Canary-1B - Check result (Free Preview) to retrieve the result of file processing. After waiting for a while, the processing status will already be complete and the node will have a response in its output parameters;

(6) A Webhook response node, to send the result of the scenario execution.

The output of the scenario is text: