12.02.1 Using AI Speech-to-Text group nodes
note
Link to the scenario template
As an example of using nodes of the AI Speech-To-Text group, let's create a scenario where an audio file will be passed for processing. The outcome of the scenario will be a text generated from the content of the audio file.
Several nodes must be added for the scenario to work successfully:
- (1) A Trigger on Webhook node to whose URL is sent a POST request containing a file;
- (2) A NVIDIA Canary-1B (Free Preview) node to process the file content. The file content can be an output parameter of the Trigger on Webhook node. To make the node return a response immediately without waiting for the results of file processing, the Async parameter should be checked.
- (3) A NVIDIA Canary-1B - Check result (Free Preview) node to get the result of file processing. The ID can be obtained from the NVIDIA Canary-1B (Free Preview) node parameters. File processing takes some time, so the output parameters of the node will not show the result, but the initial status will be displayed;
- (4) Wait node, to set the wait time, e.g. 30 seconds;
- (5) A Node NVIDIA Canary-1B - Check result (Free Preview) to retrieve the result of file processing. After waiting for a while, the processing status will already be complete and the node will have a response in its output parameters;
- (6) A Webhook response node, to send the result of the scenario execution.
The output of the scenario is text: