NVIDIA Canary-1B
An AI model for speech recognizing.
Node Description
NVIDIA Canary-1B (Free Preview) an action-type node required to generate text based on the content of an mp3, .m3a audio file.
- transcribing interviews, video tutorials, lectures, etc.
- converting speech from video to text subtitles. This is useful for those who prefer to read text instead of watching a video or those who do not speak the original language of the video.
- recording conversations with customers in business and telecommunications, to provide better customer service and increase efficiency.
Node Configuration
To configure the NVIDIA Canary-1B (Free Preview) node, you must complete the required and optional fields. Required fields include:
- File;
- Source language;
- Destination language.
File
Field for specifying the content of an mp3, .m3a audio file. The file can be passed to the scenario using the Trigger on Webhook node. The content in this case will be specified in the File field in the format file.[0].content.
Source language
A drop-down list with a list of languages. In the field you should select the language in which the audio file is recorded. Available languages:
- English;
- German;
- French;
- Spanish.
Destination language
A drop-down list with a list of languages. In the field you should select the language to be used when translating the audio file. Available languages:
- English;
- German;
- French;
- Spanish.
Keep Punctuation
Switch to save punctuation.
Async
A switch to define the node's algorithm.
If enabled, the response will be returned by the node immediately, even if the information has not yet been processed. The node output will include the status Initial or Started.
If the parameter is off, the response will only be returned when the information has been processed. The output of the node will contain the result and the status Completed.
NVIDIA Canary-1B - Check result
Node Description
NVIDIA Canary-1B - Check result (Free Preview) an action-type node required to retrieve text generated from the content of an mp3, .m3a audio file. The result is identified by an identifier that is present among other output from the NVIDIA Canary-1B (Free Preview) node. The NVIDIA Canary-1B - Check result (Free Preview) node can be useful if the NVIDIA Canary-1B (Free Preview) node executes immediately without waiting for the translation to be generated, i.e. the Async option is checked.
- transcribing interviews, video tutorials, lectures, etc.
- converting speech from video to text subtitles. This is useful for those who prefer to read text instead of watching a video or those who do not speak the original language of the video.
- recording conversations with customers in business and telecommunications, to provide better customer service and increase efficiency.
Node Configuration
To configure the NVIDIA Canary-1B - Check result (Free Preview) node, the required Task Id (UUID) field must be filled in.
Task Id (UUID)
Field for entering the ID of the audio file translation task. The identifier can be entered manually or can be a parameter of the NVIDIA Canary-1B (Free Preview) node.