app icon
DupDub
0.0.3

DupDub

dupdub-dify/dupdub0 installs

dupdub

Author: dupdub
Version: 0.0.1
Type: extension

Description

dupdub is a powerful AI-powered platform that provides a wide range of services, including speech recognition, voice cloning, dubbing, and more. With its advanced AI algorithms and cutting-edge technology, dupdub offers a comprehensive suite of tools for enhancing the quality of audio content and improving user experiences.

Features

Transcribe speech

This feature provides an AI transcription service that converts audio or video content into text. By using DupDub's ASR (Automatic Speech Recognition) API, users can upload audio or video files and get transcriptions with each segment's start and end timestamps. The transcriptions can be used for applications like subtitle generation, content analysis, and speech-to-text conversion.

Response

namemessage
urlURL of the audio file processed
languageLanguage of the transcribed content
durationDuration of the audio in seconds
textTranscription result text
segmentsDetailed segment information within the transcription
idParagraph ID within the transcription
startStart time of the segment
endEnd time of the segment
textTranscribed text of the segment

Voice Cloning

This feature allows users to clone the voice of a specific speaker by providing a speech sample. The cloned voice can then be used for text-to-speech (TTS) or other voice applications, generating synthetic speech that closely resembles the original speaker's voice.

Response

namemessage
idUnique identifier of the cloned speaker
speakerSpeaker identifier for the cloned speaker.
nameSpeaker's name

Dubbing

This feature allows users to synthesize speech by providing a cloned speaker's identity, speech speed, pitch, and the text to be spoken. By sending these parameters via an API request, users can generate personalized speech content that mimics the voice of the selected speaker. This can be used for applications like voice assistants, voice-over services, and other text-to-speech (TTS) scenarios.

Response

namemessage
successIndicates if the dubbing operation was successful
codeStatus code related to the dubbing operation
messageDetailed message or information about the operation
lengthLength of the request in characters
resultContainer for detailed results of the operation
duration_addressURL or path to the audio duration information
lengthOfTimeDuration of the generated audio in seconds
ossFileURL or path to the audio file
sizeSize of the generated audio file (bytes)
srt_addressURL or path to the subtitle (SRT) file, if generated
codeResponse code indicating success or failure
200: Succeed
400: Parameter validation failed
403: No relevant permissions
500: Failed
502: Gateway service exception
2001: Login failed, please login again
messageA message providing additional information about the response code

Get speaker ID

This feature allows users to query a list of voiceover actors based on specified criteria such as language, accent, and domain. By using the searchSpeakerList API, users can filter voice actors according to their language preference (e.g., English), accent (e.g., American), and domain (e.g., commercial or educational). This is useful for selecting the most appropriate voice talent for specific projects

Response

namemessage
speakerIdUnique identifier of the cloned speaker
speakerSpeaker identifier for the cloned speaker.
nameSpeaker's name
CATEGORY
Tool
VERSION
0.0.3
dupdub-dify·06/30/2025 03:29 AM
REQUIREMENTS
Maximum memory
256MB
DupDub - Dify Marketplace