dupdub

Author: dupdub
Version: 0.0.1
Type: extension

Description

dupdub is a powerful AI-powered platform that provides a wide range of services, including speech recognition, voice cloning, dubbing, and more. With its advanced AI algorithms and cutting-edge technology, dupdub offers a comprehensive suite of tools for enhancing the quality of audio content and improving user experiences.

Features

Transcribe speech

This feature provides an AI transcription service that converts audio or video content into text. By using DupDub's ASR (Automatic Speech Recognition) API, users can upload audio or video files and get transcriptions with each segment's start and end timestamps. The transcriptions can be used for applications like subtitle generation, content analysis, and speech-to-text conversion.

Response

name	message
url	URL of the audio file processed
language	Language of the transcribed content
duration	Duration of the audio in seconds
text	Transcription result text
segments	Detailed segment information within the transcription
id	Paragraph ID within the transcription
start	Start time of the segment
end	End time of the segment
text	Transcribed text of the segment

Voice Cloning

This feature allows users to clone the voice of a specific speaker by providing a speech sample. The cloned voice can then be used for text-to-speech (TTS) or other voice applications, generating synthetic speech that closely resembles the original speaker's voice.

Response

name	message
id	Unique identifier of the cloned speaker
speaker	Speaker identifier for the cloned speaker.
name	Speaker's name

Dubbing

This feature allows users to synthesize speech by providing a cloned speaker's identity, speech speed, pitch, and the text to be spoken. By sending these parameters via an API request, users can generate personalized speech content that mimics the voice of the selected speaker. This can be used for applications like voice assistants, voice-over services, and other text-to-speech (TTS) scenarios.

Response

name	message
success	Indicates if the dubbing operation was successful
code	Status code related to the dubbing operation
message	Detailed message or information about the operation
length	Length of the request in characters
result	Container for detailed results of the operation
duration_address	URL or path to the audio duration information
lengthOfTime	Duration of the generated audio in seconds
ossFile	URL or path to the audio file
size	Size of the generated audio file (bytes)
srt_address	URL or path to the subtitle (SRT) file, if generated
code	Response code indicating success or failure
200: Succeed
400: Parameter validation failed
403: No relevant permissions
500: Failed
502: Gateway service exception
2001: Login failed, please login again
message	A message providing additional information about the response code

Get speaker ID

This feature allows users to query a list of voiceover actors based on specified criteria such as language, accent, and domain. By using the searchSpeakerList API, users can filter voice actors according to their language preference (e.g., English), accent (e.g., American), and domain (e.g., commercial or educational). This is useful for selecting the most appropriate voice talent for specific projects

Response

name	message
speakerId	Unique identifier of the cloned speaker
speaker	Speaker identifier for the cloned speaker.
name	Speaker's name