Azure AI Speech

1 answer

Sample Data for different styles of Custom Neural Voices (happy, excited, sad).

I could find individual utterances for neutral speech, questions, and exclamations here: https://github.com/Azure-Samples/Cognitive-Speech-TTS/blob/master/CustomVoice/Sample%20Data/Individual%20utterances%20%2B%20matching%20script/SampleScript.txt To…

asked

PAVAGEAU Perrine 20

answered

dupammi 7,400 Microsoft Vendor

0 answers

training with mixed language in custom-stt(English & Korean)

Hi, I am working on training korean custom-stt, but in the training data , there are a few english words mixed in it. Some of them are well processed and accepted as train data but others get rejected such as winder, insulator, gripper, rewinding. below…

asked

VPA 21

commented

Elias Salazar Zeledon (Manpower Costa Rica S A) 0 Microsoft Vendor

0 answers

How to estimate the time needed to train a custom STT model?

Hey! I'm thinking about fine-tuning a STT model with Audio + human-labeled transcript data in Speech Studio. However, as I read through the docs, I can see that "If you switch to a base model that supports customization with audio data, the training…

asked

Bruno Goncalves Vaz (P) 0

0 answers

How can I make Microsoft consider adding Faroese language to Speech Services

I need text-to-speech services for Faroese in Speech Services. How would I go about getting Microsoft to consider this request? Is there any way for me myself to train a custom voice, for a language that doesn't yet exist in Microsoft's repository of…

asked

68046286 0

commented

dupammi 7,400 Microsoft Vendor

1 answer

Can I re-train an already deployed custom voice model with newly added data without undergoing the entire training time again (approximately 24 hours)?

Here’s the context: We set up a voice talent, added training data, trained the model, and deployed it. We've now updated the dataset with more audios and transcripts, increasing the number of utterances from 1300 to 1500. When I try to train this voice…

asked

PAVAGEAU Perrine 20

accepted

PAVAGEAU Perrine 20

0 answers

Speech recognition service is not working correctly

Hi, I'm using your speech service to recognize phrases spoken by a user in real time and evaluate their pronunciation. However, I am facing the following issues If I pass the reference text and set EnableMiscue =true, then all the wrong words the user…

asked

Miroslav 0

edited a comment

navba-MSFT 17,900 Microsoft Employee

1 answer

Why is the Isabella Multilingual voice available only in Clipchamp?

Hello, I noticed that the Isabella Multilingual voice for Thai Text to Speech is available in Clipchamp but not in Audio Content Creation. I'm interested in using this voice for my projects. I was wondering if there are any specific reasons why this…

asked

i'm MariOhn 61

accepted

i'm MariOhn 61

1 answer

How to output transcription on a word-level

With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…

asked

Sophie 0

commented

Gowtham CP 1,970

0 answers

400 Bad request using whisper with AzureCliCredentials

I'm trying to use Whisper using the AzureCliCredential and i always get an error as follow { code: 'Request is badly formated', message: 'Resource Id is badly formed: NA' } my very simple code is : import * as fs from "fs"; import {…

asked

Julien C 0

commented

navba-MSFT 17,900 Microsoft Employee

1 answer

Azure TTS batch synthesis activity logs

Hi there, we're using Azure speech synthesis (batch, since we have content over 10mins). In the Azure Portal, I can see metrics for my speech resource but I can't see any records of past jobs. Is there any way to see these? Thanks, Tim

asked

Tim Schmidt 0

commented

navba-MSFT 17,900 Microsoft Employee

0 answers

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…

asked

Verbari LLC 0

commented

navba-MSFT 17,900 Microsoft Employee

1 answer

here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.

i cannot find custom avatar key after sign in to the speech studio .

asked

Praveen Jaganivasan 0

commented

santoshkc 5,165 Microsoft Vendor

0 answers

How do you do pronunciation

Recently I had a script for a programming video, and I needed the word GUID, or goo id. I tried typing many different ways, and the only way I could get the word GUID, was to type goo hid, and use an audio editor and get rid of the H sound. Azure Speech…

asked

Data Juggler 181

commented

navba-MSFT 17,900 Microsoft Employee

1 answer

Inquiry Regarding Azure AI Speech Error

Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…

asked

y.ashibe 25

edited a comment

navba-MSFT 17,900 Microsoft Employee

0 answers

TTS繁體中文國語發音錯誤

「重考」發音應該是ㄔㄨㄥˊ ㄎㄠˇ 「假期」發音應該是ㄐㄧㄚˋ ㄑㄧˊ TTS 是收費服務，因此請儘快修正。謝謝

asked

疼目職人 0

commented

YutongTie-MSFT 46,996

2 answers

Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail

I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…

asked

ivo welch 40

commented

dupammi 7,400 Microsoft Vendor

0 answers

How to transcribe foreign names and words within English sentences

I use Azure Speech to transcribe audio files in English through a Java application. There are however some reoccuring foreign words and names (Arabic) used in the middle of the English sentences and these are not properly transcribed. What is the…

asked

Hashim Khan 0

commented

VasaviLankipalle-MSFT 15,006

1 answer

Markdown to SSML ?

Does anyone know of a basic "preparer-converter" that takes a markdown (.md) file and converts it into an SSML file?

asked

ivo welch 40

commented

dupammi 7,400 Microsoft Vendor

0 answers

import user_config_helper

not able to find library to import user_config_helper import user_config_helper

asked

AT13519148 0

commented

dupammi 7,400 Microsoft Vendor

0 answers

Regarding usage cost calculation using Azure Retail Price API

We are using Azure subscription with the Standard Tier. We have a requirement to calculate the monthly usage cost in JPY (Japanese Yen) of the Azure Speech to Text service and Azure Blob Storage in our application. we analyzed the Azure Retail Price API…

asked

Test Admin 171

edited a comment

Test Admin 171

Filter

Content

1,448 questions with Azure AI Speech tags

Sample Data for different styles of Custom Neural Voices (happy, excited, sad).

training with mixed language in custom-stt(English & Korean)

How to estimate the time needed to train a custom STT model?

How can I make Microsoft consider adding Faroese language to Speech Services

Can I re-train an already deployed custom voice model with newly added data without undergoing the entire training time again (approximately 24 hours)?

Speech recognition service is not working correctly

Why is the Isabella Multilingual voice available only in Clipchamp?

How to output transcription on a word-level

400 Bad request using whisper with AzureCliCredentials

Azure TTS batch synthesis activity logs

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.

How do you do pronunciation

Inquiry Regarding Azure AI Speech Error

TTS繁體中文國語發音錯誤

Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail

How to transcribe foreign names and words within English sentences

Markdown to SSML ?

import user_config_helper

Regarding usage cost calculation using Azure Retail Price API