Add Text-to-Speech to your website using Azure Cognitive Services and Azure Functions to authorize requests.

This project uses Azure Cogntive Services Speech synthesis and Azure Functions to provide text-to-speech in the browser.

This project consists of 2 parts: the HTML file to demonstrate the usage in browser with Javascript, and the NodeJS Azure Function used to authorize the HTML file to use Azure Cognitive Services.

The HTML file works upon the Azure-Samples repository, with removed features for legibility and understandability. The full sample, including an Express backend server, can be found here: cognitive-services-speech-sdk/synthesis.html at master · Azure-Samples/cognitive-services-speech-sdk (github.com)

How it works

Azure Cognitive Services can be accessed using a subscription key. However, this is not recommended when calling Cognitive Services from the browser, because this would require making keys publicly accessible by end users.

Instead, we can use Azure Functions to create an authorization token that can be used by the browser JavaScript to make calls directly to Cognitive Services.

When it receives an HTTP request, the Azure Function will call the cognitive service with the subscription keys to retrieve an authorization token. The Azure Function will then respond to the HTTP request by returning the authorization token.

Steps 1-4 are for setup and are called only once, usually upon page load.

Upon page load, browser requests Authorization token to Azure Function
Azure Function creates Authorization token with its Cognitive Services key
Cognitive Services returns an Authorization token
Azure Function returns the Authorization token

Steps 5-8 are repeated every time the user presses the play button

User presses play button
Browser requests speech from Cognitive services (passes Authorization token and speech options in request as parameters)
Cognitive services returns speech
Browser plays audio

Prerequisites

Required

Get started

In /Azure Function/local.settings.json, add your Speech Region and Speech key under values as such:

{
  "IsEncrypted": false,
  "Values": {
    "AzureWebJobsStorage": "",
    "FUNCTIONS_WORKER_RUNTIME": "node",
+   "speechRegion": "xxxxxx",
+   "speechKey": "xxxxxxxxxxxxxxxxxxxxxxxxxxxx"
  },
  "host":{
    "CORS": "*"
  }
}

From your command line, navigate inside the /Azure Function/ folder & run npm start.

You should see the Azure Functions Core Tools run, and the function should be running at address http://localhost:7071/api/TTSAuthorizationToken. If your address is different, you may have to adjust the HTML to make requests to your correct path.
Get the full path of /HTML/synthesis.html, copy that path, & open it in the browser
Press the Update Voice List blue button to get the list of voices available.
Done! You can add text and start synthesis when you are ready!

Common issues

If speech is not working, verify that you have added your keys and region to local.settings.json
If speech is not working after clicking start synthesis, ensure that the update voice list button was click and that a voice is selected.
Ensure that CORS is enabled in local.settings.json to allow requests from a different origin to be accepted by Azure Functions

Screenshots

Resources

Microsoft Cognitive Services Speech Service and SDK Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 305 Commits
.vscode		.vscode
Azure Function		Azure Function
HTML		HTML
.gitignore		.gitignore
2021-12-16-10-10-28.png		2021-12-16-10-10-28.png
2021-12-16-11-14-07.png		2021-12-16-11-14-07.png
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Add Text-to-Speech to your website using Azure Cognitive Services and Azure Functions to authorize requests.

How it works

Prerequisites

Required

Recommended

Get started

Common issues

Screenshots

Resources

About

Releases

Packages

Languages

License

arnaudroy97/Azure-Cognitive-Services-Speech-Azure-Functions-demo

Folders and files

Latest commit

History

Repository files navigation

Add Text-to-Speech to your website using Azure Cognitive Services and Azure Functions to authorize requests.

How it works

Prerequisites

Required

Recommended

Get started

Common issues

Screenshots

Resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages