Polly

Service to generate text-to-speech (tts) encodings using AWS Polly and serve via a RESTful API

Installation

Requires go to build Install dependancies and build binary

$ go get "github.com/aws/aws-sdk-go/aws" 		\
	"github.com/aws/aws-sdk-go/aws/session" 	\
	"github.com/aws/aws-sdk-go/service/polly" 	\
	"github.com/spf13/cobra"					\
	"github.com/fsnotify/fsnotify"				\
	"github.com/spf13/viper"
$ go build

Requires AWS authentication details in the $HOME directory, for more AWS credential info, visit here

.
└──.aws
   ├── credentials 
   └── config

Sample Credentials

[default]
aws_access_key_id=AKIAIOSFODNN7EXAMPL
aws_secret_access_key=wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY

Sample Config

[default]
region=us-west-2
output=json

Requires access to AWS S3 Bucket with full access. Bucket policy must allow public GET and HEAD method

Sample Bucket Policy

{
    "Version": "2012-10-17",
    "Id": "Policy1532096226680",
    "Statement": [
        {
            "Sid": "Stmt1532096224403",
            "Effect": "Allow",
            "Principal": "*",
            "Action": "s3:GetObject",
            "Resource": "arn:aws:s3:::uk.ac.ncl.sot/*"
        }
    ]
}

Use provided configuration to set S3 Bucket name, SNS logging (optional), and http server settings

"webserver": {
        "addr": "0.0.0.0:8080",
        "clientAddr": "http://localhost:4200",
        "timeout": {
            "write": 15,
            "read": 15,
            "idle": 60,
            "cancel": 60 
        }
    },
    "s3": {
        "bucketName": "uk.ac.ncl.sot",
        "outputFormat": "mp3",
        "maxRetryCount": 10
    },
    "sns": {
        "pollyTopicName": "arn:aws:sns:us-east-1:438791141487:Polly"
    }

Usage

$ ./polly webserver

Initialises webserver with the following routes:

GET /languages

Query all supported languages

Response:

{
    {
        "Name": "[name]",
        "Code": "[code]"
    },
    ...
}

GET /voices/{languageCode}

Query a specific language code to return all available voices for that language.

Response:

{
    "voices": {
        "Gender": "[gender]"
        "Id": "[id]"
        "LanguageCode": "[languageCode]",
        "LanguageName": "[languageName]",
		"Name": "[name]"
    },
    ...
}

GET /demo/{voiceID}

Retreive a short demo .mp3 of the queried voice.

Response:

"Content-Type", "audio/mpeg"

POST /generate/?voice={voideID}

Generate a text-to-speech (tts) encoding of the request body and store in AWS S3. The object URL is returned once the resource is available.

Response:

https://s3.us-east-1.amazonaws.com/uk.ac.ncl.sot/afd70890-8019-4b5e-90c3-165615727926.mp3

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.DS_Store		.DS_Store
.cfg.json		.cfg.json
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
README.md		README.md
generate.go		generate.go
generate_test.go		generate_test.go
getlanguages.go		getlanguages.go
getvoices.go		getvoices.go
getvoices_test.go		getvoices_test.go
handlers.go		handlers.go
handlers_test.go		handlers_test.go
main.go		main.go
webserver.go		webserver.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Polly

Installation

Sample Credentials

Sample Config

Sample Bucket Policy

Usage

GET /languages

Response:

GET /voices/{languageCode}

Response:

GET /demo/{voiceID}

Response:

POST /generate/?voice={voideID}

Response:

About

Releases

Packages

Contributors 2

Languages

ATNU/soundscapes-of-text-webserver

Folders and files

Latest commit

History

Repository files navigation

Polly

Installation

Sample Credentials

Sample Config

Sample Bucket Policy

Usage

GET /languages

Response:

GET /voices/{languageCode}

Response:

GET /demo/{voiceID}

Response:

POST /generate/?voice={voideID}

Response:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages