Speaker - Page to Speech Plugin for WordPress

Speaker converts text into human-like speech in more than 235 voices across 40+ languages and variants

WordPress Plugins Documentation

Speaker is a WordPress Plugin designed to converts website page content into human-like speech. The plugin uses the latest technology of machine learning and artificial intelligence to play a high-quality human voice and add an audio player with content to the page. The basis of the Speaker plugin is the Google Cloud Platform, which ensures the reliability and speed of the plugin anywhere in the world.

The Speaker WordPress Plugin converts text into human-like speech in more than 190 voices across 35+ languages and variants. It applies groundbreaking research in speech synthesis (WaveNet) and Google’s robust neural networks to deliver high-fidelity audio.
The Speaker is the only WordPress Plugin that supports the standard of the Speech Synthesis Markup Language(SSML). SSML allows you to fine-tune speech for each article on your website. You can pause, intonate, read numbers and telephones in the usual format for humans. In addition, the SSML in the Speaker allows using several voices and languages ​​in a single article. For example, you can voice an article with an interview in several different voices, or use this feature for online language courses.

Speaker - New major version available
Speaker – New major version available
Powered by Google Cloud Text-to-Speech API
Powered by Google Cloud Text-to-Speech API
Visual Builder for Speech Templates
Visual Builder for Speech Templates
More than 235 voices across 30+ languages
More than 235 voices across 40+ languages
Works Great with All Popular Themes
Works Great with All Popular Themes
Multiple voices in one WordPress post
Multiple voices in one WordPress post
Batch Page Processing
Batch Page Processing
Modern User Interface
Page to Speech Plugin Fully Compatible with Gutenberg Editor
Fully Compatible with Gutenberg Editor
Intro and Outro for every Post and Page
Intro and Outro for every Post and Page
Page to Speech Plugin is 
Fully compatible with Elementor
Fully compatible with Elementor
7 Player Positions
7 Player Positions
Pauses of variable duration anywhere in the post
Pauses of variable duration anywhere in the post
Page to Speech Plugin have Five Beautiful Player Skins
Five Beautiful Player Skins
Page to Speech Plugin Completely Cross-Browser Support
Completely Cross-Browser Support
Page to Speech Plugin is Fully compatible width WPBakery Page Builder
Fully compatible width WPBakery Page Builder
Muting for any piece of content
Muting for any piece of content
Modern User Interface
Modern User Interface
Minified & optimized native JS without jQuery
Minified & optimized native JS without jQuery
Page to Speech Plugin suitable for all kind of Layouts and Templates
Suitable for all kind of Layouts and Templates
Page to Speech Plugin Tested with Popular Page Builders
Tested with Popular Page Builders

We are glad to present you Speaker 3.0 with a new advanced feature that provides flexible settings for voicing your posts or pages. Now the text-to-speech plugin supports not only the regular WordPress post and page but also Custom post types and is compatible with almost all themes or plugins. Since posts can contain different types of fields, the question accordingly arises of how to generate audio for certain fields depending on the needs of the users. This question was resolved by adding Speech Templates support that allow you to create and apply separate speech templates with specific settings and elements for each type of post.

This product has been tested and is fully compatible with all popular WordPress editors. Incredibly flexible plugin settings allow you to enable/disable it on different pages; for different user groups; on different devices(mobile, tablet, desktop).

A lot of useful functions and settings

Batch processing allows you to convert into speech all the existing pages of your site. To do this, just make a few clicks in the familiar WordPress interface. It doesn’t matter how many posts on your site are 5 or 5,000 for all of its speech will be synthesized and an audio file generated.

Custom Post Type support feature which allows you to generate audio not only for regular posts and pages but for third party types like Product, Portfolio, Property, etc. This greatly extends the use of the Speaker.

Speech Templates editor to create and apply separate speech templates with specific settings and elements for each type of post.

Customizable intro and outro for all synthesized speech files, a very convenient feature if you want all your files to be branded in the same style. For example, you can add a greeting or copyright at the beginning and/or end of each speech file.

Flexible design features allow you to customize the appearance of the player to your liking and place it in different positions on the page. Besides, you can place several players on one page. For example, if you want to insert audio versions of several other articles into one article to maintain the context of the story.

Many of the customers requested a link to download the audio file. And we did it for you! Now with just one setting, you can enable the display of the download link on both the Front-end and the Back-end.

We continuously monitor the latest innovations and trends in the development universe. And they prepared a plugin to work with the newest version of PHP. However, you can use the plugin on older versions as well. The speaker is compatible with the PHP version starting from 5.6 and up to most recent stable versions.

The new user interface has been made specifically for version 2.0. The UI is inspired by the Material Design style and looks fresh and modern.

Elementor widget and WPBakery Page Builder addon support to add the Speaker player to the page.

The Automatic Speech Synthesis feature allows you to generate audio when saving or updating pages. This is useful if you publish a lot of posts on your site and want to save time on generating.

Speaker is fully cross-browser. It does not matter which browser users view your site. Chrome, Firefox, Opera, Safari, Edge, or even the outdated versions of Internet Explorer – in all browsers, visitors will get an incredible experience. Moreover, this is especially important if your site has many long pages, such as a Blog, Portfolio, Corporate Website, eCommerce Project, Creative Agency, Real Estate Listing, Personal CV, a Restaurant or Retail Company.

Features of Page to Speech Plugin

Speaker Text-to-Speech Plugin tested and compatible with All Major WP Plugins: Yoast SEO, Contact Form 7, W3 Total Cache, NextGen Gallery, Slider Revolution, WooCommerce, etc. It also works perfectly and improves the user experience with popular Website Page Builders, like Elementor, Visual Composer, Gutenberg, Beaver Builder, Divi, Themify Builder, WP Bakery Page Builder, Layers WP, and many others. The plugin works perfectly with all significant WordPress frameworks: Genesis, Divi, Themify, Warp 7, Ultimatum, Beans, Unyson, Gantry, and others.

Learn a little more about what the plugin can do and how to configure it from the Online Documentation.

Installing the WordPress plugin is quick and straightforward. After you have downloaded the archive with the plugin, you will see the file with the plugin inside. Unzipped purchased the product you will see inside the archive with the plugin pluginname.zip There are two ways to install it:

Installation of the Plugin through the WordPress Admin panel ( Recommended for most users )

STEP 1

First of all, open WordPress admin area login page and login in into WordPress as Admin. To do this visit http://yoursitename/wp-admin

STEP 2

Go to Plugins > Add New

Add New Plugin to WordPress
Add New Plugin to WordPress
STEP 3

Click Upload Plugin Button

Upload New Plugin to WordPress
Upload New Plugin to WordPress
STEP 5

Choose archive with Plugin and press Install Now Button

STEP 6

After Installing go back to the Plugins page and click Activate below plugin title

Activate Installed Plugin

Installation the plugin through FTP or build-in File manager

STEP 1

Use Login and Password to connect to your web-server via FTP or use your hosting file manager.

STEP 2

Upload archive with plugin to /wp-content/plugins

STEP 3

On next step Unzip the archive with the plugin

STEP 4

Installing WordPress Plugin is almost finished. Go to the Plugins page and click Activate below plugin title. You can find out your username and password on the website of your hosting provider.

Activate Installed Plugin

After installing and activating the plugin, you can go to settings of the Plugin. Also, you can read more helpful articles about WordPress Themes and WordPress Plugins.

The Speaker WordPress Plugin converts post or webpage into human-like speech in more than 194 voices across 35+ languages and variants. It applies groundbreaking research in speech synthesis (WaveNet) and Google’s robust neural networks to deliver high-fidelity audio.

Notes

This manual was created for plugins version 4.0.0 and higher. If you use plugins of an older version, use the Speaker settings manual for versions 1.0.^.

To open the plugin settings in the WordPress Sidebar menu go to Speaker

Speaker WordPress Plugin Settings
Speaker Settings

In the plugin settings, there are several tabs responsible for various groups of plugin settings.

Speech Synthesis Settings

The tab describes the basic settings of the plugin, the selection of voices and features of speech synthesis.

API Key File

When you first run the plugin, you need to connect the Key File. How to get the Key File you can learn from the article About Key File for the Speaker & Voicer WordPress Plugins

Now used

Shows which voice is selected now, and also allows you to listen to the voice sample chosen with default settings.

Now used preview
Now used player

You can also listen to the Supported Google Voices and Languages in the Google Cloud Documentation.

Language

Using the filter, you can choose from the table the voices you need. After selecting a language, you can filter the table by voice type and gender.

To select and activate a voice, click on the row in the table. The selected voice will be highlighted in blue.

Advanced voice settings

The toggle enables advanced voice and audio file settings. For experienced users.

Audio Format

Select the format in which the audio will be sent: MP3 or WAV. All recordings in other formats will become unavailable. After switching the format in the settings, audio files need to be recreated for each post.

Audio Encoding

The encoding determines the output audio format that we'd like.

Audio Profile

You can optimize the synthetic speech produced by Cloud Text-to-Speech API for playback on different types of hardware. For example, if your app runs primarily on smaller, 'wearable' types of devices, you can create a synthetic speech from Cloud Text-to-Speech API that is optimized specifically for smaller speakers.

Speaking Rate/Speed

This setting changes the voice playback speed. The smaller the number, the slower the synthesized voice will speak, and the more the amount will be, the faster will be the speaker. The normal speed is 1. You can select any value in the range from 0.25 to 4. The higher the number, the faster the speech.

Pitch

Speaking pitch, in the range [-20.0, 20.0]. 20 means an increase of 20 semitones from the original pitch. -20 means a decrease of 20 semitones from the original pitch. The default value is 0

Volume Gain

The setting controls the audio gain via the slider. There are available values from -10dB to 15dB.

Sample Rate

Specify the synthesis sample rate.

We recommend a sample rate of at least 16 kHz in the audio files that you use for transcription with Speech-to-Text. Sample rates found in audio files are typically 16 kHz, 32 kHz, 44.1 kHz, and 48 kHz. Because intelligibility is greatly affected by the frequency range, especially in the higher frequencies, a sample rate of less than 16 kHz results in an audio file that has little or no information above 8 kHz. This can prevent Speech-to-Text from correctly transcribing spoken audio. Speech intelligibility requires information throughout the 2 kHz to 4 kHz range, although the harmonics (multiples) of those frequencies in the higher range are also important for preserving speech intelligibility. Therefore, keeping the sample rate to a minimum of 16 kHz is a good practice.

Synthesize audio on save

This setting enables automatic speech synthesis when you press the Update button or Publish Button.

Notes

This option can significantly increase the cost of your monthly Google Cloud quota.

Turning on this option every time you make changes to the page and save them all the contents of the old page will be sent for speech synthesizing. For example:

Post Types

The tab contains settings for choosing the types of posts for audio generation and applying speech template as default for every post type.

Post Types

Select the post types, including custom post types to work with the Speaker. The available types are in the drop-down list. Once new post types are added to your site, they will be displayed in the list.

Post Types list of the Speaker

The Speaker can only work with public posts. There are various reasons for this:

If you use password-protected post types that are available only for certain users or a group of users, the Speaker can't work with them.

Speech Templates

The settings section allows you to apply Speech Templates as default for each of the post types that you select in the Post types field. You can create new Speech Templates when editing or creating a page/post.

Default Speech Templates of the Speaker

Audio Content

Before Audio

In this text field, you can add text that will be added at the beginning of the audio file. For example, you can add an invitation or copyrights.

After Audio

In this setting, you can add text that will be added to the end of the audio file. For example, you can add farewell or copyrights.

Read the Title

The option allows you to voice the Title of the current post/page when generating audio.

The setting can be applied only to the default Speech Template "Content"

Read quotes

The option allows you to voice the quotes in the current post/page when generating audio.

Read the Image Caption

The option allows you to voice the Image caption of the current post/page when generating audio.

Read image alt

The option allows you to voice the Image alt in the current post/page when generating audio.

Punctuation

The option allows you to customize the length of pauses after dots and commas when generating audio.

Punctuation settings
Punctuation settings

Regular expression replacements

With this option, you can replace any group of characters when voicing by using Regular Expressions

RegEx replacements settings

Enter the regular expression to be replaced and on a new line write the term or SSML tag to be replaced. You can also use the WordPress filter speaker_after_content_regex_replace to manipulate content for voicing

An example of replacing a number from 1 to 9 with the letter "a"

The shortcodes can not be used in the regex rules. You should use SSML

Player

The design tab contains settings related to the design of the Player and its surrounding elements.

Player position

You can select one of the available player positions:

Important

Some themes may have problems displaying the player in Before Title and After Title positions. Switch the plugin to another position in such cases.

You can easily add the Speaker WordPress Plugin player anywhere on the page using the shortcode [speaker]And you can also show the player with an audio recording of another post, for this use the shortcode [speaker id=PAGEID] PAGEID is the id of the page of the audio version that you want to use in the player.

Speaker WordPress Plugin
Page ID

You can find out the page ID in the address bar during editing or from the page code.

Player Style

The plugin has five player styles that you can use on the website pages. You can also customize any of the presented player templates with CSS.

Wavesurfer player

Flexibly customizable player with waves on the audio track

Wavesurfer player
MediaElement player

Core WordPress player which can be styled to suit your wishes.

MediaElement player
WordPress Default Player

Regular WordPress audio player:

WordPress Default Audio Player
WordPress Default Audio Player
Chrome Style player
Browser Default Player

Default browser player without any styles and scripts. The appearance of the player is different in different browsers:

Chrome & Opera Audio Player

The Chrome & Opera Audio Player have a link to download the audio file even when the Download Link is hidden in the plugin settings.

Safari Audio Player
FireFox Audio Player

Please note that when choosing a Browser Default Player, a standard WordPress player will be disabled on your site. This will have an effect on all players on all pages of your site.

To fine-tune the design of the player, use the CSS class .mdp-speaker-box

Basic player style settings

The settings block below is only for Wavesurfer and MediaElement players

Wave(for Wavesurfer player)

Flexible style settings for customizing each element of the Wavesurfer player.

Play/Pause icon

The options to select icons for the play and pause player buttons from the library or upload your own SVG. The option is available for Wavesurfer and MediaElement players.

Edit play and pause button icons
Edit play and pause button icons

Large play button

The option allows you to enable a large play button with additional width settings and add a thumbnail as a background. The option is available for the Wavesurfer player.

Repeat button

The setting to enable/disable the repeat button in the Wavesurfer and MediaElement players. This also includes an option to select On/Of icons from the library or upload your own SVG.

Current time

The setting to enable/disable the current time display in the Wavesurfer and MediaElement players.

Duration

The setting to enable/disable the duration time display in the Wavesurfer and MediaElement players.

Mute button

The setting to enable/disable the mute button in the Wavesurfer and MediaElement players. This also includes an option to select Mute/Volume icons from the library or upload your own SVG.

Volume bar

The setting to enable/disable the volume bar display in the Wavesurfer and MediaElement players.

Download button

The setting to enable/disable the download button in the Wavesurfer and MediaElement players. There are multiple positions for displaying the button:

Download button in the Above player position
Download button in the Below player position
Download button in the Player layout position

Backend download link

The setting to enable/disable the download button in the backend when editing the post/page.

Backend download link

Speed button

The setting to enable/disable the speed button in the player. There are available additional options for the block

Available speeds option
Audio speed control on frontend

Share button

The setting to enable/disable the share buttons display in the player. There are available additional options for the block:

The option to select the social networks
Icons type of the shared buttons on frontend

Audio Preload

The preload attribute specifies if and how the audio file should be loaded when the page loads.

The audio preload options

Thumbnail

Select the thumbnail display position in the player:

Background

The option allows you to set the post/page featured image as background of the Wavesurfer and MediaElement players.

Above

The option allows you to set the post/page featured image above the player and specify the thumbnail height.

Below

The option allows you to set the post/page featured image below the player and specify the thumbnail height.

In play button

The option allows you to set the post/page featured image as background of the Large play button in Wavesurfer player and specify the thumbnail height.

Autoplay

The option allows playing audio after page load. Some browsers do not allow autoplay for any audio or video according to their policy, so this feature may not work for certain browsers.

ChromeFireFoxOperaSafariEdge
WordPress default playerNoNoYesNoYes
Browser default playerNoNoYesYesYes
The table with listed browsers that support autoplay feature

Loop

The option allows looping the audio playback.

Storage

Section for configuration of the audio files storage, metadata and audio file names customizing.

Audio File Name

The option allows you to change the name of the generated audio files using available patterns: [id] - post ID; [name] - post slug; [title] - post title; [author] - post author; [type] - post type.

If you change the Audio File Name, the audio files with the old name will no longer be displayed as audio for the post.

Custom fields

The toggle to enable/disable the feature of saving data about the generated audio file in meta fields has been added. Read more in the guide.

Visible in the Media Library

The toggle to make the audio file visible and available in the Media Library.

Note: The files that were generated before the option was enabled will not be displayed.

Storage

Media library

Installed by default and provides for local saving of the audio files, does not require additional settings.

Library + Google Drive

Allows you to automatically save files both locally and on Google Drive after the audio generation. For this, you need to get your own Google Drive API key file following the guide. Once you get your JSON key file you need to upload it to the Google Drive API key field by drag and drop or upload by clicking on the field.

Once the key file has been successfully added, you need to get your token by allowing access to the disk to save files. After the installation is completed, your audio files will also be saved on Google Drive disk in the "Speaker" folder.

API key and token are successfully installed

Legacy Mode

The option to use the old method of storing audio files. Ignore the Audio File Name pattern and get audio files with old names for Speaker versions lower 4.

Speakable Markup

The toggle to enable/disable the Speakable markup option for the post/page selected in the Speaker>Post Types.

The speakable schema.org property identifies sections within an article or webpage that are best suited for audio playback using text-to-speech (TTS). Adding markup allows search engines and other applications to identify content to read aloud on Google Assistant-enabled devices using TTS. Web pages with speakable structured data can use the Google Assistant to distribute the content through new channels and reach a wider base of users.

The additional options are available when the Speakable Markup toggle is On.

Speakable markup settings

Podcasts

An option that allows you to create an RSS podcast feed for any podcast service based on audio generated on your site.

Create RSS feed podcasts following the RSS feed guidelines for Google Podcasts,or Apple Podcaster’s Guide to RSSSpotify Podcast Delivery Specification or other podcast services.

There are available Header and Item fields of the template when the RSS option is enabled.

Podcasts feed settings

Once all the configuration is done the feed will be available at the link https://yoursitelink?feed=speaker-podcast

To get a feed only with posts of a certain category, use the link

https://yoursitelink?cat=7,8&feed=speaker-podcast

Where cat = "category ID"

To get a feed only with posts of a certain tag, use the link

https://yoursitelink?tag=tag1,tag2&feed=speaker-podcast

Where tag="tag slug"

Create root file

The option allows you to create Create RSS file in website's root https://yoursitelink/podcast.xml

Multilingual support

This tab controls multilingual audio generation. You have to install and configure multilingualism on your site using WPML, Polylang or other similar plugins before enabling the Multilingual option. Select a locale and specify the language you want to use for it.

The toggle to enable the Multilingual settings

Press Add locale button to add a new language. The locales selected in the settings must exactly match the languages that are installed in your multilingual plugin.

The option for locale selection

The Speaker can only work with plugins for multilingual that provide for the creation of each translation on a separate page. The Speaker does not create audio for different languages if: translations are on one page but in different tabs/sections /blocks or when the translation is done automatically based on the main language.

Analytics Settings

The section for the Analytics settings to collect insensitive, anonymous listening data on generated audio files. The data will be displayed only in the admin panel.

The toggle to enable the Analytics settings
The toggle to enable the Analytics settings

Analytics precision

The slider sets the number of seconds after which the data will be sent to the server. Values that are too small can cause additional load on your database.

Analytics dashboard widget

The option displays a widget with generation data and quota in the admin dashboard.

Analytics dashboard widget

This is an estimate of the characters submitted for voiceover for the current site. You can find out the exact data on the use of your quota for the entire project from the Google Cloud dashboard.

Analytics metabox

The option displays analytics results in custom fields when editing a post/page.

Speaker analytics metadata in a post editor

Analytics column

The option displays analytics results in a column on the post list. It allows you to select the types of statistics data you want to display.

Analytics column settings for a posts list
The analytics column on the posts list

Advanced Settings

Advanced settings allow you to configure the plugin to work with your theme filters and hooks, as well as manage user rights. Please make sure you understand what you are doing before changing anything on this tab.

Content before the player

The toggle enables the enter field for adding a text, image or HTML markup before the audio player.

The toggle to display content before the player

Content after the player

The toggle enables the enter field for adding a text, image or HTML markup after the audio player.

Display in loop

The option to display a player only for loop queries.

Part length

Set the length of the text parts into which the entire post text will be divided before a Google request. The maximum length of a text part is 5000 bytes

Google has a single request limit of 5000 characters or bytes. Also, it should be taken into account when using Standard or WaveNet voices, Google counts characters but when using Studio and Neural2 voices, it counts bytes. This is important because for some languages one character counts as 2-3 bytes like Thai, Arabic, etc.
Besides, there is probably a bug on the Google side regarding Neural2 and Studio languages, since when using them, the limit becomes 500 bytes per request.

You can use the Part length option to split the text into smaller parts if you receive an error when generating an audio file.

Can manage audio files

The option to select the user roles that will have access to create, re-create, remove audio and speech templates. Find out more about user roles and capabilities in WordPress.

Compatibility

Select the PHP version that is used on your server to improve compatibility with the Google Library version.

Select a compatible Google Library version

Updates

Check Updates

The toggle to enable/disable requests to the update server. When the option is enabled (by default), the plugin sends requests to our server when activating and updating our plugin.

You can turn off this setting if for some reason you need to disable requests to our server. In this case, automatic plugin updates will not be available but all the plugin functionality will be saved.

Please note: the plugin must be activated via your purchase code before the Check Updates feature is disabled.

Clear cache data

A button to clear cache data. Can be used when there is a problem with updates or activation.

Save Changes

Do not forget to click Save Changes after changing plugin settings. After clicking this button, the settings will be saved and applied to all pages of the website.

Now you can proceed to use the plugin on the pages of your WordPress website. Please read the article Converting WordPress page to speech and Speech Synthesis Markup Language (SSML) in the Speaker WordPress plugin.

Difference between Speaker and Voicer plugins

https://www.youtube.com/watch?v=zn-XrLDfDn0

Hosting

Fast and reliable hosting is significant for any WordPress site. We recommend all our customers use SiteGround WordPress Hosting. Many unique settings and features make this hosting the number 1 for WordPress: Free Website Transfer, Staging Tools, Free SSL, CDN, and much more for 3.95/mo.