You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A. Support for non-English (allowing our 600k non-English books to be read)
B. Reduce bandwidth (generating mp3's is not accessible to all and slow to load)
C. Pause button
D. Playback speed controls
E. Backwards compatibility with Festival (where possible)
Implementation
Use Web Speech API (which provides options for many of these) where possible. Investigate how many of these features are also compatible Festival so we don't have to support two separate UIs if we have to fallback to Festival.
UX Planning - Determine what we want the audiobook experience to be like. This will dictate the next steps. Results of this planning will be posted on/used to close Design Read Aloud UI Options #118 .
Motivation
Each year millions of patrons use the Internet Archive’s BookReader to explore book-formatted content. Many readers are surprised to learn Bookdeader has a read-aloud button which enables any book to be read out loud by a friendly robot voice. While for many this feature is icing on the cake, others with sight and print-disabilities depend on this feature to access our books. Our current read-aloud offering gets the job done, but there are a handful of major problems and opportunities for improvement. By switching from our current system, which uses Festival text-to-speech and generating mp3 files on the fly, to using the browser’s native Web Speech API, we could fix 5 major issues:
Generating mp3's is high bandwidth (not accessible to all users) and slow to load.
Our current read-aloud has no way to pause and resume your position!
There’s no UI for changing the playback speed of our Festival voices.
In most browsers, the Web Speech API voices are superior to our Festival voices.
Today, all our books are read with an English accent -- Web Speech API offers multiple voices and the ability for books to be read in their native language.
We have a working prototype of our books being read using the Web Speech API. The next step is to integrate the Web Speech API into our native Internet Archive BookReader, and we already have a start on the feature/native-audioreader branch!
Features we definitely want:
Implementation
Plan
(NOT CEMENTED YET)
plugin.tts.jsto include an abstract class,SpeechSynthesisEngine, and a concrete classFestivalSpeechEngine(necessary for E)plugin.tts.jsto include a classBrowserSpeechEnginewhich implementsSpeechSyntehsisEngine, and which is used on supported browser (Completes B)BrowserSpeechEngineshould choose voice based on book language (Completes A)Motivation
Each year millions of patrons use the Internet Archive’s BookReader to explore book-formatted content. Many readers are surprised to learn Bookdeader has a read-aloud button which enables any book to be read out loud by a friendly robot voice. While for many this feature is icing on the cake, others with sight and print-disabilities depend on this feature to access our books. Our current read-aloud offering gets the job done, but there are a handful of major problems and opportunities for improvement. By switching from our current system, which uses Festival text-to-speech and generating mp3 files on the fly, to using the browser’s native Web Speech API, we could fix 5 major issues:
We have a working prototype of our books being read using the Web Speech API. The next step is to integrate the Web Speech API into our native Internet Archive BookReader, and we already have a start on the feature/native-audioreader branch!
Related to #58