Watson Cognitive Services
Generates accurate transcriptions by applying grammar, language structure and composition guidelines to audio signals.
IBM Watson Text to Speech API is Capable of identifying and registering more than one speaker with accuracy and confidence.
Custom Model support
For improved accuracy the API can be customized for the preferred language and content such as names of individuals, sensitive subjects or product names.
IBM Watson Speech to Text provides meaningful analytics by transcribing and analyzing audio from a microphone in real-time to pre-recorded files.
support for multiple languages
The IBM Watson Speech to Text Service with its speech recognition capabilities automatically transcribes Arabic, English, Spanish, French, Brazilian Portuguese, Japanese, and Mandarin speech into text.
multiple audio formats supported
Identifies and transcribes discussions with precision, even if the audio quality is low. Supports multiple audio formats (.mp3, .mpeg, .wav, .flac, or .opus) and programming interfaces (HTTP REST, Asynchronous HTTP, Websocket)
context and custom words support
Watson Natural Language Understanding identifies and analyzes text to drive meta-data from content such as keywords, concepts, categories, entities, semantic roles and relations.
For more personalized services, following three Watson Cognitive Services API’s can be used:
IBM WATSON PERSONALITY INSIGHTS
Predicts the needs, values and personality characteristics of an individual, by extracting information from their digital communications, social media and written text.
ibm watson tone analyzer
Detects three types of language tones, using linguistic analysis from text: social tendencies, emotional state and language style.
ibm watson emotion analysis
A fraction of the Alchemy Language API, is useful in measuring the emotions of an individual by analysing his or her writing.