Open Access

Free tools and resources for Brazilian Portuguese speech recognition

  • Nelson Neto1Email author,
  • Carlos Patrick1,
  • Aldebaro Klautau1 and
  • Isabel Trancoso2
Journal of the Brazilian Computer Society201017:23

Received: 5 July 2010

Accepted: 19 October 2010

Published: 4 November 2010


An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of text and audio corpora, phonetic dictionary, grapheme-to-phone converter, language and acoustic models. All of them are publicly available and, together with a proposed application programming interface, have been used for the development of several new applications, including a speech module for the OpenOffice suite. Performance tests are presented, comparing the developed BP system with a commercial software. The paper also describes an application that uses synthesis and speech recognition together with a natural language processing module dedicated to statistical machine translation. This application allows the translation of spoken conversations from BP to English and vice versa. The resources make easier the adoption of BP speech technologies by other academic groups and industry.


Speech recognitionBrazilian PortugueseGrapheme-to-phone conversionApplication programming interfaceSpeech-based applications