共 50 条
Read It Aloud to Me
被引:0
|作者:
Celaschi, Sergio
[1
]
Castro, Mauricio Sol
[2
]
da Cunha, Sidney Pinto
[1
]
机构:
[1] Ctr Tecnol Informacao Renato Archer, Campinas, SP, Brazil
[2] Fundacao Apoio Capacitacao Tecnol Informacao, Rod DPedro I,Km 143,6, BR-13069901 Campinas, SP, Brazil
来源:
UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: DESIGNING NOVEL INTERACTIONS, PT II
|
2017年
/
10278卷
关键词:
Assistive technology;
Text reading;
Speech synthesis;
OCR;
Photo-to-speech;
Blind;
Visually impaired;
Universal design;
D O I:
10.1007/978-3-319-58703-5_19
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The universal design applied to assistive technologies can help visually impaired person perform some day-to-day tasks as well as everybody. With this aim, the present work focuses on development of photo-to-speech instruments for the visually impaired person. It allows the user to hear text typed on a sheet of paper or written/posted on a wall. To achieve that aim a set of image capture and processing frameworks such as Optical Character Recognition (OCR) and Text to Speech Synthesis (TTS) were integrated. The first versions of the OCR based speech synthesis systems were developed for our native language, Portuguese. A preliminary desktop version was designed under Windows OS, and a version for mobile devices was developed as an application for Android devices. In this paper, we summarize efforts to develop and test a desktop and a mobile version of autonomous photo-to-speech instruments for the visually impaired. The project consisted of integration of selected components, and the CPU applications governing several functionalities: capture of images by the CCD camera; image preprocessing; OCR framework for text recognition; and finally the process of TTS, producing a synthesized voice.
引用
收藏
页码:260 / 268
页数:9
相关论文