This document describes a karaoke-style read-aloud system that uses speech alignment and text-to-speech technology. It involves using a text-to-speech API to generate an audio file from text, then aligning the audio with the text using hidden Markov model tools (HTK) to create a timed text file. This allows highlighting text as it is read like a karaoke system and has applications for language learning by allowing shadowing of speech. The process involves text preprocessing, audio generation and processing, phonetic transcription, forced alignment with HTK, and output of a timed text file.
July Tech Festa 2015にて登壇した際の資料です。
なお後日、小河さんがOpen VASのCLI操作の解説について以下のスライドを追加投稿してくれました!
「フリーでできるセキュリティチェック OpenVAS CLI編」
http://www.slideshare.net/abend_cve_9999_0001/openvas-cli-51048313
Protect Your IoT Data with UbiBot's Private Platform.pptxユビボット 株式会社
Our on-premise IoT platform offers a secure and scalable solution for businesses, with features such as real-time monitoring, customizable alerts and open API support, and can be deployed on your own servers to ensure complete data privacy and control.
47. purchaser_log = sqldf.execute("""
select *
from (
select *, is_royal_customer(name) as royal
from (
select name, sex, age
from buyer1
union all
select name, sex, age
from buyer2
)
) as b
inner join
purchase_log as p
on b.name = p.buyer;
""")