Muutke küpsiste eelistusi

Spoken Language System and Corpus Design Reprint 2012 [Kõva köide]

Edited by , Edited by , Edited by
Teised raamatud teemal:
Teised raamatud teemal:
No detailed description available for "Spoken Language System and Corpus Design".
User's guide 1(1)
Background
1(3)
Eagles objectives
1(1)
Eagles organisational structure
2(1)
Eagles workplan
3(1)
Spoken Language systems, standards and resources
4(5)
Spoken Language systems
4(1)
Standards and resources for Spoken Language systems
5(4)
The Eagles Spoken Language Working Group (WG5)
9(5)
Subgroups of the Eagles Spoken Language Working Group
11(1)
Relationships with the other Eagles Working Groups
11(1)
Workshops
11(2)
Production of the handbook
13(1)
Consultation with the R&D Community
13(1)
Overview of the handbook
14(11)
Intended readership
14(1)
Scope
15(6)
The main chapters of the handbook
21(4)
The current state of play
25(1)
Possible future actions
25(2)
Revision and completion of existing documentation
25(1)
Extended survey of existing practice
25(1)
Extension of language base
26(1)
Terminology
26(1)
Move to prescriptive recommendations
26(1)
Publication and dissemination
26(1)
Coordination with other bodies
26(1)
Contact points
27(1)
Acknowledgements
28(1)
Part IV: Spoken language reference materials 29(184)
Character codes and computer readable alphabets
30(30)
Introduction
30(1)
ASCII
30(2)
Problems
31(1)
Multi-Byte encodings
32(1)
ISO 10646
32(1)
The Unicode standard
32(1)
Translation of code tables
33(1)
Specific vs. general converters
33(1)
MIME
34(1)
Computer readable phonetic alphabetes
34(4)
IPA subsets
35(1)
Full IPA alphabet encoding
36(2)
References
38(22)
SAMPA computer readable phonetic alphabet
60(48)
Introduction
60(4)
Notation issues
60(1)
Transcription
60(1)
Coding
61(2)
Further languages
63(1)
SAMPA: Present status
64(3)
The phonemic notation of individual languages
67(36)
Bulgarian
68(2)
Danish
70(3)
Dutch
73(2)
English
75(4)
Estonian
79(2)
French
81(2)
German
83(3)
Greek
86(2)
Hungarian
88(2)
Italian
90(3)
Norwegian
93(2)
Polish
95(1)
Portuguese
96(2)
Romanian
98(1)
Spanish
99(2)
Swedish
101(2)
Levels of annotation and extension of SAMPA
103(3)
SAMPA as a phonemic system
103(1)
Detailed phonetic or acoustic annotation
103(3)
SAMPROSA (SAM PROSodic Alphabet): symbol set
106(1)
References
107(1)
SAM file formats
108(15)
Introduction
108(1)
Speech file and associated description file formats
108(1)
Label file format
108(5)
Label file header format
109(1)
Label file body format
110(1)
Label types
110(1)
Label File example
111(2)
Files used in a recording session using EUROPEC
113(5)
Corpus File: CORPUS.DBF
113(2)
Speaker File: SPEAKERS.DBF
115(1)
Prompt file
116(2)
Protocol description file
118(3)
Codes used for presentation protocol (Protocol Description File)
118(1)
Example of Protocol Description File
119(2)
Recording conditions file
121(1)
PRIVATE.PEC file
121(2)
SAM recording protocols
123(11)
Definition of terms
123(1)
Classification of general strategies for recording and prompting
124(1)
Recording mode
124(1)
Prompting style
124(1)
Timing strategy
125(1)
Recording protocol
125(9)
Microphone
125(1)
Other sensors
126(1)
Speech data capture
126(1)
Recording environment
127(1)
Recording mode and prompting style
127(1)
Recording control
128(2)
Recording procedure
130(1)
Integrity checks
131(1)
Backup procedures
131(1)
Retrieval procedures
131(1)
Calibration
132(1)
Inter site consistency and recording procedure verification
132(1)
Collation of recordings
132(2)
SAM software tools
134(12)
DKISALA (Interactive Semi-Automatic Labelling Software)
134(1)
ELSA (ESPRIT Labelling System Assessment software)
135(1)
EUROPEC (European Program d'Enregistrement de Corpus)
136(1)
PTS (Progiciel de Traitement de Signal)
136(1)
RESAM
137(1)
SAMITPRO (Sam Iteractive Proportional Fitting)
138(1)
SAM_REC0 Isolated Word Recogniser
139(1)
SAM_SCOR (Sam Input Assessment Scoring Software)
140(1)
SAM_SLM
141(1)
SAM_SPEX (Speech Parameter Extractor)
142(1)
SAMTRA (Sam TRanscription Analysis)
143(1)
SOAP (Speech Output Assessment Package)
143(1)
PTM (Parametric Test Manager)
144(2)
EUROPEC recording tool
146(3)
Digital storage media
149(7)
Storage media dimensions
149(2)
Permanence
149(1)
Order of access
149(1)
Other storage media characteristics
150(1)
Compression
150(1)
CD-ROM
151(3)
Physical CD formats
151(1)
Logical data organisation
152(1)
CD-R software
152(1)
CD-R hardware
153(1)
Future developments
153(1)
Summary
154(1)
References
155(1)
Database Management Systems (DBMSs)
156(8)
Data model
156(5)
Hierarchical data model
157(1)
Network data model
157(1)
Relational data model
157(3)
Object-oriented data model
160(1)
Deductive data model
160(1)
Safe storage of data
161(1)
Application-independent storage of data
161(1)
Controlled access to data
162(1)
Summary
162(1)
References
163(1)
Speech standards
164(8)
Introduction
164(1)
Signal theory recalls
164(1)
Speech research
165(4)
Computer hardware and software
169(1)
Conclusion
170(1)
Appendix: Useful anonymous ftp sites
171(1)
EUROM-1 database overview
172(2)
Polyphone project overview
174(10)
Introduction
174(1)
Reusable resources
175(1)
The Dutch Polyphone corpus
176(2)
Recording workstation
176(1)
Speaker selection
176(1)
The speech material
177(1)
Postprocessing
178(1)
Use of Polyphone in application development
178(4)
Train time table information
178(3)
Phone card services
181(1)
Speaker verification
182(1)
Requirements for future corpora
182(2)
European speech resources
184(16)
Overview
184(1)
Criteria for assessment of the situation of Spoken Language Resources
185(3)
Types and specificities of corpora
186(1)
Actors in speech resource production
187(1)
Summary of the current situation on a per language basis
188(8)
DANISH (Denmark)
188(1)
DUTCH (The Netherlands)
189(1)
ENGLISH (United Kingdom)
189(1)
FRENCH (France, Belgium, Switzerland)
190(1)
GERMAN (Germany)
191(1)
GREEK (Greece)
192(1)
ITALIAN (Italy)
193(1)
NORWEGIAN (Norway)
193(1)
PORTUGUESE (Portugal)
194(1)
SPANISH (Spain)
195(1)
SWEDISH (Sweden)
196(1)
General conclusions
196(2)
Production costs
198(2)
Transcription and documentation conventions for Speechdat
200(10)
Introduction
200(1)
Points of departure
200(1)
Markings required for scoring
201(5)
Case
201(1)
Spelling
201(1)
Number sequences
202(1)
Letter sequences
202(1)
Punctuation
203(1)
Mispronuciations
203(1)
Unintelligible words
203(1)
Word fragments
203(1)
Verbal deletions
203(1)
Non-speech acoustic events
204(1)
Prosodic annotation
205(1)
Truncated waveforms
205(1)
Assessment (optional)
206(1)
Procedure
207(1)
Speaker information
207(1)
Some language-specific transcription options
207(3)
French
207(1)
English
208(1)
Italian
209(1)
The Bavarian Archive for Speech Signals
210(3)
Introduction
210(1)
General information
210(1)
Corpora
210(1)
Corpora in preparation
211(1)
Other activities
212(1)
Cooperations
212(1)
Future plans
212(1)
More information
212(1)
Bibliographical references 213(4)
Glossary 217(14)
List of abbreviations 231(6)
Index 237