Text, Speech, and Dialogue: 27th International Conference, TSD 2024, Brno, Czech Republic, September 9–13, 2024, Proceedings, Part II: Lecture Notes in Computer Science, cartea 15049
Editat de Elmar Nöth, Aleš Horák, Petr Sojkaen Limba Engleză Paperback – 26 sep 2024
The 50 revised full papers presented in these deadline proceedings were carefully reviewed and selected from 103 submissions.
The papers are organized in the following topical sections:
Part I: Text
Part II: Speech, Dialogue
| Toate formatele și edițiile | Preț | Express |
|---|---|---|
| Paperback (1) | 432.11 lei 3-5 săpt. | |
| Springer Nature Switzerland – 26 sep 2024 | 432.11 lei 3-5 săpt. | |
| Paperback (1) | 374.99 lei 6-8 săpt. | |
| Springer Nature Switzerland – 26 sep 2024 | 374.99 lei 6-8 săpt. |
Din seria Lecture Notes in Computer Science
- 20%
Preț: 323.14 lei - 20%
Preț: 461.32 lei - 20%
Preț: 460.98 lei - 20%
Preț: 390.41 lei - 20%
Preț: 526.98 lei - 15%
Preț: 388.21 lei - 20%
Preț: 461.21 lei - 20%
Preț: 390.08 lei - 20%
Preț: 496.30 lei - 20%
Preț: 461.21 lei - 20%
Preț: 389.45 lei - 15%
Preț: 461.53 lei - 20%
Preț: 389.63 lei - 20%
Preț: 496.68 lei - 20%
Preț: 461.70 lei - 20%
Preț: 251.97 lei - 20%
Preț: 390.86 lei - 20%
Preț: 532.16 lei - 20%
Preț: 461.52 lei - 20%
Preț: 255.72 lei - 20%
Preț: 498.10 lei - 20%
Preț: 497.19 lei - 20%
Preț: 499.02 lei - 20%
Preț: 389.82 lei - 20%
Preț: 390.92 lei - 20%
Preț: 390.86 lei - 20%
Preț: 390.92 lei - 20%
Preț: 390.08 lei - 20%
Preț: 461.45 lei - 20%
Preț: 392.36 lei - 20%
Preț: 460.75 lei - 20%
Preț: 461.32 lei - 20%
Preț: 389.90 lei - 20%
Preț: 639.26 lei - 20%
Preț: 390.66 lei - 20%
Preț: 391.57 lei - 20%
Preț: 389.57 lei - 20%
Preț: 497.97 lei - 20%
Preț: 462.36 lei - 20%
Preț: 460.67 lei - 20%
Preț: 423.95 lei - 5%
Preț: 515.91 lei - 15%
Preț: 535.55 lei - 20%
Preț: 531.90 lei - 20%
Preț: 403.00 lei - 20%
Preț: 535.41 lei - 20%
Preț: 461.25 lei - 20%
Preț: 498.17 lei - 20%
Preț: 461.52 lei - 20%
Preț: 249.77 lei
Preț: 432.11 lei
Preț vechi: 540.14 lei
-20%
Puncte Express: 648
Preț estimativ în valută:
76.52€ • 89.37$ • 66.48£
76.52€ • 89.37$ • 66.48£
Carte disponibilă
Livrare economică 31 ianuarie-14 februarie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9783031705656
ISBN-10: 3031705653
Pagini: 326
Ilustrații: XX, 312 p.
Dimensiuni: 155 x 235 mm
Greutate: 0.5 kg
Ediția:2024
Editura: Springer Nature Switzerland
Colecția Springer
Seriile Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence
Locul publicării:Cham, Switzerland
ISBN-10: 3031705653
Pagini: 326
Ilustrații: XX, 312 p.
Dimensiuni: 155 x 235 mm
Greutate: 0.5 kg
Ediția:2024
Editura: Springer Nature Switzerland
Colecția Springer
Seriile Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence
Locul publicării:Cham, Switzerland
Cuprins
.- Speech.
.- Retrieval Augmented Spoken Language Generation for Transport Domain.
.- Adapting Audiovisual Speech Synthesis to Estonian.
.- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings.
.- Sentences vs Phrases in Neural Speech Synthesis.
.- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model.
.- Deep Speaker Embeddings for Speaker Verification of Children.
.- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
.- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers.
.- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
.- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
.- Data Alignment and Duration Modelling in VITS.
.- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus.
.- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder.
.- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation.
.- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing.
.- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition.
.- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings.
.- Dialogue.
.- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
.- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis.
.- Improving and Understanding Clarifying Question Generation in Conversational Search.
.- Explainable Multimodal Fusion for Dementia Detection From Text and Speech.
.- Robust Classification of Parkinson’s Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions.
.- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents’ Well-Being.
.- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection.
.- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings.
.- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms .
.- Automatic Classification of Parkinson’s Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.
.- Retrieval Augmented Spoken Language Generation for Transport Domain.
.- Adapting Audiovisual Speech Synthesis to Estonian.
.- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings.
.- Sentences vs Phrases in Neural Speech Synthesis.
.- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model.
.- Deep Speaker Embeddings for Speaker Verification of Children.
.- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
.- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers.
.- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
.- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
.- Data Alignment and Duration Modelling in VITS.
.- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus.
.- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder.
.- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation.
.- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing.
.- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition.
.- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings.
.- Dialogue.
.- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
.- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis.
.- Improving and Understanding Clarifying Question Generation in Conversational Search.
.- Explainable Multimodal Fusion for Dementia Detection From Text and Speech.
.- Robust Classification of Parkinson’s Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions.
.- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents’ Well-Being.
.- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection.
.- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings.
.- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms .
.- Automatic Classification of Parkinson’s Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.