14:00–15:00 |
OpeningChairs: Walter Kellermann, Reinhold Häb-Umbach Greetings from the Vice President for Research Greetings from the Vice Dean of the Technical Faculty Back to the Future of Digital Speech Communication |
15:00-15:15 |
Coffee Break |
15:15–16:30 |
Robust Speech RecognitionChair: Hans-Günter Hirsch Overview Talk Oral Presentations Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR Semi-Automatic Calibration for Dereverberation by Spectral Subtraction for Continous Speech Recognition Multimodal ASR by Turbo Decoding vs. Feature Concatenation: Where to Perform Information Integration? |
16:30–18:45 |
Hands-Free Speech Communication
Chairs: Gerald Enzner, Heinrich Löllmann Overview Talk Poster Presentations (17:15 - 18:45) P1: Effects of Resampling in Acoustic Echo Cancellation With Static Nonlinear Loudspeaker Distortion P2: Combined Nonlinear Echo Cancellation and Residual Echo Suppression P3: Efficient Multi-Channel Acoustic Echo Cancellation Using Constrained Sparse Filter Updates in the Subband Domain P4: Selflearning Codebook Speech Enhancement P5: An Open Source Corpus and Recording Software for Distant Speech Recognition With the Microsoft Kinect P6: Dual Microphone Wind Noise Reduction by Exploiting the Complex Coherence P7: A Differential Microphone Array With Input Level Alignment, Directional Equalization and Fast Notch Adaptation for Handsfree Communication |
17:15–18:45 |
Robust Speech RecognitionChair: Hans-Günter Hirsch Poster Presentations P8: Recognition of Noisy Speech by Starting the Likelihood Calculation at Voiced Segments P9: Robust Multimodal Human Machine Interaction Using the Kinect Sensor P10: Towards a Localised German Automatic Speech Recognition P11: Scoring and Re-Ranking of ASR Hypotheses Using Phoneme Error Models |
17:15–18:45 |
Demo Session: Speech Processing in the Real WorldChairs: Heinrich Löllmann, Andreas Schwarz Courtyard HD Voice Meets Car: A Hands-Free System With Bandwidth Extension and Wideband Echo Cancellation Audio lab Wave-domain Acoustic Echo Cancellation Demo room 2 Real-Time Listening Enhancement for Mobile Phones |
19:00–20:00 |
Meeting of the ITG-Fachausschüsse 4.3 and 4.4
|
8:30–9:30 |
IEEE SPS Distinguished Lecturer Talk - German Chapter Signal ProcessingPhase: Unexplored Wilderness in Signal EnhancementAkihiko K. Sugiyama Moderation: Walter Kellermann |
9:30–9:45 |
Coffee Break |
9:45–11:00 |
Spoken Language Understanding and Dialog SystemsChair: Dietrich Klakow Oral Presentations A Set of Quantitative User Experience Metrics for Multi-Modal Dialog Systems Modeling Graphical and Speech User Interfaces With Widgets and Spidgets The Impact of Word Alignment Accuracy on Audio-Visual Word Prominence Detection A New Evaluation Methodology for Speech Emotion Recognition With Confidence Output |
11:00–11:15 |
Coffee Break |
11:15–12:30 |
Speech Coding and EnhancementChair: Henning Puder Oral Presentations Impact of Coding Noise on the Convergence of Blind Source Separation Audio Coding for Beamforming With Distributed Microphones Declipping of Speech Signals Using Frequency Selective Extrapolation Scalar Quantization With Optimized Receiver-Sided Adaptive Codebook Reconstruction Levels Controlled by a Predictor On Reverse Waterfilling in Closed-Loop LPC With Noise Shaping |
12:30–13:30 |
Lunch Break |
13:30–16:30 |
Automotive Speech and Audio ProcessingChairs: Tim Fingscheidt, Gerhard Schmidt Overview Talk Poster Presentations (13:45-16:30) P1: Towards Acoustic Event Detection for Surveillance in Cars P2: Improved Performance Measures for Voice Activity Detection P3: Detection of Local Disturbances and Simultaneously Active Speakers for Distributed Speaker Dedicated Microphones in Cars P4: Application of Frequency Shifting in In-Car Communication Systems P5: SNR Estimation and Enhancement of Voiced Speech Based on Periodicity Analysis P6: Improvement in Listener Comfort Through Noise Shaping Using a Modified Wiener Filter Approach P7: Reduction of Comb-Filter Effects by Alternating Measurement Orientations in Automotive Environments |
13:45–16:30 |
Speech Coding and EnhancementChair: Henning Puder Poster Presentations P8: Linear Predictive Coding With Backward Adaptation and Noise Shaping P9: A Multi-Stage, Multi-Channel Processing System for Overlapping Speech Separation in a Real Scenario |
13:45–16:30 |
Demo Session: Speech Processing in the Real WorldChairs: Heinrich Löllmann, Andreas Schwarz Audio lab Advanced Binaural Beamforming System for Hearing Aids Demo room 1 Instrumental Quality Prediction for Text-To-Speech Systems Demo room 2 Novel Features of the Spoken Dialog System Halef A Multi-Channel Soundcard as an Acoustic Sensor Node Online Word Prominence Detection Demo room 3 Active Listening Assistant (AcListant) Speech Recognition Client for House Automation |
17:30–23:30 |
Social Event in the DB museum Nuremberg |
8:30–9:30 |
Automatic Speech Recognition Using Neural NetworksRalf Schlüter Moderation: Reinhold Häb-Umbach |
9:30–9:45 |
Coffee Break |
9:45–11:00 |
Selected Topics in Speech ProcessingChair: Peter Vary Oral Presentations Challenges in Acoustic Signal Enhancement for Human-Robot Communication System Identification With Perfect Sequence Excitation – Efficient NLMS vs. Inverse Cyclic Convolution I-Vector Speaker Verification for Speech Degraded by Narrowband and Wideband Channels On Bayesian Networks in Speech Signal Processing |
11:00–11:15 |
Coffee Break |
11:15–12:30 |
Speech and Audio Perception-Based Models for Quality EvaluationChairs: Sebastian Möller, Hans-Wilhelm Gierlich, Ulrich Heute Oral Presentations Advances in Perceptual Modeling of Speech Quality in Telecommunications Instrumental Evaluation of In-Car Communication Systems Speech Quality of VoIP: Bursty Packet Loss Revisited Orthogonal Audio Analyses for Disturbed Radio Broadcast New ITG Guideline for the Usability Evaluation of Smart Home Environments |
12:30–13:30 |
Lunch Break |
13:30–15:45 |
Acoustic Sensor NetworksChairs: Reinhold Häb-Umbach, Simon Doclo Overview Talk Oral Presentations A Subspace-Based Perspective on Spatial Filtering Performance With Distributed and Co-Located Microphone Arrays Generalized Multichannel Wiener Filter for Spatially Distributed Microphones Linear Combining of Audio Features for Signal Classification in Ad-Hoc Microphone Arrays Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking Poster Presentations (14:45 – 15:45) P1: Detection of Audio Events With Repetitive Structure Using Generalized Autocorrelations P2: Time-Frequency Dependent Multichannel Voice Activity Detection P3: Online Observation Error Model Estimation for Acoustic Sensor Network Synchronization |
14:45–15:45 |
Demo Session: Speech Processing in the Real WorldChairs: Heinrich Löllmann, Andreas Schwarz Audio lab Personalized Sound Rendering Demo room 1 Upcoming “Enhanced Voice Service” Speech Coding Standard From 3GPP Demo room 2 POLQA as an App: Embedded Perceptual Voice Quality Testing on Smartphones |
15:45–16:00 |
Closing Session
Chairs: Walter Kellermann, Reinhold Häb-Umbach |
You find the program overview here and the conference proceedings can be found here.