Social robots are becoming more and more common in our everyday lives. In the field of conversational robotics, the development goes towards socially engaging robots with humanlike conversation. This project looked into one of the technical aspects when recognizing speech, videlicet speech activity detection (SAD). The presented solution uses a convolutional neural network (CNN) based system to detect speech in a forward azimuth area. The project used a dataset from FestVox, called CMU Artic and was complimented by adding recorded noises. A library called Pyroomacoustics were used to simulate a real world setup to create a robust system. A simplified version was built, this model only detected speech activity and a accuracy of 95%was reache...
The work presented in this study is based on the long-term goal of developing a social robot that ca...
There are many aspects of human communication that affects the nature of an interaction; examples in...
The paper investigates the problem of voice activity detection from a noisy sound signal. An extreme...
A conversational robot will in many cases have todeal with multi-party spoken interaction in which ...
In conversations between humans, not only the content of the utterances but also our social signals ...
In multiparty multimodal dialogue setup, where the robot is set to interact with multiple people, a ...
This study examines two different approaches to dialogue management system in order to achieve dynam...
The aim of this work is to design and create a robust speech activity detector that is able to detec...
Abstract—Speech has become an important part in Human Robot Interaction (HRI), e.g. for person detec...
Unintentional encounters between robots and humans will increase in the future and require concepts...
Recently, Deep Learning has revolutionized many fields, where one such area is Voice Activity Detect...
For a robot to succeed at speech recognition, it is advantageous to have a strong and clear signal t...
The interest in social robots has grown dramatically in the last decade. Several studies have invest...
Abstract. This paper presents a multi-modal system for finding out where to direct the attention of ...
This thesis is partly a theoretical introduction to some basic concepts of signal processing such as...
The work presented in this study is based on the long-term goal of developing a social robot that ca...
There are many aspects of human communication that affects the nature of an interaction; examples in...
The paper investigates the problem of voice activity detection from a noisy sound signal. An extreme...
A conversational robot will in many cases have todeal with multi-party spoken interaction in which ...
In conversations between humans, not only the content of the utterances but also our social signals ...
In multiparty multimodal dialogue setup, where the robot is set to interact with multiple people, a ...
This study examines two different approaches to dialogue management system in order to achieve dynam...
The aim of this work is to design and create a robust speech activity detector that is able to detec...
Abstract—Speech has become an important part in Human Robot Interaction (HRI), e.g. for person detec...
Unintentional encounters between robots and humans will increase in the future and require concepts...
Recently, Deep Learning has revolutionized many fields, where one such area is Voice Activity Detect...
For a robot to succeed at speech recognition, it is advantageous to have a strong and clear signal t...
The interest in social robots has grown dramatically in the last decade. Several studies have invest...
Abstract. This paper presents a multi-modal system for finding out where to direct the attention of ...
This thesis is partly a theoretical introduction to some basic concepts of signal processing such as...
The work presented in this study is based on the long-term goal of developing a social robot that ca...
There are many aspects of human communication that affects the nature of an interaction; examples in...
The paper investigates the problem of voice activity detection from a noisy sound signal. An extreme...