Search by Subject

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Publications

Publication Date

Past 5 years

Searched The ACM Full-Text Collection (691,749 records)|Expand your search to The ACM Guide to Computing Literature (3,482,418 records)

Showing 1 - 20of455 Results

Filters

Select All

Export Citations Save to Binder

per page:

Latest

short-paper
October 2019
Published By ACM
Single-trial Based EEG Classification of the Dynamic Representation of Speaker Stance: A Preliminary Study with Representational Similarity Analysis
- Xiaoming Jiang
ICMI '19: NeuroManagement and Intelligent Computing Method on Multimodal InteractionOctober 2019, Article No.: 6, pp 1–4https://doi.org/10.1145/3357160.3357672

A classification approach combining machine learning and representational similarity analysis was performed on neurophysiological data (i.e., electrophysiological responses) to distinguish speakers of different stances. The trial-based classification ...
1
107
Metrics
Total Citations1
Total Downloads107
Last 12 Months10
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
short-paper
October 2019
Published By ACM
A Synergy Study of Metaphoric Gestures on Rhetorical Behavior Construction: Based on the Corpus of “AI”-themed Public Speeches
- Lang Che,
- Liqin Zha
ICMI '19: NeuroManagement and Intelligent Computing Method on Multimodal InteractionOctober 2019, Article No.: 3, pp 1–6https://doi.org/10.1145/3357160.3357669

Under the background that metaphoric gestures may play a nonnegligible role in multimodal communication, this paper attempts to probe into how metaphoric gestures co-contribute to the construction of rhetorical behavior in the discourse type of public ...
1
96
Metrics
Total Citations1
Total Downloads96
Last 12 Months13
Last 6 weeks1
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
keynote
October 2019
Published By ACM
NeuroManagement and Intelligent Computing Method on Multimodal Interaction
- Weihui Da
ICMI '19: NeuroManagement and Intelligent Computing Method on Multimodal InteractionOctober 2019, Article No.: 1, pp 1–3https://doi.org/10.1145/3357160.3357664

In recent years, the research of multimodal interaction has made rapid progress owing to the development of artificial intelligence and big data technology, as well as the new findings of human psychology and behavior's study. Nevertheless, the human- ...
0
112
Metrics
Total Citations0
Total Downloads112
Last 12 Months28
Last 6 weeks3
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
abstract
October 2019
Published By ACM
Detecting Syntactic Violations from Single-trial EEG using Recurrent Neural Networks
ICMI '19: Adjunct of the 2019 International Conference on Multimodal InteractionOctober 2019, Article No.: 4, pp 1–5https://doi.org/10.1145/3351529.3360656

We proposes a method with neural network models to detect language anomalies using electroencephalogram (EEG) signals. To the best of our knowledge, there have been few studies on classifying single-trial EEG signals related to language processing such ...
2
109
Metrics
Total Citations2
Total Downloads109
Last 12 Months10
Last 6 weeks1
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Deep Audio-visual System for Closed-set Word-level Speech Recognition
- Yougen Yuan,
- Wei Tang,
- Minhao Fan,
- Yue Cao,
- Peng Zhang,
- Lei Xie
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 540–545https://doi.org/10.1145/3340555.3356102

Audio-visual understanding is usually challenged by the complementary gap between audio and visual informative bridging. Motivated by the recent audio-visual studies, a closed-set word-level speech recognition scheme is proposed for the Mandarin Audio-...
1
145
Metrics
Total Citations1
Total Downloads145
Last 12 Months12
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Robust Spoken Language Understanding with Acoustic and Domain Knowledge
- Hao Li,
- Chen Liu,
- Su Zhu,
- Kai Yu
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 531–535https://doi.org/10.1145/3340555.3356100

Spoken language understanding (SLU) converts user utterances into structured semantic forms. There are still two main issues for SLU: robustness to ASR-errors and the data sparsity of new and extended domains. In this paper, we propose a robust SLU ...
3
192
Metrics
Total Citations3
Total Downloads192
Last 12 Months24
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Multi-Classification Model for Spoken Language Understanding
- Chaohong Tan,
- Zhenhua Ling
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 526–530https://doi.org/10.1145/3340555.3356099

The spoken language understanding (SLU) is an important part of spoken dialogue system (SDS). In the paper, we focus on how to extract a set of act-slot-value tuples from users’ utterances in the 1st Chinese Audio-Textual Spoken Language Understanding ...
4
155
Metrics
Total Citations4
Total Downloads155
Last 12 Months15
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
CATSLU: The 1st Chinese Audio-Textual Spoken Language Understanding Challenge
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 521–525https://doi.org/10.1145/3340555.3356098

Spoken language understanding (SLU) is a key component of conversational dialogue systems, which converts user utterances into semantic representations. The previous works almost focus on parsing semantic from textual inputs (top hypothesis of speech ...
8
220
Metrics
Total Citations8
Total Downloads220
Last 12 Months43
Last 6 weeks4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Transfer Learning Methods for Spoken Language Understanding
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 510–515https://doi.org/10.1145/3340555.3356096

In this paper, we present a series of methods to improve the performance of spoken language understanding in the 1st Chinese Audio-Textual Spoken Language Understanding Challenge (CATSLU 2019) which is aimed to improve the robustness for automatic ...
3
149
Metrics
Total Citations3
Total Downloads149
Last 12 Months11
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
abstract
October 2019
Published By ACM
Multimodal Machine Learning for Interactive Mental Health Therapy
- Leili Tavabi
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 453–456https://doi.org/10.1145/3340555.3356095

Mental health disorders are among the leading causes of disability. Despite the prevalence of mental health disorders, there is a large gap between the needs and resources available for their assessment and treatment. Automatic behaviour analysis for ...
2
590
Metrics
Total Citations2
Total Downloads590
Last 12 Months113
Last 6 weeks16
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
abstract
October 2019
Published By ACM
Multimodal Driver Interaction with Gesture, Gaze and Speech
- Abdul Rafey Aftab
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 487–492https://doi.org/10.1145/3340555.3356093

The ever-growing research in computer vision has created new avenues for user interaction. Speech commands and gesture recognition are already being applied in various touch-based inputs. It is, therefore, foreseeable, that the use of multimodal input ...
6
457
Metrics
Total Citations6
Total Downloads457
Last 12 Months73
Last 6 weeks13
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
abstract
October 2019
Published By ACM
Multi-modal Fusion Methods for Robust Emotion Recognition using Body-worn Physiological Sensors in Mobile Environments
- Tianyi Zhang
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 463–467https://doi.org/10.1145/3340555.3356089

High-accuracy physiological emotion recognition typically requires participants to wear or attach obtrusive sensors (e.g., Electroencephalograph). To achieve precise emotion recognition using only wearable body-worn physiological sensors, my doctoral ...
1
215
Metrics
Total Citations1
Total Downloads215
Last 12 Months29
Last 6 weeks1
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Bi-modality Fusion for Emotion Recognition in the Wild
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 589–594https://doi.org/10.1145/3340555.3355719

The emotion recognition in the wild has been a hot research topic in the field of affective computing. Though some progresses have been achieved, the emotion recognition in the wild is still an unsolved problem due to the challenge of head movement, ...
24
505
Metrics
Total Citations24
Total Downloads505
Last 12 Months62
Last 6 weeks7
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Automatic Group Cohesiveness Detection With Multi-modal Features
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 577–581https://doi.org/10.1145/3340555.3355716

Group cohesiveness is a compelling and often studied composition in group dynamics and group performance. The enormous number of web images of groups of people can be used to develop an effective method to detect group cohesiveness. This paper ...
4
212
Metrics
Total Citations4
Total Downloads212
Last 12 Months22
Last 6 weeks2
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
Group-level Cohesion Prediction using Deep Learning Models with A Multi-stream Hybrid Network
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 572–576https://doi.org/10.1145/3340555.3355715

In this paper, we propose a hybrid deep learning network for predicting group cohesion in images. It is a kind of regression problem and its objective is to predict the Group Cohesion Score (GCS), which is in the range of [0,3]. In order to solve this ...
4
237
Metrics
Total Citations4
Total Downloads237
Last 12 Months22
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
keynote
October 2019
Published By ACM
Challenges of Multimodal Interaction in the Era of Human-Robot Coexistence
- Zhengyou Zhang
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 2https://doi.org/10.1145/3340555.3354264

With the rapid progress in computing and sensory technologies, we will enter the era of human-robot coexistence in the not-too-distant future, and it is time to address the challenges of multimodal interaction. Should a robot take the form of humanoid? ...
0
237
Metrics
Total Citations0
Total Downloads237
Last 12 Months13
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
keynote
October 2019
Published By ACM
Socially-Aware User Interfaces: Can Genuine Sensitivity Be Learnt at all?
- Elisabeth André
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 5https://doi.org/10.1145/3340555.3353960

Recent years have initiated a paradigm shift from pure task-based human-machine interfaces towards socially-aware interaction. Advances in deep learning have led to anthropomorphic interfaces with robust sensing capabilities that come close to or even ...
0
115
Metrics
Total Citations0
Total Downloads115
Last 12 Months2
Last 6 weeks0
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
research-article
October 2019
Published By ACM
DeepReviewer: Collaborative Grammar and Innovation Neural Network for Automatic Paper Review
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 395–403https://doi.org/10.1145/3340555.3353766

Nowadays, there are more and more papers submitted to various periodicals and conferences. Typically, reviewers need to read through the paper and give a review comment and score to it based on somehow certain criterion. This review process is labor ...
6
224
Metrics
Total Citations6
Total Downloads224
Last 12 Months48
Last 6 weeks8
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
short-paper
October 2019
Published By ACM
Exploring Transfer Learning between Scripted and Spontaneous Speech for Emotion Recognition
- Qingqing Li,
- Theodora Chaspari
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 435–439https://doi.org/10.1145/3340555.3353762

Internet of Things technologies yield large amounts of real-life speech data related to human emotions. Yet, labelled data of human emotion from spontaneous speech are extremely limited due to the difficulties in the annotation of such large volumes of ...
2
275
Metrics
Total Citations2
Total Downloads275
Last 12 Months19
Last 6 weeks2
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
Get Access
short-paper
Public Access
October 2019
Published By ACM
VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis
ICMI '19: 2019 International Conference on Multimodal InteractionOctober 2019, pp 205–209https://doi.org/10.1145/3340555.3353751

Using neural networks to classify infant vocalisations into important subclasses (such as crying versus speech) is an emergent task in speech technology. One of the biggest roadblocks standing in the way of progress lies in the datasets: The performance ...
3
306
Metrics
Total Citations3
Total Downloads306
Last 12 Months59
Last 6 weeks4
Export Citations
Save to Binder
Save to Binder
Create a New Binder
Name
View online with eReader
HTML
PDF

Artificial Intelligence, Machine Learning, Computer Vision, Natural language processing

Applied Filters

People

Names

Affiliations

Authors

Reviewers

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Publication Date

Single-trial Based EEG Classification of the Dynamic Representation of Speaker Stance: A Preliminary Study with Representational Similarity Analysis

A Synergy Study of Metaphoric Gestures on Rhetorical Behavior Construction: Based on the Corpus of “AI”-themed Public Speeches

NeuroManagement and Intelligent Computing Method on Multimodal Interaction

Detecting Syntactic Violations from Single-trial EEG using Recurrent Neural Networks

Deep Audio-visual System for Closed-set Word-level Speech Recognition

Robust Spoken Language Understanding with Acoustic and Domain Knowledge

Multi-Classification Model for Spoken Language Understanding

CATSLU: The 1st Chinese Audio-Textual Spoken Language Understanding Challenge

Transfer Learning Methods for Spoken Language Understanding

Multimodal Machine Learning for Interactive Mental Health Therapy

Multimodal Driver Interaction with Gesture, Gaze and Speech

Multi-modal Fusion Methods for Robust Emotion Recognition using Body-worn Physiological Sensors in Mobile Environments

Bi-modality Fusion for Emotion Recognition in the Wild

Automatic Group Cohesiveness Detection With Multi-modal Features

Group-level Cohesion Prediction using Deep Learning Models with A Multi-stream Hybrid Network

Challenges of Multimodal Interaction in the Era of Human-Robot Coexistence

Socially-Aware User Interfaces: Can Genuine Sensitivity Be Learnt at all?

DeepReviewer: Collaborative Grammar and Innovation Neural Network for Automatic Paper Review

Exploring Transfer Learning between Scripted and Spontaneous Speech for Emotion Recognition

VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis