Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Interactions based on automatic speech recognition (ASR) have become widely used, with speech input being increasingly utilized to create documents. However, as there is no easy way to distinguish between commands being issued and text required to be ...
This paper describes a novel framework of voice conversion to improve the conversion performance against the amount of training data. In voice conversion, deep neural networks are used as conversion models that map source to target features. In this ...
Audio source separation is often used for the preprocessing of various tasks, and one of its ultimate goals is to construct a single versatile preprocessor that can handle every variety of audio signal. One of the most important varieties of the discrete-...
In this paper, we propose a novel approach to capture inter-company relationships from banking transaction data using graph neural networks with a special attention mechanism and textual industry or sector information. Transaction data owned by ...
High-speed finger tracking is necessary for augmented reality and operation in human-machine cooperation without latency discomfort, but conventional markerless finger tracking methods are not fast enough and the marker-based methods have low ...
In this study, we focused on the fact that the color information of an image has a significant effect on the emotion recalled, and we created a dataset with discrete and continuous emotion labels for color and grayscale image pairs, which are not ...
Food image recognition tasks are generally addressed by using a closed dataset. In a real-world setting, however, the dataset is updated as the new class of food appears, and it is impossible to train a model that distinguishes all kinds of food in ...
A coronavirus pandemic is forcing people to be "at home" all over the world. In a life of hardly ever going out, we would have realized how the food we eat affects our bodies. What can we do to know our food more and control it better? To give us a clue,...
Many municipalities and local road authorities seek to implement automated evaluation of road damage. However, they often lack technology, know-how, and funds to afford state-of-the-art data collection equipment for collection and analysis of road ...
Rapidly developing location acquisition technologies provide a powerful tool for understanding and predicting human mobility in cities, which is very significant for urban planning, traffic regulation, and emergency management. However, with the ...
Nowadays, GPS devices have increased explosively and produced huge amounts of trajectory data related to people's outgoing. Through those big location data, many researches aim to analyze human mobility for urban development, such as human movement ...
We propose a face detection method for semi-automatic annotation of faces on pre-modern Japanese artworks to assist art historians identify objects in the art collection. Our method is based on R-CNN, such as Faster R-CNN and Cascade R-CNN, for object ...
Generative models based on deep neural networks often have a high-dimensional latent space, ranging sometimes to a few hundred dimensions or even higher, which typically makes them hard for a user to explore directly. We propose differential subspace ...
Anytime algorithms for optimization problems are of particular interest since they allow to trade off execution time with result quality. However, the selection of the best anytime algorithm for a given problem instance has been focused on a particular ...
YouTubers have recently become highly popular. Generating eye-catching thumbnails is an important factor in attracting viewers. In this study, we propose an automatic YouTube-video-thumbnail generation method that ensures the following: rich facial ...
People change their hairstyles to make their appearance attractive, however it is difficult to determine which hairstyles are attractive. In this study, we aim to recommend a hairstyle that improves the attractiveness for an input face using ...
It is important for beginners to imitate poses of experts in various sports; especially in sport climbing, performance depends greatly on the pose that should be taken for given holds. However, it is difficult for beginners to learn the proper poses for ...
PoseAsQuery is an interactive browsing system used to repeatedly replay any specific segment of a video. To acquire and improve the body control skills that are essential for physical performance, it is necessary to continuously observe an individual's ...
In recent years, with the rapid development of machine learning and artificial intelligence, the problem of target recognition and classification has made a breakthrough. Single mode data cannot summarize the feature information of the target well, ...
Online dating services have become popular in modern society. Pair matching prediction between two users in these services can help efficiently increase the possibility of finding their life partners. Deep learning based methods with automatic feature ...