default search action
Akinori Ito
- > Home > Persons > Akinori Ito
Publications
- 2025
- [j51]Changlong Wang
, Akinori Ito
, Takashi Nose:
Adaptive Depth-Wise Pruning for Efficient Environmental Sound Classification. IEEE Access 13: 69751-69759 (2025) - [j50]Changlong Wang
, Akinori Ito, Takashi Nose
:
Adaptive Fine-Grained Pruning via Binary Search for Efficient Environmental Sound Classification. IEEE Access 13: 173201-173208 (2025) - 2024
- [j49]Xuecheng Niu
, Akinori Ito
, Takashi Nose
:
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. IEEE Access 12: 46940-46952 (2024) - [j48]Xuecheng Niu
, Akinori Ito
, Takashi Nose
:
A Replaceable Curiosity-Driven Candidate Agent Exploration Approach for Task-Oriented Dialog Policy Learning. IEEE Access 12: 142640-142650 (2024) - [j47]Rui Zhou
, Takaki Koshikawa
, Akinori Ito
, Takashi Nose
, Chia-Ping Chen:
Multilingual Meta-Transfer Learning for Low-Resource Speech Recognition. IEEE Access 12: 158493-158504 (2024) - [c148]Rui Zhou, Akinori Ito
, Takashi Nose:
Improving Speaker Consistency in Speech-to-Speech Translation Using Speaker Retention Unit-to-Mel Techniques. APSIPA 2024: 1-6 - [c147]Zikai Shu
, Takashi Nose
, Akinori Ito
:
Toward Photo-Realistic Facial Animation Generation Based on Keypoint Features. ICMLC 2024: 334-339 - [c146]Rui Zhou
, Akinori Ito
, Takashi Nose
:
Character Expressions in Meta-Learning for Extremely Low Resource Language Speech Recognition. ICMLC 2024: 525-529 - [c145]Changlong Wang
, Akinori Ito
, Takashi Nose
, Chia-Ping Chen
:
Evaluation of Environmental Sound Classification using Vision Transformer. ICMLC 2024: 665-669 - [c144]Tomoki Fujihara
, Akinori Ito
, Takashi Nose
:
Estimation of Offensiveness of Posts on Social Media and Its Application to a Conversation Assistance System. NLPIR 2024: 369-373 - [i3]Xuecheng Niu, Akinori Ito, Takashi Nose
:
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. CoRR abs/2402.00085 (2024) - [i1]Rui Zhou, Akinori Ito, Takashi Nose:
Preserving Speaker Information in Direct Speech-to-Speech Translation with Non-Autoregressive Generation and Pretraining. CoRR abs/2412.07316 (2024) - 2023
- [c143]Simon Jolibois, Akinori Ito, Takashi Nose
:
Multimodal Expressive Embodied Conversational Agent Design. HCI (43) 2023: 244-249 - 2021
- [c139]Daisuke Horii, Akinori Ito, Takashi Nose
:
Analysis of Feature Extraction by Convolutional Neural Network for Speech Emotion Recognition. GCCE 2021: 425-426 - [c138]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose
, Akinori Ito:
Neural Spoken-Response Generation Using Prosodic and Linguistic Context for Conversational Systems. Interspeech 2021: 246-250 - [c137]Satsuki Naijo, Akinori Ito, Takashi Nose
:
Improvement of Automatic English Pronunciation Assessment with Small Number of Utterances Using Sentence Speakability. Interspeech 2021: 4473-4477 - [c136]Ryota Yahagi, Yuya Chiba, Takashi Nose
, Akinori Ito:
Multimodal Dialogue Response Timing Estimation Using Dialogue Context Encoder. IWSDS 2021: 133-141 - 2020
- [j44]Kosuke Nakamura, Takashi Nose
, Yuya Chiba, Akinori Ito
:
A Symbol-level Melody Completion Based on a Convolutional Neural Network with Generative Adversarial Learning. J. Inf. Process. 28: 248-257 (2020) - [j43]Jiang Fu, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Automatic assessment of English proficiency for Japanese learners without reference sentences based on deep neural network acoustic models. Speech Commun. 116: 86-97 (2020) - [c134]Rikiya Takahashi, Takashi Nose
, Yuya Chiba, Akinori Ito:
Successive Japanese Lyrics Generation Based on Encoder-Decoder Model. GCCE 2020: 126-127 - [c133]Ryota Yahagi, Yuya Chiba, Takashi Nose
, Akinori Ito:
Incremental Response Generation Using Prefix-to-Prefix Model for Dialogue System. GCCE 2020: 349-350 - [c132]Satoru Mizuochi, Yuya Chiba, Takashi Nose
, Akinori Ito:
Spoken Term Detection Based on Acoustic Models Trained in Multiple Languages for Zero-Resource Language. GCCE 2020: 351-352 - [c131]Satsuki Naijo, Yuya Chiba, Takashi Nose
, Akinori Ito:
Analysis and Estimation of Sentence Speakability for English Pronunciation Evaluation. GCCE 2020: 353-355 - [c130]Aoi Kanagaki, Masaya Tanaka, Takashi Nose
, Ryohei Shimizu, Akira Ito, Akinori Ito:
CycleGAN-Based High-Quality Non-Parallel Voice Conversion with Spectrogram and WaveRNN. GCCE 2020: 356-357 - [c129]Daisuke Fujimaki, Takashi Nose
, Akinori Ito:
Integration of Accent Sandhi and Prosodic Features Estimation for Japanese Text-to-Speech Synthesis. GCCE 2020: 358-359 - [c128]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose
, Akinori Ito:
Filler Prediction Based on Bidirectional LSTM for Generation of Natural Response of Spoken Dialog. GCCE 2020: 360-361 - [c127]Takuma Hayasaka, Takashi Nose
, Akinori Ito:
A Study on Minimum Spectral Error Analysis of Speech. GCCE 2020: 362-363 - [c126]Takuto Fujimura, Takashi Nose
, Akinori Ito:
LJSing: Large-Scale Singing Voice Corpus of Single Japanese Singer. GCCE 2020: 364-365 - [c125]Shuhei Imai, Takashi Nose
, Aoi Kanagaki, Satoshi Watanabe, Akinori Ito:
Improving Pronunciation Clarity of Dysarthric Speech Using CycleGAN with Multiple Speakers. GCCE 2020: 366-367 - [c124]Yuya Chiba, Takashi Nose
, Akinori Ito:
Multi-Stream Attention-Based BLSTM with Feature Segmentation for Speech Emotion Recognition. INTERSPEECH 2020: 3301-3305 - [c123]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose, Akinori Ito:
Construction and Analysis of a Multimodal Chat-talk Corpus for Dialog Systems Considering Interpersonal Closeness. LREC 2020: 443-448 - 2019
- [j39]Hafiyan Prafianto, Takashi Nose
, Yuya Chiba, Akinori Ito
:
Improving human scoring of prosody using parametric speech synthesis. Speech Commun. 111: 14-21 (2019) - 2018
- [c121]Shunsuke Tada, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Effect of Mutual Self-Disclosure in Spoken Dialog System on User Impression. APSIPA 2018: 806-810 - [c117]Jiang Fu, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Evaluation of English Speech Recognition for Japanese Learners Using DNN-Based Acoustic Models. IIH-MSP (2) 2018: 93-100 - [c116]Mai Yamanaka, Yuya Chiba, Takashi Nose
, Akinori Ito
:
A Study on a Spoken Dialogue System with Cooperative Emotional Speech Synthesis Using Acoustic and Linguistic Information. IIH-MSP (2) 2018: 101-108 - [c115]Takashi Kimura, Takashi Nose
, Shinji Hirooka, Yuya Chiba, Akinori Ito
:
Comparison of Speech Recognition Performance Between Kaldi and Google Cloud Speech API. IIH-MSP (2) 2018: 109-115 - [c114]Kosuke Nakamura, Takashi Nose
, Yuya Chiba, Akinori Ito
:
Melody Completion Based on Convolutional Neural Networks and Generative Adversarial Learning. IIH-MSP (2) 2018: 116-123 - [c113]Shinya Hanabusa, Takashi Nose
, Akinori Ito
:
Segmental Pitch Control Using Speech Input Based on Differential Contexts and Features for Customizable Neural Speech Synthesis. IIH-MSP (2) 2018: 124-131 - [c112]Sou Miyamoto, Takashi Nose
, Kazuyuki Hiroshiba, Yuri Odagiri, Akinori Ito
:
Two-Stage Sequence-to-Sequence Neural Voice Conversion with Low-to-High Definition Spectrogram Mapping. IIH-MSP (2) 2018: 132-139 - [c111]Hiroto Aoyama, Takashi Nose
, Yuya Chiba, Akinori Ito
:
Improvement of Accent Sandhi Rules Based on Japanese Accent Dictionaries. IIH-MSP (2) 2018: 140-148 - [c110]Takahiro Furuya, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Data Collection and Analysis for Automatically Generating Record of Human Behaviors by Environmental Sound Recognition. IIH-MSP (2) 2018: 149-156 - [c109]Toru Ishikawa, Takashi Nose
, Akinori Ito
:
DNN-Based Talking Movie Generation with Face Direction Consideration. IIH-MSP (2) 2018: 157-164 - [c108]Haoran Wu, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Analyzing Effect of Physical Expression on English Proficiency for Multimodal Computer-Assisted Language Learning. INTERSPEECH 2018: 1746-1750 - [c107]Yukiko Kageyama, Yuya Chiba, Takashi Nose, Akinori Ito:
Improving User Impression in Spoken Dialog System with Gradual Speech Form Control. SIGDIAL Conference 2018: 235-240 - [c106]Yuya Chiba, Takashi Nose, Taketo Kase, Mai Yamanaka, Akinori Ito:
An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System. SIGDIAL Conference 2018: 371-375 - 2017
- [j32]Yuya Chiba, Takashi Nose
, Akinori Ito
:
Cluster-based approach to discriminate the user's state whether a user is embarrassed or thinking to an answer to a prompt. J. Multimodal User Interfaces 11(2): 185-196 (2017) - [c105]Yuya Chiba, Takashi Nose
, Akinori Ito
:
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog. APSIPA 2017: 428-431 - [c104]Yukiko Kageyama, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Collection of Example Sentences for Non-task-Oriented Dialog Using a Spoken Dialog System and Comparison with Hand-Crafted DB. HCI (29) 2017: 458-464 - [c103]Hayato Mori, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Dialog-Based Interactive Movie Recommendation: Comparison of Dialog Strategies. IIH-MSP (2) 2017: 77-83 - [c102]Shunsuke Tada, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Response Selection of Interview-Based Dialog System Using User Focus and Semantic Orientation. IIH-MSP (2) 2017: 84-90 - [c101]Yusuke Yamada, Takashi Nose
, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
Development and Evaluation of Julius-Compatible Interface for Kaldi ASR. IIH-MSP (2) 2017: 91-96 - [c100]Sou Miyamoto, Takashi Nose
, Suzunosuke Ito, Harunori Koike, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
Voice Conversion from Arbitrary Speakers Based on Deep Neural Networks with Adversarial Learning. IIH-MSP (2) 2017: 97-103 - [c99]Kosuke Nakamura, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Evaluation of Nonlinear Tempo Modification Methods Based on Sinusoidal Modeling. IIH-MSP (2) 2017: 104-111 - [c98]Kazuki Sato, Takashi Nose
, Akira Ito, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
A Study on 2D Photo-Realistic Facial Animation Generation Using 3D Facial Feature Points and Deep Neural Networks. IIH-MSP (2) 2017: 112-118 - [c97]Isao Miyagawa, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Detection of Singing Mistakes from Singing Voice. IIH-MSP (2) 2017: 130-136 - 2015
- [c91]Taketo Kase, Takashi Nose
, Akinori Ito
:
On Appropriateness and Estimation of the Emotion of Synthesized Response Speech in a Spoken Dialogue System. HCI (27) 2015: 747-752 - [c89]Tsukasa Nishino, Takashi Nose
, Akinori Ito
:
Tempo Modification of Mixed Music Signal by Nonlinear Time Scaling and Sinusoidal Modeling. IIH-MSP 2015: 146-149 - [c88]Yuki Saito, Takashi Nose
, Takahiro Shinozaki, Akinori Ito
:
Conversion of Speaker's Face Image Using PCA and Animation Unit for Video Chatting. IIH-MSP 2015: 433-436 - [c85]Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito:
Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts. INTERSPEECH 2015: 3491-3495 - 2014
- [c81]Kohei Machida, Takashi Nose
, Akinori Ito
:
Speech recognition in a home environment using parallel decoding with GMM-based noise modeling. APSIPA 2014: 1-4 - [c80]Naoto Suzuki, Takashi Nose
, Yutaka Hiroi, Akinori Ito
:
Controlling Switching Pause Using an AR Agent for Interactive CALL System. HCI (27) 2014: 588-593 - [c79]Hafiyan Prafianto, Takashi Nose
, Yuya Chiba, Akinori Ito
, Kazuyuki Sato:
A study on the effect of speech rate on perception of spoken easy Japanese using speech synthesis. ICAILP 2014: 476-479 - [c78]Masahito Okamoto, Takashi Nose
, Akinori Ito
, Takeshi Nagano:
Subjective evaluation of packet loss recovery techniques for voice over IP. ICAILP 2014: 711-714 - [c77]Noriko Totsuka, Yuya Chiba, Takashi Nose
, Akinori Ito
:
Robot: Have I done something wrong? - Analysis of prosodic features of speech commands under the robot's unintended behavior. ICAILP 2014: 887-890 - [c76]Kazumichi Yoshida, Takashi Nose
, Akinori Ito
:
Analysis of English Pronunciation of Singing Voices Sung by Japanese Speakers. IIH-MSP 2014: 554-557 - [c74]Takashi Nose, Akinori Ito:
Analysis of spectral enhancement using global variance in HMM-based speech synthesis. INTERSPEECH 2014: 2917-2921 - [c73]Yuya Chiba, Masashi Ito, Takashi Nose
, Akinori Ito
:
User Modeling by Using Bag-of-Behaviors for Building a Dialog System Sensitive to the Interlocutor's Internal State. SIGDIAL Conference 2014: 74-78
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-16 00:37 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint