2024
Junwon Lee, Modan Tailleur, Mathieu Lagrange, Keunwoo Choi, Laurie M. Heller, Brian McFee, Keisuke Imoto, and Yuki Okamoto
Challenges in Text-to-Audio Synthesis: From Foley to Sound Scenes
Proc. NeurIPS 2024 Workshop on AI-Driven Speech, Music, and Sound Generation (Audio Imagination), pp. xxx-xxx, 2024. (Accepted)
Naoki Koga, Yoshiaki Bando, and Keisuke Imoto
LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. xxx-xxx, 2024. (Accepted)
Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo, and Yohei Kawaguchi
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 111-115, 2024.
Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki, and Keisuke Imoto
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Proc. INTERSPEECH, pp. 57-61, 2024.
Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, and Keisuke Imoto
Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Proc. European Signal Processing Conference (EUSIPCO), pp. 71-75, 2024.
Modan Tailleur, Junwon Lee, Mathieu Lagrange, Keunwoo Choi, Laurie Heller, Keisuke Imoto, and Yuki Okamoto
Correlation of Frechet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Proc. European Signal Processing Conference (EUSIPCO), pp. 56-60, 2024.
Takuya Fujimura, Keisuke Imoto, and Tomoki Toda
Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Proc. European Signal Processing Conference (EUSIPCO), pp. 156-160, 2024.
Takezo Ohta, Yoshiaki Bando, Keisuke Imoto, and Masaki Onishi
A Sequential Audio Spectrogram Transformer for Real-Time Sound Event Detection
Proc. European Signal Processing Conference (EUSIPCO), pp. 101-105, 2024.
Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryotaro Nagase, Takahiro Fukumori, and Yoichi Yamashita
Environmental Sound Synthesis From Vocal Imitations and Sound Event Labels
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 411-415, 2024.
Kevin Wilkinghoff and Keisuke Imoto
F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-supervised Anomaly Detection
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 256-260, 2024.
2023
Yoto Fujita, Yoshiaki Bando, Keisuke Imoto, Masaki Onishi, and Kazuyoshi Yoshii
DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 2037-2043, 2023.
Ami Igarashi, Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Noboru Harada, and Keisuke Imoto
Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 2050-2056, 2023.
Shunsuke Tsubaki, Yohei Kawaguchi, Keisuke Imoto, Tomoya Nishida, Kota Dohi, Takashi Endo, and Yuki Okamoto
Audio-Change Captioning to Explain Machine-Sound Anomalies
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 201-205, 2023.
Noriyuki Tonami, Sakiko Mishima, Reishi Kondo, Keisuke Imoto, and Tomoyuki Hino
Event Classification With Class-Level Gated Unit Using Large-Scale Pretrained Model for Optical Fiber Sensing
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 196-200, 2023.
Keunwoo Choi, Jaekwon Im, Laurie Heller, Mathieu Lagrange, Keisuke Imoto, Yuki Okamoto, Shinnosuke Takamichi, and Brian McFee
Foley Sound Synthesis at the DCASE 2023 Challenge
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 16-20, 2023.
Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, and Yohei Kawaguchi
Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 31-35, 2023.
Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, and Yohei Kawaguchi
CAPTDURE: Captioned Sound Dataset of Single Sources
Proc. INTERSPEECH, pp. 1683-1687, 2023.
Hien Ohnaka, Shinnosuke Takamichi, Keisuke Imoto, Yuki Okamoto, Kazuki Fujii, and Hiroshi Saruwatari
Visual Onoma-to-Wave: Environmental Sound Synthesis From Visual Onomatopoeias and Sound-Source Images
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5 2023.
2022
Ami Igarashi, Keisuke Imoto, Yuka Komatsu, Shunsuke Tsubaki, Shuto Hario, and Tatsuya Komatsu
How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 7–11, 2022.
Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Takahiro Fukumori, and Yoichi Yamashita
How Should We Evaluate Synthesized Environmental Sounds
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 307–312, 2022.
Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, and Yohei Kawaguchi
Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 31-35, 2022.
Rie Koga, Sawa Takamuku, Keisuke Imoto, and Naotake Natori
Model Training that Prioritizes Rare Overlapped Labels for Polyphonic Sound Event Detection
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 56-60, 2022.
Shunsuke Tsubaki, Keisuke Imoto, and Nobutaka Ono
Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data
Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, 2022.
Yukiko Takahashi, Sawa Takamuku, Keisuke Imoto, and Naotake Natori
Semi-supervised Domain Adaptation for Acoustic Scene Classification by Minimax Entropy and Self-supervision Approaches
Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, 2022.
Yuki Shiroma, Yuma Kinoshita, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, and Hitoshi Kiya
Missing Data Recovery Using Autoencoder for Multi-Channel Acoustic Scene Classification
Proc. European Signal Processing Conference (EUSIPCO), pp. 767-771, 2022.
Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, and Yohei Kawaguchi
Environmental Sound Extraction Using Onomatopoeia
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 221-225, 2022.
Noriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, and Yoichi Yamashita
Sound Event Detection Guided by Semantic Contexts of Scenes
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 801-805, 2022.
2021
Kayo Nada, Keisuke Imoto, Reina Iwamae, and Takao Tsuchiya
Multitask Learning of Acoustic Scenes and Events Using Dynamic Weight Adaptation Based on Multi-Focal Loss
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1156–1160, 2021.
Yuki Shiroma, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, and Hitoshi Kiya
Investigation on Spatial and Frequency-based Features for Asynchronous Acoustic Scene Analysis
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1161–1166, 2021.
Yohei Kawaguchi, Keisuke Imoto, Yuma Koizumi, Noboru Harada, Daisuke Niizumi, Kota Dohi, Ryo Tanabe, Harsh Purohit, and Takashi Endo
Description and Discussion on DCASE2021 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Under Domain Shifted Condition
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 186–190, 2021.
Keisuke Imoto
Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels
Proc. European Signal Processing Conference (EUSIPCO), pp. 875-879, 2021.
Keisuke Imoto, Sakiko Mishima, Yumi Arai, and Reishi Kondo
Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 860-864, 2021.
Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, and Yoichi Yamashita
Sound Event Detection Based on Curriculum Learning Considering Difficulty of Events
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 875-879, 2021.
2020
Taiga Kawamura, Ryoichi Miyazaki, Keisuke Imoto, and Nobutaka Ono
Experimental Investigation of Robustness of Spatial Cepstrum Features Under Various Recording Conditions
Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 701-704, 2020.
Yuma Koizumi, Yohei Kawaguchi, Keisuke Imoto, Toshiki Nakamura, Yuki Nikaido, Ryo Tanabe, Harsh Purohit, Kaori Suefusa, Takashi Endo, Masahiro Yasuda, and Noboru Harada
Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 81–85, 2020.
Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, and Yoichi Yamashita
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 125–129, 2020.
Noriyuki Tonami, Keisuke Imoto, Takahiro Fukumori, and Yoichi Yamashita
Evaluation Metric of Sound Event Detection Considering Severe Misdetections by Scenes
Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 195–199, 2020.
Keisuke Imoto, Noriyuki Tonami, Yuma Koizumi, Masahiro Yasuda, Ryosuke Yamanishi, and Yoichi Yamashita
Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 621-625, 2020.
Tatsuya Komatsu, Keisuke Imoto, and Masahito Togami
Scene-dependent Acoustic Event Detection with Scene Conditioning and Fake-scene-condition Loss
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 646-650, 2020.
Masahiro Yasuda, Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, and Keisuke Imoto
Sound Event Localization Based on Sound Intensity Vector Refined by DNN-based Denoising and Source Separation
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 651-655, 2020.
Yuki Okamoto, Keisuke Imoto, Naoki Tsukahara, Ken Nagata, Koh Sueda, Ryosuke Yamanishi, and Yoichi Yamashita
Crow Call Detection Using Gated Convolutional Recurrent Neural Network
Proc. RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP), pp. 171-174, 2020.