研究業績

学術論文誌

2024年度

Tomoya Yoshinaga, Keitaro Tanaka, Yoshiaki Bando, Keisuke Imoto, and Shigeo Morishima

Onset-and-Offset-Aware Sound Event Detection via Differentiable Frame-to-Event Mapping

IEEE Signal Processing Letters, Vol. 32, pp. 186-190, 2024.

2023年度

Kayo Nada, Keisuke Imoto, and Takao Tsuchiya

Joint Analysis of Acoustic Scenes and Sound Events Based on Multitask Learning with Dynamic Weight Adaptation

Acoustical Science and Technology, Vol. 44, No. 3, pp. 167-175, 2023.

Yuki Shiroma, Yuma Kinoshita, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, and Hitoshi Kiya

Missing Data Completion of Multi-Channel Signals Using Autoencoder for Acoustic Scene Classification

APSIPA Transactions on Signal and Information Processing, Vol. 12, No. 3, e16, 2023.

2022年度

Noriyuki Tonami, Keisuke Imoto

Sound Event Triage: Detecting Sound Events Considering Priority of Classes

EURASIP Journal on Audio, Speech, and Music Processing, Vol. 2023, No. 5, pp. 1-13, 2023.

Keisuke Imoto, Sakiko Mishima, Yumi Arai, and Reishi Kondo

Impact of Data Imbalance Caused by Inactive Frames and Difference in Sound Duration on Sound Event Detection Performance

Applied Acoustics, Vol. 196, 108882, 2022.

Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, and Yoichi Yamashita

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

APSIPA Transactions on Signal and Information Processing, Vol. 11, No. 1, e13, 2022.

砺波紀之, 井本桂右, 岡本悠希, 福森隆寛, 山下洋一

誤検出の深刻さを考慮した音響イベント検出のための評価指標

日本音響学会誌, Vol. 78, No. 5, pp. 217-226, 2022.

2020年度

Noriyuki Tonami, Keisuke Imoto, Ryosuke Yamanishi, and Yoichi Yamashita

Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning

https://www.jstage.jst.go.jp/article/transinf/E104.D/2/E104.D_2020EDP7036/_pdf

Keisuke Imoto and Seisuke Kyochi

Sound Event Detection Utilizing Graph Laplacian Regularization with Event Co-occurrence

https://www.jstage.jst.go.jp/article/transinf/E103.D/9/E103.D_2019EDP7323/_pdf/

Keisuke Imoto

Graph Cepstrum: Spatial Feature Extracted from Partially Connected Microphones

https://www.jstage.jst.go.jp/article/transinf/E103.D/3/E103.D_2019EDP7162/_pdf/

山西良典, 田中一星, 井本桂右, 山下洋一

音声エンタテインメントからのウェブ音声マイニングの可能性

情報処理学会論文誌, Vol. 61, No. 11, pp. 1708-1717, 2020.

辻野雄大, 山西良典, 山下洋一, 井本桂右

ダンスゲーム譜面の特性分析とクラスタリングに基づく特徴的な譜面の自動生成

情報処理学会論文誌, Vol. 61, No. 11, pp. 1718-1728, 2020.

秋山大知, 石川智希, 井本桂右, 新妻雅弘, 山西良典, 山下洋一

音声を用いた感情認識のための学習話者の選択

日本音響学会誌, Vol. 76, No. 10, pp. 554-561, 2020.

国際会議

2025年度

Yuto Shibata, Keitaro Tanaka, Yoshiaki Bando, Keisuke Imoto, Hirokatsu Kataoka, and Yoshimitsu Aoki

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. xxx-xxx, 2025. (Accepted)

Noriyuki Tonami, Wataru Kohno, Keisuke Imoto, Yoshiyuki Yajima, Sakiko Mishima, Reishi Kondo, and Tomoyuki Hino

Trainingless Adaptation of Pretrained Models for Environmental Sound Classification

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. xxx-xxx, 2025. (Accepted)

Takahiro Morohashi, Keisuke Imoto, Masato Oka, Yasuyuki Kitahara, and Riku Matsuda

Estimation of Work Activities in Construction Sites Using Ambient Sounds: A Case Study with Cloud Cameras

Artificial Intelligence in Architecture, Engineering and Construction (AI in AEC), 2025.

2024年度

Kevin Wilkinghoff, Takuya Fujimura, Keisuke Imoto, and Jonathan Le Roux

Handling Domain Shifts for Anomalous Sound Detection: A Review

Proc. Joint Annual Meeting of German and Danish Acoustical Societies, pp. xxx-xxx, 2025. (Accepted)

Junwon Lee, Modan Tailleur, Mathieu Lagrange, Keunwoo Choi, Laurie M. Heller, Brian McFee, Keisuke Imoto, and Yuki Okamoto

Challenges in Text-to-Audio Synthesis: From Foley to Sound Scenes

Proc. NeurIPS 2024 Workshop on AI-Driven Speech, Music, and Sound Generation (Audio Imagination), pp. 1-9, 2024.

Naoki Koga, Yoshiaki Bando, and Keisuke Imoto

LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1-6, 2024.

Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo, and Yohei Kawaguchi

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 111-115, 2024.

Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki, and Keisuke Imoto

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation

Proc. INTERSPEECH, pp. 57-61, 2024.

Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, and Keisuke Imoto

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval

Proc. European Signal Processing Conference (EUSIPCO), pp. 71-75, 2024.

Modan Tailleur, Junwon Lee, Mathieu Lagrange, Keunwoo Choi, Laurie Heller, Keisuke Imoto, and Yuki Okamoto

Correlation of Frechet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant

Proc. European Signal Processing Conference (EUSIPCO), pp. 56-60, 2024.

Takuya Fujimura, Keisuke Imoto, and Tomoki Toda

Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection

Proc. European Signal Processing Conference (EUSIPCO), pp. 156-160, 2024.

Takezo Ohta, Yoshiaki Bando, Keisuke Imoto, and Masaki Onishi

A Sequential Audio Spectrogram Transformer for Real-Time Sound Event Detection

Proc. European Signal Processing Conference (EUSIPCO), pp. 101-105, 2024.

Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryotaro Nagase, Takahiro Fukumori, and Yoichi Yamashita

Environmental Sound Synthesis From Vocal Imitations and Sound Event Labels

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 411-415, 2024.

https://arxiv.org/pdf/2305.00302.pdf

Kevin Wilkinghoff and Keisuke Imoto

F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-supervised Anomaly Detection

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 256-260, 2024.

https://arxiv.org/pdf/2312.09143.pdf

2023年度

Yoto Fujita, Yoshiaki Bando, Keisuke Imoto, Masaki Onishi, and Kazuyoshi Yoshii

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 2037-2043, 2023.

Ami Igarashi, Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Noboru Harada, and Keisuke Imoto

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 2050-2056, 2023.

Shunsuke Tsubaki, Yohei Kawaguchi, Keisuke Imoto, Tomoya Nishida, Kota Dohi, Takashi Endo, and Yuki Okamoto

Audio-Change Captioning to Explain Machine-Sound Anomalies

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 201-205, 2023.

Noriyuki Tonami, Sakiko Mishima, Reishi Kondo, Keisuke Imoto, and Tomoyuki Hino

Event Classification With Class-Level Gated Unit Using Large-Scale Pretrained Model for Optical Fiber Sensing

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 196-200, 2023.

Keunwoo Choi, Jaekwon Im, Laurie Heller, Mathieu Lagrange, Keisuke Imoto, Yuki Okamoto, Shinnosuke Takamichi, and Brian McFee

Foley Sound Synthesis at the DCASE 2023 Challenge

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 16-20, 2023.

Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, and Yohei Kawaguchi

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 31-35, 2023.

Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, and Yohei Kawaguchi

CAPTDURE: Captioned Sound Dataset of Single Sources

Proc. INTERSPEECH, pp. 1683-1687, 2023.

Hien Ohnaka, Shinnosuke Takamichi, Keisuke Imoto, Yuki Okamoto, Kazuki Fujii, and Hiroshi Saruwatari

Visual Onoma-to-Wave: Environmental Sound Synthesis From Visual Onomatopoeias and Sound-Source Images

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5 2023.

2022年度

Ami Igarashi, Keisuke Imoto, Yuka Komatsu, Shunsuke Tsubaki, Shuto Hario, and Tatsuya Komatsu

How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 7–11, 2022.

Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Takahiro Fukumori, and Yoichi Yamashita

How Should We Evaluate Synthesized Environmental Sounds

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 307–312, 2022.

Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, and Yohei Kawaguchi

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 31-35, 2022.

Rie Koga, Sawa Takamuku, Keisuke Imoto, and Naotake Natori

Model Training that Prioritizes Rare Overlapped Labels for Polyphonic Sound Event Detection

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 56-60, 2022.

Shunsuke Tsubaki, Keisuke Imoto, and Nobutaka Ono

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, 2022.

Yukiko Takahashi, Sawa Takamuku, Keisuke Imoto, and Naotake Natori

Semi-supervised Domain Adaptation for Acoustic Scene Classification by Minimax Entropy and Self-supervision Approaches

Proc. International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, 2022.

Yuki Shiroma, Yuma Kinoshita, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, and Hitoshi Kiya

Missing Data Recovery Using Autoencoder for Multi-Channel Acoustic Scene Classification

Proc. European Signal Processing Conference (EUSIPCO), pp. 767-771, 2022.

Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, and Yohei Kawaguchi

Environmental Sound Extraction Using Onomatopoeia

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 221-225, 2022.

Noriyuki Tonami, Keisuke Imoto, Ryotaro Nagase, Yuki Okamoto, Takahiro Fukumori, and Yoichi Yamashita

Sound Event Detection Guided by Semantic Contexts of Scenes

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 801-805, 2022.

2021年度

Kayo Nada, Keisuke Imoto, Reina Iwamae, and Takao Tsuchiya

Multitask Learning of Acoustic Scenes and Events Using Dynamic Weight Adaptation Based on Multi-Focal Loss

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1156–1160, 2021.

Yuki Shiroma, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, and Hitoshi Kiya

Investigation on Spatial and Frequency-based Features for Asynchronous Acoustic Scene Analysis

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1161–1166, 2021.

Yohei Kawaguchi, Keisuke Imoto, Yuma Koizumi, Noboru Harada, Daisuke Niizumi, Kota Dohi, Ryo Tanabe, Harsh Purohit, and Takashi Endo

Description and Discussion on DCASE2021 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Under Domain Shifted Condition

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 186–190, 2021.

Keisuke Imoto

Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels

Proc. European Signal Processing Conference (EUSIPCO), pp. 875-879, 2021.

Keisuke Imoto, Sakiko Mishima, Yumi Arai, and Reishi Kondo

Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 860-864, 2021.

Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, and Yoichi Yamashita

Sound Event Detection Based on Curriculum Learning Considering Difficulty of Events

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 875-879, 2021.

2020年度

Taiga Kawamura, Ryoichi Miyazaki, Keisuke Imoto, and Nobutaka Ono

Experimental Investigation of Robustness of Spatial Cepstrum Features Under Various Recording Conditions

Proc. Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 701-704, 2020.

Yuma Koizumi, Yohei Kawaguchi, Keisuke Imoto, Toshiki Nakamura, Yuki Nikaido, Ryo Tanabe, Harsh Purohit, Kaori Suefusa, Takashi Endo, Masahiro Yasuda, and Noboru Harada

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 81–85, 2020.

Yuki Okamoto, Keisuke Imoto, Shinnosuke Takamichi, Ryosuke Yamanishi, Takahiro Fukumori, and Yoichi Yamashita

RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 125–129, 2020.

Noriyuki Tonami, Keisuke Imoto, Takahiro Fukumori, and Yoichi Yamashita

Evaluation Metric of Sound Event Detection Considering Severe Misdetections by Scenes

Proc. Detection and Classification of Acoustic Scenes and Events (DCASE) Workshop, pp. 195–199, 2020.

Keisuke Imoto, Noriyuki Tonami, Yuma Koizumi, Masahiro Yasuda, Ryosuke Yamanishi, and Yoichi Yamashita

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 621-625, 2020.

Tatsuya Komatsu, Keisuke Imoto, and Masahito Togami

Scene-dependent Acoustic Event Detection with Scene Conditioning and Fake-scene-condition Loss

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 646-650, 2020.

Masahiro Yasuda, Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, and Keisuke Imoto

Sound Event Localization Based on Sound Intensity Vector Refined by DNN-based Denoising and Source Separation

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 651-655, 2020.

Yuki Okamoto, Keisuke Imoto, Naoki Tsukahara, Ken Nagata, Koh Sueda, Ryosuke Yamanishi, and Yoichi Yamashita

Crow Call Detection Using Gated Convolutional Recurrent Neural Network

Proc. RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing (NCSP), pp. 171-174, 2020.

受賞

2024年度

大中緋慧

第8回 IEEE Signal Processing Society (SPS) Tokyo Joint Chapter Student Award

https://www.ieee-jp.org/section/tokyo/chapter/SP-01/past-tjc-student-paper.htm

吉永朋矢

日本音響学会第29回学生優秀発表賞

https://acoustics.jp/awards/student/

吉永朋矢

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会学生研究奨励賞

2023年度

小倉稜也

日本音響学会第27回学生優秀発表賞

https://acoustics.jp/awards/student/

大田竹蔵

情報処理学会第86回全国大会学生奨励賞

https://www.ipsj.or.jp/award/taikaigakusei.html

岡本悠希, 高道慎之介, 森松亜衣, 渡邊亞椰, 井本桂右, 山下洋一

言語処理学会第30回年次大会 (NLP2024) スポンサー賞 (Kotoba Technologies, Inc.)

岡本悠希

第17回 IEEE Signal Processing Society (SPS) Japan Student Conference Paper Award

https://www.ieee-jp.org/section/tokyo/chapter/SP-01/past-student-paper.htm

井本桂右

日本音響学会第10回学会活動貢献賞

https://acoustics.jp/awards/contri/

2021年度

岡本悠希

第5回 IEEE Signal Processing Society (SPS) Tokyo Joint Chapter Student Award

https://www.ieee-jp.org/section/tokyo/chapter/SP-01/past-tjc-student-paper.htm

岡本悠希

日本音響学会第22回学生優秀発表賞

https://acoustics.jp/awards/student/

2020年度

井本桂右

日本音響学会論文賞佐藤賞

https://acoustics.jp/awards/sato/

書籍

2017年度

井本桂右他

音響学入門ぺディア

コロナ社, 2017.

解説記事

2024年度

岡本悠希, 井本桂右

統計的手法による環境音・効果音合成

日本音響学会誌, Vol. 80, No. 12, pp. 658-666, 2024.

土肥宏太, 川口洋平, 井本桂右

機械学習を用いた異常音検知

騒音制御, Vol. 48, No. 4, pp. 186-189, 2024.

井本桂右, 矢田部浩平

小特集「アクティブ騒音制御の今」にあたって

日本音響学会誌, Vol. 80, No. 5, pp. 257-258, 2024.

2023年度

井本桂右

DCASE Challenge: 環境音分析・理解のための統合的コンペティション

日本音響学会誌, Vol. 79, No. 9, pp. 470-476, 2023.

井本桂右

環境音分析

電子情報通信学会誌, Vol. 106, No. 8, pp.774-776, 2023.

2022年度

橘亮輔, 井本桂右

小特集「ヒトと動物の音声の感情・情動伝達」にあたって

日本音響学会誌, Vol. 79, No. 1, pp. 26-27, 2023.

井本桂右

ドメイン知識を利用した環境音分析

電子情報通信学会誌, Vol. 105, No. 12, pp.1434-1440, 2022.

井本桂右, 川口洋平

環境音分析・異常音検知の研究動向

電子情報通信学会基礎・境界ソサイエティ Fundamentals Review, 15巻, 4号, pp. 268-280, 2022.

https://doi.org/10.1587/essfr.15.4_268

2019年度

井本桂右

環境音分析の研究動向

日本音響学会誌, Vol. 75, No. 9, pp. 512-518, 2019.

https://www.jstage.jst.go.jp/article/jasj/75/9/75_512/_pdf/

2018年度

Keisuke Imoto

Introduction to Acoustic Event and Scene Analysis

Acoustical Science and Technology, Vol. 39, No. 3, pp. 182-188, 2018.

https://www.jstage.jst.go.jp/article/ast/39/3/39_E183002/_pdf/

2017年度

井本桂右

音響イベントと音響シーンの分析

日本音響学会誌, Vol. 74, No. 4, pp. 198-207, 2018.

https://www.jstage.jst.go.jp/article/jasj/74/4/74_198/_pdf/

招待講演

2024年度

Keisuke Imoto

x-to-audio: General Audio Synthesis From Various Input Prompt

Eighth International Workshop on Symbolic-Neural Learning (SNL), 2024.

2023年度

井本桂右

これから始める環境音分析・合成

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, 2024.

井本桂右

環境音の分析・合成と自然言語処理との交差点

NLP若手の会 (YANS) 第18回シンポジウム, 2023年08月.

2022年度

井本桂右

計算機による環境音の理解・解釈に向けた統合的コンペティションDCASE Challengeへの招待

日本音響学会 2023年春季研究発表会, pp. 1151-1152, 2023.

井本桂右

ドメイン知識を活用した環境音分析・合成研究の動向

日本ロボット学会第144回ロボット工学セミナー「ロボットのための音声・音響処理技術」

2021年度

Keisuke Imoto

Fundamentals and Recent Advances in Environmental Sound Analysis

Overview Session (OS-2 #3), Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021.

井本桂右

環境音の特徴を活用した音響イベント検出・シーン分類

第46回産総研人工知能セミナー「AIによる音環境理解を目的とした環境音分析」

https://www.slideshare.net/ksuke-i/ss-243475589

2020年度

井本桂右

環境音分析ことはじめ

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, 2020.

国内発表

2024年度

仁泉大輔, 竹内大起, 大石康智, 原田登, 安田昌弘, 椿竣介, 井本桂右

汎用言語音響表現 M2D-CLAP

日本音響学会 2025年春季研究発表会, pp. xxx-xxx, 2025.

恩田健太郎, 深山覚, 井本桂右, 齋藤大輔, 峯松信明

母語話者音声のみを用いた外国語訛りに頑健な自動音声認識の実現に向けた離散トークンの活用の検討

日本音響学会 2025年春季研究発表会, pp. xxx-xxx, 2025.

佐藤僚, 春田智穂, 昼間信彦, 井本桂右

クラスベース目的音抽出における抽出スケール制御手法の検討

日本音響学会 2025年春季研究発表会, pp. xxx-xxx, 2025.

恩田健太郎, 朴浚鎔, 井本桂右, 深山覚, 齋藤大輔, 峯松信明

離散トークンの継続長予測に基づく母語話者音声コーパスのみを用いた外国語訛り音声合成手法の改善

情報処理学会音声言語情報処理研究会第154回研究会 (音声言語シンポジウム・自然言語処理シンポジウム), pp. xxx-xxx, 2024.

吉永朋矢, 田中啓太郎, 坂東宜昭, 井本桂右, 大西正輝, 森島繁生

汎用事前学習済みモデルを用いた音響イベント検出のためのHSMMに基づくイベント単位学習

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, pp. 7-10, 2024.

大田竹蔵, 坂東宜昭, 井本桂右, 大西正輝

音響イベント物体の視聴覚教師あり検出

日本音響学会 2024年秋季研究発表会, pp. 161-162, 2024.

柴田優斗, 田中啓太郎, 坂東宜昭, 井本桂右, 片岡裕雄, 青木義満

音源信号の数式ドリブン合成に基づく音響イベント検出の事前学習

日本音響学会 2024年秋季研究発表会, pp. 163-164, 2024.

吉永朋矢, 坂東宜昭, 田中啓太郎, 井本桂右, 大西正輝, 森島繁生

音響イベント検出のための隠れセミマルコフモデルに基づくイベント単位損失

日本音響学会 2024年秋季研究発表会, pp. 165-166, 2024.

砺波紀之, 井本桂右, 美島咲子, 近藤玲史, 樋野智之

学習済み環境音認識モデルに対する学習レス適応

日本音響学会 2024年秋季研究発表会, pp. 347-348, 2024.

諸橋俊大, 井本桂右, 岡尚人, 北原靖之, 松田陸

建設現場環境音と深層学習による作業推定システムの開発その３

日本建築学会大会学術講演会, pp. 91-92, 2024.

岡本悠希, 井本桂右, 高道慎之介, 永瀬亮太郎, 福森隆寛, 山下洋一

環境音の模倣音声を利用した環境音合成とデータセット構築

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, page 22, 2024.

Shunsuke Tsubaki, Yohei Kawaguchi, Tomoya Nishida, Keisuke Imoto, Yuki Okamoto, Kota Dohi, and Takashi Endo

Audio-change Captioning to Explain Machine-sound Anomalies

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, pp. 29-33, 2024.

井上かほり, 福本有花, 古賀直樹, 井本桂右

音響シーンと音響イベントの同時分析における継続学習の検討

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, pp. 34-37, 2024.

2023年度

古賀直樹, 坂東宣昭, 井本桂右

アノテータごとのばらつきを考慮した音響イベント検出

情報処理学会第86回全国大会, pp. 367-368, 2024.

大田竹蔵, 坂東宜昭, 井本桂右, 大西正輝

実時間で動作する音響イベント検出の大規模事前学習

情報処理学会第86回全国大会, pp. 365-366, 2024.

岡本悠希, 高道慎之介, 森松亜衣, 渡邊亞椰, 井本桂右, 山下洋一

環境音に対する日本語自由記述文コーパスとベンチマーク分析

言語処理学会第30回年次大会, pp. 1269-1273, 2024.

井上かほり, 井本桂右

環境音分析における事前学習済みモデルのバイアス調査

日本音響学会 2024年春季研究発表会, pp. 119-122, 2024.

岡本悠希, 井本桂右, 高道慎之介, 永瀬亮太郎, 福森隆寛, 山下洋一

環境音の模倣音声を用いた環境音合成の検討とデータセット構築

IDRユーザフォーラム 2023.

岩川光一, 井本桂右

調理音とレシピテキストを用いた調理工程の推定

IDRユーザフォーラム 2023.

岡本悠希, 井本桂右, 高道慎之介, 永瀬亮太郎, 福森隆寛, 山下洋一

Voice-to-foley: 環境音を模倣した音声を入力とする環境音合成

日本音響学会 2023年秋季研究発表会, pp. 1071-1074, 2023.

吉永朋矢, 坂東宜昭, 井本桂右, 大西正輝, 森島繁生

イベント間の共起構造を導入した隠れセミマルコフモデルに基づく音響イベント検出

日本音響学会 2023年秋季研究発表会, pp. 181-182, 2023.

大田竹蔵, 坂東宜昭, 井本桂右, 大西正輝

時間的連続性を導入した視聴覚自己教師あり学習に基づく音響イベント検出

日本音響学会 2023年秋季研究発表会, pp. 183-184, 2023.

坂東宜昭, 井本桂右, 升山義紀, 佐々木洋子

複数仮説トラッキングに基づく音響イベント定位・検出

日本音響学会 2023年秋季研究発表会, pp. 189-190, 2023.

小倉稜也, 井本桂右, 貴家仁志, 塩田さやか

距離に基づく音源分離を用いたシングルチャンネル環境音分類

日本音響学会 2023年秋季研究発表会, pp. 375-378, 2023.

砺波紀之, 美島咲子, 近藤玲史, 井本桂右, 樋野智之

光ファイバセンシングのための学習済み環境音認識モデルを用いたイベント分類

日本音響学会 2023年秋季研究発表会, pp. 379-382, 2023.

諸橋俊大, 井本桂右, 岡尚人, 北原靖之

建設現場環境音と深層学習による作業推定システムの開発その２

日本建築学会大会学術講演会, pp. 111-112, 2023.

2022年度

岡本悠希, 井本桂右, 土肥宏太, 川口洋平

DCASECaps: 単一音源に説明文を付与した環境音データセット

日本音響学会 2023年春季研究発表会, pp. 133-136, 2023.

Andreas Mayer, Soshi Yoshida, Keisuke Imoto, Shizuko Hiryu

Distinguishing Conspecific Bats by Their Echolocation Calls Using a Convolutional Neural Network

日本音響学会 2023年春季研究発表会, pp. 503-504, 2023.

大田竹蔵, 坂東宜昭, 井本桂右, 大西正輝

視聴覚自己教師あり学習に基づく音響イベント検出

情報処理学会第85回全国大会, pp. 441-442, 2023.

五十嵐彩美, 椿俊介, 井本桂右

半教師あり学習に基づく音響シーンと音響イベントの同時分析

電子情報通信学会音声研究会, pp. 165-170, 2023.

藤田陽斗, 坂東宜昭, 井本桂右, 大西正輝, 吉井和佳

音響イベント定位・検出のための全方位動画と多チャネル音響信号を用いた自己教師あり学習

電子情報通信学会音声研究会, pp. 78-82, 2023.

大中緋慧, 高道慎之介, 井本桂右, 岡本悠希, 藤井一貴, 猿渡洋

Visual onoma-to-wave：画像オノマトペと音源画像を利用した環境音合成の提案

電子情報通信学会音声研究会, pp. 83-88, 2023.

小倉稜也, 塩田さやか, 井本桂右, 貴家仁志

距離に基づく音源分離を用いたシングルチャンネル環境音分類

電子情報通信学会音声研究会 (ショートオーラルセッション), 2023.

砺波紀之, 井本桂右

音響イベントトリアージ：クラスの優先度を考慮したイベント検出

日本音響学会 2022年秋季研究発表会, pp. 251-254, 2022.

諸橋俊大, 井本桂右, 吉田康仁, 岡尚人

建設現場環境音を用いた作業管理システムの開発－音源定位と深層学習による作業推定技術の適用－

日本音響学会 2022年秋季研究発表会, pp. 771-774, 2022.

岡本悠希, 井本桂右, 高道慎之介, 福森隆寛, 山下洋一

環境音合成の入力情報に応じた主観評価手法の検討

日本音響学会 2022年秋季研究発表会, pp. 1257-1260, 2022.

諸橋俊大, 井本桂右, 吉田康仁, 岡尚人

建設現場環境音と深層学習による作業推定システムの開発

日本建築学会大会学術講演会, pp. 53-54, 2022.

2021年度

井本桂右, 賀谷采珠, 椿竣介

音響イベントの強ラベル付与におけるアノテーター間のばらつきの分析

日本音響学会 2022年春季研究発表会, pp. 161-162, 2022.

小松由佳, 井本桂右, 小松達也

音響シーンとイベントが相互に及ぼす影響の調査

日本音響学会 2022年春季研究発表会, pp. 163-164, 2022.

椿竣介, 宇都瑛祐, 井本桂右, 小野順貴

弱ラベルを用いた音響シーンとイベントの同時分析

日本音響学会 2022年春季研究発表会, pp. 165-166, 2022.

砺波紀之, 井本桂右, 永瀬亮太郎, 岡本悠希, 福森隆寛, 山下洋一

事前定義されていないシーン情報を利用可能な音響イベント検出

日本音響学会 2022年春季研究発表会, pp. 243-246, 2022.

岡本悠希, 堀口翔太, 山本正明, 井本桂右, 川口洋平

擬音語を用いた環境音抽出

日本音響学会 2022 年春季研究発表会, pp. 247-250, 2022

髙橋皓大, 井本桂右, 土屋隆生

グラフ深層学習を用いた音響イベントとシーンの同時分析

日本音響学会 2022年春季研究発表会, pp. 251-254, 2022.

岡本悠希, 井本桂右, 高道慎之介, 福森隆寛, 山下洋一

環境音合成における主観評価手法の検討

日本音響学会 2022年春季研究発表会, pp. 1071-1074, 2022.

城間佑樹, 木下裕磨, 井本桂右, 塩田さやか, 小野順貴, 貴家仁志

自己符号化器を用いた多チャンネル信号の欠損復元法と環境音分類における評価

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会, pp. 140-145, 2022.

井本桂右, 岡本悠希, 高道慎之介, 山下洋一

RWCP音声・音響データベースを用いた環境音・効果音合成の検討とオノマトペ拡張データセットの構築

IDRユーザフォーラム 2021.

岡本悠希, 井本桂右, 高道慎之介, 山西良典, 福森隆寛, 山下洋一

Transformerを用いたオノマトペからの環境音合成

日本音響学会 2021年秋季研究発表会, pp. 943-946, 2021.

2020年度

岩前玲那, 白波瀬壮, 髙橋皓大, 井本桂右, 土屋隆生

音響イベントとシーンのマルチタスク学習における評価関数の重みの自動調整

日本音響学会 2021年春季研究発表会, pp. 303-304, 2021.

井本桂右, 美島咲子, 荒井友督, 近藤玲史

音響イベント長とイベント非活性区間長の不均衡が検出性能に及ぼす影響

日本音響学会 2021年春季研究発表会, pp. 305-308, 2021.

岡本悠希, 井本桂右, 高道慎之介, 山西良典, 福森隆寛, 山下洋一

Onoma-to-wave: オノマトペからの環境音合成手法の提案

日本音響学会 2021年春季研究発表会, pp. 843-846, 2021.

河村泰雅, 宮崎亮一, 井本桂右

実環境におけるマイクロホンの移動に対する空間ケプストラムの頑健性の調査

日本音響学会 2020年秋季研究発表会, pp. 161-164, 2020.

砺波紀之, 井本桂右, 福森隆寛, 山下洋一

音響シーンを用いて検出誤りの深刻さを考慮したイベント検出の評価指標

日本音響学会 2020年秋季研究発表会, pp. 301-304, 2020.

その他

2020年度

戸上真人, 木田祐介, 井本桂右, 山本龍一

ここまで来た音声技術・今後の展望

LINE Developer Day 2020 Casual Track, 2020年 11月.

学術論文誌

2024年度

Onset-and-Offset-Aware Sound Event Detection via Differentiable Frame-to-Event Mapping

2023年度

Joint Analysis of Acoustic Scenes and Sound Events Based on Multitask Learning with Dynamic Weight Adaptation

Missing Data Completion of Multi-Channel Signals Using Autoencoder for Acoustic Scene Classification

2022年度

Sound Event Triage: Detecting Sound Events Considering Priority of Classes

Impact of Data Imbalance Caused by Inactive Frames and Difference in Sound Duration on Sound Event Detection Performance

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

誤検出の深刻さを考慮した音響イベント検出のための評価指標

2020年度

Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning

Sound Event Detection Utilizing Graph Laplacian Regularization with Event Co-occurrence

Graph Cepstrum: Spatial Feature Extracted from Partially Connected Microphones

音声エンタテインメントからのウェブ音声マイニングの可能性

ダンスゲーム譜面の特性分析とクラスタリングに基づく特徴的な譜面の自動生成

音声を用いた感情認識のための学習話者の選択

国際会議

2025年度

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Trainingless Adaptation of Pretrained Models for Environmental Sound Classification

Estimation of Work Activities in Construction Sites Using Ambient Sounds: A Case Study with Cloud Cameras

2024年度

Handling Domain Shifts for Anomalous Sound Detection: A Review

Challenges in Text-to-Audio Synthesis: From Foley to Sound Scenes

LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval

Correlation of Frechet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant

Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection

A Sequential Audio Spectrogram Transformer for Real-Time Sound Event Detection

Environmental Sound Synthesis From Vocal Imitations and Sound Event Labels

F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-supervised Anomaly Detection

2023年度

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach

Audio-Change Captioning to Explain Machine-Sound Anomalies

Event Classification With Class-Level Gated Unit Using Large-Scale Pretrained Model for Optical Fiber Sensing

Foley Sound Synthesis at the DCASE 2023 Challenge

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

CAPTDURE: Captioned Sound Dataset of Single Sources

Visual Onoma-to-Wave: Environmental Sound Synthesis From Visual Onomatopoeias and Sound-Source Images

2022年度

How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks

How Should We Evaluate Synthesized Environmental Sounds

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

Model Training that Prioritizes Rare Overlapped Labels for Polyphonic Sound Event Detection

Joint Analysis of Acoustic Scenes and Sound Events with Weakly labeled Data

Semi-supervised Domain Adaptation for Acoustic Scene Classification by Minimax Entropy and Self-supervision Approaches

Missing Data Recovery Using Autoencoder for Multi-Channel Acoustic Scene Classification

Environmental Sound Extraction Using Onomatopoeia

Sound Event Detection Guided by Semantic Contexts of Scenes

2021年度

Multitask Learning of Acoustic Scenes and Events Using Dynamic Weight Adaptation Based on Multi-Focal Loss

Investigation on Spatial and Frequency-based Features for Asynchronous Acoustic Scene Analysis

Description and Discussion on DCASE2021 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Under Domain Shifted Condition

Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels

Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance

Sound Event Detection Based on Curriculum Learning Considering Difficulty of Events

2020年度

Experimental Investigation of Robustness of Spatial Cepstrum Features Under Various Recording Conditions

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis

Evaluation Metric of Sound Event Detection Considering Severe Misdetections by Scenes

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels

Scene-dependent Acoustic Event Detection with Scene Conditioning and Fake-scene-condition Loss

Sound Event Localization Based on Sound Intensity Vector Refined by DNN-based Denoising and Source Separation

Crow Call Detection Using Gated Convolutional Recurrent Neural Network

受賞

2024年度

第8回 IEEE Signal Processing Society (SPS) Tokyo Joint Chapter Student Award

日本音響学会 第29回学生優秀発表賞

日本音響学会 電気音響研究会/電子情報通信学会 応用音響研究会 学生研究奨励賞

2023年度

日本音響学会 第27回学生優秀発表賞

情報処理学会 第86回全国大会 学生奨励賞

言語処理学会第30回年次大会 (NLP2024) スポンサー賞 (Kotoba Technologies, Inc.)

日本音響学会第29回学生優秀発表賞

日本音響学会電気音響研究会/電子情報通信学会応用音響研究会学生研究奨励賞

日本音響学会第27回学生優秀発表賞

情報処理学会第86回全国大会学生奨励賞

日本音響学会第10回学会活動貢献賞

日本音響学会第22回学生優秀発表賞

DCASE Challenge: 環境音分析・理解のための統合的コンペティション

建設現場環境音と深層学習による作業推定システムの開発その３