12 2022 档案

Tutorial on Gabor Filters

摘要：https://inc.ucsd.edu/mplab/75/media//gabor.pdf 阅读全文

posted @ 2022-12-29 21:49 prettysky 阅读(158) 评论(0) 推荐(0) 编辑

Wavelets

摘要：The information of interest is often a combination of phenomena that are transient (e.g., spike and action potentials) and diffuse (e.g., small oscill 阅读全文

posted @ 2022-12-29 13:45 prettysky 阅读(63) 评论(0) 推荐(0) 编辑

over_lap_and_add

摘要：colab版本 from keras.layers.normalization.batch_normalization_v1 import BatchNormalization 本地版 from keras.layers.normalization~~.batch_normalization_v1~ 阅读全文

posted @ 2022-12-25 16:20 prettysky 阅读(107) 评论(0) 推荐(0) 编辑

PE 26+27+28

摘要：26 Historical Perspective of the Field of ASR/NLU| 27 HMMs and Related Speech Recognition Technologies| 28 Speech Recognition with Weighted Finite-State Transducers 阅读全文

posted @ 2022-12-25 15:00 prettysky 阅读(24) 评论(0) 推荐(0) 编辑

DMs

摘要：引入原理正向扩散反向过程优化（推导略）条件音频 NUWAVE NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling WSRGlow WSRGlow: A Glow-based Waveform Gener 阅读全文

posted @ 2022-12-24 17:10 prettysky 阅读(70) 评论(0) 推荐(0) 编辑

2012,Normalization of spectro-temporal Gabor filter bank features for improved robust automatic speech recognition systems

摘要：DOI:10.21437/Interspeech.2012-493 阅读全文

posted @ 2022-12-21 13:02 prettysky 阅读(14) 评论(0) 推荐(0) 编辑

PD 22+23+24+25

摘要：22 Linguistic Processing for Speech Synthesis| 23 Prosodic Processing| 24 Voice Transformation| 25 Expressive/Affective Speech Synthesis 阅读全文

posted @ 2022-12-16 19:08 prettysky 阅读(23) 评论(0) 推荐(0) 编辑

DCASE

摘要：DCASE2022 Challenge Task 1, Low-Complexity Acoustic Scene Classification Task 2, Unsupervised Anomalous Sound Detection for Machine Condition Monitori 阅读全文

posted @ 2022-12-16 12:00 prettysky 阅读(177) 评论(0) 推荐(0) 编辑

PD 19+20+21

摘要：19 Basic Principles of Speech Synthesis| 20 Rule-Based Speech Synthesis| 21 Corpus-Based Speech Synthesis 阅读全文

posted @ 2022-12-14 17:38 prettysky 阅读(19) 评论(0) 推荐(0) 编辑

Non-Negative Matrix Factorization (NMF)

摘要：NMF is an unsupervised machine learning technique created by Lee & Seung in 1999. 阅读全文

posted @ 2022-12-14 11:26 prettysky 阅读(19) 评论(0) 推荐(0) 编辑

2016,Wavelet features for classification of vote snore sounds

摘要：DOI: 10.1109/ICASSP.2016.7471669 阅读全文

posted @ 2022-12-12 20:36 prettysky 阅读(11) 评论(0) 推荐(0) 编辑

2015,Histogram of Gradients of Time–Frequency Representations for Audio Scene Classification

摘要：DOI: 10.1109/TASLP.2014.2375575 阅读全文

posted @ 2022-12-12 18:46 prettysky 阅读(16) 评论(0) 推荐(0) 编辑

2019,SubSpectralNet – Using Sub-spectrogram Based Convolutional Neural Networks for Acoustic Scene Classification

摘要：DOI:10.1109/ICASSP.2019.8683288| DCASE 2018 阅读全文

posted @ 2022-12-11 20:58 prettysky 阅读(54) 评论(0) 推荐(0) 编辑

2017,The Details That Matter: Frequency Resolution of Spectrograms in Acoustic Scene Classification

摘要：DCASE 2017 阅读全文

posted @ 2022-12-11 20:54 prettysky 阅读(13) 评论(0) 推荐(0) 编辑

2019,Enhancing Sound Texture in CNN-based Acoustic Scene Classification

摘要：DOI: 10.1109/ICASSP.2019.8683490| Texture 阅读全文

posted @ 2022-12-11 20:40 prettysky 阅读(11) 评论(0) 推荐(0) 编辑

2017,Acoustic Scene Classification Using a CNN-SuperVector System Trained with Auditory and Spectrogram Image Features

摘要：DOI:10.21437/INTERSPEECH.2017-431| SAI(Stabilised Auditory Image) 阅读全文

posted @ 2022-12-11 19:17 prettysky 阅读(11) 评论(0) 推荐(0) 编辑

2010,Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

摘要：DOI: 10.48550/arXiv.1003.4083| MFCCs 阅读全文

posted @ 2022-12-11 18:29 prettysky 阅读(21) 评论(0) 推荐(0) 编辑

2022,SpectNet : End-to-End Audio Signal Classification Using Learnable Spectrograms

摘要：DOI:10.48550/arXiv.2211.09352| learnable MFCCs 阅读全文

posted @ 2022-12-11 18:05 prettysky 阅读(27) 评论(0) 推荐(0) 编辑

2009,Non-speech audio event detection

摘要：DOI: 10.1109/ICASSP.2009.4959998 阅读全文

posted @ 2022-12-11 18:02 prettysky 阅读(5) 评论(0) 推荐(0) 编辑

librosa

摘要：DOI:10.25080/MAJORA-7B98E3ED-003| librosa是一个用于音乐和音频分析的库阅读全文

posted @ 2022-12-10 16:53 prettysky 阅读(72) 评论(0) 推荐(0) 编辑

2018,Acoustic Scene Classification: An Overview of Dcase 2017 Challenge Entries

摘要：DOI: 10.1109/IWAENC.2018.8521242 阅读全文

posted @ 2022-12-10 15:02 prettysky 阅读(9) 评论(0) 推荐(0) 编辑

2010,Machine Hearing: An Emerging Field

摘要：DOI: 10.1109/MSP.2010.937498 阅读全文

posted @ 2022-12-10 14:51 prettysky 阅读(7) 评论(0) 推荐(0) 编辑

2018,Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network

摘要：DOI https://doi.org/10.1007/978-3-030-00764-5_2 阅读全文

posted @ 2022-12-10 13:57 prettysky 阅读(12) 评论(0) 推荐(0) 编辑

2019,SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

摘要：arXiv:1904.08779| log mel spectrogram 阅读全文

posted @ 2022-12-10 13:47 prettysky 阅读(18) 评论(0) 推荐(0) 编辑

2020,Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework

摘要：DOI:10.1016/j.dsp.2020.102943| ASC 阅读全文

posted @ 2022-12-10 12:21 prettysky 阅读(13) 评论(0) 推荐(0) 编辑

2020,Deep feature embedding and hierarchical classification for audio scene classification

摘要：DOI: 10.1109/IJCNN48605.2020.9206866| ASC 阅读全文

posted @ 2022-12-09 00:09 prettysky 阅读(7) 评论(0) 推荐(0) 编辑

2018, A Multi-device Dataset for Urban Acoustic Scene Classification

摘要：DOI: 10.48550/arXiv.1807.09840| ASC| DCASE 2018 阅读全文

posted @ 2022-12-08 23:54 prettysky 阅读(7) 评论(0) 推荐(0) 编辑

2017, An Investigation of High-Resolution Modeling Units of Deep Neural Networks for Acoustic Scene Classification

摘要：DOI:10.1109/IJCNN.2017.7966232| ASC| DCASE 2016 阅读全文

posted @ 2022-12-08 23:44 prettysky 阅读(18) 评论(0) 推荐(0) 编辑

2021, A Low-Complexity Deep Learning Framework For Acoustic Scene Classification

摘要：DOI https://doi.org/10.1007/978-3-658-36295-9_4| ASC| DCASE 2021 TASK 1A Top-6 阅读全文

posted @ 2022-12-08 23:11 prettysky 阅读(16) 评论(0) 推荐(0) 编辑

scikit-maad

摘要：DOI:10.1111/2041-210X.13711| scikit-maad是一个用于声景分析的信号处理库阅读全文

posted @ 2022-12-08 10:07 prettysky 阅读(289) 评论(0) 推荐(0) 编辑

PC 17+18

摘要：17 Analysis-by-Synthesis Speech Coding| 18 Perceptual Audio Coding of Speech Signals 阅读全文

posted @ 2022-12-07 20:27 prettysky 阅读(64) 评论(0) 推荐(0) 编辑