Audio Perceptual Hashing Based on NMF and MDCT Coefficients

来源 :Chinese Journal of Electronics | 被引量 : 0次 | 上传用户:qq1256280577
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Audio perceptual hashing is a digest of audio contents, which is independent of content preserving manipulations, such as MP3 compression, amplitude scaling, noise addition, etc. It provides a fast and reliable tool for identification, retrieval, and authentication of audio signals. A new audio hashing scheme based on nonNegative matrix factorization(NMF) of Modified discrete cosine transform(MDCT) coefficients is proposed. MDCT coefficients, which have been widely used in audio coding,exhibit good discrimination for different audio contents and highly robustness against content preserving manipulations, especially MDCT based compression such as MP3,AAC, etc. Based on the extraction of MDCT coefficients of the audio frames firstly, NMF is used to construct hash bits. Experiment results demonstrate that, compared with methods mentioned in literature, the proposed scheme exhibits a high efficiency in terms of discrimination, perceptual robustness identification rate and time consumption. Audio perceptual hashing is a digest of audio contents, which is independent of content preserving manipulations, such as MP3 compression, amplitude scaling, noise addition, etc. It provides a fast and reliable tool for identification, retrieval, and authentication of audio signals. A new audio hashing scheme based on nonNegative matrix factorization (NMF) of Modified discrete cosine transform (MDCT) coefficients is proposed. MDCT coefficients, which have been widely used in audio coding, exhibit good discrimination for different audio contents and highly robustness against content preserving manipulations , especially MDCT based compression such as MP3, AAC, etc. Based on the extraction of MDCT coefficients of the audio frames first, NMF is used to construct hash bits. Experiment results demonstrate that, compared with methods mentioned in literature, the proposed scheme a high efficiency in terms of discrimination, perceptual robustness identification rate and time consumption.
其他文献
通过不断创新实践,不断研发提高抽油机工作效率装置.抽油机驴头与游梁固定方式中最常见的是采用销子固定,为防止驴头销子发生外窜退出现象,研制了抽油机驴头销子锁,不仅提高
9:00前……“大家好!我又来你们这儿‘出差兼旅行’啦。”4月的一天,还不到9点,中国工商银行北京分行人事专员小耿就拖着拉杆箱、背着旅行包来到北京市西城区社保中心大户办
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
人物篇采访rn初识林炜教授,是在一次行业活动中多蒙已故张铭让教授的引荐,此后,北京、上海,曾有几次邂逅,然终未得深晤.对于她,我是先闻其名,后识其人.一篇关于皮革工业可持
结节病是多系统受累的非干酪性类上皮细胞肉芽肿性疾病。病因尚不清楚,目前认为,结节病主要是遗传易感人群暴露于原因未明的可传播因素,引起过度的细胞免疫反应导致肉芽肿所致[1]。人类白细胞抗原(HLA)具有多态性,是人体免疫功能的细胞分子学基础。在不同的种族研究中发现,结节病与HLA基因有一定的相关性[2]。本研究对华东地区汉族结节病患者进行HLA-A、HLA-B、HLA-DRB1基因分型,探讨HLA基
Remote sensing(RS) multi-spectral images are usually suffered from cloud and fog cover, which can lead to analysis troubles and application limitations. A novel patch-based dark channel prior dehaze m
自1996年起,北京赢康科技开发有限公司(以下简称赢康)立志于“将世界最优秀的AV系统呈献给中国的客户”.“合作共赢,健康发展”,赢康秉承这一宗旨和理念,经过十几年的不断努
Electroencephalogram(EEG) signal is often contaminated by electronic noise as well as movement artifacts. This paper presented an algorithm based on Canonical c
去年五一前夕,《女职工劳动保护特别规定》(简称《规定》)公布实施,它惠及半边天的女性劳动者,让她们在生育、产假、职场保护等方面有了法律的保护.时间过去一年,《规定》是
研究了单克隆免疫亲和柱-高效液相色谱法测定牛乳中黄曲霉毒素M1的方法.样品通过离心脱脂,脱脂乳用免疫亲和柱净化,SpherisorM C18色谱柱分离,荧光检测器检测,外标法定量.对