基于密集连接卷积神经网络的食品图像识别

来源 :浙江理工大学 | 被引量 : 0次 | 上传用户:yxyqt
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In recent years, image recognition has become important in computer vision and image processing.Additionally, it is used in many fields such as driverless vehicles, healthcare, face recognition, search engines, etc.With its increase usage in mobile camera, many applications use image recognition algorithms such as navigations, dietary assessment, etc.
  Food image classification and recognition are a branch field in computer vision.This field of dimension has attracted more attention because of its critical role-play in helping humans keep track of their nutrition, which in turn have significant impacts on human health.
  Food image recognition based on a convolutional neural network has enjoyed a good number of applications with a reported high accuracy.This is true because convolutional neural networks have the capability to extract features directly from images thereby branding convolutional neural network as an efficient tool for image recognition.
  Therefore, in its strives to contribute to this field of knowledge, this thesis sits on the foundation of the DenseFood model based on densely connected convolutional network architecture, that consists of initial layers, dense block layers, transition layers, and fully connected layers.Our model not very depth, but we increase the width to improve the performance.We use a convolution layer, as the initial layer to extract information as much as from food images before feeding into dense blocks, as well as use dense connectivity to extract new features.Also, we use max pooling to down-sample features and extract main features and food structures form images.Furthermore, we use ELU activation function to tackle the vanishing gradient problem, and to speed up the training process.
  Additionally, the combination of Softmax loss and center loss was employed during the training process to minimize the variance in the same category at the same time maximizing the variance in different categories.The DenseFood, DenseNet121 and ResNet50 models were trained from scratch using the VIREO-172 dataset.In addition, we fine-tuned DenseNetl21 and ResNet50 pre-trained models, which trained on the ImageNet dataset to extract features from images of our dataset.
  Experimental results showed that the DenseFood model has achieved accuracy better than other models that train from scratch.DenseFood accuracy is very close to pre-trained models and has achieved 81.68% for top-1 accuracy, whereas DenseNet and ResNet achieved 83.92%, 82.49% respectively for top-1 accuracy.Furthermore, the use of the densely connected convolutional neural network has achieved higher accuracy better than the ResNet model.
其他文献
学位
学位
学位
近年来,随着高光谱遥感技术的快速发展,基于高光谱图像的分类技术在目标检测、环境管理、矿物测绘中发挥着极其重要的作用,这些应用通常需要对特定场景内的图像进行分类。一些学者已将表示学习应用于高光谱图像分类,但传统的高光谱图像分类仍存在一些挑战和局限:(1)高光谱图像的维度比多光谱图像大得多,而传统的表示学习技术是专门为多光谱图像设计的,利用传统的技术对高光谱图像进行处理效果会受到一定程度的制约;(2)
学位
学位
高光谱图像的分类应用在地质勘探,城市扩张,农业和林业监测,军事等行业中起着至关重要的作用。高光谱图像具有优良的光谱信息和丰富的空间信息,其特征质量是影响分类性能的关键因素之一。由于特征的类内差异以及广泛的光照和规模变化,分类问题仍然具有挑战性。因此,如何从高光谱数据中提取本质特征是本文的主要研究重点。主要工作如下:(1)高光谱图像由于其光谱维数高,相关性强,数据量大等特点,在特征提取方面有很大的难
学位
学位
随着互联网的快速发展,信息与通信技术的日益提高,使得基于互联网的服务与应用和人们的生活越来越密不可分。社会网络、经济、医疗保健、工业和科学等领域产生海量数据,加上网络边界的消失以及攻击类型的多样化,增加了网络入侵的风险。如果没有敏捷的安全基础设施,基于物联网技术发展的智能城市将无法可靠运行。网络入侵检测系统(Intrusion Detection System,IDS)已成为监控网络活动和检测入侵
学位
信息爆炸时代的来临和云存储的高速发展造成了数据量成倍的增长,物联网的发展也使物和物之间增多了联系,信息技术的蓬勃发展带来了社会的欣欣向荣,同时都造成了数据量日益增加,数据存储的承载量和存储设备可靠性问题受到业界人们的关注,科研人员希望寻找有效的办法来应对这类挑战。RAID-6存储系统相比其它的RAID存储系统具有更高的数据可靠性,通过纠删码在RAID中的应用,设计出高效的扩容方案一直探索的方向。在
学位
The goal of this thesis is to examine how video games are designed and to see how different game mechanics work and how to use them in the development of a game,as well as examine what are both the po
学位